Responsible for the development of guidance and best practice for build, install, deploy, configure, analyze, tune, and troubleshoot systems in the cloud
Responsible for resolving difficult system problems
Responsible for mentoring other IT Ops team members
Responsible for ensuring high availability and reliability of company’s production system
Responsible for ensuring operability of company’s production system
Responsible for ensuring company’s system are monitored and analyzed
Responsible for recommending enhancements to system capabilities and performance to the engineering teams
Responsible for ensuring system’s events get notified and runbook executed on system events
Responsible for working on cost optimization of company’s production systems
Responsible for working with IT Security team on configuration, implementation, and maintenance system security strategies, policies, and procedures
Collaborating with other members of company’s technology team, to ensure that services meet operations requirements for both internal and external clients and ensuring that standards for consistent documentation are met
Document and follow processes and procedures for the infrastructure and systems.
Responsible for with DevOps on continuous deployment and integration
Responsible for working with company’s support teams to resolve support issues
Responsible for providing company’s support teams with appropriate tools and knowledge tools
Job Requirements
5+ years’ work experience in AWS required
2 years of experience with Azure preferred
5+ years’ experience in Linux, docker and database operations