求人ID : 973708
Cloud Lead DevOps Engineer
As AXA continues its journey to enable first-class software engineering and operations, we are looking for new members that want be part of the Cloud Centre of Excellence Team who will work in close collaboration with development and Operations teams for aligning the best practices of the Cloud. You will have the opportunity to work in a community that is leading the Digital transformation including exciting technology opportunities with cloud, including infrastructure as code, and software delivery pipelines.
The Lead DevOps Engineer, within the Cloud CoE Team, works with development and operations teams to solve operational issues and create tools, processes, and systems that improve our overall operational reliability and resilience. This role is hybrid software developer to develop infrastructure code, operations admin, and general problem solver. Furthermore, the focus is on enabling engineering teams with expert guidance and tools in the Cloud to deliver frequent, high quality and reliable components.
a) Configuring and setting up infrastructure
e) Production support
・Contribute to application code base and architecture with a focus on optimization for performance, reliability, scalability, security and cost
・Implement CI/CD best practices, deployment pipelines and test automation frameworks to enable frequent, high quality releases.
・Define and implement application deployment strategy based on application type.
・Guide and train agile engineering teams to optimize service quality and ensure adoption of operational best practices
・Day to day operational support of continuous integration, continuous delivery and source control tooling.
・Collaborate with software, infrastructure, network engineers and DBAs to solve productivity challenges, drive efficiency, automate and streamline environment builds
・Build and maintain monitoring & alert configuration to detect, triage and resolve issues quickly .
・Create Proof-of-Concepts using new technologies
・Build tools to enhance production triage and improve time to detect issues
・Take charge of escalated outages, lead until they are resolved, and make sure the root cause has been found and fixed
・Suggest architecture improvements, recommend process improvements.
・Provide training and coaching in a capacity as Subject Matter Expert to other engineers
・Contribute to documentation required to guide on-call engineers and on-board team members
・Provide off-hours support for production applications
・Good understanding of continuous integration and continuous deployment tools such as Jenkins
・Good understanding of Software Development Life Cycle (SDLC)
・Good understanding of code versioning tools, including Git and/or Subversion.
・Good understanding of infrastructure automation such as Ansible, Chef, Puppet etc.
・A minimum of 1-2 years of relevant experience administering cloud or infrastructure services across a large, diverse, enterprise environment.
・Knowledge of Private Cloud platforms and hybrid cloud architecture models.
・knowledge of Public Cloud platforms namely Amazon Web Services (AWS) and Azure Cloud..
・Intermediate knowledge of primary AWS services (S3, EC2, RDS, Route53 & VPC).
・Experience with agile development and Continuous Integration/Continuous Delivery
・Good understanding of container based virtualization technology (aka containerization) such as Docker, Rocket etc. . ・Experience with container orchestration software such as Kubernetes, Mesos etc. is a plus.
・Good understanding of application and http servers such as Apache, Nginx, Tomcat, JBoss, Websphere etc.
・Good understanding of databases such as Oracle, MySQL, MongoDB etc.
・Basic understanding of integration middleware such as MQ, EAI, ESB, API Gateway etc.
・Basic understanding of monitoring tools such as Nagios, Dynatrace etc.
・Basic understanding of network technologies such as DNS, Router, Load balancer, Firewall etc.
・Good understanding of Agile methodology, including Scrum.
・Basic understanding of programming language such as Java, SSJS, Python, Ruby etc.
・Basic understanding of job scheduling software such as cron, control-M, JP1 etc.
・Basic understanding of OpenAPI specification
・Bachelor’s degree in Computer Science or related field from accredited college/university or equivalent work experience
・3+ years operational experience.
・Preferable experience with software development
・Some experience with application performance monitoring, alerting mechanisms, and automated remediation
・Administration experience in Linux system
・Ability to quickly triage problems under pressure, determine root cause and drive resolution
・Desire to work in a fast paced, evolving, growing, and dynamic environment
・Curiosity to explore new ideas and passion to make them happen・
・A desire to be mentored and to grow
・A strong desire to work with any technologies in an effort to become a cross functional member in DevOps community