As a Lead DevOps Engineer, your role involves the design and development of robust, scalable, and resilient distributed systems. You'll define product specifications, leveraging your technical expertise to create optimal solutions hosted in Kubernetes on AWS Cloud. This position requires extensive collaboration with various teams throughout the software development lifecycle. You will lead design discussions and code reviews, contributing to the overall quality of engineering within the organization.
Your responsibilities also include creating and supporting reusable application components and patterns, considering both business and technology perspectives. You'll utilize developer tools and a range of AWS services for task management, source code handling, building, deployment, operations, and real-time communication. You are expected to demonstrate advanced skills in application design, implementation, and maintenance, often with minimal supervision.
Beyond technical tasks, you will mentor other engineers, sharing your knowledge and actively contributing to the enhancement of best practices and processes within and across teams.
Responsibilities:
Design, build, and maintain the scalable cloud infrastructure and CI/CD pipelines necessary to support our cutting-edge AI and optimization services.
Champion Infrastructure as Code (IaC) practices using tools like Terraform and Kubernetes to automate the deployment, scaling, and management of our production environments.
Implement robust monitoring, logging, and alerting systems to ensure the high availability, performance, and reliability of all services.
Partner with development teams to streamline the software development lifecycle, improve deployment velocity, and embed best practices for security and operational excellence.
JR314438
Requirements: 4+ years of hands-on experience in DevOps Concepts and Cloud Architecture.
4+ years of experience with AWS (mandatory to know concepts around s3, sqs, dynamodb, iam and kms) or other similar concepts around different cloud service providers e.g., GCP and Azure (Optional)
4+ Experience deploying and managing CI/CD pipelines. E.g., Jenkins and/or Spinnaker
Advanced programming experience with at least two modern languages such as GoLang, Java, C++, Or Python including object-oriented design.
Proven understanding of micro-services-oriented architecture and extensible REST and gRPC APIs. Experience building the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems.
Knowledge and experience to ensure Kubernetes cluster management including workloads in deployments and statefulsets remains reliable, available, secured and meet performance expectations
Experience with Kubernetes packaging technologies such as HELM and experience in administrating Kubernetes config maps, services, deployments, and stateful sets.
Experience with monitoring production and staging of test and development environments for a number of applications in a dynamic organization.
Good command of the version control tools including but not limited to GIT.
Strong expertise in troubleshooting complex production issues. Excellent problem-solving, critical thinking, and communication skills.
Degree or equivalent relevant experience required. Experience will be evaluated based on the core competencies for the role (e.g. extracurricular leadership roles, military experience, volunteer roles, work experience, etc.).
This position is open to all candidates.