In this role, you will be responsible for architecting and maintaining scalable, secure, and highly available environments across AWS and on-premises infrastructure. You will drive automation initiatives, improve CI/CD processes, enhance system reliability, and support engineering teams in delivering production-ready solutions. The ideal candidate is an AWS expert with deep Kubernetes experience, strong infrastructure automation skills, and a passion for operational excellence.
Responsibilities:
* Design, implement, and maintain scalable, secure, and highly available AWS and hybrid infrastructure
* Build, manage, and optimize Kubernetes platforms and containerized workloads
* Develop and maintain Infrastructure as Code (IaC) using Terraform
* Build and optimize CI/CD pipelines using GitHub Actions, Jenkins, and related technologies
* Drive automation initiatives to improve reliability, scalability, and operational efficiency
* Monitor, troubleshoot, and resolve production issues while implementing observability solutions
* Collaborate with development, security, and customer-facing teams to improve deployment and operational processes
Advantages:
* Experience with Microsoft Azure cloud services
* Experience with monitoring and observability tools such as Prometheus, Grafana, ELK/OpenSearch, or Datadog
* Experience with configuration management tools such as Ansible
* Knowledge of security best practices, vulnerability management, and cloud security
* Experience working in regulated or security-focused environments
Requirements: * 7+ years of experience as a DevOps Engineer
* Strong hands-on experience with Kubernetes and Docker, including platform architecture, deployment and operations
* Expert-level experience with AWS services and cloud architecture
* Strong Linux administration skills
* Experience with Infrastructure as Code tools such as Terraform
* Strong experience with CI/CD platforms such as GitHub Actions and Jenkins
* Proficiency in scripting and automation using Bash and Python
* Strong understanding of networking, security, and system reliability principles
This position is open to all candidates.