The DevOps Engineer builds, automates, and operates cloud‑native infrastructure across AWS and Red Hat OpenShift, enabling scalable, secure, and reliable application delivery. This role combines hands‑on platform engineering, CI/CD automation, container orchestration, and the integration of AI‑powered tools for observability, anomaly detection, and operational efficiency. The engineer collaborates closely with development, security, and SRE teams to streamline deployments and improve system resilience.
Core Responsibilities
Cloud & Platform Engineering
Design, deploy, and maintain cloud‑native infrastructure on AWS (EC2, VPC, IAM, EKS, S3, RDS, Lambda).
Operate and optimize Red Hat OpenShift clusters, including cluster upgrades, operator management, and workload orchestration.
Implement Infrastructure‑as‑Code using Terraform, CloudFormation, or Ansible.
Build secure, scalable network architectures including VPC design, load balancing, service mesh, and ingress/egress controls.
CI/CD & Automation
Develop and maintain CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, or Argo Workflows.
Automate build, test, and deployment workflows for microservices and containerized applications.
Implement GitOps practices using Argo CD or Flux.
Create reusable automation modules and scripts in Python, Bash, or Go.
Containers & Kubernetes
Manage containerized workloads using Docker, Kubernetes, and OpenShift Operators.
Configure namespaces, RBAC, secrets, ConfigMaps, and resource quotas.
Troubleshoot cluster performance, networking, and scheduling issues.
Support service mesh technologies (Istio, Linkerd) when applicable.
AI‑Driven Operations
Integrate AI/ML‑based tools for monitoring, anomaly detection, predictive scaling, and automated remediation.
Work with data and platform teams to operationalize AI/ML pipelines on Kubernetes or OpenShift.
Evaluate emerging AI‑Ops platforms and contribute to automation strategies.
Observability & Reliability
Implement monitoring, logging, and tracing using Prometheus, Grafana, ELK, Loki, CloudWatch, or Datadog.
Build alerting, dashboards, and SLO‑based reliability metrics.
Participate in on‑call rotations and incident response, driving root‑cause analysis and long‑term fixes.
Security & Compliance
Apply DevSecOps practices including image scanning, secrets management, and policy enforcement.
Work with security teams to implement IAM best practices, encryption, and compliance controls.
Integrate tools such as Vault, OPA/Gatekeeper, or Kyverno.
Requirements: 2-5 years of experience in DevOps, cloud engineering, or platform operations.
Strong hands‑on experience with AWS services and cloud architecture fundamentals.
Practical experience with Kubernetes and Red Hat OpenShift.
Proficiency with Terraform, Ansible, or similar IaC tools.
Experience building CI/CD pipelines and automating deployments.
Solid Linux administration and networking fundamentals.
Scripting skills in Python, Bash, or Go.
Understanding of container security, cloud security, and DevSecOps practices.
Preferred Qualifications
Certifications: AWS Solutions Architect, CKA/CKAD, Red Hat OpenShift, Terraform Associate.
Experience with AI‑Ops platforms or ML pipeline orchestration.
Familiarity with service mesh, API gateways, or event‑driven architectures.
Experience with multi‑cluster or hybrid cloud environments.
Background in SRE practices (SLOs, error budgets, chaos engineering).
What Success Looks Like
Reliable, automated, and secure cloud‑native infrastructure supporting rapid development cycles.
Stable and observable Kubernetes/OpenShift environments with clear operational metrics.
Reduced manual work through automation and AI‑driven insights.
Strong collaboration with engineering teams and continuous improvement of DevOps practices.
This position is open to all candidates.