This role involves leading a diverse team of engineers with various specializations, guiding them to maintain high standards and drive consistent performance across the platform.
As part of your role, you will:
Team Leadership & Growth - Mentor engineers, conduct performance reviews, and build a high-trust collaborative team.
Leverage AI to accelerate development and delivery - leverage industry leading tools and our companys agentic concepts to speed up development and operational workflows.
Operational Excellence - Set high standards for code quality, monitoring, and incident response; ensure systems are robust and secure.
Technical Direction - Guide architectural decisions, balancing immediate product needs with long-term scalability.
Enable Autonomous Execution - Build self-service platforms (GitOps, observability, cost dashboards) so product teams deploy and monitor independently.
Stay Hands-On - Lead architecture reviews, debug critical incidents, and mentor through code and config reviews - you're a player-coach.
Requirements: 5+ years in infrastructure or DevOps engineering, with at least 2+ years leading teams (managing 3+ engineers, as manager or tech lead).
Deep Kubernetes expertise - Production experience with EKS and/or GKE, GitOps (ArgoCD/Flux), service mesh, resource management, and multi-tenant cluster operations.
GitOps & CI/CD mastery - Deep hands-on with ArgoCD or Flux, kustomize/Helm, rollout strategies, deployment safety, change ownership, and audit trails.
AWS platform depth - EC2, EKS, S3, CloudFront, IAM/IRSA, VPC networking, RDS, cost optimization levers.
Observability mindset - Proven experience building and operating monitoring stacks (Prometheus/Grafana/Datadog/Coralogix); SLO-driven thinking, actionable alerting, runbook culture.
AI-first mindset - Actively uses AI tools (Copilot, Claude) to accelerate development and ops workflows.
Platform engineering background - Experience building self-service developer platforms, golden paths, and internal tooling that reduces team dependency on DevOps.
People leadership skills - You've hired, mentored, and grown engineers; conducted performance reviews; and built high-trust, high-performing teams.
Ownership & Communication - own outcomes end-to-end, escalate only when truly blocked, and translate infrastructure complexity into business impact.
This position is open to all candidates.