We are looking for a highly motivated Senior DevOps Engineer.
Responsibilities:
Infrastructure & Platform Engineering
Cloud Architecture: Design and operate high-scale, production-grade infrastructure primarily on GCP (leveraging GCE, Cloud SQL, and networking).
Kubernetes Mastery: Own the lifecycle of GKE clusters, including advanced networking, security hardening, and scaling strategies.
IaC & Automation: Architect modular infrastructure using Terraform or Pulumi and manage deployments via Helm.
Data & Messaging: Support high-throughput data platforms, including Databricks, Confluent Kafka, and RabbitMQ.
Continuous Delivery & GitOps
GitOps Excellence: Standardize deployments using ArgoCD for automated, declarative environment synchronization.
CI/CD Innovation: Build and optimize GitHub Actions workflows to minimize lead time for changes.
Developer Enablement: Reduce friction by building internal scripts and automation in Python and Bash that empower dev teams.
Reliability & AI Integration
AI-Augmented Workflow: Integrate AI tools (Copilot, ChatGPT, Claude) into the daily DevOps lifecycle to accelerate scripting, complex troubleshooting, and documentation.
Observability: Monitor system health with VictoriaMetrics and Datadog to ensure actionable alerting and high visibility.
Incident Response: Participate in on-call rotations, maintain runbooks, and drive blameless postmortems.
Requirements: Experience: 5+ years in DevOps, SRE, or Platform Engineering with a deep focus on GCP and Kubernetes.
Technical Stack: Proficiency with Terraform/Pulumi, Helm, ArgoCD, and GitHub Actions.
Data Infrastructure: Hands-on experience managing distributed systems like Kafka/RMQ and data platforms like Databricks.
Coding: Strong proficiency in Python and Bash for automation and tool development.
AI-Powered Mindset: Active Practitioner: Demonstrated ability to use AI tools (GitHub Copilot, LLMs) to write better code faster, debug complex infrastructure issues, and generate high-quality documentation.
Prompt Engineering: Ability to leverage AI to "rubber-duck" architectural decisions and automate repetitive "toil."
Operational Excellence: Experience with observability stacks (Datadog/VictoriaMetrics) and managing stateful workloads in cloud environments.
This position is open to all candidates.