As we rapidly scale from a handful of pilot stores to dozens of deployments each week, were looking for a DevOps engineer to take the lead on one of the most critical technical challenges in our business: how we deploy software at scale.
Today, deploying our core system into each store is a complex, multi-stage process. Tomorrow, it needs to be seamless, automated, and capable of onboarding dozens of stores per weekwithout sacrificing versatility and quality
This role isnt just about CI/CD or scripting. Its about refining the automation infrastructure that enables repeatable, self-service deployments across hundreds of live, mission-critical environments. Youll sit at the heart of our Ops Technology team, working at the intersection of system engineering and in-store execution, and serving as the technical backbone for deployment scale.
You'll help shape and evolve our DevOps toolsetworking hands-on with cutting-edge technologies to streamline deployments, boost reliability, and scale our platform with speed and confidence. For the right person, this role is a stepping stone toward technical leadership in one of our most strategic teams.
A day in the life
Design and build the software deployment framework that powers our in-store systems at scale
Set up and manage Kubernetes clusters for scalable microservices in diverse environments
Deploy and monitor services with a focus on resilience, observability, and recovery
Troubleshoot complex issues across CI/CD, multi discipline environments, services, and infrastructure
Collaborate with System Engineering, SRE, RnD peers to ensure smooth end-to-end deployment flows
Continuously evolve our CI/CD pipelines, deployment logic, and infrastructure-as-code practices
Build tooling, templates, and documentation to enable fast, low-touch deployments by others
Serve as a technical leader and force multiplier within the Ops Tech team.
Requirements: 3+ years in DevOps, Infrastructure, or SRE roles with deep end-to-end ownership
Solid background in Kubernetes, container orchestration, and microservices
Solid experience in Working in a Linux environment
Experience deploying and supporting systems in live production environments
Strong CI/CD skills with GitHub Actions, GitLab CI, or Jenkins; hands-on experience with ArgoCD is a major plus.
Scripting proficiency in Python
Familiarity with monitoring, alerting, and diagnostics (Prometheus, Grafana, VictoriaMetrics, Loki, ELK, Coralogix etc)
Experience with SQL database
Good knowledge of networking concepts (Load Balancers, DNS, VPC)
Experience with infrastructure-as-code tools like Terraform
Excellent troubleshooting skills and a bias toward automation and scale
Strong communication and the ability to work cross-functionally and independently
Nice to have
Experience with GCP and Azure
Understanding of K8s networking, service meshes, ingress controllers
Experience with MLOps.
This position is open to all candidates.