Required Senior DevOps Engineer
What will you do?
As a Senior DevOps Engineer, you will:
Own Core Data Platforms: Design, manage, and scale our diverse portfolio of datastores, including Cassandra, RDS/Aurora, Redis, Elasticsearch, and more.
Evolve Observability: Champion and advance our observability stack (Prometheus/Thanos, Grafana & ELK) to provide critical, real-time insights for hundreds of services.
Strengthen Reliability (SRE): Drive SRE best practices, including automating disaster recovery drills, managing our alerting strategy (AlertC), and improving system-wide resilience.
Modernize CI/CD: Help administer and optimize our CI/CD infrastructure, which includes Jenkins, Teamcity, and GitHub Actions.
Automate Everything: Leverage our Kubernetes and GitOps (ArgoCD/Flux) foundation to manage infrastructure as code, enhance developer self-service, and eliminate toil.
Collaborate & Consult: Act as a subject matter expert, partnering with development teams to help them choose, implement, and operate their data and observability solutions effectively.
Requirements: At least 7 years of industry experience as a Senior DevOps, SRE, or Platform Engineer.
Proven experience designing, analyzing, and troubleshooting large-scale distributed systems.
Strong experience with at least one major public cloud (AWS, GCP, or Azure).
Experience managing and scaling CI/CD systems (e.g., Jenkins, GitHub Actions, Teamcity).
Experience with modern observability stacks (e.g., Prometheus/Thanos, Grafana, ELK/OpenSearch).
Experience with production datastores (e.g., Cassandra, RDS/Aurora, Redis, Elasticsearch) - this is a strong advantage.
Excellent problem-solving and collaboration skills, with a "customer-first" attitude toward supporting internal developers.
This position is open to all candidates.