Were growing and looking to hire Site Reliability Engineer (SRE) who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.
We are looking for a skilled and motivated Site Reliability Engineer (SRE) to join our team and help ensure our production cloud environment's reliability, performance, and scalability. As an SRE, you will work at the intersection of software engineering and operations, taking ownership of system stability, incident response, automation, and continuous improvement of our infrastructure.
This role is ideal for engineers who thrive in dynamic environments, value reliability, and enjoy building resilient and scalable systems.
As an SRE, Your impact will be:
Production Reliability: Ensure system uptime and performance by identifying and addressing potential issues before they affect end users.
Incident Response: Serve as part of the on-call rotation, rapidly diagnosing and resolving incidents, and conducting root cause analysis and postmortems.
Monitoring and Alerting: Build and maintain monitoring dashboards and alerting systems to detect and respond to anomalies in real time.
Automation and Tooling: Develop and maintain automation tools for deployments, scaling, and operational efficiency using Terraform, Ansible, Bash, or Python.
Infrastructure Maintenance: Perform regular maintenance and upgrades of production infrastructure to ensure security, stability, and performance.
Release Engineering: Support and optimize the rollout of new features and updates, minimizing risk and impact on production environments.
Staging Environment Management: Ensure staging environments accurately reflect production for robust testing and validation of changes.
Requirements: Experience in SRE, DevOps, or production engineering roles
Strong skills in system troubleshooting, incident response, and root cause analysis
Proficiency with tools such as:
Jenkins, Terraform, Ansible, GIT, GitHub
Bash, Python
AWS, ArgoCD, or similar CI/CD and cloud platforms
Familiarity with observability tools and practices (metrics, logging, tracing)
Ability to work effectively in cross-functional teams
Strong communication and documentation skills
Bachelor's degree in Computer Science, Information Technology, or a related field (preferred)
Familiarity with Agile development methodologies
This position is open to all candidates.