Position Summary The DevOps team leader is responsible for leading both Israel-based and offshore DevOps teams in Australia in the design, implementation, and maintenance of scalable, secure, and reliable CI/CD pipelines and infrastructure environments across cloud and on-premises deployments. This role includes end-to-end management of Kubernetes-based production sites in Israel and Australia, ensuring high availability, stability, and observability of mission-critical environments. The position combines strong hands-on technical expertise with leadership, mentoring, and operational ownership to drive continuous delivery, automation excellence, and infrastructure resilience.
Requirements: Professional Experience
* Minimum 7 years of experience in DevOps roles, including at least 2 years in a team leadership position.
* Proven experience managing and coordinating distributed DevOps teams across multiple geographies (Israel and Australia).
* Hands-on leadership in hybrid DevOps operations spanning Azure Cloud and on-prem OpenShift environments.
* Experience in managing Kubernetes production clusters across multiple sites (Israel and Australia) including HA, DR, and monitoring.
* Proven experience defining and implementing DevOps strategy, standards, and best practices across development, QA, and production environments.
* Responsible for mentoring, task planning, and performance management of both local and offshore engineers.
* Familiarity with Agile/Scrum methodologies and collaboration with development, QA, IT, and product management teams.
Technical Expertise
* Azure AKS (Kubernetes Service): Setup, scaling, HA, networking, security, monitoring, and production management.
* Red Hat OpenShift (On-Prem): Cluster management, namespace isolation, RBAC, image registry, and CI/CD integration.
* Kubernetes Production Operations: Managing multiple production sites (Israel and Australia) including lifecycle management, capacity planning, security hardening, upgrades, and observability.
* Azure DevOpsgreenTxtBg!: Pipelines, Boards, Artifacts, Repositories, and full build & release automation.
* ArgoCD: GitOps-based deployment management, App-of-Apps architecture, sync policies, and Helm chart management.
* JFrog Artifactory: Repository management for Docker, Helm, Maven, NPM, and version lifecycle control.
* Terraform a must: Infrastructure as Code, modular template design, remote state management, and multi-environment provisioning.
* AWS (optional): EKS, EC2, S3, IAM, and cross-cloud CI/CD integration.
* Scripting: Advanced skills in Bash, PowerShell, or Python for automation and monitoring.
* Helm: Writing, maintaining, and templating Helm charts for Kubernetes deployments.
* Monitoring & Logging: Prometheus, Grafana, and ELK/EFK Stack for observability and troubleshooting.
* Security: Familiarity with TLS, secrets management, Azure Key Vault, RBAC, IAM, and secure CI/CD practices.
Troubleshooting & Operational Excellence
* Expertise in troubleshooting distributed systems and resolving production incidents across multiple Kubernetes sites.
* Skilled at identifying root causes and implementing preventive measures in CI/CD pipelines, clusters, and infrastructure layers.
* Proficiency in analyzing logs, metrics, and traces using Kibana, Grafana, and Prometheus.
* Lead incident response and problem management, coordinating between Israel and Australian teams to ensure timely resolution.
* Define and maintain runbooks, playbooks, and SLA-based operational workflows.
* Drive system reliability, performance optimization, and disaster recovery readiness across all production sites.
Optional Security & Compliance
* Understanding of DevSecOps practices, vulnerability management, and secure pipeline design.
This position is open to all candidates.