דרושים » ניהול ביניים » Senior DevOps Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 7 שעות
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a hands-on technical leader to drive the design, implementation, and evolution of our cloud infrastructure. Youll take ownership of building scalable, reliable, and secure systems that empower engineering teams to deliver with speed and confidence. Youll operate in a fast-paced environment, balancing innovation and pragmatism, and fostering a culture of continuous improvement and operational excellence.
Responsibilities:
Lead end-to-end technical initiatives to design and manage cloud infrastructure across AWS and other multi-cloud environments.
Build and evolve automation frameworks and internal tools using Python, Bash, and modern infrastructure-as-code technologies (Terraform or equivalents).
Architect, implement, and maintain CI/CD pipelines that streamline delivery and improve developer productivity.
Champion observability, reliability, and performance - establishing best practices around monitoring, alerting, and system health visibility.
Collaborate closely with cross-functional engineering teams to enable scalable deployments, efficient development workflows, and resilient production systems.
Drive technical tradeoff decisions that balance speed, cost, and reliability in a dynamic, growth-oriented environment.
Act as a mentor and advocate for DevOps culture, enabling teams to take ownership of infrastructure and operations.
Requirements:
6+ years of DevOps or Infrastructure Engineering experience in high-growth product environments.
Proven track record of leading or owning complex system deployments and cloud architectures end-to-end.
Expertise in cloud platforms (preferably AWS) with strong understanding of networking, security, and scalability best practices.
Proficiency in scripting and automation using Python and Bash.
Hands-on experience with Infrastructure as Code (Terraform, CloudFormation, or equivalent).
Deep familiarity with CI/CD systems (GitHub Actions, Jenkins, or similar) and container orchestration (Kubernetes, ECS, or EKS).
Experience building observability stacks (Prometheus, Grafana, ELK, Datadog, etc.).
Excellent collaboration and communication skills - a positive, pragmatic team player who thrives in high-velocity, tradeoff-driven environments.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8674620
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a hands-on Senior DevOps Engineer with a strong cloud-native mindset to build, maintain, and evolve our highly scalable, highly-available cloud infrastructure. This role is pivotal in driving operational excellence, security, and automation across our entire engineering organization. You will promote communication, integration, and collaboration to significantly enhance our software development productivity and reliability. You'll work closely with engineering and product teams to streamline delivery, enforce platform standards, and enable a high-velocity development environment-all while keeping reliability and security top of mind.
Responsibilities:
Design, Automate, and Manage complex cloud infrastructure on AWS using best-in-class Infrastructure as Code (IaC) practices.
Lead the operation and enhancement of our production Kubernetes environments (EKS), focusing on automation, security, observability, and seamless CI/CD integration.
Drive continuous improvement across platform tooling, developer experience, and operational processes to meet our ambitious performance and uptime goals.
Implement and enforce security-first infrastructure patterns, including strong IAM, network segmentation, and secure secrets management.
Actively contribute to high-level technical design discussions and cross-functional architectural decision-making, ensuring solutions align with long-term platform strategy.
Requirements:
7+ years of experience as a DevOps Engineer, Platform Engineer, or in a similar infrastructure-focused role.
Strong hands-on expertise across the AWS Stack (e.g. EC2, EKS, RDS, VPC, IAM, S3, Lambda).
Mastery of Infrastructure as Code - Terraform or equivalent.
Deep operational knowledge of Kubernetes, including architecture, cluster management, networking, and advanced debugging in production environments.
Strong expertise in designing and managing CI/CD methodologies and platforms (e.g. Jenkins, Github Actions).
Experience with monitoring tools such as Prometheus, DataDog, Coralogix (OTEL), Grafana etc.
Proven prior experience building and maintaining highly-available, production-grade, and service-oriented systems.
Strong scripting and automation background in languages such as Python or Bash.
Exceptional communication and collaboration skills with the ability to articulate complex technical needs and influence cross-functional teams.
Strong knowledge of AWS Networking - an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8625782
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 1 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an outstanding Senior DevOps Engineer to join our revolutionary, large-scale mobile content discovery platform used by millions of users worldwide. In this role, you won't just keep the lights on-you will take a leading role in shaping our data infrastructure, setting architectural standards for AI/ML workloads, and bridging the gap between DevOps and Data Engineering.
Why youll love this team: We move fast, use cutting-edge technologies, and value absolute technical excellence over rigid bureaucracy. If you are passionate about solving complex, high-traffic infrastructure puzzles and want to see your work directly impact millions of daily users, this is the sandbox youve been looking for.
What you'll be doing
Design Data-Native Cloud Solutions: Design and implement scalable data and AI/ML infrastructure across multiple environments using Kubernetes, orchestration platforms, and IaC to power our AI, ML, and analytics ecosystem
Accelerate Data/ML Engineer Experience: Spearhead improvements to data pipeline deployment, monitoring tools, and self-service capabilities that empower data teams to deliver insights faster with higher reliability
Engineer Robust Data/ML Platforms: Build and optimize infrastructure that supports diverse data workloads from real-time streaming to batch processing, ensuring performance and cost-effectiveness for critical analytics systems
Drive DevOps Excellence: Collaborate with engineering leaders across backend and ML teams, champion modern infrastructure practices, and mentor team members to elevate how we build, deploy, and operate data systems at scale
Collaborate on high-level technical designs with ML and Backend engineers to build resilient systems.
Requirements:
5+ years of hands-on DevOps experience building, shipping, and operating production systems
Infrastructure as Code: design and implement infrastructure automation using tools such as Terraform, Pulumi, or CloudFormation (modular code, reusable patterns, pipeline integration)
Cloud platforms: deep experience with AWS, GCP, or Azure (core services, networking, IAM)
Kubernetes: strong end-to-end understanding of Kubernetes as a system (routing/networking, scaling, security, observability, upgrades), with proven experience integrating data-centric components (e.g., Kafka, RDS, BigQuery, Aerospike).
GitOps & CI/CD: practical experience implementing pipelines and advanced delivery using tools such as Argo CD / Argo Rollouts, GitHub Actions, or similar

Observability: metrics, logs, and traces; actionable alerting and SLOs using tools such as Prometheus, Grafana, ELK/EFK, OpenTelemetry, or similar
Scalability & Performance: Proven experience managing production environments characterized by high traffic volumes and large amounts of data, with a focus on maintaining system reliability and cost-efficiency at scale.
You might also have
Coding proficiency in at least one language (e.g., Python or TypeScript); able to build production-grade automation and tools.
Data Pipeline Orchestration: Demonstrated success building and optimizing data pipeline deployment using modern tools (Airflow, Temporal, Kubernetes operators) and implementing GitOps practices for data workloads
Data Engineer Experience Focus: Track record of creating and improving self-service platforms, deployment tools, and monitoring solutions that measurably enhance data engineering team productivity
Data Infrastructure Deep Knowledge: Extensive experience designing infrastructure for data-intensive workloads including streaming platforms (Kafka, Kinesis), data processing frameworks (Spark, Flink), storage solutions, and comprehensive observability systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8675462
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior DevOps Engineer to join our R&D team in developing the next rising product in the health tech landscape. If you are looking for a challenging, influential position and are passionate about making an impact, this might be the role for you.

As a Senior DevOps Engineer , youll play a key role in the design, development, testing, deployment, and monitoring of our infrastructure and products. In this position, you'll make significant contributions to our observability stack, helping build and maintain robust systems for logs, metrics, traces, and alerting.

Our ideal candidate is passionate about DevOps and observability, has strong communication skills, and thrives on constant improvement for both technology and processes. If you enjoy working on multiple projects in parallel and are a proactive team player, youll fit right in.

This is a unique opportunity to join the core team of a fast-growing startup, where your contributions will have a direct impact on our product and success.

Responsibilities

Support and collaborate with cross-functional engineering teams using cutting-edge technologies.
Contribute to the design, implementation, and maintenance of monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, Loki)
Secure, scale, and manage our cloud environments (AWS and GCP)
Design and implement automation solutions for both development and production
Manage and improve our CI/CD pipelines for fast and safe delivery
Lead best practices in infrastructure, observability, configuration management, and system hardening
Continuously assess and improve existing infrastructure in line with industry standards
Requirements:
BSc in Computer Science, Engineering, or equivalent experience
5+ years of experience as a DevOps Engineer or similar software engineering role
Proven experience with Docker and Kubernetes (EKS preferred)
Hands-on experience with monitoring and observability tools, including Prometheus, Grafana, Datadog, or similar.
Expertise in Terraform for AWS infrastructure-as-code deployments
Strong collaboration and interpersonal communication skills
Excellent analytical thinking and problem-solving mindset
Proficiency with relational databases
Solid knowledge of Python and Bash scripting
Experience with test automation - an advantage
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8671069
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/04/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a DevOps Engineer to join our fast growing company and in a breakthrough stage where we build our dream team with the most passionate and professional people in the industry. Our team thinks differently and quickly, delivering high quality and innovative solutions with the latest technologies and frameworks. And we never forget to enjoy the ride along the way!
What Youll Be Doing:
Championing DevOps culture: Identify friction points and implement scalable automation to improve reliability, delivery speed, and developer experience.
Operating and optimizing production environments: Leverage infrastructure as code and modern observability tools to ensure stability and performance.
Empowering R&D: Build intuitive internal tools that foster a self-service culture and enable teams to own their operational workflows.
Leading deployment lifecycles: Drive architecture, design, and hands-on implementation for deployments across multiple regions and environments.
Driving innovation with impact: Collaborate across teams, challenge the status quo, and take ownership of technical decisions.
Requirements:
4+ years as a DevOps, SRE, or Production Engineer.
Cloud expertise: Hands-on experience with production workloads on public cloud (GCP/AWS/Azure).
Containers & Microservices: Proficiency with Docker, Kubernetes, and microservices architectures.
CI/CD & release engineering: Strong knowledge of pipelines and tools (GitHub Actions, GitLab, Jenkins, ArgoCD, Argo Rollouts).
Infrastructure as Code: Experience with Terraform, Terragrunt, or similar tools.
Programming & automation: Scripting/programming skills in Bash, Python, or Go.
Observability: Familiarity with OpenTelemetry, Prometheus, Grafana, VictoriaMetrics, or similar tools.
Team spirit: Collaborative, curious, and driven. You challenge ideas constructively, and thrive in an empowering, positive environment.
Innovation & adaptability: Ability to learn quickly, adopt new technologies, and push boundaries while understanding real-world constraints.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8623673
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/05/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Site Reliability Engineer on the SASE Platform team, you will play a critical role in building and operating highly available, secure, and globally distributed services. Your mission is to ensure our cloud-native security and networking platform is reliable, scalable, and performant from day one, protecting the users, applications, and data for the world's largest enterprises as they adopt cloud, remote work, and AI.
Key Responsibilities
Proactively collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages.
Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance.
Build and operate automation for provisioning, deploying, and managing global infrastructure using Infrastructure as Code (IaC).
Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments.
Drive observability best practices, including metrics, logs, traces, and SLIs/SLOs to enable data-driven incident analysis.
Participate in on-call rotations, reducing mean time to resolution (MTTR) through automation and proactive reliability improvements.
Challenge existing processes by championing reliability, security, and operational maturity across the organization.
Requirements:
Your experience:
5+ years of experience working with Unix/Linux systems, including shell, tools, networking, and kernel concepts.
2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms.
Proven experience operating workloads in public cloud environments (e.g., AWS, GCP, Azure) at scale.
Proficiency in building automation and tools in at least one scripting or programming language (e.g., Python, Go, Java).
Strong experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
Bachelors degree in Engineering, Computer Science, or a related technical field, or equivalent practical experience.
Preferred Qualifications
Deep expertise in designing and operating monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK Stack).
Advanced networking expertise, including TCP/IP, DNS, BGP, routing, and cloud networking concepts relevant to SASE architectures.
Prior experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms.
Familiarity with using AI/LLM technologies to improve operational workflows (e.g., incident analysis, automation).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638178
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior IT SRE Engineer, you will be a key player in ensuring the reliability, scalability, and performance of our critical IT infrastructure. You will leverage SRE principles and an automation-first mindset to build and maintain resilient hybrid cloud environments. This role is ideal for a candidate who thrives in a fast-paced, innovative setting and is passionate about solving complex challenges with cutting-edge technology.
Key Responsibilities
Provision, configure, and support resilient hybrid cloud deployment architectures using an Infrastructure-as-Code framework.
Proactively collaborate with development teams to ensure new applications are production-ready, scalable, and reliable from inception.
Develop and maintain tools and frameworks to automate operational tasks, including deployment, monitoring, and recovery.
Conduct thorough root cause analysis of production issues and implement preventative measures to improve system resilience, demonstrating strong problem-solving skills.
Manage CI/CD platforms, Linux infrastructure, and contribute to capacity planning and operational runbooks.
Design and implement proactive service monitoring, alerting, and trend analysis to maintain service availability and performance SLAs.
Participate in an on-call rotation to support critical applications and services, responding to and resolving incidents efficiently.
Contribute to comprehensive documentation related to infrastructure design, deployment, and operational procedures.
Requirements:
Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
6+ years of Devops engineering experience on mission-critical, enterprise-level systems in a hybrid (both cloud and on-prem) environment.
3+ years of hands-on experience with cloud environments, preferably Google Cloud Platform (GCP).
Expertise in configuration management and Infrastructure-as-Code using frameworks such as Terraform and Ansible.
Strong programming/scripting knowledge in languages like Python, Bash, or Go for infrastructure automation.
Demonstrated experience with CI/CD pipelines (e.g., GitHub, Jenkins, Artifactory) and a strong foundation in Linux/Unix administration.
Preferred Qualifications
Experience with containerization and orchestration technologies, particularly Kubernetes.
Hands-on experience with monitoring and observability tools such as Datadog, Grafana, or Prometheus.
Understanding of networking principles including firewalls, load balancers, and complex network designs.
A curious and positive mindset with a passion for applied learning and challenging existing processes for continuous improvement.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8637997
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced Senior DevOps Engineer to join our DevOps team in the Posture R&D Group, who is passionate about software design, development and deployment. The role goes beyond traditional DevOps - it focuses on building the infrastructure and platforms that enable AI models and autonomous agents to run in production at scale, across both cloud and on-prem environments. The job involves writing production-grade modern DevOps solutions that will be shipped to the cloud and on-prem solutions, while working with cutting-edge technologies and architectures that push the boundaries of AI-driven cybersecurity systems.
Responsibilities
Build the best solutions for our production platform, enabling high-scale, AI-driven systems and agents to operate reliably in production-scale environments
Everything as a code approach (IaC): Run our infrastructure with a wide range of technologies including Terraform, and Kubernetes
Build and maintain tools for automation, deployment, monitoring, and operations, with a strong focus on scalability, resilience, and observability of distributed system.
Troubleshoot complex issues in our development, production, and test environments, including large-scale, distributed, and AI-integrated systems
Excellent communication and people skills.
Requirements:
8+ of years experience with DevOps technologies.
Extensive background leading the design, build, and evolution of end-to-end DevOps platforms, including infrastructure, tooling, and operational frameworks across the software lifecycle.
Deep expertise with one of the major cloud providers: AWS (preferred), GCP, Azure.
Extensive experience with modern deployment strategies (GitOps, blue/green, canary, Kubernetes-based deployments)
Strong experience designing and optimizing end-to-end CI/CD pipelines, enabling high velocity, reliable software delivery.
Experienced with bootstrapping projects, introducing new technologies and building systems from scratch.
Background in working with AI components and understanding the challenges of bringing AI workloads into production.
Good coding capabilities (Python, Bash, etc.)
Experience mentoring engineers, leading cross-functional initiatives, and influencing technical direction.
Advantages:
Experience with on-prem environments and solutions.
Prior experience with endpoint security products (agents, sensors, collectors).
Tech Stack: AWS, Kubernetes, EKS, Jenkins, IaC, GitHub, Terraform, Python, Docker, ArgoCD, MongoDB, RabbitMQ, Redis, Go, Neo4J, AI, and more.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8664565
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/05/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Site Reliability Engineer on the SASE Platform team, you will play a critical role in building and operating highly available, secure, and globally distributed services. Your mission is to ensure our cloud-native security and networking platform is reliable, scalable, and performant from day one, protecting the users, applications, and data for the world's largest enterprises as they adopt cloud, remote work, and AI
Your Impact:
Proactively collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages.
Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance.
Build and operate automation for provisioning, deploying, and managing global infrastructure using Infrastructure as Code (IaC).
Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments.
Drive observability best practices, including metrics, logs, traces, and SLIs/SLOs to enable data-driven incident analysis.
Participate in on-call rotations, reducing mean time to resolution (MTTR) through automation and proactive reliability improvements.
Challenge existing processes by championing reliability, security, and operational maturity across the organization.
Requirements:
Your Experience
5+ years of experience working with Unix/Linux systems, including shell, tools, networking, and kernel concepts.
2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms.
Proven experience operating workloads in public cloud environments (e.g., AWS, GCP, Azure) at scale.
Proficiency in building automation and tools in at least one scripting or programming language (e.g., Python, Go, Java).
Strong experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
Bachelors degree in Engineering, Computer Science, or a related technical field, or equivalent practical experience.
Nice to have:
Deep expertise in designing and operating monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK Stack).
Advanced networking expertise, including TCP/IP, DNS, BGP, routing, and cloud networking concepts relevant to SASE architectures.
Prior experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms.
Familiarity with using AI/LLM technologies to improve operational workflows (e.g., incident analysis, automation).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638041
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We are seeking a skilled Site Reliability Engineer (SRE) to join our team and help build, maintain, and improve the reliability, scalability, and performance of our systems. As an SRE, you will be responsible for owning and evolving our observability tooling, using real-time insights to make data-driven decisions about system behavior and performance at runtime, and implementing automation to enhance our infrastructure. This role involves collaborating across teams to ensure a robust and efficient technology stack supporting mission-critical systems.

You will:
Proactively enhance system reliability, scalability, and performance through automation, monitoring, and capacity planning.

Develop and maintain observability systems, including distributed tracing, logging, and metrics platforms.

Establish and maintain organizational standards for monitoring, leveraging tools like Prometheus, Grafana, and OpenTelemetry.

Use observability tools to analyze runtime behavior and make data-driven decisions that improve system performance and reliability.

Partner with development teams to integrate reliability best practices into the software development lifecycle.

Manage infrastructure at scale in cloud services (AWS advantage) and platforms like Kubernetes.

Optimize resource utilization to reduce costs while maintaining service quality.

Contribute to the development and adoption of AI-driven tools and practices for engineering and observability.
What success looks like:

You are a trusted technical leader within the organization, mentoring others and helping shape the evolution of our SRE and observability practices.

You reduce the frequency and impact of production incidents by building resilient systems and using observability insights to address issues before they escalate.

You significantly improve observability: key metrics, logs, and traces are consistently available, well instrumented, and actionable across all critical services, enabling fast, informed decisions and rapid resolution of issues.

You are actively engaged in proactive problem solving: you identify and resolve systemic issues before they impact customers, and continuously refine SLOs and SLIs to reflect evolving business needs.
Requirements:
We are looking for:

At least 6 years of experience as a SRE or DevOps.

Strong experience with Observability Tools such as OpenTelemetry, Grafana, Prometheus, and ELK stack (Elasticsearch, Logstash, Kibana).

In-depth experience with Cloud Platforms: AWS services, including EC2, S3, RDS, and CloudFormation/Terraform for infrastructure-as-code.

Strong experience working in Kubernetes environments, with a focus on Helm for deployment and configuration management

Experience working with AI and LLM tools such as Cursor, Claude Code or similar.

Proficiency in scripting and/or development languages such as Bash or Python.

Thorough understanding of CI/CD pipelines and automation tools.

Strong experience with automation tools like Terraform and/or Ansible, and understanding of Infrastructure as Code.

Solid troubleshooting and debugging skills.

A team player with a strong can-do mentality.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8656402
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
a global leader in performance marketing, is looking for a talented DevOps Engineer to join us on our mission to simplify decision-making for millions!
Responsibilities:
Design, own, and evolve DevOps tooling, CI/CD architecture, and infrastructure automation strategies across multi-cloud environments (AWS & GCP), supporting high-performance, resilient production systems.
Manage Kubernetes clusters (EKS, GKE) and containerized microservices at scale, leveraging Helm, IaC, and other cloud-native technologies.
Collaborate with engineering and data teams to optimize cloud-native architectures for performance, cost-efficiency, scalability, and high availability.
Automate infrastructure and pipeline workflows using Python, Bash, and Groovy, with IaC tools like Terraform and CloudFormation, and CI/CD platforms such as Jenkins and GitHub Actions.
Support data workflows and ML deployments using orchestration tools like Airflow and CI/CD for data pipelines.
Work with AI-native tooling (e.g., MCP, agent frameworks, Cursor, OpenAI, Gemini and Vertex).
Bring out-of-the-box thinking, excellent problem-solving skills, and the ability to debug complex systems.
Requirements:
3+ years of hands-on experience with AWS in production environments, with strong working knowledge of Linux-based systems for deployment, debugging, and automation.
3+ years of DevOps experience supporting production-grade systems with high availability, scalability, and operational reliability.
Strong expertise in Kubernetes-based orchestration (EKS, KOps, GKE).
Extensive experience with CI/CD tools such as Git, GitHub, Jenkins, GitHub Actions, and Nexus.
Proficiency in scripting/programming languages, including Bash, Python, or Groovy, for automating infrastructure and pipelines.
Experience with Infrastructure as Code (IaC) tools like Terraform and CloudFormation.
Experience with logging, metrics, and observability stacks, such as Datadog, Telegraf, Elasticsearch, Kibana, Prometheus, and Grafana.
Ability to troubleshoot and debug complex, distributed systems across multiple cloud environments.
Only candidates meeting the above requirements will be considered.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8660493
סגור
שירות זה פתוח ללקוחות VIP בלבד