דרושים » מחשבים ורשתות » DevOps Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a DevOps Engineer, you will be responsible for the reliability, scalability, and efficiency of our SaaS products. Your success will be measured by your ability to achieve the following:

First 3 Months: Master our GitOps-based deployment pipelines. You will be expected to independently manage and troubleshoot deployments using ArgoCD and Kargo, and contribute to the team's on-call rotation.

First 6 Months: Enhance our CI/CD processes and workflow efficiency. You will lead the project to reduce average build and deployment times by 20% by optimizing GitHub Actions, Helm charts, and introducing initial AI-assisted automation.

First 12 Months: Improve system scalability and reliability. You will design and implement infrastructure enhancements using Terraform to support a 25% increase in customer workload while maintaining a 99.9% uptime.

Core Responsibilities

Deployment Pipeline Management: Build and maintain our GitOps-based deployment pipelines to ensure a 99% success rate for all deployments and reduce manual intervention by 30% within the first year.

Infrastructure Management: Manage and scale our Kubernetes infrastructure on GCP, with a goal of optimizing resource utilization to achieve a 15% cost reduction in our GCP spending over the next 18 months.

Automation and CI/CD: Enhance and maintain our GitHub Actions CI/CD pipelines to decrease the lead time for changes to production by 25% within the first year.

AI-Assisted Workflow Integration: Integrate AI-assisted tooling into day-to-day DevOps and engineering workflows to improve productivity, scalability, and operational efficiency. You will leverage AI tools to generate initial configuration drafts, validate infrastructure code, and utilize AI-driven automation to reduce repetitive manual tasks by 20% within the first 6 months, accelerating engineering execution while maintaining high-quality standards.

System Reliability: Proactively improve system reliability and availability, with the objective of reducing the number of critical production incidents by 50% through improved monitoring, logging, and alerting within 12 months.
Requirements:
What We're Looking For

3+ years in DevOps/SRE: You have proven experience in a high-growth SaaS environment and can hit the ground running to help us scale our platform.

Google Cloud Platform (GCP): You possess a deep understanding of GCP services, particularly GKE, which is essential as our entire infrastructure is on GCP.

ArgoCD and Kargo: You have hands-on experience with GitOps and progressive delivery, which is key to our goal of achieving faster, more reliable deployments.

Kubernetes and Helm: You bring strong experience in managing and deploying applications on Kubernetes, as you will be responsible for the container orchestration of our microservices.

Terraform: You have expertise in infrastructure as code, which will be crucial for our project to scale our infrastructure and reduce costs.

Forward-Thinking Automation: You have a strong interest in or experience with leveraging emerging technologies, including AI tools, to modernize workflows, validate code, and eliminate repetitive manual tasks.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8672407
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior DevOps Engineer within the XSPM team, you will be a critical, go-to technical expert responsible for the health, performance, and evolution of our database and infrastructure systems. When production databases degrade or behave unexpectedly, you are the person who dives deep, investigating root causes hands-on, understanding the underlying mechanics of the problem, and designing lasting solutions. Your mastery of database systems makes you the authority the team relies on to diagnose complex performance issues, architect better data solutions, and ensure our infrastructure scales with confidence.

Beyond databases, you will drive our DevOps practices end-to-end - CI/CD pipelines, infrastructure automation, and operational reliability across the XSPM platform. This is a high-impact, highly visible role at the intersection of database engineering and DevOps, where your expertise directly shapes how the team delivers and operates at scale.

We're a highly collaborative, friendly, inclusive and diverse group that prizes collaboration over competition. We provide opportunities to learn new skills, mentor fellow engineers, and contribute to the direction of both the team and the products for which we're responsible. We work in a distributed, high-trust environment where you manage your own time and have the flexibility to balance your work and personal life.

What You Will Do:

Serve as the team's database expert, the first person to investigate, diagnose, and resolve complex performance problems across our production database systems (MongoDB, OpenSearch, PostgreSQL, Cassandra).

Perform deep-dive root cause analysis on database performance issues, understanding query execution internals, resource consumption patterns, cluster behavior, and system-level interactions to identify the real source of problems, not just symptoms.

Design and propose better database architectures and solutions, recommending when to re-architect data models, migrate workloads, introduce new technologies, or redesign how services interact with their data layer.

* You will put in every effort within the team to ensure the data architecture is well designed.

Own capacity planning, scaling strategies, and high-availability designs for database clusters, ensuring systems are built to handle the team's growth trajectory.

Act as the bridge between development and infrastructure, advising engineers on how their application patterns impact database performance and guiding them toward sustainable solutions.

Build and maintain CI/CD pipelines, infrastructure-as-code (Terraform, Helm, Kubernetes manifests), and automated deployment workflows for the xspm team's services.

Design and manage observability stacks, dashboards, alerting rules, and SLOs, to maintain best-in-class availability for critical data pipelines and services.

Drive infrastructure automation to reduce operational toil, including automated scaling, self-healing systems, and configuration management.

Participate in on-call rotations, incident response, and post-incident reviews, driving root-cause analysis and long-term reliability improvements.

Evaluate and adopt new database technologies and infrastructure tooling that align with the team's evolving data architecture needs.
Requirements:
7+ years experience in DevOps, SRE, DBA, or infrastructure engineering, with significant hands-on responsibility for production database systems at scale.

Expert-level knowledge of a common DB such as MongoDB, Opensearch, Postgress, deep understanding of its internals, performance characteristics, replication, sharding, and the ability to diagnose and solve complex issues from first principles.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8675475
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior IT SRE Engineer, you will be a key player in ensuring the reliability, scalability, and performance of our critical IT infrastructure. You will leverage SRE principles and an automation-first mindset to build and maintain resilient hybrid cloud environments. This role is ideal for a candidate who thrives in a fast-paced, innovative setting and is passionate about solving complex challenges with cutting-edge technology.
Key Responsibilities
Provision, configure, and support resilient hybrid cloud deployment architectures using an Infrastructure-as-Code framework.
Proactively collaborate with development teams to ensure new applications are production-ready, scalable, and reliable from inception.
Develop and maintain tools and frameworks to automate operational tasks, including deployment, monitoring, and recovery.
Conduct thorough root cause analysis of production issues and implement preventative measures to improve system resilience, demonstrating strong problem-solving skills.
Manage CI/CD platforms, Linux infrastructure, and contribute to capacity planning and operational runbooks.
Design and implement proactive service monitoring, alerting, and trend analysis to maintain service availability and performance SLAs.
Participate in an on-call rotation to support critical applications and services, responding to and resolving incidents efficiently.
Contribute to comprehensive documentation related to infrastructure design, deployment, and operational procedures.
Requirements:
Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
6+ years of Devops engineering experience on mission-critical, enterprise-level systems in a hybrid (both cloud and on-prem) environment.
3+ years of hands-on experience with cloud environments, preferably Google Cloud Platform (GCP).
Expertise in configuration management and Infrastructure-as-Code using frameworks such as Terraform and Ansible.
Strong programming/scripting knowledge in languages like Python, Bash, or Go for infrastructure automation.
Demonstrated experience with CI/CD pipelines (e.g., GitHub, Jenkins, Artifactory) and a strong foundation in Linux/Unix administration.
Preferred Qualifications
Experience with containerization and orchestration technologies, particularly Kubernetes.
Hands-on experience with monitoring and observability tools such as Datadog, Grafana, or Prometheus.
Understanding of networking principles including firewalls, load balancers, and complex network designs.
A curious and positive mindset with a passion for applied learning and challenging existing processes for continuous improvement.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8637997
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/05/2026
חברה חסויה
מיקום המשרה: תל אביב יפו
סוג משרה: משרה מלאה
We are seeking a skilled and motivated DevOps Infrastructure Engineer to join our DevOps Infra team. Our team is responsible for managing and evolving the cloud-native infrastructure that powers our microservices architecture. Core responsibilities span our EKS-based Kubernetes platform, ArgoCD-driven GitOps pipelines, infrastructure observability, Helm-based deployments, and mission-critical web services running on AWS.
We are looking for a DevOps engineer who can hit the ground running, take ownership of critical infrastructure components, and contribute meaningfully from day one. The ideal candidate brings deep Kubernetes expertise, strong hands-on experience with observability tooling, and the maturity to work independently.
In this role, you will be responsible for:
Managing and evolving our EKS-based Kubernetes platform and Helm-based deployment pipelines
Owning and maintaining GitOps workflows using ArgoCD, including troubleshooting sync and rollout issues
Designing, building, and maintaining observability solutions using Prometheus, VictoriaMetrics, and Grafana
Writing and maintaining infrastructure as code using Terraform, including modules, remote state, and CI/CD automation
Taking full ownership of AWS infrastructure components - including networking, compute, IAM, and storage - ensuring reliability, security, and operational excellence across environments
Collaborating with developers and SREs to support reliable, scalable, and secure AWS infrastructure
דרישות:
1-3 years of hands-on experience in DevOps or infrastructure engineering roles.
Deep expertise in Kubernetes and Helm, including production-grade deployments and live incident troubleshooting.
Strong proficiency in Terraform or equivalent IaC tooling
Solid working knowledge of AWS core services (EC2, IAM, S3, VPC, CloudWatch, EKS).
Practical experience with Prometheus, VictoriaMetrics, Grafana, and alerting stack design.
Proven ability to work independently, take ownership end-to-end, and communicate effectively across engineering teams.
Agentic DevOps experience working with common AI assistant tools, MCPs and Agents.
Advantages:
Experience with cloud cost optimization strategies and tooling.
Background in cloud-native security practices (RBAC, policy enforcement,SSL, MTLS etc).
Prior involvement in designing or operating high-availability, fault-tolerant systems.
Experience with nginx and IIS web servers. המשרה מיועדת לנשים ולגברים כאחד.
 
עוד...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8636122
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
Required Al Infrastructure & Reliability Engineer
What this role is really about
Youll join a 3-person platform team within our Business Technology group -owning the internal infrastructure that our AI platform and its users depend on. This isnt a product engineering role, and it isnt ticket work or babysitting pipelines someone else built. Youre building and operating the internal foundation that the company runs on. The work covers the full stack of platform engineering: core cloud infrastructure (AWS, Kubernetes, IaC), CI/CD pipelines, AI-driven infrastructure components, and the SRE and observability practice that keeps it all honest -metrics, alerting, incident response, and reliability standards. As our AI capabilities grow, so does the complexity underneath them, and staying ahead of that is central to the role. If you treat infrastructure as a product -reusable, automated, observable, and built to last -this is your kind of role.
Job responsibilities
DevOps & AI-Driven Infrastructure - own CI/CD, deployment processes, and release reliability. Build and operate cloud infrastructure that is automated, intelligent, and continuously self-improving - not just managed.
Design and build our Terraform repository and IaC pipeline from scratch -AI-assisted generation, drift detection, and policy enforcement built in.
Build AI-driven GitHub Actions pipelines -automated code review, risk assessment, and intelligent deployment decisions.
Manage Kubernetes workloads across AWS accounts -zero downtime, fully automated, nothing left behind.
Embed AI into the operational layer -proactive drift detection, automated remediation, and intelligent scaling toward a self-healing runtime.
Reliability & SRE -improve uptime, resilience, and incident response.
Define and enforce SLOs/SLIs, error budgets, and on-call practices.
Lead incident response, postmortems, and systemic reliability improvements.
Own AI-specific reliability: model latency SLOs, token quota monitoring, rate limit handling, fallback and retry strategies, and cost-per-request alerting.
Observability & Telemetry - increase visibility, reduce noise, improve troubleshooting.
Establish and continuously evolve the observability stack: metrics, logs, distributed tracing, and alerting tuned for both application and AI workloads.
AI / LLM Operations- bringing AI systems to production and operating them at scale, with a focus on reliability, performance, and trust.
Own the AI infrastructure layer: rate limits, quota management, latency SLOs, and fallback strategies (retries, circuit breakers).
Operate LLM APIs in production with resilience and cost attribution per team/model.
Requirements:
2-4 years Hands-on DevOps, SRE, or infrastructure engineering in production SaaS environments.
Strong AWS experience: multi-account architecture, cross-account IAM, serverless and event-driven services (Lambda, SQS, SNS, EventBridge), and EKS cluster management.
Proven Kubernetes experience in production, including cross-account migrations and stateful workload management.
Proficiency with Terraform - repository structure design, module architecture, and CI/CD pipeline implementation.
Hands-on experience building and maintaining GitHub Actions pipelines for end-to-end CI/CD workflows.
Working Python proficiency for scripting, internal tooling, and workflow automation.
Practical experience implementing observability stacks from scratch: metrics, logging, distributed tracing, and alerting.
Experience owning reliability practices: SLOs, incident response, and postmortem culture.
Nice to have
Hands-on experience operating LLM APIs in production: rate-limit and quota management, cost attribution per team/model, latency monitoring, and resilience patterns (retries, fallbacks, circuit breakers).
FinOps experience across cloud, AI, and observability spend.
Experience introducing self-healing or auto-remediation patterns in production.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8659781
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/05/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
As a DevOps Engineer, you won't just "maintain" infrastructure, you will own it.
You will lead the charge in cloud security automation, using an AI-first mindset to drive extreme efficiency. Your mission is to identify, rectify, and prevent misconfigurations and bottlenecks in advance. You are expected to operate with high autonomy, shaping the future of secure cloud environments by being a proactive force, not a reactive one.
Key Responsibilities:
Innovate & Implement:Design and implement cloud infrastructure solutions with a focus on GCP, including compute reservations, BigQuery, Pub/Sub, GCS, and networking.
Release Engineering: Lead weekly production upgrade cycles across global multi-region environments, including branch-out processes, hotfix management, version gating, and rollback procedures.
Service Deployment & Lifecycle: Own end-to-end service deployments on Kubernetes - from Helm chart creation and Flux/GitOps configuration to production rollout and scaling.
Database Administration:Manage and optimize database infrastructure including MySQL, Redis, BigQuery, Neo4j, Scylla, MongoDB, and PostgreSQL in production environments.
AI Integration: Utilize AI-supporting tools to optimize coding, automate repetitive tasks, and solve complex architectural puzzles. Contribute to AI-native infrastructure such as Vertex AI and AI Gateway services.
Tenant & Customer Infrastructure: Manage customer-specific infrastructure including dedicated compute reservations, tenant provisioning, licensing configuration, and feature flag management across multi-tenant and single-tenant environments.
Infrastructure Automation & Tooling:Develop internal CLI tools and automation scripts (Python) to streamline operations.
Cost Optimization: Drive cloud cost optimization through resource right-sizing, reservation management, database disk reduction, and efficiency improvements.
Service Reliability: Enhance uptime by establishing SLAs, setting up comprehensive monitoring (Prometheus, Grafana, Stackdriver), and participating in the production on-call rotation (including off-hours support).
Requirements:
3+ years in DevOps/SRE with a focus on multi-region production environments.
AI-First Workflow: Must be proficient in using LLM-based agents (Cursor, Claude Code, etc.) for coding and architecture.
Cloud & Containers: Deep expertise in GCP (GKE), including Kubernetes orchestration (HPA, Node Pools) and Terraform for IaC.
Automation: Strong Python/Bash skills and experience with GitOps workflows (Flux/ArgoCD) and CI/CD (GitLab/Jenkins).
Data & Infrastructure: Experience managing production databases (SQL/NoSQL) and standard Linux/Networking troubleshooting.
Nice to have:
Experience with AI-native infrastructure (Vertex AI, AI Gateways).
Observability stacks (Prometheus/Grafana) and managing Multi-tenant SaaS platforms.
Willingness to participate in on-call rotations.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638165
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced SRE Engineer to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will be in a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 4+ years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.

What will you do?

Automation & Infrastructure
- Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.
- Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.
- Drive the adoption of infrastructure-as-code practices across the organization.
- Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.

Monitoring & Observability
- Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.
- Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.
- Define and track SLIs, SLOs, and error budgets across key services.
- Partner with development teams to embed observability earlier in the software development lifecycle.

Database & Platform Support
- Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.
- Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
Requirements:
What you need:

4+ years of overall experience in SRE, DevOps, or infrastructure automation roles.

Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.

Hands-on experience with infrastructure automation tools, particularly Ansible.

Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.

Good understanding of CI/CD pipelines and related tooling, including Jenkins.

Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.

Comfortable working in Linux-based environments.

Excellent problem-solving skills and strong written and verbal communication.


Ability to support the following:
Experience with cloud providers - AWS, GCP, or Azure.
Exposure to containerization technologies such as Docker and Kubernetes.
Familiarity with infrastructure provisioning using Terraform.
Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.
Exposure and experience with migrating/ building AI tools to improve process.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8662378
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced SRE Team Lead to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will lead a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 3-4 years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.



What will you do?

Leadership & Team Management

Lead, mentor, and grow a team of SREs, providing technical direction, career development guidance, and day-to-day management.

Own the team roadmap for reliability, observability, and automation initiatives - prioritizing work, removing blockers, and driving delivery.

Conduct regular 1:1s, performance reviews, and hiring processes to build and sustain a high-performing team.

Foster a culture of operational excellence, blameless post-mortems, and continuous improvement.

Act as an escalation point for complex incidents and reliability issues, leading post-incident reviews and ensuring follow-through on action items.


Automation & Infrastructure

Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.

Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.

Drive the adoption of infrastructure-as-code practices across the organization.

Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.


Monitoring & Observability

Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.

Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.

Define and track SLIs, SLOs, and error budgets across key services.

Partner with development teams to embed observability earlier in the software development lifecycle.


Database & Platform Support

Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.

Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
Requirements:
What you need:

Experience & Leadership

3-4+ years of experience in a people management or team lead capacity within SRE, DevOps, or infrastructure engineering.

5-8+ years of overall experience in SRE, DevOps, or infrastructure automation roles.

Proven track record of building, coaching, and retaining high-performing engineering teams.

Experience owning an engineering roadmap and driving cross-functional reliability initiatives.



Technical Skills

Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.

Hands-on experience with infrastructure automation tools, particularly Ansible.

Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.

Good understanding of CI/CD pipelines and related tooling, including Jenkins.

Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.

Comfortable working in Linux-based environments.

Excellent problem-solving skills and strong written and verbal communication.



Ability to support the following:

Experience with cloud providers - AWS, GCP, or Azure.

Exposure to containerization technologies such as Docker and Kubernetes.

Familiarity with infrastructure provisioning using Terraform.

Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.

Exposure and experience with migrating/ building AI tools to improve process.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8662300
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
13/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At UVeye, we're on a mission to redefine vehicle safety and reliability on a global scale. Founded in 2016, we have pioneered the world's first fully automated suite of vehicle inspection systems. At the heart of this innovation lies our advanced AI-centric technology, representing the pinnacle of computer vision, machine learning, and generative AI within the automotive sector. With over $380M in funding and strategic partnerships with industry giants such as Toyota, Amazon, General Motors, Volvo, and Hertz, our technology is utilized in manufacturing plants, dealerships, wholesale auctions, delivery fleets, seaports, and more. Our growing global team of over 200 employees is committed to creating a workplace that celebrates diversity, encourages teamwork, and strives for excellence.
We are looking for a driven, systems-minded Release Engineer to join our AI-Ops team. In this role, you will be the execution layer of our delivery pipeline—the technical gatekeeper who owns the safe, predictable deployment of software and AI models to global edge and cloud systems. You will balance high-speed deployment velocity with rock-solid operational stability. But you won't just be deploying code; you'll be acting as an internal project manager, driving our organizational roadmap by building Agentic AI tools and automating processes to scale our delivery capabilities continuously.
A day in the life and how you’ll make an impact:
* Act as the technical gatekeeper, validating and transitioning versions through strict release gates. Enforce rigorous governance and ensure strict "Definition of Done" criteria are met.
* Lead risk-mitigated rollouts across diverse global hardware environments.
* Monitor real-time deployment performance, Quality of Service (QoS), and algorithmic accuracy, making decisive, crisis-resilient calls to proceed, pause, or rollback to prevent regressions.
* Define and execute comprehensive test plans that verify cross-team dependencies. Validate that new versions meet detection accuracy requirements without degrading infrastructure.
* Triage complex production failures. Look beyond immediate issues to identify root causes using system metrics, logs, and container states, delivering actionable evidence to R&D.
* Build and integrate Agentic AI and LLM-based tools to accelerate log analysis, risk assessment, and deployment troubleshooting.
* Architect automated workflows to eliminate manual overhead and enhance system observability with robust monitors and dashboards.
Requirements:
* 2+ years of experience in Release Engineering, DevOps, QA, or a similar operations-centric role.
* Strong systems-level troubleshooting skills with the ability to analyze data, system metrics, logs, and container states.
* Demonstrated ability to maintain decisive control and make smart risk-management decisions during live, high-stakes deployments.
* Experience enforcing data integrity and process governance using Jira or similar issue-tracking tools.
Bonus if you have:
* Experience building or integrating AI, LLMs, or Agentic workflows into operational tooling.
* Familiarity with deploying software to both cloud environments and distributed edge hardware.
* Experience with performance benchmarking (throughput, bandwidth, algorithmic accuracy).
* Prior experience acting as a project manager for internal engineering initiatives or tools
Why UVeye: Pioneer Advanced Solutions: Harness cutting-edge technologies in AI, machine learning, and computer vision to revolutionize vehicle inspections. Drive Global Impact: Your innovations will play a crucial role in enhancing automotive safety and reliability, impacting lives and businesses on an international scale. Career Growth Opportunities: Participate in a journey of rapid development, surrounded by groundbreaking advancements and strategic industry partnerships
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8649166
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're looking for a talented DevOps engineer to join our team.
As a DevOps Engineer you will work on our highly scalable gaming backend and our massive data processing pipeline, ingesting billions of daily events. Make sure our systems and infrastructures scale and perform at optimal levels.
Responsibilities
Responsible for DevOps, Cloud and Monitoring environments - Build infrastructure, tools and services to improve delivery and availability
Be part of product architectural and infrastructure design - Design and leverage the backend infrastructure and its security aspects
Responsible (together with the team members) for the platform, up-time, infrastructure, scale and costs.
Requirements:
3+ years of experience in a DevOps role, with a comprehensive understanding of practices that promote streamlined development and operational efficiency
Strong track record of developing custom AI-driven automations or agents to accelerate DevOps workflows, enhance system monitoring and optimize cloud costs
Proven success in maintaining a production-ready Kubernetes cluster, ensuring its high availability and scalability to support large-scale operations
Extensive management experience of multiple AWS accounts spanning various regions, proficient in services like Lambda, CloudFront, SQS, VPC, and IAM
Expertise in utilizing CI/CD platforms such as GitHub Actions, Jenkins and ArgoCD
Solid experience with Terraform
An innovative thinker with a propensity for fast learning, known for possessing a strong sense of ownership and accountability in delivering quality solutions
Advantage:
Advanced skills in managing, installing, and configuring Kafka solutions like MSK and Apache Kafka
Experience with Redis, AWS ElasticCache or any other key-value DBs
CloudFlare Management.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8660454
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
13/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're looking for a Senior DevOps Engineer who owns infrastructure end-to-end, ships with confidence, and raises the reliability bar without being asked. You will work closely with backend, full-stack, and security teams to build a highly available, reliable, and secure production environment. If you get energized by building systems that scale, pipelines that teams love, and platforms that never sleep - this role is for you.
Key Responsibilities
Own and evolve our cloud infrastructure across multi-region production environments, end-to-end.
Lead our GitOps deployment model - designing and maintaining declarative, automated deployment workflows with zero manual gates.
Build, maintain, and optimize CI/CD pipelines with a strong focus on developer experience, reliability, and speed.
Drive DevSecOps "shift-left" culture: integrate security scanning, SBOM generation, and supply chain hardening directly into every pipeline.
Develop automation frameworks for provisioning, scaling, observability, and incident response - increasingly leveraging AI-assisted tooling to reduce toil.
Operate and improve our observability platform: metrics, logs, alerting, dashboards, SLOs/SLIs, and on-call tooling.
Champion zero-trust secrets management and credential-less authentication patterns across the stack.
Partner with architects and engineering leadership on cloud cost optimization, availability, and performance.
Build internal tooling and automation that multiplies engineering velocity across the organization.
Requirements:
5+ years of hands-on DevOps experience in a SaaS product environment - Must.
Deep, hands-on AWS expertise; multi-cloud experience is a strong plus - Must.
Strong understanding of containers and orchestration - Docker, Kubernetes, including workloads, networking, service mesh (Istio), Helm/Kustomize, and autoscaling (KEDA, HPA, VPA).
Strong experience with:
Infrastructure-as-Code - Terraform, Crossplane, and/or cloud-native declarative tooling.
GitOps principles and tooling (ArgoCD or equivalent).
CI/CD platforms - building reusable, scalable, security-hardened pipeline templates (GitHub Actions or equivalent).
Secrets management - dynamic injection, IRSA/Workload Identity, zero long-lived credentials.
Experience embedding security into CI/CD: vulnerability scanning, SBOM generation, and supply chain security (Trivy, Grype, Syft, JFrog Xray).
Solid observability fluency - OpenTelemetry, Prometheus, Grafana, Datadog, ELK/OpenSearch, distributed tracing.
Exposure to AI/ML workloads or LLMOps infrastructure is a meaningful advantage - not required, but will set you apart.
FinOps mindset - you think about cloud spend as a product metric, not just a finance problem.
A clear communicator who can align engineers, security teams, and leadership around infrastructure decisions.
A builder and owner - you see the system, spot the gaps, and raise the bar without being asked.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8650200
סגור
שירות זה פתוח ללקוחות VIP בלבד