דרושים » תוכנה » Principal DevOps Engineer - AI Platforms (Cortex)

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We are establishing a new role focused on bridging AI capabilities with our internal DevOps platform. You will build AI-powered tools and services that run on top of the DevOps platform developed by the infrastructure team. This role combines hands-on DevOps/platform engineering work with close collaboration with software development teams, enabling them to build, deploy, and operate AI solutions efficiently, securely, and in alignment with organizational standards. You will act as a technical enabler, helping teams bring AI-driven applications into production while ensuring best practices across architecture, compliance, DevSecOps, FinOps, and AI governance. The role also includes direct contributions to the platform itself.
Your Impact
Enhance DevOps platform capabilities to support AI-based tools and workloads.
Work closely with development teams to enable and support AI application delivery end-to-end.
Assist in designing architecture and production-grade implementation of AI systems.
Lead and enforce DevSecOps practices, compliance requirements, and governance standards for AI solutions.
Develop platform components and internal services as part of the AI infrastructure layer.
Support production readiness, monitoring, performance, reliability, and cost optimization (FinOps).
Serve as a technical interface between multiple engineering teams.
Participate in future on-call rotations.
Requirements:
Your Experience
Strong experience of 5-7 years in DevOps / Platform Engineering roles.
Good understanding of the AI ecosystem and modern AI workflows.
Hands-on familiarity with AI tools such as Claude and other GenAI platforms.
Ability to present a personal AI-related project
Solid understanding of production systems and modern CI/CD practices.
Experience with cloud infrastructure, automation, and deployment pipelines.
Strong communication skills and ability to work across multiple stakeholders.
Nice to have
Exposure to DevSecOps, compliance, and FinOps practices.
Experience building or integrating AI-driven systems in production environments.
Experience building tools or automations using Claude or similar GenAI tools.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8705680
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced SRE Engineer to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will be in a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 4+ years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.

What will you do?

Automation & Infrastructure
- Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.
- Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.
- Drive the adoption of infrastructure-as-code practices across the organization.
- Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.

Monitoring & Observability
- Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.
- Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.
- Define and track SLIs, SLOs, and error budgets across key services.
- Partner with development teams to embed observability earlier in the software development lifecycle.

Database & Platform Support
- Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.
- Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
Requirements:
What you need:

4+ years of overall experience in SRE, DevOps, or infrastructure automation roles.

Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.

Hands-on experience with infrastructure automation tools, particularly Ansible.

Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.

Good understanding of CI/CD pipelines and related tooling, including Jenkins.

Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.

Comfortable working in Linux-based environments.

Excellent problem-solving skills and strong written and verbal communication.


Ability to support the following:
Experience with cloud providers - AWS, GCP, or Azure.
Exposure to containerization technologies such as Docker and Kubernetes.
Familiarity with infrastructure provisioning using Terraform.
Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.
Exposure and experience with migrating/ building AI tools to improve process.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8662378
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/05/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Full Stack Engineer who enjoys building end-to-end systems in fast-moving environments. Youll work across backend services, frontend applications, and AI-integrated workflows to shape the foundation of our companys AI-powered platform.
The AI Application Foundation team is responsible for the core platform powering our companys next-generation AI applications and agentic systems. We build the infrastructure, developer tooling, orchestration layers, and user-facing experiences that transform advanced AI capabilities into production-grade cybersecurity products.
If you enjoy owning features from architecture to production, care deeply about developer and user experience, and want to work on meaningful AI infrastructure at scale- this role is for you.
Responsibilities
Design and build full-stack applications powering AI-driven cybersecurity workflows
Develop scalable backend services, APIs, and orchestration layers for applications and agent systems
Build intuitive frontend experiences for complex workflows, data exploration, and operational tooling
Collaborate with AI and platform engineers to integrate LLMs, retrieval systems, and agent capabilities into production applications
Own features end-to-end, from technical design and implementation to deployment, monitoring, and ongoing improvement
Improve platform reliability, observability, and performance across frontend and backend systems
Contribute to engineering standards, architecture reviews, testing practices, and CI/CD workflows
Work closely with Product and Design teams to rapidly iterate on new AI product experiences.
Requirements:
5+ years of experience as a Full Stack Engineer in production environments
Strong backend development experience with Python, Node.js, Go, or similar modern backend technologies
Familiar with frontend experience with React, TypeScript, and modern frontend frameworks
Experience designing and consuming REST APIs, event-driven systems, and distributed services
Solid understanding of cloud-native architectures and scalable web applications
Experience working with or integrating AI systems, LLMs, or agent-based architectures.
Strong product mindset and ability to work closely with cross-functional teams
Excellent debugging, problem-solving, and communication skills
Nice to Have
Experience with agent orchestration, context management, or retrieval-augmented architectures
Experience with Kubernetes, Docker, Terraform, or modern DevOps practices
Background in cybersecurity, data-intensive systems, or highly regulated environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8664622
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced SRE Team Lead to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will lead a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 3-4 years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.



What will you do?

Leadership & Team Management

Lead, mentor, and grow a team of SREs, providing technical direction, career development guidance, and day-to-day management.

Own the team roadmap for reliability, observability, and automation initiatives - prioritizing work, removing blockers, and driving delivery.

Conduct regular 1:1s, performance reviews, and hiring processes to build and sustain a high-performing team.

Foster a culture of operational excellence, blameless post-mortems, and continuous improvement.

Act as an escalation point for complex incidents and reliability issues, leading post-incident reviews and ensuring follow-through on action items.


Automation & Infrastructure

Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.

Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.

Drive the adoption of infrastructure-as-code practices across the organization.

Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.


Monitoring & Observability

Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.

Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.

Define and track SLIs, SLOs, and error budgets across key services.

Partner with development teams to embed observability earlier in the software development lifecycle.


Database & Platform Support

Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.

Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
Requirements:
What you need:

Experience & Leadership

3-4+ years of experience in a people management or team lead capacity within SRE, DevOps, or infrastructure engineering.

5-8+ years of overall experience in SRE, DevOps, or infrastructure automation roles.

Proven track record of building, coaching, and retaining high-performing engineering teams.

Experience owning an engineering roadmap and driving cross-functional reliability initiatives.



Technical Skills

Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.

Hands-on experience with infrastructure automation tools, particularly Ansible.

Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.

Good understanding of CI/CD pipelines and related tooling, including Jenkins.

Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.

Comfortable working in Linux-based environments.

Excellent problem-solving skills and strong written and verbal communication.



Ability to support the following:

Experience with cloud providers - AWS, GCP, or Azure.

Exposure to containerization technologies such as Docker and Kubernetes.

Familiarity with infrastructure provisioning using Terraform.

Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.

Exposure and experience with migrating/ building AI tools to improve process.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8662300
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Grip Security is looking for a Senior Data Platform Engineer to join our community!
We are a fast-growing startup in the software-as-a-service and AI Security Industry. We provide innovative solutions to securing the whole organization-to-SaaS surface. (More details: https://grip.security)
Using the newest technologies, we're working on solving a huge problem all enterprises face today - to govern the accessibility of all their employees to all 3rd party vendors (GitHub, SendGrid, Atlassian, and thousands more!), and ensure there is no leftover/unwanted access to any of the organization's SaaS and AI assets. The SaaS and AI security field is complex and challenging; therefore, we're looking for super-talented people, who are not afraid of technical challenges and breaking down barriers to achieve good solutions.
The job
As a Senior Data Platform Engineer, you will play a key role in building and evolving Grips modern data platform - the infrastructure that powers product features and analytics across the company.
You will focus on designing and operating scalable, reliable data systems and platform tooling that support our Data Lakehouse, enabling engineers, analysts and research teams to work with data efficiently and with minimal friction.
Responsibilities:
Design, build and operate a cloud-native modern data platform.
Develop and optimize data processing frameworks and pipelines across batch and streaming workloads.
Improve developer experience and platform usability through tooling and automation.
Lead and support large-scale data migrations and architectural improvements.
Drive best practices around infrastructure, CI/CD, testing, and system design.
Collaborate with developers, analysts, data scientists and other stakeholders to develop new products and features.
Contribute to a strong engineering culture of ownership, learning, and knowledge sharing.
Requirements:
5+ years of hands-on experience building scalable data infrastructure, particularly around data lake or data warehouse architectures.
Proven experience designing, building and operating production-grade systems and services.
Strong understanding of cloud infrastructure (AWS, GCP, or Azure) and hands-on experience with modern data platforms and tools (e.g., Spark, Kafka, Airflow, dbt, open table formats, or similar).
Strong programming skills in Python and SQL.
Independent, proactive, and ownership-driven mindset.
Background in data platform engineering, backend engineering, DevOps, or DBA - strong advantage.
Experience with containerization technologies - advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8703334
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior AI Engineer in the global CTO group, you will play a central role in building the next generation of AI-powered security capabilities across our product portfolio. This role is focused on rapid prototyping, experimentation, and innovation, turning emerging ideas into working product features that can scale across multiple products and technology stacks.

You will design and build AI-driven systems end-to-end, from agent-based workflows and model integrations to backend services, data pipelines, and product-facing capabilities. You will work closely with product, engineering, and research teams across the company to explore new use cases, validate ideas quickly, and bring impactful AI features into production.

This role is ideal for an experienced AI engineer who enjoys moving fast, working across boundaries, and building real production systems, not just experiments. Your work will directly influence how AI is embedded across our platforms and how customers experience secure AI at enterprise scale.
Requirements:
What You Will Need:
8 or more years of professional experience in software engineering, with significant hands-on experience in AI engineering or applied machine learning.
Strong expertise in building AI-powered systems, including LLM-based applications, agents, and orchestration workflows.
Proven experience integrating and operating AI and ML models in production environments.
Proficiency in multiple programming languages, including Python and at least one of the following: .NET, Go, or similar backend languages.
Experience working across diverse technology stacks and product architectures.
Solid understanding of backend system design, APIs, and distributed systems.
Strong experience with databases, including data modeling, performance considerations, and working with both relational and non-relational systems.
Practical experience with DevOps practices, including CI/CD pipelines, containerization, and cloud-based deployment.
Comfort working in cloud environments and modern infrastructure platforms.
Ability to rapidly prototype, iterate, and evolve ideas into production-ready features.
Strong ownership mindset, curiosity, and ability to collaborate across teams.

Nice to Have:
Experience designing and building AI agents for real-world workflows.
Hands-on experience training, fine-tuning, or evaluating machine learning models.
Familiarity with MLOps practices and model lifecycle management.
Experience working in security, cloud platforms, or large-scale SaaS products.
Ability to communicate complex AI concepts clearly to both technical and non-technical audiences.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8676774
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
Required Al Infrastructure & Reliability Engineer
What this role is really about
Youll join a 3-person platform team within our Business Technology group -owning the internal infrastructure that our AI platform and its users depend on. This isnt a product engineering role, and it isnt ticket work or babysitting pipelines someone else built. Youre building and operating the internal foundation that the company runs on. The work covers the full stack of platform engineering: core cloud infrastructure (AWS, Kubernetes, IaC), CI/CD pipelines, AI-driven infrastructure components, and the SRE and observability practice that keeps it all honest -metrics, alerting, incident response, and reliability standards. As our AI capabilities grow, so does the complexity underneath them, and staying ahead of that is central to the role. If you treat infrastructure as a product -reusable, automated, observable, and built to last -this is your kind of role.
Job responsibilities
DevOps & AI-Driven Infrastructure - own CI/CD, deployment processes, and release reliability. Build and operate cloud infrastructure that is automated, intelligent, and continuously self-improving - not just managed.
Design and build our Terraform repository and IaC pipeline from scratch -AI-assisted generation, drift detection, and policy enforcement built in.
Build AI-driven GitHub Actions pipelines -automated code review, risk assessment, and intelligent deployment decisions.
Manage Kubernetes workloads across AWS accounts -zero downtime, fully automated, nothing left behind.
Embed AI into the operational layer -proactive drift detection, automated remediation, and intelligent scaling toward a self-healing runtime.
Reliability & SRE -improve uptime, resilience, and incident response.
Define and enforce SLOs/SLIs, error budgets, and on-call practices.
Lead incident response, postmortems, and systemic reliability improvements.
Own AI-specific reliability: model latency SLOs, token quota monitoring, rate limit handling, fallback and retry strategies, and cost-per-request alerting.
Observability & Telemetry - increase visibility, reduce noise, improve troubleshooting.
Establish and continuously evolve the observability stack: metrics, logs, distributed tracing, and alerting tuned for both application and AI workloads.
AI / LLM Operations- bringing AI systems to production and operating them at scale, with a focus on reliability, performance, and trust.
Own the AI infrastructure layer: rate limits, quota management, latency SLOs, and fallback strategies (retries, circuit breakers).
Operate LLM APIs in production with resilience and cost attribution per team/model.
Requirements:
2-4 years Hands-on DevOps, SRE, or infrastructure engineering in production SaaS environments.
Strong AWS experience: multi-account architecture, cross-account IAM, serverless and event-driven services (Lambda, SQS, SNS, EventBridge), and EKS cluster management.
Proven Kubernetes experience in production, including cross-account migrations and stateful workload management.
Proficiency with Terraform - repository structure design, module architecture, and CI/CD pipeline implementation.
Hands-on experience building and maintaining GitHub Actions pipelines for end-to-end CI/CD workflows.
Working Python proficiency for scripting, internal tooling, and workflow automation.
Practical experience implementing observability stacks from scratch: metrics, logging, distributed tracing, and alerting.
Experience owning reliability practices: SLOs, incident response, and postmortem culture.
Nice to have
Hands-on experience operating LLM APIs in production: rate-limit and quota management, cost attribution per team/model, latency monitoring, and resilience patterns (retries, fallbacks, circuit breakers).
FinOps experience across cloud, AI, and observability spend.
Experience introducing self-healing or auto-remediation patterns in production.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8659781
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're looking for a Software Engineer with strong backend and cloud expertise to join our AI team. You'll design and build backend systems that integrate Generative AI capabilities, contributing across the stack from API development to cloud deployment.
This role is ideal for someone who thinks in systems, loves solving complex problems, and can go deep into implementation details while keeping the big picture in mind. You'll work closely with our team lead on architectural decisions and take ownership of project delivery.
Key Responsibilities
Systems Design & Development
Design and build complete backend systems APIs, databases, queues, authentication, and integrations
Develop Python services with strong focus on code quality, performance, and reliability
Design solutions that scale across different client environments (cloud and on-prem)
AI Integration
Integrate LLMs (Multi Agents Systems) and GenAI capabilities into production systems
Build orchestration layers and workflows that combine AI with traditional backend logic
Optimize AI system performance, cost, and reliability
Cloud & Infrastructure
Deploy and manage systems across major cloud platforms (GCP, Azure, AWS)
Work with cloud-native services, databases, and infrastructure tools
work with production systems, monitoring, and operational health
Collaboration & Delivery
Communicate with client development teams, DevOps, and business stakeholders
Guide frontend developers on API contracts and integration requirements
Contribute to technical decisions and solution design within the team
Client Engagement & Advisory
Lead technical discussions with clients and translate business needs into technical architectures
Present GenAI solutions, design decisions, and trade-offs to technical and non-technical stakeholders
Provide strategic technical guidance.
Requirements:
Must Have
3+ years of relevant software engineering experience
Working knowledge of GenAI concepts and LLM integration
Strong proficiency in Python for backend development
Solid experience designing and building production systems (APIs, Microservices, databases, message queues, authentication)
Hands-on experience with at least one major cloud platform (GCP, Azure, or AWS)
Understanding of software architecture patterns and best practices
Bachelors degree in Computer Science or related field or equivalent practical experience
Advantages
Experience with multiple cloud platforms
Docker and Kubernetes experience
Experience with LLM provider APIs (OpenAI, Anthropic, Google) including function calling and streaming
Data pipeline development experience
What We're Looking For
Strong problem-solving skills ability to model a problem correctly and propose complete solutions
Attention to detail and ability to go deep into implementation
Ability to quickly learn and apply new technologies, especially in the GenAI space
Clear technical communication skills
Comfort working in a fast-paced consulting environment with varying project types.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8668388
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a DevOps Architect to help shape the infrastructure strategy behind our Revenue AI platform. This role sits at the center of our engineering ecosystem, driving architectural direction, improving operational excellence, and enabling teams to scale with confidence. Youll work across engineering groups to identify systemic gaps, define scalable standards, and accelerate execution without becoming a delivery bottleneck.
Youll Own:
Infrastructure Strategy & Standards: Define and evolve our cloud and infrastructure architecture across Kubernetes, networking, observability, security, and data platforms. Establish clear standards and scalable best practices that enable teams to move faster with consistency and reliability.
Technical Debt & System Health Visibility: Continuously identify, prioritize, and drive resolution of cross-team technical debt, architectural gaps, and operational inefficiencies. Create organizational visibility around the most critical infrastructure challenges and opportunities.
Cross-Org Technical Leadership: Partner closely with engineering leaders and teams to influence architectural decisions, challenge assumptions, and ensure solutions are scalable, maintainable, and secure. Lead through expertise and influence, not direct ownership.
Developer Enablement & Engineering Velocity: Provide frameworks, tooling direction, and lightweight prototypes or POCs that empower teams to execute independently with higher quality and efficiency.
Critical Infrastructure Initiatives: Drive major cross-functional initiatives around reliability, scalability, security, observability, and cost optimization from identification through execution and measurable impact.
Youll Solve:
Scaling Complexity: How do we maintain simplicity, reliability, and operational clarity while supporting rapid growth and increasingly complex distributed systems.
Cross-Team Alignment: How do we create architectural consistency across independent engineering groups without slowing down innovation and execution?
Operational Excellence at Scale: How do we proactively surface and resolve systemic weaknesses before they become production issues?
Balancing Speed & Sustainability: How do we enable fast delivery today while protecting the long-term health and scalability of the platform?
AI Infrastructure Evolution: How do we build infrastructure that supports modern AI/ML workloads, GPUs, large-scale data pipelines, and future platform requirements
Youll Impact:
Platform Reliability & Scalability: Your work will directly improve the resilience, scalability, and operational maturity of our infrastructure platform.
Engineering Efficiency: By creating better standards, tooling, and architectural guidance, youll act as a force multiplier for engineering teams across the company.
Long-Term System Health: Youll help reduce operational friction, minimize technical debt, and ensure our infrastructure can support long-term business growth.
Execution Quality Across Teams: Your influence will elevate engineering quality, decision-making, and operational discipline throughout the organization.
Requirements:
A Deep Technical Expert: Someone with 8+ years of hands-on experience with AWS and cloud-native infrastructure at scale, including strong Kubernetes expertise and distributed systems knowledge.
An Infrastructure Architect: Someone with deep experience in Infrastructure as Code and GitOps methodologies using tools like Terraform, Crossplane, or Pulumi.
A Pragmatic Builder: A strong engineer with programming experience in Python or Go who can build tools, prototypes, and automation when needed.
A Systems Thinker: Someone who can identify patterns, uncover systemic issues, and drive improvements across complex technical environments.
An Influential Technical Leader: Someone with proven experience leading cross-team initiatives and driving alignment without direct authority.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8665155
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
13/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At UVeye, we're on a mission to redefine vehicle safety and reliability on a global scale. Founded in 2016, we have pioneered the world's first fully automated suite of vehicle inspection systems. At the heart of this innovation lies our advanced AI-centric technology, representing the pinnacle of computer vision, machine learning, and generative AI within the automotive sector. With over $380M in funding and strategic partnerships with industry giants such as Toyota, Amazon, General Motors, Volvo, and Hertz, our technology is utilized in manufacturing plants, dealerships, wholesale auctions, delivery fleets, seaports, and more. Our growing global team of over 200 employees is committed to creating a workplace that celebrates diversity, encourages teamwork, and strives for excellence.
We are looking for a driven, systems-minded Release Engineer to join our AI-Ops team. In this role, you will be the execution layer of our delivery pipeline—the technical gatekeeper who owns the safe, predictable deployment of software and AI models to global edge and cloud systems. You will balance high-speed deployment velocity with rock-solid operational stability. But you won't just be deploying code; you'll be acting as an internal project manager, driving our organizational roadmap by building Agentic AI tools and automating processes to scale our delivery capabilities continuously.
A day in the life and how you’ll make an impact:
* Act as the technical gatekeeper, validating and transitioning versions through strict release gates. Enforce rigorous governance and ensure strict "Definition of Done" criteria are met.
* Lead risk-mitigated rollouts across diverse global hardware environments.
* Monitor real-time deployment performance, Quality of Service (QoS), and algorithmic accuracy, making decisive, crisis-resilient calls to proceed, pause, or rollback to prevent regressions.
* Define and execute comprehensive test plans that verify cross-team dependencies. Validate that new versions meet detection accuracy requirements without degrading infrastructure.
* Triage complex production failures. Look beyond immediate issues to identify root causes using system metrics, logs, and container states, delivering actionable evidence to R&D.
* Build and integrate Agentic AI and LLM-based tools to accelerate log analysis, risk assessment, and deployment troubleshooting.
* Architect automated workflows to eliminate manual overhead and enhance system observability with robust monitors and dashboards.
Requirements:
* 2+ years of experience in Release Engineering, DevOps, QA, or a similar operations-centric role.
* Strong systems-level troubleshooting skills with the ability to analyze data, system metrics, logs, and container states.
* Demonstrated ability to maintain decisive control and make smart risk-management decisions during live, high-stakes deployments.
* Experience enforcing data integrity and process governance using Jira or similar issue-tracking tools.
Bonus if you have:
* Experience building or integrating AI, LLMs, or Agentic workflows into operational tooling.
* Familiarity with deploying software to both cloud environments and distributed edge hardware.
* Experience with performance benchmarking (throughput, bandwidth, algorithmic accuracy).
* Prior experience acting as a project manager for internal engineering initiatives or tools
Why UVeye: Pioneer Advanced Solutions: Harness cutting-edge technologies in AI, machine learning, and computer vision to revolutionize vehicle inspections. Drive Global Impact: Your innovations will play a crucial role in enhancing automotive safety and reliability, impacting lives and businesses on an international scale. Career Growth Opportunities: Participate in a journey of rapid development, surrounded by groundbreaking advancements and strategic industry partnerships
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8649166
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced Senior DevOps Engineer to join our DevOps team in the Posture R&D Group, who is passionate about software design, development and deployment. The role goes beyond traditional DevOps - it focuses on building the infrastructure and platforms that enable AI models and autonomous agents to run in production at scale, across both cloud and on-prem environments. The job involves writing production-grade modern DevOps solutions that will be shipped to the cloud and on-prem solutions, while working with cutting-edge technologies and architectures that push the boundaries of AI-driven cybersecurity systems.
Responsibilities
Build the best solutions for our production platform, enabling high-scale, AI-driven systems and agents to operate reliably in production-scale environments
Everything as a code approach (IaC): Run our infrastructure with a wide range of technologies including Terraform, and Kubernetes
Build and maintain tools for automation, deployment, monitoring, and operations, with a strong focus on scalability, resilience, and observability of distributed system.
Troubleshoot complex issues in our development, production, and test environments, including large-scale, distributed, and AI-integrated systems
Excellent communication and people skills.
Requirements:
8+ of years experience with DevOps technologies.
Extensive background leading the design, build, and evolution of end-to-end DevOps platforms, including infrastructure, tooling, and operational frameworks across the software lifecycle.
Deep expertise with one of the major cloud providers: AWS (preferred), GCP, Azure.
Extensive experience with modern deployment strategies (GitOps, blue/green, canary, Kubernetes-based deployments)
Strong experience designing and optimizing end-to-end CI/CD pipelines, enabling high velocity, reliable software delivery.
Experienced with bootstrapping projects, introducing new technologies and building systems from scratch.
Background in working with AI components and understanding the challenges of bringing AI workloads into production.
Good coding capabilities (Python, Bash, etc.)
Experience mentoring engineers, leading cross-functional initiatives, and influencing technical direction.
Advantages:
Experience with on-prem environments and solutions.
Prior experience with endpoint security products (agents, sensors, collectors).
Tech Stack: AWS, Kubernetes, EKS, Jenkins, IaC, GitHub, Terraform, Python, Docker, ArgoCD, MongoDB, RabbitMQ, Redis, Go, Neo4J, AI, and more.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8664565
סגור
שירות זה פתוח ללקוחות VIP בלבד