דרושים » תוכנה » Site Reliability Engineer - AI Infrastructure

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We're hiring for a new AI Engineering team in Tel Aviv, and you would be the first infrastructure hire. You will own the platform layer for AI agents the team builds: deployment architecture, observability, and production reliability.
The team's first two projects: an agent that automates internal governance processes (vendor reviews, security questionnaires, tool provisioning), and an agent that helps engineering teams prepare for architecture reviews. Both integrate with external APIs (LLM providers, OneTrust, ServiceNow), handle structured decision logic, and manage sensitive data flows with audit requirements.
Highlights
- Greenfield, but with real constraints. You're building on Azure/AWS with enterprise security requirements. The challenge is designing deployment and observability for LLM-backed services. You need to track output quality, cost per invocation, and model drift.
- Enterprise complexity, startup autonomy. Ownership and greenfield environment of a startup, with the integration challenges of a Fortune 200: connecting AI services to real enterprise systems.
- More than infrastructure. Your core is SRE, but you'll also write agent code in TypeScript and Python, work with data pipelines, and ship features alongside the team.
What the Work Looks Like
AI Service Infrastructure - Design and maintain deployment and release infrastructure for AI agents. The stack is cloud-native (Azure/AWS), with services that call LLM APIs, connect to enterprise systems, and handle structured data.
Observability & Reliability - Build monitoring and observability for AI services. Ensure model response quality doesn't degrade silently by tracking errors, logging cost spikes, and monitoring upstream API changes.
Security & Compliance - These agents handle sensitive workflows with elevated security requirements. You will work with our company's security team on standards, but you own how they're implemented in the infrastructure.
Developer Experience - Create tooling that makes it easy for the team to build, test, and deploy. The patterns you set become the team's defaults.
Requirements:
Required:
- 5+ years in SRE, platform engineering, DevOps, or infrastructure roles, with experience owning infrastructure end-to-end
- Strong experience with cloud platforms (Azure or AWS), containerization (Docker, Kubernetes), and CI/CD pipelines
- Infrastructure-as-code experience (Terraform, CDK, or CloudFormation)
- Monitoring and observability (Datadog, Splunk, CloudWatch, or similar)
- Infrastructure fundamentals: Linux, networking, security
- Incident management experience: on-call, production incidents, post-mortems
- Comfortable working independently with broad ownership and high accountability
- Strong written and verbal English for async collaboration with distributed teams
Preferred:
- Experience with AI/ML infrastructure: model serving, LLM API integration, vector databases, or evaluation pipelines
- Comfortable writing production code in TypeScript or Python, not just scripts
- Experience building self-service developer tooling or internal platforms
- Cost optimization for cloud and API-based workloads
- Security engineering experience, especially in enterprise or compliance-heavy environments.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8600507
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior AI Engineer to join our Cybersecurity team in Tel Aviv. You will design, build, and productionize LLM-powered applications, multi-agent systems, and MLOps infrastructure that power our company's next-generation cybersecurity capabilities. This is a high-impact, hands-on role at the intersection of applied AI, agentic systems, and network securit
What You'll Do
Design and develop LLM-powered security features and internal AI tools, including RAG pipelines, multi-agent workflows, and prompt-engineered systems tailored for cybersecurity use cases
Architect and operate multi-agent systems in production - including agent orchestration, inter-agent communication, task delegation, and failure handling at scale
Build robust agent monitoring and observability pipelines: tracing agent execution, detecting drift or failure, alerting on anomalous behavior, and maintaining agent reliability SLAs
Build and maintain scalable MLOps infrastructure: model serving, evaluation frameworks, experiment tracking, and CI/CD for ML models
Work with internal datasets (network telemetry, security logs, threat intelligence) to fine-tune and adapt foundation models for domain-specific detection and response tasks
Partner with the Cybersecurity, R&D, and infrastructure teams to define AI-driven security features and deliver them end-to-end
Establish best practices for model observability, safety, and responsible AI deployment within the organization
Stay current with the fast-moving LLM/GenAI and agentic AI ecosystem and evaluate emerging frameworks, models, and tools for adoption.
Requirements:
Must-Have
5-8 years of software engineering experience, with at least 2-3 years focused on AI/ML engineering
Hands-on experience building production-grade LLM applications - RAG, agents, tool use, or fine-tuning
Proven experience designing and running multi-agent systems in production: orchestration patterns, agent state management, retries, and graceful degradation
Experience monitoring and observing AI agents in production - execution tracing, latency tracking, failure detection, and alerting (e.g., LangSmith, Arize, custom observability stacks)
Proficiency with agentic frameworks: LangChain, LangGraph, and/or AWS Bedrock AgentCore
Strong Python skills and comfort working across the full AI application stack
Experience designing and operating MLOps pipelines (model versioning, deployment, monitoring)
Solid understanding of transformer-based models, embeddings, and vector databases (e.g., Pinecone, Weaviate, pgvector)
Comfortable working in cloud environments (AWS, GCP, or Azure) and containerized deployments (Docker, Kubernetes)
Strong problem-solving skills and ability to work autonomously in a fast-paced environment
Nice-to-Have
Background in cybersecurity - threat detection, SIEM, SOC automation, or security data analysis - a significant plus for this role
Familiarity with networking concepts (SDN, cloud-native networking, BGP, telemetry)
Experience with model evaluation and benchmarking (LLM-as-judge, RAGAS, or custom eval harnesses)
Exposure to MCP (Model Context Protocol) for tool-augmented agentic workflows
Prior experience in enterprise SaaS, networking, or telecom domains
Publications, open-source contributions, or projects in the LLM/GenAI or agentic AI space
Our Stack
Python PyTorch OpenAI / Anthropic APIs LangChain LangGraph AWS Bedrock AgentCore LangSmith Kubernetes Kafka Elasticsearch AWS PostgreSQL GitHub Jira Confluence.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8595648
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At our company, we believe people are capable of more than a single job description. Youre not hired just to fill a position- youre empowered to shape it, grow it, and make it your own.
We call this being Positionless.
And Positionless isnt just our culture. Its our product.
we are the creator of Positionless Marketing, an AI-powered platform that gives every marketer the power to analyze, create, launch, and optimize independently. The result is faster execution, deeper personalization, and 88% greater campaign efficiency.
Recognized as a Visionary in Gartners Magic Quadrant, we partner with leading brands like Sephora, Staples, and Entain. Today, more than 550 our company's across NYC, London, Tel Aviv, Scotland, Brazil, Estonia, and beyond are building the future of marketing together, in an environment that actively encourages ownership and growth, with two out of every three managers promoted from within.
If youre looking for a place where you can do more, be more, come grow with us.
Are you passionate about ensuring system reliability, scalability, and performance? Do you thrive in a dynamic environment where automation and operational excellence are key?
we are looking for a Site Reliability Engineer (SRE) to join our team and play a crucial role in designing, implementing, and maintaining our cloud-based infrastructure. In this role, you will collaborate across teams to drive automation, improve system resilience, and optimize performance while fostering a culture of reliability.
Responsibilities:
System Reliability- Ensure high availability and performance of services through effective monitoring, incident management, and root cause analysis.
Automation & Tooling- Develop and maintain automation for infrastructure provisioning, configuration management, and application deployment.
Performance Optimization- Analyze and enhance system performance, including load balancing, caching, and database tuning. Conduct regular capacity planning.
Incident Response & Troubleshooting- Lead incident response efforts, participate in on-call rotations, and troubleshoot complex infrastructure issues.
Security & Compliance- Collaborate with security teams to implement best practices and ensure compliance with relevant standards (ISO 27001, SOC 2, etc.).
Collaboration & Mentorship- Work closely with developers, DevOps, Support, and product teams to enhance application reliability and implement SRE best practices.
Requirements:
4+ years in Site Reliability Engineering, DevOps, or related roles.
Proven experience managing large-scale, cloud-based infrastructure in GCP, AWS, or Azure.
Expertise in container orchestration (Kubernetes, Docker) and microservices architecture.
Strong proficiency in scripting and programming languages (Python, Go, Bash, etc.).
Experience with CI/CD pipelines, infrastructure as code (Terraform, CloudFormation), and configuration management (Ansible, Puppet, Chef).
Hands-on experience with monitoring and observability tools (Datadog, Prometheus, Grafana, ELK Stack).
Experience using AI tools to enhance SRE processes, such as intelligent monitoring, incident prediction, and automation of incident response.
Deep understanding of networking concepts, DNS, load balancing, and distributed systems.
Strong problem-solving skills, excellent communication, and a proactive mindset.
Advantages:
Certifications- AWS Certified Solutions Architect, GCP Professional Cloud Architect, or Kubernetes certifications (CKA, CKAD).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8594736
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a highly motivated AI Full stack Engineer with GenAI background in production to join our team and help us shape the future of the Agentic engineering platform (AEP).
What youll do:
At our company, were a platform by developers, for developers. Your role will encompass end-to-end design, implementation, and daily feature delivery across both backend and frontend systems.
You will:
Implement high scale AI-powered features deeply integrated into our platform
Design and build production-grade backend systems serving a wide and growing user base
Build agent-based workflows using frameworks such as AI SDK
Integrate LLMs into real production systems with attention to reliability, latency, observability, and cost
Work across frontend (React + TypeScript) and backend (NodeJS, Python, Go) to deliver complete AI-driven user experiences
Own features end-to-end: design, implementation, testing, deployment, and monitoring
Help define standards and best practices around AI reliability and evaluation
Contribute to technical planning, mentor teammates, and help recruit top talent
Develop retrieval-augmented generation (RAG) pipelines over structured and unstructured data
Our stack includes React + TypeScript on the frontend, and NodeJS + TypeScript, Python, and Golang on the backend, and Vercels AI-SDK + AWS Bedrock + Azure OpenAI for GenAI. We use Kafka + Kafka Connect, Redis, PostgreSQL, MongoDB and other modern infrastructure components.
Requirements:
5+ years of professional software engineering experience
Experience in NodeJS + TypeScript
Strong experience designing and developing complex systems from design to production
Experience dealing with scale and performance-related challenges
Experience building or integrating AI/LLM-powered applications in production or meaningful production systems
Experience building agent workflows and tool integrations
Ability to think critically about model limitations, hallucinations, latency, and cost tradeoffs
A collaborative team player with a can-do approach
Strong written and verbal communication skills in English and Hebrew
Advantages:
Experience with AWS or other cloud platforms
Experience with vercels AI SDK
Experience with embeddings, vector databases, or semantic search
Expierence with AWS Bedrock / Azure Open-AI
Experience building tool-using agents or workflow engines
Experience with AI evaluation, observability, and monitoring
Experience in DevOps-related tools
Experience with PostgreSQL, Kafka, DocumentDB, OpenSearch, Redis.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8597066
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/03/2026
Job Type: Full Time
We're looking for a Senior AI/MLOps Engineer to join a group that specializes in Security and Networking, and specifically ML, AI and agent development. As a Senior AI/MLOps Engineer, youll build and maintain the infrastructure, tools and processes necessary to support the AI lifecycle in a production environment. You will collaborate closely with data scientists, software engineers, security architects and DevOps teams to ensure smooth deployment, modeling and optimization of AI models. This role involves creative problem solving alongside engineering teams, and is pivotal for the continued success of AI networking security.

What youll be doing:

Developing, improving and optimizing scalable infrastructure for handling and deploying security and networking AI models and agents in production, ensuring high availability, scalability, reproducibility, and performance.

Optimizing AI models and agents for performance, scalability, and resource utilization, considering factors such as latency, efficiency, and cost.

Monitoring and deploying agentic systems, LLMs, and ML models in production.

Designing and implementing frameworks/pipelines for AI training, inference, and experimentation.

Collaborating closely with data scientists, security architects and software engineers to operationalize and deploy AI models and agents, including packaging and integration with existing systems. Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

Collaborating with DevOps teams to integrate pipelines and workflows into the CI/CD process, ensuring flawless deployments and rollbacks.

Building and maintaining monitoring and alerting systems to proactively identify and resolve issues relating to quality, performance and infrastructure.

Implementing access controls, authentication mechanisms, and encryption standards for AI models and data.

Documenting guidelines, and standard operating procedures for MLOps/AI processes and sharing knowledge with the wider team.

Develop proof-of-concepts for new features.
Requirements:
What we need to see:

BSc/MSc in CS/CE or related field (or equivalent experience).

Strong background in AI with experience deploying and monitoring AI/ML models, LLMs and agents to production systems at scale, including distributed and multi-node environments - at least 5 years of experience.

Proficiency in programming languages such as Python, Java, or Scala, along with experience in using ML/AI frameworks and libraries (e.g. TensorFlow, PyTorch).

Proficiency in microservices architecture, container orchestration, cloud platforms, and scalable infrastructure for training and inference workloads.

Knowledge of inference optimization techniques.

Understanding of build infrastructure and CI/CD tools and practices (e.g. GitLab, GitHub Actions, Jenkins).

You are detail-oriented and care deeply about robust, well tested, high-performance code in production environments.

You are proactive, take full ownership of your deliverables, have a can-do approach, and excellent communication and collaboration skills, able to work effectively in multifunctional teams.

Ways to stand out from the crowd:

Knowledge of network protocols and Linux internals.

Security and networking background, with knowledge of security protocols, network architectures, firewalls, intrusion detection systems, and other relevant security and networking concepts.

Experience deploying and optimizing generative models and agents.

Knowledge of network security principles and practices.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586605
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a highly motivated AI Developer to help design, build, and deploy intelligent agentic systems across our product ecosystem. In this role, you'll work at the intersection of machine learning, backend systems, and modern frontend technologies to deliver AI-first features that feel magical to users.
This is a hands-on, cross-functional role ideal for engineers who love building full-fledged features-from data pipelines and LLM orchestration to intuitive UI experiences-with a strong product mindset.
Responsibilities:
AI Agent Design & Integration
Design and implement autonomous or semi-autonomous agents using LLMs (e.g., OpenAI, Anthropic, open-source models).
Work with prompt engineering, RAG pipelines, and tool/plugin integrations to enable agents to interact with internal and external systems.
Build scalable agent runtimes and orchestration layers (e.g., LangChain, Semantic Kernel, ReAct-based agents).
Fullstack Product Development
Own full-stack features end-to-end: from backend APIs and data models to React-based frontend interfaces.
Integrate AI/agent capabilities into customer-facing products with clean UX and measurable performance.
Collaborate closely with design, product, and data teams to bring ideas from concept to production.
Systems & Infrastructure
Build and maintain backend services and pipelines that support AI agents, including vector search, embeddings, function calling, and observability.
Optimize inference flows for performance and cost, potentially using streaming, caching, or local model inference.
Ensure systems are secure, reliable, and compliant with InfoSec standards.
Experimentation & Continuous Improvement
Rapidly prototype and iterate on new AI capabilities and user experiences.
Analyze performance and usage metrics to drive product and model improvements.
Stay up to date with the evolving AI toolchain and emerging agent architectures.
Requirements:
8+ years of fullstack development experience with strong skills in TypeScript/JavaScript, React, and Python (or Node/Go for backend).
Solid understanding of LLM APIs, agent frameworks (e.g., LangChain, AutoGPT, CrewAI), or custom AI pipelines- Advantage
Experience with modern cloud infrastructure (e.g., AWS, GCP, Docker, CI/CD).
Familiarity with vector databases (e.g., Pinecone, Weaviate, FAISS) and retrieval-augmented generation (RAG)- Advantage
Product-oriented mindset: you care deeply about building things that work well for users.
Bonus: experience with observability, feedback loops for AI agents, or embedded AI evaluation techniques.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8600287
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
The Falcon Cloud Security team is looking for a hands-on Engineering Manager / Team Lead to lead the development of Agentic Workflows - a transformative initiative aimed at automating complex security operations using AI-native agents. You will lead a team of talented engineers while remaining deeply technical, helping architect and build autonomous systems that don't just alert but actively reason, investigate, and remediate security risks across multi-cloud environments.
As a player-coach, you will help shape the "brain" of our cloud security platform, guiding both the people and the technology that leverages large-scale data and AI-driven logic to help customers discover misconfigurations, prioritize risks, and automate defensive actions at scale.

What You'll Do:

Lead & Grow a Team: Manage, mentor, and develop a team of backend engineers, fostering a high-trust, high-performance culture. Conduct regular 1:1s, support career growth, and drive hiring to scale the team.

Stay Hands-On: Remain an active technical contributor - designing, reviewing, and writing production-quality code alongside your team. Lead by example and maintain a strong engineering presence.

Design & Architect: Drive backend engineering efforts to build autonomous agentic frameworks, guiding the team from rapid prototypes to large-scale production applications.

Develop Core Logic: Contribute to and oversee the development of decision-making engines and workflows that allow security agents to interact with cloud APIs (AWS, Azure, GCP) and internal data streams.

Data Integration: Guide the development of high-performance data integrations and streaming services (Kafka) to feed real-time security data into agentic models for continuous reasoning.

Scale Systems: Architect and oversee distributed systems capable of processing billions of security events to provide actionable posture intelligence and automated remediation.

Drive Cross-Functional Collaboration: Partner with Product, Design, and peer engineering teams in a "startup-like" environment to define and deliver new platform capabilities with speed and quality.

Raise the Bar: Champion engineering excellence, new technologies, and best practices across the team and broader engineering organization.
Requirements:
Experience: 8+ years of backend engineering experience, with at least 2 years in an engineering leadership role (Tech Lead, Staff Engineer, or Engineering Manager). Strong proficiency in Go and Python.

People Leadership: Demonstrated ability to hire, mentor, and develop engineers at varying levels. Comfortable balancing technical contribution with team management responsibilities.

AI/LLM Experience: Prior experience building workflows powered by LLMs, RAG, or autonomous agents. Strong understanding of agent frameworks and key components including model integration, tool calling patterns, and Model Context Protocol (MCP).

Cloud Expertise: Deep knowledge of at least two major cloud providers (AWS, Azure, or GCP).

Systems Engineering: Strong understanding of distributed systems, scalability, concurrency, and resilient architecture.

Data Proficiency: Solid experience with data modeling, RDBMS (SQL), and distributed caching solutions like Redis.

Education: BS/MS in Computer Science or equivalent professional experience in data structures and algorithms.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8598636
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
The DevOps Engineer builds, automates, and operates cloud‑native infrastructure across AWS and Red Hat OpenShift, enabling scalable, secure, and reliable application delivery. This role combines hands‑on platform engineering, CI/CD automation, container orchestration, and the integration of AI‑powered tools for observability, anomaly detection, and operational efficiency. The engineer collaborates closely with development, security, and SRE teams to streamline deployments and improve system resilience.
Core Responsibilities
Cloud & Platform Engineering
Design, deploy, and maintain cloud‑native infrastructure on AWS (EC2, VPC, IAM, EKS, S3, RDS, Lambda).
Operate and optimize Red Hat OpenShift clusters, including cluster upgrades, operator management, and workload orchestration.
Implement Infrastructure‑as‑Code using Terraform, CloudFormation, or Ansible.
Build secure, scalable network architectures including VPC design, load balancing, service mesh, and ingress/egress controls.
CI/CD & Automation
Develop and maintain CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, or Argo Workflows.
Automate build, test, and deployment workflows for microservices and containerized applications.
Implement GitOps practices using Argo CD or Flux.
Create reusable automation modules and scripts in Python, Bash, or Go.
Containers & Kubernetes
Manage containerized workloads using Docker, Kubernetes, and OpenShift Operators.
Configure namespaces, RBAC, secrets, ConfigMaps, and resource quotas.
Troubleshoot cluster performance, networking, and scheduling issues.
Support service mesh technologies (Istio, Linkerd) when applicable.
AI‑Driven Operations
Integrate AI/ML‑based tools for monitoring, anomaly detection, predictive scaling, and automated remediation.
Work with data and platform teams to operationalize AI/ML pipelines on Kubernetes or OpenShift.
Evaluate emerging AI‑Ops platforms and contribute to automation strategies.
Observability & Reliability
Implement monitoring, logging, and tracing using Prometheus, Grafana, ELK, Loki, CloudWatch, or Datadog.
Build alerting, dashboards, and SLO‑based reliability metrics.
Participate in on‑call rotations and incident response, driving root‑cause analysis and long‑term fixes.
Security & Compliance
Apply DevSecOps practices including image scanning, secrets management, and policy enforcement.
Work with security teams to implement IAM best practices, encryption, and compliance controls.
Integrate tools such as Vault, OPA/Gatekeeper, or Kyverno.
Requirements:
2-5 years of experience in DevOps, cloud engineering, or platform operations.
Strong hands‑on experience with AWS services and cloud architecture fundamentals.
Practical experience with Kubernetes and Red Hat OpenShift.
Proficiency with Terraform, Ansible, or similar IaC tools.
Experience building CI/CD pipelines and automating deployments.
Solid Linux administration and networking fundamentals.
Scripting skills in Python, Bash, or Go.
Understanding of container security, cloud security, and DevSecOps practices.
Preferred Qualifications
Certifications: AWS Solutions Architect, CKA/CKAD, Red Hat OpenShift, Terraform Associate.
Experience with AI‑Ops platforms or ML pipeline orchestration.
Familiarity with service mesh, API gateways, or event‑driven architectures.
Experience with multi‑cluster or hybrid cloud environments.
Background in SRE practices (SLOs, error budgets, chaos engineering).
What Success Looks Like
Reliable, automated, and secure cloud‑native infrastructure supporting rapid development cycles.
Stable and observable Kubernetes/OpenShift environments with clear operational metrics.
Reduced manual work through automation and AI‑driven insights.
Strong collaboration with engineering teams and continuous improvement of DevOps practices.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8597223
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/03/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a Backend Team Lead to spearhead the development of ludeo.ai, our GenAI-powered product that enables users to generate interactive (gaming experiences) directly from prompts or video content. This is a high-impact leadership role at the intersection of backend architecture, multimodal AI, and real-time systems. You will architect and lead the AI engine that transforms unstructured inputs (text/video) into structured, interactive gaming playable moments.

What Youll Do

Lead & Mentor: Build and manage a high-performing backend/AI engineering team, drive architectural decisions, and foster rapid innovation while maintaining production-grade reliability.
Design AI-Native Systems: Architect scalable microservices powering complex AI workflows. Design and implement Retrieval-Augmented Generation (RAG) pipelines, embedding strategies, and vector database infrastructure (e.g., Pinecone, Weaviate, Milvus, PGVector). Optimize retrieval, prompt orchestration, latency, and cost.
Agentic Workflows: Design multi-agent systems using planner/executor/tool-calling patterns. Implement stateful, multi-step AI workflows with frameworks such as LangChain, CrewAI, AutoGen, or similar. Build evaluation, observability, and safety mechanisms for LLM systems.
Multimodal AI: Integrate multimodal models (vision + text) to understand video and translate it into structured form.
Scale & Infrastructure: Ensure robustness, security, and high availability on AWS/Kubernetes. Design distributed systems that handle real-time data and AI workloads efficiently.
Collaborate: Work closely with Product and Design to translate GenAI capabilities into stable, scalable production features.
Requirements:
Expreince leading engineering teams in fast-paced environments with strong ownership and architectural responsibility.
Backend Expertise: 6+ years of backend development experience with deep expertise in Node.js and microservices. Strong distributed systems and API design experience.
GenAI Systems Experience: Hands-on experience building production LLM systems. Proven experience with RAG architectures, vector databases, embedding pipelines, and prompt orchestration. Experience designing multi-step or agentic AI workflows.
Infrastructure: Strong experience with AWS and Kubernetes in production environments. Deep knowledge of SQL & NoSQL systems.
Communication: Ability to translate complex AI systems into clear product and business decisions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8569784
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
The Falcon Cloud Security team is looking for a hands-on Engineering Manager / Team Lead to lead the development of Agentic Workflows - a transformative initiative aimed at automating complex security operations using AI-native agents. You will lead a team of talented engineers while remaining deeply technical, helping architect and build autonomous systems that don't just alert but actively reason, investigate, and remediate security risks across multi-cloud environments.
As a player-coach, you will help shape the "brain" of our cloud security platform, guiding both the people and the technology that leverages large-scale data and AI-driven logic to help customers discover misconfigurations, prioritize risks, and automate defensive actions at scale.

What You'll Do:

Lead & Grow a Team: Manage, mentor, and develop a team of backend engineers, fostering a high-trust, high-performance culture. Conduct regular 1:1s, support career growth, and drive hiring to scale the team.

Stay Hands-On: Remain an active technical contributor - designing, reviewing, and writing production-quality code alongside your team. Lead by example and maintain a strong engineering presence.

Design & Architect: Drive backend engineering efforts to build autonomous agentic frameworks, guiding the team from rapid prototypes to large-scale production applications.

Develop Core Logic: Contribute to and oversee the development of decision-making engines and workflows that allow security agents to interact with cloud APIs (AWS, Azure, GCP) and internal data streams.

Data Integration: Guide the development of high-performance data integrations and streaming services (Kafka) to feed real-time security data into agentic models for continuous reasoning.

Scale Systems: Architect and oversee distributed systems capable of processing billions of security events to provide actionable posture intelligence and automated remediation.

Drive Cross-Functional Collaboration: Partner with Product, Design, and peer engineering teams in a "startup-like" environment to define and deliver new platform capabilities with speed and quality.

Raise the Bar: Champion engineering excellence, new technologies, and best practices across the team and broader engineering organization.
Requirements:
Experience: 8+ years of backend engineering experience, with at least 2 years in an engineering leadership role (Tech Lead, Staff Engineer, or Engineering Manager). Strong proficiency in Go and Python.

People Leadership: Demonstrated ability to hire, mentor, and develop engineers at varying levels. Comfortable balancing technical contribution with team management responsibilities.

AI/LLM Experience: Prior experience building workflows powered by LLMs, RAG, or autonomous agents. Strong understanding of agent frameworks and key components including model integration, tool calling patterns, and Model Context Protocol (MCP).

Cloud Expertise: Deep knowledge of at least two major cloud providers (AWS, Azure, or GCP).

Systems Engineering: Strong understanding of distributed systems, scalability, concurrency, and resilient architecture.

Data Proficiency: Solid experience with data modeling, RDBMS (SQL), and distributed caching solutions like Redis.

Education: BS/MS in Computer Science or equivalent professional experience in data structures and algorithms.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8598630
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're looking for an AI and Automation Engineer to join our Automation team and help build the internal tools, AI agents and automation systems that power how we operate.
This role combines hands-on engineering with impactful product development. The work is split between writing code (TypeScript, React, Python) and designing and implementing automation workflows using low-code platforms.
You'll take part in building intuitive internal web applications that teams enjoy using, developing AI-driven solutions that streamline repetitive processes, creating insightful dashboards to support decision-making, and integrating systems that help the business run smoothly and efficiently.
You'll work alongside our Tech Lead and a US-based IT Automation Engineer, contributing across the full spectrum from designing React-based internal tools to building RAG pipelines to orchestrating multi-step business workflows.
What You'll Do
Design and build internal web applications using TypeScript and React that serve teams across the organization.
Create intuitive interfaces that make AI capabilities and automation outputs accessible to non-technical users.
Build and maintain RAG pipelines - including document processing, embedding, vector storage, retrieval, and evaluation.
Work with LLM APIs (Claude API, OpenAI API, AWS Bedrock, Vertex) and implement prompt engineering patterns (tool/function calling, structured outputs, few-shot) with proBuild connectors, APIs, and data flows that keep systems in sync and processes running smoothly.
Work with data from across the business to support decision-making.
Design and maintain automation workflows using low-code/no-code platforms (Workato).
Integrate AI solutions into core business tools - Slack, Jira, Salesforce etc.
Requirements:
2+ years of coding experience, with strong proficiency in TypeScript/JavaScript and Python.
Experience building web applications with React (or similar modern frontend frameworks).
Practical experience with LLMs and AI application patterns - RAG, tool use, function calling, prompt engineering.
Solid understanding of APIs, webhooks, authentication methods, and system integrations.
Familiarity with AWS (or GCP) cloud services, including AWS Bedrock (or Vertex).
Comfortable with databases (SQL / NoSQL) for data shaping, analysis, and powering application logic.
Experience with Git and CI/CD pipelines, particularly GitHub Actions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8600476
סגור
שירות זה פתוח ללקוחות VIP בלבד