דרושים » תוכנה » AI Evaluation & Reliability Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Merkaz
Job Type: Full Time
We are looking for a Reliability Engineer! We are looking for a Reliability Engineer who will take part in building the next-generation agentic analytics platform, the first Real-Time database optimized for AI agents at scale. Were looking for a Senior AI Evaluation & Reliability Engineer to define and build how AI agents are measured, validated, monitored, and improved in production. This role sits at the intersection of LLM systems, evaluation research, and production-grade engineering. You will design evaluation methodologies, build LLM-as-a-judge systems, and develop agent-based testing frameworks to ensure correctness, robustness, and reliability of complex multi-agent workflows operating on Real-Time data.
What Youll Do:
* Design and implement evaluation frameworks for AI agents and multi-agent systems.
* Build LLM-as-a-judge pipelines to assess correctness, reasoning quality, and output quality.
* Develop agent-based evaluation systems (agents evaluating agents) for scalable testing.
* Define metrics, benchmarks, scorecards, and methodologies for agent reliability and performance.
* Build data -driven evaluation pipelines using synthetic and real-world datasets.
* Identify and analyze failure modes, edge cases, and non-deterministic behaviors.
* Improve agent robustness, consistency, and reliability in production environments.
* Work with tools such as Google ADK, Opik, and related evaluation frameworks.
* Collaborate closely with AI, platform, and database teams to shape agent- data interaction quality.
Requirements:
Must have:
* 4-8+ years of experience in software engineering, AI systems, or evaluation/ QA engineering.
* Strong programming skills in Python.
* Hands-on experience working with LLMs in production environments.
* Experience building evaluation systems, automation frameworks, or testing infrastructure.
* Strong understanding of prompt engineering, tool use, and agent behavior.
* Ability to think in terms of metrics, correctness, and system reliability. Nice to have:
* Experience with LLM evaluation frameworks (Opik, LangSmith, etc.).
* Experience with Google ADK / agent frameworks.
* Experience implementing LLM-as-a-judge or ranking systems.
* Background in data systems, analytics, or Real-Time pipelines.
* Experience with multi-agent systems.
* Familiarity with statistical evaluation methods or experimentation (A/B testing, scoring systems).
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8612583
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior AI Engineer to join our Cybersecurity team in Tel Aviv. You will design, build, and productionize LLM-powered applications, multi-agent systems, and MLOps infrastructure that power our company's next-generation cybersecurity capabilities. This is a high-impact, hands-on role at the intersection of applied AI, agentic systems, and network securit
What You'll Do
Design and develop LLM-powered security features and internal AI tools, including RAG pipelines, multi-agent workflows, and prompt-engineered systems tailored for cybersecurity use cases
Architect and operate multi-agent systems in production - including agent orchestration, inter-agent communication, task delegation, and failure handling at scale
Build robust agent monitoring and observability pipelines: tracing agent execution, detecting drift or failure, alerting on anomalous behavior, and maintaining agent reliability SLAs
Build and maintain scalable MLOps infrastructure: model serving, evaluation frameworks, experiment tracking, and CI/CD for ML models
Work with internal datasets (network telemetry, security logs, threat intelligence) to fine-tune and adapt foundation models for domain-specific detection and response tasks
Partner with the Cybersecurity, R&D, and infrastructure teams to define AI-driven security features and deliver them end-to-end
Establish best practices for model observability, safety, and responsible AI deployment within the organization
Stay current with the fast-moving LLM/GenAI and agentic AI ecosystem and evaluate emerging frameworks, models, and tools for adoption.
Requirements:
Must-Have
5-8 years of software engineering experience, with at least 2-3 years focused on AI/ML engineering
Hands-on experience building production-grade LLM applications - RAG, agents, tool use, or fine-tuning
Proven experience designing and running multi-agent systems in production: orchestration patterns, agent state management, retries, and graceful degradation
Experience monitoring and observing AI agents in production - execution tracing, latency tracking, failure detection, and alerting (e.g., LangSmith, Arize, custom observability stacks)
Proficiency with agentic frameworks: LangChain, LangGraph, and/or AWS Bedrock AgentCore
Strong Python skills and comfort working across the full AI application stack
Experience designing and operating MLOps pipelines (model versioning, deployment, monitoring)
Solid understanding of transformer-based models, embeddings, and vector databases (e.g., Pinecone, Weaviate, pgvector)
Comfortable working in cloud environments (AWS, GCP, or Azure) and containerized deployments (Docker, Kubernetes)
Strong problem-solving skills and ability to work autonomously in a fast-paced environment
Nice-to-Have
Background in cybersecurity - threat detection, SIEM, SOC automation, or security data analysis - a significant plus for this role
Familiarity with networking concepts (SDN, cloud-native networking, BGP, telemetry)
Experience with model evaluation and benchmarking (LLM-as-judge, RAGAS, or custom eval harnesses)
Exposure to MCP (Model Context Protocol) for tool-augmented agentic workflows
Prior experience in enterprise SaaS, networking, or telecom domains
Publications, open-source contributions, or projects in the LLM/GenAI or agentic AI space
Our Stack
Python PyTorch OpenAI / Anthropic APIs LangChain LangGraph AWS Bedrock AgentCore LangSmith Kubernetes Kafka Elasticsearch AWS PostgreSQL GitHub Jira Confluence.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8595648
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
Were looking for an Engineering Team Lead, who will be responsible for the foundational infrastructure framework used by all engineering teams to build, deploy, and operate AI agents safely in production. We are building the "operating system" for AI , covering agent sessions, memory management, tool orchestration, durable execution, and multi-tenant isolation. You will lead a high-impact team of 3-4 engineers to create the runtime and platform that defines the future of autonomous enterprise intelligence.
Youll Own:
Agentic Framework Architecture: Designing and building internal agentic framework, leveraging and integrating industry-standard tools such as LangChain, LangSmith, ADK, and similar ecosystems.
Evaluation and Quality Systems: Building evaluation frameworks and workflows for AI agents, including offline and online evaluations, quality metrics, regression detection, and experimentation infrastructure.
Team Leadership & Mentorship: Leading a squad of 3-4 senior engineers, fostering a culture of technical excellence, and managing end-to-end delivery in a fast-paced environment. You will spend approximately 50% of your time hands-on, architecting core systems and reviewing code, and 50% leading the team, mentoring engineers, and aligning with cross-functional stakeholders.
Observability, Monitoring, and Guardrails: Providing the organization with robust observability capabilities for AI agents, including tracing, logging, monitoring, cost tracking, and safety guardrails to ensure reliable and responsible usage.
Developer Enablement Platforms: Creating APIs, SDKs, and abstractions that enable product teams to easily build, test, and operate agents while adhering to platform standards.
Cross-Language Integrations: Designing integrations and tooling across Python and Java to enable seamless adoption of the AI framework within broader backend ecosystem.
Youll Solve:
Agent Lifecycle and Orchestration Complexity: Managing agent execution, tool usage, memory, workflows, and failure modes in production-grade systems.
AI System Reliability at Scale: Ensuring agents remain observable, debuggable, and safe as usage scales across teams and products.
Evaluation and Drift Challenges: Detecting quality regressions, model behavior changes, and unintended agent behaviors through robust evaluation and monitoring systems.
Platform Adoption Friction: Balancing flexibility with guardrails so teams can innovate quickly without compromising reliability, security, or cost controls.
Requirements:
8+ years of backend engineering experience, with strong system design and platform-building expertise. Tech leadership or team leading experience is an advantage.
Strong analytical and problem-solving skills, with the ability to debug and resolve complex technical issues efficiently.
Hands-on experience with agentic systems and frameworks such as LangChain, LangSmith, ADK, or equivalent agent orchestration platforms.
Strong understanding of AI evaluation methodologies, including agent evaluations, prompt evaluation, regression testing, and quality monitoring.
High proficiency in Python for building production-grade AI frameworks and services.
Familiarity with Java and experience integrating backend platforms or tooling into Java-based systems.
Experience building observability, monitoring, or platform tooling for distributed systems.
Strong analytical skills and the ability to reason about complex, evolving AI-driven systems.
Experience with cloud platforms and scalable microservices architectures.
Excellent communication skills and a strong platform mindset, with experience enabling multiple teams.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8606378
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
09/04/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior AI Engineer - Applied AI Engineering Group
The Dream Job
It starts with you - an engineer driven to build the agentic AI platform that turns LLMs into reliable, production-grade capabilities. You care about clean APIs, well-defined service boundaries, and systems that teams can build on with confidence. Dream is AI-first across the board - every team builds and operates agents. You'll architect and ship the platform that makes this possible: agent orchestration frameworks, LLM gateways, evaluation pipelines, tool-calling infrastructure, and retrieval systems. Without this platform, agents don't ship - you own the layer that turns AI research into Sovereign AI products, deployed across cloud and on-prem environments.
If you want to make a meaningful impact, join our mission and build the agentic AI platform that drives Sovereign AI products - this role is for you.
The Dream-Maker Responsibilities
Design and build agentic systems - single and multi-agent workflows with planning, memory, context engineering, and tool use - for both internal automation and product-facing autonomous capabilities operating over long time horizons.
Build and operate the AI platform layer - LLM gateways, prompt management, structured output handling, tool-calling infrastructure, and cost/latency optimization - deployed on Kubernetes, consumed by every team for their agentic work.
Own the agent framework layer - orchestration primitives, execution environments, state management, and sandboxed tool execution - giving every team the building blocks to create and operate their own agents.
Build evaluation infrastructure that gives teams confidence in agent behavior - automated LLM and agent evals for quality, correctness, safety, latency, cost, and regressions, including human-in-the-loop oversight for mission-critical workflows.
Productionize and harden backend services (APIs, gRPC, async workers) that integrate LLMs - with proper error handling, retries, circuit breakers, and high-availability patterns.
Own RAG pipelines and retrieval systems - indexing, chunking, embedding, vector database management, filtering, and relevance tuning for production retrieval.
Optimize performance and cost across the AI stack - model routing, caching, batching, and inference cost management.
Ship shared tooling - libraries, SDKs, agent templates, and documentation - while working closely with ML Platform, Data Platform, DevOps, and other teams across the Applied AI Engineering group. Own architecture, documentation, and operations end-to-end.
דרישות:
5+ years in backend or distributed systems engineering, with 2+ years focused on production systems that integrate AI/ML models or LLMs.
Engineering craft - Strong Python, Go, or Java, system architecture, API design, testing, and secure coding practices.
Agentic systems - Experience designing and building agent orchestration, tool-use systems, and autonomous workflows; familiarity with frameworks like LangGraph or similar, or having built equivalent from scratch
Backend engineering - Experience building production APIs and services (FastAPI or similar); async programming, service architecture, high-availability, and reliability patterns (retries, circuit breakers, backpressure)
LLM integration - Hands-on experience integrating LLMs via SDKs and APIs; context engineering, structured outputs, tool calling, and model routing
RAG & retrieval - Experience with embedding pipelines, vector databases (e.g., Milvus, Qdrant, Pinecone), chunking strategies, and relevance tuning
Evaluation & observability - Experience designing LLM and agent evals, monitoring AI system quality, and building observability for non-deterministic systems
Nice to Have:
Platform & infra - Kubernetes, AWS, Terraform or similar IaC, CI/CD, container orchestration, deploying and operating production services
Experience with MCP or similar tool-use protocols for agent-to-service communication
Hands-on ML experience - המשרה מיועדת לנשים ולגברים כאחד.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8603620
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
09/04/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
Required AI Engineering Team Lead - Applied AI Engineering Group
Tel Aviv Full-time
The Dream Job
It starts with you - a technical leader driven to build both the agentic AI platform and the engineering team behind it. You care about backend quality, platform reliability, and growing engineers through real ownership. We are AI-first across the board - every team builds and operates agents. You'll set the technical direction for the platform that makes this possible: agent orchestration frameworks, LLM gateways, evaluation infrastructure, tool-calling systems, and retrieval pipelines. Without this platform, agents don't ship - you own the layer that turns AI research into Sovereign AI products, deployed across cloud and on-prem environments. You stay close enough to the codebase to debug production incidents, unblock your engineers, and make sound architecture calls.
If you want to make a meaningful impact, join our mission and lead the team that builds the agentic AI platform driving Sovereign AI products - this role is for you.
The Dream-Maker Responsibilities
Architect and evolve the AI platform - agent orchestration, LLM gateways, context engineering pipelines, evaluation infrastructure, tool-calling systems, and retrieval pipelines - through RFCs, prototypes, and design reviews.
Lead and grow a small team of AI Engineers building the agent framework, production backend services, and AI platform infrastructure - hire, mentor, pair on hard problems, and raise the bar through hands-on code and design reviews.
Contribute to critical systems, debug production incidents, and maintain enough codebase context to make sound technical calls.
Own reliability across AI and agent services - set and enforce SLAs, build observability for non-deterministic systems, and harden tool execution environments for cost and security.
Set the standard for AI engineering practices - agent testing strategies, evaluation frameworks with human-in-the-loop oversight, retrieval quality benchmarks, and CI/CD for AI systems.
Work closely with ML Platform, Data Platform, DevOps, Data Science, and Product teams across the Applied AI Engineering group - ensure the AI platform evolves to serve teams building agentic workflows across the organization.
Measure and improve developer experience - deploy friction, onboarding time, CI turnaround - as seriously as system performance.
Requirements:
6+ years in backend software engineering, with 4+ years focused on production systems that integrate AI/ML models or LLMs.
2+ years leading an engineering team - hiring, mentoring, conducting design reviews, and shipping alongside your team.
Engineering craft - Strong Python, Go, or Java, system architecture, API design, testing, and secure coding practices.
Agentic systems & LLM integration - Deep understanding of agent orchestration, tool-use architectures, LLM integration patterns, context engineering, and frameworks like LangGraph or similar, or custom-built equivalents
Backend & platform engineering - Experience building and operating production APIs, services, and platform infrastructure at scale; comfortable working with relational databases, message queues, and event-driven architectures
RAG & retrieval - Experience with production RAG pipelines, vector databases, embedding systems, and retrieval quality
Evaluation & observability - Experience building LLM and agent eval infrastructure, monitoring AI quality, and observability for non-deterministic systems
Nice to Have:
Platform & infra - Kubernetes, AWS, Terraform or similar IaC, CI/CD, service architecture, incident management
Experience with MCP or similar tool-use protocols for agent-to-service communication
Hands-on ML experience - model training, fine-tuning, or working directly with ML pipelines.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8603446
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
5 ימים
Location: Herzliya
Job Type: Full Time
We are looking for a Senior AI Prompt Engineer who will own the design, development, and optimization of AI Agent experiences built on the Zowie AI platform. You will engineer the prompts, system instructions, guardrails, and multi-turn conversational flows that power our customer-facing AI Agents across chat and email-automation channels.
This is not a surface-level content role - you will operate at the intersection of language, logic, and AI behavior, shaping how our agents reason, respond, escalate, and self-correct. As part of the Digital & AI team, you will collaborate closely with Product, Engineering, AI/ML, Analysts, CX, Operations, and Localization teams to deliver intelligent, scalable, and trustworthy conversational solutions.
What you'll do:
Design, write, and optimize AI-driven conversational experiences, including system prompts, guardrails, tool-use instructions, and multi-turn flows across chatbot and email-automation channels.
Engineer and maintain reusable prompt frameworks, templates, and conversation patterns that ensure consistency in tone, safety, domain accuracy, and localization across markets on multiple channels such as AI Chat, Ai email bot, AI Voice bot.
Define and refine AI Agent behavior across user scenarios, edge cases, error states, escalation paths, and regulatory/compliance requirements.
Own end-to-end conversational journeys - from problem discovery and use-case research through design, prompt engineering, testing, deployment, and iterative optimization.
Build and maintain prompt evaluation pipelines - designing test cases, scoring rubrics, and regression tests to systematically measure prompt quality, hallucination rates, and task-completion accuracy.
Monitor, analyze, and improve AI Agent performance using analytics dashboards, QA outputs, hallucination findings, user feedback, and operational metrics; translate insights into concrete prompt improvements.
Collaborate cross-functional with Product, Engineering, AI/ML, CX, and Operations teams to identify high-impact use cases, define agent capabilities, and deliver scalable solutions.
Contribute to internal prompt engineering guidelines, conversational design systems, and AI best practices - helping establish our standards for responsible, effective AI Agent deployment.
Stay current with advancements in LLMs, agentic AI patterns, prompt optimization strategies, and conversational AI tooling; bring relevant innovations into the teams workflow.
דרישות:
3-4+ years of hands-on experience working with Large Language Models (LLMs) - including prompt engineering, system prompt design, and LLM-based application development (e.g., OpenAI, Azure OpenAI, Anthropic Claude, Google Gemini).
English proficiency - Mandatory (native or near-native written English; this role is language-critical).
Proven experience designing, deploying, and optimizing AI-powered conversational experiences (chatbots, AI agents, email automation, Voice bot or virtual assistants).
Coding/scripting experience (Python, JavaScript) for prototyping, automation, or prompt testing.
Experience with AI Agent architectures and concepts - tool use, function calling, RAG, multi-step reasoning, guardrails, and escalation logic.
Strong analytical skills - comfortable working with conversation analytics, A/B testing prompt variants, and using data to drive design decisions.
Experience designing for multilingual and multicultural audiences.
Ability to collaborate with developers and data teams to implement, test, and iterate on AI flows.
Familiarity with version control practices for prompt management and documentation.
Excellent stakeholder management - able to align multiple teams around conversational strategy and priorities.
Advantages:
Experience with conversational AI platforms (e.g., Zowie ai, Kore.ai, Yellow.ai, Ada, Cognity).
Knowledge of SQL or analytics/BI tools for performance analysis.
Background in customer service, contact center, or fintech environmen המשרה מיועדת לנשים ולגברים כאחד.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8605973
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
30/03/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a highly motivated AI Full stack Engineer with GenAI background in production to join our team and help us shape the future of the Agentic engineering platform (AEP).
What youll do:
At our company, were a platform by developers, for developers. Your role will encompass end-to-end design, implementation, and daily feature delivery across both backend and frontend systems.
You will:
Implement high scale AI-powered features deeply integrated into our platform
Design and build production-grade backend systems serving a wide and growing user base
Build agent-based workflows using frameworks such as AI SDK
Integrate LLMs into real production systems with attention to reliability, latency, observability, and cost
Work across frontend (React + TypeScript) and backend (NodeJS, Python, Go) to deliver complete AI-driven user experiences
Own features end-to-end: design, implementation, testing, deployment, and monitoring
Help define standards and best practices around AI reliability and evaluation
Contribute to technical planning, mentor teammates, and help recruit top talent
Develop retrieval-augmented generation (RAG) pipelines over structured and unstructured data
Our stack includes React + TypeScript on the frontend, and NodeJS + TypeScript, Python, and Golang on the backend, and Vercels AI-SDK + AWS Bedrock + Azure OpenAI for GenAI. We use Kafka + Kafka Connect, Redis, PostgreSQL, MongoDB and other modern infrastructure components.
Requirements:
5+ years of professional software engineering experience
Experience in NodeJS + TypeScript
Strong experience designing and developing complex systems from design to production
Experience dealing with scale and performance-related challenges
Experience building or integrating AI/LLM-powered applications in production or meaningful production systems
Experience building agent workflows and tool integrations
Ability to think critically about model limitations, hallucinations, latency, and cost tradeoffs
A collaborative team player with a can-do approach
Strong written and verbal communication skills in English and Hebrew
Advantages:
Experience with AWS or other cloud platforms
Experience with vercels AI SDK
Experience with embeddings, vector databases, or semantic search
Expierence with AWS Bedrock / Azure Open-AI
Experience building tool-using agents or workflow engines
Experience with AI evaluation, observability, and monitoring
Experience in DevOps-related tools
Experience with PostgreSQL, Kafka, DocumentDB, OpenSearch, Redis.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8597066
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
חברה חסויה
Location:
Job Type: Full Time
abra R&D is looking for a AI Engineer! abra R&D is looking for an AI Engineer that will take part of building a next-generation agentic analytics platform powered by a real-time, AI-optimized data infrastructure. We are looking for an experienced AI Engineer to design, build, and deploy intelligent systems that operate at scale and in real time. This role is hands-on and product-oriented, focusing on developing, integrating, and productionizing AI and machine learning models as part of a complex, high-performance platform. What You Will Do:
* Design, develop, and deploy AI and machine learning models into production systems
* Build scalable AI services that operate on large-scale and real-time data
* Implement deep learning and machine learning solutions using modern frameworks
* Integrate AI models into end-to-end product flows and backend systems
* Collaborate closely with software engineers and AI teams to deliver production-ready solutions
* Optimize model performance, reliability, and scalability in real-world environments
* Develop and maintain data pipelines and model-serving infrastructure
* Contribute to the evolution of AI-powered, agent-based systems and analytics capabilities
Requirements:
* 3+ years of experience in AI engineering, machine learning engineering, or applied ML in production
* Strong programming skills in Python
* Hands-on experience with PyTorch or TensorFlow
* Experience implementing ML models using frameworks such as scikit-learn, XGBoost, or LightGBM
* Solid experience with data processing tools ( Pandas, NumPy, Spark
* Experience working with large-scale or real-time data systems
* Strong software engineering mindset with a focus on reliability and maintainability Strong Advantages
* Experience deploying AI models in production environments
* Familiarity with LLM-based systems, AI agents, or agentic workflows
* Experience with event-driven or real-time analytics systems
* Background in AI-powered platforms or data-driven products
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8610855
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
01/04/2026
חברה חסויה
Location: Ramat Gan
Job Type: Full Time
We are looking for a Senior Applied AI Researcher to design, evaluate, and deploy AI systems that operate over complex security data and workflows.
This role focuses on building reliable agent-based systems, developing semantic representations of security data, and enabling AI systems to reason across structured and unstructured sources. The work spans research, experimentation, and productionization.
You will work on problems such as relational discovery, schema alignment, semantic modeling, and graph-based retrieval, and turn promising approaches into real systems used by security teams.
What Youll Do
Design and build AI systems for security analysis and automation.
Develop methods for relational discovery, schema matching, and semantic modeling across heterogeneous security data.
Design and run evaluation and benchmarking frameworks for models, agents, and end-to-end systems.
Experiment with agent architectures, tools, and orchestration strategies.
Investigate system behavior, analyze failure modes, and improve system reliability.
Collaborate with engineering teams to bring research prototypes into production systems.
Requirements:
Strong background in applied AI, machine learning, or AI systems.
MSc or PhD in Computer Science, Machine Learning, AI, or a related field, or equivalent practical experience.
Hands-on experience with LLMs, AI agents, or complex AI systems.
Experience designing evaluation and benchmarking methodologies for AI systems.
Experience working with structured and semi-structured data systems.
Proficiency in Python and modern AI frameworks.
Ability to work independently on ambiguous, real-world problems.
Preferred
Familiarity with knowledge graphs, graph databases, or graph-based reasoning.
Experience applying AI in security or adversarial environments.
Experience evaluating system-level behavior of AI systems in production.
Publications, open-source contributions, or prior research in AI systems, agents, or data reasoning systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8599466
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
30/03/2026
Location: Ra'anana
Job Type: Full Time
What We're Looking For
Role Overview: We are developing Ask our company - a high-load financial analysis system based on Large Language Models. The architecture is built on complex multi-agent orchestration using LangGraph, FastAPI, and Elasticsearch.
We are looking for a Senior Backend Engineer specialized in Generative AI to design agent workflows, optimize interactions with models (OpenAI, AWS Bedrock), and ensure the reliability of non-deterministic systems in production.
Tech Stack: Python (Asyncio), FastAPI, LangChain, LangGraph, Pydantic, Elasticsearch, AWS Bedrock / OpenAI API, LangSmith.
What You'll Do
Agent Architecture: Design and implement complex agent orchestration logic using LangGraph. You will define state management, conditional routing, and error handling within the agent graph.
Tool Engineering: Build and optimize the tool layer (function calling) that allows LLMs to interact with internal financial APIs and databases accurately.
Performance Optimization:
-Reduce end-to-end latency through asynchronous processing and streaming (SSE).
-Implement semantic caching strategies to minimize API costs and response time.
-Optimize token usage without sacrificing answer quality.
Observability & Evaluation: Implement automated evaluation pipelines using LangSmith. You will be responsible for setting up regression testing for prompts and agents to measure quality (correctness, faithfulness) before deployment.
Advanced RAG: Refine retrieval strategies. Work on hybrid search implementation (Keyword + Vector), re-ranking, and query expansion to feed the most relevant context to the model.
Requirements:
Python Expert: Strong proficiency in modern Python. Deep understanding of asynchronous programming (asyncio) patterns is mandatory, as our entire I/O pipeline (Network, DB, LLM) is non-blocking. Experience with FastAPI and Pydantic (v2).
Agentic Frameworks: Production experience with LangChain. Hands-on experience or deep conceptual understanding of LangGraph (or similar state-machine based agent frameworks).
Deep LLM Expertise (What we mean by "Deep"):
Non-determinism Management: Strategies for handling LLM hallucinations and ensuring reliable outputs (e.g., self-correction loops, specific prompting techniques like CoT/ReAct).
Structured Outputs: Experience forcing LLMs to adhere to strict schemas (Pydantic/JSON mode) for reliable downstream processing.
Context Optimization: Advanced strategies for managing limited context windows (summarization chains, sliding windows, selective context injection) beyond simple truncation.
Inference Economics: Understanding the trade-offs between model size, latency, and cost (e.g., when to route to GPT-4 vs. a smaller/faster model).
Nice to Have
Experience with Elasticsearch (DSL queries, analyzers).
Knowledge of vector databases and embedding models.
Background in FinTech or familiarity with financial data structures.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8596950
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/03/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
you will work at the intersection of Machine Learning and software engineering - selecting the right models, feedback strategies, and evaluation frameworks to make ai-generated code reliable, high-quality, and trustworthy.
what you'll be doing:
design and build ai-powered development pipelines - from code generation and automated review to feedback loops and evaluation systems.
evaluate and select ml approaches for specific problems: when to use llm prompting vs. fine-tuning (qlora), classical ml (random forest, linear regression) vs. reinforcement learning, rag vs. structured extraction.
architect feedback and evaluation systems that measure and improve ai output quality over time.
review and refine ai solution architectures - evaluate design decisions, identify weaknesses, propose alternatives with reasoning.
lead proof-of-concept development to validate new ai/ml approaches for development tooling.
collaborate with the core team to define risk-based development levels and calibrate ai review depth per level.
Requirements:
what we need to see:
hold a m.sc. or ph.d. in Computer Science, electrical or computer engineering from a leading university (or equivalent experience).
5+ years of industry experience (or equivalent) in ai pipelines architecture or related fields.
industry experience building and shipping ai-powered tools or ml pipelines (not just training models - end-to-end delivery).
strong understanding of llm capabilities and limitations - prompt engineering, fine-tuning, rag, agent architectures.
experience with at least two of: reinforcement learning, classical ml, NLP /information retrieval, evaluation framework design.
can reason about trade-offs: when to use which approach, with real reasoning backed by shipping experience.
strong programming skills ( Python required; familiarity with ml frameworks - pytorch, huggingface, etc.).
ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
ways to stand out from the crowd:
experience with llm-based code generation, code review, or Developer tooling.
familiarity with eval frameworks and feedback loop design (online and offline evaluation).
experience with ai agent orchestration (multi-agent systems, tool use, planning).
shown research track record (publications, open-source contributions).
knowledge of ai-assisted development tools and their underlying architectures.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593814
סגור
שירות זה פתוח ללקוחות VIP בלבד