דרושים » הנדסה » MLOps Engineer- 2548

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Merkaz
We are seeking a skilled MLOps Engineer
Responsibilities
Deploy machine learning models in development and production environments, ensuring seamless integration and reliability.
Optimize models for scalability, stability and performance in real-world scenarios.
Monitoring & Maintenance: Monitor training and production infrastructure, analyze model performance, identify potential issues and implement solutions to ensure continuous operation.
Leverage MLOps platforms and tools to automate and streamline the development of machine learning models.
Infrastructure Management: Work with containerization, orchestration, and infrastructure-as-code tools to support scalable and and efficient ML workflows.
Requirements:
At least 2 years of experience as a MLOps Engineer.
Strong programming skills in python.
Hands-on experience with machine learning frameworks such as PyTorch and TrensorRT.
Proficiency in infrastructure-as-code tools like Terraform.
Familiarity with MLOps best practices and tools for model lifecycle management.
Advantages
Experience working with cloud platforms such as AWS or GCP.
Knowledge in additional MLOps tools and platforms.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8293845
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
15/07/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior ML Engineer.
As a Senior ML Engineer , you will lead the design, deployment, and maintenance of production-ready machine learning systems that power legal insights at scale. Youll work across disciplinesintegrating LLMs, building pipelines, and optimizing infrastructureto deliver real-world impact.
Youll gain full ownership over the systems you build, from selecting the tools and platforms to deploying services in production. Youll work closely with DevOps and Data Engineering to ship robust solutions that drive meaningful change in the world.
Responsibilities :
Design and implement production-grade ML systems, including APIs, batch jobs, and streaming pipelines using Databricks, MLflow, AWS/GCP.
Build and manage ML infrastructure, including data pipelines, model training, deployment, and monitoring.
Develop and maintain end-to-end ML/LLM pipelinesfrom data ingestion and labeling to synthetic data generation, model registry, and rollout.
Own and improve MLOps practices: automated testing, CI/CD, monitoring, alerting, and model governance.
Write clean, maintainable Python code and uphold best practices in engineering and documentation.
Collaborate with DevOps and Data Engineering teams to scale systems and improve performance.
Research and recommend the best tools, platforms, and practices to support ML at scale.
Requirements:
BSc in Computer Science or a related field.
6+ years of experience building and deploying ML systems in production environments.
Proficiency in Python and production frameworks like FastAPI, Databricks, SageMaker, and MLflow.
Proven track record in deploying and maintaining ML/LLM services (APIs, microservices, serverless, or containerized).
Strong understanding of software engineering fundamentals: object-oriented design, testing, version control, CI/CD, and performance optimization.
Experience working with agentic workflows or LLM-based agents.
Ability to work independently and break down complex, ambiguous problems into structured solutions.
Strong communication skillsable to explain technical concepts to both technical and non-technical stakeholders.
Advantages:
Hands-on experience with Kubernetes, Airflow, Spark, ArgoCD, and Docker.
Experience working with databases such as Elasticsearch, vector databases, PostgreSQL, and SQL.
Experience fine-tuning or integrating open-source LLMs in production environments (e.g., RAG, LoRA, agent frameworks).
Contributions to open-source ML or MLOps projects.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8259565
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior MLOps Engineer
Realize your potential by joining the leading performance-driven advertising company!
As a Senior MLOps Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Senior MLOps Engineer Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet
Our tech stack:
Java, Python, TensorFlow, Spark, Kafka, Cassandra, HDFS, vespa.ai, ElasticSearch, AirFlow, BigQuery, Google Cloud Platform, Kubernetes, Docker, git and Jenkins.
Requirements:
To thrive in this role, youll need:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8273998
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Staff MLOps Engineer
Apply
Realize your potential by joining the leading performance-driven advertising company!
As a Staff MLOps Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Staff MLOps Engineer Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet
Our tech stack:
Java, Python, TensorFlow, Spark, Kafka, Cassandra, HDFS, vespa.ai, ElasticSearch, AirFlow, BigQuery, Google Cloud Platform, Kubernetes, Docker, git and Jenkins.
Requirements:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8272669
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/08/2025
Location:
Job Type: Full Time
a well-funded AI startup on a mission to revolutionize passenger journeys with intelligent threat detection powered by machine learning and computer vision. Were building cutting-edge AI tools to ensure safer and more efficient travel.

We are seeking a highly motivated and experienced Data Engineer & MLOps Specialist to join our dynamic Data team. This is a pivotal role where you will design and implement scalable data solutions, develop automation for machine learning workflows, and manage critical infrastructure supporting our AI/ML initiatives.

Responsibilities:
Develop and maintain automated pipelines for data transformation, model inference, and monitoring.
Build and manage an annotation database to ensure high-quality labeled data for AI training.
Design and deploy infrastructure for automated ML inference and neural network analysis using tools like Voxel51 and ClearML.
Develop ETL pipelines to integrate production data into the data warehouse for analysis and reporting.
Work on local servers and in the AWS cloud, ensuring efficient and secure processing, storage, and deployment of data and ML models.

A chance to design and implement solutions from scratch, shaping the future of data and AI infrastructure.
Work with AI technology in a high-growth startup environment.
Opportunities for career growth and continuous learning.
Join us in building the future of intelligent security systems
Requirements:
6+ years of experience in data engineering, MLOps, Software or a related field.
Strong programming skills in Python and experience with SQL and data platforms.
Familiarity with tools like Voxel51, ClearML, or similar open-source frameworks.
Hands-on experience in building scalable ETL pipelines and managing large datasets.
Solid understanding of modern data technologies, including data processing frameworks, data storage solutions, and workflow orchestration tools.
Knowledge of machine learning workflows and AI environments is a plus.
Excellent communication and collaboration skills.
Bachelors degree in Information Systems, Computer Science, or a related field; advanced degrees are a bonus.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8287880
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
14/07/2025
Location: Haifa
Job Type: Full Time and English Speakers
We are seeking an exceptional Data Scientist to lead applied research in machine learning and drive the AI transformation . This pivotal role will shape the next generation of customer-facing products and internal tools, helping to redefine the trading experience through advanced AI.
Key Responsibilities:
Design and build end-to-end AI workflows including synthetic dataset generation, data preprocessing, model development, deployment, and agent orchestration.
Design and implement LLM (Large Language Model) evaluation frameworks to assess new AI models, techniques, and parameters.
Process and analyze large-scale structured and unstructured data from databases, APIs, and develop machine learning models for time series forecasting, regression, and classification tasks.
Design and implement enterprise-grade RAG systems using LLMs, vector databases, GraphRA, G, and intelligent agents.
Optimize AI-user interactions through advanced prompt engineering and retrieval strategies.
Translate cutting-edge AI research into practical, scalable tools for internal and customer-facing applications and collaborate with global and cross-functional R&D teams to drive innovation across product lines.
Apply MLOps and LLMOps best practices to ensure robust model training, deployment, and monitoring pipelines.
Requirements:
BSc or MSc in Computer Science, Software Engineering, Data Science, or a related field.
3+ years of experience as a Data Scientist or ML Engineer.
Hands-on expertise with machine learning techniques: regression, classification, clustering, time series
Deep interest and foundational knowledge, including hands-on experience with:
Transfer learning & fine-tuning adapting large pre-trained models (like GPT, LLaMA, Mistral) to domain-specific tasks.
LLMs, Prompt Engineering, and Chain-of-thought (CoT) prompting
Designing MCP APIs, Agent,s and Multi-Agent Orchestration
Vector databases (RAG) and GraphRAG-based solutions
Models for Content Moderation
Familiarity with MLOps & LLMOps practices automating model training, testing, deployment, monitoring, and governance.
Strong proficiency in Python (or other relevant programming languages).
Excellent written and verbal communication skills in English.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8257772
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/07/2025
חברה חסויה
Location: Petah Tikva
Job Type: Full Time
we are at the forefront of developing cutting-edge, Real-Time, mission-critical platforms in the fields of energy, utilities, and smart infrastructure. We are looking for a passionate and experienced DevOps Engineer with deep expertise in MLOps Azure DevOps, and ArgoCD, to help drive scalable deployments and intelligent automation in production environments.
Responsibilities:

* Design and maintain CI/CD pipelines in Azure DevOps for both traditional applications and ML workloads.
* Develop and manage MLOps workflows for model training, validation, deployment, and monitoring.
* Automate deployment of services using ArgoCD, Helm, and GitOps best practices.
* Manage and operate production-grade Kubernetes clusters (on-prem and Azure AKS).
* Implement Infrastructure as Code (IaC) using Terraform
* Collaborate closely with data scientists, software engineers, and infrastructure teams to support end-to-end delivery.
* Monitor system performance, availability, and reliability using tools like Prometheus Grafana, and EFK stack
* Ensure best practices in security, version control, logging, and compliance.
Why Join Us:

* Be part of a leading-edge technology company solving real-world challenges in energy and infrastructure.
* Work with a multidisciplinary team of experts in a dynamic and collaborative environment.
* Enjoy continuous professional development.
* Opportunity to work on impactful, large-scale projects with national and global reach.
Requirements:
* 3+ years of experience as a DevOps Engineer in a production environment.
* Proven expertise in Azure DevOps (Pipelines, Repos, Artifacts).
* Proficient in ArgoCD Kubernetes, and container orchestration.
* Experience with Docker Git, and versioning strategies.
* Scripting knowledge in Python Bash, or PowerShell
* Solid understanding of CI/CD, GitOps, and cloud-native architecture.
* Experience working in hybrid (on-prem + cloud) environments.
* Customer-facing role; fluent English (spoken and written) is required
* Basic Network understanding is must Nice to Have
* Experience in OpenShift platforms
* Experience working in AWS
* Familiarity with monitoring/logging solutions: Prometheus Grafana Elasticsearch Kibana
* Background working with data pipelines or Real-Time streaming systems.
* Hands-on experience with MLOps platforms (e.g., MLflow, Kubeflow, Azure ML).
* Azure Certifications (e.g., AZ-400, AZ-104).
* Knowledge in different architecture environments
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8217921
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
13/07/2025
Location: Tel Aviv-Yafo and Netanya
Job Type: Full Time
As a Big Data & GenAI Engineering Lead within our company's Data & AI Department, you will play a pivotal role in building the data and AI backbone that empowers product innovation and intelligent business decisions. You will lead the design and implementation of our companys next-generation lakehouse architecture, real-time data infrastructure, and GenAI-enriched solutions, helping drive automation, insights, and personalization at scale. In this role, you will architect and optimize our modern data platform while also integrating and operationalizing Generative AI models to support go-to-market use cases. This includes embedding LLMs and vector search into core data workflows, establishing secure and scalable RAG pipelines, and partnering cross-functionally to deliver impactful AI applications.
As a Big Data & GenAI Engineering Lead in our company you will...
Design, lead, and evolve our companys petabyte-scale Lakehouse and modern data platform to meet performance, scalability, privacy, and extensibility goals.
Architect and implement GenAI-powered data solutions, including retrieval-augmented generation (RAG), semantic search, and LLM orchestration frameworks tailored to business and developer use cases.
Partner with product, engineering, and business stakeholders to identify and develop AI-first use cases, such as intelligent assistants, code insights, anomaly detection, and generative reporting.
Integrate open-source and commercial LLMs securely into data products using frameworks such as LangChain, or similar, to augment AI capabilities into data products.
Collaborate closely with engineering teams to drive instrumentation, telemetry capture, and high-quality data pipelines that feed both analytics and GenAI applications.
Provide technical leadership and mentorship to a cross-functional team of data and ML engineers, ensuring adherence to best practices in data and AI engineering.
Lead tool evaluation, architectural PoCs, and decisions on foundational AI/ML tooling (e.g., vector databases, feature stores, orchestration platforms).
Foster platform adoption through enablement resources, shared assets, and developer-facing APIs and SDKs for accessing GenAI capabilities.
Requirements:
8+ years of experience in data engineering, software engineering, or MLOps, with hands-on leadership in designing modern data platforms and distributed systems.
Proven experience implementing GenAI applications or infrastructure (e.g., building RAG pipelines, vector search, or custom LLM integrations).
Deep understanding of big data technologies (Kafka, Spark, Iceberg, Presto, Airflow) and cloud-native data stacks (e.g., AWS, GCP, or Azure).
Proficiency in Python and experience with GenAI frameworks like LangChain, LlamaIndex, or similar.
Familiarity with modern ML toolchains and model lifecycle management (e.g., MLflow, SageMaker, Vertex AI).
Experience deploying scalable and secure AI solutions with proper attention to privacy, hallucination risk, cost management, and model drift.
Ability to operate in ambiguity, lead complex projects across functions, and translate abstract goals into deliverable solutions.
Excellent communication and collaboration skills, with a passion for pushing boundaries in both data and AI domains.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8255562
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Netanya
Job Type: Full Time
We are seeking a skilled and passionate Machine Learning Engineer who combines strong software engineering expertise with a deep understanding of data science and machine learning. In this role, you will be instrumental in developing and improving our ML systems, ensuring that both the underlying infrastructure and the algorithms powering our models are optimized for performance and scalability.
You will collaborate closely with cross-functional teams, including data scientists and software engineers, to implement end-to-end machine learning models that drive impactful business outcomes. Additionally, your work will ensure the continued improvement of our ML platform, which supports algorithmic work across the company.
What will you do?
As a Senior Machine Learning Engineer, your mission will be:
Improve existing ML library & tools allowing to explore and analyze more and more data and provide accurate feedback on the activities.
Create, design, develop, test and monitor your code in production autonomously and reliably.
Design, develop, and implement ML models and algorithms with a focus on robustness, performance, and scalability.
Collaborate with data scientists to identify and apply effective machine learning techniques and strategies for improved system performance.
Mentor other team members to improve their autonomy and Software Engineering skills.
Collaborate with variety of teams for production cases to develop services from design to production.
Make sure the software is in good hands by writing, running and automating tests (unit, functional, load...).
Keep up to date with the latest Machine Learning technologies to make sure we constantly improve our ML system.
Requirements:
Strong programming skills with a focus on Python, Java, Scala and proficiency in software engineering practices such as testing, debugging, and performance tuning.
Hands-on experience implementing machine learning algorithms, particularly in production environments.
Strong problem solving skills.
Strong communication skills, with the ability to collaborate effectively across teams and explain complex technical concepts clearly.
Bonus Points:
Knowledge of Data Engineering tools (e.g. Spark, Airflow, ...).
Familiarity with ML frameworks and tools (e.g., TensorFlow, PyTorch,..).
Performance engineering capabilities, including profiling and optimizing algorithms and systems for high efficiency and low latency.
Experience working with cloud-based infrastructures (e.g., AWS, GCP).
A Masters or PhD in SC/Math.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8276179
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for an experienced MLOps / DevOps Engineer to design and manage the infrastructure powering large-scale machine learning systems. Youll be responsible for deploying GPU-heavy models (including LLMs) on cost-efficient, production-grade infrastructure, supporting both ML workflows and application artifact delivery.

Youll work with cutting-edge technologies like vLLM, Triton, SageMaker, ClearML, Karpenter, KEDA, and EKS, ensuring the right balance between performance, scalability, and cost.



What Youll Do

Deploy and manage LLMs and deep learning models using vLLM, Triton Inference Server, and custom API endpoints.

Build and maintain GPU-aware autoscaling clusters using AWS EKS, Karpenter, and KEDA, optimizing for cost-efficiency and performance.

Develop CI/CD pipelines using Jenkins and GitHub Actions to automate ML model delivery and application deployments.

Orchestrate training, fine-tuning, and inference jobs on AWS SageMaker and ClearML, with support for experiment tracking, versioning, and reproducibility.

Support backend teams in deploying app artifacts and runtime environments; implement rollback and release strategies.

Integrate observability tooling (e.g., Prometheus, Grafana, ELK, or OpenTelemetry) for both infrastructure and model performance.

Collaborate with SREs to enforce high availability, disaster recovery, and incident response procedures for mission-critical AI services.
Requirements:
6+ years of experience in DevOps, MLOps, or infrastructure roles with a focus on ML model delivery.

Proven hands-on experience deploying GPU-based models (LLMs, vision, transformers) using vLLM or Triton.

Deep knowledge of AWS EKS and Kubernetes, with practical experience configuring Karpenter and KEDA for auto-scaling GPU workloads.

Experience building pipelines using Jenkins, GitHub Actions, and managing releases for ML and application codebases.

Familiarity with AWS SageMaker, ClearML, or similar platforms for ML orchestration and experimentation.

Strong scripting and automation skills in Python, Bash, and working knowledge of containerization (Docker).

Solid grasp of networking, IAM, and cloud security fundamentals.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8268730
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At UVeye, we're on a mission to redefine vehicle safety and reliability on a global scale. Founded in 2016, we have pioneered the world's first fully automated suite of vehicle inspection systems. At the heart of this innovation lies our advanced AI-driven technology, representing the pinnacle of machine learning, GenAI, and computer vision within the automotive sector. With close to $400M in funding and strategic partnerships with industry giants such as Amazon, General Motors, Volvo, and CarMax, UVeye stands at the forefront of automotive technological advancement. Our growing global team of over 200 employees is committed to creating a workplace that celebrates diversity and encourages teamwork. Our drive for innovation and pursuit of excellence are deeply embedded in our vibrant company culture, ensuring that each individual's efforts are recognized and valued as we unite to build a safer automotive world.
We are seeking a highly motivated and skilled Release Engineer to join our AIOps group. In this role, you'll play a critical part in bridging the gap between development and operations, ensuring the seamless qualification, deployment, and monitoring of our AI algorithms and infrastructure, and be responsible for the end-to-end operationalization of our core technology.
A day in the life and how you’ll make an impact:
* Manage the end-to-end release process of machine learning algorithms and infrastructure components, from qualification through deployment.
* Validate and test new algorithm releases to ensure they meet performance, stability, and compliance standards.
* Create and execute deployment plans across various environments (staging, production), ensuring minimal risk and downtime.
* Collaborate closely with AI researchers, MLOps, and software engineers to understand release requirements, share feedback, and resolve pre-release issues.
* Identify and drive automation opportunities within the release pipeline to improve efficiency, reliability, and traceability.
* Oversee updates to infrastructure components, ensuring compatibility and performance across systems.
* Monitor deployments, proactively identify issues related to model behavior or infrastructure anomalies, and drive resolution with relevant teams.
* Maintain clear and accurate release documentation, including version history, deployment notes, and incident reports.
Requirements:
* Bachelor's degree in Computer Science, Software Engineering, or industry equivalent.
* 2+ years of experience in QA & Automation
* Proficiency in scripting languages (e.g., Python, Bash).
* Experience with containerization technologies (e.g., Docker, Kubernetes).
* Familiarity with CI/CD pipelines (e.g., GitLab CI/CD, Jenkins).
* Experience with cloud platforms (e.g., AWS, GCP).
* Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
* Excellent problem-solving skills and attention to detail.
* Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams.
Bonus if you have: Strong understanding of the machine learning lifecycle, from experimentation to deployment and monitoring.
* Experience with specific MLOps platforms or tools.
* Experience in a fast-paced startup environment.

Why UVeye: Pioneer Advanced Solutions: Harness cutting-edge technologies in AI, machine learning, and computer vision to revolutionize vehicle inspections. Drive Global Impact: Your innovations will play a crucial role in enhancing automotive safety and reliability, impacting lives and businesses on an international scale. Career Growth Opportunities: Participate in a journey of rapid development, surrounded by groundbreaking advancements and strategic industry partnerships.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8214831
סגור
שירות זה פתוח ללקוחות VIP בלבד