דרושים » תוכנה » VLA Deep Learning Engineer, End-To-End Autonomous Driving

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 15 שעות
Location: More than one
Job Type: Full Time
We have been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technologyand amazing people. Today, were tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing whats never been done before takes vision, innovation, and the worlds best talent. Youll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are in need of skilled engineers to join our autonomous driving team to invent, execute, and deploy pioneering end-to-end autonomous driving systems. Our strategy has progressed from AI 1.0 constructing a driver from the ground up to AI 2.0 training an intelligent agent to drive. This is achieved by developing LLMs, VLMs, and VLAs to offer exceptional reasoning, planning capabilities, and interaction with the driving system for autonomous driving and general robotics. Lets innovate the future of autonomytogether!

What you will be doing:

Build and train innovative large-scale modelsincluding generative, imitation, and reinforcement learningto improve the planning and reasoning capabilities of our driving systems.

Explore novel data generation and collection strategies to improve diversity and quality of training datasets. Develop, pre-train, and optimize LLM/VLM/VLA models for autonomous driving and robotics applications.

Collaborate cross-functionally to deploy and integrate AI models into vehicle firmware.

Deliver production-quality, safety-critical software that meets performance, safety, and reliability standards.
Requirements:
What we need to see:

PhD or Master's degree with equivalent experience.

8+ years of experience.

Hands-on experience training LLMs/VLMs/VLAs from scratch, or a proven record as a top-tier ML engineer/researcher passionate about autonomous systems.

Strong programming skills in Python and proficiency with major deep learning frameworks. Basic familiarity with C++ for model deployment and integration in safety-critical systems.

Comprehensive grasp of current deep learning structures and improvement methods. Consistent track record of deploying production-grade ML models for self-driving, robotics, or related fields at scale.

Ways to stand out from the crowd:

Experience developing and shipping LLM/VLM/VLA solutions for autonomous vehicles or general robotics products.

Publications, contributions to open-source projects, or victories in competitions connected to LLM/VLM/VLA systems.

Profound comprehension of behavior and motion planning in real-world autonomous vehicle (AV) applications.

Experience building and training large-scale datasets and models and/or training agents with reinforcement learning.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418914
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 15 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are in search of a Senior Software Architect- a creative, forward-thinking, and practical researcher to improve the framework for widespread LLM learning and prediction. As part of our dynamic E2E Architecture group, you will design and optimize systems driving generative AI workloads, working at the intersection of software and hardware on some of the most advanced GPU clusters worldwide. You will define how AI models are deployed and scaled in production using the NVIDIA Spectrum-X Networking Platform, influencing decisions from inter-node communication and compute scheduling to system-level optimization. This is an opportunity to collaborate with best-in-class engineers and researchers and shape the future of generative AI in real-world applications. Your work will make a lasting impact by enabling generative AI technologies to reach real-world applications and improve global computing capabilities.

What Youll Be Doing:

Lead research and development of end-to-end networking solutions for distributed AI training and inference at scale, with a focus on job completion time, failure resiliency, telemetry, scheduling, and placement.

Analyze current deployments, develop prototypes, and recommend architectural improvements.

Stay abreast of the latest research; become the teams authority in emerging networking techniques and technologies.

Design, simulate, and validate new systems using novel, scalable network simulator NSX.

Develop and test prototypes on large-scale GPU clusters (e.g., Israel-1).

Collaborate across hardware, firmware, and software teams to translate ideas into real networking product features.

Publish patents and present research at leading conferences.
Requirements:
What We Need to See:

M.Sc. or PhD (preferred) in Computer Science, Electrical/Computer Engineering, or related fieldor B.Sc. with research experience and publications.

5+ years of relevant experience.

Deep expertise in networking and communication internals (NCCL, RDMA, congestion control, routing).

Strong software engineering skills in C++ and/or Python.

Excellent system-level design and problem-solving abilities.

Outstanding communication and collaboration skills across technical domains.

Ways to Stand Out from the Crowd:

Proven passion for solving sophisticated technical problems and delivering impactful solutions.

Record of publications in top-tier conferences.

Experience in designing and building large-scale AI training clusters.

Post-PhD research experience

Practical understanding of deep learning systems, GPU acceleration, and AI model execution flows.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418932
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture group, where we build cutting-edge systems powering the next generation of generative AI workloads. In this role, you will work across software and hardware domains to design and optimize inference infrastructure for large language models running on some of the most advanced GPU clusters in the world.

Youll help define how AI models are deployed and scaled in production, driving decisions on everything from memory orchestration and compute scheduling to inter-node communication and system-level optimizations. This is an opportunity to work with top engineers, researchers, and partners across us and leave a mark on the way generative AI reaches real-world applications.

What Youll Be Doing:

Design and evolve scalable architectures for multi-node LLM inference across GPU clusters.

Develop infrastructure to optimize latency, throughput, and cost-efficiency of serving large models in production.

Collaborate with model, systems, compiler, and networking teams to ensure holistic, high-performance solutions.

Prototype novel approaches to KV cache handling, tensor/pipeline parallel execution, and dynamic batching.

Evaluate and integrate new software and hardware technologies relevant to Core Spectrum-X technologies, such as load balancing, telemetry, congestion control, vertical application integration.

Work closely with internal teams and external partners to translate high-level architecture into reliable, high-performance systems.

Author design documents, internal specs, and technical blog posts and contribute to open-source efforts when appropriate.
Requirements:
What We Need to See:

Bachelors, Masters, or PhD in Computer Science, Electrical Engineering, or equivalent experience.

8+ years of experience building large-scale distributed systems or performance-critical software.

Deep understanding of deep learning systems, GPU acceleration, and AI model execution flows and/or high performance networking.

Solid software engineering skills in C++ and/or Python, preferably demonstrate strong familiarity with CUDA or similar platforms.

Strong system-level thinking across memory, networking, scheduling, and compute orchestration.

Excellent communication skills and ability to collaborate across diverse technical domains.

Ways to Stand Out from the Crowd:

Experience working on LLM - training or inference pipelines, transformer model optimization, or model-parallel deployments.

Demonstrated success in profiling and optimizing performance bottlenecks across the LLM training or inference stack.

AI Accelerators and distributed communication patterns, congestion control and/or load balancing.

Proven optimization process for complex systems, deployed at scale to make impact.

Passion for solving tough technical problems and finding high-impact solutions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8415674
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: More than one
Job Type: Full Time
We are seeking an AI Networking Exploration Architect for our Networking Insights Group to bridge the gap between cutting-edge, hyper-scale AI workloads and the datacenter infrastructure that enables them. You will join a small, focused team of multidisciplinary engineers driving AI workload optimization through deep application understanding and end-to-end systems thinking. Your insights will directly shape our products across the full stackfrom applications and software libraries to hardware architecture and physical design.

What You'll Be Doing:

Model the performance of complex AI workloads to identify bottlenecks and recommend system-level optimizations.

Translate state-of-the-art research into actionable infrastructure, software, and hardware features in partnership with architecture teams.

Rapidly master new AI domains (LLMs, generative models, multimodal systems) and distill key findings for product teams.

Incorporate your deep knowledge of AI applications into our hardware and software roadmaps.

Conduct independent research by formulating hypotheses about workload behavior and validating them through rigorous analysis.

Drive architectural innovation and network optimization by applying your domain expertise to exploratory analysis of real-world Deep Learning (DL) workloads.
Requirements:
What we need to see:

M.Sc. or Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience.

+5 years of experience.

Strong ML/Data Science background with hands-on experience in LLMs or generative AI.

A systems-level mindset with the ability to estimate end-to-end requirements across the entire AI stack.

Proven ability to translate research and product requirements into clear software/hardware specifications.

Exceptional research skills: you can digest academic papers, self-learn new domains, and independently test hypotheses.

Advanced Python programming skills for performance modeling and data analysis.

Excellent communication skills, with the ability to present complex findings with clarity and conviction.

A pragmatic approach: you are detail-oriented but can prioritize effectively to focus on the most critical issues.

Ways to Stand Out from the Crowd:

Deep understanding of datacenter infrastructure, network topologies, and protocols.

Expertise in distributed training methods and their impact on infrastructure.

Knowledge of AI performance metrics and the impact of different deployment strategies.

Experience extrapolating academic research into tangible hardware architecture requirements.

A track record of leading complex, multidisciplinary research projects that result in production impact.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8409343
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
5 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a highly motivated Senior Deep Learning Researcher to join our team! This is an outstanding opportunity to conduct impactful research and develop the next generation of large language model (LLM) inference algorithms. You will work on technologies that directly enhance our software, making the latest LLMs more efficient and accessible for users worldwide. By joining us, you will be part of a strategic effort to establish us as the definitive platform for high-performance LLM inference. You will engage with skilled problem-solvers and top organizations, crafting AI technology advancements.

What you'll be doing:
Research, invent, and implement groundbreaking algorithms for LLM inference to advance the state of the art in both low-latency and high-throughput scenarios.
Translate research into practical software solutions that directly impact our products and customers.
Collaborate with internal research, engineering, and product teams across the globe to drive the development of sophisticated inference technologies.
Analyze the performance of new algorithms on our latest hardware, identifying bottlenecks and opportunities for algorithmic optimizations.
Partner with leading scientific organizations and industry pioneers to remain at the forefront of technological advancements and integrate the latest innovations into practical applications.
Requirements:
What we need to see:
MSc/PhD in Computer Science, Electrical Engineering, or a closely related field.
At least 3 years of proven experience in deep learning research or applied research.
At least one publication in a top-tier AI/ML conference (e.g., NeurIPS, ICLR, ICML).
Deep understanding of LLM architectures coupled with hands-on experience in training large-scale models.
Excellent programming skills, particularly in Python and deep learning frameworks like PyTorch, and experience with software engineering best practices.
A strong problem-solving mentality and a proactive demeanor, driven by the ambition to deliver solutions with real-world impact.

Ways to stand out from the crowd:
Hands-on research experience in LLM inference optimization algorithms such as speculative decoding or parallelization strategies.
Proven experience with High-Performance Computing (HPC) environments, including training or running inference on large-scale GPU clusters (tens to hundreds of GPUs).
Deep familiarity and experience with popular LLM inference systems (e.g., vLLM, TensorRT-LLM).
Experience from a world-class industrial research group or a top-tier institution.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8412839
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
20/10/2025
Location: Ra'anana
Job Type: Full Time and Hybrid work
The Ecosystems Engineering group is seeking a Principal Software Engineer to join our rapidly growing team. This is a game-changing opportunity to join an open-source AI platform that harnesses the power of hybrid cloud to drive innovation. In this role, you will work with a diverse team of highly talented engineers on designing, implementing, and productizing new AI solutions, with a focus on deep integration of the AI stack, hardware accelerators, and leading OEMs and Cloud Computing Service Providers (CCSPs).

You'll play a critical role in shaping the next generation of hybrid cloud platforms by directly contributing to our innovative AI and Edge products. This is your chance to be at the forefront of AI's exciting evolution, joining an ecosystem that champions continuous learning, career growth, and professional development. You'll also collaborate closely with product management, other engineering teams, and key partners and lighthouse customers.

What You Will Do:

Architect and lead the implementation of new features and solutions for our AI and Edge products.

Explore deep code integration into various products, ensuring optimal integration between the company`s portfolio, hardware accelerators and partners.

Provide technical vision and leadership on critical and high-impact projects, ensuring non-functional requirements including security, resiliency, and maintainability are met.

Integrate software that leverages hardware accelerators (e.g., DPUs, GPUs, AIUs) and perform performance analysis and optimization of AI workloads with accelerators.

Work with major AI and hardware partners such as NVIDIA, AMD, Dell, and others on building joint integrations and products.

Collaborate closely with UX, UI, QE, and cross-functional teams to deliver a great experience to our partners and customers.

Coordinate with team leads, architects, and other engineers on the design and architecture of our offerings.

Become responsible for the quality of our offerings, participate in peer code reviews and continuous integration (CI), and respond to security threats.

Mentor, influence, and coach a distributed team of engineers, contributing to a culture of continuous improvement by sharing recommendations and technical knowledge.
Requirements:
What You Will Bring:

7+ years of relevant technical experience in software development.

Advanced experience working in a Linux environment with at least one language like Golang, Rust, Java, C, or C++.

Advanced experience with a container orchestration ecosystem like Kubernetes, or Red Hat OpenShift.

Strong experience with microservices architectures and concepts including APIs, versioning, monitoring, etc.

Experience with AI/ML technologies, including foundational frameworks, large language models (LLMs), Retrieval Augmented Generation (RAG) paradigms, vector databases, and LLM orchestration tools.

Ability to quickly learn and guide others on using new tools and technologies.

Proven ability to innovate and a passion for staying at the forefront of technology.

Excellent system understanding and troubleshooting capabilities.

Autonomous work ethic, thriving in a dynamic, fast-paced environment.

Technical leadership acumen in a global team environment.

Proficient written and verbal communication skills in English.

The Following is Considered a Plus
Experience with cloud development for public cloud services (AWS, GCE, Azure).

Familiarity with virtualization, networking, or storage.

Background in DevOps or site reliability engineering (SRE).

Experience with hardware accelerators (e.g., GPUs, FPGAs) for AI workloads.

Recent hands-on experience with distributed computation, either at the end-user or infrastructure provider level.

Experience with performance analysis tools.

Experience with Linux kernel development.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8378022
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 15 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are rapidly growing into various verticals as stated above and looking to grow our team with a developer relations leader. This role is a key member of the team that would manage our technical engagements in leading industries such as general AI/ML applications development, consumer internet, AI on embedded systems and others; and to evangelize our portfolio of technologies and accelerate its adoption among developers. Have strong technical competence and the ability to be a leader who can function effectively and independently in a matrixed environment to groom and guide developers. This is a regional role covering Israel's AI ecosystem.

What you will be doing:

Develop a technical strategy to ensure our platforms adoption for selected industries, focusing on your priority industries and use cases

Establish relationships with technical leaders in organizations and communities, specifically developers, startups and ISVs within the industries.

Evangelize and develop our leadership position by accelerating the availability of GPU-accelerated AI and data Science applications in the specified market by helping developers understand the value of our hardware products and SDKs in addressing critical development opportunities

Lead participation in targeted customer and industry developer events and activities. Chair technical activities spanning product divisions and sales geographies, particularly with solution architects, software developers and engineering resources, developer marketing contribute towards our local ecosystem strategy and development of our value messaging for your customers.

Be the key advisor to how we are differentiated. You will become a technology mentor and focal point for the software developer community.
Requirements:
What we need to see:

Bachelor's Degree in Engineering, Science, Technical or other related discipline or equivalent experience. Master's or Ph.Ds is preferred. Intellectual curiosity and passion for innovation.

Experience in several verticals/industries with good knowledge and trends in the industry.

Expertise in CUDA programming, GPU platforms and Deep Learning and Machine Learning frameworks.

You will show a deep understanding of who and how to engage the developers product and engineering organizations with at least 8 years related experience.

5+ years experience in an AI and ML software development environment or working with developers in these areas; and at least 3 years experience in business development activities.

Able to work independently and possess excellent communication skills to drive customer and internal engagements.

Demonstrate ability to influence, evangelize and persuade at both operational and executive level (including engineering/ product management) to achieve a targeted outcome.

Execute and accelerate strategic decisions.

Ability to effectively deliver value propositions for specific and targeted industries.

Ensure a positive experience for external customers and partners while working cross functionally within our organization.

Ways to stand out from the crowd:

Experience working on AI Deep Learning and Machine Learning Applications, AI Model Training/Inferencing and other GPU related technologies and application domain.

Strong technical understanding of Data Analytics, Generative AI, Embedded System/Jetson.

Experience in network communication protocol is an added advantage.

Strong analytical, problem solving, and negotiation skills and the ability to use data analysis to support strategic decisions.

Excellent organizational, planning, and execution skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418997
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
23/10/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Machine Learning Engineer, youll work on cutting-edge
code-focused LLMs and AI agent systems
that power our companys next-generation developer platform. Youll be at the center of research, model training, and productionization of intelligent systems that understand software deeply, collaborate with developers, and help automate engineering workflows end-to-end. Your work will immediately impact millions of engineers worldwide.
Responsibilities:
Push LLM Innovation: Research, design, and fine-tune domain-specific LLMs for code generation, refactoring, debugging, and multi-turn reasoning.
Agent-Oriented Development: Build multi-agent coding systems that integrate retrieval-augmented generation (RAG), code execution, testing, and tool use to create autonomous, context-aware coding workflows.
Production-Grade AI: Own the training-to-inference pipeline for large code modelsoptimize inference with quantization, distillation, and caching techniques.
Rapid Experimentation: Prototype and validate ideas quickly; leverage reinforcement learning, human feedback, and synthetic data generation to push accuracy and reasoning.
Cross-Functional Collaboration: Partner with product, engineering, and design teams to ship AI-powered features that help developers focus on high-impact work.
Scale the Platform: Contribute to distributed training, scalable serving systems, and GPU/TPU-efficient architectures for ultra-low-latency developer tools.
Requirements:
2+ years of hands-on experience designing, training, and deploying machine-learning models
M.Sc. or higher in Computer Science / Mathematics / Statistics or equivalent from a university, or B.Sc. with strong hands-on ML experience
Practical experience with Natural Language Processing (NLP) and LLMs
Experience with data acquisition, data cleaning, and data pipelines
A passion for building products and helping people, both customers and colleagues
All-around team player, fast, self-learning individual
Nice to have:
3+ years of development experience with a passion for excellence
Experience building AI coding assistants, code reasoning models, or dev-focused LLM agents.
Familiarity with RAG, function-calling, and tool-using LLMs.
Knowledge of model optimizations (quantization, distillation, LoRA, pruning).
Startup or product-driven ML experience, especially in high-scale, latency-sensitive environments.
Contributions to open-source AI or developer tools.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8383587
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
13/10/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a highly motivated Senior Deep Learning Researcher to join our team! This is an outstanding opportunity to conduct impactful research and develop the next generation of large language model (LLM) inference algorithms. You will work on technologies that directly enhance NVIDIA's software, making the latest LLMs more efficient and accessible for users worldwide. By joining us, you will be part of a strategic effort to establish NVIDIA as the definitive platform for high-performance LLM inference. You will engage with skilled problem-solvers at NVIDIA and top organizations, crafting AI technology advancements.

What you'll be doing:
Research, invent, and implement groundbreaking algorithms for LLM inference to advance the state of the art in both low-latency and high-throughput scenarios.
Translate research into practical software solutions that directly impact NVIDIA's products and customers.
Collaborate with internal research, engineering, and product teams across the globe to drive the development of sophisticated inference technologies.
Analyze the performance of new algorithms on NVIDIAs latest hardware, identifying bottlenecks and opportunities for algorithmic optimizations.
Partner with leading scientific organizations and industry pioneers to remain at the forefront of technological advancements and integrate the latest innovations into practical applications.
Requirements:
What we need to see:
MSc/PhD in Computer Science, Electrical Engineering, or a closely related field.
At least 3 years of proven experience in deep learning research or applied research.
At least one publication in a top-tier AI/ML conference (e.g., NeurIPS, ICLR, ICML).
Deep understanding of LLM architectures coupled with hands-on experience in training large-scale models.
Excellent programming skills, particularly in Python and deep learning frameworks like PyTorch, and experience with software engineering best practices.
A strong problem-solving mentality and a proactive demeanor, driven by the ambition to deliver solutions with real-world impact.

Ways to stand out from the crowd:
Hands-on research experience in LLM inference optimization algorithms such as speculative decoding or parallelization strategies.
Proven experience with High-Performance Computing (HPC) environments, including training or running inference on large-scale GPU clusters (tens to hundreds of GPUs).
Deep familiarity and experience with popular LLM inference systems (e.g., vLLM, TensorRT-LLM).
Experience from a world-class industrial research group or a top-tier institution.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8369931
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 15 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We have improved AI infrastructure by merging GPU virtualization with Kubernetes-native tech to power innovative AI factories. We aim to speed up enterprise AI projects with smart orchestration, and scalability for AI workloads. Seeking a skilled Senior Software Engineer for our Infrastructure Group to innovate AI technology. The Infrastructure Group is tasked with composing and evolving the core systems responsible for thousands of GPUs and nodes driving enterprise AI. We invent the foundation that facilitates elastic, secure, and observable AI operations at extensive scale. We are seeking engineers who are passionate about distributed systems, modern cloud-native infrastructure, and AI performance optimization.

What youll be doing:

Crafting and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.

Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.

Collaborating closely with backend engineers, product managers, and other partners to deliver impactful solutions.

Writing clean, maintainable, and testable code in Go, contributing to our CI/CD pipelines.

Conducting code and build reviews to uphold high-quality standards and mentor team members.

Leading the development and implementation of advanced identity management systems that secure our innovative AI and GPU cloud.

Developing scalable multi-tenant solutions that allow our diverse clientele to harness the power of our platforms securely and efficiently.

Collaborating with multi-functional teams to integrate identity and access management features seamlessly into our products, from cloud services to edge computing devices.
Requirements:
What we need to see:

B.Sc. in Computer Science or a related field (or equivalent experience).

5+ years of experience

Experience in backend software development, including system design and architecture.

Proficiency in at least one backend programming language (Go preferred).

Strong knowledge in microservices architecture, RESTful APIs, and relational databases.

Proficient knowledge of security guidelines and experience applying them in large-scale systems.

Expertise in implementing OAuth, OIDC, SAML, and other modern authentication protocols - Advantage

Ways to stand out from the crowd:

Expertise in Kubernetes internals and advanced cloud-native technologies.

Experience working in Linux environments with knowledge of networking, security, and virtualization.

Contributions to open-source projects or active participation in tech communities.

Agile approach and familiarity with standard methodologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418975
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
09/11/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
If you're looking for an exciting opportunity to make a significant impact and grow with a passionate team, we are the place to be.
What You Will Do:
Be a core member of a cross-functional AI innovation team, collaborating closely with software engineers, MLOps specialists, and analysts.
Design, build, and optimize end-to-end AI-driven features - from experimentation and model training to validation and deployment.
Develop production-grade code (Python and C#) to integrate advanced models and data-driven capabilities into our core products.
Continuously evaluate model performance and data quality in real-world settings, proactively driving improvements, retraining, and robust monitoring
Requirements:
B.Sc. in Computer Science, Mathematics, or similar.
4+ years of hands-on experience in applied data science or machine learning roles.
Experience with building solutions based on agents, LLMs, generative AI, and advanced AI/ML frameworks.
Proficient in writing clean, maintainable, and efficient Python code.
Optional (preferred):
Experience with additional languages such as Java or C#.
M.Sc. in Computer Science, Mathematics, or similar.
Independent, proactive, and agile.
Skilled at problem-solving and creative solution-finding.
Strong teamwork and communication skills; thrive in highly collaborative, diverse teams.
Innovative and committed to continuous learning and staying up-to-date with the latest technologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8405133
סגור
שירות זה פתוח ללקוחות VIP בלבד