AI Framework Software Engineer, Performance Optimizations

לפני 8 שעות

AI Framework Software Engineer, Performance Optimizations

חברה חסויה

Location: Haifa and Hod Hasharon

Job Type: Full Time

Our team at the Huawei Computing Network Innovation Lab is looking for exceptional talent to join us and lead the development of next generation data centers. We create cutting-edge technologies that synergize software and hardware in tandem to accelerate compute, storage and networking at large-scale. We aim to drive innovations and deliver software defined infrastructure and algorithms to HPC, AI/ML, and Big Data applications.
We are looking for outstanding candidates with hands-on experience in development and optimization of AI frameworks. If you are a team player with excellent communication skills and motivation to revolutionize application performance, youre welcome on board!
What will you be doing?
Work as part of an innovative research team to analyze, develop, test and deploy improvements that enhance Huaweis distributed AI framework.
Develop optimizations that leverage hardware accelerator capabilities, minimize communication overhead and improve training/inference throughput
Push the boundaries of the state of the art in LLM performance and efficiency, including model compression and quantization
Analyze, profile and optimize the latest LLM AI algorithms, and implement as production-quality software libraries for latency-critical use-cases on next-generation hardware.
Work in a distributed computing environment to optimize for both scale-up (multi-device) and scale-out (multi-node) systems
Utilize advanced concepts such as Uncertainty Quantification, Mixed Precision Computing and Model Sparsity to improve performance and enable training of very large AI models
Collaborate with partners from top universities, and open-source communities to conduct state-of-the-art research.

Requirements:
B.Sc. degree in computer science, computer engineering, or a closely related field
5+ years of experience in AI kernel and performance optimizations
Excellent C/C++ programming and software design skills, including debugging, performance analysis, and testing
Strong technical skills and experience with developing code in a Linux environment
Excellent teamwork and interpersonal skills
Ability to work independently, define project goals and scope, and lead your own development effort
Innovative thinking
Ways to stand out from the crowd:
M.Sc. or Ph.D. degree
Proven track record of conducting and publishing independent research
Experience in optimizing distributed deep learning pipelines with TensorFlow / PyTorch
Experience in analyzing workloads on large scale heterogeneous clusters
Hands-on experience in developing code to target heterogeneous architectures (e.g. CPU/GPU/TPU)
Experience in developing and contributing to large open-source libraries.

This position is open to all candidates.

Hide

עדכון קורות החיים לפני שליחה

8550305

שירות זה פתוח ללקוחות VIP בלבד

משרות דומות שיכולות לעניין אותך

דיווח על תוכן לא הולם או מפלה

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

לפני 7 שעות

Parallel Computing Tech Lead

חברה חסויה

Location: Hod Hasharon and Haifa

Job Type: Full Time

What will you be doing?
Envision, design and develop new features in all layers of the HPC / AI stack, from the application level, through the programming model (e.g. OpenMP, MPI) and down to the supporting libraries/middleware
Research, design and implement features for HPC / AI applications using parallel programming models (e.g. OpenMP, MPI), accelerators offloading and memory tiers.
Contribute to open-source scientific computing, networking and I/O libraries
Research, design and assist in developing hardware offloads for features relevant for scientific, deep learning, and data-intensive workloads.

Requirements:
What do we want to see?
B.Sc. degree in computer science, computer engineering, or a closely related field
Proficiency in one or more low-level programming languages: C / C++
3+ years of experience in object-oriented software development (OOD)
Ways to stand out from the crowd:
M.Sc. or Ph.D. degree
Hands-on experience in parallel programming or distributed application development (MPI / OpenMP / SHMEM)
Experience with code optimizations (I/O, data structure, communication patterns, vectorization) and profiling
Experience in development and utilization of innovative algorithms and data-structures to optimize code performance
Experience in development and deployment of AI models and, with string proficiency in Python programming
Experience in developing and contributing to large codebases.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8550322

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

לפני 7 שעות

Distributed Computing Software Engineer

חברה חסויה

Location: Haifa

Job Type: Full Time

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8550319

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

5 ימים

Performance Modeling Manager, Software Engineering, Cloud

חברה חסויה

Location: Tel Aviv-Yafo and Haifa

Job Type: Full Time

Be part of a team that pushes boundaries, developing custom silicon solutions that power the future of our company's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration.
As the Performance Modeling Manager, you will lead a team of high-performing engineers responsible for the software performance models that are used to guide the future of our companys custom silicon. You will bridge the gap between software architecture and hardware implementation, ensuring that our next-generation CPUs and NICs are optimized for our companys most critical workloads, such as Gemini, Search, and Cloud AI.
In this role, you will lead a team of high-performing engineers in the development of sophisticated performance models and oversee the conduct of deep-dive architectural analyses. By guiding the team to explore various architectural alternatives under different scenarios, you will ensure the identification of bottlenecks and provide data-driven guidance to architects on preferred hardware implementations. Your focus will be on leading the technical roadmap and career development while maintaining high-level architectural influence across both CPU and NIC modeling domains.
The AI and Infrastructure team is redefining whats possible. We empower our company customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers, our company Cloud customers, and billions of our company users worldwide.
We're the driving channel behind our company's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for our company Cloud, our company Global Networking, Data Center operations, systems research, and much more.
Responsibilities
Build, lead, and mentor a team of performance modeling engineers, fostering a culture of technical excellence and innovation in the Tel Aviv and Haifa silicon space.
Define the goal for performance modeling tools and methodologies, ensuring they provide highly accurate projections for future architectural pivots.
Partner with Hardware Architects, Silicon Designers, and Software Workload teams to influence SoC specifications based on data-driven modeling results.
Drive the development of advanced SystemC performance simulators and infrastructure, ensuring scalability and correlation with post-silicon performance.
Serve as a subject matter expert in CPU/NIC micro-architecture, guiding the team through complex trade-off analyses (power, area, performance).

Requirements:
Minimum qualifications:
Bachelors degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent practical experience.
15 years of experience in people management, leading engineering teams in a hardware or system-software environment.
Experience in C++ performance modeling or micro-architectural simulator development.
Experience with Central Processing Unit (CPU) or Network Interface Card (NIC) accelerator architectures.
Preferred qualifications:
PhD in Electrical Engineering, Computer Engineering, or equivalent practical experience.
Experience delivering performance models for large-scale, high-performance silicon projects (from pre-silicon projection to post-silicon validation).
Ability to translate complex technical data into clear business recommendations for executive leadership.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8544166

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

לפני 7 שעות

Hands-On Software Engineer and AI Compilation Technology Expert

חברה חסויה

Location: Hod Hasharon

Job Type: Full Time

What will you be doing?
Complex static code analysis to determine possible bottlenecks and time-consuming operations within the code of AI model for inference
Architecture, design and implementation of compilation passes, compiling high-level languages to a unique HW
Take initiative to solve technical and business problems
Collaborate with other development and product teams in our company and in China to ensure the successful implementation and delivery of a solution.

Requirements:
What do we want to see?
B.Sc. in Computer Engineering / Computer Science or equivalent
At least 5 years experience in implementation and design of SW / SW+HW systems (mainly in C / C++)
Hands on experience with compilers design, architecture and implementation
At least 3 years experience using LLVM / MLIR
At least 3 years proven experience working with GPU instruction set architecture
At least 3 years proven experience using compilers for optimizing given AI models to run on GPU
System view, together with profound understanding of related technologies
Hands-on system design and PoC bring-up experience
Excellent communications skills and ability to work as part of an international team
Innovation, fast learning skills
Ways to stand out from the crowd:
M.Sc. or Ph.D. degree with expertise in fields related to compilation / static analysis / AI model optimizations
Experience in Triton compilation
Experience in working with Torch Inductor
Proven experience in optimizing applications performance
Proficiency in C++ programming language
Understanding in multiprocessing and multithreaded code.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8550313

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

25/01/2026

Machine Learning Engineer, Multimodal models (LLM/VLM)

חברה חסויה

Location: Haifa

Job Type: Full Time

We are looking for a deep learning algorithm developer to join our growing team under Algorithms Group.
The team is building an innovative multimodal learning framework aimed at improving autonomous driving performance by understanding long-tail cases and providing actionable navigation insights. Were combining state-of-the-art vision-language models to revolutionize how we train, validate, and scale our autonomous systems.
If youre passionate about deep learning and engineering high-impact autonomous solutions - this is the place for you.
What will your job look like:
Contribute to dataset curation activities - collecting, cleaning, labeling, and preparing multimodal data for training and validation.
Train and fine-tune LLMs, VLMs, and VLA models to interpret visual scenes and produce actionable navigation insights supporting autonomous vehicle decision-making.
Support validation of multimodal models - evaluating vision-language-action behavior and helping identify performance gaps across driving scenarios.
Collaborate closely with AV planners, perception teams, and infrastructure engineers to ensure seamless deployment in a real-time ecosystem.
Youll have the opportunity to influence the strategic direction of language-driven autonomy - proposing new ideas, shaping model capabilities, and driving innovation from research to real-world deployment.

Requirements:
M.Sc. in Deep Learning, Computer Vision, NLP, or a related field (Ph.D. an advantage).
Hands-on experience in developing deep learning models.
Strong programming skills in Python (additional C++ is an advantage).
Experience with modern DL frameworks (e.g., PyTorch, TensorFlow).
Experience with large multimodal or language models (LLMs/VLMs/VLA models) and their real-world integration - advantage.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8515967

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

27/01/2026

Senior Network Software Engineer , SRD Annapurna Labs

חברה חסויה

Location: Haifa

Job Type: Full Time

AWS Utility Computing (UC) provides product innovations - from foundational services such as our companys Simple Storage Service (S3) and our company Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWSs services and features apart in the industry. As a member of the UC organization, youll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS. Within AWS UC, our company Dedicated Cloud (ADC) roles engage with AWS customers who require specialized security solutions for their cloud services.
Annapurna Labs, as part of AWS, is looking for a Senior Network SW Engineer to join the SW group and take a major part in redefining the future of AWS cloud.
Were searching for engineers with a passion for networking to develop SRD (Scalable Reliable Datagram). SRD is a high-performance, low-latency transport protocol used within our company Web Services (AWS) infrastructure to optimize network performance. SRD powers several high-impact, cutting-edge products, including on-demand ML and HPC platforms leveraging EFA, next-generation storage services built on EBS, and the future of AWS network traffic.
Key job responsibilities
As a Senior Network Software Engineer on SRD team, your primary role will be to develop and optimize the implementation of SRD technology across AWS's network infrastructure. Your responsibilities will encompass designing, deploying, and maintaining SRD code-base, ensuring its reliability and scalability to accommodate the demands of various applications. An essential part of your role will involve conducting extensive network simulations to evaluate system performance under different conditions, enabling you to identify potential bottlenecks and inefficiencies. Using these simulations, you will troubleshoot network issues and implement robust solutions, leading to minimal data loss and latency. Leveraging your in-depth understanding of network protocols and AWS infrastructure, you'll refine and improve the SRD system's performance. Furthermore, you will be expected to mentor junior team members, leading projects to advance the SRD capabilities within the AWS environment. Staying updated with the latest industry trends and incorporating them into strategic network service planning.

Requirements:
Basic Qualifications
- Bachelors (or higher) Degree in Computer Science (CS), Electrical Engineering (EE) or related area.
- 8+/10+ years of programming with at least one software programming language experience
Preferred Qualifications
- Experience as a mentor, tech lead or leading an engineering team
- Experience leading the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems
- Data-center Networking
- Network related simulators
- Large-scale distributed environments
- Storage and/or Transport protocols
- Real-Time development.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8520081

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

21/01/2026

Technical Expert - Multimodal GenAI (Vision-Language Models)

חברה חסויה

Location: Haifa

Job Type: Full Time

We are looking for an exceptional Technical Expert with proven experience in LLMs/VLMs/VLA models to join our growing team under Mobileyes Algorithms Group. The team is building an innovative multimodal learning framework aimed at improving autonomous driving performance by understanding long-tail cases and providing actionable navigation insights. Were combining state-of-the-art vision-language models to revolutionize how we train, validate, and scale our autonomous systems.
If youre passionate about deep learning and engineering high-impact autonomous solutions - this is the place for you.
What will your job look like:
Lead dataset curation strategy - designing scalable pipelines for multimodal data to drive high-quality training and validation.
Architect and optimize LLMs, VLMs, and VLA models - transforming scene understanding into reliable driving guidance.
Own the validation strategy - defining methodologies, metrics, and failure-analysis workflows for robust vision-language-action alignment.
Make an immediate impact by applying deep expertise to guide model design, methodology, and development priorities from day one.
Collaborate closely with AV planners, perception teams, and infrastructure engineers to ensure seamless deployment in a real-time ecosystem.
Influence the technical direction of language-driven autonomy - contributing experience, shaping capabilities, and driving innovation to production.

Requirements:
M.Sc. in Deep Learning, Computer Vision, NLP, or a related field (Ph.D. an advantage).
Proven experience with large multimodal or language models (LLMs/VLMs/VLA models) and their real-world integration.
At least 7 years of hands-on experience in developing deep learning models.
Strong programming skills in Python (additional C++ is an advantage).
Experience with modern DL frameworks (e.g., PyTorch, TensorFlow).

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8511782

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

28/01/2026

Senior Software Development Engineer, Annapurna Labs

חברה חסויה

Location: Haifa

Job Type: Full Time

Annapurna Labs, a key division within our company Web Services (AWS), seeks a Senior Software Development Engineer to design, develop, and optimize mission-critical embedded software for cloud infrastructure. You will join teams focused on networking, machine learning acceleration, and high-performance computing (HPC), impacting millions of AWS services globally.

Requirements:
- Experience as a mentor, tech lead or leading an engineering team
- Experience leading the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems
- Bachelor's degree
- 8+ years of professional experience in embedded software development, with strong proficiency in C/C++
- Hands-on experience developing firmware, device drivers, or user-space applications for embedded systems, including low-level hardware interaction
Preferred Qualifications
- Expertise in networking protocols and performance optimization for high-throughput, low-latency systems
- Ability to work in cross-functional, agile teams and communicate technical concepts effectively to stakeholders
- Experience with AWS cloud infrastructure or other large-scale distributed systems.
- Knowledge of hardware/software co-design.
- Familiarity with storage protocols .

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8521496

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

22/01/2026

Senior Machine Learning Engineer, Multimodal models (LLM/VLM)

חברה חסויה

Location: Haifa

Job Type: Full Time

We are looking for a senior deep learning algorithm developer to join our growing team under Algorithms Group.
The team is building an innovative multimodal learning framework aimed at improving autonomous driving performance by understanding long-tail cases and providing actionable navigation insights.
Were combining state-of-the-art vision-language models to revolutionize how we train, validate, and scale our autonomous systems.
If youre passionate about deep learning and engineering high-impact autonomous solutions - this is the place for you.
What will your job look like:
Own dataset curation activities- acquiring, cleaning, labeling, and tailoring multimodal data to meet model training and validation requirements.
Train and fine-tune LLMs, VLMs, and VLA models to interpret visual scenes and produce actionable navigation insights supporting autonomous vehicle decision-making.
Lead robust validation of advanced multimodal models - ensuring reliable vision-language-action alignment and consistent performance across diverse real-world driving scenarios.
Collaborate closely with AV planners, perception teams, and infrastructure engineers to ensure seamless deployment in a real-time ecosystem.
Youll have the opportunity to influence the strategic direction of language-driven autonomy - proposing new ideas, shaping model capabilities, and driving innovation from research to real-world deployment.

Requirements:
M.Sc. in Deep Learning, Computer Vision, NLP, or a related field (Ph.D. an advantage).
At least 5 years of hands-on experience developing deep learning models, including a minimum of 2 years of industry experience in the field.
Strong programming skills in Python (additional C++ is an advantage).
Experience with modern DL frameworks (e.g., PyTorch, TensorFlow).
Experience with large multimodal or language models (LLMs/VLMs/VLA models) and their real-world integration - advantage.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8513542

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

5 ימים

CPU CAD Front-End Engineer, Cloud

חברה חסויה

Location: Tel Aviv-Yafo and Haifa

Job Type: Full Time

our company's custom-designed machines make up one of the largest and most powerful computing infrastructures in the world. The Hardware Testing Engineering team ensures that this cutting-edge equipment is reliable. In the R&D lab, you design test equipment for prototypes of our machinery and develop the protocols used to scale these tests for the entire global team. Working closely with design engineers, you give input on designs to improve our hardware until you're sure it meets our company's standards of quality and reliability.
The AI and Infrastructure team is redefining whats possible. We empower our company customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers, our company Cloud customers, and billions of our company users worldwide.
We're the driving force behind our company's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for our company Cloud, our company Global Networking, Data Center operations, systems research, and much more.
Responsibilities
Design, develop, and maintain CAD tools and scripts to automate and streamline design tasks, verification processes, and data analysis.
Administer and optimize the front-end compute environment, ensuring reliability, performance, and scalability.
Provide technical support and training to Design and Verification teams on the use of CAD tools, scripts, and the compute environment.
Identify opportunities to enhance front-end development workflows, implement improvements, and document best practices.
Work closely with design, verification, and CAD teams to understand their needs, gather requirements, and deliver effective solutions.

Requirements:
Minimum qualifications:
Bachelors degree or equivalent practical experience.
3 years of experience in coding or scripting languages (e.g., Python, TCL).
Experience with front-end design, verification, integration teams on tools development, maintenance, or support.
Preferred qualifications:
Experience in CPU or SoC design, debug, and verification flows.
Experience working with RTL teams and design integration methodologies that improve team productivity and velocity.
Experience with delivering chip design infrastructure or methodology including multi-HDL model builds and CI/CD systems.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8544143

שירות זה פתוח ללקוחות VIP בלבד