דרושים » הנדסה » Software Engineer, DOCA

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
חברה חסויה
Location: Yokne`am and Ra'anana
Job Type: Full Time
We are looking for a Senior Software Engineer. You will work with highly experienced engineers to provide the world's outstanding SmartNIC products for cloud-computing, research, medical, automotive, finance, weather, telco, and more.

What you'll be doing:

You will architect, design, and develop the next-generation technology in network acceleration, as well as work with best-in-class technical leaders in this domain.

Engage with customers and architects to understand the requirements and derive the software design accordingly.

Collaborate with other engineering teams that develop the upper layers applications like virtual switches (OVS, VPP, and etc.) and lower layers like driver, kernel, FW, and HW.
Requirements:
What we need to see:

B.Sc. (or equivalent experience) in computer science/software engineering.

3+ years confirmed experience of Programming C/C++.

3+ years confirmed experience in Linux environment and tools.

Experience with Networking Protocols.

Experience with virtualization technologies.

Strong analytical, debugging, and problem-solving skills.

Deep knowledge of computer architecture and operating systems.

Experience in performance optimizations.

Ways to stand out from the crowd:

Knowledge and experience in DPDK.

Knowledge and experience with designing SDKs.

Open Source Software Contributor to relevant projects (OvS, DPDK, Linux Kernel..).

A positive demeanor, a growth mindset, and excellent interactions with colleagues.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583562
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Yokne`am
Job Type: Full Time
The DOCA team is seeking a highly motivated and experienced engineer with hands-on technical expertise to develop the SDK for the next-generation Data Processing Unit, with a wide range of features and embedded systems. This position offers the opportunity to have a real impact in a dynamic, technology-focused company, impacting data centers across the world. We're united in our quest to transform the way Smart Adapters are used.

What you'll be doing:

Collaborate with skilled engineers to design and implement the DOCA SDK and infrastructure.

Take charge of end-to-end development for various features within DOCA.

Design and develop SDK libraries focusing on networking, storage, and high-performance computing solutions.

Partner with major companies to address real-world challenges that demand innovative solutions and a customer-centric approach.
Requirements:
What we need to see:

B.Sc in Computer Engineering, Computer Science, Electrical Engineering, or equivalent practical experience.

8+ years of hands-on experience with the software development lifecycle, especially in performance-critical systems.

Strong programming skills in C/C++.

Familiarity with Linux environments and associated tools.

Strong analytical, debugging, and problem-solving skills.

Excellent communication skills and teamwork abilities. Self-motivated and well organized.

Experience with OOD/OOP.

Ways to stand out from the crowd:

Background in networking and low-level programming (C/C++).

Experience with performance optimizations.

Experience in RT embedded development or driver development within a Linux environment.

Technical leadership skills.

Confirmed experience in solving customer issues.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583579
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
08/02/2026
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are looking for a passionate Software Engineer to join our Simulation development team. Our team is growing, and we are looking for hardworking and experienced engineers to take part in building advanced networking simulation solutions. You will be part of a team developing next generation Switch simulation, works closely with other SW R&D teams and SW Architects.

What you will be doing:

Design and develop advanced features simulating our world class NICs and Switches.

Develop solutions using advanced virtualization technology.

Write clean, efficient and maintainable code.

Collaborate with team members, SW R&D, Architects, Chip Design and FW.
Requirements:
What we need to see:

B.Sc. degree or equivalent experience in Computer Science / Software Engineering.

2+ years of experience.

Proficient knowledge and experience in C.

Strong design, coding, analytical, debugging and problem-solving skills.

Full ownership & end-to-end responsibility.

Excellent social and written communication skills.

Ways To Stand Out From The Crowd:

Can do attitude, independency and agility.

Ability to quickly adapt to new technology and go deep into new areas.

Understanding of Networking Protocols - Ethernet, InfiniBand is an advantage.

Knowledge of Virtualization, especially with KVM/QEMU is an advantage.

Knowledge of Linux/Windows kernel and drivers development is an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8536036
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
חברה חסויה
Location: Ra'anana
Job Type: Full Time
We are looking for an excellent Software Manager to join the Switch SDK Group and lead a team of powerful software engineers. You will join the SDK group and take our product to next level, working closely with various other design and architecture teams and gain a deep understanding of our products and technologies. We have been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technology-and amazing people.

Today, were tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing whats never been done before takes vision, innovation, and the worlds best talent. Youll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

What you'll be doing:

Leading and managing a dynamic team of engineers, ensuring smooth and efficient project execution.

Developing and implementing our Ethernet SDK with a focus on high-performance and reliability.

Collaborating with teams from other subject areas to identify technical requirements and successfully implement innovative solutions.

Following established guidelines and standards to ensure the delivery of high-quality products.

Continuously striving for excellence, improving team performance, and driving innovation.
Requirements:
What we need to see:

A bachelor's degree in Computer Science, Electrical Engineering, or a related field, or equivalent experience.

8+ years of overall relevant experience.

Proven experience with over 4 years of managing engineering teams of 4 or more members.

Outstanding leadership skills and a proven record of effectively managing complex projects.

Strong knowledge and experience in Ethernet communication, network protocols and real time software.

Experience with C/C++ programming.

Excellent problem-solving abilities and a collaborative approach.

Strong analytical, creative, debugging, and problem-solving skills.

Ways to stand out from the crowd:

Python programming experience.

Linux internals experience.

Excellent understanding of embedded SW and real-time programming.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584089
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
02/03/2026
Location: Yokne`am
Job Type: Full Time
The Networking Advanced Development Software team develops new groundbreaking technologies to enable new market shares for the company and tighten customer relationships. These are emerging technologies in networking and distributed computing for the booming AI factories and data centers. They span areas such as AI neural networks, Deep Learning, High Performance Computing (HPC), Storage, Cloud, SW Defined Network, Network Function Virtualization and more. We develop the solutions top-down, all the way from application behavioral analysis, to architecture definition and down to the implementation, using the world-leading our devices. The development traverses any needed component - application SW, middleware SW, OS kernel subsystems, device drivers, embedded SW (Firmware) and CUDA GPU. We collaborate with partners and key customers in the analysis processes and engage with open source communities introducing our leading features.

What youll be doing:

Design and implement solutions throughout all layers from high level application, OS and driver subsystem to firmware.

Work on impactful projects involving state-of-the-art high-performance computing hardware and software.

Provide insight and technical guidance and collaborate with peers from across the company - including software architecture, chip architecture, and engineering departments to improve our future technology.

Collaborate with our partners and customers.
Requirements:
What we need to see:

B.Sc. in Computer Science, Electrical Engineering, Computer Engineering, or a related field.

5+ overall years of industry experience in system programming or related fields.

Understanding of multi core hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers, real-time

Excellent programming skills.

Ability to learn complex concepts in a fast pace environment.

A teammate with a can-do attitude, high energy and excellent interpersonal skills.

Ways to stand out from a crowd:

Familiarity with networking protocols.

Hands-on experience with CUDA programming and GPU acceleration.

Hands-on experience with LLM serving frameworks.

Experience with open-source projects (coursework, personal, or contributions).

Working in a fast-paced and dynamic environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8566056
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are looking for an excellent Software Engineer for the Switch SDK Group. You will join the SDK group and take our product to next level, working closely with various other design and architecture teams and gain a deep understanding of NVIDIAs products and technologies. NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technology-and amazing people.

What youll be doing:

Design, develop, optimize and maintain APIs, tools and libraries for Switching, Routing, Analytics, Telemetry and many other modules.

Collaborate with team members, Architects, QA teams, and customers (both external and internal).

Innovate & rapidly develop POC prototypes that can then be developed into full-fledged products/solutions.
Requirements:
What we need to see:

B.Sc. in Software Engineering / Computer Science / related field or equivalent work experience will be considered as well.

10+ years of experience as a Software Engineer, including experience with C programming.

Experience with Embedded/ RT Embedded systems.

Excellent C programming skills, with a keen eye for performance and writing optimized code.

Strong analytical skills, deep knowledge of algorithms and proficiency with data structures.

Excellent communication and documentation skills.

Ways to stand out from the crowd:

Previous experience with Ethernet Switching or Routing protocols.

Hands on Linux development, user-space and/or kernel-space.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583591
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/02/2026
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. We are widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with NVIDIA networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8541388
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
02/03/2026
Location: Yokne`am
Job Type: Full Time
We are looking for a skilled Software Developer with strong hardware knowledge to join our diverse team passionate about developing low-level software and test infrastructure for our networking and Interconnect products. As a technical focal point, you will work at the intersection of hardware and software, taking ownership of driver development, hardware bring-up, and test system architecture. You will be instrumental in driving innovation by developing low-level software that directly controls hardware, debugging sophisticated hardware-software interactions, and creating robust testing solutions. If you're passionate about code development, automation, innovation, reading schematics, debugging hardware with software tools, and becoming a key technical contributor, we'd love to have you on board!


What You'll Be Doing:

Collaborate with multi-functional teams including hardware, electrical, thermal, and mechanical engineers to understand hardware specifications and develop corresponding test requirements.

Take the lead in debugging hardware-software integration issues using instruments, and software tools to pinpoint root causes.

Develop low-level drivers in C/C++ for hardware components and build DLL wrappers for integration with higher-level C# applications.

Review and interpret hardware schematics and datasheets to understand signal flows, timing requirements, and implement appropriate software control mechanisms.

Lead all aspects of hardware bring-up, validation, and deployment of test solutions to production environments, ensuring accurate integration.
Requirements:
What We Need to See:

Bachelor's or master's degree in electrical engineering, Computer Engineering, or Software Engineering with significant hardware/electronics coursework or hands-on experience.

5+ years of hands-on experience in hardware-software integration, low-level driver development, or firmware development with proven ability to lead technical projects.

Strong proficiency in one or more programming languages such as Python, Java, C#, with additional experience in C/C++ for low-level programming.

Demonstrated ability to read and interpret electrical schematics, block diagrams, and hardware datasheets.

Experience with hardware debugging tools such as oscilloscopes, logic analyzers, JTAG debuggers, or similar instruments.

Excellent problem-solving skills in developing software solutions for sophisticated hardware-software interactions. Ability to collaborate with hardware teams, demonstrating deep technical ownership of hardware-software integration projects.


Ways To Stand Out from the Crowd:

Hands-on experience with PCIe, I2C, SPI, UART, or other hardware communication protocols. Experience developing device drivers for Windows or Linux environments.

Background in embedded systems, microcontrollers, DSPs, FPGAs, or custom ASIC integration. Experience wrapping native C/C++ libraries into managed DLLs for .NET/C# applications.

Knowledge of hardware validation methodologies and experience with automated hardware test equipment. Previous work in networking hardware, high-speed interconnects, or semiconductor validation environments.

Strong ability to bridge communication between hardware and software teams, translating hardware requirements into software solutions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8566021
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
Location: Ra'anana and Yokne`am
Job Type: Full Time
In this role, you will help shape the evolution of our DOCA Networking software stack - the accelerated infrastructure framework powering AI factories and cloud environments. You will drive software innovation from vision to real-world impact, influencing some of the most advanced computing systems in the world. As part of the DOCA Product Group, you will lead software strategy for ConnectX NIC and BlueField DPU platforms - key pillars of our data center and AI networking strategy - helping build the intelligent infrastructure of tomorrow.

What You'll Be Doing:

Lead the Product-strategy for DOCA networking stack and products across their life-cycle: from vision and inception, through detailed customer & ecosystem requirements, roadmap crafting, market introduction, growing into in-scale delivery, and product improvement cycles.

Orchestrate a unified technical strategy between AI product teams, engineering teams, and customers to advocate the use of DOCA libraries & microservices, and to develop new DOCA APIs and services for new deployments.

Drive multidisciplinary engineering and architecture teams to establish priorities and define precise, actionable requirements for breakthrough projects.

Forge strong partnerships with customers and ecosystem partners - actively listening to their technical needs, delivering expert guidance, and supporting successful, large-scale AI (and other) deployments that drive their strategic goals.

Create use cases and reference applications to demonstrate product value to technical and executive audiences.

Gather insights to define future products, including analysis of complementary and competitive products and customer feedback.
Requirements:
What We Need to See:

BSc/MSc in Computer Science, Communication Engineering, Software Engineering, or equivalent experience.

12+ years of experience in R&D, architecture, and program management, with primary focus on product management leadership in Data Center Networking with proven track record in defining and driving both inbound and outbound product strategy across complex technologies and cross-functional organizations.

MBA or similar experience, with a balance of technical and business knowledge.

Deeply versed in hardware-accelerated networking protocols (RDMA, ETH, and more), technologies (DPDK, OVS, and more), and full Product-solutions in Data Center and Cloud environments.

Strong ability to deliver complex, Linux-based networking software frameworks and SDKs specifically architected for cloud providers, hyperscalers, and large-scale enterprise deployments.

Translate global customer and business insights into high-impact networking solutions, bridging the gap between deep technical requirements and long-term strategic goals.

Proven experience driving vision into reality by navigating complex, global organizational matrices, using exceptional communication to align cross-functional engineering and product teams.

Highly motivated, fast learner, and a team-player.

Ways to Stand Out from the Crowd:

Strong background in Data-Centre clusters and topologies, AI-driven networking, storage, security, and orchestration techniques.

Hands-on experience with networking infrastructure and DPU architecture, programmable networking pipelines, and NVIDIA technologies (CUDA, embedded solutions), with deep platform ecosystem knowledge

Proven leadership in complex hardware/software systems development, Including: SW, embedded SW, and HW. Supplying complete solutions: from the networking infrastructure, through SDKs and services, and into the application-level.

Success in partnering with Tier-1 customers on networking and cloud infrastructure deployments.

Extensive product management experience in international organizations, with a focus on adaptability and cross-cultural collaboration as well as vast experience as a R&D manager, or engineering program manager.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584108
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 9 שעות
Location: Ra'anana and Yokne`am
Job Type: Full Time
We are looking for an excellent Software Developer whos passionate about Networking, Encryption and Virtualization to join the Cloud networking R&D team. We are seeking an experienced engineer who is deeply technical, hands-on, and has a wide system view. You will work on NVIDIA innovative Virtual switch which use our DOCA SDK for accelerated packet processing using NVIDIA's superior ConnectX NICs and Bluefield DPUs. We are looking to grow our teams with the most resourceful people in the world. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and implement new features to accelerate Network and Storage

Develop pioneering virtual switch which use our HW offload capabilities.

Driving a complete engineering process, including refining requirements, engineering design of data structures/algorithms, implementation, peer review, developer testing, and post-GA support.

Drive sophisticated technical issues to closure that may occur in the cross-team boundary.
Requirements:
What we need to see:

Bachelor's degree or equivalent experience in Computer Science/Engineering.

3+ years of proven experience in C Programming.

Strong background in networking.

Deep understanding of the system software stack, with a focus on software/hardware interaction, including platform firmware, device drivers, Linux kernel, and how user-space applications use system services to achieve for accelerated packet processing.

Knowledge of Network Protocols L2/L3/L4 - Ethernet / IP / TCP / UDP.

Ways to stand out from the crowd:

Background with SR-IOV, DPDK, RoCE technologies.

Hands-on experience with Open Virtual Switch (or an equivalent solution).

Experience with RDMA (InfiniBand or RoCE) fabrics.

Hands-on experience with HW offloads.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583501
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. NVIDIA is widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with NVIDIA networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584095
סגור
שירות זה פתוח ללקוחות VIP בלבד