דרושים » הנדסה » Firmware Engineer GPU Networking

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Ra'anana and Tel Aviv-Yafo
Job Type: Full Time
We are seeking to hire an excellent Firmware Engineer to join the GPU networking (NVLink) FW group in Tel-Aviv / Raanana. You will have the opportunity to take part in designing, implementing, and driving major development firmware features, delivering the next-generation GPU networking products.

We drive the data growth of the worlds biggest companies. With dedicated engineers around the globe, the work environment is dynamic, exciting, and fast-paced. Are you ready for the challenge?

What you will be doing:

Design and implement new features in the core of our GPU firmware.

Collaborate with architecture and different software design teams throughout the software development lifecycle.

Work in Pre & Post-silicon development environments of next-generation GPU products.

Gain a deep understanding of system debugging, networking technology, and stacks, as well as the HW/FW/SW relationship.

Innovate! Bring our FW switching products to shine in customers view.

Collaborate with other R&D teams around the globe.
Requirements:
What we need to see:

2+ years of relevant professional experience.

BS or MS in Computer Science/ Computer Engineering / Electrical Engineering or equivalent experience.

Proficient knowledge of C.

Excellent understanding of embedded SW and real-time programming.

Strong analytical, creative, debugging and problem-solving skills.

Meticulous and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

Strong programming skills in Python.

An excellent teammate with good social skills.

Ways to stand out from the crowd:

Knowledge of network protocols.

Assembly programming and debugging hand-on experience.

Background in Linux internals.

Scripting abilities (Python, Bash).

Motivation to learn and constantly improve processes and tools.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203468
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are seeking an excellent Firmware Manager to join to GPU networking (NVLink) FW group in Tel-Aviv / Raanana. You will have the opportunity to lead development team responsible for the firmware of the next-generation networking products while being hands-on with development activities. We drive the data growth of the worlds biggest companies. With dedicated engineers around the globe, the work environment is dynamic, exciting, and fast-paced. Are you ready for the challenge?

What you will be doing:

Lead a team of engineers and provide technical guidance to the team of highly skilled engineers. Empower the team members to excel and increase team productivity.

Lead the design and implementation of new features in the core of our GPU Networking firmware.

Drive and facilitate the planning, scheduling, and execution of the project and activities of the team.

Collaborate with architecture and different software design teams as part of the software development lifecycle.

Work in pre and post-silicon development environments of next-generation GPU networking products.

Gain a deep understanding of networking technology, system debugging, and stacks, as well as the HW/FW/SW relationship.

Innovate! Bring our Networking products to shine in customers view.
Requirements:
What we need to see:

B.Sc. in Computer Science/ Computer Engineering / Electrical Engineering.

2+ years of managerial experience.

6+ overall years of relevant overall professional experience.

Proficient knowledge of C (Real-time).

In-depth understanding of firmware and real-time programming, working closely with HW.

Strong analytical, creative, debugging, and problem-solving skills.

Detail-oriented and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

An excellent teammate with good social skills.

Strong programming skills in Python.

Ways to stand out from the crowd:

Knowledge of network protocols.

Experience with Agile/Scrum.

Background in Linux internals.

Experience in operating systems concepts like memory management, and user-space vs Kernel space.

Practical OOP hands-on design experience (Python / C++).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203488
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
03/06/2025
חברה חסויה
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are looking for an excellent Senior Firmware Engineer to join the Core group developing NVLink and Infiniband FW for Switches and GPU feature in Tel-Aviv / Raanana. As a Senior Engineer, you will be taking part in designing, implementing, and leading major firmware features, for next-generation networking & GPU products

The Firmware team develops cutting-edge networking features for AI workloads, Cloud, HPC and storage. We drive the data growth of the worlds biggest companies. With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced.

Are you ready for the challenge?

What you will be doing:

Design and implement new features in the core of our Switch firmware.

Collaborate with architecture and different software design teams as part of the software development lifecycle.

Work in pre and post-silicon development environments for next-generation networking products.

Gain a deep understanding of system debugging, networking technology, and stacks, as well as the HW/FW/SW relationship.

Innovate! Bring our FW switching products to shine in customers view.
Requirements:
What we need to see:

B.Sc. in Computer Science/ Computer Engineering / Electrical Engineering.

5+ years of relevant professional experience.

Proficient knowledge of C.

Strong analytical, creative, debugging and problem-solving skills.

Excellent understanding of embedded SW and real-time programming.

Strong programming skills in Python.

Detail-oriented and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

An excellent teammate with good social skills.


Ways to stand out from the crowd:

Knowledge of network protocols.

Background in Linux internals (Kernel & Userspace).

Agile experience.

Motivation to learn and constantly improve processes and tools.

Scripting abilities (Python, Bash).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8201444
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
03/06/2025
חברה חסויה
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are looking for an excellent Software Engineer to join the InfiniBand Switch and NVLink FW group in Tel-Aviv / Raanana.
As the team member, you will be part of a major development effort for the next-generation networking products. The verification team develops cutting-edge networking features for cloud, HPC and storage. We drive the data growth of the worlds biggest companies. With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced. Are you ready for the challenge?

What you will be doing:

As a Senior Software Engineer, you will part of a group of firmware verification engineers, using your expertise in C++ and Python in an Agile methodology.

Develop the verification process, aligning with quality standards and project timelines.

Design and implement new features in the core of our Switch and GPUs.

Drive and facilitate the planning, scheduling, and execution of the project and activities of the team.

Collaborate with architecture and different software design teams as part of the software development lifecycle.

Work in pre and post-silicon development environments of next-generation networking products.

Gain a deep understanding of networking technology, system debugging and stacks, as well as the HW/FW/SW relationship.

Innovate! Bring our FW switching products to shine in customers view.
Requirements:
What we need to see:

B.Sc degree or equivalent experience in Computer Science, Computer Engineering, or Electrical Engineering.

5+ years of relevant overall professional experience.

Proficient knowledge of C++, Python.

Experience with Embedded SW development.

Strong analytical, creative, debugging, and problem-solving skills.

Detail-oriented and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

An excellent teammate with good interpersonal skills.


Ways to stand out from the crowd:

Knowledge of network protocols.

Background with CI methodology & tools (Git, Gerrit, Jenkins etc.).

Background in Linux internals.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8201438
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
03/06/2025
חברה חסויה
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are looking for an excellent SW Engineer to join the InfiniBand Switch and NVLink Core FW group in Tel-Aviv / Raanana. As the team member, you will be part of a major development effort for the next-generation networking products. The verification team develops cutting-edge networking features for cloud, HPC and storage. We drive the data growth of the worlds biggest companies. With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced. Are you ready for the challenge?

What you will be doing:

As a SW Engineer, you will part of a group of firmware verification engineers, using your expertise in C++ and Python in an Agile methodology.

Develop the verification process, aligning with quality standards and project timelines.

Design and implement new features in the core of our Switch and GPUs.

Drive and facilitate the planning, scheduling, and execution of the project and activities of the team.

Collaborate with architecture and different software design teams as part of the software development lifecycle.

Work in pre and post-silicon development environments of next-generation networking products.

Gain a deep understanding of networking technology, system debugging and stacks, as well as the HW/FW/SW relationship.

Innovate! Bring our FW switching products to shine in customers view.
Requirements:
What we need to see:

B.Sc degree or equivalent experience in Computer Science, Computer Engineering, or Electrical Engineering.

3+ years of relevant overall professional experience.

Proficient knowledge of C++, Python.

Experience with Embedded SW development.

Strong analytical, creative, debugging, and problem-solving skills.

Detail-oriented and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

An excellent teammate with good interpersonal skills.


Ways to stand out from the crowd:

Knowledge of network protocols.

Background with CI methodology & tools (Git, Gerrit, Jenkins etc.).

Background in Linux internals.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8201420
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an excellent Firmware Design Engineer for NVIDIA FW PHY Group. The person will closely work with NVIDIA FW development, architecture, chip design teams and gain deep understanding of NVIDIA's Networking products and technologies.

We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous engineer who loves a challenge? Are you ready to become the engineer you always wanted to be? Come and be part of the best chip design team in the industry!

What youll be doing:

Enabling new SerDes and Optical Technologies.

Implement FW functionality in signal processing units of NVIDIA products.

Work closely with the architecture, HW, and SW design teams.

Define implement and maintain FW algorithm to control the Silicon.

Develop and test FW on emulation & simulation environments during the Pre-silicon phase.

Debug and screen HW/FW/SW issues.

Take an active part in silicon bring-up and SW development phases.

Lead data-driven discussions about the product functionality and areas for improvement.
Requirements:
What we need to see:

B.Sc. or M.Sc. in Electrical or Computer Engineering.

5+ years of relevant experience.

Proficient programming in C.

Debugging experience and ability to investigate and triage difficult problems in embedded FW.

Experience with developing the physical layer of communication protocols.

Good communication skills and the ability to work with people across several countries.

Excellent English verbal and written communication skills.

Good understanding of analog, mixed-signal circuit operation.


Ways to stand out from the crowd:

Proficient in Python and MatLab.

Good understanding of SerDes operation.

Knowledgeable of Hardware/Software Development Process.

Strong collaborative and interpersonal skills, with an ability to successfully guide and influence.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196517
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are seeking a highly skilled and expert Verification Lead to join our PCIe Firmware team. In this role, you will be responsible for supervising the verification plans, monitoring their execution within the teams, guiding engineers through the verification implementation, and tracking coverage, Coverity, and other relevant statistics. You will be joining a team dedicated to developing groundbreaking technology and building the core technology of next-generation devices across various fields, including low-level C layer between hardware and firmware, C++ verification environment, automation challenges, and Python testing environment.

What youll be doing:

Develop and lead verification plans, ensuring they are driven effectively within the teams.

Provide guidance to engineers on verification implementation within the verification environment.

Monitor coverage, Coverity, and other statistics to ensure comprehensive verification and optimization of next-generation NVIDIA products.

Work closely with firmware design, chip design, software, and architecture teams to define and craft both legacy and new low-level firmware flows.

Enhance methodologies and automated processes to improve efficiency and effectiveness.
Requirements:
What we need to see:

B.Sc. in Electrical Engineering, Computer Science, or Computer Engineering, or equivalent experience.

12+ years of relevant experience.

Knowledge of object-oriented programming, computer structure, operating systems, and familiarity with Python or Bash is advantageous.

Problem-solving skills, independence, curiosity, strong interpersonal skills, and self-learning ability. Multi-disciplinary capabilities and the ability to work with a wide interface of people are crucial.

Ways to stand out from the crowd:

Familiarity with hardware verification concepts and tools such as C++, Jenkins automation, hardware familiarity, and test-driven development (TDD).

Experience in partnering with software and architecture teams to define and implement firmware.

Knowledge of PCIe, networking, Linux, and scripting languages, along with experience in solving in-depth problems.

Knowledge of object-oriented programming, computer structure, operating systems, and familiarity with Python or Bash is advantageous.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196206
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions that were once considered science fiction, including artificial intelligence to autonomous cars. Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We're seeking a Senior Software Architect to help co-design next-gen data center platforms and scalable communications software.

DL and HPC applications have a huge compute demands and already run at scales of up to tens of thousands of GPUs. GPUs are connected with high-speed interconnects (e.g. NVLink, PCIe) within a node and with high-speed networking (e.g. InfiniBand, Ethernet) across nodes. Efficient and fast communication between GPUs directly impacts end-to-end application performance. This impact continues to grow with the increasing scale of next generation systems. This is an outstanding opportunity to advance the state-of-the-art, break performance barriers, and deliver platforms the world has never seen before. Are you ready to build the new and innovative technologies that will help realize our vision?

What you will be doing:

Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems.

Design and implement new communication technologies to accelerate AI and HPC workloads.

Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects.

Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations.

Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs).
Requirements:
What we need to see:

M.S./Ph.D. degree in CS/CE or equivalent experience.

5+ years of relevant experience.

Excellent C/C++ programming and debugging skills.

Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).

Deep understanding of operating systems, computer and system architecture.

Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads.

Strong experience with Linux.

Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.

Ways to stand out from the crowd:

Expertise in related technology and passion for what you do. Experience with CUDA programming and our GPUs. Knowledge of high-performance networks like InfiniBand, RoCE, NVLink, etc.

Experience with Deep Learning Frameworks such PyTorch, TensorFlow, etc. Knowledge of deep learning parallelisms and mapping to the communication subsystem. Experience with HPC applications.

Strong collaborative and interpersonal skills and a proven track record of effectively guiding and influencing within a dynamic and multi-functional environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203578
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
Location: More than one
Job Type: Full Time
We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband and NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.

What youll be doing:

Define the Infiniband and NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).

research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.

The architect should have experience in developing models for simulations, analyzing simulation results and development of optimization algorithms.

Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements:
What we need to see:

B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.

At least 5 years of industry or research experience in computer networks.

Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.

Experience in development of simulation environments.

Possess strong managerial, problem solving and critical thinking skills.

Ability to work and operate in a highly dynamic environment.

Partner with multiple groups in the organization.

Ways to stand out from the crowd:

Good knowledge in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.

Good knowledge in Python, C++.

Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196190
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
Location: More than one
Job Type: Full Time
We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband and NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.

What youll be doing:

Define the Infiniband and NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).

research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.

The architect should have experience in developing models for simulations, analyzing simulation results and development of optimization algorithms.

Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements:
What we need to see:

B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.

At least 5 years of industry or research experience in computer networks.

Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.

Experience in development of simulation environments.

Possess strong managerial, problem solving and critical thinking skills.

Ability to work and operate in a highly dynamic environment.

Partner with multiple groups in the organization.

Ways to stand out from the crowd:

Good knowledge in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.

Good knowledge in Python, C++.

Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196183
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Engage with our partners and customers to root cause functional and performance issues reported with NCCL.

Conduct performance characterization and analysis of NCCL and DL applications on groundbreaking GPU clusters.

Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.).

Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters.

Document and conduct trainings/webinars for NCCL.

Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.
Requirements:
What we need to see:

B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design.

Experience working with engineering or academic research community supporting HPC or AI.

Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control.

Expert in Linux fundamentals and a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible).

Adaptability and passion to learn new areas and tools.

Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow.

Deep understanding of technology and passionate about what you do.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203558
סגור
שירות זה פתוח ללקוחות VIP בלבד