דרושים » חשמל ואלקטרוניקה » Senior Advanced Development Engineer, GPU Networking

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: More than one
Job Type: Full Time
As a Senior Software Engineer in the GPU Networking Architecture team, youll lead advanced development and POC efforts for AI infrastructure solutions. You'll be collaborating with experts across distributed AI, deep learning, networking OS, virtualization, storage, and more. Come and join us for the cutting edge in GPU Networking!

What youll be doing:
Drive advanced development and POCs for AI infrastructure solutions in cutting-edge AI networks, integrating innovative software and hardware.
Demonstrate team architectural concepts with larger NVIDIA AI software stacks.
Work closely with various groups within NVIDIA to bring AI network technologies to reality, including GPU and Switch HW and SW teams, Product as well as fellow architects.
Requirements:
What we need to see:
Hold a B.Sc., M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a reputable university (or equivalent experience).
Experienced in virtualization, networking and storage.
Proficient in C/C++ over Linux OS Development.
8+ years of proven experience as a software engineer.
A teammate with a can-do attitude, high energy and excellent interpersonal skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.

Ways to stand out from the crowd:
Knowledge in Deep Learning frameworks and AI communication libraries (NCCL, UCX, MPI and equivalents).
Experience in Kubernetes.
Stellar communication skills.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496609
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 7 שעות
Location: More than one
Job Type: Full Time
As a Senior Software Architect in the GPU Networking Architecture team, you will define Software Defined Networking architectural solutions. You'll also be part of a team of specialists who span across numerous technological fields related to the modern data center, such as distributed AI and deep learning systems, Networking Operating Systems, Virtualization, Storage, and more.

What youll be doing:
Define system and software architecture for Software Defined Networking (SDN) of ground breaking emerging AI networks which involves innovative software and hardware.
Be an active member in setting the use-cases and metrics for Monitoring Complex High-speed Networks Control-plane.
Work closely with various groups within us to bring AI network technologies to reality, including GPU and Switch HW and SW teams, Product as well as fellow architects.
Requirements:
What we need to see:
Hold a B.Sc., M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering (or equivalent experience).
8+ years of proven experience as a software architect.
Proven Networking experience.
A teammate with a can-do attitude, high energy and excellent interpersonal skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.

Ways to stand out from the crowd:
SDN definition/development experience.
InfiniBand hands-on experience.
Experience in Kubernetes.
Stellar communication skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496613
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: More than one
Job Type: Full Time
As a Senior Software Architect in the Accelerated Computing System and Software team, you will define Software Defined Networking (SDN) architectural solutions and be part of a team of specialists who span across numerous technological fields related to the modern data center, such as distributed AI and deep learning systems, High Performance Computing (HPC), Networking Operating Systems, Virtualization, Storage, and more.

What youll be doing:

Define system and software architecture for Software Defined Networking (SDN) of ground breaking emerging AI and HPC networks which involves innovative software and hardware.

Be an active member in setting the use-cases and metrics for Monitoring Complex High-speed Networks Control-plane.

Work closely with various groups within NVIDIA to bring AI and HPC network technologies to reality, including GPU and Switch HW and SW teams, Product as well as fellow architects.
Requirements:
What we need to see:

Hold a B.Sc., M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a reputable university (or equivalent experience).

8+ years of proven experience as a software architect.

Proven Networking experience

A teammate with a can-do attitude, high energy and excellent interpersonal skills.

Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.


Ways to stand out from the crowd:

SDN definition/development experience

InfiniBand hands-on experience

Experience in Kubernetes.

Stellar communication skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496601
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/12/2025
Location: More than one
Job Type: Full Time
As a Senior Software Architect in the Accelerated Computing System and Software team, you will define Software Defined Networking (SDN) architectural solutions and be part of a team of specialists who span across numerous technological fields related to the modern data center, such as distributed AI and deep learning systems, High Performance Computing (HPC), Networking Operating Systems, Virtualization, Storage, and more.
What youll be doing:
Define system and software architecture for Software Defined Networking (SDN) of ground breaking emerging AI and HPC networks which involves innovative software and hardware.
Be an active member in setting the use-cases and metrics for Monitoring Complex High-speed Networks Control-plane.
Work closely with various groups within our company to bring AI and HPC network technologies to reality, including GPU and Switch HW and SW teams, Product as well as fellow architects.
Requirements:
Hold a B.Sc., M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a reputable university (or equivalent experience).
8+ years of proven experience as a software architect.
Proven Networking experience
A teammate with a can-do attitude, high energy and excellent interpersonal skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
SDN definition/development experience
InfiniBand hands-on experience
Experience in Kubernetes.
Stellar communication skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465502
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/12/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
The company Networking Advanced Development Software team develops new groundbreaking technologies to enable new market shares for the company and tighten customer relationships. These are emerging technologies in networking and distributed computing for the booming AI factories and data centers. They span areas such as AI neural networks, Deep Learning, High Performance Computing (HPC), Storage, Cloud, SW Defined Network, Network Function Virtualization, 5G NR and more. We develop the solutions top-down, all the way from application behavioral analysis, to architecture definition and down to the implementation, using the world-leading company devices. The development traverses any needed component - application SW, middleware SW, OS kernel subsystems, device drivers, embedded SW (Firmware) and CUDA GPU. We collaborate with partners and key customers in the analysis processes and engage with open source communities introducing our leading features.
What youll be doing:
Design and implement solutions throughout all layers from high level application, OS and driver subsystem to firmware
Work on impactful projects involving state-of-the-art high-performance computing hardware and software
Provide insight and technical guidance and collaborate with peers from across the company - including software architecture, chip architecture, and engineering departments to improve our future technology
Collaborate with our company partners and customers.
Requirements:
B.Sc. in Computer Science, Electrical Engineering, Computer Engineering, or a related field
Understanding of multi core hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers, real-time
Programming skills
Ability to learn complex concepts in a fast pace environment.
A teammate with a can-do attitude, high energy and excellent interpersonal skills
Ways to stand out from a crowd:
Familiarity with networking protocols
Experience with open-source projects (coursework, personal, or contributions)
Working in a fast-paced and dynamic environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465199
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: More than one
Job Type: Full Time
We seek a highly motivated and experienced System Architect specializing in Data-Center, AI Fabric, and Ethernet Networking to join our team of experts and help shape the future of high-performance ML/AI computing. You will have the opportunity to work on some of the most pioneering technologies and help drive the innovation of our next-generation networks. You will play a key role in defining end-to-end solutions, networking protocols and features, interworking with orchestration systems, and help address new business opportunities in exciting areas. Our Architects also represent us in open-source projects, technical conferences, and standard development organizations.

What you'll be doing:

Explore new technologies and end-to-end solutions for our Ethernet Networking Platforms.

Be familiar with data-center and AI fabric network topologies, AI/ML clusters operation and network usage, as well as with the Ethernet Switch platforms' design and characteristics.

Define robust architectures and technical requirements for network operating systems and end-to-end solution offering for AI/ML workloads' needs and highly performing network operations.

Lead the work with R&D and Validation teams, providing technical guidelines and close support and thorough reviews for detailed designs and test plans.

Collaboration with architects across various fields, including Chip Design, Firmware, Hardware Platforms, and System teams.

Close work with product marketing, program managers, and account managers to ensure the successful execution of projects.

Support engagements with key customers, issue patents, publish white papers and blogs, and be proactive in technical forums and industry working groups.

Promote innovation through the design and implementation of Proof-of-Concept (PoC).
Requirements:
What we need to see:

B.Sc., M.Sc. or Ph.D. in Computer Science, Computer Engineering, or Electrical Engineering.

15+ years of experience in embedded software development for networking products, including 7+ years functioning as a System and/or Networking Architect.

Expert-level knowledge in Ethernet/IP technologies, network topologies, and networking features in data center, telco and/or edge networks.

Highly experienced in system software design and networking fundamentals.

Excellent understanding of large-scale network behavior and the effect of distributed computing workloads on the network.

Demonstrated ability to maintain technical foresight, conducting deep research and development into new technologies to generate innovative ideas and functional applications.

Leadership skills and accountability, including of past projects.

Clear verbal and written communication with the ability to build consensus within a large organization.

Possess problem-solving and critical thinking skills.

Ability to operate in a highly dynamic environment.

Ways to stand out of the crowd:

Extensive knowledge in various Switch ASIC hardware and Software Development Kit (SDK).

Demonstrated ability to prototype ideas and demonstrate their value.

Applying ML/AI methods to solve networking problems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496586
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/12/2025
Location: More than one
Job Type: Full Time
we are building state-of-the-art accelerated computing platforms that know no boundaries. Our technology is crucial for global innovators, scientists, researchers, and engineersempowering them to transform their boldest concepts into tangible outcomes. Our next-generation Infiniband, NVLink, and Ethernet systems will continue to be at the forefront of connecting and powering the world's most advanced AI clusters.
We seek a highly motivated and experienced Software Architect specializing in Ethernet Switch ASICs to join our team of experts and help shape the future of high-performance ML/AI computing. You will have the opportunity to work on some of the most pioneering technologies and help drive the innovation of our next-generation networks. You will play a key role in defining switching software stacks and Linux kernel networking, and help address new business opportunities in exciting areas. Our Architects also represent our company in open-source projects, technical conferences, and standard development organizations.
What you'll be doing:
Explore networking technologies, features and protocols, hardware/software capabilities, open-source software and drivers for our Ethernet Switch ASICs and Networking platforms.
Be familiar with the Ethernet Switch ASIC hardware and software stacks, as well as with the Ethernet Switch platforms design and characteristics.
Define robust architectures and technical requirements for embedded software, meeting AI/ML workloads' needs and highly performing network operations.
Lead the work with R&D and Validation teams, providing technical guidelines and close support and thorough reviews for detailed designs and test plans.
Collaboration with architects across various fields, including Chip Design, Firmware, Hardware Platforms, and System teams.
Close work with product marketing, program managers, and account managers to ensure the successful execution of projects.
Support engagements with key customers, issue patents, publish white papers and blogs, and be proactive in technical forums and industry working groups.
Promote innovation through the design and implementation of Proof-of-Concept (PoC).
Requirements:
B.Sc. or M.Sc. in Computer Science, Computer Engineering, or Electrical Engineering.
8+ years of experience in embedded software development for networking products, including 5+ years functioning as a Software Architect responsible for significant modules.
Expert-level knowledge in Ethernet/IP technologies, network topologies, and networking features in data center, telco and/or edge networks.
Highly experienced in embedded software design and operating system fundamentals.
Proven track record of proactively researching and integrating emerging technologies to develop practical applications and innovative solutions.
Leadership skills and accountability, including of past projects.
Clear verbal and written communication with the ability to build consensus within a large organization.
Possess problem-solving and critical thinking skills.
Ability to operate in a highly dynamic environment.
Ways to stand out of the crowd:
Wide knowledge in Switch ASIC hardware and Software Development Kit (SDK).
Deep understanding of the Linux kernel and networking.
Demonstrated ability to prototype ideas and demonstrate their value.
Applying ML/AI methods to solve networking problems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8467594
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: More than one
Job Type: Full Time
We seek a highly motivated and experienced Software Architect specializing in Ethernet Switch ASICs to join our team of experts and help shape the future of high-performance ML/AI computing. You will have the opportunity to work on some of the most pioneering technologies and help drive the innovation of our next-generation networks. You will play a key role in defining switching software stacks and Linux kernel networking, and help address new business opportunities in exciting areas. Our Architects also represent us in open-source projects, technical conferences, and standard development organizations.

What you'll be doing:

Explore networking technologies, features and protocols, hardware/software capabilities, open-source software and drivers for our Ethernet Switch ASICs and Networking platforms.

Be familiar with the Ethernet Switch ASIC hardware and software stacks, as well as with the Ethernet Switch platforms design and characteristics.

Define robust architectures and technical requirements for embedded software, meeting AI/ML workloads' needs and highly performing network operations.

Lead the work with R&D and Validation teams, providing technical guidelines and close support and thorough reviews for detailed designs and test plans.

Collaboration with architects across various fields, including Chip Design, Firmware, Hardware Platforms, and System teams.

Close work with product marketing, program managers, and account managers to ensure the successful execution of projects.

Support engagements with key customers, issue patents, publish white papers and blogs, and be proactive in technical forums and industry working groups.

Promote innovation through the design and implementation of Proof-of-Concept (PoC).
Requirements:
What we need to see:

B.Sc. or M.Sc. in Computer Science, Computer Engineering, or Electrical Engineering.

8+ years of experience in embedded software development for networking products, including 5+ years functioning as a Software Architect responsible for significant modules.

Expert-level knowledge in Ethernet/IP technologies, network topologies, and networking features in data center, telco and/or edge networks.

Highly experienced in embedded software design and operating system fundamentals.

Proven track record of proactively researching and integrating emerging technologies to develop practical applications and innovative solutions.

Leadership skills and accountability, including of past projects.

Clear verbal and written communication with the ability to build consensus within a large organization.

Possess problem-solving and critical thinking skills.

Ability to operate in a highly dynamic environment.

Ways to stand out of the crowd:

Wide knowledge in Switch ASIC hardware and Software Development Kit (SDK).

Deep understanding of the Linux kernel and networking.

Demonstrated ability to prototype ideas and demonstrate their value.

Applying ML/AI methods to solve networking problems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496596
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 10 שעות
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an AI Test Architect joining E2E Verification group to profile Innovative large scale Distributed training on our AI End-to-End solutions in a large scale supercomputing clusters. Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated Computing and Deep Learning software and hardware platforms, with researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, Switch, HCA, CPU and GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Profiling, benchmarking, and analyzing deep learning models to identify areas for optimization and improvement in terms of performance, efficiency, and accuracy, with a strong emphasis on networking aspects.

Collaborating closely with data scientists, researchers, development, automation teams to design and implement scalable training pipelines and frameworks that demonstrate large scale high -performance networking capabilities.

Staying up-to-date with the latest advancements in deep learning algorithms, architectures, our GPU technologies, and high-performance networking solutions.

Optimizing deep learning models for performance, memory usage, and power efficiency while maximizing high-performance networking features on our supercomputers.

Providing insights and recommendations based on the analysis of large-scale training results, specifically focusing on networking bottlenecks and optimizations, to improve model outcomes and achieve business objectives.

Collaborating with hardware engineers to guide the development and integration of efficient networking solutions for deep learning, including exploring network architecture optimizations and bringing to bear technologies such as RDMA or InfiniBand.
Requirements:
What we need to see:

B.Sc. in Computer Science, Software Engineering, or equivalent experience.

Strong understanding and practical experience with machine learning algorithms and techniques, with a specialization in deep learning and expertise in high-performance networking.

8+ years of overall experience, with CUDA programming for deep learning frameworks like TensorFlow, PyTorch, combined with expertise in networking libraries and protocols.

Ability to profile and optimize deep learning workflows, focusing on networking-related bottlenecks and optimizations, to improve overall performance and efficiency.

Exceptional analytical and problem-solving skill, with a keen attention to detail, particularly in identifying and resolving networking performance issues.

Excellent communication and collaboration skills, enabling effective teamwork and cooperation.

Familiarity with supercomputers, parallel computing, distributed systems, and high- performance networking technologies like RDMA or InfiniBand.

Ways to stand out from the crowd:

Demonstrated experience in successfully profiling and optimizing large-scale deep learning training on our supercomputers, with a significant focus on high-performance networking enhancements.

Experience with distributed deep learning, distributed training frameworks, or large-scale data pipelines enhanced by high-performance networking solutions.

Expertise in optimizing networking parameters, such as bandwidth, latency, or congestion control, for deep learning workloads.

Familiarity with our networking technologies, such as Mellanox InfiniBand, and their integration with deep learning workflows.

Strong understanding of high-performance networking protocols and standards and their application to deep learning.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496288
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 9 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a passionate and innovative Senior Software Engineer with expertise in enterprise and data center networking to join our Cumulus Linux team. Cumulus Linux is a leading open-networking operating system and a cornerstone of the AI Factory - the next-generation data center designed to power the training, fine-tuning, and deployment of AI models at scale. As part of the team, you will design and implement core features of Cumulus Linux that enable the worlds most advanced data centers. You will work closely with cross-functional architecture and design teams, shaping the future of our networking technologies while gaining hands-on experience across our hardware and software ecosystem - from advanced networking ASICs to large-scale distributed systems.

What you'll be doing:

Design, develop, integrate, and test data forwarding and routing features in our Cumulus Linux.

Enable our Cumulus Linux on next-generation ASICs.

Work collaboratively with team members, product managers, architects, QA, and other engineering teams to deliver high-quality solutions.

Innovate and rapidly develop proof-of-concept (POC) prototypes that can evolve into fully developed products or solutions.

Engage closely with customers to understand their challenges, use cases, and deployment strategies, and devise innovative solutions.
Requirements:
What we need to see:

BS or MS degree in Computer Engineering, Computer Science, or a related field.

Over 5 years of experience as a Software Engineer.

Excellent C programming skills on Linux.

Familiarity with forwarding and routing networking concepts.

Strong analytical skills and a deep understanding of data structures and algorithms.

Excellent communication skills.

Ways to stand out from the crowd:

Deep understanding of forwarding and routing architectures and protocols.

Strong expertise in Linux systems, with a focus on kernel-level networking.

Hands-on experience with merchant silicon platforms for high-performance switching and routing.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496553
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/12/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We seek a highly motivated Network Performance Exploration Engineer to join our team of experts and help shape the foundational infrastructure for the AI revolution. Our next-generation networking systems are at the forefront of connecting and powering the world's most advanced AI clusters. As a key member of our architecture team, you will be responsible for exploring and identifying critical network optimization opportunities across our entire hardware and software stack, analyzing how system-level changes impact application-level performance.
What Youll Be Doing:
Explore and validate end-to-end application performance, defining comprehensive test plans and critical metrics to identify optimization opportunities in both hardware and software.
Establish and maintain a comprehensive database of benchmark results, tracking performance across releases to drive data-informed decisions.
Conduct deep-dive analysis into communication libraries (like NCCL), system software, and hardware configurations to investigate performance characteristics, validate architectural theories, and identify bottlenecks.
Provide critical performance data to correlate and enhance simulation tools, ensuring our models accurately predict real-world hardware behavior.
Analyze application-level traffic patterns (e.g., LLMs) on our advanced networking fabrics to identify hardware and software optimization opportunities and tune system parameters.
Lead Proof-of-Concept (POC) projects to prototype and evaluate potential hardware and software optimizations and their impact on application performance.
Requirements:
B.Sc. or M.Sc. degree in Computer Science, Computer Engineering, or Electrical Engineering, or equivalent experience.
5+ years of relevant industry or research experience in high-performance computing, computer architecture, or computer networks.
Hands-on programming skills in Python and/or C/C++ for system analysis, automation, and customizing benchmarks.
Excellent understanding of large-scale system behavior and the effect of distributed computing workloads on network and system performance.
Proven experience in performance analysis, benchmarking, and identifying system bottlenecks.
Exceptional analytical, problem-solving, and systems-thinking skills, with the ability to dive deep into complex software and hardware interactions.
Ability to thrive in a a fast-paced, dynamic environment and work concurrently with multiple cross-functional teams.
Ways To Stand Out From The Crowd:
Deep understanding of and hands-on experience with communication libraries such as NCCL, UCX, or MPI.
Direct experience debugging or modifying the source code of a major communication library.
Expertise in the architecture and system-level requirements of large-scale, distributed Deep Learning workloads (e.g., LLMs).
Expertise in high-performance network protocols (Ethernet, InfiniBand, RoCE) and interconnect technologies like NVLink.
Familiarity with the PyTorch ecosystem, especially for distributed workloads.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465097
סגור
שירות זה פתוח ללקוחות VIP בלבד