דרושים » הנדסה » Senior AI Networking System Architect

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 2 שעות
Location: More than one
Job Type: Full Time
We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.

What youll be doing:

Define the NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).

Research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.

Developing models for simulations and performance testing, analysing the results and development of future HW and SW.

Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements:
What we need to see:

B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.

At least 5 years of industry or research experience in computer networks.

Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.

Experience in development of simulation environments.

Possess strong managerial, problem solving and critical thinking skills.

Ability to work and operate in a highly dynamic environment.

Partner with multiple groups in the organization.

Ways to stand out of the crowd:

Strong understanding in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.

Good knowledge in Python, C++.

Good knowledge with AI models.

Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8506728
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/12/2025
Location: More than one
Job Type: Full Time
we are building state-of-the-art accelerated computing platforms that know no boundaries. Our technology is crucial for global innovators, scientists, researchers, and engineersempowering them to transform their boldest concepts into tangible outcomes. Our next-generation Infiniband, NVLink, and Ethernet systems will continue to be at the forefront of connecting and powering the world's most advanced AI clusters.
We seek a highly motivated and experienced Software Architect specializing in Ethernet Switch ASICs to join our team of experts and help shape the future of high-performance ML/AI computing. You will have the opportunity to work on some of the most pioneering technologies and help drive the innovation of our next-generation networks. You will play a key role in defining switching software stacks and Linux kernel networking, and help address new business opportunities in exciting areas. Our Architects also represent our company in open-source projects, technical conferences, and standard development organizations.
What you'll be doing:
Explore networking technologies, features and protocols, hardware/software capabilities, open-source software and drivers for our Ethernet Switch ASICs and Networking platforms.
Be familiar with the Ethernet Switch ASIC hardware and software stacks, as well as with the Ethernet Switch platforms design and characteristics.
Define robust architectures and technical requirements for embedded software, meeting AI/ML workloads' needs and highly performing network operations.
Lead the work with R&D and Validation teams, providing technical guidelines and close support and thorough reviews for detailed designs and test plans.
Collaboration with architects across various fields, including Chip Design, Firmware, Hardware Platforms, and System teams.
Close work with product marketing, program managers, and account managers to ensure the successful execution of projects.
Support engagements with key customers, issue patents, publish white papers and blogs, and be proactive in technical forums and industry working groups.
Promote innovation through the design and implementation of Proof-of-Concept (PoC).
Requirements:
B.Sc. or M.Sc. in Computer Science, Computer Engineering, or Electrical Engineering.
8+ years of experience in embedded software development for networking products, including 5+ years functioning as a Software Architect responsible for significant modules.
Expert-level knowledge in Ethernet/IP technologies, network topologies, and networking features in data center, telco and/or edge networks.
Highly experienced in embedded software design and operating system fundamentals.
Proven track record of proactively researching and integrating emerging technologies to develop practical applications and innovative solutions.
Leadership skills and accountability, including of past projects.
Clear verbal and written communication with the ability to build consensus within a large organization.
Possess problem-solving and critical thinking skills.
Ability to operate in a highly dynamic environment.
Ways to stand out of the crowd:
Wide knowledge in Switch ASIC hardware and Software Development Kit (SDK).
Deep understanding of the Linux kernel and networking.
Demonstrated ability to prototype ideas and demonstrate their value.
Applying ML/AI methods to solve networking problems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8467594
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: More than one
Job Type: Full Time
We seek a highly motivated and experienced System Architect specializing in Data-Center, AI Fabric, and Ethernet Networking to join our team of experts and help shape the future of high-performance ML/AI computing. You will have the opportunity to work on some of the most pioneering technologies and help drive the innovation of our next-generation networks. You will play a key role in defining end-to-end solutions, networking protocols and features, interworking with orchestration systems, and help address new business opportunities in exciting areas. Our Architects also represent us in open-source projects, technical conferences, and standard development organizations.

What you'll be doing:

Explore new technologies and end-to-end solutions for our Ethernet Networking Platforms.

Be familiar with data-center and AI fabric network topologies, AI/ML clusters operation and network usage, as well as with the Ethernet Switch platforms' design and characteristics.

Define robust architectures and technical requirements for network operating systems and end-to-end solution offering for AI/ML workloads' needs and highly performing network operations.

Lead the work with R&D and Validation teams, providing technical guidelines and close support and thorough reviews for detailed designs and test plans.

Collaboration with architects across various fields, including Chip Design, Firmware, Hardware Platforms, and System teams.

Close work with product marketing, program managers, and account managers to ensure the successful execution of projects.

Support engagements with key customers, issue patents, publish white papers and blogs, and be proactive in technical forums and industry working groups.

Promote innovation through the design and implementation of Proof-of-Concept (PoC).
Requirements:
What we need to see:

B.Sc., M.Sc. or Ph.D. in Computer Science, Computer Engineering, or Electrical Engineering.

15+ years of experience in embedded software development for networking products, including 7+ years functioning as a System and/or Networking Architect.

Expert-level knowledge in Ethernet/IP technologies, network topologies, and networking features in data center, telco and/or edge networks.

Highly experienced in system software design and networking fundamentals.

Excellent understanding of large-scale network behavior and the effect of distributed computing workloads on the network.

Demonstrated ability to maintain technical foresight, conducting deep research and development into new technologies to generate innovative ideas and functional applications.

Leadership skills and accountability, including of past projects.

Clear verbal and written communication with the ability to build consensus within a large organization.

Possess problem-solving and critical thinking skills.

Ability to operate in a highly dynamic environment.

Ways to stand out of the crowd:

Extensive knowledge in various Switch ASIC hardware and Software Development Kit (SDK).

Demonstrated ability to prototype ideas and demonstrate their value.

Applying ML/AI methods to solve networking problems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496586
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/12/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We seek a highly motivated Network Performance Exploration Engineer to join our team of experts and help shape the foundational infrastructure for the AI revolution. Our next-generation networking systems are at the forefront of connecting and powering the world's most advanced AI clusters. As a key member of our architecture team, you will be responsible for exploring and identifying critical network optimization opportunities across our entire hardware and software stack, analyzing how system-level changes impact application-level performance.
What Youll Be Doing:
Explore and validate end-to-end application performance, defining comprehensive test plans and critical metrics to identify optimization opportunities in both hardware and software.
Establish and maintain a comprehensive database of benchmark results, tracking performance across releases to drive data-informed decisions.
Conduct deep-dive analysis into communication libraries (like NCCL), system software, and hardware configurations to investigate performance characteristics, validate architectural theories, and identify bottlenecks.
Provide critical performance data to correlate and enhance simulation tools, ensuring our models accurately predict real-world hardware behavior.
Analyze application-level traffic patterns (e.g., LLMs) on our advanced networking fabrics to identify hardware and software optimization opportunities and tune system parameters.
Lead Proof-of-Concept (POC) projects to prototype and evaluate potential hardware and software optimizations and their impact on application performance.
Requirements:
B.Sc. or M.Sc. degree in Computer Science, Computer Engineering, or Electrical Engineering, or equivalent experience.
5+ years of relevant industry or research experience in high-performance computing, computer architecture, or computer networks.
Hands-on programming skills in Python and/or C/C++ for system analysis, automation, and customizing benchmarks.
Excellent understanding of large-scale system behavior and the effect of distributed computing workloads on network and system performance.
Proven experience in performance analysis, benchmarking, and identifying system bottlenecks.
Exceptional analytical, problem-solving, and systems-thinking skills, with the ability to dive deep into complex software and hardware interactions.
Ability to thrive in a a fast-paced, dynamic environment and work concurrently with multiple cross-functional teams.
Ways To Stand Out From The Crowd:
Deep understanding of and hands-on experience with communication libraries such as NCCL, UCX, or MPI.
Direct experience debugging or modifying the source code of a major communication library.
Expertise in the architecture and system-level requirements of large-scale, distributed Deep Learning workloads (e.g., LLMs).
Expertise in high-performance network protocols (Ethernet, InfiniBand, RoCE) and interconnect technologies like NVLink.
Familiarity with the PyTorch ecosystem, especially for distributed workloads.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465097
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/12/2025
חברה חסויה
Location: More than one
Job Type: Full Time
We are seeking a highly motivated SoC Architect to join our team and define the next generation of our companys high-performance networking SoCs. Our Ethernet and NVL switch silicon powers the world's most advanced AI compute clusters - from hyperscale GPU systems used to train and inference massive foundation models, to the AI factories shaping the future of computing.
As an SoC Architect at our company, you will drive end-to-end SoC definition, connecting system-level requirements with chip-level implementation across multiple domains. You will work closely with cross-functional teams to craft scalable, power-efficient, and feature-rich SoCs that enable the next leap in networking and AI infrastructure.
What You'll Be Doing:
Lead SoC architecture across multiple teams and disciplines - including firmware, security, debug, power management, and peripheral/IP owners - ensuring holistic architectural alignment and system coherence.
Ensure next-generation architectures meet the requirements and constraints of all stakeholder teams, and drive clear specification and communication of those requirements.
Architect and analyze multi-chip solutions, including die-to-die connectivity, chip partitioning, package/board constraints, system requirements, chip fabric, PCIe subsystem and how SoC subsystems must support them.
Define top-level SoC structure: subsystem partitioning, interconnect, memory subsystem, coherency, clocking, power architecture, and system integration.
Define system flows: power up sequences, boot sequences, software update.
Own the SoC architecture specification and guide it throughout the entire product lifecycle - concept, modeling, implementation, and silicon bring-up.
Perform trade-off analyses across performance, area, power, and feature complexity to drive architectural decisions.
Collaborate deeply with chip architects, logic design, verification, physical design, firmware, and system software to ensure seamless integration of all SoC components.
Contribute to innovation and long-term architectural direction, including patent development.
Requirements:
BSc or MSc in Electrical Engineering, Computer Engineering, or related field
6+ years of experience in SoC or chip architecture, microarchitecture, or complex ASIC design
Strong understanding of SoC fundamentals - interconnects, memory systems, coherency, clock/power architecture, security, and HW/SW integration
Ability to work across hardware, firmware, and system software boundaries with strong system-level reasoning
Hands-on experience writing and owning architecture specifications
Proven ability to collaborate across many teams and drive alignment in complex technical environments
Ways to Stand Out from the Crowd:
Expertise in networking, switch silicon, high-speed IO, or data-path acceleration
Experience defining multi-chip or disaggregated architectures (e.g., chiplets, advanced packaging, die-to-die protocols)
Experience with fabric and memory subsystem.
Strong background in system modeling, performance analysis, or traffic simulation
Experience with security architecture, power management, or debug infrastructure.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465552
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: More than one
Job Type: Full Time
We seek a highly motivated and experienced Software Architect specializing in Ethernet Switch ASICs to join our team of experts and help shape the future of high-performance ML/AI computing. You will have the opportunity to work on some of the most pioneering technologies and help drive the innovation of our next-generation networks. You will play a key role in defining switching software stacks and Linux kernel networking, and help address new business opportunities in exciting areas. Our Architects also represent us in open-source projects, technical conferences, and standard development organizations.

What you'll be doing:

Explore networking technologies, features and protocols, hardware/software capabilities, open-source software and drivers for our Ethernet Switch ASICs and Networking platforms.

Be familiar with the Ethernet Switch ASIC hardware and software stacks, as well as with the Ethernet Switch platforms design and characteristics.

Define robust architectures and technical requirements for embedded software, meeting AI/ML workloads' needs and highly performing network operations.

Lead the work with R&D and Validation teams, providing technical guidelines and close support and thorough reviews for detailed designs and test plans.

Collaboration with architects across various fields, including Chip Design, Firmware, Hardware Platforms, and System teams.

Close work with product marketing, program managers, and account managers to ensure the successful execution of projects.

Support engagements with key customers, issue patents, publish white papers and blogs, and be proactive in technical forums and industry working groups.

Promote innovation through the design and implementation of Proof-of-Concept (PoC).
Requirements:
What we need to see:

B.Sc. or M.Sc. in Computer Science, Computer Engineering, or Electrical Engineering.

8+ years of experience in embedded software development for networking products, including 5+ years functioning as a Software Architect responsible for significant modules.

Expert-level knowledge in Ethernet/IP technologies, network topologies, and networking features in data center, telco and/or edge networks.

Highly experienced in embedded software design and operating system fundamentals.

Proven track record of proactively researching and integrating emerging technologies to develop practical applications and innovative solutions.

Leadership skills and accountability, including of past projects.

Clear verbal and written communication with the ability to build consensus within a large organization.

Possess problem-solving and critical thinking skills.

Ability to operate in a highly dynamic environment.

Ways to stand out of the crowd:

Wide knowledge in Switch ASIC hardware and Software Development Kit (SDK).

Deep understanding of the Linux kernel and networking.

Demonstrated ability to prototype ideas and demonstrate their value.

Applying ML/AI methods to solve networking problems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496596
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
08/01/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
We are searching for a strong technical leader to own the backbone of our Networking Research capabilities. We are looking for an Engineering Manager to lead the development of our high-fidelity Network Simulation platform and the extensive on-premise infrastructure that powers it.

In this role, you will lead a team of performance simulation software engineers and DevOps/Infrastructure specialists. You will own the "Simulation-as-a-Service" product-a critical platform used by internal researchers to model next-generation data center architectures. Your mission is to ensure our simulations are accurate, performant, and accessible, while managing the large-scale compute clusters required to run them.

What you'll be doing:

Team Leadership: Manage and mentor a team of C++ software engineers and DevOps infrastructure engineers, fostering a culture of performance, reliability, and code quality.

Product Ownership (Sim-as-a-Service): Treat the internal simulation platform as a product. Work with research partners to define the roadmap, prioritize features, and ensure high availability for users.

High-Performance Simulation: Be responsible for the architecture and optimization of complex network simulation engines (C++ based), ensuring they can scale to model extensive data center topologies with high fidelity.

Infrastructure Management: Own the lifecycle of our on-premise compute clusters and servers. Drive decisions on hardware upgrades, prioritisation, and managing system resources.

DevOps & Automation: Lead the strategy for CI/CD pipelines, automated testing, and containerized deployments to ensure rapid iteration and stability of the simulation platform.

multi-functional Collaboration: Partner with the AI Agents team to expose simulation APIs, enabling agents to run experiments and gather data autonomously.
Requirements:
What we need to see:

MSc, Ph.D. or equivalent experience in Computer Science, Electrical Engineering, or a related field.

8+ years of hands-on software engineering experience, with a proven track record of leading technical teams in systems or infrastructure domains for 3+ years.

3+ years of managerial experience.

C++ Expertise: Strong background in C++ development for high-performance applications (System-level programming, concurrent programming).

Infrastructure & DevOps: Practical experience managing on-premise servers, Linux environments, and modern DevOps tools (Kubernetes, Slurm, Docker, Ansible).

Operational Rigor: Ability to manage "heavy" operations-ensuring uptime, monitoring system health, and optimizing hardware utilization.

Ways to stand out from the crowd:

Networking Knowledge: Deep understanding of computer networking fundamentals (TCP/IP, Ethernet, InfiniBand, Congestion Control) and data center architectures.

Simulation/Modeling: Experience with discrete event simulation (DES) or modeling complex systems.

HPC Background: Experience working with MPI, CUDA, or other High-Performance Computing frameworks.

Specific Simulators: Familiarity with standard network simulators like OMNeT++, NS-3, or similar proprietary tools.

Hardware Knowledge: Understanding of switch micro-architecture or NIC design is a significant plus.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8494134
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
5 ימים
Location: Yokne`am
Job Type: Full Time
We are looking for an excellent System Eng. First line manager to join our group in Yokneam.

As the team manager, you will be leading a development team responsible for the System bring up of the Next-generation networking products. The team focus is on Firmware validation tests for advanced cutting-edge NVIDIA networking and AI platforms and silicon. We drive the data growth of the worlds biggest companies, Working with talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced.


What you will be doing:

Lead a team of engineers and provide technical guidance and career mentorship to the team of highly skilled engineers. Empower the team members to excel and increase team productivity.

Define and implement new product integration and validation in the core of our HCA and DPU products.

Drive and facilitate the planning, scheduling, and execution of the project and activities of the team.

Collaborate with architecture, product and different software design teams as part of the software development lifecycle.

Work in pre and post-silicon activities of our next-generation networking products.

Gain a deep understanding of networking technology, system debugging, as well as the HW/FW/SW relationship.
Requirements:
What we need to see:

B.Sc. in Computer Science/ Computer Engineering / Electrical Engineering

2+ years of managerial experience.

6+ overall years of relevant firmware development or System validation experience.

Strong system-level understanding and hands-on lab experience

Programming skills in Python.

Strong analytical, creative, debugging, and problem-solving skills.

Detail-oriented and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

Excellent interpersonal and intercultural skills, including written and verbal communication in English


Ways to stand out from the crowd:

Knowledge in SW-HW interfaces, PCIe, memory subsystems, storage components, network applications, device drivers and real-time software

Programming knowledge in Python/C/C++

Deep understanding of computing architectures, hardware systems block diagrams and schematic sheets understanding

Previous roles in system validation positions, technical lead or silicon bring up leader.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8499631
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a passionate and innovative Senior Software Engineer with expertise in enterprise and data center networking to join our Cumulus Linux team. Cumulus Linux is a leading open-networking operating system and a cornerstone of the AI Factory - the next-generation data center designed to power the training, fine-tuning, and deployment of AI models at scale. As part of the team, you will design and implement core features of Cumulus Linux that enable the worlds most advanced data centers. You will work closely with cross-functional architecture and design teams, shaping the future of our networking technologies while gaining hands-on experience across our hardware and software ecosystem - from advanced networking ASICs to large-scale distributed systems.

What you'll be doing:

Design, develop, integrate, and test data forwarding and routing features in our Cumulus Linux.

Enable our Cumulus Linux on next-generation ASICs.

Work collaboratively with team members, product managers, architects, QA, and other engineering teams to deliver high-quality solutions.

Innovate and rapidly develop proof-of-concept (POC) prototypes that can evolve into fully developed products or solutions.

Engage closely with customers to understand their challenges, use cases, and deployment strategies, and devise innovative solutions.
Requirements:
What we need to see:

BS or MS degree in Computer Engineering, Computer Science, or a related field.

Over 5 years of experience as a Software Engineer.

Excellent C programming skills on Linux.

Familiarity with forwarding and routing networking concepts.

Strong analytical skills and a deep understanding of data structures and algorithms.

Excellent communication skills.

Ways to stand out from the crowd:

Deep understanding of forwarding and routing architectures and protocols.

Strong expertise in Linux systems, with a focus on kernel-level networking.

Hands-on experience with merchant silicon platforms for high-performance switching and routing.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496553
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an AI Test Architect joining E2E Verification group to profile Innovative large scale Distributed training on our AI End-to-End solutions in a large scale supercomputing clusters. Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated Computing and Deep Learning software and hardware platforms, with researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, Switch, HCA, CPU and GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Profiling, benchmarking, and analyzing deep learning models to identify areas for optimization and improvement in terms of performance, efficiency, and accuracy, with a strong emphasis on networking aspects.

Collaborating closely with data scientists, researchers, development, automation teams to design and implement scalable training pipelines and frameworks that demonstrate large scale high -performance networking capabilities.

Staying up-to-date with the latest advancements in deep learning algorithms, architectures, our GPU technologies, and high-performance networking solutions.

Optimizing deep learning models for performance, memory usage, and power efficiency while maximizing high-performance networking features on our supercomputers.

Providing insights and recommendations based on the analysis of large-scale training results, specifically focusing on networking bottlenecks and optimizations, to improve model outcomes and achieve business objectives.

Collaborating with hardware engineers to guide the development and integration of efficient networking solutions for deep learning, including exploring network architecture optimizations and bringing to bear technologies such as RDMA or InfiniBand.
Requirements:
What we need to see:

B.Sc. in Computer Science, Software Engineering, or equivalent experience.

Strong understanding and practical experience with machine learning algorithms and techniques, with a specialization in deep learning and expertise in high-performance networking.

8+ years of overall experience, with CUDA programming for deep learning frameworks like TensorFlow, PyTorch, combined with expertise in networking libraries and protocols.

Ability to profile and optimize deep learning workflows, focusing on networking-related bottlenecks and optimizations, to improve overall performance and efficiency.

Exceptional analytical and problem-solving skill, with a keen attention to detail, particularly in identifying and resolving networking performance issues.

Excellent communication and collaboration skills, enabling effective teamwork and cooperation.

Familiarity with supercomputers, parallel computing, distributed systems, and high- performance networking technologies like RDMA or InfiniBand.

Ways to stand out from the crowd:

Demonstrated experience in successfully profiling and optimizing large-scale deep learning training on our supercomputers, with a significant focus on high-performance networking enhancements.

Experience with distributed deep learning, distributed training frameworks, or large-scale data pipelines enhanced by high-performance networking solutions.

Expertise in optimizing networking parameters, such as bandwidth, latency, or congestion control, for deep learning workloads.

Familiarity with our networking technologies, such as Mellanox InfiniBand, and their integration with deep learning workflows.

Strong understanding of high-performance networking protocols and standards and their application to deep learning.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8496288
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
5 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a 100% hands-on Storage Services Software engineer to join the block storage group. You will be a member of a team that builds the next generation block storage capabilities. You will work closely with a variety of teams and architects including the networking team, and external customers. You will take part in defining the software architecture and implementation of the most advanced storage services! Services that will need to meet extreme performance and scalability demands! We have crafted a team of extraordinary people stretching around the globe, whose mission is to push the frontiers of what is possible today and define the platform of tomorrow.

We work, think and learn as a team. We thrive in a deeply strong environment, and we're passionate about a culture that demands innovation and the highest standards. The rewards are sweet and include collaborating with some of the smartest people in the industry, an aggressive compensation plan that rewards top performers, and the opportunity to work on products that transform the way people work and play.

What youll be doing:

100% hands-on coding role in C language, kernel and userspace.

Research, design, implement and test, new and existing, networking features for distributed storage services and features of our block storage solution, in both Host and DPU environments.

Acquire understanding of the algorithms, the technicalities and the interaction with other components across our block storage ecosystem.

Analyze and solve challenging bugs and customer cases in large scale production systems, identifying issues in our or inbox kernel modules and often in other components. Drive new solutions based on any issues that arise.
Requirements:
What we need to see:

B.Sc., M.Sc.. in Computer Science, Electrical Engineering or related discipline (or equivalent experience).

15+ years of experience as a senior developer, preferably in the domains of storage, networking, and/or operating-systems.

Strong proficiency in C/C++ programming.

Knowledge of networking fundamentals and experience in Linux-based networking environments.

Familiarity with RDMA technologies, including Infiniband, RoCE, or iWARP, and experience with RDMA programming models, control and data paths. Comprehension of large and complexed systems.

Proven professional experience in designing and developing distributed systems; advantage for experience in block storage and/or networking systems.

Ability to work autonomously, with a proactive mindset and perseverance to solve day to day challenges.

Ability to quickly adapt to new technology and go deep into new areas

Excellent communication skills and a collaborative mindset.

Innovative approach, identifying opportunities to improve, accelerate, and reuse existing solutions.

Knowledge of cloud computing concepts, including virtualization, scalability, and data management.

Ways to Stand Out From the Crowd:

Linux Kernel coding experience.

Linux Kernel internals knowledge including memory management, scheduling, etc.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8499984
סגור
שירות זה פתוח ללקוחות VIP בלבד