We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband and NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.
What youll be doing:
Define the Infiniband and NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).
research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.
The architect should have experience in developing models for simulations, analyzing simulation results and development of optimization algorithms.
Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements: What we need to see:
B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.
At least 5 years of industry or research experience in computer networks.
Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.
Experience in development of simulation environments.
Possess strong managerial, problem solving and critical thinking skills.
Ability to work and operate in a highly dynamic environment.
Partner with multiple groups in the organization.
Ways to stand out from the crowd:
Good knowledge in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.
Good knowledge in Python, C++.
Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.