Our technology has no boundaries! we are building the worlds most groundbreaking and state of the art accelerated compute platforms for the world to use. Its because of our work that scientists, researchers and engineers can advance their ideas. We pioneered a supercharged form of computing loved by the fastest paced computer users in the world - scientists, designers, artists, and gamers.
We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband and NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect at our company, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.
What youll be doing:
Define the NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).
Research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.
Developing models for simulations and performance testing, analysing the results and development of future HW and SW.
Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements: B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.
At least 5 years of industry or research experience in computer networks.
Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.
Experience in development of simulation environments.
Possess strong managerial, problem solving and critical thinking skills.
Ability to work and operate in a highly dynamic environment.
Partner with multiple groups in the organization.
Ways to stand out of the crowd:
Strong understanding in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.
Good knowledge in Python, C++.
Good knowledge with AI models.
Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.