דרושים » הנדסה » Senior System Software Engineer, NCCL - Partner Enablement

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Yokne`am and Tel Aviv-Yafo
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Engage with our partners and customers to root cause functional and performance issues reported with NCCL.

Conduct performance characterization and analysis of NCCL and DL applications on groundbreaking GPU clusters.

Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.).

Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters.

Document and conduct trainings/webinars for NCCL.

Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.
Requirements:
What we need to see:

B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design.

Experience working with engineering or academic research community supporting HPC or AI.

Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control.

Expert in Linux fundamentals and a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible).

Adaptability and passion to learn new areas and tools.

Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow.

Deep understanding of technology and passionate about what you do.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203558
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.

Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack.

Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available.

Triage and root-cause performance issues reported by our customers.

Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information.

Collaborate with a very dynamic team across multiple time zones.
Requirements:
What we need to see:

M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience.

3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Experience conducting performance benchmarking and triage on large scale HPC clusters.

Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals).

Implement micro-benchmarks in C/C++, read and modify the code base when required.

Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker).

Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control.

Experience debugging network issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs.

Experience with Deep Learning Frameworks such PyTorch, TensorFlow.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203543
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions that were once considered science fiction, including artificial intelligence to autonomous cars. Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We're seeking a Senior Software Architect to help co-design next-gen data center platforms and scalable communications software.

DL and HPC applications have a huge compute demands and already run at scales of up to tens of thousands of GPUs. GPUs are connected with high-speed interconnects (e.g. NVLink, PCIe) within a node and with high-speed networking (e.g. InfiniBand, Ethernet) across nodes. Efficient and fast communication between GPUs directly impacts end-to-end application performance. This impact continues to grow with the increasing scale of next generation systems. This is an outstanding opportunity to advance the state-of-the-art, break performance barriers, and deliver platforms the world has never seen before. Are you ready to build the new and innovative technologies that will help realize our vision?

What you will be doing:

Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems.

Design and implement new communication technologies to accelerate AI and HPC workloads.

Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects.

Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations.

Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs).
Requirements:
What we need to see:

M.S./Ph.D. degree in CS/CE or equivalent experience.

5+ years of relevant experience.

Excellent C/C++ programming and debugging skills.

Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).

Deep understanding of operating systems, computer and system architecture.

Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads.

Strong experience with Linux.

Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.

Ways to stand out from the crowd:

Expertise in related technology and passion for what you do. Experience with CUDA programming and our GPUs. Knowledge of high-performance networks like InfiniBand, RoCE, NVLink, etc.

Experience with Deep Learning Frameworks such PyTorch, TensorFlow, etc. Knowledge of deep learning parallelisms and mapping to the communication subsystem. Experience with HPC applications.

Strong collaborative and interpersonal skills and a proven track record of effectively guiding and influencing within a dynamic and multi-functional environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203578
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/05/2025
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.

Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.

Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.

Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.

Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.

Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.

Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.

Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
Requirements:
What we need to see:

B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.

Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.

Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.

Familiarity with Jenkins, Ansible, Puppet/Chef.

Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.

Deep understanding of networking protocols such as InfiniBand and Ethernet.

Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.

Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.

Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).

Familiarity with cloud platforms (AWS, Azure, Google Cloud).

Ways to stand out from the crowd:

Architectural Insight: Knowledge of CPU and/or GPU architecture.

Container Expertise: Understanding of Kubernetes and container-related microservice technologies.

GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).

RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8193596
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.

Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.

Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.

Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.

Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.

Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.

Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.

Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
Requirements:
What we need to see:

B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.

Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.

Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.

Familiarity with Jenkins, Ansible, Puppet/Chef.

Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.

Deep understanding of networking protocols such as InfiniBand and Ethernet.

Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.

Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.

Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).

Familiarity with cloud platforms (AWS, Azure, Google Cloud).

Ways to stand out from the crowd:

Architectural Insight: Knowledge of CPU and/or GPU architecture.

Container Expertise: Understanding of Kubernetes and container-related microservice technologies.

GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).

RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203606
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
Location: More than one
Job Type: Full Time
We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband and NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.

What youll be doing:

Define the Infiniband and NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).

research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.

The architect should have experience in developing models for simulations, analyzing simulation results and development of optimization algorithms.

Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements:
What we need to see:

B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.

At least 5 years of industry or research experience in computer networks.

Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.

Experience in development of simulation environments.

Possess strong managerial, problem solving and critical thinking skills.

Ability to work and operate in a highly dynamic environment.

Partner with multiple groups in the organization.

Ways to stand out from the crowd:

Good knowledge in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.

Good knowledge in Python, C++.

Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196190
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
Location: More than one
Job Type: Full Time
We are seeking a highly motivated High-Performance System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband and NVL systems will be at the forefront of connecting and powering the world's most advanced compute clusters, from supercomputers used in AI research to high-performance clusters used at almost every industry today, such as car and Pharmaceutical. As a high-performance system architect, you will have the opportunity to work on some of the most cutting-edge technology and help to drive the innovation of our next generation networks that will be used by top researchers and engineers around the world.

What youll be doing:

Define the Infiniband and NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, on deployments).

research of various solutions to enable the next large-scale-high-performance computing clusters. The position spans over various layers from algorithms, software, firmware, and HW.

The architect should have experience in developing models for simulations, analyzing simulation results and development of optimization algorithms.

Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
Requirements:
What we need to see:

B.Sc, M.Sc, or Ph. D degree in Computer Science, Computer Engineer, or Electrical Engineer.

At least 5 years of industry or research experience in computer networks.

Excellent understanding of large-scale networks behaviour and the effect of distributed computing workloads effect on the network.

Experience in development of simulation environments.

Possess strong managerial, problem solving and critical thinking skills.

Ability to work and operate in a highly dynamic environment.

Partner with multiple groups in the organization.

Ways to stand out from the crowd:

Good knowledge in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.

Good knowledge in Python, C++.

Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196183
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
Job Type: Full Time
We are hiring a Solution Architect to customer-facing teams supporting different technical areas such as IB/ETH networking, DPU, Cloud infrastructure DevOps, HPC/AI workloads, and customer success. In the process, you will have the opportunity to become a specialist in our enterprise products including the DGX/HGX systems, and networking DPUs and switches, as well as our developer software platforms including our Omniverse, our HPC and AI. It may also be possible to rotate with our program management or engineering teams. Are you ready for the challenge?

What youll be doing:
Be responsible for the setup of experiments, tests, equipment, and otherwise facilitate evaluations that help solve customer problems using our technologies.
Partner with Build and Program Management teams to support AI Factory deliveries globally.
Establish close technical ties to the customer account, establishing personal relationships to facilitate rapid resolution of customer issues.
Work closely and collaborate with the customer account team, other Solution Architects, and/or product engineering teams during quarterly rotation assignments.
You will raise and provide timely advance warning of critical customer issues that require additional attention.
Present platform solutions to customers, partners, community, etc.
Some rotation assignments might require up to 15% travel.
Requirements:
What we need to see:
Bachelors in computer science/electrical/mechanical engineering.
1+ years of experience in data center infrastructure.
Knowledge in Data Sciences, Deep Learning, or Machine Learning.
Strong programming skills in one or more high-level languages Python, C, C++, Rust, etc.
Motivated self-starter with an equal balance of strong problem-solving skills and strong customer-facing communication skills including presenting.
Strong collaboration and interpersonal skills.
Passion for continuous learning, knowledge transfer and working in a dynamic environment without losing focus.

Ways to stand out from the crowd:
Background with working on AI Deep Learning and Machine Learning Applications, AI Model Training/Inferencing or other GPU related technologies including using TensorFlow, PyTorch, DL frameworks or CUDA.
Experience working with Docker, Kubernetes or slurm both on-prem and cloud-based infrastructure including HPC or AI supercomputer clusters.
System, networking and storage hardware, software and administrative experience.
Exposure to cloud service platforms such as AWS, Azure, GCP or OCI through coursework or through certification programs.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196550
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
חברה חסויה
Location: More than one
Job Type: Full Time
We are looking for an enthusiastic software engineer to join our AI networking acceleration team, to work on a groundbreaking open-source library, using hardware offloads, GPU Kernels and RDMA network cards. Our product is a performance-oriented low-level infrastructure, crafted to change the way inference works.

We thrive as a team in a deeply strong environment, and we're passionate about innovation. The rewards are sweet and include working with some of the brightest people in the industry, an aggressive compensation plan that rewards top performers, and the opportunity to collaborate on products that transform daily the way people work and play.

What you'll be doing:

Developing a highly optimized inference framework

Running on the worlds largest supercomputers and data centers.

The work environment is dynamic and challenging as our employees work on innovative, next-generation products at the forefront of technology in terms of performance, scalability, and features.
Requirements:
What we need to see:

B.Sc. or equivalent experience in Computer Science or Software Engineering.

5 years experience in modern C++ / C / Rust development.

3 years experience in Linux environment and familiarity with development tools.

Deep knowledge of the TCP/IP network stack.

Understanding of computer architecture and operating systems concepts.

Ways to stand out from the crowd:

Background in Linux internals and low-level software optimizations (benchmarking, bottleneck research, performance tuning).

Experience in programming CUDA kernels is an advantage.

Familiarity with ML frameworks and LLMs.

Background in parallel programming / high-performance computing / RDMA technology.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196337
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
About the job
our company's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to our companys needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
In this role, you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
our company Cloud accelerates every organizations ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage our companys cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to our company Cloud as their trusted partner to enable growth and solve their most critical business problems.
Responsibilities
Write product or system development code.
Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
Requirements:
Minimum qualifications:
Bachelors degree or equivalent practical experience.
5 years of experience with software development in one or more programming languages, and with data structures/algorithms.
5 years of experience in C or C++ programming.
Preferred qualifications:
Master's degree or PhD in Computer Science or related technical fields.
2 years of experience with performance, large scale systems data analysis, visualization tools, or debugging.
Experience developing accessible technologies.
Experience in code and system health, diagnosis and resolution, and software test engineering.
Interest in the following areas: performance debugging and optimization of workloads, design of performance tools, compiler design and code optimization, high-performance software development techniques, concurrent programming, or multi-core computer architectures.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8187427
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are now looking for a motivated Chip Architecture Engineer to use your creativity to work on the Spectrum Switch with a highly inventive and knowledgeable team.

Our technology has no boundaries! NVIDIA is building the worlds most groundbreaking and state of the art compute platforms for the world to use. Its because of our work that scientists, researchers and engineers can advance their ideas. At its core, our visual computing technology not only enables an outstanding computing experience, it is energy efficient! We pioneered a supercharged form of computing loved by the most fast paced computer users in the world - scientists, designers, artists, and gamers. Its not just technology though! It is our people, some of the brightest in the world, and our company diverse culture make us one of the most fun, innovative and dynamic places to work in the world! At the center of our culture are our core values like innovation, excellence and determination and team, which guide us to be the best we can be.

What you'll be doing:

Be part of the team that defines the Spectrum Switch chip architecture end to end from the market requirements through design and all product life cycles (post/pre-silicon, on deployments).

Be part of the team that defines the GPU interconnect protocol, and defines the Architecture for GPU interconnect.

Work with related industry standards & customers on deploying your tech.

Collaborate with teams across teams (physical design, logic design, system software, firmware, applications).

Perform research and analysis for current and future architectures.

Develop Proof of Concepts using our technology, collaborating with our most sophisticated customers on state-of-the-art innovations.
Requirements:
What we need to see:

B.Sc. in Electrical or Computer Engineering.

3+ years of relevant experience

Programming skills.

Knowledge and understanding of computing and networking systems.

Your can-do attitude and high energy with leadership and excellent interpersonal skills and possess the ability to learn complex concepts in a fast pace environment.

You have the utmost passion for attention to detail on design and a high focus on design quality.

Ways to stand out from the crowd:

Experience and love for system architecture, CPU/GPU/Memory/Storage/Networking.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8204145
סגור
שירות זה פתוח ללקוחות VIP בלבד