דרושים » תוכנה » Research Software Engineer, Advanced Development

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 4 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
we are searching for world-class Software Engineers to join our growing software architecture Research team. The ideal candidate will be conducting cutting-edge research at the intersection of Networking, Security, Communications, AI and Distributed GPU computing, and working alongside top experts in these fields. With incredible resources in networking and compute, you will be able to impact, contribute and advance these domains for scalable accelerated computing. Topics include but are not limited to remote direct memory access, hardware offloading and hardware acceleration, distributed accelerator networks, AI for networking and security, storage management, cryptography accelerators and architecture, LLM network traffic optimizations and AI collectives. With its unique open culture, we are one of the best industry labs to do Accelerated Computing research.
What youll be doing:
Enhance our company's GPU Networking offerings for accelerating AI workloads, such as our company Dynamo or our company NIXL.
Develop and evaluate new technologies, innovations relevant for scientific, Deep Learning, and data-intensive workloads.
Create proof-of-concept to evaluate and drive such new technologies.
Work on impactful projects involving state-of-the-art high-performance computing software and hardware.
Designing and implementing services, runtime systems, and applications over SDK
Partner and collaborate with other forward-thinking team members and external researchers.
Requirements:
Hold a B.Sc. or M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a leading university.
0-2 years of industry experience (or equivalent) in system programming or related fields.
Background in algorithm design, system programming, and computer architecture.
Strong programming and software development skills.
A teammate with a can-do attitude, high energy and excellent interpersonal skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Proven research track record.
Experience and passion for system architecture, CPU/GPU/Memory/Storage/Networking.
Stellar communication skills.
Knowledge in Deep Learning frameworks and AI communication libraries (NCCL, UCX, MPI and equivalents).
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465302
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
18/11/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are in search of a Senior Software Architect- a creative, forward-thinking, and practical researcher to improve the framework for widespread LLM learning and prediction. As part of our dynamic E2E Architecture group, you will design and optimize systems driving generative AI workloads, working at the intersection of software and hardware on some of the most advanced GPU clusters worldwide. You will define how AI models are deployed and scaled in production using the NVIDIA Spectrum-X Networking Platform, influencing decisions from inter-node communication and compute scheduling to system-level optimization. This is an opportunity to collaborate with best-in-class engineers and researchers and shape the future of generative AI in real-world applications. Your work will make a lasting impact by enabling generative AI technologies to reach real-world applications and improve global computing capabilities.

What Youll Be Doing:

Lead research and development of end-to-end networking solutions for distributed AI training and inference at scale, with a focus on job completion time, failure resiliency, telemetry, scheduling, and placement.

Analyze current deployments, develop prototypes, and recommend architectural improvements.

Stay abreast of the latest research; become the teams authority in emerging networking techniques and technologies.

Design, simulate, and validate new systems using novel, scalable network simulator NSX.

Develop and test prototypes on large-scale GPU clusters (e.g., Israel-1).

Collaborate across hardware, firmware, and software teams to translate ideas into real networking product features.

Publish patents and present research at leading conferences.
Requirements:
What We Need to See:

M.Sc. or PhD (preferred) in Computer Science, Electrical/Computer Engineering, or related fieldor B.Sc. with research experience and publications.

5+ years of relevant experience.

Deep expertise in networking and communication internals (NCCL, RDMA, congestion control, routing).

Strong software engineering skills in C++ and/or Python.

Excellent system-level design and problem-solving abilities.

Outstanding communication and collaboration skills across technical domains.

Ways to Stand Out from the Crowd:

Proven passion for solving sophisticated technical problems and delivering impactful solutions.

Record of publications in top-tier conferences.

Experience in designing and building large-scale AI training clusters.

Post-PhD research experience

Practical understanding of deep learning systems, GPU acceleration, and AI model execution flows.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418932
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 4 שעות
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
The company Networking Advanced Development Software team develops new groundbreaking technologies to enable new market shares for the company and tighten customer relationships. These are emerging technologies in networking and distributed computing for the booming AI factories and data centers. They span areas such as AI neural networks, Deep Learning, High Performance Computing (HPC), Storage, Cloud, SW Defined Network, Network Function Virtualization, 5G NR and more. We develop the solutions top-down, all the way from application behavioral analysis, to architecture definition and down to the implementation, using the world-leading company devices. The development traverses any needed component - application SW, middleware SW, OS kernel subsystems, device drivers, embedded SW (Firmware) and CUDA GPU. We collaborate with partners and key customers in the analysis processes and engage with open source communities introducing our leading features.
What youll be doing:
Lead a team of 5 engineers in the advanced technologies development
Design and implement solutions throughout all layers from high level application, OS and driver subsystem to firmware
Work on impactful projects involving state-of-the-art high-performance computing hardware and software
Provide insight and technical guidance and collaborate with peers from across the company - including software architecture, chip architecture, and engineering departments to improve our future technology
Collaborate with our company partners and customers.
Requirements:
B.Sc. in Computer Science, Electrical Engineering, Computer Engineering, or a related field, or equivalent practical experience
10+ overall years of industry experience in system programming or related fields and 3+ years of experience leading a team
Understanding of multi core hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers, real-time
Excellent programming skills
Ability to learn complex concepts in a fast pace environment
A teammate with a can-do attitude, high energy and excellent interpersonal skills
Ways to stand out from a crowd:
Familiarity with networking protocols
Experience with open-source projects (coursework, personal, or contributions)
Working in a fast-paced and dynamic environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465195
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 4 שעות
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
The company Networking Advanced Development Software team develops new groundbreaking technologies to enable new market shares for the company and tighten customer relationships. These are emerging technologies in networking and distributed computing for the booming AI factories and data centers. They span areas such as AI neural networks, Deep Learning, High Performance Computing (HPC), Storage, Cloud, SW Defined Network, Network Function Virtualization, 5G NR and more. We develop the solutions top-down, all the way from application behavioral analysis, to architecture definition and down to the implementation, using the world-leading company devices. The development traverses any needed component - application SW, middleware SW, OS kernel subsystems, device drivers, embedded SW (Firmware) and CUDA GPU. We collaborate with partners and key customers in the analysis processes and engage with open source communities introducing our leading features.
What youll be doing:
Design and implement solutions throughout all layers from high level application, OS and driver subsystem to firmware
Work on impactful projects involving state-of-the-art high-performance computing hardware and software
Provide insight and technical guidance and collaborate with peers from across the company - including software architecture, chip architecture, and engineering departments to improve our future technology
Collaborate with our company partners and customers.
Requirements:
B.Sc. in Computer Science, Electrical Engineering, Computer Engineering, or a related field
Understanding of multi core hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers, real-time
Programming skills
Ability to learn complex concepts in a fast pace environment.
A teammate with a can-do attitude, high energy and excellent interpersonal skills
Ways to stand out from a crowd:
Familiarity with networking protocols
Experience with open-source projects (coursework, personal, or contributions)
Working in a fast-paced and dynamic environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465199
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 5 שעות
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We seek a highly motivated Network Performance Exploration Engineer to join our team of experts and help shape the foundational infrastructure for the AI revolution. Our next-generation networking systems are at the forefront of connecting and powering the world's most advanced AI clusters. As a key member of our architecture team, you will be responsible for exploring and identifying critical network optimization opportunities across our entire hardware and software stack, analyzing how system-level changes impact application-level performance.
What Youll Be Doing:
Explore and validate end-to-end application performance, defining comprehensive test plans and critical metrics to identify optimization opportunities in both hardware and software.
Establish and maintain a comprehensive database of benchmark results, tracking performance across releases to drive data-informed decisions.
Conduct deep-dive analysis into communication libraries (like NCCL), system software, and hardware configurations to investigate performance characteristics, validate architectural theories, and identify bottlenecks.
Provide critical performance data to correlate and enhance simulation tools, ensuring our models accurately predict real-world hardware behavior.
Analyze application-level traffic patterns (e.g., LLMs) on our advanced networking fabrics to identify hardware and software optimization opportunities and tune system parameters.
Lead Proof-of-Concept (POC) projects to prototype and evaluate potential hardware and software optimizations and their impact on application performance.
Requirements:
B.Sc. or M.Sc. degree in Computer Science, Computer Engineering, or Electrical Engineering, or equivalent experience.
5+ years of relevant industry or research experience in high-performance computing, computer architecture, or computer networks.
Hands-on programming skills in Python and/or C/C++ for system analysis, automation, and customizing benchmarks.
Excellent understanding of large-scale system behavior and the effect of distributed computing workloads on network and system performance.
Proven experience in performance analysis, benchmarking, and identifying system bottlenecks.
Exceptional analytical, problem-solving, and systems-thinking skills, with the ability to dive deep into complex software and hardware interactions.
Ability to thrive in a a fast-paced, dynamic environment and work concurrently with multiple cross-functional teams.
Ways To Stand Out From The Crowd:
Deep understanding of and hands-on experience with communication libraries such as NCCL, UCX, or MPI.
Direct experience debugging or modifying the source code of a major communication library.
Expertise in the architecture and system-level requirements of large-scale, distributed Deep Learning workloads (e.g., LLMs).
Expertise in high-performance network protocols (Ethernet, InfiniBand, RoCE) and interconnect technologies like NVLink.
Familiarity with the PyTorch ecosystem, especially for distributed workloads.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465097
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 50 דקות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a highly skilled Senior Networking AI Platform Engineer to join our Applied Networking AI group. In this role, you will help design and develop cutting-edge AI solutions, integrating them seamlessly into a variety of products. Youll collaborate closely with multi-functional teams of data scientists, software engineers, and DevOps professionals to ensure the efficient deployment, monitoring, and optimization of machine learning (ML) models.
As a key contributor, you will drive the entire software development lifecycle-from conceptualization and architecture to implementation and production-while working closely with engineering teams to solve complex problems and help build a successful company practice.
What you'll be doing:
Lead the design, development, and deployment of robust software systems across different platforms and environments
Architect, design, and implement scalable and high-performance software solutions, handling complex requirements and integrating various subsystems
Ensure systems are maintainable, flexible, and well-documented, with an emphasis on performance and security
Adapt to new tools, technologies, and frameworks, and be capable of taking ownership of the development process from conception to deployment
Supply innovative ideas and solutions, driving continuous improvement in both code quality and system efficiency
Develop and maintain scalable infrastructure for handling and deploying security and networking ML models in production, ensuring high availability, scalability, performance.
Design and implement data pipelines to efficiently process and transform large volumes of data for training and inference purposes.
Optimize and fine-tune ML models for performance, scalability, and resource utilization, considering factors such as latency, efficiency, and cost.
Collaborate with data scientists and software engineers to operationalize and deploy ML models, including model versioning, packaging, and integration with existing systems.
Requirements:
Bachelors or masters degree in computer science, Data Science, or a closely related discipline.
Over 5 years of experience in software development and/or MLOps.
Strong proficiency in programming languages such as Python, Java, C++.
Deep understanding of cloud services architecture and the ability to create real-world applications that include telemetry, authentication, authorization, and security standard methodologies.
Proven track record of leading complex software projects from concept to delivery.
A "can do" attitude with exceptional problem-solving skills and the ability to thrive in fast-paced environments..
Strong problem-solving skills and ability to solve and resolve sophisticated issues in a timely manner.
Excellent communication and collaboration skills, with the ability to work effectively in multi-functional teams.
Attention to detail and a focus on quality, ensuring robustness and reliability in production ML systems.
Experience with Kubernetes architecture and management is a plus.
Ways to stand out from the crowd:
Exude high energy and a positive attitude.
Stellar verbal and written communication skills.
Passionate about data science and implementation.
Have data science and GPU performance experience.
Want to make what was impossible possible!
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465950
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
our company has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technologyand amazing people. We seek an Senior SW Performance Engineer to join our performance verification team. As a Performance Engineer at our company, you will have to work closely with our companys development and architecture teams responsible for Ethernet AI solution and gain a deep understanding of our company products and technologies.
What youll be doing:
Participate in an international team of software engineers working on products for testing our company products
Build automated verification environment for high-end hardware and software which is at the forefront of innovation
Identify, analyze, and report software defects, inconsistencies, and other quality issues.
Drive improvements for performance, quality, stability around SW acceleration solutions.
Stay up to date with industry standard methodologies, new technologies, and emerging trends in software verification.
Requirements:
B.Sc. degree or equivalent experience in Engineering/Computer Science/related field
4+ years of experience as a Software Engineer
Strong programming skills in Python
Expertise in networking & compute infrastructure (servers, switches, routers, TCP/UDP).
Knowledge of how to tune environment for the best performance and deploy infrastructure based on innovate technologies and high-end hardware.
Strong technical abilities, problem-solving skills, coding, and design skills
Ability to lead feature development, take full ownership and deliver independently
Linux knowledge: have a general understanding of Linux operation system concepts
Ways to stand out from the crowd:
Knowledge in performance testing scenarios and creation of performance reports.
Proven experience in a leadership role, with a track record of successfully leading scrums and projects
Strong communication and interpersonal skills, with the ability to motivate and inspire others.
Knowledge in one or more Networking areas: Ethernet, VLANs, TCP/UDP/IP, QoS, L2-L3 protocols
Prior software testing experience, with an understanding of Software Testing Tools and Methodologies and Python expertise.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465384
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
our company's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to our companys needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
In this role, you will work with system teams and the CPU Architecture team to develop an understanding of the Central Processing Unit (CPU), System on a Chip (SoC), performance metrics, benchmarks/measuring tools, and available optimization knobs. You will define methods and technologies to model CPU performance at different accuracy levels by supporting architectural explorations and decision making. You will correlate performance projections with measured post-silicon data.The AI and Infrastructure team is redefining whats possible. We empower our company customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers, our company Cloud customers, and billions of our company users worldwide.
We're the driving force behind our company's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for our company Cloud, our company Global Networking, Data Center operations, systems research, and much more.
Responsibilities
Write product or system development code.
Design, develop, test, deploy, maintain, and improve Central Processing Unit (CPU) software modeling and other software tools.
Manage project priorities, deadlines, and deliverables.
Collaborate with hardware and software CPU architecture teams, SoC performance modeling team, and other our company Software teams.
Requirements:
Minimum qualifications:
Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or equivalent practical experience.
2 years of experience with software development in C++ programming language or 1 year of experience with an advanced degree.
Preferred qualifications:
Masters degree or PhD in Engineering, Computer Science, or a related technical field.
2 years of experience with data structures or algorithms.
Experience in modern CPU/Machine Learning (ML) architecture and micro-architecture.
Ability to learn coding languages.
Excellent object-oriented database design and SQL skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8413477
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
18/11/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We have improved AI infrastructure by merging GPU virtualization with Kubernetes-native tech to power innovative AI factories. We aim to speed up enterprise AI projects with smart orchestration, and scalability for AI workloads. Seeking a skilled Senior Software Engineer for our Infrastructure Group to innovate AI technology. The Infrastructure Group is tasked with composing and evolving the core systems responsible for thousands of GPUs and nodes driving enterprise AI. We invent the foundation that facilitates elastic, secure, and observable AI operations at extensive scale. We are seeking engineers who are passionate about distributed systems, modern cloud-native infrastructure, and AI performance optimization.

What youll be doing:

Crafting and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.

Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.

Collaborating closely with backend engineers, product managers, and other partners to deliver impactful solutions.

Writing clean, maintainable, and testable code in Go, contributing to our CI/CD pipelines.

Conducting code and build reviews to uphold high-quality standards and mentor team members.

Leading the development and implementation of advanced identity management systems that secure our innovative AI and GPU cloud.

Developing scalable multi-tenant solutions that allow our diverse clientele to harness the power of our platforms securely and efficiently.

Collaborating with multi-functional teams to integrate identity and access management features seamlessly into our products, from cloud services to edge computing devices.
Requirements:
What we need to see:

B.Sc. in Computer Science or a related field (or equivalent experience).

5+ years of experience

Experience in backend software development, including system design and architecture.

Proficiency in at least one backend programming language (Go preferred).

Strong knowledge in microservices architecture, RESTful APIs, and relational databases.

Proficient knowledge of security guidelines and experience applying them in large-scale systems.

Expertise in implementing OAuth, OIDC, SAML, and other modern authentication protocols - Advantage

Ways to stand out from the crowd:

Expertise in Kubernetes internals and advanced cloud-native technologies.

Experience working in Linux environments with knowledge of networking, security, and virtualization.

Contributions to open-source projects or active participation in tech communities.

Agile approach and familiarity with standard methodologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418975
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
Location: More than one
Job Type: Full Time
our company has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technologyand amazing people. Today, were tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing whats never been done before takes vision, innovation, and the worlds best talent. As a worker, youll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
we are seeking a highly skilled and modern software engineer to develop and prototype brand new advancements in distributed training and inference using our companys Spectrum-X AI fabric. This role offers a rare chance to pioneer AI and networking technology, contributing to ground-breaking projects that will define the landscape of large-scale AI systems. Improve AI app-networking connection by refining communication, crafting congestion control, coding NIC firmware, and expanding switch SDK features for enhanced AI factory efficiency. Your work impacts large AI system development, scaling, and speed.
What youll be doing:
Prototype end-to-end solutions to improve distributed training and disaggregated inference performance.
Analyze and optimize communication flows across application, transport, and network layers.
Develop system software spanning communication libraries, drivers, and firmware integrations.
Collaborate with hardware, firmware, and SDK teams to co-design network features.
Validate and integrate prototypes into our companys AI infrastructure and products.
Requirements:
BSc/MSc/PhD in Computer Science or Electrical Engineering
5+ years of relevant experience and/or knowledge
Deep understanding of networking and communication internals NCCL, RDMA/RoCE, congestion control.
Hands-on experience with HW/SW/FW integration and low-level programming (C/C++, kernel, drivers).
Some background in distributed training systems (such as PyTorch DDP, Megatron-LM, DeepSpeed).
Ways to stand out from the crowd:
Demonstrated innovation and leadership turning prototypes into impactful product features.
Experience with programmable data planes (P4, eBPF, DOCA SDK, or switch SDKs).
Familiarity with NIC firmware scheduling, in-network compute, or congestion management.
Contributions to open-source projects, academic papers, or performance benchmarking tools.
Strong background in AI factory architectures, distributed inference, or network telemetry.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465368
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/11/2025
Location: More than one
Job Type: Full Time
We are seeking an AI Networking Exploration Architect for our Networking Insights Group to bridge the gap between cutting-edge, hyper-scale AI workloads and the datacenter infrastructure that enables them. You will join a small, focused team of multidisciplinary engineers driving AI workload optimization through deep application understanding and end-to-end systems thinking. Your insights will directly shape our products across the full stackfrom applications and software libraries to hardware architecture and physical design.

What You'll Be Doing:

Model the performance of complex AI workloads to identify bottlenecks and recommend system-level optimizations.

Translate state-of-the-art research into actionable infrastructure, software, and hardware features in partnership with architecture teams.

Rapidly master new AI domains (LLMs, generative models, multimodal systems) and distill key findings for product teams.

Incorporate your deep knowledge of AI applications into our hardware and software roadmaps.

Conduct independent research by formulating hypotheses about workload behavior and validating them through rigorous analysis.

Drive architectural innovation and network optimization by applying your domain expertise to exploratory analysis of real-world Deep Learning (DL) workloads.
Requirements:
What we need to see:

M.Sc. or Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience.

+5 years of experience.

Strong ML/Data Science background with hands-on experience in LLMs or generative AI.

A systems-level mindset with the ability to estimate end-to-end requirements across the entire AI stack.

Proven ability to translate research and product requirements into clear software/hardware specifications.

Exceptional research skills: you can digest academic papers, self-learn new domains, and independently test hypotheses.

Advanced Python programming skills for performance modeling and data analysis.

Excellent communication skills, with the ability to present complex findings with clarity and conviction.

A pragmatic approach: you are detail-oriented but can prioritize effectively to focus on the most critical issues.

Ways to Stand Out from the Crowd:

Deep understanding of datacenter infrastructure, network topologies, and protocols.

Expertise in distributed training methods and their impact on infrastructure.

Knowledge of AI performance metrics and the impact of different deployment strategies.

Experience extrapolating academic research into tangible hardware architecture requirements.

A track record of leading complex, multidisciplinary research projects that result in production impact.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8409343
סגור
שירות זה פתוח ללקוחות VIP בלבד