דרושים » הנדסה » Senior System Software Engineer, NCCL - Partner Enablement

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
Location: Yokne`am and Tel Aviv-Yafo
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack. Are you ready for to contribute to the development of innovative technologies and help realize our company's vision?
What you will be doing:
Engage with our partners and customers to root cause functional and performance issues reported with NCCL
Conduct performance characterization and analysis of NCCL and DL applications on groundbreaking GPU clusters
Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.)
Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters
Document and conduct trainings/webinars for NCCL
Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.
Requirements:
B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)
Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
Experience working with engineering or academic research community supporting HPC or AI
Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control
Expert in Linux fundamentals and a scripting language, preferably Python
Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible)
Adaptability and passion to learn new areas and tools
Flexibility to work and communicate effectively across different teams and timezones
Ways to stand out from the crowd:
Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments
Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow
Deep understanding of technology and passionate about what you do.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321595
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our company's vision?
What you will be doing:
Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.
Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack
Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available
Triage and root-cause performance issues reported by our customers
Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information
Collaborate with a very dynamic team across multiple time zones.
Requirements:
M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience
3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)
Experience conducting performance benchmarking and triage on large scale HPC clusters
Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals)
Implement micro-benchmarks in C/C++, read and modify the code base when required
Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python
Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker)
Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones
Ways to stand out from the crowd:
Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control
Experience debugging network issues in large scale deployments
Familiarity with CUDA programming and/or GPUs
Experience with Deep Learning Frameworks such PyTorch, TensorFlow.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321604
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions that were once considered science fiction, including artificial intelligence to autonomous cars. Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We're seeking a Senior Software Architect to help co-design next-gen data center platforms and scalable communications software.
DL and HPC applications have a huge compute demands and already run at scales of up to tens of thousands of GPUs. GPUs are connected with high-speed interconnects (e.g. NVLink, PCIe) within a node and with high-speed networking (e.g. InfiniBand, Ethernet) across nodes. Efficient and fast communication between GPUs directly impacts end-to-end application performance. This impact continues to grow with the increasing scale of next generation systems. This is an outstanding opportunity to advance the state-of-the-art, break performance barriers, and deliver platforms the world has never seen before. Are you ready to build the new and innovative technologies that will help realize our company's vision?
What you will be doing:
Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems.
Design and implement new communication technologies to accelerate AI and HPC workloads.
Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects.
Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations.
Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs).
Requirements:
M.S./Ph.D. degree in CS/CE or equivalent experience.
5+ years of relevant experience.
Excellent C/C++ programming and debugging skills.
Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).
Deep understanding of operating systems, computer and system architecture.
Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads.
Strong experience with Linux.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Expertise in related technology and passion for what you do. Experience with CUDA programming and our company GPUs. Knowledge of high-performance networks like InfiniBand, RoCE, NVLink, etc.
Experience with Deep Learning Frameworks such PyTorch, TensorFlow, etc. Knowledge of deep learning parallelisms and mapping to the communication subsystem. Experience with HPC applications.
Strong collaborative and interpersonal skills and a proven track record of effectively guiding and influencing within a dynamic and multi-functional environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321599
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/09/2025
Location: More than one
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. we are looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.
We are looking for highly motivated Senior Software Engineers to work on our GPU NVLink Fabric Networking team. Youll be part of a team responsible for defining the next generation communications standards and products building on our current NVLink and NVSwitch technology.
What you will be doing:
Design, develop, and maintain system-level software to enable high-performance GPU-to-GPU communication.
Collaborate closely with cross-functional teams including hardware, firmware, system software to build and deliver next-generation GPU networking solutions.
Contribute to scalable and reliable GPU fabric architecture for large compute clusters.
Align software development with customer needs and real-world deployment environments.
Requirements:
B.S/M. S/ Ph.D. in computer science or a related field with 5+ years of relevant experience.
Excellent C/C++ programming and debugging skills, with some familiarity with Python.
Experience writing software applications that interface with device drivers and expose associated hardware functionality.
Solid understanding of computer system architecture, operating system and kernel internals.
Experience with Linux development; familiarity with Windows is a plus.
Background in multi-core / multi-process / multi-threaded programming environment.
Strong understanding of networking fundamentals and high-performance interconnection (e.g., InfiniBand, Ethernet)
Familiarity with OS virtualization technologies like KVM/QEMU/Hyper-V, etc.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Understanding of CUDA programming model and our company GPUs.
Knowledge of memory coherence and consistency models.
Familiarity with static and dynamic code analysis, fuzzing, negative testing, and other techniques.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8333391
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
חברה חסויה
Location: Yokne`am
Job Type: Full Time
we are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.
What youll be doing:
Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.
Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.
Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.
Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.
Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.
Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.
Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.
Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
Requirements:
B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.
Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.
Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.
Familiarity with Jenkins, Ansible, Puppet/Chef.
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.
Deep understanding of networking protocols such as InfiniBand and Ethernet.
Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.
Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.
Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).
Familiarity with cloud platforms (AWS, Azure, Google Cloud).
Ways to stand out from the crowd:
Architectural Insight: Knowledge of CPU and/or GPU architecture.
Container Expertise: Understanding of Kubernetes and container-related microservice technologies.
GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).
RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321669
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/08/2025
Location: Yokne`am
Job Type: Full Time
our companys Networking Performance team develops advanced software tools that drive the optimization of the worlds fastest networking technologies. Our mission spans benchmarking, telemetry capture, and performance analysis, enabling both our company's R&D teams and our company's customers across the industry to root-cause bottlenecks, maximize throughput, and achieve world-class performance for AI and HPC workloads.
We are looking for a Senior Software Engineer to lead the design and development of next-generation performance engineering frameworks. In this role, you will focus on Python-based benchmarking and analysis systems, while also contributing to high-performance components in C/C++. Youll work on challenges at the intersection of networking, distributed computing, and AI, building tools that run at scale on clusters, clouds, and data centers.
What you'll be doing:
Design and implement performance benchmarking frameworks for next-generation AI and HPC workloads
Take full technical ownership of our core telemetry engine
Work across Python (primary) and C/C++ (for performance-critical modules) to deliver reliable and scalable tools
Collaborate with experts in networking, AI, and systems to translate performance engineering needs into powerful software solutions
Enhance our DevOps, owning the CI/CD pipelines and release processes for your projects
Drive technical innovation in the performance engineering ecosystem, including taking part in building our next-gen agentic AI assistant.
Requirements:
B.Sc. in Computer Science, or a related engineering field
5+ years of professional software development experience
A proven track record of technical ownership, making key architectural decisions, driving a technical agenda, and problem solving
Expert-level Python development skills, building robust, well-structured, production-grade applications
C/C++ experience, especially for performance-critical or low-level components
Experience with modern CI/CD pipelines and DevOps practices
Ways to stand out from the crowd:
Linux systems knowledge, including software packaging (RPM, DEB), and an understanding of the complexities of software distribution and dependencies
Experience with the Python data analysis and visualization frameworks (e.g., h5py, pandas, NumPy, Matplotlib/Plotly)
Experience with Slurm, Kubernetes, MPI, or other distributed job orchestration and cluster management systems
Familiarity with agentic AI concepts or frameworks (e.g., RAG techniques, LangChain, LangGraph, LlamaIndex, etc.)
Experience contributing to open-source projects.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8324017
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Realize your potential by joining the leading performance-driven advertising company!
As a Senior MLOps Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Senior MLOps Engineer Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet.
Requirements:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8335971
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
Location: Yokne`am
Job Type: Full Time
our company's networking performance team is developing tools for benchmarking and profiling our company's cutting-edge networking technologies. Our products are used by engineers within our companyand across the industry to optimize networking performance quickly and accurately. We are evolving our suite of expert-driven tools to create an AI-powered assistant that can autonomously benchmark and analyze complex networking performance issues in the world's most advanced data centers. This is an opportunity to work on an innovative strategic project, combining systems engineering with cutting-edge generative AI, directly impacting how our company's top customers optimize infrastructure.
What You'll Be Doing:
Build the core components of our AI assistant, translating a visionary architecture into a robust, production-ready system
Enhance and extend our core telemetry capture engine (Python and C++) to provide the high-fidelity data needed for analysis
Develop intelligent agents and workflows using modern AI frameworks to automate the 'benchmark-analyze-remediate' lifecycle
Design and implement sophisticated techniques for grounding LLMs with private data (e.g., RAG), ensuring the assistant's outputs are factually accurate and reliable
Partner closely with our domain experts in performance benchmarking and data analysis to create a cohesive, end-to-end platform.
Requirements:
B.Sc. in Computer Science or a related engineering field
A minimum of 5+ years of professional software development experience, for which an advanced degree with a strong portfolio of peer-reviewed research, conference presentations, significant open-source contributions, or other demonstrated expertise may be considered an equivalent
Experience developing applications with Large Language Models (LLMs), demonstrated through professional work, significant open-source contributions, or advanced academic research projects
Familiarity with modern AI development frameworks (e.g., LangChain, LangGraph, LlamaIndex) and concepts (Agentic AI, MCP servers, etc.)
A collaborative mindset with excellent communication skills, and a passion for mentoring and learning from talented peers
Ways To Stand Out From The Crowd:
M.Sc. or Ph.D. in a relevant field, particularly with a focus on AI or distributed systems
Proven proficiency in both C++ and Python
Experience connecting AI agents to unstructured data sources (e.g., databases, Confluence APIs, knowledge graphs)
A background in data analysis and visualization (e.g., pandas, Jupyter).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321836
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/09/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
Run:ai, now part of our company, has evolved AI infrastructure by merging GPU virtualization with Kubernetes-native capabilities. Our world class AI platform allows organizations to improve productivity and efficiency for data scientists and machine learning engineers. With deep Kubernetes expertise and a focus on innovation, we are dedicated to developing cutting-edge technologies, delivering the best user experience for our customers, and providing deep visibility into workload performance through rich metrics that help users optimize their AI workloads. We are looking for highly skilled software engineers to join our Platform Group and help shape the future of AI infrastructure.
The role of a Senior Software Engineer in the Platform Group is to design and develop scalable, high-performance systems that support the next generation of AI workloads. You will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of AI capabilities.
What youll be doing:
Designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other collaborators to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go
Conducting code and design reviews to uphold high-quality standards and mentor team members.
Requirements:
B.Sc. in Computer Science or a related field.
5+ years of experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (We write in Go).
Strong understanding of microservices architecture, RESTful APIs, and relational databases.
Deep familiarity with Kubernetes and the cloud-native ecosystem.
Demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Hands-on experience with HPC or AI/ML platforms.
Familiarity with AI inference workloads and performance optimization.
Proficiency in Linux, with knowledge in networking, security, storage, and virtualization.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8333587
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
our company has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, our cpmpany has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.
We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stackfrom server-level request batching to GPU kernel fusionand collaborate with teams across diverse disciplines to push our company's hardware to its full potential.
What youll be doing:
Cooperate with research teams to onboard new LLMs and VLMs into our company's opensource AI runtimes
Optimize inference workloads using sophisticated profiling and simulation tools
Build SOLID, extendable inference software systems, and refine robust APIs
Implement and debug low-level GPU code to harness the latest HW features
Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products.
Requirements:
B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering
5+ years of relevant hands-on software engineering experience
Profound knowledge of software design principles
Strong proficiency in at least one system and one scripting language
Strong grasp of machine learning concepts
People person with excellent communication skills that enjoys collaboration and teamwork.
Ways to stand out from the crowd:
Familiarity with our company's DL software stack, e.g. Triton Inference Server, TensorRT-LLM, and Model Optimizer
Proven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with our company's accelerators.
Familiarity with LLM quantization, fine-tunning, and caching algorithms
Proficiency in GPU kernel programming (CUDA or OpenCL)
Prior experience working on a large software project with 50+ contributors.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8320244
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/08/2025
חברה חסויה
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
our company has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technologyand amazing people. Today, were tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing whats never been done before takes vision, innovation, and the worlds best talent. As a worker, youll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Are you ready to help build next generation solutions for High-Performance Web Servers and Proxy Servers, Content Delivery Networks, Financial Service market's High-Frequency Trading (HFT) platforms, High-Performance Disaggregated Storage? we are looking for an Excellent Senior Software Engineer to develop groundbreaking networking acceleration solutions for variety of markets. Development is done in an exciting and Agile environment on a widely deployed products that need constant improvements.
What youll be doing:
Design and develop high performance networking solutions based on our company's outstanding ConnectX and Bluefield networking cards hardware.
Work in a startup mode/group developing groundbreaking networking solutions.
Development of the entire solutions stack, from application level to networking card hardware access.
Development of kernel bypassing user space TCP/IP stack on top of our outstanding networking card hardware.
Invent and implement creative ways to improve performance and scalability.
Large scale and high quality deliveries.
Work closely with customers and partners.
Collaborate with multiple teams in our multi-functional environment on developing new features/improvements.
Requirements:
B.Sc. or M.Sc. in Computer Science or Electrical Engineering or equivalent experience.
5+ years of experience in each of the following areas: Software development in C/C++, Networking protocols, Linux environment.
Strong design, coding, analytical, debugging and problem-solving skills.
Ability to quickly adapt to new technology and go deep into new areas.
Independence and agility.
Good social and interpersonal skills.
Ways to stand out from the crowd:
Experience with low latency acceleration and performance improvement.
Experience with Linux user space/driver/kernel development.
Deep knowledge and understanding of TCP/IP stack.
Good view of system architecture and performance.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8319787
סגור
שירות זה פתוח ללקוחות VIP בלבד