דרושים » חשמל ואלקטרוניקה » Senior Software Engineer, CPU Performance Modeling Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
In this role, you will work with system teams and the CPU Architecture team to develop an understanding of the CPU, SoC, performance metrics, benchmarks/measuring tools, and available optimization knobs. You will define methods and technologies to model CPU performance at different accuracy levels by supporting architectural explorations and decision-making. In addition, you will correlate performance projections with measured post-silicon data.

Responsibilities
Design, develop, test, deploy, maintain, and improve CPU software modeling and other software tools.
Manage individual project priorities, deadlines, and deliverables.
Collaborate with hardware and software CPU architecture teams, SOC performance modeling team, and other Google Software teams.
Requirements:
Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or equivalent practical experience.
5 years of experience with software development in one or more programming languages, and with data structures/algorithms.
4 years of experience with performance, systems data analysis, visualization tools, or debugging.
Experience in performance modeling, performance analysis, and workload characterization.

Preferred qualifications:
Masters degree or PhD in Engineering, Computer Science, or a related technical field.
Experience in modern, high-performance CPU/ML architecture and micro-architecture.
Ability and interest to learn other coding languages as needed.
Excellent object-oriented, database design, and SQL skills.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8255913
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 9 שעות
Location: Tel Aviv-Yafo and Haifa
Job Type: Full Time
In this role, you will be responsible for ensuring that the Systems-on-Chip (SoCs) meet the power, thermal, and performance goals. You will play a critical role in the pre and post-silicon validation phases, collaborating with cross-functional teams to identify, debug, and optimize SoC behavior in use cases, as well as validating the IPs protection mechanism.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities
Contribute to developing and improving post-silicon test content that exercises both IP and SoC levels workloads and other compute-intensive scenarios.
Collaborate with architecture, design, and firmware teams to define comprehensive validation plans for SoC features, focusing on power consumption, thermal management, sensors behavior, and performance metrics.
Work with design and firmware teams to propose and implement solutions for optimizing SoC power efficiency and performance, including tuning core-to-memory latencies and bandwidth, power control loops, thermal control loops, etc.
Develop and maintain automated test scripts and frameworks (e.g., Python) to improve validation efficiency and coverage.
Participate in early silicon bring-up and platform bring-up activities, ensuring the stability and functionality of high-power multi-core SoCs.
Requirements:
Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
5 years of experience in post-silicon validation, with power and performance characterization.
Experience validating multi-core CPU/GPU/APU architectures.
Experience in programming languages such as C, C++, and Python for scripting and automation.
Experience with lab equipment such as oscilloscopes, logic analyzers, power meters, and thermal chambers.
Experience with SoC architecture, including interconnects, memory hierarchy, cache coherency, and power management concepts.

Preferred qualifications:
Experience with embedded systems programming (e.g., bare-metal, RTOS, kernel, driver programming).
Experience with version control systems (e.g., Git).
Excellent debugging and root-causing skills for hardware/software issues.
Excellent analytical, problem-solving, and communication skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8255672
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.

Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack.

Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available.

Triage and root-cause performance issues reported by our customers.

Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information.

Collaborate with a very dynamic team across multiple time zones.
Requirements:
What we need to see:

M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience.

3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Experience conducting performance benchmarking and triage on large scale HPC clusters.

Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals).

Implement micro-benchmarks in C/C++, read and modify the code base when required.

Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker).

Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control.

Experience debugging network issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs.

Experience with Deep Learning Frameworks such PyTorch, TensorFlow.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203543
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo and Haifa
Job Type: Full Time
As a Technical Program Manager for Silicon Development, you will use your technical and management experience to lead the development and execution of complex, multidisciplinary SoC projects. You will plan programs and manage their execution from early concepts through development to tape-out and production. You will collaborate closely with architecture, design, verification, physical implementation and manufacturing teams throughout the SoC execution life cycle. This includes making technical decisions for the chip designs and methodology, driving project schedules, identifying risks and communicating them to all stakeholders, and managing partner teams.

Responsibilities
Plan, coordinate, and deliver custom silicon products.
Assess complexity and scope out the project, generate task lists, build a project timeline and work with the teams to make it into reality.
Lead the data-driven schedules and milestones, track the progress, proactively identify potential future issues, and identify mitigations with the team leaders.
Drive technical, budgetary, and schedule trade-off discussions with cross-functional teams.
Manage project execution and issues through design, development, test, manufacturing, deployment and sustaining activities for silicon and hardware products.
Requirements:
Bachelor's degree in Computer Science, Electrical Engineering or equivalent practical experience.
8 years of experience in program management.
Experience in program management on technical cross-functional projects.
Experience in one or more areas like architecture, design, verification, implementation, or validation with seven or more cycles of chip development.
Experience in leading, developing and growing teams.

Preferred qualifications:
Master's degree or PhD in Engineering, or in a related technical field.
Experience as an engineer or manager in developing hardware or software systems around the chips.
Experience with two or more chip cycles in a project management role with execution within resource and schedule constraints.
Knowledge of data centers and cloud markets, technological and business trends, requirements, and ecosystem partners.
Ability to motivate and focus a large collaboration to reach goals.
Excellent communication and facilitation skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8255750
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: More than one
Job Type: Full Time
We are looking for a versatile Senior Software Engineer for the DPU Platform team. This position offers the opportunity to have real impact in a multifaceted, technology-focused company affecting product lines that empower the most advanced data centers in the world. Using your deep knowledge of embedded platforms, operating systems, and software distribution technologies, you will work with a world-wide development team to solve the unique challenges of delivering the world's most powerful platforms.

What you'll be doing:

Develop system software components including processor firmware and bootloaders, kernel drivers/modules, and user space applications and libraries.

Collaborating with hardware and product design teams to develop software for sophisticated SOC platform designs.

Assisting world-wide teams with various customers' and internal DPU projects.

Tackle complex system-level optimization and resource utilization challenges.

Participate across all levels of product development lifecycle that values high-standards for clear requirements, software quality and performance.

Collaborate within a worldwide matrixed software development team, and have broad impact within our highly-dynamic and technology-focused company.
Requirements:
What we need to see:

Bachelor's degree in Computer Science/Engineering or equivalent experience.

5+ years developing software for embedded systems (C is required, Python).

Proven understanding of the system software stack, with a focus on software/hardware interaction, including platform firmware, device drivers, Linux kernel, and how user-space applications utilize system services to achieve high performance.

A deep knowledge of high-performance processor architecture including CPU and cache coherency concepts, as well as hardware accelerators.

Well-rounded engineering skills, including technical investigation, design, testing, and agile software engineering process.

Outstanding written and oral communication skills.

Must be proficient in the C programming language.

Experienced with build environment tools (gcc, git, github, make, bitbake, shell scripts, gerrit, jenkins, etc).

Ways to stand out from the crowd:

Background with ARMv8 microarchitecture, ATF and/or UEFI software is a strong plus.

Experience with multiple Linux distributions, with the ability to compare and contrast them.

Experience developing security key management solutions is very desirable.

Exposure to secure boot flows and/or trusted computing environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203438
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo and Haifa
Job Type: Full Time
In this role, you will be part of a team developing Application-Specific Integrated Circuit (ASIC) used to accelerate networking in data centers. You will have responsibilities in areas such as project definition, design, and implementation. You will participate in the design, architecture, documentation, and implementation of the next generation of data center accelerators. You will be responsible for performance analysis for a networking stack using the knowledge of Remote Direct Memory Access (RDMA) based transports.

Responsibilities
Participate in evaluation of future ASIC designs and general architecture for executing Googles data center networking roadmap, off-the-shelf components, vendor co-developments, custom designs, and chiplets.
Collaborate in developing new layer protocols for data center networking.
Understand how everything interacts with software and other ASIC subsystems to implement groundbreaking data center networks.
Define performance hardware/software interfaces. Write micro-architecture and design specifications.
Define efficient micro-architecture and block partitioning/interfaces and flows.
Requirements:
Bachelor's degree or equivalent practical experience.
8 years of experience architecting networking ASICs from specification to production.
Experience working with design networking like: RDMA and or packet processing and system design principles for low latency, high throughput, security, and reliability.
Experience developing RTL for ASIC subsystems.
Experience in cross-functional, micro-architecture, design, verification, logic synthesis, and timing closure.

Preferred qualifications:
Experience in Transmission Control Protocol (TCP), IP, Ethernet, PCIE and Dynamic Random-Access Memory (DRAM), Network on Chip (NoC) principles and protocols.
Experience working with software teams optimizing the hardware/software interface.
Experience architecting networking switches, end points, and hardware offloads.
Experience in a procedural programming language (e.g. C++, Python, Go.).
Experience in estimating performance by analysis, modeling, and network simulation. Ability to define and drive performance test plans.
Understanding of packet classification, processing, queueing, scheduling, switching, routing, traffic conditioning, and telemetry.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8255921
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
03/06/2025
חברה חסויה
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are looking for an excellent Software Engineer to join the InfiniBand Switch and NVLink FW group in Tel-Aviv / Raanana.
As the team member, you will be part of a major development effort for the next-generation networking products. The verification team develops cutting-edge networking features for cloud, HPC and storage. We drive the data growth of the worlds biggest companies. With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced. Are you ready for the challenge?

What you will be doing:

As a Senior Software Engineer, you will part of a group of firmware verification engineers, using your expertise in C++ and Python in an Agile methodology.

Develop the verification process, aligning with quality standards and project timelines.

Design and implement new features in the core of our Switch and GPUs.

Drive and facilitate the planning, scheduling, and execution of the project and activities of the team.

Collaborate with architecture and different software design teams as part of the software development lifecycle.

Work in pre and post-silicon development environments of next-generation networking products.

Gain a deep understanding of networking technology, system debugging and stacks, as well as the HW/FW/SW relationship.

Innovate! Bring our FW switching products to shine in customers view.
Requirements:
What we need to see:

B.Sc degree or equivalent experience in Computer Science, Computer Engineering, or Electrical Engineering.

5+ years of relevant overall professional experience.

Proficient knowledge of C++, Python.

Experience with Embedded SW development.

Strong analytical, creative, debugging, and problem-solving skills.

Detail-oriented and comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

An excellent teammate with good interpersonal skills.


Ways to stand out from the crowd:

Knowledge of network protocols.

Background with CI methodology & tools (Git, Gerrit, Jenkins etc.).

Background in Linux internals.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8201438
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
19/06/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As part of our software group, you will play a pivotal role in developing high-performance software solutions that enable seamless interaction with our companys cutting-edge AI processors.
This role demands technical expertise, a passion for performance optimization, and an eagerness to solve complex runtime challenges. If youre excited about AI, runtime design, and performance-oriented software development, wed love to meet you!
You will have a direct impact on the efficiency and usability of our companys AI solutions, working on innovative technologies that serve customers across diverse industries.
Responsibilities
Design and implement high-performance software for interacting with our companys AI processors.
Optimize software solutions to achieve minimal CPU, memory footprint, and runtime overhead.
Collaborate with cross-functional teams, including hardware engineers and ML experts, to debug and integrate complex hardware/software systems.
Contribute to the development of APIs, frameworks, and infrastructure for efficient AI processing on edge devices.
Solve challenges in multi-threaded and resource-constrained environments.
Requirements:
B.Sc in Computer Science, Electrical Engineering, or a related field.
5+ years of hands-on experience in C++ development.
Strong understanding of performance optimization techniques in resource-constrained systems.
Proficiency in user-space application development on Linux.
Excellent problem-solving skills and a commitment to high-quality code.
Strong interpersonal skills, team-oriented mindset, and fluent English.
Advantages
Familiarity with runtime libraries and low-level system programming.
Python scripting on Linux.
Experience in open-source projects.
Experience with CMake.
Background in AI accelerators or hardware-software integration.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8223211
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are seeking a talented and driven Senior Software Verification Engineer to join our innovative team and tackle SW verification challenges in the domains of high-speed networking, virtualization, and security. You will play a key role in validating and testing complex software products that support Ethernet and InfiniBand protocols, delivering advanced networking, storage, and security services for cloud, compute, and AI workloads.

What Youll Be Doing:

Develop and Automate Testing: Design, implement, and maintain automated test scripts and frameworks (primarily in Python) to verify the correct functionality of our software products.

End-to-End Feature Ownership: Deep dive into feature sets, taking responsibility from test planning through to final implementation and full automation.

System & Integration Validation: Validate software functionality and performance through system-level and integration testing, utilizing Linux-based environments and virtualization tools.

Test Environment Management: Set up, maintain, and optimize test environments using Linux, Docker, virtual machines, and other modern tools.

Collaboration & Communication: Work closely with software, DevOps, architecture, and product teams to define test requirements, coordinate releases, and ensure high-quality product delivery.

Continuous Improvement: Drive design verification flows, contribute to methodology improvements, and leverage planning/tracking systems to manage release progress and build release indicators.

Defect Analysis: Analyze test results, file defects, and track issues to closure, ensuring robust and scalable solutions.
Requirements:
What We Need to See:

Bachelors/masters degree in computer science or computer engineering, or equivalent experience

5+ years of experience in software testing, QA automation, or software engineering.

Strong proficiency in Python and scripting for automation.

Solid experience with Linux-based environments, including system tools and command-line utilities.

Proven understanding of computer networking and modern Linux operating systems.

Familiarity with software testing, integration, and system validation practices.

Excellent problem-solving, critical thinking, and communication skills.

Ability to work independently, manage multiple tasks, and drive technical initiatives.

Great interpersonal skills, agility, and determination for success.

Fluent English; strong presentation and public speaking abilities.

Ways to Stand Out from the Crowd:

Deep technical know-how and familiarity with networking protocols or low-level system tools.

Experience with Docker, KVM, or other virtualization technologies.

Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI) and test reporting tools (e.g., Allure, Grafana, Kibana).

Experience with large HW+SW systems and advanced Linux OS technologies.

Proficiency with GIT, Bash, and other scripting languages.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203394
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Key Responsibilities
Software Development: Design, develop, test, and maintain robust, scalable, and high-quality software applications.
System Architecture: Contribute to architectural decisions, ensuring efficient system design and implementation.
Data Engineering & Analytics: Design and optimize data pipelines, integrating structured and unstructured data sources into data lakes.
Code Quality & Best Practices: Write clean, maintainable, and well-documented code while enforcing coding standards and best practices (SOLID principles, TDD, CI/CD).
Performance Optimization: Identify bottlenecks and optimize application performance, scalability, and security.
Mentorship & Leadership: Mentor junior developers, conduct code reviews, and promote knowledge sharing within the team.
Cross-Functional Collaboration: Work closely with product managers, designers, DevOps, and QA teams to deliver high-quality software solutions.
Problem-Solving: Troubleshoot and resolve complex technical issues across different components of the software stack.
Agile Development: Participate in Agile methodologies, including sprint planning, daily stand-ups, and retrospectives.
Requirements:
Required Skills & Qualifications
Experience: 5+ years of professional software development experience.
Programming Languages: Proficiency in one or more languages such as C#, JavaScript/TypeScript, or Go.
Frameworks & Technologies: Experience with frameworks such as .NET Core & React.
Databases: Strong understanding of relational (PostgreSQL, MySQL) and NoSQL (MongoDB, Redis) databases.
Microservices & APIs: Strong experience in building RESTful APIs and microservices architectures.
Data Lakes & Warehouses (Advantage): Experience working with one of the leading vendors for big data processing, analytics, and storage.
Cloud Platforms (Advantage): Experience with AWS, Azure, or Google Cloud Platform (GCP).
Security & Compliance (Advantage): Understanding of secure coding practices and data protection regulations.
Testing (Advantage): Experience with unit testing, integration testing, and automated testing frameworks.
CI/CD & DevOps (Advantage): Experience with Docker, Kubernetes, Gitlab, or other CI/CD tools.
Education (Advantage): Bachelors or Masters degree in Computer Science, Software Engineering, or relevant army training.
Preferred Skills & Qualifications
Experience working with one of the leading Data Lake & Warehouse vendors for big data processing, analytics, and storage.
Experience with any of the leading cloud platforms AWS, Azure, or Google Cloud Platform (GCP).
Understanding of secure coding practices and data protection regulations.
Experience with unit testing, integration testing, and automated testing frameworks.
Experience with Docker, Kubernetes, Gitlab, or other CI/CD tools.
Experience in distributed systems, event-driven architecture, or message queues (Kafka, RabbitMQ).
Knowledge of front-end development (React, Vue.js, or Angular).
Bachelors or Masters degree in Computer Science, Software Engineering, or relevant army training.
Contributions to open-source projects or personal coding projects.
Familiarity with observability and monitoring tools like Prometheus, Grafana, or Datadog.
Soft Skills
Strong problem-solving and analytical skills.
Excellent communication and collaboration abilities.
Ability to work in a fast-paced, Agile development environment.
Passion for continuous learning and staying updated with industry trends.
Additional Skills:
Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8221380
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Engage with our partners and customers to root cause functional and performance issues reported with NCCL.

Conduct performance characterization and analysis of NCCL and DL applications on groundbreaking GPU clusters.

Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.).

Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters.

Document and conduct trainings/webinars for NCCL.

Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.
Requirements:
What we need to see:

B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design.

Experience working with engineering or academic research community supporting HPC or AI.

Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control.

Expert in Linux fundamentals and a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible).

Adaptability and passion to learn new areas and tools.

Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow.

Deep understanding of technology and passionate about what you do.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203558
סגור
שירות זה פתוח ללקוחות VIP בלבד