דרושים » חשמל ואלקטרוניקה » Senior Software Architect, AI Networking

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
06/10/2024
משרה זו סומנה ע"י המעסיק כלא אקטואלית יותר
מיקום המשרה: יקנעם ותל אביב יפו
סוג משרה: משרה מלאה
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
29/10/2024
Job Type: Full Time
We are looking for Senior HPC/AI Solutions Architect to join its our Infrastructure Specialists Team. Academic and commercial groups around the world are using our products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world! We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large scale AI/HPC projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

What Youll Be Doing:

Primary responsibilities will include building robust AI/HPC infrastructure for new and existing customers.

Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, training stability, real-time monitoring, logging, and alerting.

Engage in and improve the whole lifecycle of services from inception and design through deployment, operation, and refinement.

Your primary focus would be on understanding the AI workload and how it interacts with other parts of the system like networking, storage, deep learning frameworks, data cleaning tools, etc.

Help maintain services once they are live by measuring and monitoring progress of AI jobs and helping engineering design solutions for more robust training at scale.

Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

Worldwide travel is required for on-site visits with customers.
דרישות:
What We Need to See:

BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience with Python/ C++ / other software development.

Track record of medium to large scale AI training and understanding of key libraries used for NLP/LLM/VLA training (NeMo Framework, DeepSpeed etc.)

Experience with integration and deployment of software products in production enterprise environments, and microservices software architecture.

You are excited to work with multiple levels and teams across organisations (Engineering, Product, Sales and Marketing team) Capable of working in a constantly evolving environment without losing focus. Ability to multitask in a fast-paced environment.

Driven with strong analytical and problem-solving skills. Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects.

You are a self-starter with demeanour for growth, passion for continuous learning and sharing findings across the team.

Technical leadership and strong understanding of NVIDIA technologies, and success in working with customers.

Excellent verbal, written communication, and technical presentation skills in English.

Ways to Stand Out from The Crowd:

Experience working with large transformer-based architectures for NLP, CV, ASR or other. Experience running large scale distributed DL training.

Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.

Proven experience with one or more Tier-1 Clouds (AWS, Azure, GCP or OCI) and cloud-native architectures and software.

Expertise with parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects (InfiniBand, Omni Path, and Gig-E).

Strong coding and debugging skills, and demonstrated expertise in one or more of the following areas: Machine Learning, Deep Learning, Slurm, Docker/Kubernetes, Kubernetes, Singularity, MPI, MLOps, LLMOps, Ansible, Terraform, and other high-performance AI cluster המשרה מיועדת לנשים ולגברים כאחד.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7917855
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
29/10/2024
Job Type: Full Time
We are looking for Senior Networking (ETH/IB) Solutions Architect to join its Infrastructure Specialst Team. Academic and commercial groups around the world are using our products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world! We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large scale Networking projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

What you'll be doing:

Primary responsibilities will include building AI/HPC infrastructure for new and existing customers.

Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, real-time monitoring, logging, and alerting.

Engage in and improve the whole lifecycle of servicesfrom inception and design through deployment, operation, and refinement.

Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.

Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

Worldwide travel is required for on-site visits with customers.
Requirements:
What we need to see:

BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience in networking fundamentals, TCP/IP stack, and data center architecture.

8+ years of experience with configuring, testing, validating, and issue resolution of LAN and InfiniBand networking, including use of validation tools for InfiniBand health and performance including medium to large scale HPC/AI network environments.

Knowledge and experience with Linux system administration/dev ops, process management, package management, task scheduling, kernel management, boot procedures, troubleshooting, performance reporting/optimization/logging, and network-routing/advanced networking (tuning and monitoring).

Driven focus on customer needs and satisfaction. Self-motivated with excellent leadership skills including working with customers.

Extensive knowledge of automation, delivering fully automated network provisioning solutions using Ansible, Salt, and Python.

Strong written, verbal, and listening skills in English are essential.

Ways to stand out from the crowd:

Linux or Networking Certifications.

Experience with High-performance computing architectures. Understanding of how job schedulers(Slurm, PBS) work.

Proven knowledge of Python or Bash. Infrastructure Specialist's delivery experience.

luster management technologies knowledge (bonus credit for BCM (Base Command Manager).)

Experience with GPU (Graphics Processing Unit) focused hardware/software.

Experience with MPI (Message Passing Interface).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7917769
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
29/10/2024
Job Type: Full Time
We are looking for Senior Cloud Infrastructure/DevOps Solutions Architect to join its our Infrastructure Specialist Team. Academic and commercial groups around the world are using our products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world! We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large scale Networking projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

What you'll be doing:

Design, implement and maintain large scale HPC/AI clusters with monitoring, logging and alerting Manage Linux job/workload schedulers and orchestration tools.

Develop and maintain continuous integration and delivery pipelines .

Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.

Deploy monitoring solutions for the servers, network and storage.

Perform troubleshooting bottom up from bare metal, operating system, software stack and application level.

Being a technical resource, develop, re-define and document standard methodologies to share with internal teams Support Research & Development activities and engage in POCs/POVs for future improvements.

Worldwide travel is required for on-site visits with customers.
Requirements:
What we need to see:

BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience in networking fundamentals, TCP/IP stack, and data center architecture.

Knowledge of HPC and AI solution technologies from CPUs and GPUs to high speed interconnects and supporting software.

Direct design, implementation and management experience with cloud computing platforms (e.g. AWS, Azure, Google Cloud).

Experience with job scheduling workloads and orchestration technologies such as Slurm, Kubernetes and Singularity.

Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalld, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.

Experience with multiple storage solutions such as Lustre, GPFS, zfs and xfs. Familiarity with newer and emerging storage technologies.

Python programming and bash scripting experience.

Comfortable with automation and configuration management tools including Jenkins, Ansible, Puppet/Chef, etc.

Deep knowledge of Networking Protocols like InfiniBand, Ethernet Deep understanding and experience with virtual systems (for example VMware, Hyper-V, KVM, or Citrix).

Strong written, verbal, and listening skills in English are critical.

Ways to stand out from the crowd:

Knowledge of CPU and/or GPU architecture .

Knowledge of Kubernetes, container related microservice technologies.

Experience with GPU-focused hardware/software (DGX, CUDA.)

Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7917834
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
29/10/2024
Job Type: Full Time
We are looking for Senior Industry SA/Customer Success/Partnership Solutions Architect to join its our Infrastructure Specialist Team. Academic and commercial groups around the world are using our products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world! We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large scale Networking projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

What you'll be doing:

Engage with our Cloud Partners (NCP) to drive initiatives, shape new business opportunities, and cultivate collaborations in the field of Artificial Intelligence (AI), contributing to the advancement of our cloud solutions.

Identify and pursue new business opportunities for our products and technology solutions in datacenters and artificial intelligence applications, closely collaborating with Engineering, Product Management, and Sales teams.

Serve as a technical specialist for GPU and networking products, collaborating closely with sales account managers to secure design wins and actively engaging with customer engineers, management, and architects at key accounts.

Conduct regular technical customer meetings to discuss project and product roadmaps, features, and introduce new technology solutions.

Develop custom product demonstrations and Proof of Concepts (POCs) addressing critical business needs, supporting sales efforts.

Strong technical presentation skills in English, confidence in developing Proofs-of-Concept, and a customer-focused mentality, coupled with good organization skills, a logical approach to problem-solving and effective time management for handling concurrent requests.

Manage technical project aspects of complex data center deployments, including design-in opportunities and responding to RFP/RFI proposals.

Worldwide travel is required for on-site visits with customers.
Requirements:
What we need to see:

BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience in networking fundamentals, TCP/IP stack, and data center architecture.

Ideal candidate possesses 8+ years of Solution Architect or similar Sales Engineering experience, demonstrating motivation and skills to drive the technical pre-sales process.

Deep expertise in datacenter engineering, GPU, networking, including a solid understanding of network topologies, server and storage architecture.

Proficiency in system-level aspects, encompassing Operating Systems, Linux kernel drivers, GPUs, NICs, and hardware architecture.

Demonstrated expertise in cloud orchestration software and job schedulers, including platforms like Kubernetes, Docker Swarm, and HPC-specific schedulers such as Slurm.

Familiarity with cloud-native technologies and their integration with traditional infrastructure is essential.

​Ways to stand out from the crowd:

Knowledge in InfiniBand and Artificial Intelligence infrastructure.

Demonstrated hands-on experience with our systems/SDKs (e.g., CUDA), our Networking technologies (e.g., DPU, RoCE, InfiniBand), ARM CPU solutions, coupled with proficiency in C/C++ programming, parallel programming, and GPU development.

Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes, data center compute/network/storage deployments.

Large scale systems management experience.

Experience with Python programming and AI workflow development and deployment (training/inference) would be advantageous.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7917695
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
31/10/2024
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
Are you ready to build innovative, next-generation infrastructure for AI supercomputers and data-centers?

We are looking for an excellent Senior Software Developer to work on our next generation cloud platforms. We are seeking an experienced engineer who is deeply technical, hands-on, and has a wide system view. You will craft, build, and deploy high-performance and scalable clouds based on our outstanding GPU/NVLink, ConnectX NICs and Bluefield DPUs.

The team is responsible for developing high-performance computing and cloud infrastructure, for the worlds largest supercomputers and data-centers. The work environment is educational, dynamic, and challenging as our employees are currently working on innovative, next-generation products at the forefront of technology in terms of performance, scalability, and features.

What you'll be doing:
Design and build innovative features for High-Performance Networking of IaaS in both private and public cloud environments, enhancing functionality and performance.
Develop a high speed networking solution that accelerates HPC and AI workloads using our advanced technologies in cloud environments, e.g. DPU, ConnectX and GPU/NVLink.
Take part in developing our pioneering AI supercomputer.
Work closely with other teams on new products or features/improvements of existing products.
Support, maintain and document software functionality.
Requirements:
What we need to see:
BSc in Computer Science or equivalent program.
5+ years of hands-on experience in software development, preferably with C, Python, Rust and Golang.
Wide hands-on experience with high speed network, e.g. IB, RoCE and NVLink.
Experience with Jenkins, GitLab and/or GitHub.
Strong background in designing, implementing, and debugging sophisticated software.
Highly motivated with strong interpersonal skills, ability to work successfully with multi-functional teams, developers, and architects.
Coordinate effectively across organizational boundaries and geographies.
Strong self-initiative, independence, and flexibility to a new technology.

Ways to stand out from the crowd:
R&D background with OpenStack or IaaS of Cloud
Experience with working on open-source projects
Understanding of HPC/AI systems and related technologies
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7921941
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
29/10/2024
Job Type: Full Time
We are looking for Senior NIC/DPU Solutions Architect to join its our Infrastructure Specialist Team. Academic and commercial groups around the world are using our products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world! We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large scale Networking projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

What you'll be doing:

Support GPU, NIC, and networking applications on the converged GPU/DPU/NIC and x86 platforms work on customer production activities, introducing and integrating our networking products to new and existing customers.

Gain customers trust and understand their needs.

Work closely with support cross-functional teams, optimize customer environment, and maintain resiliency.

Help with customer production requirements alongside engineering and product teams.

Address sophisticated and obvious customer issues.

Worldwide travel is required for on-site visits with customers.
Requirements:
What we need to see:

BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience in networking fundamentals, TCP/IP stack, and data center architecture.

8+ years of experience with configuring, testing, validating, and issue resolution of LAN and InfiniBand networking, including use of validation tools for InfiniBand health and performance including medium to large scale HPC/AI network environments.

Knowledge and experience with Linux system administration/dev ops, process management, package management, task scheduling, kernel management, boot procedures, solving, performance reporting/optimization/logging, and network-routing/advanced networking (tuning and monitoring).

Driven focus on customer needs and satisfaction. Self-motivated with excellent leadership skills including working with customers.

Strong written, verbal, and listening skills in English are critical.

Ways to stand out from the crowd:

Familiarity with the InfiniBand protocol and RDMA concepts.

Having experience with GPUs, CUDA, GPUDirect or NVIDIAS'a Bluefield Data Processing Unit (DPU).

Experience with high-performance computing architectures. Understanding of how job schedulers(Slurm, PBS) work.

Coding development experience with multiple programming languages (from low-level C programming language to high-level languages such as Python/Bash.)

Cluster management technologies knowledge and bonus credit for BCM (Base Command Manager.)
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7917721
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
31/10/2024
Location: Yokne`am
Job Type: Full Time
We are looking for an outstanding candidate for a Senior System Networking Engineer role in HPC/AI E2E Verification team. Be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in InfiniBand networking technologies and High-performance computing. You will work with the latest InfiniBand based Switches, HCAs, AI servers and Software, together with many researchers, Architects and developers leading differentiated InfiniBand HPC/AI solutions.

What You Will Be Doing:

As a Senior System InfiniBand Networking Engineer, you will play a crucial role in crafting and implementing innovative architectures for high-performance computing systems, enabling efficient and scalable computation for AI/ML applications and HPC Benchmarks.

Collaborating closely with multi-functional teams, including hardware engineers, software developers, and domain experts, to deliver optimized solutions that meet the demanding requirements of HPC/AI workloads.

Planning, Reviewing, and Executing complexed End-to-End scenarios with strong emphasis on Scalability, Performance, and Functionality of our InfiniBand HPC/AI solutions ensuring alignment with our Networking & AI specifications.

Analyzing test results and generating detailed reports for stakeholders to facilitate informed decision-making.

Drive continuous improvement initiatives, identifying opportunities to enhance verification processes and methodologies in the context of our Networking & AI solutions.
Requirements:
What We Need To See:

Bachelor's/Masters degree in electrical engineering, Computer Science, or equivalent experience in Networking/System field.

8+ years experience driving large-scale complexed solutions with strong emphasis on networking troubleshooting and Performance analysis.

In depth experience and understanding of Linux based networking systems.

Strong analytical and problem-solving skills.

Excellent communication and interpersonal skills.

Ability to work effectively in a collaborative, fast-paced environment.

Ways To Stand Out From The Crowd:

In-depth knowledge of InfiniBand XDR/NDR technology, our Networking, and AI architectures, protocols, and standards.

Expertise in High-performance computing and Machine learning.

Experience in AI Application benchmarks and Distributed job scheduling.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7921710
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
31/10/2024
Location: Ra'anana and Yokne`am
Job Type: Full Time
Our Networking BU is looking for Senior Software Program Manager that will be responsible for software programs and projects. The PM should drive planning and execution of FW/SW projects while aligning with corporate priorities and constraints.

Our Mellanox Networking division is a leading supplier of innovative end-to-end InfiniBand and Ethernet connectivity solutions and services for servers and storage. We offer best-in-class solutions that include adapter cards, switches, cables, and software to support networking technologies. Our products optimize data center performance and deliver industry-leading bandwidth and scalability. In addition, we serve a wide range of markets including high-performance computing, enterprise, data centers, cloud computing, big data and Web 2.0. We are constantly reinventing ourselves to stay ahead of the market and bring groundbreaking products and services to the industry. Our product line is focusing on delivering the most optimized Ethernet solutions for industries like Media and Entertainment as well as any other industry that can benefit from our Datastream and TCP/IP acceleration.

What you'll be doing:
You will manage the networking software programs for our next generation AI Data centers.
Responsible to coordinate between all project stakeholders such as marketing, engineering teams in IL and around the world, operations, etc. from initial requirements definition through Architectural stage, execution, and delivery.
Develop and execute feature planning and prioritization of perception capabilities to meet the software programs' needs.
Identify risks, gaps, and bottlenecks in time, and find resolution with technical leaders and project management.
Work with product managers, architects, and engineers to ensure consistency with company strategy, commitments, and goals.
Requirements:
What we need to see:
B.Sc. or M.Sc. in Computer Science, Electrical Engineering, or related field.
Expert with software project management methodologies and tools.
8+ years experience in software project management or leadership.
Experience in software development over hardware/Silicon products.
Teammate, independent, responsible, capable of multi-tasking, ability to drive people and tasks.
Excellent verbal and written communication skills with English proficiency.
Ability and willingness to work in a dynamic environment and flexible hours, with teams all over the world.

Ways to stand out from the crowd:
Technical orientation, including the ability to conduct technical discussions.
Experience with tools such as MS Excel, MS Project, Power BI.
Networking background.
Experience in multiple groups coordination.
Familiarity with SW Agile concept.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7921310
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
29/10/2024
Location: Yokne`am
Job Type: Full Time
We are looking for an AI & HPC Clusters's group manager to join Cloud Solutions group. In this role, you will build, manage, and maintain the biggest cluster in NVIDIA Networking R&D to validate and test next-generation networking cloud technology and Reference Architecture that are being released to our customers. We are currently working on next generation BlackWell GPU Platform AI clouds with our XDR (800G InfiniBand) and SpectrumX800 next generation technology. Come join the team and see how you can make a lasting impact on the world.

What youll be doing:

Lead a group that is responsible for building, managing, and maintaining SW R&D clusters composed of Linux, Windows, and VMware systems, x86 and ARM CPU, GPU, Ethernet, and InfiniBand technologies.

Work closely with the engineering and architecture teams to understand, plan and build new clusters for validating and testing new NVIDIA Networking technology solutions.

Drive the design and implementation of automatic systems to deploy, configure, maintain, and monitor these clusters.

Drive the design and implementation of resource management systems for multiuser environments with different needs on these clusters.

Manage R&D lab including inventory, power, space, and cooling.

Build, expand, and mentor the team to address growing demands and requirements.

Innovate! Influence on NVIDIA Networking cluster management tools to shine in customers view.
Requirements:
What We Need to See:

A degree in Computer Science, Engineering, or a related field.

5+ years of managerial experience including managers management.

10+ years of relevant overall professional experience.

Experience in Data center management from a multidisciplinary company, including handling power, cooling, and space.

Experience in managing HPC/AI clusters.

Deep understanding of operating systems, computer networks, and high-performance hardware.

Deep knowledge of distributed resource scheduling systems and orchestration tools such as Slurm, K8s.

Strong organizational and project management skills, comfortable with multitasking in a dynamic environment with shifting priorities and changing requirements.

Enthusiastic and ambitious personality, encouraging a positive and productive work environment.

Ways to Stand Out From the Crowd:

Knowledge of HPC and AI solution technologies from CPUs and GPUs to high-speed interconnects and supporting software.

Familiarity with CUDA and managing GPU-accelerated computing systems.

Experience and knowledge of InfiniBand.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7917642
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
 
נאספה מאתר אינטרנט
20/11/2024
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Associate Solutions Architect - Early Career Program 2025
DESCRIPTION
The role starts on March or September 2025.
Are you passionate about collaborating with technology and business leaders to deliver cloud-based solutions?
We are seeking recent graduates, early career professionals, and advanced career professionals with limited technical experience interested in jump-starting a career as an Associate Solutions Architect and participating in the AWS Tech U program to advance their technical and professional skills.
AWS Tech U is 12 month accelerated workforce development program focused on helping recent graduates and early career professionals build technical and professional skills to jump-start technical careers at AWS. The program features a six-month cohort-based training followed by six months of on-the-job-training. You will complete specialized curriculum, shadow AWS experts, and receive coaching and on-the-job training to help you succeed in your new role and work towards.
Residents will also prepare to take the certification exams for AWS Certified Cloud Practitioner and AWS Certified Solutions Architect Associate. During the programs On-the-Job training phase, you will prepare for additional certifications that align to your role.
Are you ready to embrace the challenge? Come build the future with us.
Associate Solutions Architect
As an Associate Solutions Architect, youll help your customers successfully implement cloud technologies. Youll solve complex, technical challenges so your customers can focus on their business. This includes using your knowledge to craft scalable, flexible, and resilient cloud architectures. Youll drive technical solutions discussions, diving deep into the details with customer teams.
Building relationships to understand our customers is key. As a trusted technical advisor, youll use your interpersonal skills to influence a variety of stakeholders from technical teams to executives. Youll help ensure their short-term technology decisions are aligned with their long-term goals. Speeding up the adoption of our services will be part of your day to day.
Youll also act as an evangelist in the wider community. This includes taking part in educating, sharing best practices, presenting at events, writing white papers, blogs, and running workshops. You wont just want to be part of an industry movement; youll want to be leading it. As a Builder, youll also have the chance to shape the direction of our products and services. This is through gathering feedback from customers whilst collaborating with our engineering and service teams.
Requirements:
BASIC QUALIFICATIONS
- A degree in Computer Science / Engineering / Mathematics / Technology / Related science/technical field OR equivalent training, certifications, and/or experience.
- Interest and aptitude to learn about and deliver cloud-based solutions to customers.
- Experience with one of the programming languages like Java, Python, Ruby, Node.js, C#, or C++ OR the interest and technical ability to learn a programming language.
- Written and verbal communication skills and ability to effectively articulate technical challenges and solutions to both large and small audiences.
- Fluent proficiency in English & Hebrew
PREFERRED QUALIFICATIONS
- Demonstrated ability to adapt to new technologies and learn quickly.
- Experience with Networking fundamentals including Security, Storage or Databases (Relational and/or NoSQL), Operating Systems (Unix, Linux, and/or Windows)
- Experience with one or more of the following domains: systems administration (Linux/Window), network administration (DNS, IPsec, BGP, VPN, Load Balancing), or programming (Node.JS, Java, Ruby, C#, Python, or PHP).
- Experience around implementing cloud-based technology solutions - in a school project or while working for a company.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
7947821
סגור
שירות זה פתוח ללקוחות VIP בלבד