משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
חברה חסויה
Location: Yokne`am and Tel Aviv-Yafo
Job Type: Full Time
We are seeking an experienced and motivated CAD Engineer to join our CAD development team. We are looking for candidates with a strong hardware background and robust software skills, passionate about advancing the development and application of non-EDA tools for circuit analysis. This role involves deep studies of circuit behavior at both the standard cell (STDCELL) and block levels, and offers the opportunity to collaborate with multiple teams across us.

What Youll Be Doing:

Develop and maintain CAD solutions for both transistor-level and block-level circuit analysis.

Conduct studies of circuit behavior at both STDCELL and block levels, informing tool development and design improvements.

Work with our quality teams on transistor-level analysis to enhance circuit reliability and performance.

Work with other teams to identify, develop, and deploy advanced CAD solutions addressing design challenges.

Engage directly with design projects, leveraging CAD tools to solve and improve complex circuit design challenges.
Requirements:
What We Need to See:

Bachelors degree in Computer Science, Engineering, or equivalent experience.

Experience in CAD development for circuit analysis.

Hands-on experience with SPICE simulations.

2+ years of experience in VLSI Design Automation.

Strong knowledge of Python programming.

Ways to Stand Out from the Crowd:

Deep knowledge in the circuit domain.

Proven track record of developing innovative CAD solutions.

Understanding of STDCELL internal design and architecture.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8204163
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are at the forefront of AI-driven innovation in VLSI design automation. Join us to shape the future of semiconductor design with cutting-edge AI tools and make a significant impact in a collaborative, high-performance environment. Are you ready to push the boundaries of whats possible in VLSI CAD? Come be part of our pioneering team!

What you'll be doing:
You will be responsible for developing and integrating advanced CAD solutions and automation flows using AI and machine learning for VLSI design, verification, and implementation.
Work closely with design, verification, and CAD teams to identify areas for improving VLSI workflows using advanced tools and methods.
Research, prototype, and deploy AI-based algorithms.
Develop and maintain scripts and automation infrastructure to enable seamless adoption of AI tools in the VLSI design process.
Continuously review emerging AI technologies and methodologies to keep our CAD environment up-to-date.
Provide technical support and training to engineering teams on AI-enabled CAD flows and best practices.
Requirements:
What we need to see:
B.Sc./M.Sc. in Electrical Engineering, Computer Engineering, Computer Science, or equivalent experience.
5+ years of experience in VLSI CAD tool development, with a strong focus on integrating AI/ML techniques into EDA workflows.
Proficiency in Python and at least one AI/ML framework (such as TensorFlow, PyTorch, or scikit-learn).
Hands-on experience with VLSI physical design and familiarity with industry-standard EDA tools (e.g., Synopsys, Cadence).
Knowledge of data preprocessing, feature engineering, and model deployment as applied to VLSI design challenges.
Experience developing and maintaining automation scripts (Python, Perl, Tcl, Make).
Strong analytical skills in evaluating the impact of AI solutions on design quality, performance, and productivity.
Excellent communication skills and the ability to work cross-functionally in a fast-paced environment.
Self-motivation, attention to detail, and a track record of delivering robust solutions to production.

Ways to stand out from the crowd:
Demonstrated experience deploying AI/ML models in production VLSI CAD environments.
Contributions to open-source AI/EDA projects or publications in relevant conferences/journals.
Deep understanding of VLSI design challenges-such as timing closure, power optimization, or yield enhancement-and how AI can address them.
Experience with cloud-based or distributed compute environments for large-scale AI training and inference.
Strong ownership, curiosity, and a passion for continuous learning and innovation.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203338
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/05/2025
חברה חסויה
Job Type: Full Time
We are looking for a top ASIC Engineer with a curiosity about SOC design automation, RTL integration, chip build and assembly, and padring design and verification. You should have real passion for methodologies and automation solutions that enable SOC creation in the most optimized way.

In this position, you will get the opportunity to build complex networking chips and interact directly with unit-level ASIC, Physical Design, CAD, Package Design, Software, DFT and other teams.

What you'll be doing:

Implement chip level design through collaboration with cross-functional teams (Functional Design, DFT, Design Verification, System Verification, STA, and Physical Design).

Be exposed and work on a variety of functional and structural challenges. Including functional debug, physical design readiness, emulation, resolve design quality issues.

Daily work involves aspects of chip level design, including partitioning, CDC, RDC, trial synthesis, design quality checks.

Taking part in flows development and deployment.
Requirements:
What we need to see:

B.SC./ M.SC. in Electrical Engineering/Computer Engineering.

2+ years proven experience in chip design.

Solid hands-on RTL design skills in System-Verilog.

Proficiency in at least one scripting languages like python, bash, tcl.

Great teammate.

Way to stand out from the crowd:

Passion for quality. Experience with delivery to physical design, emulation, firmware and other customers.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8193584
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are looking for best-in-class Physical Design CAD Engineer to join our outstanding Networking Silicon engineering team, developing the industry's best high speed communication devices, delivering the highest throughput and lowest latency! Come and take a part in designing our groundbreaking and innovating chips, enjoy working in a meaningful, growing and highly professional environment where you make a significant impact in a technology-focused company.

What you'll be doing:

You will be in charge of developing physical design, synthesis, STA and Logic eq methodologies for implementation of networking chips and SOCs.

Work closely with block owners. full Chip STA engineers and project managers to assure high quality and timely convergence.

Come up with unique and creative solutions to the state of the art physical design problems that are needed for Our chips.

Additional responsibilities include participating and developing flow and tool methodologies for chip floorplan, power and clock distribution, P&R, timing analysis and closure, power and noise analysis and back-end verification across multiple projects.
Requirements:
What we need to see:

B.SC./ M.SC. in Electrical Engineering/Computer Engineering (or equivalent experience).

At least 2 years of relevant experience

Proficiency using Python, Perl, Tcl, Make scripting.

Expertise in analysing and converging crosstalk delay, noise glitch, and electrical/manufacturing rules in deep-sub micron processes.

Knowledge in physical design and optimization e.g. placement, routing, cell sizing, buffering, logic restructuring, etc. to improve timing and power required and implementing them through ECOs is required.

Knowledge in process variation effect modelling and experience in design convergence taking into account variations.

Successful track record of delivering designs to production is necessary.

Self-motivation, attention to detail, and good written, verbal, and presentation skills are critical to success in this role.

Ways to stand out from the crowd:

Familiarity with synthesis, place and route, STA EDA tools from Synopsys (DC/FC/PT), Cadence (Innovus/Tempus)

Experience in methodology definition / flow owner of synthesis / Place and Route/ STA steps is an advantage.

Great teammate.

Ownership, self-learning skills, and ability to work autonomously.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196280
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are seeking a talented and driven Senior Software Verification Engineer to join our innovative team and tackle SW verification challenges in the domains of high-speed networking, virtualization, and security. You will play a key role in validating and testing complex software products that support Ethernet and InfiniBand protocols, delivering advanced networking, storage, and security services for cloud, compute, and AI workloads.

What Youll Be Doing:

Develop and Automate Testing: Design, implement, and maintain automated test scripts and frameworks (primarily in Python) to verify the correct functionality of our software products.

End-to-End Feature Ownership: Deep dive into feature sets, taking responsibility from test planning through to final implementation and full automation.

System & Integration Validation: Validate software functionality and performance through system-level and integration testing, utilizing Linux-based environments and virtualization tools.

Test Environment Management: Set up, maintain, and optimize test environments using Linux, Docker, virtual machines, and other modern tools.

Collaboration & Communication: Work closely with software, DevOps, architecture, and product teams to define test requirements, coordinate releases, and ensure high-quality product delivery.

Continuous Improvement: Drive design verification flows, contribute to methodology improvements, and leverage planning/tracking systems to manage release progress and build release indicators.

Defect Analysis: Analyze test results, file defects, and track issues to closure, ensuring robust and scalable solutions.
Requirements:
What We Need to See:

Bachelors/masters degree in computer science or computer engineering, or equivalent experience

5+ years of experience in software testing, QA automation, or software engineering.

Strong proficiency in Python and scripting for automation.

Solid experience with Linux-based environments, including system tools and command-line utilities.

Proven understanding of computer networking and modern Linux operating systems.

Familiarity with software testing, integration, and system validation practices.

Excellent problem-solving, critical thinking, and communication skills.

Ability to work independently, manage multiple tasks, and drive technical initiatives.

Great interpersonal skills, agility, and determination for success.

Fluent English; strong presentation and public speaking abilities.

Ways to Stand Out from the Crowd:

Deep technical know-how and familiarity with networking protocols or low-level system tools.

Experience with Docker, KVM, or other virtualization technologies.

Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI) and test reporting tools (e.g., Allure, Grafana, Kibana).

Experience with large HW+SW systems and advanced Linux OS technologies.

Proficiency with GIT, Bash, and other scripting languages.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203394
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are looking for a passionate Senior Software Engineer to join our Chip Design CAD team. In this role, you will contribute and work on the interesting junction of electrical and software engineering, when developing SW tools and solutions used to design the next generation Smart Adapters, Switches and GPUs for the advanced data centers across the world.

What you'll be doing:

Develop innovative SW solutions.

Take part in defining methodologies affecting design and verification engineers day to day work.

Improve existing tools, which are used in-house by electrical engineers for designing the next generation SoCs and Switches.

Collaborate with designers, verification specialists and architects to accomplish your tasks.
Requirements:
What we need to see:

Computer Science degree with excellent grades.

5+ years of practical experience in cpp developing at Linux environment.

Motivated, responsive, and keen on process improvement.

Strong analytical, debugging and problem-solving skills.

Strong programming skills.


Ways to stand out from the crowd:

Experience in DA / CAD team.

Knowledge in FE/BE chip design or Dynamic verification.

Strong cpp/python programming skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196237
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Engage with our partners and customers to root cause functional and performance issues reported with NCCL.

Conduct performance characterization and analysis of NCCL and DL applications on groundbreaking GPU clusters.

Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.).

Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters.

Document and conduct trainings/webinars for NCCL.

Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.
Requirements:
What we need to see:

B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design.

Experience working with engineering or academic research community supporting HPC or AI.

Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control.

Expert in Linux fundamentals and a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible).

Adaptability and passion to learn new areas and tools.

Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow.

Deep understanding of technology and passionate about what you do.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203558
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: More than one
Job Type: Full Time
We are looking for a versatile Senior Software Engineer for the DPU Platform team. This position offers the opportunity to have real impact in a multifaceted, technology-focused company affecting product lines that empower the most advanced data centers in the world. Using your deep knowledge of embedded platforms, operating systems, and software distribution technologies, you will work with a world-wide development team to solve the unique challenges of delivering the world's most powerful platforms.

What you'll be doing:

Develop system software components including processor firmware and bootloaders, kernel drivers/modules, and user space applications and libraries.

Collaborating with hardware and product design teams to develop software for sophisticated SOC platform designs.

Assisting world-wide teams with various customers' and internal DPU projects.

Tackle complex system-level optimization and resource utilization challenges.

Participate across all levels of product development lifecycle that values high-standards for clear requirements, software quality and performance.

Collaborate within a worldwide matrixed software development team, and have broad impact within our highly-dynamic and technology-focused company.
Requirements:
What we need to see:

Bachelor's degree in Computer Science/Engineering or equivalent experience.

5+ years developing software for embedded systems (C is required, Python).

Proven understanding of the system software stack, with a focus on software/hardware interaction, including platform firmware, device drivers, Linux kernel, and how user-space applications utilize system services to achieve high performance.

A deep knowledge of high-performance processor architecture including CPU and cache coherency concepts, as well as hardware accelerators.

Well-rounded engineering skills, including technical investigation, design, testing, and agile software engineering process.

Outstanding written and oral communication skills.

Must be proficient in the C programming language.

Experienced with build environment tools (gcc, git, github, make, bitbake, shell scripts, gerrit, jenkins, etc).

Ways to stand out from the crowd:

Background with ARMv8 microarchitecture, ATF and/or UEFI software is a strong plus.

Experience with multiple Linux distributions, with the ability to compare and contrast them.

Experience developing security key management solutions is very desirable.

Exposure to secure boot flows and/or trusted computing environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203438
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/05/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
Our Networking Clock design team is looking for experienced top notch ASIC design engineer to work on next generation of our Networking chips. We're looking for profound and multi-disciplinary background in Clock design domains to lead Clocks Micro-Architecture activities. This role requires working with multiple teams as Architecture, IP, Physical design, Timing and Post-Si teams. Complexity of clocking scheme has grown substantially over recent chip generations with increased focus on performance, power and quality. Modern Clocking design needs to balance high frequency clocks with power, DFx, noise, circuit and physical design constraints.

What you will be doing:
Working on next generation of Networking Switch, NIC and SoC products.
Micro architect and design next generation clock topologies and modules.
ASIC Clock scheme definition.
Improve Power, Performance, and Area (PPA) of state-of-the-art chips by evaluating trade-offs across DFx, Physical Implementation, Power Optimization and Ease of timing closure to innovate and implement new Clocking topologies in RTL.
Collaborate with Physical design and timing team to evaluate Clocking concerns and come up with solutions for supporting high speed Clocking.
Understand physical aspects of the chip and develop enhanced clock distribution techniques.
Get involved in end-to-end cycle of ASIC execution starting from micro-arch, design implementation, design fixes, sign-off checks and all the way to Silicon bringup.
Support Post-Si debug, characterization and productization activities.
Requirements:
What we need to see:
BSc or MSc degrees in EE or equivalent experience from known universities.
At least 5+ years of work experience in RTL design, Gate-Level and Circuit design optimization.
Deep understanding of logic optimization techniques and PPA trade-offs.
Excellent interpersonal skills and ability to collaborate with multiple teams.
Excellent problem solving and debugging skills.

Ways to stand out from the crowd:
Prior experience in RTL design (Verilog), verification and synthesis.
Clock IPs profound knowledge: PLL, DLL, Compensator.
Understanding of sub-micron silicon issues like noise, cross-talk, and OCV effects is a bonus. Prior experience in implementing on-chip clocking networks.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8196107
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/05/2025
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.

Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.

Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.

Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.

Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.

Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.

Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.

Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
Requirements:
What we need to see:

B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.

Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.

Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.

Familiarity with Jenkins, Ansible, Puppet/Chef.

Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.

Deep understanding of networking protocols such as InfiniBand and Ethernet.

Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.

Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.

Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).

Familiarity with cloud platforms (AWS, Azure, Google Cloud).

Ways to stand out from the crowd:

Architectural Insight: Knowledge of CPU and/or GPU architecture.

Container Expertise: Understanding of Kubernetes and container-related microservice technologies.

GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).

RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8193596
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.

Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.

Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.

Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.

Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.

Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.

Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.

Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
Requirements:
What we need to see:

B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.

Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.

Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.

Familiarity with Jenkins, Ansible, Puppet/Chef.

Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.

Deep understanding of networking protocols such as InfiniBand and Ethernet.

Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.

Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.

Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).

Familiarity with cloud platforms (AWS, Azure, Google Cloud).

Ways to stand out from the crowd:

Architectural Insight: Knowledge of CPU and/or GPU architecture.

Container Expertise: Understanding of Kubernetes and container-related microservice technologies.

GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).

RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203606
סגור
שירות זה פתוח ללקוחות VIP בלבד