דרושים » תוכנה » Senior Software Configuration Management Engineer - Hardware

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Yokne`am and Tel Aviv-Yafo
Job Type: Full Time
What you'll be doing:

Responsible for the full SCM environment including application, OS, and server hardware components, developing the continued automation and innovation needed for our large environment.

Create new solutions to improve the reliability and performance of our ever-growing infrastructure, and work with automated orchestration tools to deploy those improvements to hundreds of systems worldwide.

Be part of a global team and will evaluate technology alternatives, work closely with other project members to specify solutions, craft schedules, and lead ongoing enhancements and support.

Lead or contribute to GitLab upgrades and migrations, architectural reviews and design docs, root cause analysis and systemic fixes.

Learn and greatly improve the daily productivity of the worlds top chip designers and software engineers.
Requirements:
What we need to see:

MS (preferred) or BS in Computer Science (or equivalent experience) or a related field with at least 5+ years of experience.

Deep understanding of SCM processes and large-scale, multi-site GitLab environments (experience with other SCM tools such as Perforce, Subversion, or ClearCase is a plus).

You've configured/deployed Continuous Integration (CI) and Continuous Deployment (CD) systems in your past experience.

Excellent interpreted language skills highly desired - Object Oriented Perl or Python preferred, and Strong software engineering process skills required.

Strong skills in scripting and object-oriented languages such as Python or Ruby, with solid software engineering practices and familiarity with design patterns.

Hands-on experience with relational databases (PostgreSQL preferred).

Experience with DevOps or system administration on Linux systems required (Rocky Linux 8, CentOS/RHEL, and Ubuntu preferred).

Strong experience with automation required, Ansible or Puppet preferred and Excellent interpersonal skills, including written and verbal communication.

You are comfortable and enjoy working with dynamic and ever evolving environments.

Ways to stand out from the crowd:

Meticulous and proactive problem-solver with a positive, can-do attitude.

Demonstrate use of out-of-box thinking for creative solutions to highly sticky problems.

Fun and enthusiastic teammate who enjoys a challenge and celebrates success.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586941
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are seeking a DevOps Manager to lead a team of engineers in developing and supporting scalable CI/CD processes for our Networking products, enhancing global R&D efficiency. The role requires over five years of DevOps experience, leadership skills, proficiency with Jenkins, Git SCM, and programming in Python, Java, or Groovy. It involves collaborating with distributed teams to improve development methodologies and infrastructure. Preferred qualifications include experience with Linux system administration, containerization tools like Docker and Kubernetes, and managing complex infrastructure projects in fast-paced environments. Join us and help us to take our CI/CD infrastructure to the next level! You will have a unique opportunity to have a real impact in a dynamic, technology-focused company shaping the future of networking and data-center technologies. If you bring with your excellence and creative thinking while solving issues as they arise, and if you enjoy working with distributed teams - your place is with us!

What youll be doing:

Supervise a team of 4+ experienced and high motivated DevOps engineers.

Architect, implement and support scalable CI processes for multiple NVIDIA Networking products.

Closely collaborate with DevOps, IT and security teams across the globe to share, promote and improve CI practices.

Build infrastructure for effective, high quality, reliable development and release processes for our products.
Requirements:
What we need to see:

Bachelor's/Master's degree in Computer Science, Engineering, or equivalent experience.

4+ years leading DevOps organizations for mid range or enterprise company.

8+ overall years hands-on experience in DevOps with focus on CI/CD and infrastructure development.

Familiarity Linux OS distributions.

Strong knowledge of build systems: Jenkins or similar.

Proven knowledge of Git SCM administration.

Proficiency in Python, Java, Groovy or related programming languages.

Proven success building and scaling teams in fast-paced, high-growth environments.

Agility, excellent communication skills, customer-facing approach.

Ways to stand out from the crowd:

Solid understanding of LINUX system administration.

Demonstrated history of building robust and scalable CI/CD infrastructures for complex enterprise products.

Familiarity with containerization and orchestration tools such as Docker and Kubernetes.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586879
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Yokne`am and Tel Hai
Job Type: Full Time
We are looking for a Senior Software Engineer to join NSV tools (Network Solutions Validation) group. As a senior team member, you will be part of a development effort of high-performing software automation systems for our Data Center environments. You will interact with NIC, OS, Switch, HCA, CPU and GPU compute as well as architects, network engineers, and developers. We drive the data growth of the worlds biggest companies. With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced. Are you ready for the challenge?

What youll be doing:

Design and develop an automation platform used to provision, configure, and monitor HPC data centers.

Implement scalable, reliable, and maintainable services that enhance cluster visibility and improve operational efficiency.

Collaborate closely with internal and external stakeholders to understand requirements and deliver robust full-cycle solutions.

Improve stability and performance across the provisioning pipeline through architectural enhancements and code optimizations.

Troubleshoot issues in distributed environments and contribute to system observability and reliability improvements.

Work cross-functionally with architects, DevOps engineers, product managers and stakeholders to ensure high-quality releases.

Participate in code reviews, technical design discussions, and continuous improvement activities within the team.
Requirements:
What we need to see:

B.Sc. in Computer Science, Engineering, or a related field (or equivalent practical experience).

5+ years of strong hands-on experience on Linux-based platforms.

Proficient scripting and automation skills (Bash, Python, Ansible).

Background in DevOps and Network Engineering practices.

Hands-on experience with large-scale network architectures, switches/routers, OVS, SR-IOV, and network operating/management systems.

Networking expertise: Ethernet, VLANs, TCP/UDP/IP, QoS, L2/L3 protocols, BGP, EVPN/VXLAN, and common network topologies.

Practical experience with containers and cloud-native technologies (Docker, Kubernetes) and networking performance.

Experience with version control systems (Git) and CI/CD pipelines.

Independent, fast learner with strong ownership mindset, excellent debugging and problem-solving skills, and effective communication abilities.

Ways to stand out from the crowd:

Experience as Team Lead/ Scrum master or similar leadership role.

Experience in planning, tracking, and delivering projects.

Familiarity with DevOps methodologies and tools (e.g., Jenkins, Ansible).

Hands-on experience with Docker and containerized environments.

Experience with agentic AI development.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586566
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are now looking for a HPC Operations Engineer to join our mission and continue improving our HPC infrastructure. A meaningful part of ourstrength is our unique and advanced development tools and environments that enable our incredible pace of innovation. We are looking for architects to help us evolve the way our private compute cloud is architected and optimized.

What youll be doing:

Troubleshoot incoming support requests in a large-scale HPC environment.

Contribute enhancements to existing deployment automation, configuration management, observability, and operational monitoring and day to day operation through automation.

Ensure compute servers are running correct Operating System and configuration.

Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.

Collaborate with specialist teams to drive issues to closure.

Collaborate with domain experts to improve how our chip development process utilizes our infrastructure.

Directly contribute to the overall quality and improve time to market for our next generation chips.
Requirements:
What we need to see:

BS in Computer Science or similar degree or equivalent experience

2+ years of experience Proficient in administering Centos/RHEL Linux distributions.

Understating of container technologies like Docker.

Proficiency in Python and UNIX scripting languages such as bash.

Excellent problem-solving skills, with the ability to analyze complex systems, identify bottlenecks, and implement scalable solutions.

Excellent communication and teamwork skills, with the ability to work effectively with diverse teams and individuals.

Solid understanding of cluster configuration managements tools such as Ansible.

Ways to stand out from the crowd:

Understanding of key Linux technologies such as NFS, automounter, LDAP, DNS, and TCP/IP networking in Red Hat Linux distribution flavors.

Familiarity with job scheduler administration (e.g. IBM Spectrum LSF or SLURM) and experience building/ operating large scale compute infrastructure.

Knowledge of the FlexLM license management system.

Proficiency in Perl for maintaining legacy automation scripts.

Familiarity with High-Speed Networking (InfiniBand, RDMA, RoCE etc.) and fast, distributed storage systems (Lustre, GPFS, etc.).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583522
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
5 ימים
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. NVIDIA is widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with NVIDIA networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584095
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. We are widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with our networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586994
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
5 ימים
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are looking for a creative and experienced Senior Firmware Engineer to join our PCIe Firmware team-someone passionate about using artificial intelligence to engineer the foundational hardware of the AI revolution.

As an integral part of our team, you'll architect and implement the core of our next-generation devices. This senior role places you at the center of innovation, where you will have a direct impact on our business and technology by solving sophisticated technical challenges. Its a unique opportunity to shape our technology and empower customers to build the supercomputers and AI fabrics of tomorrow.

What You'll Be Doing:

Lead the architectural design, development, and optimization of cutting-edge PCIe firmware, using AI-driven modeling and insights to deliver exceptional performance.

Serve as a trusted technical expert by investigating, debugging, and resolving challenging PCIe firmware issues for our most important customers.

Collaborate closely with our Chip Design, Verification, Software, and Architecture engineers to find root causes and develop robust, long-term solutions.

Champion the integration of AI-assisted diagnostics and generative AI tools across the entire development lifecycle to boost team productivity and innovation.

Translate customer needs and field data into actionable feedback that directly shapes the future of our products.
Requirements:
What We Need to See:

A degree in Electrical Engineering, Computer Science, Computer Engineering, or equivalent practical experience.

8+ years of significant professional experience in embedded firmware development, with a deep understanding of PCIe.

A strong foundation in computer architecture, operating systems, and object-oriented programming.

Proficiency in scripting languages like Python to automate tasks and workflows.

An innovative approach with a genuine desire to apply AI and machine learning to accelerate firmware development.

Ways to Stand Out from the Crowd:

Track record of applying AI-powered tools like Cursor to accelerate the development lifecycle.

Previous experience in a customer-facing or application engineering role.

Direct, hands-on experience with PCIe switch architecture and its firmware in high-performance applications.

Deep knowledge of hardware verification concepts and tools (e.g., C++, Python, Jenkins).

Extensive knowledge of networking protocols and the Linux operating system.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584100
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
02/03/2026
Location: Yokne`am
Job Type: Full Time
We are looking for a skilled Software Developer with strong hardware knowledge to join our diverse team passionate about developing low-level software and test infrastructure for our networking and Interconnect products. As a technical focal point, you will work at the intersection of hardware and software, taking ownership of driver development, hardware bring-up, and test system architecture. You will be instrumental in driving innovation by developing low-level software that directly controls hardware, debugging sophisticated hardware-software interactions, and creating robust testing solutions. If you're passionate about code development, automation, innovation, reading schematics, debugging hardware with software tools, and becoming a key technical contributor, we'd love to have you on board!


What You'll Be Doing:

Collaborate with multi-functional teams including hardware, electrical, thermal, and mechanical engineers to understand hardware specifications and develop corresponding test requirements.

Take the lead in debugging hardware-software integration issues using instruments, and software tools to pinpoint root causes.

Develop low-level drivers in C/C++ for hardware components and build DLL wrappers for integration with higher-level C# applications.

Review and interpret hardware schematics and datasheets to understand signal flows, timing requirements, and implement appropriate software control mechanisms.

Lead all aspects of hardware bring-up, validation, and deployment of test solutions to production environments, ensuring accurate integration.
Requirements:
What We Need to See:

Bachelor's or master's degree in electrical engineering, Computer Engineering, or Software Engineering with significant hardware/electronics coursework or hands-on experience.

5+ years of hands-on experience in hardware-software integration, low-level driver development, or firmware development with proven ability to lead technical projects.

Strong proficiency in one or more programming languages such as Python, Java, C#, with additional experience in C/C++ for low-level programming.

Demonstrated ability to read and interpret electrical schematics, block diagrams, and hardware datasheets.

Experience with hardware debugging tools such as oscilloscopes, logic analyzers, JTAG debuggers, or similar instruments.

Excellent problem-solving skills in developing software solutions for sophisticated hardware-software interactions. Ability to collaborate with hardware teams, demonstrating deep technical ownership of hardware-software integration projects.


Ways To Stand Out from the Crowd:

Hands-on experience with PCIe, I2C, SPI, UART, or other hardware communication protocols. Experience developing device drivers for Windows or Linux environments.

Background in embedded systems, microcontrollers, DSPs, FPGAs, or custom ASIC integration. Experience wrapping native C/C++ libraries into managed DLLs for .NET/C# applications.

Knowledge of hardware validation methodologies and experience with automated hardware test equipment. Previous work in networking hardware, high-speed interconnects, or semiconductor validation environments.

Strong ability to bridge communication between hardware and software teams, translating hardware requirements into software solutions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8566021
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an outstanding experienced manager to join our leading AI team. The team is developing cutting-edge AI applications including: Retrieval-Augmented Generation (RAG) systems, Large Language Model (LLM) implementations, large scale AI Agents, MCPs, recommendation engines, classical AI models such as clustering, and more. You will have the opportunity to shape how AI transforms our products and internal processes worldwide. In this role, you will manage a team of AI/ ML engineers, lead the innovation and development the different solutions, and collaborate with cross-company teams. We are looking for an experienced, hard working manager who can collaborate effectively and has a deep technical capabilities in both AI/ML and software development.

What you'll be doing:

Manage a team of 6-8 engineers.

Formalize the team work methodologies, collaboration, etc.

Provide technical leadership, mentorship, and guidance to foster team growth.

Supervise and build AI best in class solutions, including data ingestion, RAG, MCPs, Agents, self service, and more.

Be a part of an experienced team with a great atmosphere.

Collaborate with cross-functional teams to identify and implement AI opportunities.
Requirements:
What we need to see:

B.Sc. (or equivalent experience) in Computer Science, AI, Machine Learning or related field.

4+ years of experience in a managerial role.

3+ years of experience implementing production AI/ML systems.

8+ overall years of experience in software development.

Strong understanding of modern Machine Learning domains, known algorithms, architectures and techniques.

Proficiency in Python.

Capable of multi-tasking, ability to drive both people and tasks.

Effective communication and collaboration skills.

Demonstrated leadership with the ability to inspire and motivate team members.

Ways to stand out from the crowd:

Track record of delivering high-quality work on time and meeting project timeline.

Experience developing large scale software, Understanding of SOLID principles and design patterns.

Knowledge of LLM techniques.

Background implementing RAG systems in production environments.

Experience with vector databases and embedding technologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586448
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are now looking for a Senior HPC Site Reliability Engineer to join our mission and continue improving our HPC infrastructure. A meaningful part of NVIDIAs strength is our unique and advanced development tools and environments that enable our incredible pace of innovation. We are looking for architects to help us evolve the way our private compute cloud is architected and optimized.

What you will be doing:
Provide leadership in the design and implementation of our large-scale compute cloud that enables the world's top chip modelers, designers, and deep learning experts to invent groundbreaking technology.
Identify architectural changes or completely innovative approaches in our cloud architecture and design.
Help with strategic challenges we encounter, including: effective resource utilization in a heterogeneous compute environment, evolving our private/public cloud strategy, capacity modeling, and planning for multi-year growth and scaling across our global computing environment!
Requirements:
What we need to see:
B.sc in Computer Science, Electrical Engineering or related field or equivalent experience
8+ years of experience designing and operating large scale compute infrastructure.
Experience with job schedulers such as IBM/Platform LSF, SGE, SLURM, Marathon, Chronos.
Solid understanding of cluster configuration managements tools - Ansible, Puppet, Chef, Salt.
Good experience providing compute services using a public cloud (AWS, Azure, Google Cloud)
Strong script-writing skills: Python, Bash, Perl
Knowledge of and/or experience deploying PaaS microservices - Docker, Docker Swarm, Kubernetes
Understanding of fast distributed and network attached storage solutions and Linux file systems, ability to recommend and implement solutions to improve OS performance and reliability.

Ways to stand out from the crowd:
Linux certification from a well-known vendor - RedHat, Oracle, etc.
Prior experience managing large-scale Kubernetes deployment in production.
Strong skills in modern container networking and storage architecture.
Well-known Cloud Certification(s).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586553
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
Location: Yokne`am
Job Type: Full Time
Our System Product Engineering (SPE) organization is looking for a Security Software Engineer to join the TDE (Test Development Engineering) Security Team. This role focuses on designing, developing, and deploying security-critical software that protects our next-generation products throughout development, validation, and production. You will work on the most sensitive and business-critical portions of the SPE delivery and production pipelines, taking ownership of production-line security and collaborating closely with DFT, architecture, test development, and validation teams to ensure security is built in end-to-end.

What youll be doing:

Design and develop security-critical backend services, libraries, and tooling that protect SPE systems, validation flows, and production delivery pipelines.

Own and implement production-line security mechanisms, ensuring secure bring-up, test, validation, and manufacturing handoff.

Develop and integrate security features such as authentication, attestation, secrets and key management, integrity checks, and audit mechanisms.

Build secure automation frameworks and tooling embedded into test execution, validation, and manufacturing workflows.

Collaborate closely with DFT (Design for Test), Architecture, test writers, and validation teams to define security requirements and translate them into robust, scalable software solutions.

Participate in secure design reviews, threat modeling, security features for SPE board components and production flows.

Improve the security posture, reliability, observability, and maintainability of SPE systems and services.

Support and influence secure CI/CD and release pipelines, including vulnerability detection, policy enforcement, and controlled deployments.

Investigate, debug, and remediate security vulnerabilities and systemic weaknesses across SPE systems spanning development through production.
Requirements:
What we need to see:

Bachelors or Masters degree in Electrical engineering, Computer Science, Software Engineering, or a related field.

5+ years of professional software engineering experience, with strong ownership of backend systems in Python.

Proven experience developing production-quality software and automation passion.

Solid understanding of OOP, software concepts and system design principles.

Strong familiarity with Linux environments, system services, and system-level troubleshooting.

Excellent debugging, problem-solving, and code review skills.

Experience working in cross-functional engineering environments.

Proficiency with version control systems and collaborative development workflows.

Ways to stand out from the crowd:

Experience securing production, manufacturing, or product bring-up pipelines.

Background in product security, platform security, or DevSecOps.

Experience working with or alongside DFT, hardware architecture, or validation teams.

Knowledge of cryptography concepts, secure provisioning, and key management systems.

Experience securing CI/CD pipelines for large-scale engineering organizations.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583556
סגור
שירות זה פתוח ללקוחות VIP בלבד