דרושים » תוכנה » Senior Software Engineer - Infrastructure & Network Automation

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 18 שעות
Location: Tel Hai and Yokne`am
Job Type: Full Time
We are looking for a Senior Software Engineer to join NSV tools (Network Solutions Validation) group. As a senior team member, you will be part of a development effort of high-performing software automation systems for our Data Center environments. You will interact with NIC, OS, Switch, HCA, CPU and GPU compute as well as architects, network engineers, and developers. We drive the data growth of the worlds biggest companies. With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced. Are you ready for the challenge?

What youll be doing:

Design and develop an automation platform used to provision, configure, and monitor HPC data centers.

Implement scalable, reliable, and maintainable services that enhance cluster visibility and improve operational efficiency.

Collaborate closely with internal and external stakeholders to understand requirements and deliver robust full-cycle solutions.

Improve stability and performance across the provisioning pipeline through architectural enhancements and code optimizations.

Troubleshoot issues in distributed environments and contribute to system observability and reliability improvements.

Work cross-functionally with architects, DevOps engineers, product managers and stakeholders to ensure high-quality releases.

Participate in code reviews, technical design discussions, and continuous improvement activities within the team.
Requirements:
What we need to see:

B.Sc. in Computer Science, Engineering, or a related field (or equivalent practical experience).

5+ years of strong hands-on experience on Linux-based platforms.

Proficient scripting and automation skills (Bash, Python, Ansible).

Background in DevOps and Network Engineering practices.

Hands-on experience with large-scale network architectures, switches/routers, OVS, SR-IOV, and network operating/management systems.

Networking expertise: Ethernet, VLANs, TCP/UDP/IP, QoS, L2/L3 protocols, BGP, EVPN/VXLAN, and common network topologies.

Practical experience with containers and cloud-native technologies (Docker, Kubernetes) and networking performance.

Experience with version control systems (Git) and CI/CD pipelines.

Independent, fast learner with strong ownership mindset, excellent debugging and problem-solving skills, and effective communication abilities.

Ways to stand out from the crowd:

Experience as Team Lead/ Scrum master or similar leadership role.

Experience in planning, tracking, and delivering projects.

Familiarity with DevOps methodologies and tools (e.g., Jenkins, Ansible).

Hands-on experience with Docker and containerized environments.

Experience with agentic AI development.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586566
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/02/2026
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. We are widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with NVIDIA networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8541388
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
4 ימים
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. NVIDIA is widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with NVIDIA networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584095
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 16 שעות
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. We are widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with our networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586994
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/02/2026
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are looking for an AI Test Architect joining E2E Verification group to profile Innovative large scale Distributed training on our AI End-to-End solutions in a large scale supercomputing clusters. Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated Computing and Deep Learning software and hardware platforms, with researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, Switch, HCA, CPU and GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What youll be doing:

Profiling, benchmarking, and analyzing deep learning models to identify areas for optimization and improvement in terms of performance, efficiency, and accuracy, with a strong emphasis on networking aspects.

Collaborating closely with data scientists, researchers, development, automation teams to design and implement scalable training pipelines and frameworks that demonstrate large scale high -performance networking capabilities.

Staying up-to-date with the latest advancements in deep learning algorithms, architectures, our GPU technologies, and high-performance networking solutions.

Optimizing deep learning models for performance, memory usage, and power efficiency while maximizing high-performance networking features on our supercomputers.

Providing insights and recommendations based on the analysis of large-scale training results, specifically focusing on networking bottlenecks and optimizations, to improve model outcomes and achieve business objectives.

Collaborating with hardware engineers to guide the development and integration of efficient networking solutions for deep learning, including exploring network architecture optimizations and bringing to bear technologies such as RDMA or InfiniBand.
Requirements:
What we need to see:

B.Sc. in Computer Science, Software Engineering, or equivalent experience.

Strong understanding and practical experience with machine learning algorithms and techniques, with a specialization in deep learning and expertise in high-performance networking.

8+ years of overall experience, with CUDA programming for deep learning frameworks like TensorFlow, PyTorch, combined with expertise in networking libraries and protocols.

Ability to profile and optimize deep learning workflows, focusing on networking-related bottlenecks and optimizations, to improve overall performance and efficiency.

Exceptional analytical and problem-solving skill, with a keen attention to detail, particularly in identifying and resolving networking performance issues.

Excellent communication and collaboration skills, enabling effective teamwork and cooperation.

Familiarity with supercomputers, parallel computing, distributed systems, and high- performance networking technologies like RDMA or InfiniBand.

Ways to stand out from the crowd:

Demonstrated experience in successfully profiling and optimizing large-scale deep learning training on NVIDIA supercomputers, with a significant focus on high-performance networking enhancements.

Experience with distributed deep learning, distributed training frameworks, or large-scale data pipelines enhanced by high-performance networking solutions.

Expertise in optimizing networking parameters, such as bandwidth, latency, or congestion control, for deep learning workloads.

Familiarity with NVIDIA's networking technologies, such as Mellanox InfiniBand, and their integration with deep learning workflows.

Strong understanding of high-performance networking protocols and standards and their application to deep learning.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8541318
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Ra'anana and Yokne`am
Job Type: Full Time
We are transforming the world of networking for massively scaled AI factories through Spectrum-X: our Ethernet Networking solution. We need automation engineers with strong coding skills to join our Spectrum-X team and create the products that will drive the next phase of AI growth. Aa a Software Engineer you will ensure that the solution's software features are developed, tested and delivered to meet the high standards of our customers. Do you want to help create the most exciting solutions in the world of networking today? If so, read on.

What you'll be doing:

Participating in an international team of software engineers working on products for testing our ethernet networking products.

Designing and implementing robust, maintainable, and efficient automation test suites for end-to-end testing of AI ethernet networks.

Engaging directly with our customers to ensure we are aligned on their networking requirements and solve their problems.

Debugging and diagnosing complex networking problems.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science or equivalent experience.

5+ years of experience of networking product delivery (e.g. switches, routers, NICs) and the software development lifecycle.

Strong programming skills in Python.

Experience of creating and maintaining robust test automation systems, and a desire to automate everything.

Exceptional analytical and problem-solving skills, with a keen attention to detail, particularly in identifying and resolving networking issues.

Good English communication and interpersonal skills.

Good Linux familiarity.

Ways to stand out from the crowd:

Experience of data center network engineering spanning servers, NICs, cabling and switches.

Strong knowledge of ethernet networking, from switch pipelines (e.g. ACLs, forwarding rules) through networking protocols (like TCP/IP and BGP/EVPN-VxLAN) to network operating systems and their automation.

Have a passion for AI and networking.

Experience of NVIDIA networking technology such as Bluefield/ConnectX NICs, Spectrum switches, and the Cumulus Linux and SONiC operating systems.

Networking certification (e.g. CCIE, JNCIE, ACE-E).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8585102
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 18 שעות
Job Type: Full Time
We're looking for a Senior AI/MLOps Engineer to join a group that specializes in Security and Networking, and specifically ML, AI and agent development. As a Senior AI/MLOps Engineer, youll build and maintain the infrastructure, tools and processes necessary to support the AI lifecycle in a production environment. You will collaborate closely with data scientists, software engineers, security architects and DevOps teams to ensure smooth deployment, modeling and optimization of AI models. This role involves creative problem solving alongside engineering teams, and is pivotal for the continued success of AI networking security.

What youll be doing:

Developing, improving and optimizing scalable infrastructure for handling and deploying security and networking AI models and agents in production, ensuring high availability, scalability, reproducibility, and performance.

Optimizing AI models and agents for performance, scalability, and resource utilization, considering factors such as latency, efficiency, and cost.

Monitoring and deploying agentic systems, LLMs, and ML models in production.

Designing and implementing frameworks/pipelines for AI training, inference, and experimentation.

Collaborating closely with data scientists, security architects and software engineers to operationalize and deploy AI models and agents, including packaging and integration with existing systems. Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

Collaborating with DevOps teams to integrate pipelines and workflows into the CI/CD process, ensuring flawless deployments and rollbacks.

Building and maintaining monitoring and alerting systems to proactively identify and resolve issues relating to quality, performance and infrastructure.

Implementing access controls, authentication mechanisms, and encryption standards for AI models and data.

Documenting guidelines, and standard operating procedures for MLOps/AI processes and sharing knowledge with the wider team.

Develop proof-of-concepts for new features.
Requirements:
What we need to see:

BSc/MSc in CS/CE or related field (or equivalent experience).

Strong background in AI with experience deploying and monitoring AI/ML models, LLMs and agents to production systems at scale, including distributed and multi-node environments - at least 5 years of experience.

Proficiency in programming languages such as Python, Java, or Scala, along with experience in using ML/AI frameworks and libraries (e.g. TensorFlow, PyTorch).

Proficiency in microservices architecture, container orchestration, cloud platforms, and scalable infrastructure for training and inference workloads.

Knowledge of inference optimization techniques.

Understanding of build infrastructure and CI/CD tools and practices (e.g. GitLab, GitHub Actions, Jenkins).

You are detail-oriented and care deeply about robust, well tested, high-performance code in production environments.

You are proactive, take full ownership of your deliverables, have a can-do approach, and excellent communication and collaboration skills, able to work effectively in multifunctional teams.

Ways to stand out from the crowd:

Knowledge of network protocols and Linux internals.

Security and networking background, with knowledge of security protocols, network architectures, firewalls, intrusion detection systems, and other relevant security and networking concepts.

Experience deploying and optimizing generative models and agents.

Knowledge of network security principles and practices.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586605
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/02/2026
Location: Yokne`am
Job Type: Full Time
Our DOCA Verification team is seeking a highly motivated and hardworking Software Engineer with hands-on capability technical experience, to verify the design and implementation of the next generation Data Processing Unit Software, with wide range of features related to cyber security and embedded systems. This position offers the opportunity to have real impact in a dynamic, technology-focused company impacting data centers across the world. We're united in our quest to transform the way Smart Adapters are used.

What you'll be doing:

Collaboration & Communication: Work closely with software, architecture, product and DevOps teams to define test requirements, coordinate releases, and ensure high-quality product delivery.

End-to-End Feature Ownership: Deep dive into feature sets, taking responsibility from test plan development to final implementation and full automation.

Develop and Automate Testing: Design, implement, and maintain automated test scripts and frameworks (primarily in Python) to verify the correct functionality of our software products

System & Integration Validation: Validate software functionality and performance through system-level and integration testing, utilizing Linux-based environments and virtualization tools.

Defect Analysis: Analyze test results, open bugs and track issues to closure, ensuring robust and scalable solutions.

Continuous Improvement: Drive design verification flows, contribute to methodology improvements, and leverage planning/tracking systems to manage release progress and build release indicators.

Test Environment Management: Set up, maintain, and optimize test environments using Linux, Docker, virtual machines, and other modern tools.

Regression Monitoring: Operate, monitor, and analyze failures in the nightly regression system, providing methodical root-cause analysis across hardware, OS, and software layers.
Requirements:
What we need to see:

Bachelors Degree in EE, CS or CE or equivalent work experience.

5+ years of experience in software testing or software engineering.

Strong programming skills in C/C++.

Solid experience with Linux-based environments, including system tools and command-line utilities.

Methodical troubleshooting skills in Linux environments with a disciplined approach to evidence-based failure analysis.

Detail oriented and comfortable multitasking in a dynamic environment with shifting priorities and changing requirements.

Ability to work with various teams and have strong analytical, debugging and problem-solving skills with attention to details.

Excellent communications skills, self-motivated and well organized.

Knowledge in operating systems and specifically with Linux.

Ways to stand out from the crowd:

Prior software testing experience, with an understanding of Software Testing Tools and Methodologies.

Python or other scripting languages (such as Shell)-advantage.

Experience in CI methodology & servers (e.g. Gerrit, Jenkins etc.).

Knowledge of NVIDIA DPU products.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8541330
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Yokne`am
Job Type: Full Time
We are looking for a System/Network Test Engineer to join our End-to-End Cloud solution team. You are going to be part of the E2E Verification team with the main goal of testing and be part of defining the most sophisticated Ethernet/InfiniBand NIC and Switch features and topologies, which build fast enablement for the products to meet growing market demands. The ideal candidate will engage in testing and defining of Networking industry-leading systems and will bring with him an ability of the fast learning of new features, technologies, and protocols.

What you'll be doing:

Contribute to design review and product features requirements under the whole Ethernet/InfiniBand NIC & Switch portfolio and AI network.

Design and build setup topologies for appropriate product coverage with an emphasis on an emulation of customer large scale / complex environments.

Design requirements, for testing automation team, and implement tests for the new features, as part of our growing network switch and adaptors division.

Lead innovation approach by prepare and deploy different POC activities, based on the growing field demands.

Generate comprehensive test reports during release execution procedure, assist with reproduction and debugs complex customer use cases, with determination of the issue root cause, be an engineering PIC for the full verification cycles of the customer use cases fixes provided by R&D team.

Execute end-to-end test scenarios in different scopes: Regression, Performance, Functional and Scale; Report the progress of testing and provide summary reports of testing activity.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science or Electrical Engineering or equivalent experience as IT/Network Engineer.

2+ years of practical experience.

Strong Hands-on experience in Linux based platform.

Experience with L2 & L3 network protocols.

Fast and self-learner with outstanding technical skills.

Independent, responsible worker, able to plan and complete.

Effective trouble shooting and problem-solving skills.

Standout colleague with good communication and interpersonal skills.

Ways to stand out from the crowd:

Experience with virtualization technologies (KVM, HyperV, VMWARE, OpenStack, Kubernetes).

Experience in Congestion Control/DCQCN, Switches and knowledges about collective communication: NCCL, MPI etc.

Scripting skills and experience: Bash / Python.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8585200
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 17 שעות
חברה חסויה
Location: Yokne`am
Job Type: Full Time
We are seeking a DevOps Manager to lead a team of engineers in developing and supporting scalable CI/CD processes for our Networking products, enhancing global R&D efficiency. The role requires over five years of DevOps experience, leadership skills, proficiency with Jenkins, Git SCM, and programming in Python, Java, or Groovy. It involves collaborating with distributed teams to improve development methodologies and infrastructure. Preferred qualifications include experience with Linux system administration, containerization tools like Docker and Kubernetes, and managing complex infrastructure projects in fast-paced environments. Join us and help us to take our CI/CD infrastructure to the next level! You will have a unique opportunity to have a real impact in a dynamic, technology-focused company shaping the future of networking and data-center technologies. If you bring with your excellence and creative thinking while solving issues as they arise, and if you enjoy working with distributed teams - your place is with us!

What youll be doing:

Supervise a team of 4+ experienced and high motivated DevOps engineers.

Architect, implement and support scalable CI processes for multiple NVIDIA Networking products.

Closely collaborate with DevOps, IT and security teams across the globe to share, promote and improve CI practices.

Build infrastructure for effective, high quality, reliable development and release processes for our products.
Requirements:
What we need to see:

Bachelor's/Master's degree in Computer Science, Engineering, or equivalent experience.

4+ years leading DevOps organizations for mid range or enterprise company.

8+ overall years hands-on experience in DevOps with focus on CI/CD and infrastructure development.

Familiarity Linux OS distributions.

Strong knowledge of build systems: Jenkins or similar.

Proven knowledge of Git SCM administration.

Proficiency in Python, Java, Groovy or related programming languages.

Proven success building and scaling teams in fast-paced, high-growth environments.

Agility, excellent communication skills, customer-facing approach.

Ways to stand out from the crowd:

Solid understanding of LINUX system administration.

Demonstrated history of building robust and scalable CI/CD infrastructures for complex enterprise products.

Familiarity with containerization and orchestration tools such as Docker and Kubernetes.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586879
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 18 שעות
Location: Yokne`am
Job Type: Full Time
The Enterprise Network Support and SRE team is looking to add a seasoned Technical SRE lead to help actualize the SRE vision for our network infrastructure. We are looking for an engineer who is passionate about the network and making its operation seamless with a focus on user experience. This role will offer several opportunities to solve problems by being hands-on with troubleshooting, focused on network automation, observability, documentation, and excellence in operations. This Network SRE will be dedicated to enhancing network operations, diligently working to minimize manual labor and achieve Service Level Objectives (SLOs) amongst many goals. Our approach includes minimizing reactive operational tasks, documenting KB articles that can be used by Bots, following through on RCAs, conducting blameless postmortems, and proactively identifying and mitigating network risks to foster continuous improvement.

This crucial role will be focused on user satisfaction and brilliance in Network Operations. This SRE engineer will focus on tackling significant projects and is committed to fostering a supportive atmosphere that offers the mentorship necessary for professional development and growth. They will bring a wealth of skills and experience to be a sought after mentor, who leads by example.

What you'll be doing:

Owning the operational aspect of the network infrastructure, ensuring its high availability and reliability.

Partnering with architecture and deployment teams to guarantee that new implementations are supportable and align with production standards.

Advocating for and implementing automation to reduce toil and enhance operational efficiency.

Monitoring network performance, identifying areas for improvement, and coordinating with relevant teams to execute enhancements.

Collaborating with SMEs to resolve production issues swiftly and effectively, maintaining customer satisfaction.

Identifying opportunities for operational improvements and partnering with teams to develop solutions that drive excellence and sustainability in network operations.
Requirements:
What we need to see:

BS degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience.

Minimum of 8 years of industry experience in network site reliability engineering, network automation, network operations, or related areas. Experience on both campus and data center networks.

Familiarity with network management tools such as Prometheus, Grafana, Alert Manager, Nautobot/Netbox, BigPanda.

Expertise in automating networks using frameworks such as Salt, Ansible, or similar.

In depth experience in one or more of the following: Python, Go.

Knowledge in network technologies such as TCP/UDP, IPv4/IPv6, Wireless, BGP, VPN, L2 switching, , Firewalls, Load Balancers, EVPN, VxLAN, Segment Routing. Proven track record in network operations.

Skills with ServiceNow and Jira.

Knowledge of Linux system fundamentals is a plus.

Systematic problem-solving approach, coupled with excellent communication skills and a sense of ownership and drive.

Ways to stand out from the crowd:

Track record of taking operational signals through means such as SNMP, Syslog, Streaming Telemetry to solve operational challenges.

History of debugging and optimizing code; automating routine tasks.

Experience with Mellanox/Cumulus Linux, Palo Alto firewalls, Netscalers and F5 load balancers.

Previous SRE experience.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586630
סגור
שירות זה פתוח ללקוחות VIP בלבד