דרושים » הנדסה » Senior DevOps and Automation Engineer, Fabric Networking - GPU

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: More than one
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. we are looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.
We are looking for highly motivated DevOps and Automation Engineer to join our software infrastructure team. In this role, you'll build and enhance the systems that support large-scale GPU clustersinterconnected via NVLink and InfiniBandthat run today's most fast paced HPC and AI workloads.
What you will be doing:
Build and maintain CI/CD pipelines that support fast, reliable integration and deployment across complex systems.
Design tools and automation workflows that simplify software releases, manage dependencies, and increase reliability.
Accelerate development by modularizing systems and enabling independent release cycles.
Build infrastructure automation for provisioning, scaling, and maintaining GPU clusters.
Automate software updates and monitor system health to improve reliability and availability.
Troubleshoot and resolve operational issues across distributed infrastructure.
Manage firmware and software rollouts to minimize downtime and ensure consistency.
Work with global engineering teams to align infrastructure tools and support project achievements.
Requirements:
BS or MS in Computer Science, Computer Engineering, or a related field
5+ years of experience managing infrastructure or systems in high-performance or distributed environments.
Expertise in scripting and automation using Python, Ansible, and Shell.
Practical experience with modern CI/CD tools and infrastructure-as-code frameworks.
Strong understanding of Linux, networking, and distributed system design.
Proven ability to break down monolithic systems into scalable, loosely coupled components.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Experience with cluster management tools like Slurm.
Familiarity with our company DGX/HGX systems and GPU-based clusters.
Knowledge of observability tools such as Prometheus and Grafana.
Proven ability to lead DevOps process improvements and drive team efficiency.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8333379
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: More than one
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. we are looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.
our company data center systems, such as DGX and HGX, have become core to our company's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of our company GPUs, our company NVLink, our company InfiniBand networking, our company Grace CPUs, and a fully optimized our company AI and HPC software stack. We are seeking a highly motivated System Software Manager to lead a team driving the development of next-generation communication software and products, building on our industry-leading NVLink and NVSwitch technologies. This role combines technical leadership with hands-on development activities, driving innovation at the core of our companys data center ecosystem.
What you will be doing:
Lead, mentor, and grow your engineering team while overseeing project planning, execution, and ensuring the quality and performance of all deliverables.
Own major features and drive the teams roadmap that aligns with ambitious goals and dynamic customer requirements.
Grow the local team while collaborating with a wide range of crossbusiness unit teams both on-site and locally.
Continuously evaluate and identify opportunities to improve processes, infrastructure, and practices to ensure efficient, transparent, and high-quality team execution.
This is a technical leadership role, with active involvement in feature design and implementation.
Requirements:
BS or MS degree in Computer Engineering, Computer Science, or related field or equivalent experience.
8+ years of overall technical experience in system software and networking product development.
3+ years of experience managing and leading engineering teams.
Experience balancing multiple projects with competing priorities.
Solid understanding of computer system architecture, operating system and kernel internals.
Strong understanding of networking fundamentals and high-performance interconnection (e.g., InfiniBand, Ethernet)
Experience with Linux development, familiarity with OS virtualization technologies.
Background in multi-core / multi-process / multi-threaded programming environment.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Prior experience working with distributed teams, demonstrating your ability to manage and coordinate effectively across different locations.
Direct exposure to high-profile customers, demonstrating strong communication and relationship management skills.
Understanding of CUDA programming models, our company GPUs, and high-performance computing (HPC) environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8333374
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our company's vision?
What you will be doing:
Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.
Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack
Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available
Triage and root-cause performance issues reported by our customers
Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information
Collaborate with a very dynamic team across multiple time zones.
Requirements:
M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience
3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)
Experience conducting performance benchmarking and triage on large scale HPC clusters
Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals)
Implement micro-benchmarks in C/C++, read and modify the code base when required
Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python
Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker)
Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones
Ways to stand out from the crowd:
Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control
Experience debugging network issues in large scale deployments
Familiarity with CUDA programming and/or GPUs
Experience with Deep Learning Frameworks such PyTorch, TensorFlow.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321604
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
חברה חסויה
Location: Yokne`am
Job Type: Full Time
we are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.
What youll be doing:
Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.
Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.
Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.
Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.
Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.
Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.
Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.
Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
Requirements:
B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.
Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.
Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.
Familiarity with Jenkins, Ansible, Puppet/Chef.
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.
Deep understanding of networking protocols such as InfiniBand and Ethernet.
Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.
Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.
Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).
Familiarity with cloud platforms (AWS, Azure, Google Cloud).
Ways to stand out from the crowd:
Architectural Insight: Knowledge of CPU and/or GPU architecture.
Container Expertise: Understanding of Kubernetes and container-related microservice technologies.
GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).
RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321669
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/08/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are at the forefront of the AI revolution, delivering cutting-edge accelerated compute platforms for global impact. Our Network Insights group is seeking a talented and motivated Sr. DevOps Engineer to architect, scale, and optimize the DevOps infrastructure supporting our advanced networking simulation services. In this high-impact role, you will lay the foundations to scale a key insight product to reach 10100 times more users, design robust CI/CD pipelines, drive automation, and ensure the reliability, scalability, and security of our cloud-based, and on-prem platforms.. If you are passionate about solving complex infrastructure challenges and enabling world-class software delivery, we want to hear from you.
What You'll Be Doing:
Architect and optimize CI/CD pipelines for large-scale, high-availability simulation services, ensuring fast, reliable, and secure deployments.
Drive automation across infrastructure provisioning, configuration management, and monitoring to support rapid development cycles and minimize manual intervention.
Collaborate with software engineering and product teams to design and implement scalable, cloud-native solutions that meet evolving business needs.
Promote standard processes in infrastructure as code, containerization, and cloud security, ensuring compliance and resilience across environments.
Monitor, troubleshoot, and resolve infrastructure and deployment issues, maximizing uptime and ensuring efficient performance for internal and external customers.
Evaluate and integrate new tools and technologies to continually enhance the reliability, observability, and efficiency of our DevOps ecosystem.
Participate in incident response and post-mortem processes, driving root cause analysis and systemic improvements.
Requirements:
BSc or above in Computer Science, Computer Engineering, or a related field, or equivalent experience.
5+ overall years of hands-on experience in DevOps or Site Reliability Engineering roles.
Proven expertise in designing, building, and maintaining CI/CD pipelines (e.g., Jenkins, GitLab CI, GitHub Actions, or similar).
Deep knowledge of cloud platforms (AWS, preferably), On-Prem deployment, container orchestration (Kubernetes, Docker), and infrastructure as code.
Strong scripting and automation skills (Python, Bash, or similar).
Experience with monitoring, logging, and observability tools (Prometheus, Grafana, ELK, etc.).
Proven understanding of security standard methodologies in cloud & on-prem DevOps environments.
Excellent communication and interpersonal skills, with a track record of multi-functional collaboration.
Experience supporting large-scale, high-availability production systems.
Ways to Stand Out From the Crowd:
Prior background in networking or simulation environments.
Prior experience with building a new team from the grounds up.
Familiarity with performance tuning and cost optimization in cloud and on-prem environments.
Experience with building CI/CD pipelines from the ground up.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8322880
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
20/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Senior DevOps Engineer to join our newly formed Foundations Teama small, high-impact group responsible for the infrastructure, tools, and shared services that power our entire R&D organization.
In this role, youll design, build, and evolve internal platform infrastructure, CI/CD systems, and developer enablement tooling. Your mission is to empower developers across the company to work autonomously, by creating self-service tools, automation, and clear standards that reduce friction and increase reliability.
Youll collaborate closely with engineers across disciplines and partner with the Foundations Team Lead to shape DevOps practices that scale. This is a hands-on role for someone who thrives in high-velocity, mission-critical environments and is passionate about building tools that make developers faster, more productive, and confident in running their own services.
What Youll Do
Design and maintain scalable, developer-friendly CI/CD pipelines and deployment workflows.
Build self-service tooling and automation that enables teams to manage deployments, environments, secrets, and observability independently
Be responsible for cloud infrastructure and operations foundations
Implement and promote best practices for monitoring, logging, and alerting across services.
Operate and optimize Kubernetes-based production environments, ensuring performance, security, and stability.
Manage infrastructure using Infrastructure as Code (IaC) and ensure repeatability and traceability through tools like Terraform.
Collaborate with R&D teams to support onboarding to internal tooling and promote a culture of enablement over dependency.
Monitor cloud cost, ensuring our cloud operates efficiently.
Requirements:
4+ years of hands-on experience in DevOps or infrastructure engineering, ideally in high-velocity, mission-critical production environments.
Deep expertise in Kubernetes and containerized infrastructure, with experience deploying and managing workloads at scale.
Strong understanding of cloud infrastructure and operations, including networking, storage, compute, and securityGCP experience preferred.
Proficiency with Infrastructure as Code tools, especially Terraform, with a focus on automation and operational excellence.
Experience developing and managing CI/CD processes and tools, with a passion for improving developer workflows and release quality.
Strong debugging and problem-solving skills, with the ability to troubleshoot complex systems across the stack.
Highly self-motivated and organized, able to work independently in a fast-paced, collaborative environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8311657
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/08/2025
Location: Yokne`am
Job Type: Full Time
we are seeking an exceptional DevOps & Software Engineer to join our innovative development team. This team is at the heart of our companys software infrastructurebuilding and maintaining a wide range of solutions including internal cloud environment, scalable build systems, automation frameworks and AI-driven tools. You will have a direct impact on our companys products and development workflows. This role blends hands-on development with deep DevOps practices. Its ideal for engineers passionate about scalability, infrastructure, automation, and modern development platforms.
What Youll Be Doing:
Develop and maintain internal cloud solutions based on a microservices architecture to enable efficient, high-quality software development and delivery.
Design and implement automation tools, infrastructure services, and advanced build systems.
Work across a wide variety of operating systems, building virtualization and system-level capabilities.
Provide resilient solutions for a sophisticated infrastructure stack with the latest network devices.
Contribute to a positive and collaborative team culture that values creativity and agility.
Partner with engineering teams across our company to deliver scalable and reliable infrastructure solutions.
Requirements:
B.Sc. in Computer Science, Computer Engineering, or equivalent technical field.
5+ years of hands-on experience in software development or DevOps roles.
Proficiency in Python and familiarity with Linux environments.
Solid understanding of software design, implementation, and debugging.
Strong analytical skills, ability to troubleshoot complex systems.
Self-driven, quick learner, comfortable with multitasking and dynamic environments.
Ways to Stand Out from the Crowd:
Experience with virtualization and operating system internals.
Familiarity with tools like Kubernetes, Rancher, MongoDB, Redis, Docker, Vagrant, Ansible, or similar.
Strong background in CI/CD practices, particularly using Jenkins and infrastructure-as-code.
Knowledge of networking fundamentals and protocols.
Prior experience with cloud infrastructure or grid computing environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8322800
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/08/2025
חברה חסויה
Location: More than one
Job Type: Full Time
we are looking for an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. we are focused on building supercomputers and HPC clusters based on groundbreaking technologies. We are looking for an outstanding architect for a senior HPC, be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing. Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.
What you will be doing:
Design, implement and maintain large scale HPC/AI clusters with monitoring, logging and alerting
Manage Linux job/workload schedules and orchestration tools
Develop and maintain continuous integration and delivery pipelines
Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources
Deploy monitoring solutions for the servers, network and storage
Perform troubleshooting bottom up from bare metal, operating system, software stack and application level
Being a technical resource, develop, re-define and document standard methodologies to share with internal teams
Support Research & Development activities and engage in POCs/POVs for future improvements.
Requirements:
A degree in Computer Science, Engineering, or a related field
5+ years of experience
Knowledge of HPC and AI solution technologies from CPUs and GPUs to high speed interconnects and supporting software
Experience with job scheduling workloads and orchestration tools such as Slurm, K8s
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalld, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.
Experience with multiple storage solutions such as Lustre, GPFS, zfs and xfs. Familiarity with newer and emerging storage technologies.
Python programming and bash scripting experience.
Comfortable with automation and configuration management tools such as Jenkins, Ansible, Puppet/chef
Deep knowledge of Networking Protocols like InfiniBand, Ethernet
Deep understanding and experience with virtual systems (for example VMware, Hyper-V, KVM, or Citrix)
Ways to stand out from the crowd:
Familiarity with cloud computing platforms (e.g. AWS, Azure, Google Cloud)
Knowledge of CPU and/or GPU architecture
Knowledge of Kubernetes, container related microservice technologies
Experience with GPU-focused hardware/software (DGX, Cuda)
Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8317649
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
This is a great opportunity to be part of one of the fastest-growing infrastructure companies in history, an organization that is in the center of the hurricane being created by the revolution in artificial intelligence.
"our company's data management vision is the future of the market."- Forbes
we are the data platform company for the AI era. We are building the enterprise software infrastructure to capture, catalog, refine, enrich, and protect massive datasets and make them available for real-time data analysis and AI training and inference. Designed from the ground up to make AI simple to deploy and manage, our company takes the cost and complexity out of deploying enterprise and AI infrastructure across data center, edge, and cloud.
Our success has been built through intense innovation, a customer-first mentality and a team of fearless company ronauts who leverage their skills & experiences to make real market impact. This is an opportunity to be a key contributor at a pivotal time in our companys growth and at a pivotal point in computing history.
The DevOps Engineer position is an operational engineering role and is an integrated part of our development team. You will be responsible for improving the efficiency of our processes, software, and infrastructure, and will be assisting RnD Team with product development. If you are DevOps Engineer that is passionate about automating and scaling everything, this job is for you.
Responsibilities
Monitor and optimize cloud infrastructure for performance, scalability, and cost-efficiency.
Manage and Maintain CI Infrastructure (GitLab CI and Jenkins).
Manage, Maintain and Improve our Release and Development Environments.
Support critical production infrastructure deployed in Multiple Clouds (AWS, Azure, and GCP).
Develop and Support RnD toolchain and implement best practices for code deployment, testing, and maintenance.
Automate On-Premises Labs Infrastructure by adopting IaC practices.
Lead and Develop Monitoring, Telemetry, Alerting, and Logging Production services.
Requirements:
Desired Qualifications:
Proven hands-on experience with Docker and Kubernetes in production. Hands-on experience deploying and managing complex Kubernetes environments, including services, ingresses, load balancers, and Helm charts
Solid understanding of Linux/Unix Internals and experience with handling complex performance and configuration problems in Linux/Unix environment.
Multi-Cloud Expertise: Deep familiarity with both GCP and AWS for provisioning, networking, and cost-optimization strategies
Experience in DSL Configuration tools like Ansible, Chef, or Puppet.
Experienced with programming languages (Python is preferred).
Shell scripting experience.
Proficient in SRE\Monitoring methodologies (Monitoring stacks with emphasis on Prometheus)
Nice To Have Skills
Experienced with CI/CD tools and frameworks.
Experience with managing binary repositories (RPMs, Pypi, NPM and etc)
Experience with developing Ansible collections, roles, and modules.
Experience with managing GitLab and GitLab CI.
Experience with Hashicorp Products: Terraform, Packer, Consul, Vault, and Vagrant.
Experience with automating configuration and deployment of On-Premises Lab Hardware.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8325791
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/08/2025
חברה חסויה
Location: Yokne`am
Job Type: Full Time
we are seeking a highly skilled DevOps Engineer to join our Networking IC Product Engineering Group (ICPE). This is a unique opportunity to become a cornerstone of our DevOps practice, owning the critical systems that power our engineering innovation. Youll be responsible for the entire DevOps lifecyclefrom robust CI/CD pipelines to production line package releasesdriving efficiency, scalability, and reliability across the organization. You will work with a high degree of autonomy, expected to independently lead initiatives, design and implement optimal solutions, and collaborate with both internal stakeholders and external partners. If you're a self-motivated engineer who thrives in dynamic environments, takes initiative without waiting for direction, and enjoys improving and scaling engineering ecosystemswe want you with us.
What Youll Be Doing:
Develop and maintain robust, scalable CI/CD pipelines to ensure seamless software integration and delivery.
Collaborate with cross-functional teams to enhance build system reliability and efficiency.
Monitor, troubleshoot, and optimize system performance to ensure continuous, reliable operation.
Diagnose and resolve complex issues affecting the stability and performance of development and production environments.
Requirements:
Bachelor's degree in computer science, computer engineering, or equivalent experience.
5+ years of hands-on experience in CI/CD pipeline development and automation (e.g., Jenkins, GitLab CI/CD).
5+ years of experience in Python development.
5+ years of working with Linux distributions (e.g., RedHat, Ubuntu).
Proficiency in scripting languages (e.g., Bash, Ruby, Groovy) in a Unix/Linux environment.
Strong background in configuration and deployment management.
Expertise in version control systems (e.g., GitLab, Gerrit).
Exceptional problem-solving skills, with a focus on identifying root causes and implementing long-term fixes.
Excellent communication and interpersonal skills; strong team spirit and cross-team collaboration mindset.
Proven ability to work independently, prioritize tasks, and drive initiatives without constant supervision.
Ways To Stand Out From The Crowd:
Experience with PyTest or other testing frameworks.
Previous leadership experience or a track record of mentoring/team-leading.
Familiarity with databases (e.g., MongoDB or similar).
Hands-on experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8322916
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior DevOps Engineer to join our R&D team in developing the next rising product in the health tech landscape. If you are looking for a challenging, influential position and are passionate about making an impact, this might be the role for you.
As a Senior DevOps Engineer , youll play a key role in the design, development, testing, deployment, and monitoring of our infrastructure and products. In this position, you'll make significant contributions to our observability stack, helping build and maintain robust systems for logs, metrics, traces, and alerting.
Our ideal candidate is passionate about DevOps and observability, has strong communication skills, and thrives on constant improvement for both technology and processes. If you enjoy working on multiple projects in parallel and are a proactive team player, youll fit right in.
This is a unique opportunity to join the core team of a fast-growing startup, where your contributions will have a direct impact on our product and success.
Responsibilities:
Support and collaborate with cross-functional engineering teams using cutting-edge technologies.
Contribute to the design, implementation, and maintenance of monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, Loki)
Secure, scale, and manage our cloud environments (AWS and GCP)
Design and implement automation solutions for both development and production
Manage and improve our CI/CD pipelines for fast and safe delivery
Lead best practices in infrastructure, observability, configuration management, and system hardening
Continuously assess and improve existing infrastructure in line with industry standards
Requirements:
BSc in Computer Science, Engineering, or equivalent experience
5+ years of experience as a DevOps Engineer or similar software engineering role
Proven experience with Docker and Kubernetes (EKS preferred)
Hands-on experience with monitoring and observability tools, including Prometheus, Grafana, Datadog, or similar.
Expertise in Terraform for AWS infrastructure-as-code deployments
Strong collaboration and interpersonal communication skills
Excellent analytical thinking and problem-solving mindset
Proficiency with relational databases
Solid knowledge of Python and Bash scripting
Experience with test automation an advantage
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8320472
סגור
שירות זה פתוח ללקוחות VIP בלבד