דרושים » תוכנה » AI Infrastructure Architect

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 10 שעות
חברה חסויה
Location: Caesarea
Job Type: Full Time
we are seeking a Lead System Architect to join our system architecture team and help define NR-NEXUS, our next-generation AI inference platform.
Responsibilities:
Lead the software architecture and technical roadmap for NR-Nexus
Write system specifications for NR-Nexus product
Research AI infrastructure, SaaS platforms, model serving, and inference trends
Work with engineering to translate technical capabilities into product value
Work closely with engineering teams to optimize performance, scalability, and feature delivery.
Define performance goals and lead profiling, benchmarking, and optimization efforts for GenAI and distributed AI workloads.
Collaborate with customers, partners, and open-source communities to ensure ecosystem compatibility and adoption.
Mentor software engineers and provide technical leadership
Requirements:
7+ years of software engineering experience, including 3+ years in software architecture or technical leadership.
Strong experience with Kubernetes-based platforms and cloud-native architecture.
Deep understanding of Gen AI/LLM infrastructure and distributed workloads
Experience designing management software or SaaS platforms for production systems.
Strong background in distributed systems, microservices, APIs, and automation.
Hands-on experience with observability stacks, monitoring, logging, alerting, and SLA/SLO tracking.
Experience with CI/CD, deployment automation, upgrades, and rollback mechanisms.
Good understanding of security, authentication, authorization, and integration with customer data center environments.
Nice to have:
Deep understanding of GenAI / LLM inference infrastructure, including model serving, scaling, batching, latency, throughput, and resource utilization.
Experience with production AI inference clusters using GPUs, AI accelerators, or other specialized compute infrastructure.
Understanding of how distributed inference systems operate, including scheduling, load balancing, autoscaling, failover, and cluster-level observability.
Experience with LLM serving frameworks such as vLLM, Triton Inference Server, TensorRT-LLM, or similar.
Familiarity with GPU/accelerator orchestration, device plugins, resource scheduling, and cluster capacity planning.
Familiarity with GPU communication technologies such as GPUDirect RDMA, NCCL, NVLink, or UALink.
Experience optimizing communication for distributed AI/ML workloads.
Knowledge of Prometheus, Grafana, OpenTelemetry, Helm, Argo CD, Istio, KServe, Kubeflow, or similar tools.
Experience deploying software in on-prem, edge, private cloud, or hybrid environments.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8720593
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Job Type: Full Time and Hybrid work
Lead the software architecture and technical roadmap for next-generation ultra low-latency AI-SuperNIC software stack, including drivers, firmware, libfabric, and libibverbs.
Define the partitioning and interfaces between hardware, firmware, Kernel drivers, user-space libraries, and AI frameworks.
Lead the design and implementation of high-performance networking, RDMA, and GPU-direct communication capabilities.
Drive software support for emerging technologies and standards such as UEC, UALink, MRC and RoCEv2 ecosystems.
Work closely with hardware, system architecture, and VLSI teams to optimize performance, scalability, and feature delivery.
Define performance goals and lead profiling, benchmarking, and optimization efforts for GenAI and distributed AI workloads.
Collaborate with customers, partners, and open-source communities to ensure ecosystem compatibility and adopti
Requirements:
BSc/MSc in Computer Science, Electrical Engineering, or a related field.
7+ years of experience in software architecture, networking software, or system software development.
Strong experience developing Linux Kernel drivers, firmware, and user-space networking software.
Deep understanding of data center networking, including Ethernet, TCP/IP, routing, switching and congestion management
Proven experience defining software architectures that span hardware, firmware, Kernel, and user-space components.
Strong programming skills in C / C ++ and experience with Linux -based development environments.
Experience leading cross-functional technical initiatives and collaborating with hardware and system architecture teams.
Excellent analytical, debugging, and performance optimization skills.
Nice to Have:
Experience with RDMA technologies and low-latency networking architectures.
Experience with lib
This position is open to all candidates.
 
Show more...
הגשת מועמדות
עדכון קורות החיים לפני שליחה
8718880
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 10 שעות
חברה חסויה
Location: Caesarea
Job Type: Full Time
we are seeking a Lead System Architect to join our system architecture team and help define the next generation of our AI-SuperNIC scale-out chip.
AI scale-out communication is a critical element in modern data centers, and emerging standards such as Ultra Ethernet aim to address this challenge. This role focuses on defining a high-performance Smart NIC architecture optimized for GPU-centric AI workloads, with emphasis on low-latency, high-bandwidth data movement.
You will work across hardware and software domains, collaborating closely with AI, platform, driver, and VLSI teams to design a competitive scale-out networking solution.
Responsibilities:
Lead the software architecture and technical roadmap for or next-generation ultra low-latency AI-SuperNIC software stack, including drivers, firmware, libfabric, and libibverbs.
Define the partitioning and interfaces between hardware, firmware, kernel drivers, user-space libraries, and AI frameworks.
Lead the design and implementation of high-performance networking, RDMA, and GPU-direct communication capabilities.
Drive software support for emerging technologies and standards such as UEC, UALink, MRC and RoCEv2 ecosystems.
Work closely with hardware, system architecture, and VLSI teams to optimize performance, scalability, and feature delivery.
Define performance goals and lead profiling, benchmarking, and optimization efforts for GenAI and distributed AI workloads.
Collaborate with customers, partners, and open-source communities to ensure ecosystem compatibility and adoption.
Mentor software engineers and provide technical leadership across firmware, driver, and networking software development
Requirements:
BSc/MSc in Computer Science, Electrical Engineering, or a related field.
7+ years of experience in software architecture, networking software, or system software development.
Strong experience developing Linux kernel drivers, firmware, and user-space networking software.
Deep understanding of data center networking, including Ethernet, TCP/IP, routing, switching and congestion management
Proven experience defining software architectures that span hardware, firmware, kernel, and user-space components.
Strong programming skills in C/C++ and experience with Linux-based development environments.
Experience leading cross-functional technical initiatives and collaborating with hardware and system architecture teams.
Excellent analytical, debugging, and performance optimization skills.
Nice to Have:
Experience with RDMA technologies and low-latency networking architectures.
Experience with libfabric, libibverbs, RDMA-core, DPDK, SPDK, or similar infrastructure software.
Familiarity with GPU communication technologies such as GPUDirect RDMA, NCCL, NVLink, or UALink.
Experience optimizing communication for distributed AI/ML workloads.
Contributions to open-source networking or Linux kernel projects.
Experience working on SmartNICs, DPUs, NICs, or networking ASICs.
Deep understanding of GenAI/ML infrastructure and distributed workloads
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8720570
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 10 שעות
Location: Caesarea
Job Type: Full Time
We are looking for a Senior Software Engineer to help build and optimize large-scale, high-performance GenAI infrastructure and inference systems on Kubernetes.
As AI workloads increasingly move toward Kubernetes-native infrastructure, we are building systems that support distributed inference, performance optimization, reliability, observability, and production-grade deployment at scale.
This role is ideal for an engineer who can reason deeply about systems, performance, tradeoffs, and reliability, and who is comfortable owning difficult technical decisions end-to-end.
You will work across inference serving, distributed systems, optimization, and Kubernetes-native AI infrastructure.
What Youll Do:
Build and optimize high-performance Kubernetes-native GenAI inference systems
Work with modern inference stacks such as vLLM, SGLang, TensorRT-LLM, and related tooling
Work with Kubernetes-native distributed LLM inference frameworks such as llm-d and NVIDIA Dynamo
Design and implement optimization algorithms and performance improvements
Improve reliability, observability, deployment, and operational maturity of AI systems
Make architectural decisions and take ownership of technical outcomes
Collaborate with a small, senior engineering team focused on performance and production quality
Requirements:
Minimum 5 years of experience as a Software Engineer, with strong software engineering and system design skills.
Programming experience in Go and Python
Hands-on experience with the Kubernetes ecosystem, including Operators, service meshes, GitOps, Gateway API, and OpenTelemetry
Experience with cloud platforms
Strong understanding of optimization algorithms and performance engineering
Ability to independently drive technical initiatives from concept to production
Strong systems thinking and debugging skills
Comfort operating in environments with high autonomy and responsibility
Nice to Have:
Experience with modern LLM inference frameworks such as vLLM, SGLang, or TensorRT-LLM
Experience with distributed LLM inference frameworks such as llm-d or NVIDIA Dynamo
Contributions to open-source Kubernetes or ML infrastructure projects
GPU performance optimization and profiling experience
Familiarity with CUDA, NCCL, or Triton kernels
Experience running GenAI systems at scale in production
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8720579
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Caesarea
Job Type: Full Time and Hybrid work
Performance Architect
What You'll Do:
Join our Silicon One architecture team, the core of silicon development. Our architects manage all aspects of system chip development and provide specifications to various development teams. In this role you will:
Define the features and specifications of future devices.
Utilize a data-driven approach to model and analyze networks of tomorrow, providing optimal solutions for our customers.
Model, analyze, and present simulation results for cutting-edge networking solutions across various use cases.
Apply strong networking research skills and a robust theoretical background to your work.
Who You'll Work With
You will be part of our Silicon One architecture team, which is central to our ASIC group.
Our team, which operates with a startup mentality within a stable and leading corporation, drives the development of next-generation networking devices.
Our design center is unique, hosting all silicon hardware and software development disciplines at one site. We are revolutionizing the industry by building a new internet for AI networks and the 5G era, with a unified, programmable silicon architecture that will underpin all of our future routing and switching products.
Our devices are engineered to be adaptable across service providers and web-scale markets, designed for both fixed and modular platforms. They deliver high speed without sacrificing programmability, buffering, power efficiency, scale, or feature flexibility. We are set to be a transformative technology for decades to come.
Requirements:
Minimum Requirements
Software Development Skills: Proficiency in C++ and Python.
Research Skills: Experience researching networking solutions.
Self-Learning Ability: Capability to quickly grasp new concepts and technologies from papers and specifications.
Presentation Skills: Effective in communicating and presenting complex technical concepts.
Curiosity & Innovation: A passion for innovation, with strong analytical skills and meticulous attention to detail.
Team Player: Proven ability to collaborate and contribute to team goals.
Technical Documentation: Strong writing skills for creating technical documents.
Preferred/Advantageous Qualifications
Versatility: Adaptable to diverse tasks within the networking architecture domain.
Network Modeling Experience: Familiarity with tools like ns-3 or OMNeT++.
AI Knowledge: Familiarity with AI concepts.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8717196
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Caesarea
Job Type: Full Time and Hybrid work
We are seeking a highly motivated and experienced Senior Software Engineer for a position focused on developing innovative network device monitoring and visualization solution.
Meet the Team:
This position is part of our Silicon One organization, based in Israel. Silicon One is the foundation of our industry-leading networking hardware products, pushing the boundaries of technology and driving the next generation of high-performance, scalable solutions.
Your Impact:
Design and develop visualization capabilities within our SDK environment
Lead the development of a brand-new comprehensive self-monitoring system for network devices from the ground up
Implement a comprehensive model for capturing and managing network state information
Drive cross-team collaboration and technical strategy
Work closely with senior technical leadership
Create and refine technical requirements and system designs
Build new systems and frameworks from the ground up.
Requirements:
Minimum Qualifications:
Bachelor's degree in Computer Science, Software Engineering, or related field
Minimum 7 years of software development experience
Proficiency in C++ and low-level programming
Demonstrated ability to design and implement complex software solutions
Proven track record of technical leadership
Preferred Qualifications:
Experience with ASIC and Network technologies
Background in large-scale distributed systems
Expertise in developing high-performance software that handles billions of packets
Advanced system design and architectural skills
Deep understanding of network state modeling and monitoring systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8717211
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Caesarea
Job Type: Full Time
As a growing startup, this role will continue to evolve, and were looking for team members who are comfortable building operational processes that scale with the company.

The Operations Lead is responsible for end-to-end execution of customer deployments, owning all non-engineering aspects of delivery across pilots, initial deployments, and scaled rollouts. This role operates within a cross-functional scrum led by a Field Deployment Engineer (FDE), ensuring projects are delivered on time, at quality, and at scale.

Responsibilities:

Own end-to-end deployment execution, including planning, timelines, and rollout strategy
Drive project plans, milestones, and execution tracking across multiple workstreams
Manage field operations, installers, vendors, and site readiness
Coordinate cross-functional teams (Engineering, Product, Sales, Customer Success)
Act as the operational point of contact for customers during deployments
Identify and mitigate risks, ensuring on-time delivery and high-quality execution
Own logistics and onsite readiness for pilots, deployments, and rollouts
Build scalable processes, templates, and best practices for deployment operations
Requirements:
5+ years in operations, program management, deployment, or delivery roles
Up to 30% travel
Fluent English - a must
Proven experience managing complex, cross-functional projects
Strong organizational and execution skills with attention to detail
Experience working with field operations or external vendors
Excellent communication and stakeholder management skills
Ability to thrive in fast-paced, dynamic startup environments
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8701262
סגור
שירות זה פתוח ללקוחות VIP בלבד