דרושים » תוכנה » Senior SW Engineer - AI Infrastructure & Optimization

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: Caesarea
Job Type: Full Time
We are looking for a Senior Software Engineer to help build and optimize large-scale, high-performance GenAI infrastructure and inference systems on Kubernetes.
As AI workloads increasingly move toward Kubernetes-native infrastructure, we are building systems that support distributed inference, performance optimization, reliability, observability, and production-grade deployment at scale.
This role is ideal for an engineer who can reason deeply about systems, performance, tradeoffs, and reliability, and who is comfortable owning difficult technical decisions end-to-end.
You will work across inference serving, distributed systems, optimization, and Kubernetes-native AI infrastructure.
What Youll Do:
Build and optimize high-performance Kubernetes-native GenAI inference systems
Work with modern inference stacks such as vLLM, SGLang, TensorRT-LLM, and related tooling
Work with Kubernetes-native distributed LLM inference frameworks such as llm-d and NVIDIA Dynamo
Design and implement optimization algorithms and performance improvements
Improve reliability, observability, deployment, and operational maturity of AI systems
Make architectural decisions and take ownership of technical outcomes
Collaborate with a small, senior engineering team focused on performance and production quality
Requirements:
Minimum 5 years of experience as a Software Engineer, with strong software engineering and system design skills.
Programming experience in Go and Python
Hands-on experience with the Kubernetes ecosystem, including Operators, service meshes, GitOps, Gateway API, and OpenTelemetry
Experience with cloud platforms
Strong understanding of optimization algorithms and performance engineering
Ability to independently drive technical initiatives from concept to production
Strong systems thinking and debugging skills
Comfort operating in environments with high autonomy and responsibility
Nice to Have:
Experience with modern LLM inference frameworks such as vLLM, SGLang, or TensorRT-LLM
Experience with distributed LLM inference frameworks such as llm-d or NVIDIA Dynamo
Contributions to open-source Kubernetes or ML infrastructure projects
GPU performance optimization and profiling experience
Familiarity with CUDA, NCCL, or Triton kernels
Experience running GenAI systems at scale in production
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8720579
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Caesarea
Job Type: Full Time
we are seeking a Lead System Architect to join our system architecture team and help define NR-NEXUS, our next-generation AI inference platform.
Responsibilities:
Lead the software architecture and technical roadmap for NR-Nexus
Write system specifications for NR-Nexus product
Research AI infrastructure, SaaS platforms, model serving, and inference trends
Work with engineering to translate technical capabilities into product value
Work closely with engineering teams to optimize performance, scalability, and feature delivery.
Define performance goals and lead profiling, benchmarking, and optimization efforts for GenAI and distributed AI workloads.
Collaborate with customers, partners, and open-source communities to ensure ecosystem compatibility and adoption.
Mentor software engineers and provide technical leadership
Requirements:
7+ years of software engineering experience, including 3+ years in software architecture or technical leadership.
Strong experience with Kubernetes-based platforms and cloud-native architecture.
Deep understanding of Gen AI/LLM infrastructure and distributed workloads
Experience designing management software or SaaS platforms for production systems.
Strong background in distributed systems, microservices, APIs, and automation.
Hands-on experience with observability stacks, monitoring, logging, alerting, and SLA/SLO tracking.
Experience with CI/CD, deployment automation, upgrades, and rollback mechanisms.
Good understanding of security, authentication, authorization, and integration with customer data center environments.
Nice to have:
Deep understanding of GenAI / LLM inference infrastructure, including model serving, scaling, batching, latency, throughput, and resource utilization.
Experience with production AI inference clusters using GPUs, AI accelerators, or other specialized compute infrastructure.
Understanding of how distributed inference systems operate, including scheduling, load balancing, autoscaling, failover, and cluster-level observability.
Experience with LLM serving frameworks such as vLLM, Triton Inference Server, TensorRT-LLM, or similar.
Familiarity with GPU/accelerator orchestration, device plugins, resource scheduling, and cluster capacity planning.
Familiarity with GPU communication technologies such as GPUDirect RDMA, NCCL, NVLink, or UALink.
Experience optimizing communication for distributed AI/ML workloads.
Knowledge of Prometheus, Grafana, OpenTelemetry, Helm, Argo CD, Istio, KServe, Kubeflow, or similar tools.
Experience deploying software in on-prem, edge, private cloud, or hybrid environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8720593
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: Caesarea
Job Type: Full Time
we are looking for a Senior Machine Learning Engineer to work on LLM-based systems with a strong emphasis on integration, end-to-end workflows, and production readiness. This role combines performance optimization (prefill/decode, throughput, latency) with hands-on work integrating components across the AI platform.
A significant part of the role involves building end-to-end integration flows and tests, particularly around token generation pipelines and system orchestration, as well as contributing to intelligent system behavior such as hardware selection and execution strategies. The role also includes developing system-level logic in Python for multi-tenant management, caching strategies, and service lifecycle management across the platform.
***This is not a Data Science position***
Responsibilities:
Build and maintain end-to-end integration flows across the AI inference pipeline (serving, orchestration, APIs, and infrastructure)
Design, implement, and optimize LLM inference workflows, including prefill and decode stages
Improve system performance with focus on throughput, latency, and interactivity
Write production-grade components in Python and integrate them into the broader system
Contribute to system-level logic such as smart hardware selection and execution strategies
Integrate models (open source and custom), services, and APIs into cohesive, reliable end-to-end application pipelines
Requirements:
4+ years of experience in software engineering or machine learning engineering
Strong proficiency in Python
Strong experience with LLM inference systems and performance optimization
Hands-on experience with system integration and end-to-end workflows
Experience with inference frameworks such as vLLM, TensorRT, SGLang etc
Experience working with GPU/accelerator-based systems
Preferred Qualifications:
Hands-on experience with Dynamo and LLM-D for LLM inference and serving
Familiarity with Kubernetes and cloud environments
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8720590
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Caesarea
Job Type: Full Time and Hybrid work
We are seeking a highly motivated and experienced Senior Software Engineer for a position focused on developing innovative network device monitoring and visualization solution.
Meet the Team:
This position is part of our Silicon One organization, based in Israel. Silicon One is the foundation of our industry-leading networking hardware products, pushing the boundaries of technology and driving the next generation of high-performance, scalable solutions.
Your Impact:
Design and develop visualization capabilities within our SDK environment
Lead the development of a brand-new comprehensive self-monitoring system for network devices from the ground up
Implement a comprehensive model for capturing and managing network state information
Drive cross-team collaboration and technical strategy
Work closely with senior technical leadership
Create and refine technical requirements and system designs
Build new systems and frameworks from the ground up.
Requirements:
Minimum Qualifications:
Bachelor's degree in Computer Science, Software Engineering, or related field
Minimum 7 years of software development experience
Proficiency in C++ and low-level programming
Demonstrated ability to design and implement complex software solutions
Proven track record of technical leadership
Preferred Qualifications:
Experience with ASIC and Network technologies
Background in large-scale distributed systems
Expertise in developing high-performance software that handles billions of packets
Advanced system design and architectural skills
Deep understanding of network state modeling and monitoring systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8717211
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Caesarea
Job Type: Full Time
As a growing startup, this role will continue to evolve, and were looking for team members who are comfortable building operational processes that scale with the company.

The Operations Lead is responsible for end-to-end execution of customer deployments, owning all non-engineering aspects of delivery across pilots, initial deployments, and scaled rollouts. This role operates within a cross-functional scrum led by a Field Deployment Engineer (FDE), ensuring projects are delivered on time, at quality, and at scale.

Responsibilities:

Own end-to-end deployment execution, including planning, timelines, and rollout strategy
Drive project plans, milestones, and execution tracking across multiple workstreams
Manage field operations, installers, vendors, and site readiness
Coordinate cross-functional teams (Engineering, Product, Sales, Customer Success)
Act as the operational point of contact for customers during deployments
Identify and mitigate risks, ensuring on-time delivery and high-quality execution
Own logistics and onsite readiness for pilots, deployments, and rollouts
Build scalable processes, templates, and best practices for deployment operations
Requirements:
5+ years in operations, program management, deployment, or delivery roles
Up to 30% travel
Fluent English - a must
Proven experience managing complex, cross-functional projects
Strong organizational and execution skills with attention to detail
Experience working with field operations or external vendors
Excellent communication and stakeholder management skills
Ability to thrive in fast-paced, dynamic startup environments
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8701262
סגור
שירות זה פתוח ללקוחות VIP בלבד