דרושים » תוכנה » Senior AI and MLOps Engineer - Security and Networking Research

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/03/2026
Job Type: Full Time
We're looking for a Senior AI/MLOps Engineer to join a group that specializes in Security and Networking, and specifically ML, AI and agent development. As a Senior AI/MLOps Engineer, youll build and maintain the infrastructure, tools and processes necessary to support the AI lifecycle in a production environment. You will collaborate closely with data scientists, software engineers, security architects and DevOps teams to ensure smooth deployment, modeling and optimization of AI models. This role involves creative problem solving alongside engineering teams, and is pivotal for the continued success of AI networking security.

What youll be doing:

Developing, improving and optimizing scalable infrastructure for handling and deploying security and networking AI models and agents in production, ensuring high availability, scalability, reproducibility, and performance.

Optimizing AI models and agents for performance, scalability, and resource utilization, considering factors such as latency, efficiency, and cost.

Monitoring and deploying agentic systems, LLMs, and ML models in production.

Designing and implementing frameworks/pipelines for AI training, inference, and experimentation.

Collaborating closely with data scientists, security architects and software engineers to operationalize and deploy AI models and agents, including packaging and integration with existing systems. Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

Collaborating with DevOps teams to integrate pipelines and workflows into the CI/CD process, ensuring flawless deployments and rollbacks.

Building and maintaining monitoring and alerting systems to proactively identify and resolve issues relating to quality, performance and infrastructure.

Implementing access controls, authentication mechanisms, and encryption standards for AI models and data.

Documenting guidelines, and standard operating procedures for MLOps/AI processes and sharing knowledge with the wider team.

Develop proof-of-concepts for new features.
Requirements:
What we need to see:

BSc/MSc in CS/CE or related field (or equivalent experience).

Strong background in AI with experience deploying and monitoring AI/ML models, LLMs and agents to production systems at scale, including distributed and multi-node environments - at least 5 years of experience.

Proficiency in programming languages such as Python, Java, or Scala, along with experience in using ML/AI frameworks and libraries (e.g. TensorFlow, PyTorch).

Proficiency in microservices architecture, container orchestration, cloud platforms, and scalable infrastructure for training and inference workloads.

Knowledge of inference optimization techniques.

Understanding of build infrastructure and CI/CD tools and practices (e.g. GitLab, GitHub Actions, Jenkins).

You are detail-oriented and care deeply about robust, well tested, high-performance code in production environments.

You are proactive, take full ownership of your deliverables, have a can-do approach, and excellent communication and collaboration skills, able to work effectively in multifunctional teams.

Ways to stand out from the crowd:

Knowledge of network protocols and Linux internals.

Security and networking background, with knowledge of security protocols, network architectures, firewalls, intrusion detection systems, and other relevant security and networking concepts.

Experience deploying and optimizing generative models and agents.

Knowledge of network security principles and practices.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586605
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/02/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
Were growing fast, and our team is passionate about pushing AI engineering to new heights - solving complex problems in LLM training, inference optimization, reasoning, and agent orchestration at scale.
About the Role:
As a Machine Learning Engineer, youll work on cutting-edge
code-focused LLMs and AI agent systems
that power next-generation developer platform. Youll be at the center of research, model training, and productionization of intelligent systems that understand software deeply, collaborate with developers, and help automate engineering workflows end-to-end. Your work will immediately impact millions of engineers worldwide.
Responsibilities:
Push LLM Innovation: Research, design, and fine-tune domain-specific LLMs for code generation, refactoring, debugging, and multi-turn reasoning.
Agent-Oriented Development: Build multi-agent coding systems that integrate retrieval-augmented generation (RAG), code execution, testing, and tool use to create autonomous, context-aware coding workflows.
Production-Grade AI: Own the training-to-inference pipeline for large code models-optimize inference with quantization, distillation, and caching techniques.
Rapid Experimentation: Prototype and validate ideas quickly; leverage reinforcement learning, human feedback, and synthetic data generation to push accuracy and reasoning.
Cross-Functional Collaboration: Partner with product, engineering, and design teams to ship AI-powered features that help developers focus on high-impact work.
Scale the Platform: Contribute to distributed training, scalable serving systems, and GPU/TPU-efficient architectures for ultra-low-latency developer tools.
Requirements:
2+ years of hands-on experience designing, training, and deploying machine-learning models
M.Sc. or higher in Computer Science / Mathematics / Statistics or equivalent from a university, or B.Sc. with strong hands-on ML experience
Practical experience with Natural Language Processing (NLP) and LLMs
Experience with data acquisition, data cleaning, and data pipelines
A passion for building products and helping people, both customers and colleagues
All-around team player, fast, self-learning individual
Nice to have:
3+ years of development experience with a passion for excellence
Experience building AI coding assistants, code reasoning models, or dev-focused LLM agents.
Familiarity with RAG, function-calling, and tool-using LLMs.
Knowledge of model optimizations (quantization, distillation, LoRA, pruning).
Startup or product-driven ML experience, especially in high-scale, latency-sensitive environments.
Contributions to open-source AI or developer tools.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8556109
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
18/03/2026
Location: Yokne`am
Job Type: Full Time
Our System Product Engineering (SPE) organization is looking for a Security Software Engineer to join the TDE (Test Development Engineering) Security Team. This role focuses on designing, developing, and deploying security-critical software that protects our next-generation products throughout development, validation, and production. You will work on the most sensitive and business-critical portions of the SPE delivery and production pipelines, taking ownership of production-line security and collaborating closely with DFT, architecture, test development, and validation teams to ensure security is built in end-to-end.

What youll be doing:

Design and develop security-critical backend services, libraries, and tooling that protect SPE systems, validation flows, and production delivery pipelines.

Own and implement production-line security mechanisms, ensuring secure bring-up, test, validation, and manufacturing handoff.

Develop and integrate security features such as authentication, attestation, secrets and key management, integrity checks, and audit mechanisms.

Build secure automation frameworks and tooling embedded into test execution, validation, and manufacturing workflows.

Collaborate closely with DFT (Design for Test), Architecture, test writers, and validation teams to define security requirements and translate them into robust, scalable software solutions.

Participate in secure design reviews, threat modeling, security features for SPE board components and production flows.

Improve the security posture, reliability, observability, and maintainability of SPE systems and services.

Support and influence secure CI/CD and release pipelines, including vulnerability detection, policy enforcement, and controlled deployments.

Investigate, debug, and remediate security vulnerabilities and systemic weaknesses across SPE systems spanning development through production.
Requirements:
What we need to see:

Bachelors or Masters degree in Electrical engineering, Computer Science, Software Engineering, or a related field.

5+ years of professional software engineering experience, with strong ownership of backend systems in Python.

Proven experience developing production-quality software and automation passion.

Solid understanding of OOP, software concepts and system design principles.

Strong familiarity with Linux environments, system services, and system-level troubleshooting.

Excellent debugging, problem-solving, and code review skills.

Experience working in cross-functional engineering environments.

Proficiency with version control systems and collaborative development workflows.

Ways to stand out from the crowd:

Experience securing production, manufacturing, or product bring-up pipelines.

Background in product security, platform security, or DevSecOps.

Experience working with or alongside DFT, hardware architecture, or validation teams.

Knowledge of cryptography concepts, secure provisioning, and key management systems.

Experience securing CI/CD pipelines for large-scale engineering organizations.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8583556
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/03/2026
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Site Reliability Engineer on the SASE Platform team, you will play a critical role in building and operating highly available, secure, and globally distributed services. Your mission is to ensure our cloud-native security and networking platform is reliable, scalable, and performant from day one, protecting the users, applications, and data for the world's largest enterprises as they adopt cloud, remote work, and AI
Your Impact:
Proactively collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages.
Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance.
Build and operate automation for provisioning, deploying, and managing global infrastructure using Infrastructure as Code (IaC).
Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments.
Drive observability best practices, including metrics, logs, traces, and SLIs/SLOs to enable data-driven incident analysis.
Participate in on-call rotations, reducing mean time to resolution (MTTR) through automation and proactive reliability improvements.
Challenge existing processes by championing reliability, security, and operational maturity across the organization.
Requirements:
Your Experience
5+ years of experience working with Unix/Linux systems, including shell, tools, networking, and kernel concepts.
2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms.
Proven experience operating workloads in public cloud environments (e.g., AWS, GCP, Azure) at scale.
Proficiency in building automation and tools in at least one scripting or programming language (e.g., Python, Go, Java).
Strong experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
Bachelors degree in Engineering, Computer Science, or a related technical field, or equivalent practical experience.
Nice to have:
Deep expertise in designing and operating monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK Stack).
Advanced networking expertise, including TCP/IP, DNS, BGP, routing, and cloud networking concepts relevant to SASE architectures.
Prior experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms.
Familiarity with using AI/LLM technologies to improve operational workflows (e.g., incident analysis, automation).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8587419
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At our company youll be part of a global team that breaks barriers to redefine cybersecurity guided by our BRAVE core values:

Bold in how we dream and innovate

Responsive to feedback, challenges and opportunities

Accountable for results and best in class outcomes

Visionary in future focused problem-solving

Exceptional in execution and impact

we are a global leader in human- and agent-centric cybersecurity. We protect how people, data, and AI agents connect across email, cloud, and collaboration tools. Over 80 of the Fortune 100, 10,000 large enterprises, and millions of smaller organizations trust to stop threats, prevent data loss, and build resilience across their people and AI workflows. Our mission is simple safeguard the digital world and empower people to work securely and confidently. Join us in our pursuit to defend data and protect people.

How We Work
At our company youll be part of a global team that breaks barriers to redefine cybersecurity guided by our BRAVE core values

Bold in how we dream and innovate

Responsive to feedback, challenges and opportunities

Accountable for results and best in class outcomes

Visionary in future focused problem-solving

Exceptional in execution and impact

Location: Tel Aviv

The Role
Were looking for a hands-on Technical Lead / Architect to design, build, and guide the development of scalable, secure, and resilient systems. As a key member of our engineering team, youll own architectural decisions and lead the delivery of a large-scale SaaS platform that powers mission-critical experiences for our customers. Youll collaborate with seasoned engineers and product leaders to shape and implement our technology vision - spanning everything from REST APIs and distributed microservices to data pipelines and analytics infrastructure. Youll work with modern open-source tools, cloud-native frameworks, and DevOps practices to deliver solutions that scale efficiently and securely.

Your Day-to-Day

Lead the design and implementation of core backend systems and microservice architectures.

Drive technical decision-making, ensuring performance, scalability, and security-by-design.

Collaborate cross-functionally with product, DevOps, and data teams to align architecture with business goals.

Champion code quality, automation, observability, and engineering excellence.

Mentor engineers, conduct design and architecture reviews, and raise the teams technical bar.

Work closely with data scientists to integrate machine learning / AI models into the system.

Stay ahead of emerging technologies, frameworks, and cybersecurity best practices relevant to SaaS architecture.
Requirements:
10+ years of experience building and scaling distributed systems or SaaS platforms.

Proven technical leadership and a track record of delivering high-quality, impactful products.

Intellectual curiosity and commitment to continuous learning.

Deep customer focus: you start with the user and work backward to create value.

Proficiency in Python (FastAPI, Flask).

Experience with cloud platforms (AWS preferred) and containerized deployments (Docker, Kubernetes).

Solid Linux fundamentals and CI/CD experience.

Understanding of lean software development and data-informed decision-making.

A mindset of continuous improvement, questioning the status quo and driving innovation.

BSc in Computer Science or equivalent experience.

Bonus: exposure to security-by-design, data protection, and regulatory compliance (SOC 2, ISO 27001, etc.).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8559707
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
The DevOps Engineer builds, automates, and operates cloud‑native infrastructure across AWS and Red Hat OpenShift, enabling scalable, secure, and reliable application delivery. This role combines hands‑on platform engineering, CI/CD automation, container orchestration, and the integration of AI‑powered tools for observability, anomaly detection, and operational efficiency. The engineer collaborates closely with development, security, and SRE teams to streamline deployments and improve system resilience.
Core Responsibilities
Cloud & Platform Engineering
Design, deploy, and maintain cloud‑native infrastructure on AWS (EC2, VPC, IAM, EKS, S3, RDS, Lambda).
Operate and optimize Red Hat OpenShift clusters, including cluster upgrades, operator management, and workload orchestration.
Implement Infrastructure‑as‑Code using Terraform, CloudFormation, or Ansible.
Build secure, scalable network architectures including VPC design, load balancing, service mesh, and ingress/egress controls.
CI/CD & Automation
Develop and maintain CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, or Argo Workflows.
Automate build, test, and deployment workflows for microservices and containerized applications.
Implement GitOps practices using Argo CD or Flux.
Create reusable automation modules and scripts in Python, Bash, or Go.
Containers & Kubernetes
Manage containerized workloads using Docker, Kubernetes, and OpenShift Operators.
Configure namespaces, RBAC, secrets, ConfigMaps, and resource quotas.
Troubleshoot cluster performance, networking, and scheduling issues.
Support service mesh technologies (Istio, Linkerd) when applicable.
AI‑Driven Operations
Integrate AI/ML‑based tools for monitoring, anomaly detection, predictive scaling, and automated remediation.
Work with data and platform teams to operationalize AI/ML pipelines on Kubernetes or OpenShift.
Evaluate emerging AI‑Ops platforms and contribute to automation strategies.
Observability & Reliability
Implement monitoring, logging, and tracing using Prometheus, Grafana, ELK, Loki, CloudWatch, or Datadog.
Build alerting, dashboards, and SLO‑based reliability metrics.
Participate in on‑call rotations and incident response, driving root‑cause analysis and long‑term fixes.
Security & Compliance
Apply DevSecOps practices including image scanning, secrets management, and policy enforcement.
Work with security teams to implement IAM best practices, encryption, and compliance controls.
Integrate tools such as Vault, OPA/Gatekeeper, or Kyverno.
Requirements:
2-5 years of experience in DevOps, cloud engineering, or platform operations.
Strong hands‑on experience with AWS services and cloud architecture fundamentals.
Practical experience with Kubernetes and Red Hat OpenShift.
Proficiency with Terraform, Ansible, or similar IaC tools.
Experience building CI/CD pipelines and automating deployments.
Solid Linux administration and networking fundamentals.
Scripting skills in Python, Bash, or Go.
Understanding of container security, cloud security, and DevSecOps practices.
Preferred Qualifications
Certifications: AWS Solutions Architect, CKA/CKAD, Red Hat OpenShift, Terraform Associate.
Experience with AI‑Ops platforms or ML pipeline orchestration.
Familiarity with service mesh, API gateways, or event‑driven architectures.
Experience with multi‑cluster or hybrid cloud environments.
Background in SRE practices (SLOs, error budgets, chaos engineering).
What Success Looks Like
Reliable, automated, and secure cloud‑native infrastructure supporting rapid development cycles.
Stable and observable Kubernetes/OpenShift environments with clear operational metrics.
Reduced manual work through automation and AI‑driven insights.
Strong collaboration with engineering teams and continuous improvement of DevOps practices.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8597223
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
חברה חסויה
Location: Yokne`am
Job Type: Full Time
in this role, you will help build and evolve systems that support performance analysis, telemetry, and optimization for large-scale gpu- and cpu-based clusters used in ai and high-performance computing environments. you will work closely with hardware, networking, firmware, and software teams to collect, analyze, and interpret performance data from live systems. this is a fast-paced r&d environment where system behavior and requirements evolve rapidly, requiring adaptable engineering solutions and strong analytical thinking.
what youll be doing:
profile, benchmark, and analyze ai and hpc workloads on gpu and cpu clusters
explore performance characteristics of high-performance networking and collective communications (e.g., nccl, rdma, mpi, roce)
identify performance bottlenecks across networking, compute, memory, and system architecture
develop and enhance performance analysis, benchmarking, and diagnostic tools
define performance TEST plans and establish expectations for new technologies and platforms
collaborate across hardware, firmware, networking, systems, and software teams to provide actionable performance insights
support telemetry collection and data refinement efforts to enable accurate performance analysis
maintain high standards for  data quality, reproducibility, and traceability of performance results
Requirements:
what we need to see:
b.sc. or m.sc. in Computer Science, computer engineering, software engineering, or equivalent experience
5+ years of experience in performance analysis, systems engineering, or hpc/ai infrastructure
demonstrated expertise in performance analysis skills and methodologies
hands-on experience with high-performance networking (rdma, mpi, nccl, congestion control)
strong understanding of  system performance metrics (latency, throughput, resource utilization)
exposure to hardware, firmware, or Embedded telemetry environments
strong analytical, problem-solving, and communication skills
ability to work effectively in cross-functional, fast-paced r&d teams
ways to stand out from the crowd:
knowledge of cuda, nccl internals, and congestion control algorithms
deep system -level understanding of cpu architectures, gpus, hcas, memory, and pcie
experience with nvidia gpus, cuda, and deep learning frameworks such as pytorch or tensorflow
experience with cloud platforms 
proficiency in  Python ; experience with bash and C / C ++ is a plus as well as a strong experience working in  Linux environments
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8594112
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
18/03/2026
Location: Yokne`am
Job Type: Full Time
We are looking for a Senior networking test engineer with strong system‑level debugging skills to join our End‑to‑End Verification team. You will work on cutting‑edge Ethernet‑based AI clusters, owning complex issues across hardware, system software and AI workloads. NVIDIA is widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

What youll be doing:

Design and review test and product requirements across the Ethernet / NIC / DPU / Switch portfolio, focusing on large‑scale AI cluster behavior.

Build and maintain realistic customer‑like testbeds, including heterogeneous hardware, OS / driver combinations and complex network fabrics.

Own end‑to‑end cluster troubleshooting: reproduce customer scenarios, triage across the stack and drive issues to root cause and fix.

Read and understand relevant source code to identify defects, validate fixes and improve logging and instrumentation.

Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments.

Define tests and guide the automation team to implement robust suites that produce actionable logs, metrics and traces.

Run Regression, Performance, Functional and Scale testing, analyze results and provide clear, data‑driven reports to stakeholders.

Profile and benchmark deep learning training and inference workloads, correlating model‑level metrics with system and network telemetry to uncover bottlenecks.
Requirements:
What we need to see:

B.A./B.Sc. in Computer Science, Electrical Engineering, or equivalent IT/Network/Systems experience.

5+ years of hands‑on networking or system‑level testing and debugging on Linux.

Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2).

Proven production‑grade debugging experience: forming hypotheses, running experiments, and driving issues to root cause under pressure.

Expertise in host‑side NIC validation and tuning (offloads, queues, interrupts, firmware/driver interactions).

Strong knowledge of AI networking libraries (such as NCCL) and protocols (such as RoCE and RDMA), including performance and correctness debugging.

Ability to read and reason about source code (C/C++/Python or similar) and collaborate closely with developers on fixes.

Solid scripting and automation skills with Bash / Python / Ansible for setup, log collection, and experiment orchestration.

Fast learner, familiar with modern AI tools and workflows, able to adapt quickly.

Excellent analytical, problem‑solving and communication skills, with strong ownership and a collaborative mindset.

Ways to stand out from the crowd:

Hands‑on debugging of collective communication libraries (for example NCCL) or large‑scale LLM training / inference clusters.

Experience with large cluster environments (tens to thousands of GPUs or nodes), including incident response and post‑mortem analysis.

Deep expertise in tuning and debugging congestion control and lossless Ethernet for AI workloads (for example DCQCN, ECN, PFC).

Familiarity with NVIDIA networking technologies (for example BlueField / BF3, ConnectX NICs) and their software stack and diagnostics.

Experience debugging issues that span multiple layers (L2/L3, transport, AI frameworks) or contributing to open‑source networking / AI systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8584095
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
looking for a strong technical senior architect to join us in shaping the future. senior architects are innovators who can translate business needs into workable technology solutions. their expertise is deep and broad. they are hands on, producing both detailed technical work and high-level architectural designs.
as a senior architect in the ai networking research team, you will explore technological challenges on accelerate networking and building ai data centers. research new transport functions and semantics for optimizing ai workloads, ai systems communication and accelerations and much more. you will also be leading architectural and development efforts across numerous technological fields, related to the modern ai data center, such as distributed ai and deep learning solutions, data analytics, high performance computing (hpc), software defined networking (sdn), virtualization, Storage, and more.
what youll be doing:
co-design hardware features (e.g., in gpus, dpus, or interconnects) that accelerate data movement and enable new capabilities for inference and model serving. 
identify and evaluate new technologies, innovations and partner relationships for alignment with our technology roadmap and business value.
lead architecture and design of new technologies and innovations such as runtime systems, communication libraries, ai-specific technologies.
lead proof-of-concept development to evaluate and drive such technologies.
Requirements:
what we need to see:
hold a m.sc. or ph.d. in Computer Science, electrical or computer engineering from a leading university (or equivalent experience).
5+ years of industry experience (or equivalent) in system architecture, ai systems architecture, scaling of ai, parallelism of ai frameworks, or deep learning training workloads.
experienced in algorithm design, system programming, computer architecture and operating systems.
experienced in virtualization, networking and Storage.
deep understanding of performance profiling and optimization techniques, together with defining and using hardware features.
strong programming and software development skills.
ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
ways to stand out from the crowd:
shown research track record.
have experience and passion for system architecture, cpu/gpu/memory/ Storage /networking.
stellar communication skills.
knowledge in deep learning frameworks and ai communication libraries (nccl, ucx, mpi and equivalents).
deep understanding of inference and training workloads and optimizations, like prefill/decode, data parallelism, tensor parallelism, fdsp and others.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593803
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
7 ימים
Location: Yokne`am
Job Type: Full Time
we are looking for a data center network deployment engineer to join the networking clusters solutions hpc/ai infrastructure team. we are building supercomputers and ai clusters based on groundbreaking technologies. we are looking for a network/ system Engineer to be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and gpu computing.
you will work with the latest accelerated computing and deep learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. you will interact with hpc, os, gpu compute, and systems specialist to architect, develop and bring up large scale performance platforms. does this sound like you? if so, we would love to hear from you!
what you'll be doing:
deploy, manage and maintain large scale ai data centers - control, network and Storage stack
work with multiple software and hardware teams to optimize the clusters networking health and performance
develop and implement automation scripts for network, compute and Storage operations and deployments
supporting research & development activities and engaging in pocs/povs for future improvements.
Requirements:
what we need to see:
b.sc. in engineering or ccnp certificate
3+ years of proficiency in networking fundamentals, configuring ethernet switches, understanding the tcp/ip stack, and data center architecture.
excellent knowledge of windows and Linux (redhat/centos and ubuntu) networking (sockets, firewalls, iptables, wireshark, etc.) and internals, acls and os level security protection and common protocols e.g. tcp, dhcp, dns, etc.
proactive individual with the ability to work independently, prioritizing tasks to optimize technology and enhance Customer Experience.
provides ad-hoc knowledge transfers, develops handover materials, and offers deployment support for engagements.
ways to stand out from the crowd:
combination of interpersonal skills and technical competence
knowledge of hpc and ai solution technologies from cpus and gpus to high speed interconnects and supporting software
experience with multiple Storage solutions such as lustre, gpfs, and newer and emerging Storage technologies.
automation tooling background (ansible, salt, puppet etc.).
we are widely considered to be one of the technology worlds most desirable employers! we have some of the most forward-thinking and hardworking individuals in the world working for us. if you're creative and autonomous, we want to hear from you!
#il-hybrid
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593381
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
19/03/2026
חברה חסויה
Location: Ra'anana
Job Type: Full Time
We are seeking an experienced IT/Lab Manager to lead the planning, deployment, and operations of our physical lab environment and IT systems. This role will focus on building and maintaining scalable, reliable, and secure environments to support engineering teams involved in research, quality assurance, validation, and related activities. It will also support internal collaborators. You will have an outstanding opportunity to drive innovation in a multidimensional, technology-focused company that is crafting the future of data-center and lab technologies. If you bring perfection and creative thinking while solving issues as they arise, and enjoy working with distributed teams - your place is with us!


What Youll Be Doing:
Own day-to-day operations, planning, and roadmap for the engineering lab and IT infrastructure (servers, storage, networking, and related services).
Lead and mentor an IT/Lab team, driving guidelines, standards, and a culture of ownership, partnership, and continuous improvement.
Collaborate closely with R&D, QE, Verification, and other engineering teams to design, provision, and maintain environments that meet their performance, reliability, and security needs.
Lead all aspects of running data center and lab operations, including rack layout, cabling, power and cooling, hardware lifecycle, and resource availability.
Lead procurement and vendor management for hardware, software, and services, including evaluation, negotiation, and ongoing relationship management.
Implement and maintain automation for system provisioning, configuration, and operations using tools such as shell/Perl/Ansible.
Design and maintain monitoring, logging, and alerting for servers, network, and storage systems to ensure high availability and rapid incident response.
Investigate and resolve sophisticated infrastructure issues across OS, networking, storage, virtualization, and application layers.
Requirements:
What we need to see:
B.Sc. or BA in Computer Science, Engineering, or a related field, or equivalent practical experience.
At least 10 years of overall experience in IT / systems administration, including extensive hands-on work with Linux/Unix environments.
At least 3 years of experience in a managerial or team-lead position within IT, lab, or infrastructure teams.
Vast experience with Linux/Unix system administration, including installation, configuration, troubleshooting, and performance tuning.
Demonstrable experience collaborating with engineering organizations (R&D, QE, Verification, etc.) and supporting their infrastructure needs.
Solid experience with data center and lab management, including server, network, and storage equipment deployment and lifecycle.
Demonstrated experience in procurement and vendor management for infrastructure hardware and software.
Proficiency in automation and scripting (e.g., shell, Perl, Ansible) for provisioning, configuration, and operational tasks.
Hands-on experience with monitoring and alerting solutions for infrastructure and services.
Strong debugging skills and experience resolving complex, cross-domain technical issues.

Ways To Stand Out From The Crowd:
Experience with Kubernetes (K8s) in on-prem or hybrid environments.
Hands-on work with Slurm, HPC clusters, and large-scale compute environments.
Background in HPC, large-scale Linux clusters, or performance-sensitive engineering environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8585125
סגור
שירות זה פתוח ללקוחות VIP בלבד