דרושים » AI » Team Leader AI Datacenter Tools & Automation

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a highly skilled and motivated Team Leader to build and lead a new team dedicated to developing orchestration tools and software solutions for AI datacenters.
The main goal of this team is to design and deliver customer-focused orchestration platforms that simplify the deployment, management, and monitoring of large-scale AI workloads.
This role combines technical leadership with hands-on development, covering the entire AI datacenter ecosystem including switches, hosts, smart NICs, GPUs, ROCm, and RCCL. The team will primarily develop in Python, complemented by modern full-stack technologies for user interfaces and control systems.
Key Responsibilities:
Lead and mentor a team of engineers building orchestration tools that manage complex AI datacenter infrastructures.
Define the teams vision, roadmap, and architecture for orchestration solutions that enhance customer experience and operational efficiency.
Design and implement distributed control and orchestration systems using Python and full-stack frameworks.
Collaborate with networking, compute, and AI acceleration teams to integrate orchestration capabilities across all datacenter components (switches, NICs, GPUs, and software stacks).
Work closely with product, QA, and DevOps teams to identify customer requirements and translate them into scalable, production-grade orchestration platforms.
Ensure software reliability, scalability, and maintainability through strong design principles, testing, and CI/CD practices.
Foster a culture of innovation, technical excellence, and cross-functional collaboration.
Requirements:
5+ years of software development experience, including 2+ years in a team leadership or technical lead role.
Strong proficiency in Python for backend, orchestration, and systems integration.
Proven experience in designing and implementing orchestration or control-plane systems for datacenter or cloud environments.
Deep understanding of datacenter infrastructure networking, compute, storage, or GPU acceleration.
Hands-on experience with containers, orchestration frameworks, and CI/CD pipelines (Kubernetes, Docker, etc.).
Excellent problem-solving, leadership, and communication skills.
Preferred Qualifications:
Experience with AI workloads and GPU software stacks (ROCm, RCCL, PyTorch, TensorFlow).
Familiarity with control-plane architectures, distributed systems, or cluster management frameworks.
Background in telemetry, resource scheduling, or performance optimization for large-scale systems.
Knowledge of microservices, REST/gRPC APIs, and cloud-native architectures.
Practical experience with full-stack development (React, Angular, Node.js, or similar).
Experience with testing frameworks (pytest, Robot Framework, etc.).
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423027
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a talented and motivated Software Engineer to join our newly formed team developing orchestration tools and platforms for AI datacenters.
The main goal of this team is to create customer-focused orchestration solutions that simplify the deployment, management, and optimization of large-scale AI workloads across a full datacenter stack including switches, hosts, smart NICs, GPUs, ROCm, and RCCL.
You will work on the design and development of orchestration systems that bridge compute, networking, and AI acceleration domains, primarily using Python and modern full-stack technologies.
Key Responsibilities
* Design and develop software components for orchestration platforms managing AI datacenter infrastructure.
* Implement control and coordination mechanisms for compute, network, and AI acceleration resources.
* Develop backend services, APIs, and UI components using Python and modern full-stack frameworks.
* Collaborate with cross-functional teams including networking, GPU, and system software to integrate orchestration capabilities across multiple layers.
* Participate in architecture discussions, code reviews, and continuous integration processes.
* Contribute to testing, validation, and performance improvements of orchestration systems.
* Engage with product and customer teams to translate operational needs into effective software solutions.
Requirements:
Required Qualifications
3+ years of experience in software development, preferably in infrastructure, orchestration, or systems software.
Strong proficiency in Python, including experience with backend or orchestration frameworks.
Familiarity with datacenter or cloud infrastructure, including networking, compute, or storage systems.
Experience with containers and orchestration platforms (Docker, Kubernetes).
Solid understanding of software engineering principles, including design patterns, testing, and CI/CD.
Strong collaboration and communication skills, with the ability to work in a multidisciplinary environment.
Preferred Qualifications
Exposure to AI workloads and GPU ecosystems (ROCm, RCCL, PyTorch, TensorFlow).
Experience with distributed systems, control-plane software, or cluster management frameworks.
Familiarity with REST/gRPC APIs, microservices, and cloud-native architectures.
Background in monitoring, telemetry, or resource scheduling systems.
Practical experience in full-stack development (React, Angular, Node.js, or equivalent).
Experience with test automation frameworks (pytest, Robot Framework, etc.).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423042
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Team Lead to join our Network Orchestration group. Our group is responsible for developing scalable, high-performance distributed systems that support complex network infrastructures.
As a Team Leader, you will drive the architecture, design, and development of critical components of the system. You will lead a team of frontend engineers, define best practices, and ensure the scalability and maintainability of the system
Our group is growing, and you will play a key role in building and expanding the team. You'll have the opportunity to optimize and enhance key parts of our system, leveraging the latest tools and infrastructure to drive innovation and efficiency.
Responsibilities
Lead & mentor a high-performing development team in building advanced, scalable software solutions.
Collaborate closely with development teams to provide seamless integration across applications.
Work closely with QA to implement best practices for testing and automation, ensuring high software quality and reliability.
Coordinate with Product Managers to align development with business priorities and roadmap planning.
Engage with UI/UX Designers to ensure an intuitive and high-quality user experience.
Define and enforce best practices, coding standards, and architectural guidelines for frontend development.
Stay updated with industry trends and emerging technologies to drive innovation in UI infrastructures.
Work with CI/CD pipelines to automate testing, deployment, and monitoring of UI libraries.
Requirements:
Technical Expertise: At least 6+ years of hands-on experience in frontend development. Strong knowledge of JavaScript/TypeScript and React framework, along with state management solutions (Redux, MobX, etc.).
Good knowledge of at least one backend language
Leadership & Team Management: Minimum 3+ years of experience leading a development team of engineers, including hiring, mentoring, and guiding technical decisions.
Testing: Experience with unit, UI component, and end-to-end testing using frameworks like Jest, React Testing Library, or Playwright.
Component-Driven Development: Hands-on experience with Storybook or other UI component documentation tools.
Project Delivery: Proven ability to lead projects to production, prioritize tasks effectively, and deliver high-quality results within set timelines.
Cross-Team Collaboration: Experience working closely with QA, Product Managers, Project Managers, and UI/UX Designers to drive the development and delivery of features.
Hands-On Mentality: A strong technical orientation with active involvement in code reviews, architecture discussions, and debugging.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423033
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
As the Team Leader, you will manage and mentor a team of software engineers developing advanced networking components and services in Rust and Python. Youll combine hands-on technical leadership with people management, setting technical direction, reviewing code, driving design discussions, and ensuring the teams delivery aligns with our companys architectural and product vision. This is a highly technical leadership role: you will contribute to design and architecture while cultivating best practices in modern systems programming, asynchronous networking, observability, and performance optimization.
Key Responsibilities
Lead, mentor, and grow a team of software engineers specializing in Rust and Python for AI datacenter networking.
Define technical roadmap, design architecture, and own the teams deliverables.
Lead implementation of networking and distributed-systems components.
Collaborate closely with architects, product management, QA, and platform teams to deliver features supporting large-scale AI network deployments.
Perform technical design reviews and deep-dive debugging; ensure code quality, maintainability, and observability.
Encourage a culture of innovation, continuous learning, and technical excellence.
Manage planning, prioritization, and reporting; communicate progress and risks to leadership.
Recruit, onboard, and mentor engineers to expand the teams capabilities.
Requirements:
Required Qualifications
B.Sc. or higher in Computer Science, Computer Engineering, or equivalent experience.
7 + years of software development experience, with at least 2 years leading or mentoring engineers.
Strong proficiency in Rust including async programming, memory safety, concurrency, and performance profiling.
Deep understanding of networking fundamentals (L2/L3, routing, load balancing, network monitoring).
Experience with Linux networking, system-level APIs, or distributed systems.
Strong communication skills and a collaborative mindset.
Preferred Qualifications
Experience with SONiC, SAI, or open-source network operating systems.
Knowledge of DPDK, RDMA, P4, or smart NIC/DPU architectures.
Familiarity with AI cluster topologies, high-throughput interconnects, and performance tuning for large-scale workloads.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423031
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Staff Engineer, you will be the technical lead and driving force behind the groups most complex initiatives. You will work closely with engineers, tech leads, architects, and product managers to solve high-scale distributed systems challenges, improve performance, and design robust, future-proof systems.
This role is ideal for experienced software architects and senior developers who are passionate about system architecture, performance at scale, and leading cross-team engineering efforts without formal management duties.

Key Responsibilities:
Act as the technical authority for large-scale backend systems within the Execution group.
Gain deep understanding of the Orchestration groups services, the campaign targeting flow, and how the product works as a whole, in order to make architectural decisions in the broader product context.
Champion the groups strategic adoption of AI and Vibe Coding practices, becoming a key enabler for increasing developer efficiency through the use of cutting-edge AI development tools.
Lead the design and implementation of distributed, high-throughput, low-latency services that support billions of message executions monthly.
Partner with Engineering Managers and Architects to shape the groups long-term technical vision and architecture roadmap.
Define and enforce engineering standards and best practices across services.
Conduct in-depth design and code reviews, mentoring other engineers and elevating technical excellence.
Proactively identify cross-cutting concerns and drive group-wide engineering initiatives (e.g., observability, resiliency, fault tolerance).
Analyze and improve system bottlenecks in data flow, message queuing, storage, and processing pipelines.
Take ownership of non-functional requirements such as reliability, scalability, maintainability, and security.
Collaborate with Product and Data Science teams to ensure engineering plans align with business priorities.
דרישות:
Technical Skills and Experience:
10+ years of software engineering experience, with at least 3 years in senior or staff-level roles involving architectural decision-making.
Proven experience designing and building scalable, distributed systems and services in .NET/C# (preferred) or other modern languages (Java, Go, etc.).
Expertise in designing event-driven architectures using Kafka or equivalent messaging systems.
Deep understanding of data pipelines, message queues, batch and stream processing at scale.
Strong experience with cloud-native development, container orchestration, and infrastructure-as-code (e.g., GCP, Docker, Kubernetes, Terraform).
Experience with relational and NoSQL databases and an understanding of their tradeoffs.
Strong familiarity with performance monitoring, alerting, and observability tools.
Experience driving technical design documents, evaluating new technologies, and communicating decisions effectively to varied audiences.
Curiosity and hands-on experience with AI-powered development workflows, LLM tools, and productivity boosters is a strong plus.
Leadership & Impact
Recognized as a go-to expert and trusted advisor by engineers across the group.
Strong mentoring skills-willing and able to guide others through design challenges and deep technical problems.
Comfortable operating in ambiguity, proposing solutions, and reducing complexity.
Influences architecture, priorities, and processes beyond their immediate team.
Passionate about creating a culture of engineering excellence, ownership, and continuous improvement.
Leads cross-functional technical initiatives that span multiple teams and disciplines.

Preferred Qualifications:
Experience in a high-growth SaaS company or one with high-throughput systems.
Background in campaign orchestration, marketing automation, or messaging systems.
Experience working with data engineering tools and pipelines (e.g., Airflow, BigQuery, dbt) is a plus.
Contributor to open-source or internal developer communities.#E המשרה מיועדת לנשים ולגברים כאחד.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8386300
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a proactive and skilled Software Infrastructure Developer to join our team, focusing on building robust and scalable infrastructure to support seamless automation and reliability across systems. This role is essential in ensuring efficient integration within our CI/CD pipelines, creating tools and frameworks for end-to-end automation, and working with engineering teams to continuously enhance system stability. The ideal candidate will bring strong technical expertise in automation, CI/CD (with hands-on knowledge of Jenkins), containerization, and cloud infrastructure, along with proficiency in Python and Node.js.

Key Responsibilities:

Software Infrastructure Design: Design, build, and maintain scalable infrastructure and frameworks to ensure consistency and reliability on various systems.
Tool and Library Development: Develop libraries and tools to simplify the creation and management of end-to-end automation, enabling efficiency and ease of use.
CI/CD Integration: Integrate and optimize CI/CD pipelines to streamline deployment and operational workflows.
Feature Ownership: Take ownership of new features and products from design to production, ensuring high-quality, reliable releases.
Cross-Team Collaboration: Collaborate with cross-functional engineering teams to gather insights and requirements, driving continuous improvement of automation tools and frameworks.
Documentation and Knowledge Sharing: Document best practices and usage guidelines for libraries and tools to promote effective adoption and knowledge sharing across teams.
Technology Evaluation: Continuously research, evaluate, and implement new technologies to maximize development efficiency and innovation.
Requirements:
Experience: Minimum of 3 years hands-on experience with Python / TypeScript
Programming Skills: Strong understanding of software development principles, including writing clean, maintainable, and scalable code.
CI/CD Expertise: Proven experience designing, implementing, and managing CI/CD pipelines, particularly in cloud environments (e.g., AWS, Azure).
Containerization & Orchestration: Hands-on experience with Docker and Kubernetes.
Version Control: Strong knowledge of Git and best practices for collaboration, branching, and code management.
Problem-Solving: Excellent analytical and troubleshooting skills, with the ability to resolve complex infrastructure and system issues.
Web Technologies: Familiarity with web protocols and technologies, including HTTP, JSON, HTML, and JavaScript.
Adaptability: Ability to work effectively in a fast-paced, multi-tasking environment and rapidly adopt new tools and technologies.
Big Advantage: Experience with AI/GenAI technologies using Python libraries (e.g., LangChain) and familiarity with orchestration tools such as LangGraph, n8n, or similar frameworks.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8382001
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We have improved AI infrastructure by merging GPU virtualization with Kubernetes-native tech to power innovative AI factories. We aim to speed up enterprise AI projects with smart orchestration, and scalability for AI workloads. Seeking a skilled Senior Software Engineer for our Infrastructure Group to innovate AI technology. The Infrastructure Group is tasked with composing and evolving the core systems responsible for thousands of GPUs and nodes driving enterprise AI. We invent the foundation that facilitates elastic, secure, and observable AI operations at extensive scale. We are seeking engineers who are passionate about distributed systems, modern cloud-native infrastructure, and AI performance optimization.

What youll be doing:

Crafting and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.

Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.

Collaborating closely with backend engineers, product managers, and other partners to deliver impactful solutions.

Writing clean, maintainable, and testable code in Go, contributing to our CI/CD pipelines.

Conducting code and build reviews to uphold high-quality standards and mentor team members.

Leading the development and implementation of advanced identity management systems that secure our innovative AI and GPU cloud.

Developing scalable multi-tenant solutions that allow our diverse clientele to harness the power of our platforms securely and efficiently.

Collaborating with multi-functional teams to integrate identity and access management features seamlessly into our products, from cloud services to edge computing devices.
Requirements:
What we need to see:

B.Sc. in Computer Science or a related field (or equivalent experience).

5+ years of experience

Experience in backend software development, including system design and architecture.

Proficiency in at least one backend programming language (Go preferred).

Strong knowledge in microservices architecture, RESTful APIs, and relational databases.

Proficient knowledge of security guidelines and experience applying them in large-scale systems.

Expertise in implementing OAuth, OIDC, SAML, and other modern authentication protocols - Advantage

Ways to stand out from the crowd:

Expertise in Kubernetes internals and advanced cloud-native technologies.

Experience working in Linux environments with knowledge of networking, security, and virtualization.

Contributions to open-source projects or active participation in tech communities.

Agile approach and familiarity with standard methodologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418975
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/11/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
The AI Transformation Architect plays a central role in driving our companys enterprise-wide AI transformation. Reporting to the VP of AI Transformation, this role defines and builds the technical and architectural foundations that power our AI-driven software organization.
The Architect leads the design and implementation of scalable Generative AI platforms and central services for AI, creating shared capabilities, tools, and frameworks that transform how our company builds, delivers, and operates products.
Beyond architecture, this role drives strategic, high-impact projects with powerful ROI that positively disrupt operations and elevate customer outcomes, improving efficiency, innovation, and product value across the company.
Strategic Objectives
Accelerate AI Transformation across the SDLC by designing and operationalizing central AI services and a unified platform framework that powers key transformation initiatives, including Agentic Workflows.
Institutionalize AI Engineering and Governance Standards through common architecture, services, and enablement frameworks ensuring scalability, security, quality, and compliance.
Deliver measurable transformation outcomes through strategic, ROI-driven projects that reshape how our companys products are built and operated, directly benefiting customers and business performance.
Drive sustainable transformation at scale, balancing rapid innovation with operational excellence and long-term maintainability.
Key Responsibilities
Architect the AI Transformation Framework
Design and own the end-to-end architecture that enables the companys AI Transformation vision - spanning data, models, pipelines, APIs, and governance layers.
Define blueprints and integration standards to embed AI and automation throughout the SDLC, from ideation to delivery.
Build Generative AI Platforms and Central Services
Lead the design and implementation of the AI Enablement Platform and central AI services powering transformation initiatives such as Agentic Workflows and the Knowledge Hub.
Create scalable, secure, and modular AI components that can be leveraged across product lines and engineering domains.
Partner with infrastructure, platform, and security teams to operationalize LLM-based services, retrieval systems, and agentic automation flows.
Drive Strategic Transformation Projects
Lead cross-functional initiatives that deliver tangible operational and business impact - improving speed, quality, and cost-efficiency across the product organization.
Develop strong business cases and ROI frameworks to prioritize and communicate transformation value at the executive level.
Translate successful outcomes into reusable playbooks and AI cookbooks that scale across teams, roles, and product domains.
Technical Leadership and Enablement
Act as the senior technical authority and coach for AI transformation initiatives, guiding teams through architecture reviews, proofs of concept, and scale-up phases.
Mentor internal architects and tech leads to develop AI-native architectural competencies.
Requirements:
10+ years in software or platform architecture, including experience leading cross-organizational transformations.
Proven track record of designing large-scale, data-driven, or AI-enabled systems.
Experience delivering projects with measurable business outcomes and ROI.
Expertise
Enterprise and cloud-native architecture (multi-tenant, microservices, API-first, distributed systems).
Deep understanding of AI/ML infrastructure, LLM integration patterns, agent frameworks, and MLOps.
Familiarity with data engineering, retrieval-augmented generation (RAG), and workflow orchestration (e.g., n8n, LangChain).
Background in cyber security or secure software design - strong advantage.
Mindset & Skills
Transformation-oriented thinker who connects technology to organizational capability and business strategy.
Excellent communicator and influencer at multiple levels (executive to technical).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8409655
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
09/11/2025
Location: More than one
Job Type: Full Time
We are seeking a Technical Marketing Engineer to join our Ethernet Networking team to keep improving our performance leadership in AI. In this pivotal role, you will be the hands-on expert for our Spectrum-X Ethernet platform, showcasing its superiority for emerging AI use cases. You will develop and implement rigorous benchmarks on various GPU clusters, analyzing everything from LLM training to groundbreaking inference workloads. Your primary mission is to translate these performance results into compelling technical content, including white papers, blogs, and presentations, that clearly articulates why our Spectrum-X Ethernet solutions are the definitive choice for modern AI infrastructure.

What you'll be doing:

Design and execute performance benchmarks using industry-standard tools (e.g., MLPerf, UCX, our Collective Communications Library - NCCL and CloudAI) and customer-representative AI workloads on our state-of-the-art GPU clusters.

Translate your benchmark data and technical insights into compelling, high-impact marketing assets and performance-driven sales enablement materials.

Collaborate closely with Product Management, ASIC and Software architecture and Sales teams, provide feedback on product features, and ensure our performance results are technically accurate and impactful.
Requirements:
What we need to see:

B.Sc in Computer Science or Software Engineering or equivalent experience

5+ years of experience benchmarking and analyzing high‑performance networking solutions, including RDMA, MPI, and large‑scale collective communication frameworks.

Hands‑on expertise in testing and benchmarking deep learning workloads on our GPUs with CUDA, TensorFlow, and PyTorch, focused on validating and demonstrating distributed training and inference performance over NCCL, RoCE, and RDMA.

Shown proficiency in Performance Analysis methodologies and techniques.

Understanding of Ethernet and high-performance networking.

Programming experience with Python, Bash and C languages.

Experience with distributed job orchestration (Slurm, Kubernetes).

Experience with Linux OS distros.

Fast and self-learning capabilities with strong analytical and problem-solving skills.

In-depth knowledge and experience with AI workloads and benchmarking for large-scale distributed training/inference systems.

Ways to stand out from the crowd:

Strong Performance Analysis skills and methodologies using modern tools.

Deep knowledge in AI/Data Center Ethernet networks protocols and best-practices (Clos fabrics, BGP, VXLAN, etc.).

Hands-on experience with automation, CI/CD pipelines and DevOps practices.

Expertise in AI fabrics telemetry including metrics capturing and analysis as well as telemetry tools such as Prometheus and Grafana.

In-depth System knowledge and understanding (Intel / AMD / ARM CPUs, NVIDIA GPUs, NIC, Memory, PCI).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8406080
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
our company's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to our companys needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
The Waze Engineering Productivity (EngProd) team is the multiplier for all of Waze engineering.
In this role, you will design, build, and own the foundational systems that empower developers to ship with speed, quality, and confidence. You will be maintaining systems and leading the charge on Waze's technical initiatives.
Waze is where people and technology meet to solve transportation challenges. It's a platform that empowers users to contribute road data and edit Waze maps to improve the way we move about the world. As the social navigation pioneer, Waze leverages mobile technology and a passionate global community to redefine expectations of todays maps.
Responsibilities
Develop and maintain back-end services and libraries, in the Java ecosystem, that form the support of the development environment.
Take ownership of components within technical initiatives, such as the company-wide migration to our company3 or the rollout of new Artificial Intelligence (AI)-powered developer tools.
Deploy and manage mission-critical services on our company Cloud Platform (GCP) using technologies like Kubernetes and Docker, ensuring high availability and performance.
Collaborate with engineering teams across Waze to understand their issues, gather requirements, and deliver solutions that make their workflows efficient.
Work with Java, Python and our company's internal tooling to select the right technology for the job.
Requirements:
Minimum qualifications:
Bachelors degree or equivalent practical experience.
2 years of experience with software development or 1 year of experience with an advanced degree in an industry setting.
2 years of experience with developing large-scale infrastructure, distributed systems or networks, or with compute technologies, storage or hardware architecture.
2 years of experience in software development with software design, architecture, and shipping production-grade systems.
Preferred qualifications:
Experience in Java and building production-grade back-end services and distributed systems.
Experience with public cloud platforms and container orchestration technologies like Kubernetes.
Experience with developer productivity, tooling, and building infrastructure.
Experience with build systems (e.g., Gradle, Bazel), Continuous Integration/Continuous Deployment (CI/CD) pipelines, and other developer-focused tooling.
Experience with leading technical projects from design to completion, with excellent architectural and system design skills.
Excellent problem-solving, communication, and collaboration skills, with the ability to work across multiple engineering teams.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8412759
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced Engineering Team Leader to drive and oversee the development of our infrastructure platform.
You'll lead a team focused on building Python-based microservices, infrastructure automation, and cloud-native solutions.
This role combines technical leadership with hands-on development, emphasizing strong Linux expertise and distributed systems knowledge.
The Infrastructure group focuses on developing scalable microservices architectures and automation systems that power our platform. As the team leader, you'll guide the development of Python-based services while ensuring robust infrastructure practices.
Requirements:
BSc in Computer Science or related field, or equivalent practical experience
6+ years of software engineering experience, including 2+ years in a technical leadership role
Strong expertise in Python microservices development
Advanced Linux system administration and development skills
Extensive experience with Kubernetes and container orchestration
Proven experience with infrastructure automation and cloud platforms
Strong understanding of Azure services and cloud architecture
Experience with modern DevOps practices and CI/CD implementation
Ability to mentor engineers and drive technical decisions
Deep understanding of Linux internals and system optimization
Advantages
Experience with workflow engines and their implementation
Background in infrastructure as code (IaC) and automation
Experience with high-scale distributed systems
Track record of successful cross-team collaboration
Experience with Agile methodologies and team management
Strong security background in Linux environments
Experience with service mesh and microservices patterns
Background in performance optimization and troubleshooting
This leadership role will be instrumental in:
Leading the technical vision and architecture of our Python microservices platform
Managing and mentoring a team of engineers focused on infrastructure development
Ensuring robust and scalable Kubernetes-based solutions
Driving technical decisions and ensuring best practices in development
Building highly available and performant distributed systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423038
סגור
שירות זה פתוח ללקוחות VIP בלבד