דרושים » תוכנה » Senior Software Architect, AI Networking

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 1 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
we are seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture group, where we build cutting-edge systems powering the next generation of generative AI workloads. In this role, you will work across software and hardware domains to design and optimize inference infrastructure for large language models running on some of the most advanced GPU clusters in the world.
Youll help define how AI models are deployed and scaled in production, driving decisions on everything from memory orchestration and compute scheduling to inter-node communication and system-level optimizations. This is an opportunity to work with top engineers, researchers, and partners across our company and leave a mark on the way generative AI reaches real-world applications.
What Youll Be Doing:
Design and evolve scalable architectures for multi-node LLM inference across GPU clusters.
Develop infrastructure to optimize latency, throughput, and cost-efficiency of serving large models in production.
Collaborate with model, systems, compiler, and networking teams to ensure holistic, high-performance solutions.
Prototype novel approaches to KV cache handling, tensor/pipeline parallel execution, and dynamic batching.
Evaluate and integrate new software and hardware technologies relevant to model inference (e.g., memory hierarchy, network topology, modern inference architectures).
Work closely with internal teams and external partners to translate high-level architecture into reliable, high-performance systems.
Author design documents, internal specs, and technical blog posts and contribute to open-source efforts when appropriate.
Requirements:
Bachelors, Masters, or PhD in Computer Science, Electrical Engineering, or equivalent experience.
5+ years of experience building large-scale distributed systems or performance-critical software.
Deep understanding of deep learning systems, GPU acceleration, and AI model execution flows.
Solid software engineering skills in C++ and/or Python, with strong familiarity with CUDA or similar platforms.
Strong system-level thinking across memory, networking, scheduling, and compute orchestration.
Excellent communication skills and ability to collaborate across diverse technical domains.
Ways to Stand Out from the Crowd:
Experience working on LLM inference pipelines, transformer model optimization, or model-parallel deployments.
Demonstrated success in profiling and optimizing performance bottlenecks across the LLM training or inference stack.
Familiarity with data center-scale orchestration, cluster schedulers, or AI service deployment pipelines.
Passion for solving tough technical problems and shipping high-impact solutions.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465917
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
16/11/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture group, where we build cutting-edge systems powering the next generation of generative AI workloads. In this role, you will work across software and hardware domains to design and optimize inference infrastructure for large language models running on some of the most advanced GPU clusters in the world.

Youll help define how AI models are deployed and scaled in production, driving decisions on everything from memory orchestration and compute scheduling to inter-node communication and system-level optimizations. This is an opportunity to work with top engineers, researchers, and partners across us and leave a mark on the way generative AI reaches real-world applications.

What Youll Be Doing:

Design and evolve scalable architectures for multi-node LLM inference across GPU clusters.

Develop infrastructure to optimize latency, throughput, and cost-efficiency of serving large models in production.

Collaborate with model, systems, compiler, and networking teams to ensure holistic, high-performance solutions.

Prototype novel approaches to KV cache handling, tensor/pipeline parallel execution, and dynamic batching.

Evaluate and integrate new software and hardware technologies relevant to Core Spectrum-X technologies, such as load balancing, telemetry, congestion control, vertical application integration.

Work closely with internal teams and external partners to translate high-level architecture into reliable, high-performance systems.

Author design documents, internal specs, and technical blog posts and contribute to open-source efforts when appropriate.
Requirements:
What We Need to See:

Bachelors, Masters, or PhD in Computer Science, Electrical Engineering, or equivalent experience.

8+ years of experience building large-scale distributed systems or performance-critical software.

Deep understanding of deep learning systems, GPU acceleration, and AI model execution flows and/or high performance networking.

Solid software engineering skills in C++ and/or Python, preferably demonstrate strong familiarity with CUDA or similar platforms.

Strong system-level thinking across memory, networking, scheduling, and compute orchestration.

Excellent communication skills and ability to collaborate across diverse technical domains.

Ways to Stand Out from the Crowd:

Experience working on LLM - training or inference pipelines, transformer model optimization, or model-parallel deployments.

Demonstrated success in profiling and optimizing performance bottlenecks across the LLM training or inference stack.

AI Accelerators and distributed communication patterns, congestion control and/or load balancing.

Proven optimization process for complex systems, deployed at scale to make impact.

Passion for solving tough technical problems and finding high-impact solutions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8415674
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 4 שעות
Location: More than one
Job Type: Full Time
our company has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technologyand amazing people. Today, were tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing whats never been done before takes vision, innovation, and the worlds best talent. As a worker, youll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
we are seeking a highly skilled and modern software engineer to develop and prototype brand new advancements in distributed training and inference using our companys Spectrum-X AI fabric. This role offers a rare chance to pioneer AI and networking technology, contributing to ground-breaking projects that will define the landscape of large-scale AI systems. Improve AI app-networking connection by refining communication, crafting congestion control, coding NIC firmware, and expanding switch SDK features for enhanced AI factory efficiency. Your work impacts large AI system development, scaling, and speed.
What youll be doing:
Prototype end-to-end solutions to improve distributed training and disaggregated inference performance.
Analyze and optimize communication flows across application, transport, and network layers.
Develop system software spanning communication libraries, drivers, and firmware integrations.
Collaborate with hardware, firmware, and SDK teams to co-design network features.
Validate and integrate prototypes into our companys AI infrastructure and products.
Requirements:
BSc/MSc/PhD in Computer Science or Electrical Engineering
5+ years of relevant experience and/or knowledge
Deep understanding of networking and communication internals NCCL, RDMA/RoCE, congestion control.
Hands-on experience with HW/SW/FW integration and low-level programming (C/C++, kernel, drivers).
Some background in distributed training systems (such as PyTorch DDP, Megatron-LM, DeepSpeed).
Ways to stand out from the crowd:
Demonstrated innovation and leadership turning prototypes into impactful product features.
Experience with programmable data planes (P4, eBPF, DOCA SDK, or switch SDKs).
Familiarity with NIC firmware scheduling, in-network compute, or congestion management.
Contributions to open-source projects, academic papers, or performance benchmarking tools.
Strong background in AI factory architectures, distributed inference, or network telemetry.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465368
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 1 שעות
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
our company has been defining computer graphics, PC gaming, and accelerated computing for more than 25 years. With an outstanding legacy of innovation, driven by phenomenal technology, and extraordinary people, we are looking for a strong technical senior architect to join us in shaping the future. Senior Architects are innovators who can translate business needs into workable technology solutions. Their expertise is deep and broad. They are hands on, producing both detailed technical work and high-level architectural designs.
As a Senior architect in the Advanced Development team, you will explore technological challenges on accelerate networking and building AI data centers. Research new transport functions and semantics for optimizing AI workloads. You will also be leading architectural and development efforts across numerous technological fields, related to the modern data center, such as distributed AI and deep learning solutions, data analytics, High Performance Computing (HPC), Software Defined Networking (SDN), virtualization, storage, and more.
What youll be doing:
Enhance our company's GPU Networking offerings for accelerating AI workloads, such as our company Dynamo or our company NIXL.
Identify and evaluate new technologies, innovations and partner relationships for alignment with our technology roadmap and business value.
Lead architecture and design of such technologies.
Lead proof-of-concept development to evaluate and drive such technologies.
Requirements:
Hold a M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a leading university (or equivalent experience).
12+ years of industry experience (or equivalent) in systems architecture or related fields.
Experienced in virtualization, networking and storage.
Experienced in either Windows or Linux drivers, with a very good background of the other OS.
Deep understanding of performance profiling and optimization techniques, together with defining and using HW offloads.
A teammate with a can-do attitude, high energy and excellent interpersonal skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Shown research track record.
Have experience and passion for system architecture, CPU/GPU/memory/storage/networking.
Stellar communication skills.
Knowledge in Deep Learning frameworks and AI communication libraries (NCCL, UCX, MPI and equivalents).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465872
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
חברה חסויה
Location: More than one
Job Type: Full Time
We are seeking a highly motivated SoC Architect to join our team and define the next generation of our companys high-performance networking SoCs. Our Ethernet and NVL switch silicon powers the world's most advanced AI compute clusters - from hyperscale GPU systems used to train and inference massive foundation models, to the AI factories shaping the future of computing.
As an SoC Architect at our company, you will drive end-to-end SoC definition, connecting system-level requirements with chip-level implementation across multiple domains. You will work closely with cross-functional teams to craft scalable, power-efficient, and feature-rich SoCs that enable the next leap in networking and AI infrastructure.
What You'll Be Doing:
Lead SoC architecture across multiple teams and disciplines - including firmware, security, debug, power management, and peripheral/IP owners - ensuring holistic architectural alignment and system coherence.
Ensure next-generation architectures meet the requirements and constraints of all stakeholder teams, and drive clear specification and communication of those requirements.
Architect and analyze multi-chip solutions, including die-to-die connectivity, chip partitioning, package/board constraints, system requirements, chip fabric, PCIe subsystem and how SoC subsystems must support them.
Define top-level SoC structure: subsystem partitioning, interconnect, memory subsystem, coherency, clocking, power architecture, and system integration.
Define system flows: power up sequences, boot sequences, software update.
Own the SoC architecture specification and guide it throughout the entire product lifecycle - concept, modeling, implementation, and silicon bring-up.
Perform trade-off analyses across performance, area, power, and feature complexity to drive architectural decisions.
Collaborate deeply with chip architects, logic design, verification, physical design, firmware, and system software to ensure seamless integration of all SoC components.
Contribute to innovation and long-term architectural direction, including patent development.
Requirements:
BSc or MSc in Electrical Engineering, Computer Engineering, or related field
6+ years of experience in SoC or chip architecture, microarchitecture, or complex ASIC design
Strong understanding of SoC fundamentals - interconnects, memory systems, coherency, clock/power architecture, security, and HW/SW integration
Ability to work across hardware, firmware, and system software boundaries with strong system-level reasoning
Hands-on experience writing and owning architecture specifications
Proven ability to collaborate across many teams and drive alignment in complex technical environments
Ways to Stand Out from the Crowd:
Expertise in networking, switch silicon, high-speed IO, or data-path acceleration
Experience defining multi-chip or disaggregated architectures (e.g., chiplets, advanced packaging, die-to-die protocols)
Experience with fabric and memory subsystem.
Strong background in system modeling, performance analysis, or traffic simulation
Experience with security architecture, power management, or debug infrastructure.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465552
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 4 שעות
Location: More than one
Job Type: Full Time
our company has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Its a unique legacy of innovation thats fueled by great technologyand amazing people. Today, were tapping into the unlimited potential of AI to define the next era of computing.
we are looking for a passionate, modern software engineer or junior architect early in their career. The role involves developing and prototyping new scalable training and inference advancements using our companys Spectrum-X AI fabric.
This role offers a rare opportunity to work on innovative AI and networking technologies, building prototypes that influence the development of large-scale AI systems. You will help improve AI applicationnetwork interaction by refining communication, crafting congestion control, contributing to NIC and switch capabilities, and enhancing AI factory performance at scale.
What youll be doing:
Prototype end-to-end solutions to improve distributed training and disaggregated inference performance.
Analyze and optimize communication flows across application, transport, and network layers.
Develop system software spanning communication libraries, drivers, and firmware integrations.
Collaborate with hardware, firmware, and SDK teams to co-design network features.
Validate and integrate prototypes into our companys AI infrastructure and products.
Requirements:
Bachelor's or Master's Degree in Computer Science or Electrical Engineering
0-2 years of experience in relevant fields.
Programming knowledge in C/C++
Ability to work closely with architects and R&D teams.
Passion to learn and innovate independently.
Ways to stand out from the crowd:
Demonstrated innovation and leadership turning prototypes into impactful product features.
Understanding of Networking Protocols Ethernet, InfiniBand is an advantage.
Ability to quickly adapt to new technology and go deep into new areas.
Contributions to open-source projects, academic papers, or performance benchmarking tools.
Background in AI factory architectures, distributed inference, or network telemetry.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8465419
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking an experienced Solutions Data Engineer who possess both technical depth and strong interpersonal skills to partner with internal and external teams to develop scalable, flexible, and cutting-edge solutions. Solutions Engineers collaborate with operations and business development to help craft solutions to meet customer business problems.
A Solutions Engineer works to balance various aspects of the project, from safety to design. Additionally, a Solutions Engineer researches advanced technology regarding best practices in the field and seek to find cost-effective solutions.
Job Description:
Were looking for a Solutions Engineer with deep experience in Big Data technologies, real-time data pipelines, and scalable infrastructuresomeone whos been delivering critical systems under pressure, and knows what it takes to bring complex data architectures to life. This isnt just about checking boxes on tech stacksits about solving real-world data problems, collaborating with smart people, and building robust, future-proof solutions.
In this role, youll partner closely with engineering, product, and customers to design and deliver high-impact systems that move, transform, and serve data at scale. Youll help customers architect pipelines that are not only performant and cost-efficient but also easy to operate and evolve.
We want someone whos comfortable switching hats between low-level debugging, high-level architecture, and communicating clearly with stakeholders of all technical levels.
Key Responsibilities:
Build distributed data pipelines using technologies like Kafka, Spark (batch & streaming), Python, Trino, Airflow, and S3-compatible data lakesdesigned for scale, modularity, and seamless integration across real-time and batch workloads.
Design, deploy, and troubleshoot hybrid cloud/on-prem environments using Terraform, Docker, Kubernetes, and CI/CD automation tools.
Implement event-driven and serverless workflows with precise control over latency, throughput, and fault tolerance trade-offs.
Create technical guides, architecture docs, and demo pipelines to support onboarding, evangelize best practices, and accelerate adoption across engineering, product, and customer-facing teams.
Integrate data validation, observability tools, and governance directly into the pipeline lifecycle.
Own end-to-end platform lifecycle: ingestion → transformation → storage (Parquet/ORC on S3) → compute layer (Trino/Spark).
Benchmark and tune storage backends (S3/NFS/SMB) and compute layers for throughput, latency, and scalability using production datasets.
Work cross-functionally with R&D to push performance limits across interactive, streaming, and ML-ready analytics workloads.
Operate and debug object storebacked data lake infrastructure, enabling schema-on-read access, high-throughput ingestion, advanced searching strategies, and performance tuning for large-scale workloads.
Requirements:
24 years in software / solution or infrastructure engineering, with 24 years focused on building / maintaining large-scale data pipelines / storage & database solutions.
Proficiency in Trino, Spark (Structured Streaming & batch) and solid working knowledge of Apache Kafka.
Coding background in Python (must-have); familiarity with Bash and scripting tools is a plus.
Deep understanding of data storage architectures including SQL, NoSQL, and HDFS.
Solid grasp of DevOps practices, including containerization (Docker), orchestration (Kubernetes), and infrastructure provisioning (Terraform).
Experience with distributed systems, stream processing, and event-driven architecture.
Hands-on familiarity with benchmarking and performance profiling for storage systems, databases, and analytics engines.
Excellent communication skillsyoull be expected to explain your thinking clearly, guide customer conversations, and collaborate across engineering and product teams.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8442983
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/12/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
A leading large-scale ad network at the forefront of advertising technology. We are looking for a highly skilled and experienced Senior Software Engineer to join our backend team.
In this role, you will leverage your expertise in distributed systems, data engineering, and software development to design, build, and deploy high-performance solutions. You will take a leading role in developing and maintaining a cutting-edge online machine learning system powered by PyTorch/Tensorflow models and Triton inference server.
This is an opportunity to work on complex, large-scale systems that serve billions of requests, shaping the future of ad tech and ML-driven optimization.
What you'll be doing
Architect and Build: Design, develop, and deploy robust, scalable, and high-performance distributed systems that form the backbone of ironSource's next-generation ML ad network.
Real-Time Ad Serving: Engineer and optimize critical systems for real-time ad serving, enabling machine learning models to make intelligent, low-latency decisions for optimal ad selection.
ML Infrastructure Innovation: Drive the evolution of our ML capabilities by researching, evaluating, and implementing cutting-edge techniques for feature stores, data aggregation, and model serving infrastructure.
Data Pipeline Engineering: Collaborate closely with data scientists and product managers to design, build, and maintain efficient and reliable data lakes and data pipelines, ensuring high-quality data for ML training and analytics.
Operational Excellence: Take ownership of key system components, ensuring their reliability, performance, and scalability in a production environment through proactive monitoring and continuous improvement.
Requirements:
5+ years of backend development experience with strong skills and a genuine passion for server-side technologies.
Proven experience building and maintaining large-scale, low-latency, distributed systems.
Solid understanding of service lifecycle management and efficient resource utilization.
Hands-on experience with machine learning integration in production systems.
Proficiency in backend programming languages such as Java and Scala
Familiarity with cloud platforms (AWS, GCP, or Azure) and container orchestration (Docker, Kubernetes).
Strong problem-solving skills, ownership mindset, and ability to thrive in high-impact environments.
You might also have
Experience with ML frameworks such as TensorFlow, PyTorch.
Experience with inference servers such as Triton, TensorFlow Serving.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8454306
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
09/12/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were seeking a strategic and hands-on Head of Engineering to lead and scale our technical organization.
In this role, you will turn our product vision into a strong, scalable engineering foundation, build efficient processes, and drive technical excellence. You should be passionate about technology, user value, and high-quality execution. Success means challenging assumptions, diving into details, and shaping both the product and the team that builds it.
Responsibilities:
Lead and scale of 20-30 engineering organizations (Frontend + Backend) through two Group Leads.
Own the technical strategy and roadmap, ensuring strong alignment with product goals and long-term company vision.
Design and oversee scalable, reliable system architecture across image processing, AI model serving, and data pipelines.
Remove bottlenecks to improve development velocity and overall engineering efficiency.
Drive hiring, mentoring, career development, and a culture of ownership and excellence.
Roll up your sleeves when needed and set the standard for hands-on technical leadership.
Enforce engineering best practices across code quality, CI/CD, testing, deployment, and observability.
Partner closely with Product, AI/ML, and Design teams to translate business needs into technical solutions.
Make key technology decisions and evaluate new tools, frameworks, and services.
Define and track engineering KPIs to ensure system performance, uptime, and team productivity.
Requirements:
At least 10 years in sw and a minimum of 5+ years in an engineering leadership role, including managing team leads in large groups (20+ )
Proven background as a Full-Stack Developer with a strong, hands-on command of both frontend and backend development.
Deep understanding and practical experience with modern system architecture (e.g., microservices, distributed systems).
Experience with cloud platforms (AWS, GCP, or Azure), containerization (Docker/Kubernetes), and CI/CD pipelines.
Strong product sense and the proven ability to translate product vision into a viable technology strategy.
Excellent communication skills, demonstrating the ability to work effectively across all functions.
Proven ability to successfully recruit, develop, and retain top engineering talent.
Bachelors or Masters degree from a leading university in Computer Science, Engineering, or a related field
Preferred Skills (Nice to Have):
Prior experience with Software as a Service (SaaS) platforms targeting Small to Medium Businesses (SMBs) or independent Professionals.
Background in productionizing Machine Learning or AI models, with specific experience in areas like computer vision or image processing.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8450466
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/11/2025
Location: More than one
Job Type: Full Time
We are seeking an AI Networking Exploration Architect for our Networking Insights Group to bridge the gap between cutting-edge, hyper-scale AI workloads and the datacenter infrastructure that enables them. You will join a small, focused team of multidisciplinary engineers driving AI workload optimization through deep application understanding and end-to-end systems thinking. Your insights will directly shape our products across the full stackfrom applications and software libraries to hardware architecture and physical design.

What You'll Be Doing:

Model the performance of complex AI workloads to identify bottlenecks and recommend system-level optimizations.

Translate state-of-the-art research into actionable infrastructure, software, and hardware features in partnership with architecture teams.

Rapidly master new AI domains (LLMs, generative models, multimodal systems) and distill key findings for product teams.

Incorporate your deep knowledge of AI applications into our hardware and software roadmaps.

Conduct independent research by formulating hypotheses about workload behavior and validating them through rigorous analysis.

Drive architectural innovation and network optimization by applying your domain expertise to exploratory analysis of real-world Deep Learning (DL) workloads.
Requirements:
What we need to see:

M.Sc. or Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience.

+5 years of experience.

Strong ML/Data Science background with hands-on experience in LLMs or generative AI.

A systems-level mindset with the ability to estimate end-to-end requirements across the entire AI stack.

Proven ability to translate research and product requirements into clear software/hardware specifications.

Exceptional research skills: you can digest academic papers, self-learn new domains, and independently test hypotheses.

Advanced Python programming skills for performance modeling and data analysis.

Excellent communication skills, with the ability to present complex findings with clarity and conviction.

A pragmatic approach: you are detail-oriented but can prioritize effectively to focus on the most critical issues.

Ways to Stand Out from the Crowd:

Deep understanding of datacenter infrastructure, network topologies, and protocols.

Expertise in distributed training methods and their impact on infrastructure.

Knowledge of AI performance metrics and the impact of different deployment strategies.

Experience extrapolating academic research into tangible hardware architecture requirements.

A track record of leading complex, multidisciplinary research projects that result in production impact.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8409343
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
As Senior Machine Learning Engineer, youll work with top notch engineers and data scientists from the team on bringing it to the next level and enabling optimal user experience. The work will focus on building, deploying and serving GenAI capabilities (Agents, Tools and the orchestration between them) using the most advanced technologies and models.
Key Job Responsibilities and Duties:
Deploying machine learning models: Design, develop and deploy in collaboration with scientists, scalable machine learning models and algorithms that provide content related insights and generative AI applications, ensuring scalability, efficiency, and accuracy.
Evaluating possible architecture solutions by taking into account cost, business requirements, emerging technologies, and technology requirements, like latency, throughput, and scale.
Generative AI Development: Contribute to the development of generative models such as GPT (Generative Pre-trained Transformer) variants or similar architectures for creative content generation, Q&A, translation or other innovative applications.
Deployment and integration: Work closely with software engineers to integrate machine learning models into production systems. Ensure seamless deployment and efficient model inference in real-time environments. Collaborate with DevOps to implement effective monitoring and maintenance strategies.
Owning a service end to end by actively monitoring application health and performance, setting and monitoring relevant metrics and acting accordingly when violated.
Maintain clean, scalable code, ensuring reproducibility and easy integration of models into production environments, including CI/CD.
Collaborate with multidisciplinary teams: Collaborate with product managers, data scientists, and analysts to understand business requirements and translate them into machine learning solutions.
Requirements:
Bachelors or masters degree in computer science, Engineering, Statistics, or a related field.
Minimum of 6 years of experience as a Machine Learning Engineer or a similar role, with a consistent record of successfully delivering ML solutions.
Strong programming skills in languages such as Python and Java.
Experience with cloud frameworks like AWS sagemaker for training, evaluation and serving models using TensorFlow, PyTorch, or scikit-learn.
Experience with LLMs, Agents and MCP in production environments.
Experience with big data processing frameworks such, Pyspark, Apache Flink, Snowflake or similar frameworks.
Experience with data at scale using MySQL, Pyspark, Snowflake and similar frameworks.
Demonstrable experience with MySQL, Cassandra, DynamoDB or similar relational/NoSQL database systems.
Deep understanding of machine learning algorithms, statistical models, and data structures.
Experience in deploying large-scale language models like GPT, BERT, or similar architectures - an advantage.
Proficiency in data manipulation, analysis, and visualization using tools like NumPy, pandas, and matplotlib - an advantage.
Experience with experimental design, A/B testing, and evaluation metrics for ML models - an advantage.
Experience of working on products that impact a large customer base - an advantage.
Excellent communication in English; written and spoken.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8430189
סגור
שירות זה פתוח ללקוחות VIP בלבד