דרושים » תוכנה » AI\ML Architect

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
a world-leading sports data provider, trusted by sportsbooks worldwide to deliver real-time data with unmatched accuracy and reliability. With technology that drives smarter trading and deeper engagement, we empower bookmakers to grow, innovate, and stay ahead of the game.
If youre passionate about sports and technology and want to make your mark in a fast-moving industry, don't miss this chance! Step onto the field with us and help build the future of sports data We are looking for a talented AI\ML Architect.

What Youll Do:
Architect, build, and scale production-grade data pipelines and services.
Develop and implement machine learning systems, overseeing the complete lifecycle from conceptualization to deployment.
Collaborate with internal teams and external partners to deliver innovative solutions that enhance market leadership.
Take ownership of significant projects, guiding them from inception through to successful deployment.
Requirements:
5+ years of industry experience, with a bachelor's degree or higher in Computer Science, Information Systems, or a related technical field, or equivalent experience.
Demonstrated experience with large language models (LLMs) and a strong background in Natural Language Processing (NLP) and Machine Learning models, including deployment, monitoring, and performance evaluation.
Familiarity with the software development life cycle and Agile methodologies.
3+ years of experience in delivering production-grade data pipelines and backend services.
Expertise in data pipeline construction, distributed architectures, SQL and NoSQL databases, and data lake/warehouse design and implementation.
Proven ability to thrive in a fast-paced, high-pressure environment, with impeccable attention to detail and a strong decision-making and problem-solving skill set.
Excellent communication skills, capable of effectively articulating technical concepts to both technical and non-technical stakeholders.
Bonus Points If you have:

Knowledge of modern CI environments (e.g., Git, Docker, Kubernetes), ETL tools (e.g., AWS Glue, Apache Airflow), and messaging systems (e.g., Kafka, RabbitMQ)
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8318204
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Staff Engineer
The Campaign-Orchestration group is responsible for the reliable, scalable, and timely delivery of billions of personalized messages and campaigns across multiple channels (email, SMS, push, web, and more). As a Staff Engineer, you will be the technical lead and driving force behind the groups most complex initiatives. You will work closely with engineers, tech leads, architects, and product managers to solve high-scale distributed systems challenges, improve performance, and design robust, future-proof systems.
This role is ideal for experienced software architects and senior developers who are passionate about system architecture, performance at scale, and leading cross-team engineering efforts without formal management duties.
Key Responsibilities
Act as the technical authority for large-scale backend systems within the Execution group.
Gain deep understanding of the Orchestration groups services, the campaign targeting flow, and how the product works as a whole, in order to make architectural decisions in the broader product context.
Champion the groups strategic adoption of AI and Vibe Coding practices, becoming a key enabler for increasing developer efficiency through the use of cutting-edge AI development tools.
Lead the design and implementation of distributed, high-throughput, low-latency services that support billions of message executions monthly.
Partner with Engineering Managers and Architects to shape the groups long-term technical vision and architecture roadmap.
Define and enforce engineering standards and best practices across services.
Conduct in-depth design and code reviews, mentoring other engineers and elevating technical excellence.
Proactively identify cross-cutting concerns and drive group-wide engineering initiatives (e.g., observability, resiliency, fault tolerance).
Analyze and improve system bottlenecks in data flow, message queuing, storage, and processing pipelines.
Take ownership of non-functional requirements such as reliability, scalability, maintainability, and security.
Collaborate with Product and Data Science teams to ensure engineering plans align with business priorities.
Requirements:
Technical Skills and Experience
10+ years of software engineering experience, with at least 3 years in senior or staff-level roles involving architectural decision-making.
Proven experience designing and building scalable, distributed systems and services in .NET/C# (preferred) or other modern languages (Java, Go, etc.).
Expertise in designing event-driven architectures using Kafka or equivalent messaging systems.
Deep understanding of data pipelines, message queues, batch and stream processing at scale.
Strong experience with cloud-native development, container orchestration, and infrastructure-as-code (e.g., GCP, Docker, Kubernetes, Terraform).
Experience with relational and NoSQL databases and an understanding of their tradeoffs.
Strong familiarity with performance monitoring, alerting, and observability tools.
Experience driving technical design documents, evaluating new technologies, and communicating decisions effectively to varied audiences.
Curiosity and hands-on experience with AI-powered development workflows, LLM tools, and productivity boosters is a strong plus.
Leadership & Impact
Recognized as a go-to expert and trusted advisor by engineers across the group.
Strong mentoring skills-willing and able to guide others through design challenges and deep technical problems.
Comfortable operating in ambiguity, proposing solutions, and reducing complexity.
Influences architecture, priorities, and processes beyond their immediate team.
Passionate about creating a culture of engineering excellence, ownership, and continuous improvement.
Leads cross-functional technical initiatives that span multiple teams and disciplines.
Preferred Qualifications
Experience in a high-growth SaaS company or one with high-throughput systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8327906
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
01/09/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're seeking an innovative Software Architect to design and implement robust, scalable solutions that power our AI-driven platform. You'll play a crucial role in shaping the technical foundation of Apollo and its ecosystem.

**What You'll Do:**
- Lead the architectural design and implementation of infrastructure and tools for AI research and deployment within the Google Cloud Platform (GCP) ecosystem.
- Oversee the implementation and management of infrastructure stacks for various company applications.
- Design and implement automation processes to enhance operational efficiency across our systems.
- Manage integrations between company systems and third-party services.
- Administer and optimize third-party applications.
- Oversee the administration of both relational and non-relational databases.
- Monitor all infrastructure resources and applications to ensure 24/7 availability, optimal performance, and robust security.
- Collaborate with cross-functional teams to align architectural decisions with business goals and AI capabilities.
Requirements:
**Who We're Looking For:**
- A visionary architect with a passion for creating cutting-edge software solutions at scale.
- An experienced technologist with a deep understanding of AI-driven systems and cloud architecture.
- A strategic thinker who can balance innovation with practicality in system design.

**Requirements:**
- 3+ years of experience in a similar role, with a proven track record of architecting complex software systems.
- Strong knowledge of microservices-based architecture and cloud-native application design.
- Extensive experience in UNIX/Linux system administration and deep understanding of system-level concepts.
- 2+ years of experience with CI/CD processes and tools (e.g., Jenkins), focusing on automating software delivery pipelines.
- 1+ years of experience with container technologies (Docker, Kubernetes) for building and deploying scalable microservices.
- 1+ years of experience in NoSQL database administration, particularly MongoDB, and proficiency in database management.
- Familiarity with system monitoring tools like Datadog for ensuring infrastructure health and performance.
- Demonstrated experience working with AI products, including design, implementation, and deployment of AI technologies.
- Strong knowledge of cloud security practices and data security protocols
- Proficiency in at least one programming language commonly used in system architecture (e.g., Python, Go, Java).
- Experience with GCP services and best practices for cloud architecture.
- Excellent communication skills to collaborate with both technical and non-technical stakeholders.
- Ability to mentor team members and contribute to the overall technical strategy of the company.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8327755
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/08/2025
Location: Tel Aviv-Yafo and Netanya
Job Type: Full Time
Were looking for a hands-on AI Architect to join our Platform infrastructure (Gen AI applications team) where you'll have the unique opportunity to collaborate with our strategic technical leadership and the development of next-generation technologies for modern software release management, driving the industry forward.
As an AI Architect you will...
Design, develop, and implement various ML and AI-based solutions to address business needs and objectives
Collaborate with key organizational stakeholders to understand AI requirements and design end-to-end AI solutions
Explore and experiment with novel ML and AI techniques and architectures to drive innovation
Evaluate and recommend ML and AI tools and frameworks to enhance productivity and effectiveness
Ensure the scalability and reliability of LLM-based system architectures
Provide technical guidance and mentorship to development teams on AI and ML technologies and practices
Influence the AI priorities of the company. Your decisions will shape our future direction in AI, and have a significant impact on our AI initiatives.
Requirements:
A bachelor's degree or higher in Computer Science, Data Science, or a related field
Proven experience in developing LLM-based application architectures
Proficiency in ML along with relevant tools, processes, and frameworks, such as TensorFlow, PyTorch, Keras, natural language processing (NLP), and reinforcement learning from human feedback (RLHF)
Proficiency in LLM-related tools, processes, and frameworks, including OpenAI Models and APIs, Hugging Face Transformers, LangChain, vector databases, and prompt management tools like PromptPerfect/PromptBase and Guardrails
Experience with cloud platforms, such as AWS, Google Cloud, or Azure
Proficiency in Python programming
Experience deploying LLM-based applications in a production environment
Excellent problem-solving and analytical skills
Strong communication skills and the ability to collaborate effectively in a team.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8318715
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions that were once considered science fiction, including artificial intelligence to autonomous cars. Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We're seeking a Senior Software Architect to help co-design next-gen data center platforms and scalable communications software.
DL and HPC applications have a huge compute demands and already run at scales of up to tens of thousands of GPUs. GPUs are connected with high-speed interconnects (e.g. NVLink, PCIe) within a node and with high-speed networking (e.g. InfiniBand, Ethernet) across nodes. Efficient and fast communication between GPUs directly impacts end-to-end application performance. This impact continues to grow with the increasing scale of next generation systems. This is an outstanding opportunity to advance the state-of-the-art, break performance barriers, and deliver platforms the world has never seen before. Are you ready to build the new and innovative technologies that will help realize our company's vision?
What you will be doing:
Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems.
Design and implement new communication technologies to accelerate AI and HPC workloads.
Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects.
Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations.
Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs).
Requirements:
M.S./Ph.D. degree in CS/CE or equivalent experience.
5+ years of relevant experience.
Excellent C/C++ programming and debugging skills.
Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).
Deep understanding of operating systems, computer and system architecture.
Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads.
Strong experience with Linux.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Expertise in related technology and passion for what you do. Experience with CUDA programming and our company GPUs. Knowledge of high-performance networks like InfiniBand, RoCE, NVLink, etc.
Experience with Deep Learning Frameworks such PyTorch, TensorFlow, etc. Knowledge of deep learning parallelisms and mapping to the communication subsystem. Experience with HPC applications.
Strong collaborative and interpersonal skills and a proven track record of effectively guiding and influencing within a dynamic and multi-functional environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321599
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
12/08/2025
Location: Tel Aviv-Yafo
Job Type: More than one
We are seeking a talented Software Architect. This is a unique opportunity to work with US clients and engage in global projects, leveraging your extensive skills in generative AI tools.
As a pivotal member of our team, you will be responsible for translating complex business problems into data-driven solutions, drafting methodologies, and executing innovative projects.
Key Responsibilities:
Advanced Backend Development: Lead the development of scalable, secure, and maintainable backend systems using Python and frameworks like FastAPI.
System Design & Architecture: Architect robust backend solutions and design efficient RESTful APIs with asynchronous programming support.
Core Technical Strengths: Utilize deep knowledge of Object-Oriented Programming (OOP), asynchronous design patterns, and software engineering best practices.
DevOps & Containerization: Implement Docker for containerizing applications and Kubernetes for orchestrating microservices-based deployments in scalable, production-grade environments. Manage CI/CD pipelines and deployment automation.
Database Expertise: Design schemas, optimize queries, and manage data consistency across distributed systems using PostgreSQL and CosmosDB.
Quality Engineering & Testing: Write comprehensive unit tests, enforce code quality through peer reviews, and integrate test automation in CI workflows.
Technical Mentorship & Team Collaboration: Mentor junior developers, drive code quality, scalable design, and agile delivery. Collaborate effectively across DevOps, frontend, and product teams.
Agile Development & Ownership: Participate in sprint planning, backlog management, and stakeholder communication. Take full ownership of modules, ensuring timely and reliable delivery in Agile/Scrum environments.
Exposure to Agentic Flows: Leverage experience with agentic flows to enhance system performance and reliability.
Requirements:
Extensive experience in backend development with Python and FastAPI.
Proven expertise in system design and architecture.
Strong understanding of OOP, asynchronous design patterns, and software engineering best practices.
Hands-on experience with Docker, Kubernetes, CI/CD pipelines, and deployment automation.
Proficiency in PostgreSQL and CosmosDB.
Experience in writing unit tests, conducting peer reviews, and integrating test automation.
Demonstrated ability to mentor junior developers and collaborate across teams.
Proactive in agile development and ownership of deliverables.
Familiarity with agentic flows is a plus.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8300897
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
17/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were searching for a passionate and experienced hands-on AI Architect to join our team.
With a wide array of responsibilities and corresponding impact, youll help spearhead our product offering and wear multiple hats at a fast-growing, dynamic, early-stage company.
Responsibilities:
Design, develop, and drive design processes and products that incorporate Generative AI (LLMs), models and knowledge Retrieval in order to help patients across the US.
Design and define data pipelines and evaluation pipelines.
Coach teams on best practices for datasets and building data sets.
Build applicative AI in production on a large scale.
Build solutions for different mediums including voice products with minimal latency and high accuracy.
Build an internal tool system to allow scientific evaluation.
Implement and lead new AI strategies by designing and implementing complex functionality and/or integrating with 3rd parties.
Continuously improve our existing NLP capabilities to help patients across the US answer any question, schedule appointments with their doctor, refill their prescriptions, and much more.
Use best practices and SOTA from the AI industry but also push the envelope to new heights.
Requirements:
At least 3 years of experience as an AI Architect/Principal Engineer.
At least 2 years of experience with building ML or LLM-based applications in production.
At least 5 years of Python or equivalent programming experience.
Bachelors or a master's degree in computer science, data science, or equivalent field.
Knowledge in MLOps - bonus.
Team player with a positive and fun attitude.
Strong interpersonal, leadership, and communication skills.
Capacity to work in a fast-changing environment.
Experience designing, building, and operating large systems with scalability, availability, testing, and performance requirements.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8306373
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
01/09/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
Our technology has no boundaries! we are building the worlds most groundbreaking and state-of-the-art accelerated computing platforms. Because of our work, scientists, researchers, and engineers can advance their ideas. We pioneered a supercharged form of computing loved by the fastest-paced computer users in the worldscientists, designers, artists, and gamers.
We seek a highly motivated Senior AI Network System Architect to join our team of experts and help shape the foundational infrastructure for the AI revolution. Our next-generation networking systems are at the forefront of connecting and powering the world's most advanced AI clusters. As a key member of our architecture team, you will be responsible for a wide range of critical activities, from deep technical analysis and performance modeling to strategic architectural studies, ensuring our company continues to innovate and lead.
What Youll Be Doing:
Define, develop, and execute cutting-edge benchmarks and workloads to analyze system performance, identify bottlenecks, and drive optimizations across our hardware and software stack.
Drive the direction of our future products by performing deep-dive analysis of system architectures and solutions to assess their performance, efficiency, and value proposition.
Develop and validate sophisticated performance and network simulation models, correlating them with real-world hardware to predict and analyze the behavior of future systems.
Analyze and optimize the entire AI stack, including communication libraries (like NCCL) and system software to the underlying network fabric, developing Proof-of-Concepts (POCs) for new features and improvements.
Conceptualize next-generation networking architectures driven by emerging DL and AI technologies.
Collaborate with multi-functional teams, including other architecture teams, logic design, system software, firmware, and DL research teams, to ensure the successful execution of our vision.
Requirements:
M.Sc. or Ph.D. degree in Computer Science, Computer Engineering, or Electrical Engineering, or equivalent experience.
6+ years of relevant industry or research experience in high-performance computing, computer architecture, or computer networks.
Excellent understanding of large-scale system behavior and the effect of distributed computing workloads on network and system performance.
Proven experience in simulative performance analysis or benchmarking.
Exceptional analytical, problem-solving, and systems-thinking skills, with the ability to translate complex technical data into strategic architectural insights.
Hands-on programming skills in Python and/or AI frameworks for system analysis, automation, and modeling.
Ability to thrive in a fast-paced, dynamic environment and work concurrently with multiple groups across the organization.
Ways To Stand Out From The Crowd:
Expertise in the architecture and system-level requirements of large-scale, distributed DL workloads (e.g., LLMs, Generative AI for vision).
Deep understanding of communication libraries such as NCCL, UCX, or UCC.
Expertise in network protocols (Ethernet, InfiniBand, RoCE) and large-scale network topologies.
Experience with industry-standard AI benchmarks (e.g., MLPerf) and our company's frameworks (e.g., NeMo) on large-scale clusters.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8327544
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and English Speakers
We are looking for a Senior Product Security Architect.
What your day will look like:
As a Senior Product Security Architect (Application Security), you will play a critical role in supporting our companys Secure Software Development Lifecycle (SSDLC).
Youll join an expanding group where your contributions can make a strong impact .
Youll have the chance to be part of the team that is in charge of security of the whole Platform, from design to implementation and deployment.
Responsibilities:
Collaborate with key stakeholders to identify and prioritize essential security requirements; inspire new security initiatives; advocate and ensure the allocation of resources for security during R&D planning sessions.
Partner with technical R&D experts to implement robust security requirements and develop effective mitigation strategies for identified risks on all stages of development, from design to implementation and deployment.
Develop and maintain deep expertise in the security aspects of technologies utilized by Axonius and share your knowledge across teams through documentation, presentations, and other learning initiatives, elevating our collective security awareness.
Requirements:
Have 5+ years of experience in software development, with at least 3 years in dedicated security-related roles.
Know the field of Application Security and Web Application Security, including hands-on experience with threat modeling and performing security reviews of software architectures.
Know web application and cloud design patterns.
Strong in technical writing and able to produce clear, concise, and comprehensive security architecture documents, design specifications, and guidelines.
Have grasp of core technical knowledge: operating systems, networking, cloud computing (preferably AWS), and cryptography.
Strong written and verbal communication skills in English and Hebrew. Including experience collaborating with distributed teams and across multiple business functions.
Know in general relevant data protection regulations (e.g. GDPR).
Ready to work independently, take ownership of tasks, and drive them to completion.
Advantages:
Hands-on software development / DevOps / DevSecOps experience
Solid knowledge of EU and USA data protection regulations.
Professional certifications like Certified Information Systems Security Professional (CISSP), Offensive Security Certified Professional (OSCP), Cloud Architect or Cloud Security Professional.
Understanding of contemporary AI and GPT-like technologies applications for software development and their influence on product security.
Security Research and Leadership: Demonstrated security research activities (e.g., participation in bug bounties or credit for reporting CVEs), publications (e.g. blog posts or conference talks).
Bachelors or Masters degree in Computer Science, Engineering, or a related field.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8313407
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
we are seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture group, where we build cutting-edge systems powering the next generation of generative AI workloads. In this role, you will work across software and hardware domains to design and optimize inference infrastructure for large language models running on some of the most advanced GPU clusters in the world.
Youll help define how AI models are deployed and scaled in production, driving decisions on everything from memory orchestration and compute scheduling to inter-node communication and system-level optimizations. This is an opportunity to work with top engineers, researchers, and partners across our company and leave a mark on the way generative AI reaches real-world applications.
What Youll Be Doing:
Design and evolve scalable architectures for multi-node LLM inference across GPU clusters.
Develop infrastructure to optimize latency, throughput, and cost-efficiency of serving large models in production.
Collaborate with model, systems, compiler, and networking teams to ensure holistic, high-performance solutions.
Prototype novel approaches to KV cache handling, tensor/pipeline parallel execution, and dynamic batching.
Evaluate and integrate new software and hardware technologies relevant to model inference (e.g., memory hierarchy, network topology, modern inference architectures).
Work closely with internal teams and external partners to translate high-level architecture into reliable, high-performance systems.
Author design documents, internal specs, and technical blog posts and contribute to open-source efforts when appropriate.
Requirements:
Bachelors, Masters, or PhD in Computer Science, Electrical Engineering, or equivalent experience.
5+ years of experience building large-scale distributed systems or performance-critical software.
Deep understanding of deep learning systems, GPU acceleration, and AI model execution flows.
Solid software engineering skills in C++ and/or Python, with strong familiarity with CUDA or similar platforms.
Strong system-level thinking across memory, networking, scheduling, and compute orchestration.
Excellent communication skills and ability to collaborate across diverse technical domains.
Ways to Stand Out from the Crowd:
Experience working on LLM inference pipelines, transformer model optimization, or model-parallel deployments.
Demonstrated success in profiling and optimizing performance bottlenecks across the LLM training or inference stack.
Familiarity with data center-scale orchestration, cluster schedulers, or AI service deployment pipelines.
Passion for solving tough technical problems and shipping high-impact solutions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8319687
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/08/2025
Location: More than one
Job Type: Full Time
our company SOC Architecture team is looking for a Senior Data Scientists with SW development skills and HW-System architecture experience. Do you want to be part of the Artificial Intelligence Revolution? Would you like to work with world-class systems architects and deep learning experts to define the next generation SoC? we are developing processor and system architectures that are at the forefront of accelerating machine learning, automotive and high-performance computing applications. We are building the most advanced SoC's in the world for these applications.
In this position, you will develop Data sets, AI Models and Train AI models for advanced system architecture Power and Performance features, you will have the chance to explore and define future aspects of our architectures that bring together our company's GPUs, custom processors and accelerators into a single chip. As part of the development flow you will be involved with building performance models, automating analysis work flows, data generation and characterisation for pre and post silicon. You will work with our HW & System architects, software, ASIC design, verification, physical design, VLSI and platform teams. Our SoC architects excel at pushing the state of the art, while making the best engineering trade-offs.
What youll be doing:
Dataset generation, preparation, AI model training and algo tuning.
Perform performance, perf @ watt, and power modelling and analysis to optimize our architecture. System C/C++ model development and validation. For Early SW development and verification Reference
Drive automation & tools for architectural exploration and analysis work.
Post-Silicon production support with silicon debug, analysis for performance and power optimization.
Collaborate closely with our SoC Architects to understand and initiate efficiency improvements.
Understand and analyze the interplay of hardware and software architectures on future algorithms and applications.
Requirements:
B.Sc., M.Sc. or Ph.D. in Computer Science, Electrical Engineering, or a related field.
5+ Years of relevant experience
Programming experience and proficiency in Python, System C/C++ development skills and familiar with Linux developing environment.
Strong interpersonal and organizational skills and the ability & desire to work as a great teammate.
Excellent analytical, and verbal interpersonal skills and ability to work as part of a team
Independence and drive to lead initiatives to enhance our work flows and architectural development process.
Ways to stand out from the crowd:
Data scientist experience
Background in Power and Performance model development
Experience in architecture work flows automation and tooling
Theoretical and practical knowledge in accelerated computing, machine learning, NLP, AIGC, LLM, AI4S, etc
Experience in CUDA and deep learning frameworks (i.e TensorFlow or PyTorch) & Proficiency with Python and data analysis packages like: Pandas, NumPy.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8318032
סגור
שירות זה פתוח ללקוחות VIP בלבד