דרושים » הנדסה » Senior Performance Analysis Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/06/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science fiction. GPU Deep Learning has provided the foundation for machines to learn, perceive, reason and solve problems. Today, visual computing is a crucial tool in helping people get along with technology, and we have extended its technology into datacenters, mobile devices and cars. There has never been a more exciting time to join our team - if this role sounds like a fit for you, we'd love to hear from you
we are seeking a Senior Performance Analysis Engineer to join our Performance group. In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters for distributed Deep Learning LLM training focused on collectives communication and networking. You will interact with many types of hardware and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems. You will develop performance analysis tools and methodologies to dive deeply into the details and understand performance expectations, limitations, and bottlenecks.
What you'll be doing:
Exploring and researching AI workloads and DL models specifically tailored for large-scale deep learning LLM training on our supercomputers and distributed systems focusing on high-performance networking and Nvidia Collective Communications Library (NCCL).
Benchmarking, Profiling, and Analyzing the performance to find bottlenecks and identify areas of improvement and optimizations, with a strong emphasis on networking aspects.
Implementing performance analysis tools.
Collaborating with many teams from hardware to software to provide performance analysis insights.
Defining performance test planning, setting performance expectations for new technologies and solutions, and working to reach the performance targets limits.
Requirements:
B.Sc. in Computer Science or Software Engineering or equivalent experience
5+ years of experience with high-performance Networking (RDMA, MPI, NCCL, Congestion Control Algorithms)
Demonstrated Performance Analysis skills and methodologies.
Experience with NVIDIA GPUs, CUDA library, deep learning frameworks like TensorFlow or PyTorch, combined with expertise in networking collective communication libraries (such as NCCL) and protocols (such as RoCE and RDMA).
Fast and self-learning capabilities with strong analytical and problem-solving skills.
Programming Languages: Python, Bash and C languages
Experience with Linux OS distros.
Great teammate with good communication and interpersonal skills
Ways to stand out from the crowd:
In-depth knowledge and experience with AI workloads and benchmarking for distributed LLM training.
Knowledge in CUDA, and NCCL libraries.
Knowledge in Congestion Control algorithms.
In-depth System knowledge and understanding (Intel / AMD / ARM CPUs, NVIDIA GPUs, HCA, Memory, PCI).
Strong Performance Analysis skills and methodologies using modern tools.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8224849
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Engage with our partners and customers to root cause functional and performance issues reported with NCCL.

Conduct performance characterization and analysis of NCCL and DL applications on groundbreaking GPU clusters.

Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.).

Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters.

Document and conduct trainings/webinars for NCCL.

Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.
Requirements:
What we need to see:

B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design.

Experience working with engineering or academic research community supporting HPC or AI.

Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control.

Expert in Linux fundamentals and a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible).

Adaptability and passion to learn new areas and tools.

Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow.

Deep understanding of technology and passionate about what you do.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203558
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our vision?

What you will be doing:

Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.

Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack.

Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available.

Triage and root-cause performance issues reported by our customers.

Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information.

Collaborate with a very dynamic team across multiple time zones.
Requirements:
What we need to see:

M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience.

3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM).

Experience conducting performance benchmarking and triage on large scale HPC clusters.

Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals).

Implement micro-benchmarks in C/C++, read and modify the code base when required.

Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python.

Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker).

Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control.

Experience debugging network issues in large scale deployments.

Familiarity with CUDA programming and/or GPUs.

Experience with Deep Learning Frameworks such PyTorch, TensorFlow.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203543
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Ra'anana
Job Type: Full Time
We are seeking a talented and driven Senior Software Verification Engineer to join our innovative team and tackle SW verification challenges in the domains of high-speed networking, virtualization, and security. You will play a key role in validating and testing complex software products that support Ethernet and InfiniBand protocols, delivering advanced networking, storage, and security services for cloud, compute, and AI workloads.

What Youll Be Doing:

Develop and Automate Testing: Design, implement, and maintain automated test scripts and frameworks (primarily in Python) to verify the correct functionality of our software products.

End-to-End Feature Ownership: Deep dive into feature sets, taking responsibility from test planning through to final implementation and full automation.

System & Integration Validation: Validate software functionality and performance through system-level and integration testing, utilizing Linux-based environments and virtualization tools.

Test Environment Management: Set up, maintain, and optimize test environments using Linux, Docker, virtual machines, and other modern tools.

Collaboration & Communication: Work closely with software, DevOps, architecture, and product teams to define test requirements, coordinate releases, and ensure high-quality product delivery.

Continuous Improvement: Drive design verification flows, contribute to methodology improvements, and leverage planning/tracking systems to manage release progress and build release indicators.

Defect Analysis: Analyze test results, file defects, and track issues to closure, ensuring robust and scalable solutions.
Requirements:
What We Need to See:

Bachelors/masters degree in computer science or computer engineering, or equivalent experience

5+ years of experience in software testing, QA automation, or software engineering.

Strong proficiency in Python and scripting for automation.

Solid experience with Linux-based environments, including system tools and command-line utilities.

Proven understanding of computer networking and modern Linux operating systems.

Familiarity with software testing, integration, and system validation practices.

Excellent problem-solving, critical thinking, and communication skills.

Ability to work independently, manage multiple tasks, and drive technical initiatives.

Great interpersonal skills, agility, and determination for success.

Fluent English; strong presentation and public speaking abilities.

Ways to Stand Out from the Crowd:

Deep technical know-how and familiarity with networking protocols or low-level system tools.

Experience with Docker, KVM, or other virtualization technologies.

Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI) and test reporting tools (e.g., Allure, Grafana, Kibana).

Experience with large HW+SW systems and advanced Linux OS technologies.

Proficiency with GIT, Bash, and other scripting languages.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203394
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/06/2025
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
We are seeking a Senior Networking Software Engineer to join our RDMA Transport Software team, driving the development of next-generation RDMA solutions for AI, cloud, HPC, and storage. You will research and develop innovative transport algorithms that push the limits of performance and scalability. You will work in a fast-paced, collaborative environment alongside talented engineers from around the world, supporting the data needs of the worlds largest enterprises

What you'll be doing:

Take part in research, design, and development of advanced RDMA transport mechanisms and algorithms, enhancing performance, reliability, and scalability.

Collaborate closely with hardware engineers, software developers, and system architects to align on project objectives and requirements.

Keep up with industry trends and emerging technologies, integrating new ideas and innovations into the development process.
Requirements:
What we need to see:

Bachelor's or Master's degree in Electrical Engineering or Computer Science fields from a known institute.

5+ years of development experience.

Knowledge with RoCE and/or InfiniBand, along with a background in RDMA development across software, firmware, or hardware.

Strong problem-solving skills with a hands-on approach, able to dive deep into the RDMA stack and solve complex issues.

Proficiency in C/C++ and embedded systems programming.

Fast learner possessing the ability to learn complex concepts in a fast-paced environment.

A can-do attitude and high energy with excellent collaboration, and social skills.

Ways to stand out from the crowd:

Background with data centers networking & storage workloads (advantage).

Familiar with RDMA, InfiniBand, or Ethernet technologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8203482
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
19/06/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
A leader in computer graphics, PC gaming, accelerated computing, and AI. the Networking Business Unit is building chips, systems, and software that power the most advanced data center and hyper-converged networks of today and tomorrow. The data processing unit (DPU) and series of SmartNICs ignite outstanding innovation for modern data centers by offloading, accelerating, and isolating a broad range of sophisticated networking, storage, and security services.

We are looking for a highly motivated software engineer with experience in data forwarding technologies, Linux kernel and container networking, network functions virtualization (NFV), and related areas to join our team and work on innovative offload solutions. You will develop software for various networking offload and virtualization use cases as well as data forwarding on Ethernet switching platforms. You will use the latest software development tools and techniques and gain extensive knowledge of modern data center architectures and workload acceleration.

What youll be doing:
Design, develop, test, and maintain new functionality and improvements to existing functionality related to offloading various networking services.
Design, develop, test, and maintain system software components related to networking.
Work on data forwarding functionality on Ethernet switching platforms.
Lead and guide cross-functional teams on large feature development activities.
Collaborate with team members, architects, QA, and Support teams on feature definition, development, release, and bug fixing.
Requirements:
BS or MS degree in Computer Engineering, Computer Science, or a related field (or equivalent experience).
A minimum of 5+ years of software development experience in areas such as data forwarding, NFV, SDN, kernel and container networking, SmartNICs, and offload solutions.
Strong and validated experience in C programming.
Strong technical abilities, problem-solving, design, coding, and debugging skills.
Lead feature development, take full ownership of tasks from A-Z, and deliver independently with minimal supervision.
Ability to quickly understand new requirements and technologies and swiftly prototype and implement solutions.
Ways to stand out from the crowd:
Experience in virtualized networking and SRIOV, and packet processing using Openvswitch.
Background in Linux kernel networking internals.
Knowledge of routing and control plane technologies such as EVPN, Segment Routing, PIC, etc.
Participation in the open-source community.
Python and C++ programming skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8224046
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
19/06/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Research Engineer.
As a Research Engineer, you will design, develop, and implement algorithms and models that will enhance the products capabilities, performance, reliability, and scalability. Together with our algo and engineering team members, you will work closely with our product team members to implement algorithms that can be used to drive our business forward as well as the frontier of software development AI technologies.
Responsibilities:
Design, develop, implement and test algorithms and AI-empowered solutions for various product features and applications
Collaborate with cross-functional teams to understand business needs and translate them into algorithmic solutions
Perform research on emerging trends, algorithms, and cutting-edge technologies, and identify ways to incorporate them into our products pragmatically
Propose new and innovative solutions to complex problems and lead the development of algorithms in those areas
Design, cleanse, and utilize benchmarks throughout the process of the algorithm development
Requirements:
Bachelors or Masters degree in Computer Science, Mathematics, or related field
5+ of experience in programming languages such as Python and experience with putting machine learning code into production
Experience in training LLMs, such as Llama, DeepSeek or similar
Experience in designing benchmarks and evaluating LLM applications
Passion for exploring new technologies and techniques to enhance improve algorithm performance and product features
Strong communication, collaboration, and problem- solving skills
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8223172
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
03/06/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
the brand grabs attention like nothing else in cybersecurity. And were growing like crazy, with $70M in Series C funding, 200% employee growth, and 300% revenue growth in 2024. Fueling growth are our game changing agentic AI security solutions, backed by a team and culture that makes one of Forbes Best Startup Employers in America, and a Business Insider startup to bet your career on.

Life at is all gas, no brakes. Were a team of relentless, collaborative go-getters pushing the boundaries of whats possible for security automation. Every role is an essential driver of success as the AI-native autonomous SecOps platform of choice for security teams across the Fortune 500. Excited about our vision and ready to make an impact as we grow? Wed love to see what you can bring to the team.

We are looking for an experienced LLM Engineer to join our fast-growing company during an exciting stage of innovation. In this role, you'll design and develop Agent and LLM-based features, craft and optimize high-quality prompts, and build datasets and benchmarks to evaluate and monitor applications effectively. If you're passionate about staying at the forefront of Large Language Model advancements and collaborating with a team of experts, wed love to hear from you!

What Will You Do?
Take part in all development aspects, from design to production, of Agent and LLM-based features and applications.
Craft and optimize high-quality prompts to enhance application functionality and output quality.
Develop datasets and benchmarks to evaluate and monitor LLM-based applications effectively.
Stay up to date with state-of-the-art research in the field of Large Language Models and apply relevant advancements to improve our applications.
Collaborate closely with other team members and contribute to a culture of innovation and continuous improvement.
Mentor new team members and share your expertise to strengthen the team's collective knowledge.
Requirements:
Significant real-world experience developing software for production systems, with at least 5 years of experience
Proficiency in LLM application concepts and techniques such as Retrieval-Augmented Generation (RAG), embeddings, Chain-of-Thought (CoT) prompting, and more.
Hands-on experience with developing datasets, evaluation benchmarks, and performance monitoring systems for LLM applications.
A solid understanding of state-of-the-art advancements in the field of Large Language Models and natural language processing.
Ability to take initiative, scope work iteratively, and make meaningful design decisions to drive projects to completion.
Preferred Qualifications
A strong focus on data and data quality, with the ability to optimize data flows for developing cutting-edge LLM-based applications.
Familiarity with LangGraph or other LLM-based frameworks
The ideal candidate possesses a data science mindset combined with an engineer's agility and focus on swiftly bringing solutions into production.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8202235
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/06/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At UVeye, we're on a mission to redefine vehicle safety and reliability on a global scale. Founded in 2016, we have pioneered the world's first fully automated suite of vehicle inspection systems. At the heart of this innovation lies our advanced AI-driven technology, representing the pinnacle of machine learning, GenAI, and computer vision within the automotive sector. With close to $400M in funding and strategic partnerships with industry giants such as Amazon, General Motors, Volvo, and CarMax, UVeye stands at the forefront of automotive technological advancement. Our growing global team of over 200 employees is committed to creating a workplace that celebrates diversity and encourages teamwork. Our drive for innovation and pursuit of excellence are deeply embedded in our vibrant company culture, ensuring that each individual's efforts are recognized and valued as we unite to build a safer automotive world.
We are seeking a highly motivated and skilled Release Engineer to join our AIOps group. In this role, you'll play a critical part in bridging the gap between development and operations, ensuring the seamless qualification, deployment, and monitoring of our AI algorithms and infrastructure, and be responsible for the end-to-end operationalization of our core technology.
A day in the life and how you’ll make an impact:
* Manage the end-to-end release process of machine learning algorithms and infrastructure components, from qualification through deployment.
* Validate and test new algorithm releases to ensure they meet performance, stability, and compliance standards.
* Create and execute deployment plans across various environments (staging, production), ensuring minimal risk and downtime.
* Collaborate closely with AI researchers, MLOps, and software engineers to understand release requirements, share feedback, and resolve pre-release issues.
* Identify and drive automation opportunities within the release pipeline to improve efficiency, reliability, and traceability.
* Oversee updates to infrastructure components, ensuring compatibility and performance across systems.
* Monitor deployments, proactively identify issues related to model behavior or infrastructure anomalies, and drive resolution with relevant teams.
* Maintain clear and accurate release documentation, including version history, deployment notes, and incident reports.
Requirements:
* Bachelor's degree in Computer Science, Software Engineering, or industry equivalent.
* 2+ years of experience in QA & Automation
* Proficiency in scripting languages (e.g., Python, Bash).
* Experience with containerization technologies (e.g., Docker, Kubernetes).
* Familiarity with CI/CD pipelines (e.g., GitLab CI/CD, Jenkins).
* Experience with cloud platforms (e.g., AWS, GCP).
* Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
* Excellent problem-solving skills and attention to detail.
* Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams.
Bonus if you have: Strong understanding of the machine learning lifecycle, from experimentation to deployment and monitoring.
* Experience with specific MLOps platforms or tools.
* Experience in a fast-paced startup environment.

Why UVeye: Pioneer Advanced Solutions: Harness cutting-edge technologies in AI, machine learning, and computer vision to revolutionize vehicle inspections. Drive Global Impact: Your innovations will play a crucial role in enhancing automotive safety and reliability, impacting lives and businesses on an international scale. Career Growth Opportunities: Participate in a journey of rapid development, surrounded by groundbreaking advancements and strategic industry partnerships.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8214831
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Your role will involve leading features from inception and analysis, through design, implementation, testing, and maintenance.
You will also participate in product-changing architectural choices, take ownership of various components/tools, and continuously expand your skill sets.
Expect to immerse yourself deeply in complex sub-systems that support serving of AI models at scale (for example model-monitoring and feature-store).
Recognized for helping enterprises deliver enterprise-wide analytics at scale, advanced Machine Learning Operations (MLOps) pipeline enables clients to streamline and manage their AI, from the initial concept all the way to production, in a simplified and automated manner.
Your Growth
Driving lasting impact and building long-term capabilities with our clients is not easy work. You are the kind of person who thrives in a high performance/high reward culture - doing hard things, picking yourself up when you stumble, and having the resilience to try another way forward.
In return for your drive, determination, and curiosity, we'll provide the resources, mentorship, and opportunities you need to become a stronger leader faster than you ever thought possible. Your colleaguesat all levelswill invest deeply in your development, just as much as they invest in delivering exceptional results for clients. Every day, you'll receive apprenticeship, coaching, and exposure that will accelerate your growth in ways you wont find anywhere else.
When you join us, you will have:
Continuous learning: Our learning and apprenticeship culture, backed by structured programs, is all about helping you grow while creating an environment where feedback is clear, actionable, and focused on your development. The real magic happens when you take the input from others to heart and embrace the fast-paced learning experience, owning your journey.
A voice that matters: From day one, we value your ideas and contributions. Youll make a tangible impact by offering innovative ideas and practical solutions. We not only encourage diverse perspectives, but they are critical in driving us toward the best possible outcomes.
Global community: With colleagues across 65+ countries and over 100 different nationalities, our firms diversity fuels creativity and helps us come up with the best solutions for our clients. Plus, youll have the opportunity to learn from exceptional colleagues with diverse backgrounds and experiences.
World-class benefits: On top of a competitive salary (based on your location, experience, and skills), we provide a comprehensive benefits package, which includes medical, dental, mental health, and vision coverage for you, your spouse/partner, and children.
Requirements:
BSc in Computer Science or related subject 5+ years of development experience, with extensive Software Engineering skills and system understanding Demonstrated passion for crafting meticulously designed and elegantly structured software solutions Proven ability to navigate uncharted territories in both code and theory autonomously, spanning the entire technology stack Solid proficiency in Python, including asynchronous and parallel programming Experience in developing software from the ground up, showcasing innovation and problem-solving skills Experience with big data frameworks (e.g. Spark) and data-engineering packages (Pandas, Pyarrow) Experience with ML / AI tooling (for training and/or serving) highly desired Proficiency in distributed systems; familiarity with Kubernetes management and its ecosystem is an advantage
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8223161
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
22/06/2025
Job Type: Full Time
A highly motivated senior AI cloud infrastructure Software architect to join our team in Israel. Work on innovative technologies shaping the future of AI and cloud infrastructure. We are developing RDMA Transport protocols within the Networking software architecture team. Efficient and fast communication between GPUs directly impacts end-to-end application performance. This impact continues to grow with the increasing scale of next generation systems. This is an outstanding opportunity to advance the state-of-the-art, break performance barriers, and deliver platforms the world has never seen before. Are you ready to build the new and innovative technologies that will help realize vision?

What you'll be doing:
Translate requirements into vision, architecture, and roadmap
Design infrastructure to support programmable improvements to RDMA protocols for E/W AI networking stack
Collaborate with multi-functional teams to innovate and deliver networking solutions
Explore innovative solutions in HW and SW for our next-generation platforms as part of programmable RDMA architecture
Build proofs-of-concept, conduct experiments, and perform quantitative modeling to evaluate and drive new innovations
Requirements:
Bachelor's or Masters degree in Computer Science or equivalent experience
8+ years of proven experience in the field
Background in networking stack and protocols such as RDMA, TCP/IP, and InfiniBand
Strong articulation skills for crafting and improving technical documents and the ability to engage with a globally distributed engineering team
Eagerness to learn new technologies and constantly improve your expertise
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8224793
סגור
שירות זה פתוח ללקוחות VIP בלבד