Senior Software Developer

עדכון קורות החיים לפני שליחה

8321816

שירות זה פתוח ללקוחות VIP בלבד

משרות דומות שיכולות לעניין אותך

דיווח על תוכן לא הולם או מפלה

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

28/08/2025

Senior Software Engineer, Chip Simulation

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

we are widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
we are looking for a phenomenal Nvlink HW Simulation developer for the ChipSim Group. You will join the ChipSim growing team and take our product to next level, working closely with HW design and architect teams and gaining a deep understanding of our companys products and technologies.
What Youll Be Doing:
You will be responsible for learning, developing, and delivering new Networking features in simulation.
Collaborate with chip architects, firmware developers, and hardware design teams to accurately simulate complex physical-layer behaviors in software.
Extend and optimize the simulation infrastructure by contributing to CI pipelines, automated test frameworks, and regression tools.
Support internal users by debugging simulation flows and collaborating on bug resolution.
Take part in future-facing innovation by enabling simulation for next-generation devices and features.

Requirements:
Bachelor's Degree or equivalent experience in Computer Science / Software Engineering / Computer Engineering / Electrical Engineering / Communication Engineering.
5+ years of experience in Python programming, with strong object-oriented design skills.
Experience with C and/or C++, especially in systems or performance-sensitive environments.
Experience debugging using debuggers (gdb), including concurrency issues (races, deadlocks...).
Solid understanding of Linux, containerized environments (e.g., Docker), and command-line tools.
Familiarity with Inter-Process Communication (IPC) mechanisms (sockets, message queues, shared memory...).
Ability to communicate complex technical ideas in simple terms.
Well-organized, proactive and capable of leading your own tasks.
Collaborative personality with a love for teamwork.
Ways to stand out from the crowd:
Experience building complex simulation or emulation systems, especially those simulating hardware behaviors.
Background with multi-platform systems spanning HW, FW, and SW.
Experience with low-level networking protocols and applications.
Knowledge of physical layer concepts.
Experience contributing to CI/CD systems and tooling (e.g., Git, Jenkins, Gerrit).

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8322872

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

חברת השמה / כח אדם

07/09/2025

Senior MLOps Engineer

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

Realize your potential by joining the leading performance-driven advertising company!
As a Senior MLOps Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Senior MLOps Engineer Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet.

Requirements:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8335971

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

27/08/2025

Senior Software Architect - Deep Learning and HPC Communications

חברה חסויה

Location: Tel Aviv-Yafo and Yokne`am

Job Type: Full Time

we are leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions that were once considered science fiction, including artificial intelligence to autonomous cars. Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We're seeking a Senior Software Architect to help co-design next-gen data center platforms and scalable communications software.
DL and HPC applications have a huge compute demands and already run at scales of up to tens of thousands of GPUs. GPUs are connected with high-speed interconnects (e.g. NVLink, PCIe) within a node and with high-speed networking (e.g. InfiniBand, Ethernet) across nodes. Efficient and fast communication between GPUs directly impacts end-to-end application performance. This impact continues to grow with the increasing scale of next generation systems. This is an outstanding opportunity to advance the state-of-the-art, break performance barriers, and deliver platforms the world has never seen before. Are you ready to build the new and innovative technologies that will help realize our company's vision?
What you will be doing:
Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems.
Design and implement new communication technologies to accelerate AI and HPC workloads.
Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects.
Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations.
Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs).

Requirements:
M.S./Ph.D. degree in CS/CE or equivalent experience.
5+ years of relevant experience.
Excellent C/C++ programming and debugging skills.
Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).
Deep understanding of operating systems, computer and system architecture.
Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads.
Strong experience with Linux.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Expertise in related technology and passion for what you do. Experience with CUDA programming and our company GPUs. Knowledge of high-performance networks like InfiniBand, RoCE, NVLink, etc.
Experience with Deep Learning Frameworks such PyTorch, TensorFlow, etc. Knowledge of deep learning parallelisms and mapping to the communication subsystem. Experience with HPC applications.
Strong collaborative and interpersonal skills and a proven track record of effectively guiding and influencing within a dynamic and multi-functional environment.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8321599

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

חברת השמה / כח אדם

3 ימים

Senior Software Engineer, Generative AI

חברה חסויה

Location: Tel Aviv-Yafo and Haifa

Job Type: Full Time

our company's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to our companys needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
our company Research addresses challenges that define the technology of today and tomorrow. From conducting fundamental research to influencing product development, our research teams have the opportunity to impact technology used by billions of people every day.
Our teams aspire to make discoveries that impact everyone, and core to our approach is sharing our research and tools to fuel progress in the field -- we publish regularly in academic journals, release projects as open source, and apply research to our company products.
Responsibilities
Write and test product or system development code.
Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.

Requirements:
Minimum qualifications:
Bachelors degree or equivalent practical experience.
5 years of experience with software development in one or more programming languages.
3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.
3 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
1 year of experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).
Preferred qualifications:
Master's degree or PhD in Computer Science or related technical field.
5 years of experience with data structures or algorithms.
3 years of experience working in a complex, matrixed organization involving cross-functional, or cross-business projects.
1 year of experience in a technical leadership role.
Experience developing accessible technologies.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8345219

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

04/09/2025

Senior Software Engineer, AI Platform

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

Run:ai, now part of our company, has evolved AI infrastructure by merging GPU virtualization with Kubernetes-native capabilities. Our world class AI platform allows organizations to improve productivity and efficiency for data scientists and machine learning engineers. With deep Kubernetes expertise and a focus on innovation, we are dedicated to developing cutting-edge technologies, delivering the best user experience for our customers, and providing deep visibility into workload performance through rich metrics that help users optimize their AI workloads. We are looking for highly skilled software engineers to join our Platform Group and help shape the future of AI infrastructure.
The role of a Senior Software Engineer in the Platform Group is to design and develop scalable, high-performance systems that support the next generation of AI workloads. You will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of AI capabilities.
What youll be doing:
Designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other collaborators to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go
Conducting code and design reviews to uphold high-quality standards and mentor team members.

Requirements:
B.Sc. in Computer Science or a related field.
5+ years of experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (We write in Go).
Strong understanding of microservices architecture, RESTful APIs, and relational databases.
Deep familiarity with Kubernetes and the cloud-native ecosystem.
Demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Hands-on experience with HPC or AI/ML platforms.
Familiarity with AI inference workloads and performance optimization.
Proficiency in Linux, with knowledge in networking, security, storage, and virtualization.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8333587

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

01/09/2025

Senior Software and Automation Engineer ICPE

חברה חסויה

Location: Yokne`am

Job Type: Full Time

We are looking for a creative and independent Software Engineer for tools, infrastructure, and workflows development for the IC test and product engineering group in our company Networking Business Unit.
our company Networking Business Unit has continuously reinvented itself over two decades. Our high-speed buses & network products are leading in the markets with innovative ways to improve speed and bandwidth from one generation to another and today we are known as the go-to place for End-to-End High-Speed Ethernet and InfiniBand Solutions.
We're looking to grow our company and build our teams with smart people who can join us at the cutting-edge technology. We need a creative individual, who will help move Network Silicon ICs products (Switch, NIC, SmartNic) from design to mass production. You will work with test engineers, test house, Design, IT and many other professionals in the organization for the development of tools and test infrastructure for speeding time to market and enabling next generation test capabilities, characterization and data analysis.
If you are passionate about enabling of the highest quality Network products in the market, we want to hear from you!
What you'll be doing:
Design, develop, and maintain mission-critical engineering applications and automation tools.
Build systems that automate test program validation, execution, and release processes.
Architect infrastructure for scalable test and data workflows targeting next-generation network silicon.
Collaborate with cross-functional teams to enhance HW/SW automation flows and characterization pipelines.
Support integration and deployment in manufacturing environments and Contract Manufacturers (CM).
Enable new capabilities in the CM
Leverage DevOps best practices (CI/CD, version control, infrastructure automation) to accelerate internal development cycles.
Work with various teams at our company to improve and automate data analysis capabilities for all engineering and characterization test results.

Requirements:
BSc or higher in Computer Science or related field, with 7+ years of hands-on software development experience.
Proficiency in C# and Python; C/C++ experience is a strong plus.
Proven experience in GUI, application development, and tool integration; web/cloud background is advantageous.
GIT high proficiency.
Outstanding customer orientation
Hands-on experience with CI/CD (Jenkins, GitLab pipelines), Git-based workflows, Linux environments, shell scripting, and virtualized infrastructure.
Passion for it just works automation and no repetitive tasks.
Excellent communication skills with diverse teams and functional groups
Agile, self-learning and high execution quality standards
Innovative approach for problem solving.
Ways to stand out from the crowd:
VBA or VB6 experience is a huge plus.
Semiconductor test knowledge or hands-on experience with ATE/DFT workflows.
Experience with HW/SW interfaces.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8328340

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

27/08/2025

Senior HPC Performance Engineer

חברה חסויה

Location: Tel Aviv-Yafo and Yokne`am

Job Type: Full Time

we are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our company's vision?
What you will be doing:
Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.
Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack
Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available
Triage and root-cause performance issues reported by our customers
Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information
Collaborate with a very dynamic team across multiple time zones.

Requirements:
M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience
3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)
Experience conducting performance benchmarking and triage on large scale HPC clusters
Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals)
Implement micro-benchmarks in C/C++, read and modify the code base when required
Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python
Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker)
Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones
Ways to stand out from the crowd:
Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control
Experience debugging network issues in large scale deployments
Familiarity with CUDA programming and/or GPUs
Experience with Deep Learning Frameworks such PyTorch, TensorFlow.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8321604

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

02/09/2025

Senior Software Engineer

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

Run:ai, now part of our company, has evolved AI infrastructure by merging GPU virtualization with Kubernetes-native capabilities. Our world class AI platform allows organizations to improve productivity and efficiency for data scientists and machine learning engineers. With deep Kubernetes expertise and a team of extraordinary individuals, we are looking for a highly skilled Senior Software Engineer to join the team and help shape the future of AI technology. The role of a Senior Software Engineer in the Run:ai group is to design and develop scalable, high-performance systems that support the next generation of AI workloads and infrastructure. You will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of AI capabilities.
What youll be doing:
Designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other stakeholders to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go, contributing to our CI/CD pipelines.
Conducting code and design reviews to uphold high-quality standards and mentor team members.

Requirements:
B.Sc. in Computer Science or a related field (or equivalent experience).
5+ years of experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (Go preferred).
Strong understanding of microservices architecture, RESTful APIs, and relational databases.
Deep familiarity with Kubernetes and the cloud-native ecosystem.
Demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Hands-on experience with HPC, GPU virtualization, or AI/ML platforms.
Experience working in Linux environments with knowledge of networking, security, and virtualization.
Contributions to open-source projects or active participation in tech communities.
Agile approach and familiarity with standard methodologies.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8329770

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

27/08/2025

Senior HPC DevOps Engineer

חברה חסויה

Location: Yokne`am

Job Type: Full Time

we are looking for an experienced HPC DevOps Engineer to help us build the supercomputers and HPC clusters of the future. As a Senior HPC DevOps Engineer, you'll be a key player in groundbreaking advancements in artificial intelligence and GPU computing. Your expertise will drive the latest breakthroughs, providing insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.
What youll be doing:
Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems.
Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments.
Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes.
Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring.
Enhance Monitoring: Deploy advanced monitoring solutions for servers, networks, and storage to ensure seamless operations.
Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.
Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams.
Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.

Requirements:
B.Sc. in Computer Science, Engineering, or a related field with 5+ years of experience.
Deep knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.
Advanced proficiency in programming and scripting languages, with a solid understanding of object-oriented programming principles.
Familiarity with Jenkins, Ansible, Puppet/Chef.
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu), networking and OS-level security.
Deep understanding of networking protocols such as InfiniBand and Ethernet.
Experience with job scheduling workloads and orchestration tools such as Slurm and Kubernetes.
Experience with multiple storage solutions like Lustre, GPFS, ZFS, and XFS.
Expertise with virtual systems (VMware, Hyper-V, KVM, Citrix).
Familiarity with cloud platforms (AWS, Azure, Google Cloud).
Ways to stand out from the crowd:
Architectural Insight: Knowledge of CPU and/or GPU architecture.
Container Expertise: Understanding of Kubernetes and container-related microservice technologies.
GPU Focus: Experience with GPU-focused hardware/software (DGX, CUDA).
RDMA Fabrics: Background with RDMA (InfiniBand or RoCE) fabrics.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8321669

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

31/08/2025

Senior Software Engineer

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

We are looking for a senior backend engineer who is truly cross functional possesses strong engineering foundations for developing highly performant services. The ideal candidate will have a deep understanding of algorithms and Python fundamentals, as well as an understanding of the capabilities and limitations of LLMs.
They will be able to quickly collaborate with their product counterparts to iterate over new, cutting edge features and capabilities and get them to the hands of our users. Collaborating closely with product teams, they will quickly ideate, develop and deliver innovative features and capabilities to our users.
If you have the skills and experience we are looking for, we encourage you to apply for this exciting opportunity to join our team and make a significant impact on the future of AI
Role and Responsibilities:
Build the framework used to supercharge our algorithm teams to write scalable, performant algorithms
E2E ownership of production services, databases and infrastructure
Work in fast iteration loops with business/product managers to architect and deliver new products and capabilities
Drive the architectural design, including dependent services and service interactions (APIs & SDKs)
Apply judgment and experience to balance trade-offs between competing interests and optional solutions, considering profiling data to inform decisions on performance and resource utilization.
Help with coaching and mentoring team members, and maintaining high standards of software quality within the team.

Requirements:
5+ years of programming experience. Proficiency with Python is a huge plus.
Experience with building SaaS applications from conception to production.
Strong hands-on experience with production systems, continuous integration and deployment and testing best practices.
Performance engineering experience to ensure applications are built to scale, run, and perform for varying demands
Able to clearly articulate architecture patterns of complex systems, with business and technical implications, to executive and customer stakeholders
Collaborate with engineers across the organization to champion standard software patterns and the reuse of shared libraries and services
Advantage: Experience working with Large Language Models (LLMs) and cutting-edge AI technologies.
Advantage: Experience with the following technologies is a plus: Celery, Databases such as postgres, redis,and pgvector support such as Aurora & alloydb), docker containers, etc.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8326410

שירות זה פתוח ללקוחות VIP בלבד