דרושים » תוכנה » Senior Software Engineer - CTO Office

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/08/2025
Location: Netanya and Tel Aviv-Yafo
Job Type: Full Time
We are seeking a Senior Software Engineer to join a new initiative within a newly formed group, taking part in the design and implementation of an innovative solution in the world of software development management. With a roadmap brimming with innovations, we are on the lookout for a Senior Golang Software Engineer to contribute to this new and exciting venture. This role is ideal for someone who thrives in a dynamic environment, is passionate about delivering high-quality code, and is adept at working in a startup-like environment within an industry-leading company.
As a Senior Software Engineer you will
Contribute to the architecture and development of a new SaaS product using cloud-native approaches
Write high-quality, testable, and efficient code for software development management
Initiate and promote new ideas for continuous product functionality improvement, emphasizing the use of micro-service architecture approaches.
Requirements:
5+ years of experience in software development
Go proficiency: A minimum of 2 years experience as the primary programming language
Hands-on experience building scalable cloud-native microservices in SaaS
Understanding of Kubernetes, message-bus, caching, and key cloud-native tools for scalable systems
Experience with PostgreSQL and query optimization
Familiarity with API Gateway concepts (e.g., routing, data translation, rewrites, and access control) is an advantage
Experience in modern CI practices and tools.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8318732
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
04/09/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
Run:ai, now part of our company, has evolved AI infrastructure by merging GPU virtualization with Kubernetes-native capabilities. Our world class AI platform allows organizations to improve productivity and efficiency for data scientists and machine learning engineers. With deep Kubernetes expertise and a focus on innovation, we are dedicated to developing cutting-edge technologies, delivering the best user experience for our customers, and providing deep visibility into workload performance through rich metrics that help users optimize their AI workloads. We are looking for highly skilled software engineers to join our Platform Group and help shape the future of AI infrastructure.
The role of a Senior Software Engineer in the Platform Group is to design and develop scalable, high-performance systems that support the next generation of AI workloads. You will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of AI capabilities.
What youll be doing:
Designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other collaborators to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go
Conducting code and design reviews to uphold high-quality standards and mentor team members.
Requirements:
B.Sc. in Computer Science or a related field.
5+ years of experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (We write in Go).
Strong understanding of microservices architecture, RESTful APIs, and relational databases.
Deep familiarity with Kubernetes and the cloud-native ecosystem.
Demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Hands-on experience with HPC or AI/ML platforms.
Familiarity with AI inference workloads and performance optimization.
Proficiency in Linux, with knowledge in networking, security, storage, and virtualization.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8333587
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
03/09/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Senior Software Engineer to join our growing R&D team. In this role, you will play a critical part in designing, building, and optimizing complex systems that power our AI-driven platform. Youll work across the stack- primarily on backend services - with opportunities to influence architectural decisions and build highly scalable and performant systems. Youll collaborate closely with AI, product, and frontend teams to bring advanced features to life and ensure a seamless, intelligent experience for our users.

This is a high-impact role for someone who is passionate about engineering excellence, eager to shape systems end-to-end, and ready to grow with a fast-moving, AI-first company.



Key Responsibilities:

Design, develop, and maintain robust backend systems and services.
Ensure the scalability, performance, and security of backend components.
Collaborate with front-end developers and data teams to integrate user-facing elements with server-side logic.
Optimize the platform's infrastructure to handle large-scale data processing and analysis.
Troubleshoot and debug complex issues, identifying and implementing the most effective solutions.
Contribute to the architecture and system design decisions for the backend infrastructure.
Stay up to date with industry trends and new technologies to continuously improve backend performance.
Requirements:
7+ years of software development experience in a fast-paced SaaS environment.
Strong experience with server-side technologies, particularly Node.js, Python and SQL.
In-depth knowledge of databases; experience in schema design and optimization.
Expertise in API development and microservices architecture.
Familiarity with cloud platforms such as Google Cloud/AWS.
Understanding of containerization and orchestration tools (Docker, Kubernetes).
Experience with message queues (e.g., RabbitMQ, Kafka or their cloud alternatives such as SQS/pubsub) and data processing.
Experience with client-side technologies (e.g. React) is a plus
Applied AI or video editing knowledge is a big plus.
Excellent problem-solving skills with a focus on scalability and performance.
Ability to work independently while also thriving in a collaborative team environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8331691
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior Software Engineer to join the ride as we spearhead the next revolution in electronics!

Responsibilities:
Develop and maintain robust, scalable, and secure Java-based software solutions.
Collaborate with product managers, architects, and other engineers to design and implement new features.
Build and optimize data processing pipelines for high-volume analytics applications.
Ensure software quality through code reviews, unit testing, and integration testing.
Participate in architectural decisions, contributing to the design of cloud-based systems.
Monitor and optimize system performance to meet scalability and reliability goals.
Troubleshoot, debug and resolve issues in development, staging, and production environments.
Requirements:
Requirements:
BA or B.Sc in Computer Science or an equivalent field.
5+ years of hands-on experience in SW development.
Strong proficiency in at least one backend programming language (Java, Python).
Strong understanding of object-oriented programming, design patterns, and clean code principles.
Familiarity with database systems (SQL/NoSQL) and query optimization techniques.
Proven experience of cloud platforms (AWS, Azure, GCP) and microservices architecture.
Strong understanding of REST API.
Excellent problem-solving skills and a proactive attitude.
Strong communication skills and the ability to collaborate in a team environment.

Advantages:
Experience with Spring Boot and the Spring Framework ecosystem
Experienced with JPA (Hibernate advantage)
Experience with streaming or messaging services (Kafka, RabbitMQ)
Knowledge of monitoring tools such as Grafana, Prometheus, or ELK Stack
Hands-on experience with containerization and orchestration (Docker, Kubernetes)
Familiarity with big data technologies like Apache Flink or Spark
Experience in performance optimization and distributed systems
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8351645
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
02/09/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Run:ai, now part of our company, has evolved AI infrastructure by merging GPU virtualization with Kubernetes-native capabilities. Our world class AI platform allows organizations to improve productivity and efficiency for data scientists and machine learning engineers. With deep Kubernetes expertise and a team of extraordinary individuals, we are looking for a highly skilled Senior Software Engineer to join the team and help shape the future of AI technology. The role of a Senior Software Engineer in the Run:ai group is to design and develop scalable, high-performance systems that support the next generation of AI workloads and infrastructure. You will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of AI capabilities.
What youll be doing:
Designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other stakeholders to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go, contributing to our CI/CD pipelines.
Conducting code and design reviews to uphold high-quality standards and mentor team members.
Requirements:
B.Sc. in Computer Science or a related field (or equivalent experience).
5+ years of experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (Go preferred).
Strong understanding of microservices architecture, RESTful APIs, and relational databases.
Deep familiarity with Kubernetes and the cloud-native ecosystem.
Demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Hands-on experience with HPC, GPU virtualization, or AI/ML platforms.
Experience working in Linux environments with knowledge of networking, security, and virtualization.
Contributions to open-source projects or active participation in tech communities.
Agile approach and familiarity with standard methodologies.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8329770
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
20/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a talented highly-motivated experienced SW engineer to join one of its growing inspiring development teams.
You will work on multi-tenant, high-scale, distributed SaaS echo system on top of k8s platform which is used for managing the cloud security services infrastructure, customers' self-service configuration, monitoring and reporting, analytics and more.

As a SW engineer, you will manage and work with the different Engineering teams and architects in order to design, develop, monitor, scale and optimize the large-scale architecture of a winning SaaS security service.

What will you do?

Implement our implementation of next generation back-end infrastructure to help us scale our SaaS based infrastructure.

Be part of a team building tools to make our infrastructure scalable, and robust.

Leverage Generative AI tools for code generation, optimization, debugging, documentation, and prototyping.

Continuously research and integrate new AI-driven developer productivity tools.

Design and develop an always-available Cloud-based SaaS platform in AWS

Lead and Design the development of robust CI/CD pipelines for Kubernetes running Containerized applications

Design and build strong Application and System monitoring and automated self-healing procedures.

Maintain and support application deployments, building new systems and upgrading existing.

Working closely with all the Engineering and DevOps teams, taking full responsibility and ownership from conception to post-deployment in a collaborative, fast-paced environment.
Requirements:
6+ years of experience in infrastructure and Backend SW development roles.

Experience managing infrastructure on AWS.

Experience with architecture methodologies and paradigms like micro-services, distributed systems and more.

Experience integrating and actively using GenAI tools (e.g., GitHub Copilot, Claude, ChatGPT etc) in daily development must.

Open-minded to new workflows and AI-driven innovation.

An agile/DevOps way of thinking.

Experience with CI/CD tools (Jenkins, argot, Nexus and similar).

Experience with the K8S platform and tools (Helm charts and similar).

Experience with the following technologies/tools/fields: Elasticsearch , Clickhouse, Messaging (Kafka,NATS,, Redis etc), Monitoring and Visibility (Prometheus, Grafana, loki, etc).

Programming languages Golang/ Java.

Functioning well under pressure.

Strong problem-solving ability and a "Can-do approach".

Working in an agile environment.

Excellent communication and interpersonal skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8312368
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a senior backend engineer who is truly cross functional possesses strong engineering foundations for developing highly performant services. The ideal candidate will have a deep understanding of algorithms and Python fundamentals, as well as an understanding of the capabilities and limitations of LLMs.
They will be able to quickly collaborate with their product counterparts to iterate over new, cutting edge features and capabilities and get them to the hands of our users. Collaborating closely with product teams, they will quickly ideate, develop and deliver innovative features and capabilities to our users.
If you have the skills and experience we are looking for, we encourage you to apply for this exciting opportunity to join our team and make a significant impact on the future of AI
Role and Responsibilities:
Build the framework used to supercharge our algorithm teams to write scalable, performant algorithms
E2E ownership of production services, databases and infrastructure
Work in fast iteration loops with business/product managers to architect and deliver new products and capabilities
Drive the architectural design, including dependent services and service interactions (APIs & SDKs)
Apply judgment and experience to balance trade-offs between competing interests and optional solutions, considering profiling data to inform decisions on performance and resource utilization.
Help with coaching and mentoring team members, and maintaining high standards of software quality within the team.
Requirements:
5+ years of programming experience. Proficiency with Python is a huge plus.
Experience with building SaaS applications from conception to production.
Strong hands-on experience with production systems, continuous integration and deployment and testing best practices.
Performance engineering experience to ensure applications are built to scale, run, and perform for varying demands
Able to clearly articulate architecture patterns of complex systems, with business and technical implications, to executive and customer stakeholders
Collaborate with engineers across the organization to champion standard software patterns and the reuse of shared libraries and services
Advantage: Experience working with Large Language Models (LLMs) and cutting-edge AI technologies.
Advantage: Experience with the following technologies is a plus: Celery, Databases such as postgres, redis,and pgvector support such as Aurora & alloydb), docker containers, etc.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8326410
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are spearheading the AI revolution and the creation of state-of-the-art accelerated compute platforms for global utilization. Our Network Modeling and Performance Insights group is seeking a skilled and driven Software Developer for the design and development of our infrastructure for a complex networking simulation as a service. In this role, you will be responsible for developing and optimizing our network simulation software, and to enhance its performance and quality. You will work on integrating this infrastructure with cloud computation services for various use cases and ensure the simulation is available as a service for internal and external customers. If you're passionate about tackling intricate challenges and contributing to comprehensive software solutions, we want to hear from you.
What you'll be doing:
Enhance simulation runtime and memory consumption through innovative optimization techniques.
Improve the quality of the simulation as a software product, ensuring robustness and reliability.
Expends the simulation versatility to accommodate new various and complex user use cases and bleeding-edge requirements.
Design and expose the simulation as a service to facilitate easier access for different stakeholders.
Integrate a new simulation management system, making simulated experiments data accessible to all users.
Design and develop a CI/CD infrastructure for our complex networking simulation tool, ensuring efficient deployment and smooth integration processes.
Requirements:
BSc or above in Computer Science, Computer Engineering, or a related field, or equivalent experience.
5+ years of relevant practical experience in software development, including working on a large-scale software product, preferably with strict performance considerations.
Proficiency in C++ and optimization techniques for improving code performance
In-depth knowledge of computer science fundamentals, and computer architecture.
Strong communication skills.
Experience with simulation environments (specifically, network related) - a significant advantage
Prior experience with multi-core computation and parallel code acceleration
Familiarity with cloud computing and parallelization of computational workloads - an advantage.
Experience in developing CI/CD pipelines and integrating services - an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321816
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
Location: Tel Aviv-Yafo and Netanya
Job Type: Full Time
Required Senior Software Developer - Core Team
Are you a software engineer who thrives on designing and building scalable, high-quality services and infrastructure? Do you want to work on cutting-edge services while shaping the backbone of an ML platform? If so, wed love to have you join us.
As a Senior Software Engineer, you will play a critical role in architecting and developing cloud based core services and infrastructure with a strong emphasis on clean code, performance, observability, monitoring, and security. This role also includes developing core Go/Java libraries, cross-cutting services like authentication and authorization, Kubernetes deployments, billing etc. You will collaborate closely with engineering teams, DevOps, architects, and product managers to ensure a scalable, stable, and high-performing system.
As a Senior Software Engineer you will...
Write clean, maintainable, and efficient code following best practices.
Design and develop robust backend services with scalability, performance, and security in mind.
Conduct high-quality code reviews and architecture discussions, ensuring best practices are followed.
Take full ownership of projects - from ideation and design to production deployment and maintenance.
Collaborate with cross-functional teams to define, design, and ship new features.
Stay current with industry trends, technologies, and best practices in software development and cybersecurity.
Be involved in multiple aspects of an ML platform - from data sources to inference pipelines.
Requirements:
5+ years of proven experience in software development
Experience in designing, developing, and debugging complex, distributed systems.
Proven hands-on experience in Kubernetes, containerized environments and microservices.
Hands-on experience with cloud services, observability tools, and automation.
Experienced with at least one of the main cloud provider platforms (e.g. AWS, GCP)
Ability to lead discussions, mentor engineers, and drive technical decisions.
A collaborative mindset - we value engineers who can communicate effectively across teams.
Programming in Go or Java - advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321497
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Realize your potential by joining the leading performance-driven advertising company!
As a Senior MLOps Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Senior MLOps Engineer Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet.
Requirements:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8335971
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
27/08/2025
חברה חסויה
Location: Tel Aviv-Yafo and Yokne`am
Job Type: Full Time
we are leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
Come work for the team that brought to you NCCL, NVSHMEM & GPUDirect. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applications! We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC applications of today have a huge compute demand and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are you ready for to contribute to the development of innovative technologies and help realize our company's vision?
What you will be doing:
Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters.
Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack
Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available
Triage and root-cause performance issues reported by our customers
Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information
Collaborate with a very dynamic team across multiple time zones.
Requirements:
M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience
3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)
Experience conducting performance benchmarking and triage on large scale HPC clusters
Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals)
Implement micro-benchmarks in C/C++, read and modify the code base when required
Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python
Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker)
Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones
Ways to stand out from the crowd:
Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control
Experience debugging network issues in large scale deployments
Familiarity with CUDA programming and/or GPUs
Experience with Deep Learning Frameworks such PyTorch, TensorFlow.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8321604
סגור
שירות זה פתוח ללקוחות VIP בלבד