דרושים » תוכנה » Senior Autonomous Driving Software Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 5 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior Software Engineer, you will design, develop, and integrate core infrastructure components that power our perception and planning pipelines. Your work will ensure seamless collaboration between hardware, algorithms, and platform modules - delivering production-grade software that runs in real autonomous vehicles.

Responsibilities:
Design and develop infrastructure components to be used across multiple application groups.
Develop and maintain production-grade code for deployment on real vehicles.
Profile, debug, and optimize platform infrastructure modules for performance and reliability.
Integrate software components with hardware and other software modules to ensure seamless system operation.
Work closely with cross-functional teams (hardware, algorithms, and verification) to ensure system-level functionality.
Create and maintain clear technical documentation, including design specifications and system integration details.
Support thorough testing and validation in collaboration with verification engineers.
Requirements:
B.Sc or M.Sc in Computer Science, Electrical Engineering, or a related field.
5+ years of hands-on experience in SW development for real-time, high-performance systems.
Strong expertise in Python/C++ development.
Hands-on experience with ROS/ROS 2.
Deep understanding of memory management, concurrency, synchronization, and low-level performance optimization.
Strong problem-solving and debugging skills at the system level.
Prior experience in system-level development at companies like NVIDIA, Intel, Qualcomm, Broadcom, Marvell, etc. strong advantage.
Background in AI model deployment on edge platforms (TensorRT, ONNX runtime) advantage.
Experience with camera/LiDAR/IMU integration and synchronization advantage.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8334159
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a meticulous and experienced Chipset System Validation Engineer to ensure the reliability, performance, and quality of our advanced radar chipset systems. This critical role will be responsible for designing, implementing, and executing comprehensive validation strategies that guarantee our chipsets meet the highest standards before delivery to Tier-1 automotive partners. The ideal candidate will combine deep technical expertise with systematic validation methodologies to validate complex radar systems
Responsibilities
Design and develop comprehensive validation test plans and procedures for radar chipset systems and associated software components
Execute end-to-end system validation testing, including functional, performance, stress, and regression testing
Collaborate with hardware, software, and system engineering teams to define validation requirements and acceptance criteria
Develop and maintain automated test frameworks and validation tools to enhance testing efficiency and coverage
Perform system-level integration testing to ensure seamless interaction between chipset hardware and software components
Conduct environmental and reliability testing to validate performance under various operating conditions
Analyze test results, identify issues, and work with development teams to implement corrective actions
Create detailed validation reports and documentation to support product certification and customer delivery
Establish and maintain validation lab infrastructure, including test equipment and measurement systems
Support field testing activities and customer validation efforts
Continuously improve validation processes and methodologies based on industry best practices.
Requirements:
Bachelor's degree in Electrical Engineering, Computer Engineering, or related field (Master's degree preferred)
4+ years of experience in system validation, verification, or test engineering, preferably with embedded systems or automotive electronics
Strong background in radar systems, RF/microwave testing, or signal processing validation
Experience with validation of complex hardware-software integrated systems
Proficiency in test automation tools and scripting languages (Python, MATLAB, or similar)
Knowledge of automotive industry standards and validation requirements (ISO 26262, ASPICE, etc.)
Experience with test equipment such as oscilloscopes, spectrum analyzers, signal generators, and network analyzers
Preferred Qualifications
Experience with automotive radar systems or ADAS technology validation
Knowledge of automotive functional safety requirements and testing
Familiarity with HIL (Hardware-in-the-Loop) and SIL (Software-in-the-Loop) testing
Experience with continuous integration and automated testing pipelines
Understanding of EMC/EMI testing and automotive compliance requirements.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8331655
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
06/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
You will play a critical role in designing, developing, and optimizing the software components that drive our hardware. Your expertise will be instrumental in ensuring the reliability, performance, and efficiency of our products.

Were looking for a Senior Embedded Software Engineer to own and evolve our production‑grade, multi‑threaded C++ application that drives thermal management, ASIC operation, and hardware interfaces in Bitcoin mining Hardware - ensuring its ongoing productization, reliability, and performance while accelerating time‑to‑market.

Roles and Responsibilities:
Take complete ownership of the embedded control‑system code for Bitcoin mining.
Transform prototype code into clean, modular, production-ready software.
Resolve technical debt, improve system architecture, and manage dependencies.
Design and enhance robust unit and integration test strategies.
Optimize performance and resource utilization.
Debug and resolve complex issues in a multi‑threaded system.
Document APIs, software designs, and perform code reviews.
Collaborate with ASIC and board‑design teams during bring‑up and validation.
Requirements:
Requirements:
Bachelors or Masters degree in Computer Engineering, Computer Science, or equivalent.
5+ years of experience developing modern C++ (14/17/20) applications in embedded/Linux environments.
Proven track record of owning large, complex codebases.
Expertise in multithreading and real‑time systems.
Strong background in software architecture and design.
Excellent problem‑solving skills, with clear written and verbal communication.
Hands-on, proactive, and comfortable working independently and cross-functionally.

Advantages:
Proficiency with Git workflows, build systems (CMake), and debugging tools.
Prior work on ASIC‑based products.
Familiarity with hardware communication protocols: SPI, I²C, GPIO.
Experience with safety‑critical systems, system profiling, performance tuning or thermal optimization.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8292531
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
The Senior Manufacturing Test Engineer is responsible for developing, implementing, and sustaining test processes and equipment used in the manufacturing of hardware products. This role ensures that products meet quality, reliability, and performance standards through robust test strategies, automation, and continuous improvement. The engineer acts as a bridge between design engineering, manufacturing operations, and quality assurance, supporting new product introductions (NPI) through to mass production and sustaining phases.
This is a great opportunity to be part of one of the fastest-growing AI infrastructure companies in history, an organization that is in the center of the hurricane being created by the revolution in artificial intelligence.
we are the data platform company for the AI era. We are building the enterprise software infrastructure to capture, catalog, refine, enrich, and protect massive datasets and make them available for real-time data analysis and AI training and inference. Designed from the ground up to make AI simple to deploy and manage, our company takes the cost and complexity out of deploying enterprise and AI infrastructure across data center, edge, and cloud.
Our success has been built through intense innovation, a customer-first mentality and a team of fearless company ronauts who leverage their skills & experiences to make real market impact. This is an opportunity to be a key contributor at a pivotal time in our companys growth and at a pivotal point in computing history.
Role and Responsibilities:
Test Development & Validation
Design, develop, and implement test plans, test fixtures and infrastructure.
Collaborate with R&D to define test requirements early in the product lifecycle.
Develop test scripts and automation software (Python, LabVIEW, C#, etc.) to improve coverage and efficiency.
Validate test coverage, yield, and reliability through statistical analysis (GR&R, Cpk, SPC).
New Product Introduction (NPI)
Support EVT, DVT, and PVT phases with test readiness and execution.
Lead test process transfer to contract manufacturers (CMs) or ODM partners.
Train CM engineers/technicians on test systems and procedures.
Ensure compliance with safety, regulatory, and customer requirements.
Manufacturing Support & Continuous Improvement
Monitor production test yields, debug failures, and drive root cause analysis (RCA).
Implement corrective actions and continuous improvements to reduce test time, cost, and false failures.
Maintain and calibrate test equipment and fixtures.
Support ECO (Engineering Change Orders) by updating test plans and equipment accordingly.
Cross-Functional Collaboration
Work closely with hardware, firmware, and reliability engineers to improve product testability and robustness.
Partner with Quality and Operations to ensure smooth scaling into mass production.
Engage with suppliers and CM partners on test strategy alignment.
Requirements:
Bachelors or Masters degree in Electrical Engineering, Computer Engineering, or related field.
58+ years of experience in manufacturing test engineering, preferably in electronics/hardware products.
Proficiency in test automation tools (e.g., LabVIEW, Python, C#, TestStand).
Familiarity with manufacturing processes
Strong problem-solving and analytical mindset.
Excellent communication and collaboration across cross-functional teams.
Ability to lead projects, mentor junior engineers, and work with global teams.
Desired Qualifications
Good understanding and experience of server systems including test methodology for CPU, memory and motherboards
Experience with IPMI and testing BMC functionality
Familiarity with networking and testing networking infrastructure
Experience with storage architecture, including testing SSDs
Experience with PCIe debugging and testing
Bench-top electrical debug tool experience as well as electrical design of test circuitry
Knowledge of programming devices such as CPLDs.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8325649
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
4 ימים
Job Type: Full Time
we are widely considered to be one of the technology worlds most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
we are looking for a phenomenal Senior Verification Engineer for HW Simulation for the ChipSim Group. You will join the ChipSim growing team and take our product to next level, working closely with HW design and architect teams and gaining a deep understanding of our companys products and technologies.
What Youll Be Doing:
Develop and maintain robust test environments, verification flows, and infrastructure.
Define and execute comprehensive test plans for existing and next-generation networking features.
Take ownership of functional, integration, and regression testingfrom planning through execution.
Build automated test suites and integrate them into CI pipelines to ensure quality at scale.
Collaborate across architecture, firmware, and HW teams to drive quality and early bug detection.
Analyze complex system behaviors and drive debugging efforts across hardware and software boundaries.
Requirements:
Bachelors Degree or higher in Electrical Engineering, Computer Engineering, or equivalent experience.
5+ years of hands-on experience in functional verification and automation.
Strong programming skills in Python and C/C++.
Solid understanding of system-level debugging, failure analysis, and test methodology.
Experience with Linux environments and scripting.
Familiarity with networking concepts and communication protocols.
Ways To Stand Out From The Crowd:
Background in verifying complex HW/FW/SW systems.
Experience with CI tools and methodologies (Git, Jenkins, Gerrit, etc.).
Knowledge of networking stacks and low-level protocols.
Familiarity with UVM or other verification methodologies.
Strong interpersonal and communication skillscomfortable working in a fast-paced, team-oriented environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8326212
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
were on a mission to revolutionize public safety. As the global leader in emergency collaboration technology, were building a cutting-edge platform that helps save livesthrough live video streaming, real-time chat, and precise location tracking.
Our tools empower emergency teams to respond faster and smarter, ensuring help reaches those in need ASAP! With partnerships with tech giants like Amazon, Microsoft, and AT&T, were innovating life-saving solutions for over 400 million people worldwide

About the Role
We are seeking a talented Senior Backend and Realtime Software Engineer to play a critical role in designing, implementing, and maintaining our complex and (life-saving level) critical communication systems. You'll be responsible for ensuring seamless, super-reliable, and scalable communication solutions that support our rapid expansion.

What You'll Do:
Design and implement robust VoIP/unified communication architectures and systems.
Scale and develop highly-available and super-resilient systems and deployments.
Overcome difficult resiliency, scale, load and performance system level challenges.
Collaborate with cross-functional teams to ensure seamless integration.
Stay up-to-date with the latest VoIP technologies and trends.
Contribute to the scalability and reliability of our platform.
Requirements:
Bachelor's degree in Software Engineering or Computer Science
7+ years of experience in Backend Development, with focus on real-time and networking systems
Expertise in cloud services (AWS-like), focusing on backend services like databases, compute services, and API management
Experience with distributed architectures, large scale stateful systems and high scale production workloads
A proactive "can-do" approach and exceptional interpersonal abilities
Experience as VoIP/Unified Communication developer - an advantage.
Understanding of SIP, VoIP, and related technologies - an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8288279
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Now were looking for a bright and passionate Senior Software Engineer to join our Access Clients team. In this critical role, you will design, develop and maintain core components of our Access Clients which are used by hundreds of thousands of users worldwide, on various households and enterprise environments. You will help taking the Windows Client to the next level of functionality and stability, developing core Service functionality as well as user-facing components.
Responsibilities:
Developing core components for C++ Windows Service and applications.
Working on the cross-platform C SDK library which runs on all major operating systems.
Researching new technologies.
Taking ownership on new features and leading collaborative efforts.
Support the entire application lifecycle, including release management and supporting production.
Collaborate with quality assurance, technical support, product-management and other engineering teams to deliver integrated features.
Requirements:
BSc. degree in Computer Science from a known university or equivalent experience
Excellent understanding of software engineering principles and best practices
At least 5+ years developing experience in C++
Experience developing applications or Services on Windows
Windows driver development Major advantage
Good understanding of multithreaded development, synchronization and data structures
Extensive experience with networking, network protocols, and networking environment
Experience in developing low-level code in C or Rust - Advantage
Passionate about delivering a great user experience
Motivated and independent, a fast learner who is great at problem-solving
A team player with excellent communication and collaboration skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8276874
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
10/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Staff Engineer.
What Youll Do:
Lead architecture and system design for critical components of the Developer Experience platform, ensuring scalability, resilience, and long-term maintainability.
Own end-to-end delivery of complex initiatives, from requirements gathering and design to implementation, rollout, and observability.
Design, implement, and maintain robust microservices supporting high-throughput and low-latency operations.
Define and uphold API design standards, including gateway configuration, versioning strategy, and long-term lifecycle management.
Build and optimize backend systems that enable developer-facing products such as SDKs, APIs, and webhooks.
Work with both relational and NoSQL databases to ensure data consistency, scalability, and performance.
Collaborate with cross-functional teams to design systems that meet operational and business requirements.
Research and implement cloud-native architectures to support growth and scalability.
Contribute to the creation of developer tools and standards that improve the usability of our APIs and SDKs.
Requirements:
10+ years of experience in backend development, with a strong focus on scalable infrastructure.
Proficiency in Node.js and TypeScript; additional experience with other backend languages is a plus.
Strong expertise in relational and NoSQL databases, including schema design, query optimization, and troubleshooting.
Experience designing and managing RESTful APIs, including versioning strategies, API gateway integration, and developer-first design.
Proven experience designing and deploying microservices-based architectures in production environments.
Hands-on experience with cloud providers (AWS, GCP, Azure) and container orchestration tools (e.g., Kubernetes, Docker).
Solid understanding of system design principles, distributed systems, and scalability.
Experience with monitoring and logging frameworks (e.g. Datadog, Prometheus, Grafana, ELK stack).
Deep understanding of REST APIs and event-driven architectures.
Advantage - Familiarity with AWS, Servers-less
Strong problem-solving skills, with the ability to troubleshoot production issues effectively.
Ability to manage multiple priorities and thrive in a service-oriented, fast-paced environment.
Bonus Points:
Experience designing developer-centric SDKs, tools, or CLI utilities.
Track record of contributing to internal platform teams or DX-focused initiatives.
Knowledge of OpenAPI/Swagger specifications and API documentation best practices.
Passion for elevating developer experience and usability across engineering platforms.
Hands-on experience in designing developer-friendly SDKs and APIs.
Knowledge of CI/CD pipelines and best practices for automated testing and deployment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8296063
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
26/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
we are seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture group, where we build cutting-edge systems powering the next generation of generative AI workloads. In this role, you will work across software and hardware domains to design and optimize inference infrastructure for large language models running on some of the most advanced GPU clusters in the world.
Youll help define how AI models are deployed and scaled in production, driving decisions on everything from memory orchestration and compute scheduling to inter-node communication and system-level optimizations. This is an opportunity to work with top engineers, researchers, and partners across our company and leave a mark on the way generative AI reaches real-world applications.
What Youll Be Doing:
Design and evolve scalable architectures for multi-node LLM inference across GPU clusters.
Develop infrastructure to optimize latency, throughput, and cost-efficiency of serving large models in production.
Collaborate with model, systems, compiler, and networking teams to ensure holistic, high-performance solutions.
Prototype novel approaches to KV cache handling, tensor/pipeline parallel execution, and dynamic batching.
Evaluate and integrate new software and hardware technologies relevant to model inference (e.g., memory hierarchy, network topology, modern inference architectures).
Work closely with internal teams and external partners to translate high-level architecture into reliable, high-performance systems.
Author design documents, internal specs, and technical blog posts and contribute to open-source efforts when appropriate.
Requirements:
Bachelors, Masters, or PhD in Computer Science, Electrical Engineering, or equivalent experience.
5+ years of experience building large-scale distributed systems or performance-critical software.
Deep understanding of deep learning systems, GPU acceleration, and AI model execution flows.
Solid software engineering skills in C++ and/or Python, with strong familiarity with CUDA or similar platforms.
Strong system-level thinking across memory, networking, scheduling, and compute orchestration.
Excellent communication skills and ability to collaborate across diverse technical domains.
Ways to Stand Out from the Crowd:
Experience working on LLM inference pipelines, transformer model optimization, or model-parallel deployments.
Demonstrated success in profiling and optimizing performance bottlenecks across the LLM training or inference stack.
Familiarity with data center-scale orchestration, cluster schedulers, or AI service deployment pipelines.
Passion for solving tough technical problems and shipping high-impact solutions.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8319687
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Senior Software Engineer to join our growing R&D team. In this role, you will play a critical part in designing, building, and optimizing complex systems that power our AI-driven platform. Youll work across the stack- primarily on backend services - with opportunities to influence architectural decisions and build highly scalable and performant systems. Youll collaborate closely with AI, product, and frontend teams to bring advanced features to life and ensure a seamless, intelligent experience for our users.

This is a high-impact role for someone who is passionate about engineering excellence, eager to shape systems end-to-end, and ready to grow with a fast-moving, AI-first company.



Key Responsibilities:

Design, develop, and maintain robust backend systems and services.
Ensure the scalability, performance, and security of backend components.
Collaborate with front-end developers and data teams to integrate user-facing elements with server-side logic.
Optimize the platform's infrastructure to handle large-scale data processing and analysis.
Troubleshoot and debug complex issues, identifying and implementing the most effective solutions.
Contribute to the architecture and system design decisions for the backend infrastructure.
Stay up to date with industry trends and new technologies to continuously improve backend performance.
Requirements:
7+ years of software development experience in a fast-paced SaaS environment.
Strong experience with server-side technologies, particularly Node.js, Python and SQL.
In-depth knowledge of databases; experience in schema design and optimization.
Expertise in API development and microservices architecture.
Familiarity with cloud platforms such as Google Cloud/AWS.
Understanding of containerization and orchestration tools (Docker, Kubernetes).
Experience with message queues (e.g., RabbitMQ, Kafka or their cloud alternatives such as SQS/pubsub) and data processing.
Experience with client-side technologies (e.g. React) is a plus
Applied AI or video editing knowledge is a big plus.
Excellent problem-solving skills with a focus on scalability and performance.
Ability to work independently while also thriving in a collaborative team environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8331691
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
25/08/2025
חברה חסויה
Location: More than one
Job Type: Full Time
we are looking for an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. we are focused on building supercomputers and HPC clusters based on groundbreaking technologies. We are looking for an outstanding architect for a senior HPC, be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing. Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.
What you will be doing:
Design, implement and maintain large scale HPC/AI clusters with monitoring, logging and alerting
Manage Linux job/workload schedules and orchestration tools
Develop and maintain continuous integration and delivery pipelines
Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources
Deploy monitoring solutions for the servers, network and storage
Perform troubleshooting bottom up from bare metal, operating system, software stack and application level
Being a technical resource, develop, re-define and document standard methodologies to share with internal teams
Support Research & Development activities and engage in POCs/POVs for future improvements.
Requirements:
A degree in Computer Science, Engineering, or a related field
5+ years of experience
Knowledge of HPC and AI solution technologies from CPUs and GPUs to high speed interconnects and supporting software
Experience with job scheduling workloads and orchestration tools such as Slurm, K8s
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalld, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.
Experience with multiple storage solutions such as Lustre, GPFS, zfs and xfs. Familiarity with newer and emerging storage technologies.
Python programming and bash scripting experience.
Comfortable with automation and configuration management tools such as Jenkins, Ansible, Puppet/chef
Deep knowledge of Networking Protocols like InfiniBand, Ethernet
Deep understanding and experience with virtual systems (for example VMware, Hyper-V, KVM, or Citrix)
Ways to stand out from the crowd:
Familiarity with cloud computing platforms (e.g. AWS, Azure, Google Cloud)
Knowledge of CPU and/or GPU architecture
Knowledge of Kubernetes, container related microservice technologies
Experience with GPU-focused hardware/software (DGX, Cuda)
Background with RDMA (InfiniBand or RoCE) fabrics.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8317649
סגור
שירות זה פתוח ללקוחות VIP בלבד