דרושים » ניהול ביניים » senior software engineer, profiling services

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 11 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
help build an always-on, low-overhead gpu profiling service that runs in production, scales across cluster environments, and delivers actionable insights for ml workloads. you will be hands-on delivering our profiling solutions across system software, drivers, and cuda to make profiling continuously available and reliable.
what youll be doing:
develop low-overhead, high-reliability implementations in C / C ++, with bounded cpu/memory budgets.
lead end-to-end feature delivery spanning user-mode components, driver/platform layers, and performance counter/trace providers.
establish profiling models that integrate with existing ml/ai workflows (e.g., pytorch/xla) to turn low-level signals into actionable insights.
Requirements:
what we need to see:
bs or ms degree or equivalent experience in computer engineering, Computer Science, or related degree.
5+ years of system -level C / C ++ development, including concurrency, memory management, and performance engineering.
familiarity with system software design, operating systems fundamentals, computer architectures, performance analysis, and delivering production-quality software.
strong interpersonal, verbal, and written communication; able to influence across organizations and build trust with external collaborators.
ways to stand out from the crowd:
extensive experience with profiling/tracing stacks for cpu/gpu (e.g., cupti, nsight, performance counters, event correlation) and debugging highly concurrent systems.
deep hands-on knowledge of cuda and gpu architecture, including runtime/driver apis, cuda streams/graphs, and Kernel behavior.
track record building continuous, always-on, or multi-client profiling systems designed for predictable overhead at scale.
hands-on experience tuning ml training/inference loops based on deep profiling analysis, with familiarity in ml ecosystems (e.g., pytorch, jax) and correlating application events with gpu metrics to translate data into actionable performance insights (e.g., bottleneck triage, compute vs. memory bound).
experience with user-mode driver development and integration within platform security and permissions models.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593763
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
4 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
Help build an Always-On, low-overhead GPU profiling service that runs in production, scales across cluster environments, and delivers actionable insights for ML workloads. You will be hands-on delivering our profiling solutions across system software, drivers, and CUDA to make profiling continuously available and reliable.

What youll be doing:

Develop low-overhead, high-reliability implementations in C/C++, with bounded CPU/memory budgets.

Lead end-to-end feature delivery spanning user-mode components, driver/platform layers, and performance counter/trace providers.

Establish profiling models that integrate with existing ML/AI workflows (e.g., PyTorch/XLA) to turn low-level signals into actionable insights.
Requirements:
What we need to see:

BS or MS degree or equivalent experience in Computer Engineering, Computer Science, or related degree.

5+ years of system-level C/C++ development, including concurrency, memory management, and performance engineering.

Familiarity with system software design, operating systems fundamentals, computer architectures, performance analysis, and delivering production-quality software.

Strong interpersonal, verbal, and written communication; able to influence across organizations and build trust with external collaborators.

Ways to stand out from the crowd:

Extensive experience with profiling/tracing stacks for CPU/GPU (e.g., CUPTI, Nsight, performance counters, event correlation) and debugging highly concurrent systems.

Deep hands-on knowledge of CUDA and GPU architecture, including runtime/driver APIs, CUDA streams/graphs, and kernel behavior.

Track record building continuous, always-on, or multi-client profiling systems designed for predictable overhead at scale.

Hands-on experience tuning ML training/inference loops based on deep profiling analysis, with familiarity in ML ecosystems (e.g., PyTorch, JAX) and correlating application events with GPU metrics to translate data into actionable performance insights (e.g., bottleneck triage, compute vs. memory bound).

Experience with user-mode driver development and integration within platform security and permissions models.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8586600
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior Software Engineer, Filesystem
About The Position:
We are architecting a new approach to the enterprise data stack built for the age of reasoning. We set the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy.
We are a pre-IPO, growth-stage company on a hyper-growth trajectory. Weve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the worlds largest and most innovative enterprises and research organizations, including 12 of the Fortune 50, achieve discoveries, insights, and business outcomes faster and more sustainably. Were passionate about solving our customers most complex data challenges to accelerate intelligent innovation and business value. If you share our passion, we invite you to join us on this exciting journey.
What you'll be doing:
The filesystem group is a high-powered team responsible for implementing algorithms at scales of 100s of PBs. The team also manages the core filesystem components, including blocks and metadata management, snapshots, RAID logic, object-store tiering, unique cloud disaster recovery features, and more. And most importantly, they skillfully handle the most delicate part of the solution - our customers data.
As a Senior Software Engineer, youll:
Design and develop distributed file system components to support data management features such as snapshots, replication, tiering, and advanced data reduction algorithms;
Participate in the design, architecture, and implementation of next-generation storage architecture;
Assist in technically managing initial storage implementations including proofs-of-concept;
Diagnose bottlenecks and implement clean and performant solutions to achieve unbeatable performance;
Design algorithms and data structures to make sure customer data is safe and coherent across our solution in a wide variety of failure modes; and
Constantly revisit the architecture, algorithms, and methodologies to improve productivity, reliability, and maintainability.
Requirements:
Mastery of low-level and performant programming in C or C++/ Rust
A thorough understanding of concurrency, inter-process communication, threading models, and synchronization concepts, including significant experience with complex multithreaded software design
Experience identifying, reproducing, and resolving complex software defects, including root cause isolation, tracing through large source codebases, and implementing long-term fixes as well as short-term workarounds
5+ years of hands-on experience with Linux development and debugging, along with a broad knowledge and understanding of Linux internals
It's nice if you have:
Experience in data-path design and development
Experience with development of highly-distributed systems
Deep familiarity with concepts and features from the storage industry, including snapshots, replication, transparent data migration, and data reduction techniques
Experience with ZFS, XFS, or other file systems or with enterprise storage solutions
Experience working with the Linux filesystem community
Contribution, upstreaming, or maintaining of filesystem code
Experience playing a significant role in the implementation of a concurrent, long-running performant server.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8588364
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 10 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
we seek a versatile senior software engineer who is passionate about performance optimization and generative ai. our team builds software solutions that enable efficient inference on the latest and greatest generative ai models. we tackle problems on all levels of the stack-from server-level request batching to gpu Kernel fusion-and collaborate with teams across diverse disciplines to push nvidia's hardware to its full potential.
what youll be doing:
cooperate with research teams to onboard new llms and vlms into nvidia's opensource ai runtimes
optimize inference workloads using sophisticated profiling and simulation tools
build solid, extendable inference software systems, and refine robust apis
implement and debug low-level gpu code to harness the latest hw features
own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products
Requirements:
what we need to see:
b.sc., m.sc. or equivalent experience in Computer Science or computer engineering
5+ years of relevant hands-on software engineering experience
profound knowledge of software design principles
strong proficiency in at least one system and one scripting language
strong grasp of Machine Learning concepts
people person with excellent communication skills that enjoys collaboration and teamwork.
ways to stand out from the crowd:
familiarity with nvidia's DL software stack, e.g. triton inference server, tensorrt-llm, and model optimizer
proven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with nvidia's accelerators.
familiarity with llm quantization, fine-tunning, and caching algorithms
proficiency in gpu Kernel programming (cuda or opencl)
prior experience working on a large software project with 50+ contributors
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593825
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a skilled software engineer to join our NPU software stack development team. This role involves developing high-performance GPU programming frameworks, runtime systems, and libraries for AI/ML workloads. You will be responsible for implementing, optimizing, and maintaining GPU software stack components to support distributed AI training and inference.
Key Responsibilities
Identify bottlenecks, analysis and optimize in distributed NPU eco-system
Design and develop NPU memory management system
Design and develop optimized NPU development framework, execution path and debugging
Develop compatibility with AI frameworks (Triton, PyTorch, JAX)
Write high-quality, well-tested code with comprehensive documentation
Collaborate with other teams (Hardware, Network, QA, AI Framework Integration)
Participate in code reviews and technical design discussions.
Requirements:
5+ years of experience in distributed system programming
3+ years of experience with NPU programming (Triton, CUDA, HIP, OpenCL)
Expert-level C/C++ programming with focus on performance optimization
Expert-level Python programming with focus on DL/ML frameworks (PyTorch/JAX/etc)
Deep understanding of NPU architecture, memory tiering, and programming models
Knowledge of NPU runtime systems
Experience with performance profiling and optimization tools
Strong problem-solving and debugging skills
Experience with version control systems, Ticking system and collaborative development
Team player with excellent communication skills
Fast learner, highly organized, detail-oriented with high motivation
Preferred Qualifications
Experience with NPU software stack development
Experience with large-scale NPU systems (100+ NPUs)
Experience with DL/ML workloads (oriented AI) and distributed training / inferencing
Familiarity with containerization and orchestration.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8550014
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a top-talented software engineer with experience in the low-level development and\or security domain, to be a founding member of our new Data Protection team. In this role, you will take end-to-end ownership of building next-generation data protection capabilities from the ground up, spanning multiple technologies, services, and layers of the product.
This is a true builder role for someone who thrives with an owners mindset, embraces ambiguity, and enjoys close collaboration, and can bridge between deep kernel/system internals into a production-grade product, integrating multiple components into a cohesive, high-impact security solution.
What will you do?
Design and implement low-level agent modules (using C++ or Rust), capable of monitoring data access and movement with minimal performance overhead.
Research and evaluate technologies for building high-fidelity sensors that track data access.
Develop robust, scalable, and performant code that operates reliably across multiple operating systems and environments.
Collaborate closely with Core Agent, Backend, and Frontend teams to deliver a unified, user-facing, next-generation data protection product.
Requirements:
4+ years of experience as a low-level software engineer, building complex systems in modern C++\C.
Hands-on experience with system-level development, debugging tools, and performance profiling.
Deep OS Expertise, with strong knowledge of operating system architecture and internals (Windows, Linux, and/or macOS).
A proven track record of shipping production-quality code to large-scale deployments, ensuring reliability across diverse environments and collaborating with multiple stakeholders.
Itd be great if you have experience with / or Youd learn & gain on our team:
Security Domain Expertise
Technical leadership experience
Exposure to a multi-stack environment, working across agent, backend, and frontend systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8553811
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior Software Engineer, Platform
About The Position:
We are architecting a new approach to the enterprise data stack built for the age of reasoning. We set the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy.
We are a pre-IPO, growth-stage company on a hyper-growth trajectory. Weve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the worlds largest and most innovative enterprises and research organizations, including 12 of the Fortune 50, achieve discoveries, insights, and business outcomes faster and more sustainably. Were passionate about solving our customers most complex data challenges to accelerate intelligent innovation and business value. If you share our passion, we invite you to join us on this exciting journey.
What youll be doing:
As a Senior Software Engineer, you'll be an integral part of our Platform Group-a hands-on team of seasoned engineers responsible for building and optimizing the critical low-level components that power our infrastructure. This includes the networking stack, storage systems, task scheduling framework, and other foundational systems.
As a Senior Software Engineer, youll:
Play an active role in creating jaw-dropping designs, writing impressively efficient code, and conducting collaborative code review;
Share fresh ideas and architectural guidance for our core areas of distributed computing, high-performance storage, and cloud computing; and
Challenge our benchmarks with performance testing around IO and storage throughput.
Requirements:
Mastery of low-level C/C++ development in Linux user space or kernel-space with a vast experience in performance-sensitive code
Extensive experience with networking concepts and protocols, including UDP, TCP, InfiniBand, Ethernet, and RDMA.
10+ years of hands-on experience with software development on Linux based systems
Experience working on complex and large-scale and/or distributed systems
It's nice if you have:
Experience with DPDK, eBPF/XDP and libfabric
Knowledge of storage systems and SSDs (SPDK)
Prior involvement with deep networking (congestion control, bonding, VLAN, InfiniBand)
Kernel driver development know-how
Familiarity with storage concepts (SMB, NFS, S3, SSD, NVMe, Linux filesystems).
Experience with the development of highly-distributed systems.
Experience with memory management concepts and entities in a multiprocessing system (cache, shared memory, numa, huge pages etc.).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8588378
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 11 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
our company is leading the way in groundbreaking developments in artificial intelligence, high performance computing and visualization. the gpu, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
come work for the team that brought to you nccl, nvshmem & gpudirect. our gpu communication libraries are crucial for scaling deep learning and hpc applications! we are looking for a motivated partner enablement engineer to guide our key partners and customers with nccl. most DL /hpc applications run on large clusters with high-speed networking (infiniband, roce, ethernet). this is an outstanding opportunity to get an end to end understanding of the ai networking stack. are you ready for to contribute to the development of innovative technologies and help realize our vision?
what you will be doing:
engage with our partners and customers to root cause functional and performance issues reported with nccl
conduct performance characterization and analysis of nccl and DL applications on groundbreaking gpu clusters
develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (azure, aws, gcp, etc.)
guide our customers and support teams on hpc knowledge and standard methodologies for running applications on multi-node clusters
document and conduct trainings/webinars for nccl
engage with internal teams in different time zones on networking, gpus, Storage, infrastructure and support.
Requirements:
what we need to see:
b.s./m.s. degree in cs/ce or equivalent experience with 5+ years of relevant experience. experience with parallel programming and at least one communication runtime (mpi, nccl, ucx, nvshmem)
excellent C / C ++ programming skills, including debugging, profiling, code optimization, performance analysis, and TEST design
experience working with engineering or academic research community supporting hpc or ai
practical experience with high performance networking: infiniband/roce/ethernet networks, rdma, topologies, congestion control
expert in Linux fundamentals and a scripting language, preferably Python
familiar with containers, cloud provisioning and scheduling tools (docker, docker swarm, kubernetes, slurm, ansible)
adaptability and passion to learn new areas and tools
flexibility to work and communicate effectively across different teams and timezones
ways to stand out from the crowd:
experience conducting performance benchmarking and developing infrastructure on hpc clusters. prior system administration experience, esp for large clusters. experience debugging network configuration issues in large scale deployments
familiarity with cuda programming and/or gpus. good understanding of Machine Learning concepts and experience with deep learning frameworks such pytorch, tensorflow
deep understanding of technology and passionate about what you do
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593743
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 13 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
our company has evolved ai infrastructure by merging gpu virtualization with kubernetes-native capabilities. our world class ai platform allows organizations to improve productivity and efficiency for data scientists and Machine Learning engineers. with deep kubernetes expertise and a focus on innovation, we are dedicated to developing cutting-edge technologies, delivering the best User Experience for our customers, and providing deep visibility into workload performance through rich metrics that help users optimize their ai workloads. we are looking for highly skilled software engineers to join our platform group and help shape the future of ai infrastructure.
the role of a senior software engineer in the platform group is to design and develop scalable, high-performance systems that support the next generation of ai workloads. you will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of ai capabilities.
what youll be doing:
designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
building and optimizing microservices-based architectures using kubernetes and cloud-native technologies.
collaborating closely with backend engineers, product managers, and other collaborators to deliver impactful solutions.
writing clean, maintainable, and testable code in go
conducting code and design reviews to uphold high-quality standards and mentor team members.
Requirements:
what we need to see:
b.sc. in Computer Science or a related field.
5+ years proven experience in backend software development, including system design and architecture.
proficiency in at least one backend programming language (we write in go).
strong understanding of microservices architecture, restful apis, and relational databases.
deep familiarity with kubernetes and the cloud-native ecosystem.
demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
ways to stand out from the crowd:
expertise in kubernetes internals and advanced cloud-native technologies.
hands-on experience with hpc or ai/ml platforms.
familiarity with ai inference workloads and performance optimization.
proficiency in Linux, with knowledge in networking, security, Storage, and virtualization.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8593557
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior Software Engineer, Platform
About The Position:
We are architecting a new approach to the enterprise data stack built for the age of reasoning. We set the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy.
We are pre-IPO, growth-stage company on a hyper-growth trajectory. Weve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the worlds largest and most innovative enterprises and research organizations, including 12 of the Fortune 50, achieve discoveries, insights, and business outcomes faster and more sustainably. Were passionate about solving our customers most complex data challenges to accelerate intelligent innovation and business value. If you share our passion, we invite you to join us on this exciting journey.
What youll be doing:
As our new Senior Software Engineer, youll be joining the Platform group. This group of highly-experienced and detail-oriented engineers proudly owns the network stack, storage stack, task scheduling infrastructure and more.
As a Senior Software Engineer, youll:
Play an active role in creating jaw-dropping designs, writing impressively efficient code, and conducting collaborative code review;
Share fresh ideas and architectural guidance for our core areas of distributed computing, high-performance storage, and cloud computing; and
Challenge our benchmarks with performance testing around IO and storage throughput.
Requirements:
Mastery of low-level C/C++ development in Linux user space or kernel-space with a vast experience in performance-sensitive code
5+ years of hands-on experience with software development on Linux based systems
Familiarity with network concepts and protocols (UDP, TCP, InfiniBand, Ethernet, RDMA).
It's nice if you have:
Experience with DPDK and SPDK
Knowledge of storage systems and SSDs
Kernel driver development know-how
Previous experience with hardware interfaces
Prior involvement with deep networking (congestion control, bonding, VLAN, InfiniBand)
Familiarity with storage concepts (SMB, NFS, S3, SSD, NVMe, Linux filesystems).
Experience with the development of highly-distributed systems.
Experience with memory management concepts and entities in a multiprocessing system (cache, shared memory, numa, etc.)
Experience working on complex and large-scale and/or distributed systems, databases, or others.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8588385
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior Algo Data Engineer
Realize your potential by joining the leading performance-driven advertising company!
As a Senior Algo Data Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Senior Algo Data Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet
Our tech stack:
Java, Python, TensorFlow, Spark, Kafka, Cassandra, HDFS, vespa.ai, ElasticSearch, AirFlow, BigQuery, Google Cloud Platform, Kubernetes, Docker, git and Jenkins.
Requirements:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills - in Java & Python
Experience with TensorFlow - a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8559383
סגור
שירות זה פתוח ללקוחות VIP בלבד