דרושים » הנדסה » Senior Software Platform Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
Location: Tel Aviv-Yafo
Job Type: Full Time
The Infrastructure and Developer Platform (IDP) team within the Microsoft Threat Protection (MTP) organization builds and maintains the infrastructure and developer platform that all Defender products rely on, in an unmtachable scale. This platform allows engineers across MTP to more easily deploy their services, lower costs, and increase security and reliability across our fleet.



Responsibilities


Serve as a hands-on Kubernetes subject matter expert, driving architecture, design, and implementation of scalable, secure, and resilient AKS-based solutions.
Design and implement cloud-native security solutions using Azure technologies, with a focus on container runtime protection, policy enforcement, and threat detection.
Own and deliver production-grade services with high availability, reliability, and performance across global AKS deployments.
Develop and maintain CI/CD pipelines, secure build systems, and automated testing frameworks tailored for Kubernetes workloads.
Drive observability and telemetry improvements, including logging, monitoring, alerting, and incident response for services.
Identify and implement innovative approaches to secure Kubernetes workloads at scale, including leveraging AI/ML for anomaly detection.
Contribute to strategic initiatives that shape Microsofts container security roadmap and influence industry best practices.
Mentor junior engineers and contribute to engineering culture through code reviews, design discussions, and knowledge sharing.
Demonstrate ownership and accountability for end-to-end delivery of features and services.
Exhibit growth mindset by continuously learning and adapting to new technologies, threats, and customer needs.
Influence engineering decisions and drive impact beyond the immediate team.
Requirements:
Qualifications
Bachelors or Masters degree in Computer Science, Engineering, or a related technical field.
8+ years of professional software engineering experience, with a strong track record of delivering production-grade distributed systems.
Deep expertise in Kubernetes, including cluster architecture, workload orchestration, and security hardening e.g. RBAC, Workload Identity, Container Runtime Security.
Hands-on experience with containers (Docker) and container orchestration platforms (AKS, EKS, GKE).

Preferred:
Proficiency in one or more programming languages such as Go, C++, C, Python, or Java.
Experience building and operating cloud-native services on Azure or other public cloud platforms.
Strong understanding of site reliability engineering (SRE) principles, including observability, incident response, and automation.
Ability to work effectively with cross-functional teams and manage multiple priorities.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8312276
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Principal DevOps Engineer in our Platform Engineering team, you will lead the design and implementation of cutting-edge CI/CD pipelines and cloud architecture that powers our development environment. You'll drive initiatives to enhance developer productivity through automation, tooling, and infrastructure improvements, working with a modern tech stack including Kubernetes, Python, cloud-native and high-scale technologies.
Your Impact
Architect and implement scalable, resilient CI/CD pipelines and cloud infrastructure that supports our engineering organization's evolving needs
Design and develop internal developer tools and platforms that significantly improve developer experience and productivity
Drive the evolution of our Kubernetes-based deployment infrastructure in Google Cloud Platform, ensuring security, reliability and performance
Optimize and scale our CI/CD infrastructure including Jenkins, GitLab, TeamCity, and artifact management systems
Mentor and guide other engineers on DevOps best practices, infrastructure design, and implementation strategies
Drive adoption of infrastructure-as-code, automated testing, and deployment methodologies
Collaborate with development teams to understand their needs and implement solutions that accelerate their workflow
Establish standards and best practices for infrastructure reliability, observability, and performance.
Requirements:
7+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering roles
Extensive experience with CI/CD pipeline design and implementation in complex environments
Advanced knowledge of Kubernetes administration, deployment patterns, and ecosystem tools
Strong programming skills in Python with solid understanding of OOP principles and design patterns
Deep understanding of cloud architecture, specifically with Google Cloud Platform services
Proven track record designing and implementing developer tooling and automation
Experience managing containerized applications and services in production environments
Strong system design skills with focus on scalability, reliability, and security
Knowledge of GitOps workflows and infrastructure-as-code using tools like Terraform, Pulumi, or equivalent
Familiarity with GitLab CI administration and pipeline development
participate in an on call rotation for working and non-working hours
Nice-to-Have
Knowledge of observability platforms and practices (Prometheus, Grafana, distributed tracing)
Familiarity with TeamCity administration and pipeline development
Experience implementing security best practices in CI/CD pipelines
Understanding of compliance requirements in software delivery pipelines
Experience with Infrastructure as Code testing frameworks
Knowledge of software architecture patterns and microservices design.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8290390
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Site Reliability Engineer- Infra
Realize your potential by joining the leading performance-driven advertising company!
As a Site Reliability Engineer- infra, on our Infrastructure team at the TLV office, you will play a key role in ensuring the reliability, scalability, and performance of our critical systems. You will be responsible for managing and improving our core infrastructure, with a focus on automation, monitoring, and incident response. You will work with a wide range of technologies, including Kubernetes, monitoring and observability tools, configuration management systems, and core networking services.
How youll make an impact:
As a Site Reliability Engineer, youll bring value by:
Ensure the reliability, availability, and performance of our infrastructure services.
Manage and maintain our Kubernetes infrastructure, including KubeVirt.
Design, implement, and maintain our monitoring and observability stack (SensuGo, VictoriaMetrics, Prometheus, ELK).
Automate infrastructure provisioning, configuration, and deployment processes using Puppet and Ansible.
Manage and maintain core services such as DNS and networking.
Troubleshoot and resolve complex infrastructure issues in a timely and efficient manner.
Participate in on-call rotations and incident response.
Develop and maintain infrastructure-as-code (IaC).
Identify and implement proactive measures to prevent incidents and improve system reliability.
Collaborate with development teams to ensure smooth and reliable deployments.
Contribute to the design and implementation of new infrastructure solutions.
Drive improvements in system architecture, processes, and tools.
Mentor and coach other team members.
Requirements:
5+ years of experience in a Site Reliability Engineering, Systems Engineering, or similar role.
Deep understanding of Site Reliability Engineering principles and practices.
Extensive experience with Kubernetes, including deployment, management, and troubleshooting.
Strong experience with monitoring and observability tools such as SensuGo, Zabbix, VictoriaMetrics, Prometheus, and ELK.
Proficiency in configuration management tools such as Puppet and Ansible.
Solid understanding of Linux internals and networking.
Experience with managing and maintaining core services such as DNS and networking.
Strong programming skills in Python and/or Go.
Experience with both on-premises and cloud environments.
Experience with KubeVirt.
Excellent troubleshooting and problem-solving skills.
Strong communication and collaboration skills.
Ability to work in a fast-paced, dynamic environment.
Ability to participate in on-call rotations including weekends.
Preferred Qualifications:
Experience with large-scale, distributed systems.
Experience with other cloud providers (e.g., AWS, Azure, GCP).
Contributions to open-source projects.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8272676
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
10/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Staff Engineer.
What Youll Do:
Lead architecture and system design for critical components of the Developer Experience platform, ensuring scalability, resilience, and long-term maintainability.
Own end-to-end delivery of complex initiatives, from requirements gathering and design to implementation, rollout, and observability.
Design, implement, and maintain robust microservices supporting high-throughput and low-latency operations.
Define and uphold API design standards, including gateway configuration, versioning strategy, and long-term lifecycle management.
Build and optimize backend systems that enable developer-facing products such as SDKs, APIs, and webhooks.
Work with both relational and NoSQL databases to ensure data consistency, scalability, and performance.
Collaborate with cross-functional teams to design systems that meet operational and business requirements.
Research and implement cloud-native architectures to support growth and scalability.
Contribute to the creation of developer tools and standards that improve the usability of our APIs and SDKs.
Requirements:
10+ years of experience in backend development, with a strong focus on scalable infrastructure.
Proficiency in Node.js and TypeScript; additional experience with other backend languages is a plus.
Strong expertise in relational and NoSQL databases, including schema design, query optimization, and troubleshooting.
Experience designing and managing RESTful APIs, including versioning strategies, API gateway integration, and developer-first design.
Proven experience designing and deploying microservices-based architectures in production environments.
Hands-on experience with cloud providers (AWS, GCP, Azure) and container orchestration tools (e.g., Kubernetes, Docker).
Solid understanding of system design principles, distributed systems, and scalability.
Experience with monitoring and logging frameworks (e.g. Datadog, Prometheus, Grafana, ELK stack).
Deep understanding of REST APIs and event-driven architectures.
Advantage - Familiarity with AWS, Servers-less
Strong problem-solving skills, with the ability to troubleshoot production issues effectively.
Ability to manage multiple priorities and thrive in a service-oriented, fast-paced environment.
Bonus Points:
Experience designing developer-centric SDKs, tools, or CLI utilities.
Track record of contributing to internal platform teams or DX-focused initiatives.
Knowledge of OpenAPI/Swagger specifications and API documentation best practices.
Passion for elevating developer experience and usability across engineering platforms.
Hands-on experience in designing developer-friendly SDKs and APIs.
Knowledge of CI/CD pipelines and best practices for automated testing and deployment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8296063
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/07/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a Senior Platform Engineer, Observability to join our Observability team. This role offers the opportunity to work at the intersection of software development and platform engineering, contributing to the tools, systems, and practices that improve visibility, reliability, and operational excellence across our engineering organization.

This position is ideally suited for experienced software engineers who are passionate about building high-quality systems and are interested in expanding their expertise in observability, distributed systems, and developer experience. You will help design, build and maintain systems that empower engineers across us to monitor, understand, and troubleshoot their services more effectively.

Our observability team is responsible for delivering scalable and user-friendly solutions to over 150 engineers working across more than 20 teams. Were focused on enabling rapid incident detection and resolution, improving our reliability posture, and supporting a culture of continuous improvement.

What you'll be doing:
Design, build, and maintain observability tools and infrastructure that help our engineers provide actionable insights into the performance and reliability of our systems.
Collaborate with other engineers and teams to enhance the developer experience around monitoring, logging, alerting, and tracing.
Develop and evolve our internal tooling to simplify the process of instrumenting and observing services.
Partner with engineering teams to improve incident response and recovery workflows, and ensure systems meet internal SLOs/SLAs and reliability targets.
Support the migration from our legacy ELK stack to a modern observability platform using Prometheus, Mimir, Grafana, Honeycomb, Loki, Quickwit, and OpenTelemetry.
Contribute to knowledge sharing and the ongoing development of best practices in observability across the organisation.
Requirements:
What you'll need:
4+ years of professional experience as a software engineer, with a strong foundation in building and maintaining production systems.
Proficiency in one or more modern programming languages such as Python, Java, JavaScript, or Ruby.
Familiarity with Kubernetes, AWS, and infrastructure-as-code tools such as Terraform.
Experience working with observability tools and platforms (e.g. Prometheus, Grafana, ELK, Honeycomb, Loki, or similar).
A strong interest in developer experience and platform tooling, with the ability to empathise with engineering teams as internal customers.
Excellent communication skills, with the ability to collaborate effectively across teams and explain complex technical concepts clearly.
A proactive mindset focused on long-term impact, sustainable engineering practices, and continuous improvement.

Preferred Qualifications:
Experience with OpenTelemetry or distributed tracing systems.
Understanding of observability-driven development and service reliability principles (e.g. SRE, MTTR, SLIs/SLOs).
Experience optimising observability systems for cost and performance at scale.
Knowledge of microservices architectures and how to monitor and debug distributed systems.
Contributions to open-source projects in the observability or monitoring space
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8274690
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
2 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Senior DevOps Engineer to join our newly formed Foundations Teama small, high-impact group responsible for the infrastructure, tools, and shared services that power our entire R&D organization.
In this role, youll design, build, and evolve internal platform infrastructure, CI/CD systems, and developer enablement tooling. Your mission is to empower developers across the company to work autonomously, by creating self-service tools, automation, and clear standards that reduce friction and increase reliability.
Youll collaborate closely with engineers across disciplines and partner with the Foundations Team Lead to shape DevOps practices that scale. This is a hands-on role for someone who thrives in high-velocity, mission-critical environments and is passionate about building tools that make developers faster, more productive, and confident in running their own services.
What Youll Do
Design and maintain scalable, developer-friendly CI/CD pipelines and deployment workflows.
Build self-service tooling and automation that enables teams to manage deployments, environments, secrets, and observability independently
Be responsible for cloud infrastructure and operations foundations
Implement and promote best practices for monitoring, logging, and alerting across services.
Operate and optimize Kubernetes-based production environments, ensuring performance, security, and stability.
Manage infrastructure using Infrastructure as Code (IaC) and ensure repeatability and traceability through tools like Terraform.
Collaborate with R&D teams to support onboarding to internal tooling and promote a culture of enablement over dependency.
Monitor cloud cost, ensuring our cloud operates efficiently.
Requirements:
4+ years of hands-on experience in DevOps or infrastructure engineering, ideally in high-velocity, mission-critical production environments.
Deep expertise in Kubernetes and containerized infrastructure, with experience deploying and managing workloads at scale.
Strong understanding of cloud infrastructure and operations, including networking, storage, compute, and securityGCP experience preferred.
Proficiency with Infrastructure as Code tools, especially Terraform, with a focus on automation and operational excellence.
Experience developing and managing CI/CD processes and tools, with a passion for improving developer workflows and release quality.
Strong debugging and problem-solving skills, with the ability to troubleshoot complex systems across the stack.
Highly self-motivated and organized, able to work independently in a fast-paced, collaborative environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8311657
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
15/07/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
our company's Infinity External Risk Management, otherwise known as Cyberint, continuously reduces external cyber risk by managing and mitigating an array of digital threats with a unified solution.
At Cyberint, we help organizations protect their digital presence by delivering cutting-edge Attack Surface Management (ASM) and Threat Intelligence (TI) solutions. As a member of our R&D organization, youll play a key role in ensuring the scalability, reliability, and performance of our cloud-native SaaS platform operating at scale.
Key Responsibilities
As a DevOps Engineer, you will be a core member of our DevOps & Infrastructure team, focused on building and maintaining distributed, scalable, and highly available systems in a dynamic SaaS environment. You will collaborate closely with development, QA, and support teams to enhance automation, improve CI/CD pipelines, and drive operational excellence across the board.
Key Responsibilities:
Design, build, and maintain infrastructure in a modern cloud-native SaaS ecosystem (primarily AWS).
Contribute to the scalability and reliability of distributed systems supporting high-volume data processing and real-time operations.
Develop and enhance CI/CD pipelines to support rapid and reliable deployments across multiple environments.
Implement and manage Infrastructure as Code (IaC) using Terraform for consistent, scalable infrastructure.
Operate and optimize Kubernetes (EKS) clusters to support distributed microservices architectures.
Monitor and respond to system alerts, troubleshoot issues, and contribute to incident prevention and response strategies.
Build self-service tools and automation frameworks to empower R&D teams and enhance delivery velocity.
Work cross-functionally with developers, QA, and support to ensure infrastructure meets evolving product needs.
Write and maintain scripts (Python, Bash) to automate recurring tasks and streamline operations.
Continuously identify and execute improvements in system performance, availability, and cost-efficiency.
Requirements:
Experience:
25 years of experience in DevOps, SRE, or infrastructure engineering roles, working with distributed systems and SaaS applications.
Hands-on experience with public cloud providers (AWS strongly preferred).
Production experience with tools such as Kubernetes, Terraform, CI/CD platforms (Jenkins, ArgoCD), and monitoring systems (Prometheus, Grafana).
Skills:
Solid grasp of Infrastructure as Code principles and best practices.
Strong knowledge of distributed systems, microservices, and orchestration technologies.
Proficiency in scripting (Python, Bash) for automation and tooling.
Familiarity with logging and monitoring stacks (e.g., Elasticsearch, Redis, CloudWatch, Grafana, Prometheus).
Awareness of DevOps security practices and cloud cost optimization strategies.
Mindset & Traits:
A strong sense of ownership and accountability for system health and performance.
Passion for automation, self-service, and continuous improvement.
Excellent communication and collaboration skills.
Comfortable working in fast-paced SaaS environments with cross-functional teams.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8259928
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
14/07/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We are looking for an experienced Technology Architect / Technical Leader to drive the design, scalability, and evolution of our Harmony SaaS security platform. You will play a pivotal role in shaping our SaaS cloud security architecture and capabilities to protect organizations from emerging threats in real-time. This is a strategic, cross-functional, and hands-on position, suitable for a builder who thrives on ownership and technical depth.
Key Responsibilities
Architect and drive end-to-end design of large-scale, secure, and high-performance SaaS systems for cloud security, detection, and response.
Lead cross-functional projects across data, detection, and platform teams to deliver end-to-end features and capabilities.
Collaborate closely with Product Managers, Engineering Managers, and UX Designers to translate business requirements into scalable solutions.
Mentor engineers and guide the technology roadmap for the Harmony SaaS product.
Ensure operational excellence and full lifecycle ownership: design, implementation, deployment, observability, and continuous improvement.
Contribute to codebases where needed to unlock complex or foundational technical challenges.
Set architectural and technical standards for data pipelines, microservices, and cloud infrastructure.
Bring technical leadership and innovation to areas such as:
High-scale detection engines and analytics
Cloud-native threat protection
Real-time event streaming and processing
Unified SaaS security policies and visibility.
Requirements:
10+ years of experience in software engineering, including significant hands-on coding in Python and/or Golang.
5+ years in technical leadership or architecture roles in cloud security or large-scale SaaS environments.
Experience with cloud platforms
Deep understanding of data infrastructure, real-time processing pipelines (Kafka, streaming), and microservices.
Proven experience leading cross-team projects and aligning engineering execution with strategic goals.
Familiarity with Terraform, Docker, Kubernetes, and modern DevOps practices.
Strong belief in you build it, you own it with operational accountability.
Passion for working hands-on and mentoring teams.
Advantages
Prior experience working in a security-focused company
Exposure to cloud detection & response mechanisms and incident workflows.
Experience with hybrid data systems (SQL/NoSQL/Redis/SingleStore).
Background in working in Agile squads with Product/UX collaboration.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8258415
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Design, develop, and maintain complex, scalable, and high-performance systems with a focus on writing clean, efficient, and maintainable code.

Collaborate with cross-functional teams, including product managers and architects, to implement technical solutions aligned with business objectives.

Provide input on architectural design and participate in technical planning to ensure long-term maintainability and compliance with standards.

Identify and troubleshoot production issues, conducting root cause analysis and implementing fixes to ensure system reliability.

Contribute to code reviews and technical discussions, sharing knowledge and fostering a culture of collaboration and continuous improvement.
Requirements:
BSc in Computer Science or a related degree from a recognized institution, or a strong track record in server-side development with advanced technical skills.

6+ years of experience in software engineering with a demonstrated ability to work on large-scale projects and solve complex technical problems.

Proficiency in one or more programming languages such as C/C++, C#, Java, Go, or Python, with the ability to adapt to new tools and technologies.

Solid experience building scalable, distributed systems with an understanding of microservices architecture and API design.

Experience with cloud platforms such as AWS, Azure, or GCP, with knowledge of best practices for deploying and maintaining cloud-based services.

Strong problem-solving skills, with experience debugging and resolving production issues in complex systems.

Ability to prioritize tasks, manage workload efficiently, and contribute to the technical growth of the team.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8268731
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Staff MLOps Engineer
Apply
Realize your potential by joining the leading performance-driven advertising company!
As a Staff MLOps Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Staff MLOps Engineer Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet
Our tech stack:
Java, Python, TensorFlow, Spark, Kafka, Cassandra, HDFS, vespa.ai, ElasticSearch, AirFlow, BigQuery, Google Cloud Platform, Kubernetes, Docker, git and Jenkins.
Requirements:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8272669
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Senior Algo Data Engineer
Realize your potential by joining the leading performance-driven advertising company!
As a Senior Algo Data Engineer on the Infra group, youll play a vital role in develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools.
About Algo platform:
The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.
The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.
The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
How youll make an impact:
As a Senior Algo Data Engineer, youll bring value by:
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet
Our tech stack:
Java, Python, TensorFlow, Spark, Kafka, Cassandra, HDFS, vespa.ai, ElasticSearch, AirFlow, BigQuery, Google Cloud Platform, Kubernetes, Docker, git and Jenkins.
Requirements:
To thrive in this role, youll need:
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 5+ years experience.
Passion for ML engineering and for creating and improving platforms
Experience with designing and supporting ML pipelines and models in production environment
Excellent coding skills in Java & Python
Experience with TensorFlow a big plus
Possess strong problem solving and critical thinking skills
BSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
Bonus points if you have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools
Experience in developing software within a distributed computation framework.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8274042
סגור
שירות זה פתוח ללקוחות VIP בלבד