דרושים » תוכנה » Senior DevOps Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 3 שעות
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior DevOps Engineer to join our R&D team in developing the next rising product in the health tech landscape. If you are looking for a challenging, influential position and are passionate about making an impact, this might be the role for you.
As a Senior DevOps Engineer , youll play a key role in the design, development, testing, deployment, and monitoring of our infrastructure and products. In this position, you'll make significant contributions to our observability stack, helping build and maintain robust systems for logs, metrics, traces, and alerting.
Our ideal candidate is passionate about DevOps and observability, has strong communication skills, and thrives on constant improvement for both technology and processes. If you enjoy working on multiple projects in parallel and are a proactive team player, youll fit right in.
This is a unique opportunity to join the core team of a fast-growing startup, where your contributions will have a direct impact on our product and success.
Responsibilities:
Support and collaborate with cross-functional engineering teams using cutting-edge technologies.
Contribute to the design, implementation, and maintenance of monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, Loki)
Secure, scale, and manage our cloud environments (AWS and GCP)
Design and implement automation solutions for both development and production
Manage and improve our CI/CD pipelines for fast and safe delivery
Lead best practices in infrastructure, observability, configuration management, and system hardening
Continuously assess and improve existing infrastructure in line with industry standards
Requirements:
BSc in Computer Science, Engineering, or equivalent experience
5+ years of experience as a DevOps Engineer or similar software engineering role
Proven experience with Docker and Kubernetes (EKS preferred)
Hands-on experience with monitoring and observability tools, including Prometheus, Grafana, Datadog, or similar.
Expertise in Terraform for AWS infrastructure-as-code deployments
Strong collaboration and interpersonal communication skills
Excellent analytical thinking and problem-solving mindset
Proficiency with relational databases
Solid knowledge of Python and Bash scripting
Experience with test automation an advantage
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8320472
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Site Reliability Engineer- Infra
Realize your potential by joining the leading performance-driven advertising company!
As a Site Reliability Engineer- infra, on our Infrastructure team at the TLV office, you will play a key role in ensuring the reliability, scalability, and performance of our critical systems. You will be responsible for managing and improving our core infrastructure, with a focus on automation, monitoring, and incident response. You will work with a wide range of technologies, including Kubernetes, monitoring and observability tools, configuration management systems, and core networking services.
How youll make an impact:
As a Site Reliability Engineer, youll bring value by:
Ensure the reliability, availability, and performance of our infrastructure services.
Manage and maintain our Kubernetes infrastructure, including KubeVirt.
Design, implement, and maintain our monitoring and observability stack (SensuGo, VictoriaMetrics, Prometheus, ELK).
Automate infrastructure provisioning, configuration, and deployment processes using Puppet and Ansible.
Manage and maintain core services such as DNS and networking.
Troubleshoot and resolve complex infrastructure issues in a timely and efficient manner.
Participate in on-call rotations and incident response.
Develop and maintain infrastructure-as-code (IaC).
Identify and implement proactive measures to prevent incidents and improve system reliability.
Collaborate with development teams to ensure smooth and reliable deployments.
Contribute to the design and implementation of new infrastructure solutions.
Drive improvements in system architecture, processes, and tools.
Mentor and coach other team members.
Requirements:
5+ years of experience in a Site Reliability Engineering, Systems Engineering, or similar role.
Deep understanding of Site Reliability Engineering principles and practices.
Extensive experience with Kubernetes, including deployment, management, and troubleshooting.
Strong experience with monitoring and observability tools such as SensuGo, Zabbix, VictoriaMetrics, Prometheus, and ELK.
Proficiency in configuration management tools such as Puppet and Ansible.
Solid understanding of Linux internals and networking.
Experience with managing and maintaining core services such as DNS and networking.
Strong programming skills in Python and/or Go.
Experience with both on-premises and cloud environments.
Experience with KubeVirt.
Excellent troubleshooting and problem-solving skills.
Strong communication and collaboration skills.
Ability to work in a fast-paced, dynamic environment.
Ability to participate in on-call rotations including weekends.
Preferred Qualifications:
Experience with large-scale, distributed systems.
Experience with other cloud providers (e.g., AWS, Azure, GCP).
Contributions to open-source projects.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8272676
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Senior DevOps Engineer to join our newly formed Foundations Teama small, high-impact group responsible for the infrastructure, tools, and shared services that power our entire R&D organization.
In this role, youll design, build, and evolve internal platform infrastructure, CI/CD systems, and developer enablement tooling. Your mission is to empower developers across the company to work autonomously, by creating self-service tools, automation, and clear standards that reduce friction and increase reliability.
Youll collaborate closely with engineers across disciplines and partner with the Foundations Team Lead to shape DevOps practices that scale. This is a hands-on role for someone who thrives in high-velocity, mission-critical environments and is passionate about building tools that make developers faster, more productive, and confident in running their own services.
What Youll Do
Design and maintain scalable, developer-friendly CI/CD pipelines and deployment workflows.
Build self-service tooling and automation that enables teams to manage deployments, environments, secrets, and observability independently
Be responsible for cloud infrastructure and operations foundations
Implement and promote best practices for monitoring, logging, and alerting across services.
Operate and optimize Kubernetes-based production environments, ensuring performance, security, and stability.
Manage infrastructure using Infrastructure as Code (IaC) and ensure repeatability and traceability through tools like Terraform.
Collaborate with R&D teams to support onboarding to internal tooling and promote a culture of enablement over dependency.
Monitor cloud cost, ensuring our cloud operates efficiently.
Requirements:
4+ years of hands-on experience in DevOps or infrastructure engineering, ideally in high-velocity, mission-critical production environments.
Deep expertise in Kubernetes and containerized infrastructure, with experience deploying and managing workloads at scale.
Strong understanding of cloud infrastructure and operations, including networking, storage, compute, and securityGCP experience preferred.
Proficiency with Infrastructure as Code tools, especially Terraform, with a focus on automation and operational excellence.
Experience developing and managing CI/CD processes and tools, with a passion for improving developer workflows and release quality.
Strong debugging and problem-solving skills, with the ability to troubleshoot complex systems across the stack.
Highly self-motivated and organized, able to work independently in a fast-paced, collaborative environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8311657
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/07/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a Senior Platform Engineer, Observability to join our Observability team. This role offers the opportunity to work at the intersection of software development and platform engineering, contributing to the tools, systems, and practices that improve visibility, reliability, and operational excellence across our engineering organization.

This position is ideally suited for experienced software engineers who are passionate about building high-quality systems and are interested in expanding their expertise in observability, distributed systems, and developer experience. You will help design, build and maintain systems that empower engineers across us to monitor, understand, and troubleshoot their services more effectively.

Our observability team is responsible for delivering scalable and user-friendly solutions to over 150 engineers working across more than 20 teams. Were focused on enabling rapid incident detection and resolution, improving our reliability posture, and supporting a culture of continuous improvement.

What you'll be doing:
Design, build, and maintain observability tools and infrastructure that help our engineers provide actionable insights into the performance and reliability of our systems.
Collaborate with other engineers and teams to enhance the developer experience around monitoring, logging, alerting, and tracing.
Develop and evolve our internal tooling to simplify the process of instrumenting and observing services.
Partner with engineering teams to improve incident response and recovery workflows, and ensure systems meet internal SLOs/SLAs and reliability targets.
Support the migration from our legacy ELK stack to a modern observability platform using Prometheus, Mimir, Grafana, Honeycomb, Loki, Quickwit, and OpenTelemetry.
Contribute to knowledge sharing and the ongoing development of best practices in observability across the organisation.
Requirements:
What you'll need:
4+ years of professional experience as a software engineer, with a strong foundation in building and maintaining production systems.
Proficiency in one or more modern programming languages such as Python, Java, JavaScript, or Ruby.
Familiarity with Kubernetes, AWS, and infrastructure-as-code tools such as Terraform.
Experience working with observability tools and platforms (e.g. Prometheus, Grafana, ELK, Honeycomb, Loki, or similar).
A strong interest in developer experience and platform tooling, with the ability to empathise with engineering teams as internal customers.
Excellent communication skills, with the ability to collaborate effectively across teams and explain complex technical concepts clearly.
A proactive mindset focused on long-term impact, sustainable engineering practices, and continuous improvement.

Preferred Qualifications:
Experience with OpenTelemetry or distributed tracing systems.
Understanding of observability-driven development and service reliability principles (e.g. SRE, MTTR, SLIs/SLOs).
Experience optimising observability systems for cost and performance at scale.
Knowledge of microservices architectures and how to monitor and debug distributed systems.
Contributions to open-source projects in the observability or monitoring space
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8274690
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
20/07/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a Platform Engineer (DevOps) to join our Core R&Ds Platform Engineering team.
We believe that our development platform is not just a means to an end but a product in its own right. As a Platform Engineer, you will be at the forefront of crafting, enhancing, and optimizing this critical foundation, to support our Core R&Ds innovation and growth.
From enabling our product teams, to maintaining our build system, to owning both our internal and open-source infrastructure, this team plays a central role in our business operations.
Why would you love this job?
As a Platform Engineer, you will play a critical role in shaping and optimizing our development platform, ensuring a seamless developer experience, fostering a culture of efficiency and innovation. You will have a large impact on how we deliver and provides solutions that will impact Databases for thousands of customers and some of the worlds largest companies.
What will you do:
Collaborate with software engineering teams to optimize development pipelines and improve the overall developer experience.
Actively champion and drive innovative solutions and processes within the platform engineering team.
Own and continuously improve the build and CI systems of our Enterprise.
Continuously improve the security posture.
Contribute to the design and architecture of distributed systems, ensuring scalability, security, and maintainability.
Optimize alerting and monitoring systems to maintain high availability and quick response times.
Requirements:
At least 5 or more years of experience working on infrastructure/devops/cloud related domains.
Experience with a high level programming language.
Experience with cloud infrastructure (AWS / GCP / Azure).
Experience with containerization technologies and concepts (Docker, or equivalent technologies).
Experience using and maintaining CI/CD tools (Github actions, Jenkins, or equivalent technologies).
Experience with infrastructure as code tools (Terraform, Pulumi, or equivalent technologies) and configuration management (Ansible, Chef, or equivalent technologies).
Extra great if you have:
B.S. in Computer Science, Software Engineering or a related field.
Experience with build systems (CMake, make, etc.).
Experience with container orchestration (Docker, Kubernetes, GKE/EKS/AKS, Docker Swarm, etc.) and related technologies (Helm, kustomize, ArgoCD, etc.).
Experience with alerting and monitoring systems (Prometheus, ELK, Splunk, etc.).
​Experience working with large-scale distributed systems.
Experience with NoSQL databases.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8265766
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior DevOps Engineer on our Production Engineering team, you will be at the forefront of ensuring the stability, scalability, and performance of our production systems. Youll be responsible for the health of large-scale cloud environments, investigating incidents, driving root cause analysis, and implementing long-term solutions that improve system reliability. Youll also own and continuously improve the production release process, ensuring deployments are safe, automated, and well-orchestrated. Youll collaborate closely with engineering, platform, and SRE teams to ensure world-class operational excellence for our customer-facing services.
Your Impact
Own the end-to-end release process: plan, coordinate, and execute deployments across environments with a strong focus on safety, reliability, and automation
Ensure stability and performance of all production systems, maintaining high availability through proactive monitoring and incident management
Investigate and resolve complex production issues, driving post-incident reviews and implementing long-term fixes
Respond to critical incidents and customer escalations with a calm, structured approach and clear communication
Define and uphold best practices for change management, observability, and system reliability
Manage infrastructure-as-code using Terraform for scalable cloud deployments
Improve monitoring, alerting, and recovery mechanisms to detect and resolve issues faster
Automate repetitive operational tasks through scripting and tooling
Collaborate with development teams to ensure smooth delivery and stable operation of new features
Participate in an on-call rotation to support production systems.
Requirements:
5+ years of experience supporting large-scale production systems in cloud environments
Strong hands-on experience with Linux systems and networking fundamentals
Solid experience with cloud platforms (GCP preferred)
Hands-on experience running large-scale Kubernetes in production
Expertise with Terraform and infrastructure-as-code principles
Strong scripting skills (e.g., Python, Bash) to build automation and tooling
Proven experience owning or contributing to release and deployment processes
Familiarity with observability tools like Prometheus, Grafana, or similar
Ability to lead incident investigations and drive root cause resolution
Excellent communication and collaboration skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8290783
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and English Speakers
We are growing and are looking for a Senior DevOps Engineer
who value personal and career growth, team-work, and winning!
What your day will look like:
This role is critical in supporting our companys short and long-term SaaS product expansion plans. Youll join an expanding group where your contributions can make a strong impact.
You will have the chance to design and build our customer and internal production systems using up-to-date technologies and tools.
As a DevOps Eng, you will have a chance to play a key role in all our mission critical services, working with multiple teams and groups and having a very wide view of how all of this comes together, from the request to the design, up to the build , deploy and whole lifecycle of what we build and manage in our cloud .
You will collaborate with our R&D, Security, and Corporate IT teams to deliver safe, scalable, and high-performing solutions. Members of the SaaS DevOps team act as trusted technical architects, and play a key role in determining the future of how builds and delivers its cybersecurity asset management services.
Responsibilities:
Evaluate , Design & Implement new cloud infrastructure technologies & Architecture to support our ever growing SaaS solution
Build a SaaS product to serve thousands of global customers in a modern and scalable way
Design, deploy, and operate cloud infrastructure and services for various internal (corporate) applications on behalf of other teams
Requirements:
5+ years of experience managing production environments
2+ years of experience managing cloud-based environments
Experience with AWS
At least 4 years of experience with Linux based OS administration
Extensive experience with modern DevOps tooling, including configuration management and Infrastructure as Code
Professional software development experience with any language (eg Python, Ruby, GoLang, Javascript)
Proven experience and understanding of architecture principles across infrastructure platforms, security, data, integration, and application layers
Experience deploying and operating services based on Linux containers and virtualization (Docker, etc.)
Monitoring and operational metrics gathering (e.g., CloudWatch, Prometheus, Grafana, Datadog, etc)
Building and managing infrastructure that requires high availability and high security standards
Strong written and verbal communication skills in English and Hebrew
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8313495
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/08/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a motivated Senior DevOps Engineer to join our Cortex Devops Production group in our Tel Aviv R&D center. The group is responsible for the reliability and availability of the production environment hosting Cortex XDR and the enablement of the entire XDR RnD group using CI tools, infrastructure, and automation.
In this role you will a part of a DevOps group that is responsible for planning, executing, and reporting the various infrastructure and code projects, as well as managing and executing high-pressure production maintenance work and issues
More information about the Cortex-XDR product can be found here.
Your Impact
You will take full end-to-end responsibility for the production environment of our SaaS product deployed on GCP
You will build tools for the automatic remediation of known issues
You will develop Infrastructure-as-code which will be used to orchestrate production and dev environments
You will design, build, maintain, and scale production services with thousands of Kubernetes clusters
You will secure the production environments and add in new security tools and features both internal our company's and other market-leading technologies
You will work closely with development teams to design and enhance software architecture to improve scalability, service reliability, cost, and performance
You will build CI pipelines and automation processes
Participate in the on-call rotation supporting the applications and infrastructure
You will research cutting-edge technologies and deploy them to production.
Requirements:
4+ years as DevOps Engineer (or equal role) with a passion for technology and strong motivation and responsibility for high reliability and service level
Proficiency with code language (Python / Go - preferred)
High proficiency with Linux
Proficiency in the cloud (GCP - preferred)
Proficiency with Terraform and HashiCorp tools
High proficiency with virtualized and containerized environments (Kubernetes and Docker)
Proficiency with CI/CD and Configuration Management (Jenkins preferred)
Proficiency with DB such as Cassandra, ScyllaDB, MemSQL, MySQL - An advantage
Experience with working with internal and external customers and stakeholders
Managing a high-scale production environment
Excellent communication and interpersonal skills, ability to work and coordinate between multiple teams
Ability to grasp new technologies quickly and prioritize and multitask on multiple responsibilities.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8290432
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/07/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Fiverr is looking for an experienced DevOps Engineer, who will work closely with the developers teams, design and implement improved development processes and tools. Team up with the DevOps team to design and implement scalable systems that will keep Fiverr running smoothly and support our significant business growth. You will join an innovative, high-performance team and work with cutting-edge technologies in a dynamic and agile environment. Fiverr’s Technology Stack sample: AWS, Kubernetes, Terragrunt, Ansible, Jenkins, ArgoCD, Service Mesh, Kong & Nginx, CloudFlare, Hashicorp Vault/Consul, Kafka, RabbitMQ, Prometheus, Grafana, VictoriaMetrics Programming languages: Python, NodeJS, Go, Kotlin

What am I going to do?:

* Maintain and build a large-scale, highly available cloud infrastructure focusing on K8S.
* Improve resiliency and cost efficiency of our cloud infrastructure.
* Automate tasks and error-handling scenarios.
* Develop and adopt new tools to make Development and Operations processes at Fiverr more efficient.
* Collaborate with developers to optimize service performance, reliability, and scale.
* Evolve and maintain Fiverr’s AWS infrastructure by improving and adopting new services.
* Maintain Fiverr availability by participating in DevOps on-call shifts.
* Mentor DevOps engineers.

Equal opportunities:
At Fiverr, we’re not about checklists. If you don’t meet 100% of the requirements for this role but still feel passionate about the position and think you have the right skills and qualifications to excel at it, we want to hear from you. At Fiverr, we prioritize diversity. We celebrate difference and embed it into every aspect of our workplace and product, as well as our community. Fiverr is proud and committed to providing equal opportunity employment to all individuals regardless of race, color, religion, sex, sexual orientation, citizenship, national origin, disability, Veteran status, or any other characteristic protected by law. In addition, Fiverr will provide accommodation to individuals with disabilities or a special need.
Requirements:
* 5+ years of experience as DevOps
* Working in a Linux environment
* Writing scripts in Python
* Production experience with AWS & Kubernetes.
* 2+ years of experience with CI/CD processes.
* Good knowledge of networking concepts (Load Balancers, DNS, VPC)
* Experience in designing and maintaining high-availability solutions for large-scale
* Experience with monitoring tools and log analytics (Grafana, Prometheus, Graphite)
* Experience with IaC tools (Terraform, Terragrunt - advantage )
* Development experience - Advantage
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8283381
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
21/07/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
we are seeking a Site Reliability Engineer who excels at bridging the gap between infrastructure and development. In this role, you will work closely with engineering teams to ensure the reliability, scalability, and performance of our systems. A strong emphasis will be placed on observability - designing and implementing effective monitoring, logging, tracing and alerting solutions to provide deep visibility into system behavior. You should be comfortable collaborating with developers, presenting technical insights, and helping shape best practices. Your responsibilities will include incident management, automation and improvement of our observability solutions, and continuous performance tuning to ensure our platform can scale and evolve with our business needs.

Role:
Ensure production systems meet or exceed established SLAs and SLOs by actively maintaining and enhancing system performance and uptime.
Design and maintain end-to-end observability systemsincluding monitoring, logging, and distributed tracingto detect anomalies and enable proactive issue resolution.
Work closely with engineering teams to improve how their applications are monitored and alerted on. Help define meaningful alerts, reduce noise, and ensure developers are accountable for the operational health of their services.
Optimize application performance on Kubernetes through resource tuning, scaling strategies, and deep performance analysis.
* Provide guidance on reliability-first design, instrumenting code for observability, and using Grafana dashboards to drive decision-making and incident response.
Requirements:
5+ years in SRE, DevOps, or Production Engineering roles
Deep expertise in AWS, Kubernetes, Linux
Being responsible of deploying and tuning monitoring tools like Prometheus, Thanos and any time-series databases for storing metrics.
Logging responsibilities with ELK stack, Loki, Grafana or any alternatives.
Experience with tracing opentelemetry, tempo, jaeger
Strong understanding of incident management processes and best practices.
Experience with automation tools and practices for deployment and infrastructure management.
Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
Ownership mindset, proactive and reliable
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8268431
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
10/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Staff Engineer.
What Youll Do:
Lead architecture and system design for critical components of the Developer Experience platform, ensuring scalability, resilience, and long-term maintainability.
Own end-to-end delivery of complex initiatives, from requirements gathering and design to implementation, rollout, and observability.
Design, implement, and maintain robust microservices supporting high-throughput and low-latency operations.
Define and uphold API design standards, including gateway configuration, versioning strategy, and long-term lifecycle management.
Build and optimize backend systems that enable developer-facing products such as SDKs, APIs, and webhooks.
Work with both relational and NoSQL databases to ensure data consistency, scalability, and performance.
Collaborate with cross-functional teams to design systems that meet operational and business requirements.
Research and implement cloud-native architectures to support growth and scalability.
Contribute to the creation of developer tools and standards that improve the usability of our APIs and SDKs.
Requirements:
10+ years of experience in backend development, with a strong focus on scalable infrastructure.
Proficiency in Node.js and TypeScript; additional experience with other backend languages is a plus.
Strong expertise in relational and NoSQL databases, including schema design, query optimization, and troubleshooting.
Experience designing and managing RESTful APIs, including versioning strategies, API gateway integration, and developer-first design.
Proven experience designing and deploying microservices-based architectures in production environments.
Hands-on experience with cloud providers (AWS, GCP, Azure) and container orchestration tools (e.g., Kubernetes, Docker).
Solid understanding of system design principles, distributed systems, and scalability.
Experience with monitoring and logging frameworks (e.g. Datadog, Prometheus, Grafana, ELK stack).
Deep understanding of REST APIs and event-driven architectures.
Advantage - Familiarity with AWS, Servers-less
Strong problem-solving skills, with the ability to troubleshoot production issues effectively.
Ability to manage multiple priorities and thrive in a service-oriented, fast-paced environment.
Bonus Points:
Experience designing developer-centric SDKs, tools, or CLI utilities.
Track record of contributing to internal platform teams or DX-focused initiatives.
Knowledge of OpenAPI/Swagger specifications and API documentation best practices.
Passion for elevating developer experience and usability across engineering platforms.
Hands-on experience in designing developer-friendly SDKs and APIs.
Knowledge of CI/CD pipelines and best practices for automated testing and deployment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8296063
סגור
שירות זה פתוח ללקוחות VIP בלבד