Backend & DevOps Technical Lead

עדכון קורות החיים לפני שליחה

8125381

שירות זה פתוח ללקוחות VIP בלבד

משרות דומות שיכולות לעניין אותך

דיווח על תוכן לא הולם או מפלה

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

02/04/2025

Engineering Backend Tech Lead (SRE team)

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

We are seeking an experienced and motivated Engineering Backend Tech Lead to join our dynamic Site Reliability Engineering (SRE) team. As an Engineering Backend Tech Lead you will play a crucial role in enhancing the reliability, performance, and scalability of our systems and services. You will be a part of a global commando team of highly skilled SREs, driving best practices and innovations for optimal system operations, while protecting critical companies systems in a real time.
In this role, you will be responsible for:
Drive incident response and post-mortem processes, fostering a culture of continuous improvement.
Design, build and improve internal tools and automation software to make maintaining production services easier and safer.
Lead reliability-focused practices such as SLO (Service Level Objective) design and implementation, Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others.
Participate in the on-call rotation, providing expertise and support during critical system incidents and ensuring timely resolution.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8125295

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

02/04/2025

SRE Tech Lead

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time and Hybrid work

We are seeking an experienced and motivated SRE Tech Lead to join our dynamic Site Reliability Engineering (SRE) team. As a Tech Lead you will play a crucial role in enhancing the reliability, performance, and scalability of our systems and services. You will be a part of a global commando team of highly skilled SREs, driving best practices and innovations for optimal system operations, while protecting critical companies systems in a real time.
In this role, you will be responsible for:
Drive incident response and post-mortem processes, fostering a culture of continuous improvement.
Design, build and improve internal tools and automation software to make maintaining production services easier and safer.
Lead reliability-focused practices such as SLO (Service Level Objective) design and implementation, Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others.
Participate in the on-call rotation, providing expertise and support during critical system incidents and ensuring timely resolution.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8125103

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

23/04/2025

Senior Software Team Manager - Core Mission-Critical Backend Systems

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time and Hybrid work

We are seeking a Senior Team Manager to lead the team responsible for the core of our mission-critical communications platform. This platform handles real-time communications with 99.999% availability requirements, supporting emergency services globally. The ideal candidate will combine deep software engineering expertise and architectural skills with outstanding management capabilities to drive the development and maintenance of large-scale, high-availability systems.

This role requires extensive experience in architecting and scaling mission-critical systems in cloud environments, following best practices for high availability and reliability. The candidate must also have proven experience in managing complex environments, including coordinating with technical support and professional services teams across different time zones, particularly in the U.S. Experience with real-time communications (VoIP or WebRTC) is a significant advantage.

Reporting to: Director of Core Engineering Group.

Heres What Youll Be Doing:
1. Software Engineering Leadership:
Lead the architecture, design, and implementation of our core communications platform, ensuring scalability, high availability (99.999%), and fault tolerance.
Drive the adoption of software engineering best practices such as service resiliency, failover mechanisms, load balancing, and distributed systems design.
Oversee the development of a mix of legacy components and modern microservices using various programming languages and frameworks.
Provide technical leadership in architectural decisions, focusing on performance optimization, security, and reliability.
2. Strategic System Architecture:
Design and implement systems for growth and scale, ensuring they can handle increasing loads while maintaining strict availability and performance standards.
Establish and enforce best practices for monitoring, alerting, and incident response to minimize downtime and ensure rapid issue resolution.
Continuously evaluate and integrate emerging technologies and cloud-native architectures to future-proof the core platform.
3. Team Leadership & Management
Lead, mentor, and manage a team of engineers distributed across multiple regions, including Israel, Europe, and other international locations.
Establish clear goals, KPIs, and growth paths for team members, focusing on both individual development and team performance.
Foster a collaborative, direct, and informal communication culture aligned with our values.

Requirements:
1. Experience & Skills:
12+ years of software engineering experience, with at least 5 years in a management role leading engineering teams.
Proven track record in designing, architecting, and scaling mission-critical systems in cloud environments (AWS, GCP, or Azure).
Extensive experience in implementing high availability best practices such as:
- Distributed systems design and microservices architecture.

- Automated failover, load balancing, and disaster recovery strategies.

- Real-time monitoring and alerting systems.

Strong technical background in cloud-native architectures and a mix of programming languages commonly used for high-performance backend systems.
Experience with real-time communications platforms (VoIP, WebRTC, or similar) is a significant advantage.
Demonstrated ability to work effectively with technical support and professional services teams.

2. Management Skills:
Exceptional leadership and people management skills, with the ability to inspire and lead a distributed team across different geographies.
Experience in high-pressure environments requiring rapid decision-making and problem-solving.

3. Communication & Collaboration:
Fluent in English, with excellent written and verbal communication skills.
Familiarity with the direct and informal communication style is an advantage.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8149363

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

03/04/2025

Site Reliability Engineer

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

Are you passionate about ensuring system reliability, scalability, and performance? Do you thrive in a dynamic environment where automation and operational excellence are key?
We are looking for a Site Reliability Engineer (SRE) to join our team and play a crucial role in designing, implementing, and maintaining our cloud-based infrastructure. In this role, you will collaborate across teams to drive automation, improve system resilience, and optimize performance while fostering a culture of reliability.

Responsibilities:
System Reliability Ensure high availability and performance of services through effective monitoring, incident management, and root cause analysis.
Automation & Tooling Develop and maintain automation for infrastructure provisioning, configuration management, and application deployment.
Performance Optimization Analyze and enhance system performance, including load balancing, caching, and database tuning. Conduct regular capacity planning.
Incident Response & Troubleshooting Lead incident response efforts, participate in on-call rotations, and troubleshoot complex infrastructure issues.
Security & Compliance Collaborate with security teams to implement best practices and ensure compliance with relevant standards (ISO 27001, SOC 2, etc.).
Collaboration & Mentorship Work closely with developers, DevOps, Support, and product teams to enhance application reliability and implement SRE best practices.

Requirements:
Requirements:
5+ years in site reliability engineering, DevOps, or related roles.
Proven experience managing large-scale, cloud-based infrastructure in GCP, AWS, or Azure.
Expertise in container orchestration (Kubernetes, Docker) and microservices architecture.
Strong proficiency in scripting and programming languages (Python, Go, Bash, etc.).
Experience with CI/CD pipelines, infrastructure as code (Terraform, CloudFormation), and configuration management (Ansible, Puppet, Chef).
Hands-on experience with monitoring and observability tools (Datadog, Prometheus, Grafana, ELK Stack).
Deep understanding of networking concepts, DNS, load balancing, and distributed systems.
Strong problem-solving skills, excellent communication, and a proactive mindset.

Advantages:
Certifications AWS Certified Solutions Architect, GCP Professional Cloud Architect, or Kubernetes certifications (CKA, CKAD).

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8127121

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

23/04/2025

Senior Software Team Manager - Core VoIP Platform

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time and Hybrid work

We are seeking a Senior Team Manager to lead the team responsible for our Core VoIP Platformthe backbone of our mission-critical communications system. This platform handles real-time communications with 99.999% availability requirements, supporting emergency services globally.

The ideal candidate will bring extensive experience in VoIP systems and a proven track record in designing, architecting, and scaling high-availability, mission-critical platforms. Expertise in Kamailio, FreeSWITCH, or similar VoIP technologies is a strong advantage. Alongside VoIP expertise, we are looking for a leader with deep software engineering skills and the ability to manage and scale complex systems in cloud environments.

Reporting to: Director of Core Engineering Group.

Heres What Youll Be Doing
1. VoIP Platform Leadership:
Lead the architecture, design, and implementation of our Core VoIP Platform, ensuring scalability, high availability (99.999%), and fault tolerance.
Oversee the development and maintenance of VoIP components using technologies such as Kamailio, FreeSWITCH, and other SIP-based systems.
Drive the adoption of best practices in VoIP protocols (SIP, RTP), network security, and service resiliency.
Provide technical leadership in architectural decisions related to call routing, load balancing, failover mechanisms, and codec management.
2. Software Engineering Leadership:
Manage and guide a team responsible for the backend control plane, built with microservices architecture using various programming languages.
Champion software engineering best practices, including automated testing, CI/CD pipelines, and real-time monitoring.
Ensure the platform's compliance with industry standards for latency, quality of service (QoS), and security.
3. Strategic System Architecture:
Design and implement systems for growth and scale, focusing on distributed architecture and cloud-native services.
Establish and enforce best practices for monitoring, alerting, and incident response to minimize downtime and ensure rapid issue resolution.
Continuously evaluate and integrate emerging VoIP technologies and cloud architectures to enhance platform reliability and performance.
4. Team Leadership & Management:
Lead, mentor, and manage a geographically distributed team of engineers across multiple regions, including Israel, Europe, and other international locations.
Establish clear goals, KPIs, and growth paths for team members, focusing on both individual development and team performance.
Foster a collaborative, direct/

דרישות:
What You Bring:
VoIP Expertise (Must-Have):
5+ years of hands-on experience in VoIP technologies and protocols (SIP, RTP, WebRTC).
Proven expertise in Kamailio, FreeSWITCH, or similar open-source VoIP platforms is a strong advantage.
Experience with Session Border Controllers (SBCs), SIP proxies, and media servers.

Software Engineering Skills:
12+ years of software engineering experience, with at least 5 years in a management role leading engineering teams.
Proven track record in designing, architecting, and scaling mission-critical systems in cloud environments (AWS, GCP, or Azure).
Extensive experience in implementing high availability best practices such as:
Distributed systems design and microservices architecture.
Automated failover, load balancing, and disaster recovery strategies.
Real-time monitoring and alerting systems.
Strong technical background in cloud-native architectures and a mix of programming languages commonly used for high-performance backend systems.

Management Skills:
Exceptional leadership and people management skills, with the ability to inspire and lead a distributed team across different geographies.
Experience in high-pressure environments requiring rapid decision-making and problem-solving.

Communication & Collaboration:
Fluent in English, with excellent written and verbal communication skills.
Familiarity with the direct and informal communication styl המשרה מיועדת לנשים ולגברים כאחד.

עדכון קורות החיים לפני שליחה

8149369

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

16/04/2025

Backend Team Lead

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

Our technology eases the stress of paying for lifes expenses by giving people more options on how and when they pay. Founded in 2016, offers a next-generation, no-fee credit card that can be managed through a powerful mobile app, as well as a point-of-sale payment option available at more than 25,000 service locations, including auto dealership service centers, optical practices, dental offices, veterinary clinics, and specialty healthcare services. included on the 2022 Inc. 5000 list. The financial technology company has also been named as a Most Loved Workplace, Best Point of Sale Company, and a Top Fintech Startup by CB Insights.

We use cutting-edge innovations in financial technology to bring leading data and features that allow individuals to be qualified instantly, making purchases at the point-of-sale fast, fair, and easy for consumers from all walks of life. We create value focused on our core values; we work tirelessly to ensure that becomes available to everyone, everywhere.
Feel free to reach out with any questions

What Youll Do:
looking for a Squad Lead to join and lead our Purchase Apps Squada multidisciplinary team responsible for exposing purchase process to users and partners. This squad owns a variety of products that are at the heart of how customers and partners engage with our platform.

As a Purchase Apps Squad Lead, your primary focus will be on architectural design, technical leadership, and cross-functional collaboration. You'll lead a team of frontend and backend engineers, work closely with product and business teams, and ensure alignment with business goals, system scalability, and development best practices.

Responsibilities

Among your key responsibilities, you will lead a highly skilled team that architects and builds scalable systems that serve our customers and partners.

Lead the technical architecture and system design for customer- and partner-facing purchase products
Guide a multidisciplinary team of frontend and backend engineers through implementation, delivery, and iteration
Work closely with product managers and business stakeholders to translate requirements into scalable technical solutions
Ensure development aligns with business objectives, long-term system health, and engineering standards
Conduct and lead design reviews, code reviews, and performance optimization efforts
Drive a culture of technical excellence, ownership, and continuous improvement
Oversee system reliability, monitoring, and incident response within your domain

Requirements:
2+ years of leadership experience, guiding engineering teams and making architectural decisions
8+ years of software engineering experience, with a strong focus on scalable architecture and system design
Proficiency in modern system architecture with large-scale and high performance.
Impact driven - use data to make sure every task moves the needle and utilize your resources to maximize the team's impact.
Team player with strong communication skills and someone who thrives working in a fast-paced environment.
Creative, solution-oriented, and able to maintain a can do attitude, with the ability to work efficiently under pressure and uncertainty. Adapt quickly to changing business needs.
Familiarity with microservices architecture and distributed systems.
B.Sc in computer science or equivalent.
Fluent in English.
Preferred Experience:
Experience with software development in Kotlin or Java.
Experience with Frontend technologies.
Leading multidisciplinary team.
Familiarity with microservices architecture and distributed systems.
Solid understanding of AWS or similar cloud platforms.
Expertise in the Spring Framework.
Proficient in both relational (e.g., MySQL) and non-relational databases (e.g., MongoDB).
Familiarity with message queue technologies such as Kafka, SQS, RabbitMQ, or simil

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8140432

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

31/03/2025

DevOps Team Leader

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

seeking a results-driven DevOps Team Lead to head the DevOps and Infrastructure team within our R&D organization. This role requires strategic vision, technical expertise, and a proactive approach to drive operational excellence, empower teams with robust tools and automation, and ensure high system reliability and scalability.

As the DevOps Team Lead, you will be instrumental in delivering critical KPIs, including system uptime, automation, incident management, and collaboration with development and QA teams to enable self-sufficiency. Additionally, you will serve as the leader for strategic projects, identifying opportunities to improve infrastructure and operational processes, setting long-term goals, and executing initiatives that align with Cyberints business objectives and growth.

Key Responsibilities
Strategic Leadership
System Reliability & Uptime

Lead initiatives to ensure system reliability, minimize disruptions, and maintain high availability for Cyberints SaaS platform.
Establish and manage proactive monitoring, alerting, and preventive maintenance strategies.
Drive incident prevention efforts, ensuring robust failover and disaster recovery mechanisms.
Develop and maintain playbooks to enable rapid diagnosis and resolution of issues.
Automation, Infrastructure as Code (IaC), & Self-Service Enablement

Champion the adoption of automation and IaC to streamline infrastructure management and deployments.
Build and enhance self-service tools and frameworks, empowering R&D teams to operate independently with minimal reliance on DevOps.
Continuously improve CI/CD pipelines to optimize deployment speed and reliability.
Collaboration & Support for Self-Sufficiency

Collaborate closely with development, QA, and support teams to deliver tools and frameworks that promote team autonomy and efficiency.
Advocate for cross-functional engagement to align operational processes with R&D objectives.
Provide training and mentorship to teams on using DevOps tools effectively.
Accountability, Ownership, & Scalability

Take ownership of all systems and infrastructure, ensuring solutions are scalable, resilient, and aligned with Cyberints growth objectives.
Establish clear accountability frameworks for maintaining infrastructure and delivering on key projects.
Design and execute a roadmap to support self-service-oriented and scalable solutions.

Identify and lead strategic projects to enhance Cyberints platform scalability, reliability, and operational efficiency.
Develop and execute a roadmap for critical infrastructure and DevOps initiatives that drive business success.
Collaborate with senior stakeholders to align projects with organizational priorities and deliver measurable outcomes.

Requirements:
5+ years of experience in DevOps or SRE roles, with 2+ years in a leadership capacity.
Proven expertise in building and maintaining highly available, cloud-native environments (AWS preferred).
Experience with Kubernetes, Terraform, CI/CD pipelines, and monitoring technology and tools (Prometheus, Grafana, Jenkins, ArgoCD, Terraform, Elasticsearch, Redis, EKS, etc.).
Skills & Expertise

Strong understanding of automation, Infrastructure as Code (IaC), and self-service enablement.
Expertise in incident management and a track record of delivering reliable, scalable systems.
Hands-on experience with scripting and automation tools (Python, Bash).
Deep understanding of containerization, orchestration, and cloud-native architectures.
Familiarity with cost monitoring and optimization strategies to ensure infrastructure is both efficient and cost-effective.
Knowledge of security best practices for infrastructure and DevOps environments.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8121466

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

20/04/2025

Software Engineer - Cloud Security

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time

Ask a member of our team and theyll answer, Our people! We work together to build and innovate best-in-class cybersecurity solutions for our customers; all while creating a culture of belonging, respect, and excellence where we can be our best selves. When youre part of our team, you can expect to partner with some of the most talented and passionate people in the industry, and have the support and resources you need to do work that truly matters. We deliver results that exceed expectations and we win together!
Cloud Security was born out of the acquisition of Ermetic, an innovative cloud-native application protection platform (CNAPP) company, and a leading provider of cloud infrastructure entitlement management (CIEM). The acquisition combines two cybersecurity innovators and marks an important milestone in mission to shift organizations to proactive security. The combination of and Ermetic offerings will add capabilities to deliver market-leading contextual risk visibility, prioritization and remediation across infrastructure and identities, both on-premises and in the cloud.
Your Role:
Design, develop, and maintain complex, scalable, and high-performance systems with a focus on writing clean, efficient, and maintainable code.
Collaborate with cross-functional teams, including product managers and architects, to implement technical solutions aligned with business objectives.
Provide input on architectural design and participate in technical planning to ensure long-term maintainability and compliance with standards.
Identify and troubleshoot production issues, conducting root cause analysis and implementing fixes to ensure system reliability.
Contribute to code reviews and technical discussions, sharing knowledge and fostering a culture of collaboration and continuous improvement.
Your Opportunity:
Develop critical components and systems that drive business outcomes, while maintaining and enhancing core infrastructure for scalability and reliability.
Participate in the design and development of new features, integrations, and enhancements to software applications, databases, and interfaces.
Support and improve automated testing and deployment processes to ensure smooth delivery of new features and system updates.
Collaborate with other senior engineers to drive the technical direction of projects and ensure high-quality software delivery.

Requirements:
BSc in Computer Science or a related degree from a recognized institution, or a strong track record in server-side development with advanced technical skills.
6+ years of experience in software engineering with a demonstrated ability to work on large-scale projects and solve complex technical problems.
Proficiency in one or more programming languages such as C/C++, C#, Java, Go, or Python, with the ability to adapt to new tools and technologies.
Solid experience building scalable, distributed systems with an understanding of microservices architecture and API design.
Experience with cloud platforms such as AWS, Azure, or GCP, with knowledge of best practices for deploying and maintaining cloud-based services.
Strong problem-solving skills, with experience debugging and resolving production issues in complex systems.
Ability to prioritize tasks, manage workload efficiently, and contribute to the technical growth of the team.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8142855

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

27/03/2025

Site Reliability Engineering Manager

חברה חסויה

Location: Tel Aviv-Yafo and Netanya

Job Type: Full Time

We are looking for a Site Reliability Engineering Manager to lead our Israel SRE team. In this role, you'll drive best practices in reliability engineering, ensuring the stability, availability, and performance of our SaaS services. You'll collaborate with global SRE leaders, refine processes, and foster a culture of accountability and continuous improvement.
As a Site Reliability Engineering Manager you will
Lead, mentor, and develop a high-performing SRE Israel team, fostering collaboration, innovation, and accountability
Ensure SaaS reliability, performance, and availability, meeting or exceeding service-level objectives
Drive SRE best practices, including capacity planning, incident management, chaos engineering, and disaster recovery
Implement proactive monitoring, alerting, and anomaly detection aligned with SaaS standards
Collaborate with P&E and Cloud engineering teams to embed reliability into the SDLC
Oversee incident management, ensuring swift identification, escalation, and resolution
Maintain comprehensive SRE documentation, including processes, incident reports, and system architecture
Evaluate and adopt tools, technologies, and methodologies to enhance uptime and reliability.

Requirements:
3+ years of management experience leading a team of SRE, DevOps, or a similar SaaS role
Bachelors degree in Computer Science, Engineering, or related field (or equivalent experience)
Strong expertise in cloud platforms (AWS, GCP, or Azure), containers (Kubernetes, Docker), and configuration management (Terraform, Ansible)
Proficiency in Python or Go for automation and system optimization, as well as GitOps experience with SCM tools (e.g., Git, Bitbucket)
Strong leadership, communication, and collaboration skills, working across globally distributed teams
Familiarity with Agile methodologies, CI/CD pipelines, and orchestration tools (Jenkins, ArgoCD, StackStorm)
Familiarity with Chaos Engineering (e.g., Gremlin, Litmus, Chaos Toolkit)
Hands-on with alerting & observability tools (e.g., PagerDuty, OpsGenie, New Relic, Coralogix)
Strong understanding of scalability, high availability, and security best practices in cloud & Kubernetes environments.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8118211

שירות זה פתוח ללקוחות VIP בלבד

שמך המלאמה השם שלך?

מייל

תיאור

שליחה

תודה על שיתוף הפעולה

מודים לך שלקחת חלק בשיפור התוכן שלנו :)

המשרה נמחקה

תוכל לצפות בה בדף המשרות שלי

המשרה הוחזרה לרשימת תוצאות החיפוש

האם תרצה להסיר את המשרה מרשימת

המשרות השמורות שלך?

כן לא

אירעה שגיאה בשליחת פרטיך למשרה

02/04/2025

SRE Team Leader

חברה חסויה

Location: Tel Aviv-Yafo

Job Type: Full Time and Hybrid work

Were growing and looking to hire SRE Team Leader who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.
Responsibilities
As an SRE Team Leader, Your impact will be:
Site Reliability Engineering (SRE)
Production Gatekeeper: Design and enforce the rollout strategy for new technologies and oversee their execution to ensure minimal disruption to existing systems.
Production On-Call: Act as the first line of response for critical incidents, assessing issues, triaging, and coordinating with the team to prevent further issues and swiftly restore services.
Monitor Production Performance and Degradation: Keep a close eye on system performance metrics and detect any degradation early to prevent outages and disruptions.
Production Maintenance: Conduct regular infrastructure upgrades to accommodate changes, developments, and advancements in the technological landscape.
Manage Release Flow: Oversee the release of updates and new functionalities, ensuring a seamless transition while handling any potential negative impacts on production.
Staging Management: Oversee the management of the staging environment, ensuring that it accurately represents the production environment for effective testing and simulation.
Network Operations Center (NOC)
Build Playbooks: Develop and maintain comprehensive playbooks for managing system issues and incidents, setting guidelines for troubleshooting, escalation, and resolution processes.
Build Monitoring Dashboards: Design, set up, and maintain monitoring dashboards to visualize and track system performance and incidents in real-time.
Alerts and Incident Management: Establish protocols for issuing alerts in the event of system issues or anomalies and lead the team in incident resolution.

Requirements:
What do you need to succeed in this role?
Proven experience in SRE/DevOps roles (NOC role - advantage) and team management experience
Strong leadership qualities and team management skills.
Tech stack - Jenkins, TF, Ansible, Bash, Python, AWS, Argo
Expertise in system monitoring and incident management tools
Exceptional problem-solving and analytical skills
Excellent written and verbal communication abilities.
A Bachelor's degree in Computer Science, Information Technology, or a related field - Advantage
Familiarity with Agile methodologies.

This position is open to all candidates.

עדכון קורות החיים לפני שליחה

8125434

שירות זה פתוח ללקוחות VIP בלבד