דרושים » הנדסה » Systems Engineer III, Site Reliability Engineering

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Improve the life-cycle of services from inception and design, through deployment, operation, and refinement.
Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health, and lead sustainable incident response.
Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity.
Requirements:
Bachelors degree in Computer Science, a related field, or equivalent practical experience.
2 years of experience working with one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
2 years of experience working with administration (e.g., filesystems, inodes, system calls) or networking (e.g., TCP/IP, routing, network topologies and hardware, SDN).

Preferred qualifications:
Master's degree in Computer Science or Engineering.
Experience in managing and operating global-scale production systems in cloud environments.
Experience architecting, developing, and troubleshooting systems.
Experience designing, analyzing, and troubleshooting distributed systems.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8135331
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Set and communicate team priorities that support the broader organization's goals. Align strategy, processes, and decision-making across teams.
Set clear expectations with individuals based on their level and role and aligned to the broader organization's goals. Meet regularly with individuals to discuss performance and development and provide feedback and coaching.
Develop the mid-term technical goal and roadmap within the scope of our often multiple teams. Evolve the roadmap to meet anticipated future requirements and infrastructure needs.
Design, guide and vet systems designs within the scope of the broader area, and write product or system development code to solve ambiguous problems.
Review code developed by other engineers and provide feedback to ensure practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
Requirements:
Bachelor's degree or equivalent practical experience.
8 years of experience building and developing infrastructure or distributed systems.
Experience with software development in one or more programming languages (e.g., Python, C, C++, Java, Javascript).
Experience in a technical leadership role.

Preferred qualifications:
Masters degree or PhD in Engineering, Computer Science, or a related technical field.
3 years of experience working in a matrixed organization.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8135162
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Write product or system development code.
Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
Requirements:
Bachelors degree or equivalent practical experience.
1 year of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
1 year of experience with data structures or algorithms.
1 year of experience building and developing large-scale infrastructure or distributed systems.

Preferred qualifications:
Experience in User Space Networking, Software-Defined Networking and Low Latency Networking.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8135377
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
Required Production Engineer
Production Engineers are hybrid software/systems engineers who ensure that our services run smoothly and have the capacity for future growth. They are embedded in every one of Facebook's product and infrastructure teams, and are core participants in every significant engineering effort underway in the company.
Our team members come with varying levels of experience and backgrounds. Relevant industry experience is important (Software Engineer, Site Reliability Engineer (SRE), Systems Engineer, DevOps Engineer, Network Engineer, Database Administrator or similar role), but ultimately less so than your demonstrated abilities and attitude. We sail into uncharted waters every day in Production Engineering, and we are always learning.
This position is full-time.
Production Engineer Responsibilities
Own the end-to-end reliability and scalability of the platforms, services, and products built on top of our underlying infrastructure and network.
Write and review code, develop documentation and capacity plans, and debug the hardest problems, on some of the largest and most complex systems in the world.
Together with your engineering team, you will share an on-call rotation and be an escalation contact for live service incidents.
Partner alongside the best engineers in the industry on the coolest stuff around, the code and systems you work on will be in production and used by billions of users all around the world.
Requirements:
Minimum Qualifications
4+ years experience coding in higher-level languages (e.g., PHP, Python, C++, Rust or Java).
4+ years experience building, maintaining, and debugging production services/platforms such as cloud infrastructure, load balancers, relational databases, and messaging systems.
4+ years experience with software development, frameworks and APIs.
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
Preferred Qualifications
Depth of understanding in an areas such as operating systems or TCP/IP network fundamentals
Experience with distributed web-scale & Data systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8142866
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Lead, mentor, and manage multiple software engineering teams to deliver high-quality, scalable solutions.
Define and drive the software development roadmap, ensuring alignment with business objectives.
Oversee the architecture, design, and implementation of data streaming and processing solutions.
Collaborate with cross-functional teams (e.g., product, operations, and hardware) to deliver end-to-end IoT solutions.
Implement best practices for coding, testing, and deployment to maintain high standards of software quality.
Monitor project progress, manage resources, and ensure timely delivery of milestones.
Stay up-to-date with the latest industry trends, tools, and technologies to drive innovation.
Requirements:
Proven experience as a Software Director or similar leadership role, with at least two managerial tenures.
Experience in the AI & MLOps ecosystems.
Strong expertise in data engineering, data streaming, and related technologies (e.g., Kafka, Apache Flink, Spark, or similar).
A solid background in software development, with proficiency in modern programming languages (e.g., Python, Java, or C++).
Experience in designing and deploying scalable, cloud-based systems (AWS, Azure, or GCP preferred).
Excellent problem-solving skills and the ability to navigate complex technical challenges.
Strong interpersonal and communication skills, with a focus on building and maintaining a positive team culture.
Bachelors or Masters degree in Computer Science, Engineering, or a related field.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8142146
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Staff Infrastructure Software Engineer, you'll join our R&D center in Tel Aviv. You'll be a pivotal member of a cross-functional group working with multiple development teams to build the infrastructure, tooling, and standards for infrastructure and infra-related coding across our company's R&D. This role emphasizes research, programming, building scalable systems, and hands-on operational work in production environments. You'll play a critical role in shaping the future of transportation, affecting millions of riders daily worldwide.

What Youll Do:
Develop, drive, execute, and lead a long-term vision and strategy for our infrastructure, and tooling.
Create scalable, reliable, and efficient systems that enhance our technological foundation and support our growing scale and stability requirements.
Engage in hands-on coding and development, writing, building, deploying, and maintaining code in production environments.
Actively participate in day-to-day production operations alongside research and development, implementing real changes in production systems and directly addressing operational challenges.
Work closely with various development teams, moving between the four teams in our group as needed to address business objectives and tackle the biggest R&D challenges.
Work across a broad spectrum of technological disciplines, from network and operating system internals to high-level architecture planning.
Requirements:
BSc or MSc in Computer Science.
Minimum of 8 years of experience in backend and infrastructure development.
At least 3 years of hands-on experience with both a high-level language (e.g., Python, Go, Java, C#) and a low-level language (e.g., C, C++, Rust).
Experience in large-scale production microservices environments and devops operations.
Deep understanding of distributed systems and microservices architectures.
Proficiency with cloud platforms such as AWS, GCP, or Azure, and at least 2 years of experience with containerization technologies like Docker and Kubernetes.
Ability to move between teams and projects, working effectively in a dynamic environment.
Strong interpersonal skills, enjoys working with people, and capable of articulating complex technical concepts.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8126889
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We are seeking a Senior Team Manager to lead the team responsible for the core of our mission-critical communications platform. This platform handles real-time communications with 99.999% availability requirements, supporting emergency services globally. The ideal candidate will combine deep software engineering expertise and architectural skills with outstanding management capabilities to drive the development and maintenance of large-scale, high-availability systems.

This role requires extensive experience in architecting and scaling mission-critical systems in cloud environments, following best practices for high availability and reliability. The candidate must also have proven experience in managing complex environments, including coordinating with technical support and professional services teams across different time zones, particularly in the U.S. Experience with real-time communications (VoIP or WebRTC) is a significant advantage.

Reporting to: Director of Core Engineering Group.

Heres What Youll Be Doing:
1. Software Engineering Leadership:
Lead the architecture, design, and implementation of our core communications platform, ensuring scalability, high availability (99.999%), and fault tolerance.
Drive the adoption of software engineering best practices such as service resiliency, failover mechanisms, load balancing, and distributed systems design.
Oversee the development of a mix of legacy components and modern microservices using various programming languages and frameworks.
Provide technical leadership in architectural decisions, focusing on performance optimization, security, and reliability.
2. Strategic System Architecture:
Design and implement systems for growth and scale, ensuring they can handle increasing loads while maintaining strict availability and performance standards.
Establish and enforce best practices for monitoring, alerting, and incident response to minimize downtime and ensure rapid issue resolution.
Continuously evaluate and integrate emerging technologies and cloud-native architectures to future-proof the core platform.
3. Team Leadership & Management
Lead, mentor, and manage a team of engineers distributed across multiple regions, including Israel, Europe, and other international locations.
Establish clear goals, KPIs, and growth paths for team members, focusing on both individual development and team performance.
Foster a collaborative, direct, and informal communication culture aligned with our values.
Requirements:
1. Experience & Skills:
12+ years of software engineering experience, with at least 5 years in a management role leading engineering teams.
Proven track record in designing, architecting, and scaling mission-critical systems in cloud environments (AWS, GCP, or Azure).
Extensive experience in implementing high availability best practices such as:
- Distributed systems design and microservices architecture.

- Automated failover, load balancing, and disaster recovery strategies.

- Real-time monitoring and alerting systems.

Strong technical background in cloud-native architectures and a mix of programming languages commonly used for high-performance backend systems.
Experience with real-time communications platforms (VoIP, WebRTC, or similar) is a significant advantage.
Demonstrated ability to work effectively with technical support and professional services teams.

2. Management Skills:
Exceptional leadership and people management skills, with the ability to inspire and lead a distributed team across different geographies.
Experience in high-pressure environments requiring rapid decision-making and problem-solving.

3. Communication & Collaboration:
Fluent in English, with excellent written and verbal communication skills.
Familiarity with the direct and informal communication style is an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8149363
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
02/04/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking an experienced and motivated Engineering Backend Tech Lead to join our dynamic Site Reliability Engineering (SRE) team. As an Engineering Backend Tech Lead you will play a crucial role in enhancing the reliability, performance, and scalability of our systems and services. You will be a part of a global commando team of highly skilled SREs, driving best practices and innovations for optimal system operations, while protecting critical companies systems in a real time.
In this role, you will be responsible for:
Drive incident response and post-mortem processes, fostering a culture of continuous improvement.
Design, build and improve internal tools and automation software to make maintaining production services easier and safer.
Lead reliability-focused practices such as SLO (Service Level Objective) design and implementation, Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others.
Participate in the on-call rotation, providing expertise and support during critical system incidents and ensuring timely resolution.
Requirements:
Minimum 5 years of Software Engineering experience with .Net, NodeJs or other object-oriented languages.
Knowledge of architecture and application design experience.
Excellent troubleshooting and debugging skills.
Excellent verbal and written communication skills in English.
Basic knowledge of AWS or other cloud platforms on the infrastructure level
Preferred:
Experience with building AzureDevops CI/CD pipelines
Experience working on large-scale, high-traffic platforms.
Distributed monitoring experience with logging, metrics and tracing using OpenTelemetry and Prometheus.
Additional scripting languages: bash, powershell, python
Previous experience working as SRE
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8125295
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
02/04/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We are seeking an experienced and motivated SRE Tech Lead to join our dynamic Site Reliability Engineering (SRE) team. As a Tech Lead you will play a crucial role in enhancing the reliability, performance, and scalability of our systems and services. You will be a part of a global commando team of highly skilled SREs, driving best practices and innovations for optimal system operations, while protecting critical companies systems in a real time.
In this role, you will be responsible for:
Drive incident response and post-mortem processes, fostering a culture of continuous improvement.
Design, build and improve internal tools and automation software to make maintaining production services easier and safer.
Lead reliability-focused practices such as SLO (Service Level Objective) design and implementation, Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others.
Participate in the on-call rotation, providing expertise and support during critical system incidents and ensuring timely resolution.
Requirements:
Minimum 5 years of Software Engineering experience with .Net, NodeJs or other object-oriented languages.
Knowledge of architecture and application design experience.
Excellent troubleshooting and debugging skills.
Excellent verbal and written communication skills in English.
Basic knowledge of AWS or other cloud platforms on the infrastructure level
Preferred:
Experience with building AzureDevops CI/CD pipelines
Experience working on large-scale, high-traffic platforms.
Distributed monitoring experience with logging, metrics and tracing using OpenTelemetry and Prometheus.
Additional scripting languages: bash, powershell, python
Previous experience working as SRE
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8125103
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
30/03/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an innovative, multi-disciplinary cloud developer, to participate in product improvement efforts, ongoing operational tasks and customer facing technical challenges.

You will be required to demonstrate deep technological capabilities as well as great analytical skills.

You will be responsible for developing, maintaining, and improving the quality of our production-critical solutions:

Own, maintain and improve our cloud image creation system
Take part in the ongoing improvement of a wide range of solutions and technologies
Monitor, examine and remediate quality issues in all product aspects
Collaborate with the product focal points on quality, release, and content targets
Adapt and develop technical expertise in specific product aspect
Work closely with customers and support to deliver end-to-end high quality solutions to field issues and requests
Own the certification process of new cloud platforms
Manage our cloud offerings, product release and infrastructure components
Research and integrate new technologies/principles/patterns into our operational stacks
Requirements:
B.Sc. degree in Computer Engineering / Computer Science (or equivalent field of study) Must.
Working knowledge in Python, Java, C or similar
Analytic and problem solving skills, fast learner
Superb hands on development & debugging skills
Self-driven and extremely motivated
Great communication skills
Analytic, problem solving skills and fast learner
Ability to work in a multi-tasked and dynamic environment
Advantages:

Technical experience working with customers
Proficiency in Bash/Python/C/C+ programming languages
Comfortable working in Linux environment
Networking knowledge (TCP/IP stack, OSI model & routing)
Familiarity with Network Security
Familiarity of Virtualization concepts
Experience with cloud environments
Familiarity with SDLC and/or CI/CD
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8120001
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Your Role:

Design, develop, and maintain complex, scalable, and high-performance systems with a focus on writing clean, efficient, and maintainable code.

Collaborate with cross-functional teams, including product managers and architects, to implement technical solutions aligned with business objectives.

Provide input on architectural design and participate in technical planning to ensure long-term maintainability and compliance with standards.

Identify and troubleshoot production issues, conducting root cause analysis and implementing fixes to ensure system reliability.

Contribute to code reviews and technical discussions, sharing knowledge and fostering a culture of collaboration and continuous improvement.

Your Opportunity:

Develop critical components and systems that drive business outcomes, while maintaining and enhancing core infrastructure for scalability and reliability.

Participate in the design and development of new features, integrations, and enhancements to software applications, databases, and interfaces.

Support and improve automated testing and deployment processes to ensure smooth delivery of new features and system updates.

Collaborate with other senior engineers to drive the technical direction of projects and ensure high-quality software delivery.
Requirements:
BSc in Computer Science or a related degree from a recognized institution, or a strong track record in server-side development with advanced technical skills.

6+ years of experience in software engineering with a demonstrated ability to work on large-scale projects and solve complex technical problems.

Proficiency in one or more programming languages such as C/C++, C#, Java, Go, or Python, with the ability to adapt to new tools and technologies.

Solid experience building scalable, distributed systems with an understanding of microservices architecture and API design.

Experience with cloud platforms such as AWS, Azure, or GCP, with knowledge of best practices for deploying and maintaining cloud-based services.

Strong problem-solving skills, with experience debugging and resolving production issues in complex systems.

Ability to prioritize tasks, manage workload efficiently, and contribute to the technical growth of the team.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8140445
סגור
שירות זה פתוח ללקוחות VIP בלבד