דרושים » הנדסה » Site Reliability Engineer (SRE)

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
Were growing and looking to hire Site Reliability Engineer (SRE) who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.
We are looking for a skilled and motivated Site Reliability Engineer (SRE) to join our team and help ensure our production cloud environment's reliability, performance, and scalability. As an SRE, you will work at the intersection of software engineering and operations, taking ownership of system stability, incident response, automation, and continuous improvement of our infrastructure.
This role is ideal for engineers who thrive in dynamic environments, value reliability, and enjoy building resilient and scalable systems.
As an SRE, Your impact will be:
Production Reliability: Ensure system uptime and performance by identifying and addressing potential issues before they affect end users.
Incident Response: Serve as part of the on-call rotation, rapidly diagnosing and resolving incidents, and conducting root cause analysis and postmortems.
Monitoring and Alerting: Build and maintain monitoring dashboards and alerting systems to detect and respond to anomalies in real time.
Automation and Tooling: Develop and maintain automation tools for deployments, scaling, and operational efficiency using Terraform, Ansible, Bash, or Python.
Infrastructure Maintenance: Perform regular maintenance and upgrades of production infrastructure to ensure security, stability, and performance.
Release Engineering: Support and optimize the rollout of new features and updates, minimizing risk and impact on production environments.
Staging Environment Management: Ensure staging environments accurately reflect production for robust testing and validation of changes.
Requirements:
Experience in SRE, DevOps, or production engineering roles
Strong skills in system troubleshooting, incident response, and root cause analysis
Proficiency with tools such as:
Jenkins, Terraform, Ansible, GIT, GitHub
Bash, Python
AWS, ArgoCD, or similar CI/CD and cloud platforms
Familiarity with observability tools and practices (metrics, logging, tracing)
Ability to work effectively in cross-functional teams
Strong communication and documentation skills
Bachelor's degree in Computer Science, Information Technology, or a related field (preferred)
Familiarity with Agile development methodologies
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8198455
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We are seeking a skilled Site Reliability Engineer (SRE) to join our team and help build, maintain, and improve the reliability, scalability, and performance of our systems. As an SRE, you will be responsible for owning observability tools, driving incident management processes, and implementing automation to enhance our infrastructure. This role involves collaborating across teams to ensure a robust and efficient technology stack supporting mission-critical systems.

You will:
Proactively enhance system reliability, scalability, and performance through automation, monitoring, and capacity planning.
Develop and maintain observability systems, including distributed tracing, logging, and metrics platforms.
Establish and maintain organizational standards for monitoring, leveraging tools like Prometheus, Grafana, and OpenTelemetry.
Drive incident management, root cause analysis, and continuous improvement initiatives.
Partner with development teams to integrate reliability best practices into the software development lifecycle.
Manage infrastructure at scale in cloud services (AWS advantage) and platforms like Kubernetes or ECS.
Optimize resource utilization to reduce costs while maintaining service quality.
Requirements:
At least 5 years of experience as a SRE.
Strong experience with Observability Tools: Proficiency with OpenTelemetry, Grafana, Prometheus, and ELK stack (Elasticsearch, Logstash, Kibana).
Experience with Cloud Platforms: In-depth knowledge of AWS services, including EC2, S3, RDS, and CloudFormation/Terraform for infrastructure-as-code.
Proficiency in scripting and/or development languages like Bash or Python.
Thorough understanding of CI/CD pipelines and automation tools.
Understanding of Infrastructure as Code, and strong experience with automation tools like Terraform and/or Ansible.
Solid troubleshooting and debugging skills.
A team player with a strong can-do mentality.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8163101
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Site Reliability Engineer (SRE) to join our Engineering team. Someone who has a passion for observability, monitoring, automation, and high-availability systems, and who has a desire to solve complex technological challenges with a proactive approach to continuous improvement.

We use an interesting and mixed technology stack: Kubernetes, Terraform, CI/CD pipelines, Datadog, Prometheus, and cloud-native architectures.

In this position, you will use your expertise in building and scaling SRE operations, and will design, implement, and operate a world-class reliability strategy.


Key Responsibilities
Develop and maintain our monitoring, alerting, and logging systems, ensuring high visibility into production environments.
Implement automation to improve system reliability, scalability, and efficiency.
Troubleshoot and resolve production incidents, leading root cause analyses and implementing permanent fixes.
Collaborate with software engineers and DevOps teams to enhance application performance and resilience.
Continuously improve operational processes, focusing on reducing toil and improving reliability.
Requirements:
3+ years of experience as an SRE, DevOps Engineer, or in a similar role.
Hands-on experience with monitoring and observability tools like Datadog, Prometheus, and Grafana.
Strong understanding of Linux systems, networking, and cloud-native architectures.
Experience with Kubernetes, Terraform, and CI/CD pipelines.
A problem solver, capable of finding creative solutions and getting things done.
Fluent with incident management, RCA processes, and operational best practices.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8200136
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/05/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking an experienced Senior Site Reliability Engineer to join our SRE team as part of our Platform Engineering group. This role involves taking ownership of monitoring, deploying, and ensuring the reliability of production-grade modern SaaS platforms across Cloud and On-Premise environments.
Responsibilities:
Lead initiatives to enhance product reliability and system readiness.
Design and implement sophisticated monitoring solutions to ensure high availability and performance of our production platform.
Oversee and refine the entire product reliability pipeline.
Proactively troubleshoot and resolve issues across production environments.
Champion an "Everything as Code" approach using a wide range of technologies including Ansible, Terraform, Helm, Python and more.
Develop advanced tools for automation, deployment, monitoring, and operations.
Exhibit excellent communication and interpersonal skills to effectively collaborate within the team and across departments.
Promoting best practices in reliability and system operations.
Requirements:
At least 4-5 years of experience as a DevOps or Site Reliability Engineer.
In-depth knowledge of microservices architectures and technologies such as Kubernetes.
Comprehensive understanding of cloud & on-prem environments and hybrid solutions.
Proficiency with one or more major cloud providers. (AWS experience is an advantage)
Advanced experience with CI/CD technologies including Jenkins, GitHub Actions, and ArgoCD.
Proficient coding and scripting capabilities in Python, Bash, or similar languages.
Strong team player with proven ability to lead and inspire.
Advantages:
Prior experience with endpoint security products (agents, sensors, collectors).
Background in working with AI components (training, inference, serving).
Tech Stack: AWS, Kubernetes, EKS, RKE2, ECS, SageMaker, Jenkins, GitHub, Terraform, Python, Ansible, Docker + Compose, ArgoCD, MongoDB, RabbitMQ, Redis, Go, Neo4J, AI, and more.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8162480
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior Site Reliability Engineer, youll play a key role in shaping our new Production Reliability domain. Youll drive reliability initiatives, lead cross-team projects, and make sure our SaaS platform stays robust, scalable, and efficient. This is a high-impact, hands-on role that demands technical expertise and a proactive approach.

As a Senior SRE, you will:

Design, build, and maintain scalable, fault-tolerant systems.
Define and enforce SLOs, SLIs, and SLAs and drive improvements based on real data.
Build automation and tooling to enhance observability, testing, and deployments.
Lead complex incident responses, including on-call rotations and postmortems.
Collaborate closely with engineering, product, and support teams to embed reliability into everything we do.
Mentor engineers and promote operational excellence across the organization.
Requirements:
Have 7+ years of experience in SRE, DevOps, or Production Engineering roles, ideally in SaaS environments.
Bring deep expertise in resilience engineering, monitoring, and building fault-tolerant systems.
Are hands-on with monitoring tools like Datadog, Dynatrace, Opensearch, Coralogix, or Sentry.
Are experienced with CI/CD tools like Jenkins or ArgoCD.
Are proficient with infrastructure-as-code tools like Terraform or Crossplane.
Have strong knowledge of Linux systems and networking fundamentals.
Have solid experience with cloud platforms (AWS preferred).
Are an advanced coder in Java (Python or Go is a plus).
Know Kubernetes and the broader CNCF ecosystem inside out.
Excel at debugging and root cause analysis.
Are fluent in Hebrew and English.
Bring a high sense of ownership and accountability to everything you do.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8199501
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Site Reliability Engineering (SRE) & Production Team Leader to join our Engineering team. Someone who has a passion for observability, monitoring, automation, and high-availability systems, and who has a desire to solve complex technological challenges with a proactive approach to continuous improvement.

We use an interesting and mixed technology stack: Kubernetes, Terraform, CI/CD pipelines, Datadog, Prometheus, and cloud-native architectures.

In this position, you will use your expertise in building and scaling SRE operations, and will design, implement, and operate a world-class reliability strategy.

Key Responsibilities
Design, build, and manage our SRE framework to ensure observability, resilience, and high availability.
Develop and automate solutions for proactive monitoring, incident response, and performance optimization.
Improve and maintain our alerting and monitoring stack, leveraging tools like Datadog, Prometheus, and Grafana.
Lead post-mortem analysis and implement continuous improvement initiatives.
Collaborate with DevOps, Engineering, and Product teams to ensure smooth and efficient delivery of reliable services.
Requirements:
SRE & Production Manager with 5+ years of experience in SRE, Production Engineering, or DevOps, including 2+ years in a leadership role.
Experience with monitoring and observability tools like Datadog, Prometheus, and Grafana.
A problem solver, capable of finding creative solutions and getting things done.
Fluent with incident management, RCA processes, and operational best practices.
Experience with AWS (EKS, EC2, RDS, S3, networking configurations).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8200138
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
20/05/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a results-driven DevOps Team Lead to head the DevOps and Infrastructure team within our R&D organization. This role requires strategic vision, technical expertise, and a proactive approach to drive operational excellence, empower teams with robust tools and automation, and ensure high system reliability and scalability.

As the DevOps Team Lead, you will be instrumental in delivering critical KPIs, including system uptime, automation, incident management, and collaboration with development and QA teams to enable self-sufficiency. Additionally, you will serve as the leader for strategic projects, identifying opportunities to improve infrastructure and operational processes, setting long-term goals, and executing initiatives that align with Cyberints business objectives and growth.

Key Responsibilities
Strategic Leadership

Identify and lead strategic projects to enhance Cyberints platform scalability, reliability, and operational efficiency.
Develop and execute a roadmap for critical infrastructure and DevOps initiatives that drive business success.
Collaborate with senior stakeholders to align projects with organizational priorities and deliver measurable outcomes.
System Reliability & Uptime

Lead initiatives to ensure system reliability, minimize disruptions, and maintain high availability for Cyberints SaaS platform.
Establish and manage proactive monitoring, alerting, and preventive maintenance strategies.
Drive incident prevention efforts, ensuring robust failover and disaster recovery mechanisms.
Develop and maintain playbooks to enable rapid diagnosis and resolution of issues.
Automation, Infrastructure as Code (IaC), & Self-Service Enablement

Champion the adoption of automation and IaC to streamline infrastructure management and deployments.
Build and enhance self-service tools and frameworks, empowering R&D teams to operate independently with minimal reliance on DevOps.
Continuously improve CI/CD pipelines to optimize deployment speed and reliability.
Collaboration & Support for Self-Sufficiency

Collaborate closely with development, QA, and support teams to deliver tools and frameworks that promote team autonomy and efficiency.
Advocate for cross-functional engagement to align operational processes with R&D objectives.
Provide training and mentorship to teams on using DevOps tools effectively.
Accountability, Ownership, & Scalability

Take ownership of all systems and infrastructure, ensuring solutions are scalable, resilient, and aligned with Cyberints growth objectives.
Establish clear accountability frameworks for maintaining infrastructure and delivering on key projects.
Design and execute a roadmap to support self-service-oriented and scalable solutions.
Requirements:
5+ years of experience in DevOps or SRE roles, with 2+ years in a leadership capacity.
Proven expertise in building and maintaining highly available, cloud-native environments (AWS preferred).
Experience with Kubernetes, Terraform, CI/CD pipelines, and monitoring technology and tools (Prometheus, Grafana, Jenkins, ArgoCD, Terraform, Elasticsearch, Redis, EKS, etc.).
Skills & Expertise

Strong understanding of automation, Infrastructure as Code (IaC), and self-service enablement.
Expertise in incident management and a track record of delivering reliable, scalable systems.
Hands-on experience with scripting and automation tools (Python, Bash).
Deep understanding of containerization, orchestration, and cloud-native architectures.
Familiarity with cost monitoring and optimization strategies to ensure infrastructure is both efficient and cost-effective.
Knowledge of security best practices for infrastructure and DevOps environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8185042
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
06/05/2025
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a highly skilled and experienced Architecture & Operations Lead to drive the development of infrastructure for automation testing, internal DevOps, CI/CD, and deployment.

This role is critical in designing and maintaining scalable and high-performance infrastructure for software development, testing, and production environments. The ideal candidate has a strong background in cloud infrastructure, automation, microservices architecture, performance monitoring, and software best practices.

Key Responsibilities:

Infrastructure & Automation:
Design and implement infrastructure for automation testing and internal DevOps processes.
Develop and manage CI/CD pipelines, ensuring smooth and automated software deployment.
Architect and maintain scalable infrastructure on AWS, leveraging Terraform and infrastructure-as-code (IaC) best practices.
Define and enforce software best practices, ensuring reliability, maintainability, and security.

Operations & Performance Monitoring:
Lead performance monitoring and optimization efforts using tools like APM (Application Performance Monitoring) and New Relic.
Implement Site Reliability Engineering (SRE) principles to enhance system reliability and scalability.
Monitor and improve system performance, ensuring high availability and fault tolerance.

Collaboration Across Teams:
Work closely with development, product, and DevOps teams to align infrastructure strategies with system architecture.
Conduct design reviews and provide recommendations to optimize software and infrastructure performance.
Oversee GitHub Actions workflows for efficient automation and deployment processes.

Security & Compliance:
Ensure infrastructure meets industry security and compliance standards.
Collaborate with security teams to perform vulnerability assessments and implement secure deployment strategies.

Software Development & Best Practices:
Define and enforce best practices for software development and deployment.
Ensure backward compatibility compliance, preventing API breakages.
Drive automation initiatives to reduce manual effort and increase efficiency.
Requirements:
Key Experience and Qualifications Required:
Bachelors or Masters degree in Computer Science, Software Engineering, or a related field.
8+ years of experience in software infrastructure, DevOps, or cloud architecture, including leadership roles.
Expertise in designing and managing CI/CD pipelines using GitHub Actions.
Strong experience with AWS, Terraform, and infrastructure-as-code (IaC) principles.
Proficiency in Python for automation and infrastructure management.
Strong understanding of microservices architecture and distributed systems.
Experience with performance monitoring tools such as New Relic and APM solutions.
Familiarity with containerization and orchestration (Docker, Kubernetes).
Hands-on experience with SRE methodologies and best practices.
Strong problem-solving skills with a focus on scalability and system-wide impact.

Preferred Skills:
Experience in high-availability system design and cloud-based infrastructure optimization.
Knowledge of compliance and security frameworks for cloud environments.
Strong analytical skills for performance tuning and optimization.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8164868
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/04/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a DevOps Team Lead to lead our geographically diverse team and take ownership of our Cloud Infrastructure and Platform Engineering strategy, enabling high-scale, cutting-edge GenAI products running across 40+ Kubernetes clusters on GCP and AWS.

This role combines technical leadership, team management, and hands-on engineering, requiring solid expertise in cloud-native technologies, Kubernetes at scale, and modern DevOps principles. You will collaborate closely with engineering teams to design scalable infrastructure solutions, optimize developer workflows, and ensure platform reliability and efficiency.

Role and Responsibilities
Team Leadership & Mentorship: Lead and manage a geographically distributed team, fostering growth, engagement, and professional development. Mentor engineers, conduct performance reviews, career growth planning, and encourage knowledge-sharing across R&D teams.
Cloud & Kubernetes Management: Guide the design and implementation of scalable multi-cluster Kubernetes environments across GCP & AWS.
Developer Experience & Enablement: Oversee the development of self-service tools and automation to improve efficiency for R&D teams.
Incident & Reliability Engineering: Collaborate with engineering teams to optimize cost, performance, and reliability of production infrastructure through monitoring, capacity planning, and scaling strategies.
Security & Governance: Drive best practices for RBAC, IAM, cloud security, and compliance, ensuring robust infrastructure security.
Automation & Infrastructure as Code: Promote adoption of GitOps workflows and Infrastructure as Code (Terraform, Helm, Crossplane) for improved automation and consistency.
Cross-Team Collaboration: Align cloud infrastructure goals with business needs by working closely with engineering, security, and product teams.
Requirements:
7+ years of DevOps, SRE, or Platform Engineering experience.
5+ years working with public cloud platforms (AWS/GCP) at scale.
Senior-level Kubernetes expertise, including experience managing enterprise-grade, multi-cluster environments.
Experience with Infrastructure as Code (Terraform, Helm) and familiarity with GitOps principles (ArgoCD, FluxCD, etc.).
Familiarity with observability and monitoring tools (Prometheus, Grafana, Datadog, OpenTelemetry, etc.).
Proficiency in scripting and automation (Python, Go, Bash) for infrastructure management.
Knowledge of cloud networking (VPC, load balancers, service meshes) and security best practices (RBAC, IAM, security groups, network policies).
Experience with CI/CD pipelines, optimizing for performance, security, and developer velocity.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8152212
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
4 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a highly skilled and versatile Senior DevOps Engineer to join our development team. As a Senior DevOps Engineer, you'll play a vital role in ensuring the reliability, scalability, and performance of our systems while collaborating with cross-functional teams to deliver high-quality software products


Responsibilities:
Take an active part of all DevOps areas: Develop and maintain monitoring and alerting infrastructure to ensure system reliability and performance
Oversee building and maintaining tools and procedures for monitoring, deployment, and alerting for our SaaS multi-tenant product family
Design and implement CI/CD processes for continuous integration and deployment of software applications
Develop and manage a containerized production environment using technologies such as Kubernetes, Docker, and Helm
Define and enforce DevOps standards, best practices, and procedures across the organization
Providing ad-hoc custom solutions to meet the technical needs of other teams
Acting as a resource and mentor for engineers with less DevOps experience, providing guidance and support
Requirements:
5+ years of experience as a DevOps Engineer
Hands on experience with any public cloud provider (such as: GCP)
Hands on experience with Kubernetes and Docker containers
Strong knowledge with IAC tools such as Terraform and Helm charts
Hands on experience with CI/CD automation - Github Actions and ArgoCD
Experience with ELK Stack (Elasticsearch, Logstash, Kibana) for log analysis and monitoring
Experience in architecting and scaling in cloud environments
Extensive experience with Linux operating system and proficiency in bash scripting
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8199545
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
20/05/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
we are looking for a Senior DevOps Engineer to join our Cloud Network Security group.

Key Responsibilities
As a DevOps Engineer at Check Point, you will design, implement, and manage CI/CD pipelines, collaborate with cross-functional teams, and ensure the high availability and reliability of our cloud-based services and solutions.

Responsibilities:

Design, implement, and manage CI/CD pipelines to automate the deployment of SaaS
Collaborate with development, QA, and operations teams to ensure smooth and reliable software releases.
Monitor system performance and troubleshoot issues to ensure high availability and reliability of our services.
Implement and manage infrastructure as code (IaC) using tools like Terraform, CloudFormation and ARM.
Optimize system performance, scalability, and security.
Develop and maintain documentation for infrastructure and deployment processes.
Requirements:
2-4 years of experience in DevOps or a related role, working with distributed systems and SaaS applications.
Proficiency with CI/CD tools such as Gerrit, GitLab CI, GitHub
Experience with Cloud Providers like: AWS, Azure, GCP
Solid foundation in Cloud account users management & cost optimizations (FinOps principles)
Solid understanding of networking, security, and system administration.
Familiarity with logging and monitoring stacks (e.g., Elasticsearch, CloudWatch, Grafana, Prometheus).
Proficiency in scripting (Python, Bash) for automation and tooling.
Solid grasp of IaC & GitOps principles and best practices (Terraform, Helm, ArgoCD, Crossplane).
Knowledge of agile methodologies and practices
Strong knowledge of distributed systems, microservices, and orchestration technologies
Expertise in containerization and orchestration tools like Docker and Kubernetes
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8185035
סגור
שירות זה פתוח ללקוחות VIP בלבד