דרושים » מחשבים ורשתות » Sr. Site Reliability Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
What You'll Do:
Design, build, and support tooling, automation, and infrastructure to maximize the reliability, scalability, and performance of our Cognition.
Proactively identify, mitigate, and resolve issues, leveraging AI-driven insights and automation where possible.
Develop robust monitoring, alerting, and incident response strategies; ensure actionable observability across all critical systems.
Drive best practices in CI/CD, Infrastructure-as-Code, environment provisioning, and disaster recovery.
Collaborate closely with engineering teams to build, deploy, and maintain highly available services in production.
Take responsibility for uptime, reliability, and the operational excellence of our Cognition.
Help define and measure SLOs/SLAs to ensure world-class service delivery.
Requirements:
3+ years in Site Reliability, DevOps, or related Infrastructure Engineering roles in 24/7 production environments.
Deep experience operating, automating, and supporting distributed systems on AWS or similar clouds.
Experience with Infrastructure-as-Code (e.g., Terraform, CloudFormation) and CI/CD tooling (e.g., Jenkins, Github Actions, etc.).
Strong skills in Python, Bash, or comparable scripting languages for automation.
Hands-on experience with observability stacks (e.g., New Relic, Grafana, CloudWatch, Datadog) and incident response.
Familiarity with microservices architectures and patterns for resilience/scalability (e.g., throttling, retries, circuit breakers).
Experience with common data stores (MySQL/RDS, DocumentDB, Elasticsearch, Redis).
Working knowledge of Node.js/TypeScript backends (bonus: performance optimization and monitoring); experience with Java, Python, or Go is a plus.
Interest or experience in applying AI for infrastructure automation, monitoring, or optimization (a strong plus).
A collaborative mindset with strong communication skills, able to work independently and comfortably across teams and disciplines.
Thrives in a fast-paced, high-growth environment and ready to tackle complex system challenges at scale.
Data-driven, analytical thinker with the ability to dive into metrics, identify insights, and drive product improvements.
Startup-ready: thrive in fast-paced, ambiguous environments; bias for learning, action, and innovation.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8366060
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required DevOps Engineer
As a key member of our engineering team, youll work at the intersection of development, operations, and reliability. Youll automate cloud infrastructure, ensure system performance, and maintain secure, scalable deployments in a regulated fintech environment.
Responsibilities:
Manage and enhance cloud infrastructure (AWS, GCP, Azure, or similar).
Develop, maintain, and automate CI/CD pipelines to streamline application delivery.
Implement Infrastructure as Code (e.g., Terraform, Ansible, CloudFormation) for provisioning and managing environments.
Set up and maintain monitoring, observability, and alerting systems using tools like Prometheus, Grafana, Splunk, New Relic, ELK, etc.
Define, track, and act upon SRE metrics (SLIs, SLOs, error budgets) to balance reliability and development velocity.
Participate in incident response, including root cause analysis and remediation.
Automate repetitive tasks to reduce toil and increase system resiliency and uptime.
Collaborate with developers and security teams to embed security and compliance best practices (e.g., PCI DSS, DevSecOps).
Support on-call rotation and continuously improve operational processes.
Requirements:
5-8 years experience in DevOps, SRE, or related engineering roles.
Proven experience working with at least one cloud provider (AWS, GCP, Azure).
Proven experience with containerization and orchestration (Docker, Kubernetes,GKE).
Proficiency in CI/CD tooling (e.g., GitLab CI, Jenkins, GitHub Actions).
Hands-on experience with Infrastructure as Code tools (Terraform, Ansible,CloudFormation).
Strong command of monitoring and observability tools (Prometheus, Grafana, ELK stack, Splunk, New Relic).
Solid scripting ability in Python, Bash, or similar.
Familiarity with Linux/Unix systems, networking, and basic system administration.
Comfortable working in fast-paced, collaborative environments and able to handle operational incidents effectively.
Excellent communication skills and a mindset geared toward continuous learning and improvement.
Nice to Have:
Exposure to containerization and orchestration (Docker, Kubernetes, GKE).
Understanding of SLA/SLI/SLO frameworks, error budgets, and reliability engineering principles. WikipediaReddit
Awareness of financial compliance standards like PCI DSS. jobs.singaporefintech.org
Knowledge of DevSecOps practices (security-as-code, shifting security left). Wikipedia
Familiarity with incident management and on-call culture.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8365930
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
07/09/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Site Reliability Engineer to join our DevOps team.
You will ensure the reliability, performance, and scalability of our back-office solutions, which serve as the foundation for the entire purchasing process. This role will lead the development of SRE capabilities, meeting SLI/SLO/SLA targets, and establishing effective monitoring systems.
You will enhance our Software Development Lifecycle by integrating reliability and scalability, working with cross-functional teams, and supporting production environments.
Additionally, you will implement incident management processes and conduct post-mortem analyses to drive continuous improvement. If you have a strong engineering and automation background and are passionate about the E-commerce field, then we would love to hear from you.
Roles and Responsibilities:
Develop and implement SRE capabilities to enhance the reliability, availability, and performance of Admin solutions.
Design and maintain proactive monitoring and alerting systems for deep visibility into critical business flows, beyond simple statuses, to identify functional issues.
Drive improvements in the Software Development Lifecycle (SDLC) for reliability and scalability from design to deployment.
Collaborate with development and operations teams to troubleshoot production incidents affecting the purchase flow through root cause analysis.
Lead SRE initiatives to boost system resilience and operational efficiency.
Implement best practices for incident management and conduct blameless post-mortems, contributing to capacity planning and performance testing to ensure scalability.
Requirements:
5+ years of experience as a Site Reliability/DevOps Engineer
Deep understanding of E-commerce flows, specifically with back-office operations and order processing - must
Experience as an Automation/Software Engineer with a strong understanding of software development principles and in building, testing, and deploying distributed systems - must
Experience in designing, implementing, and utilizing monitoring and observability platforms such as DataDog, NewRelic, Prometheus/Grafana, or ELK stack - must
Proficiency in scripting and automation using languages such as Python, Java, etc. - must
Ability to create dashboards, alerts, and insightful queries - must
Experience with AWS services to build and operate scalable and resilient applications (e.g., EC2, ECS/EKS, RDS, S3, Lambda, CloudWatch) - plus
Experience in automating infrastructure provisioning, application deployments, and repetitive operational tasks - plus
Proactive approach with excellent problem-solving skills
Strong collaborator, with an ability to work with cross-functional teams
Proficient in English
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8336258
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
were looking for a DevOps engineer to take the lead on one of the most critical technical challenges in our business: how we deploy software at scale.
Today, deploying our core system into each store is a complex, multi-stage process. Tomorrow, it needs to be seamless, automated, and capable of onboarding dozens of stores per weekwithout sacrificing versatility and quality
This role isnt just about CI/CD or scripting. Its about refining the automation infrastructure that enables repeatable, self-service deployments across hundreds of live, mission-critical environments. Youll sit at the heart of our Ops Technology team, working at the intersection of system engineering and in-store execution, and serving as the technical backbone for deployment scale.
You'll help shape and evolve our DevOps toolsetworking hands-on with cutting-edge technologies to streamline deployments, boost reliability, and scale our platform with speed and confidence. For the right person, this role is a stepping stone toward technical leadership in one of most strategic teams.
A day in the life:
Design and build the software deployment framework that powers in-store systems at scale
Lead remote deployments using Octopus Deploy, driving repeatability and automation
Set up and manage Kubernetes clusters for scalable microservices in diverse environments
Deploy and monitor services with a focus on resilience, observability, and recovery
Troubleshoot complex issues across CI/CD, environments, services, and infrastructure
Collaborate with System Engineering and SRE peers to ensure smooth end-to-end deployment flows
Continuously evolve our CI/CD pipelines, deployment logic, and infrastructure-as-code practices
Build tooling, templates, and documentation to enable fast, low-touch deployments by others
Serve as a technical leader and force multiplier within the Ops Tech team
Requirements:
3+ years in DevOps, Infrastructure, or SRE roles with deep end-to-end ownership
Solid background in Kubernetes, container orchestration, and microservices
Experience deploying and supporting systems in live production environments
Strong CI/CD skills with tools like GitHub Actions, GitLab CI, or Jenkins
Scripting proficiency in Bash, PowerShell, or Python
Familiarity with monitoring, alerting, and diagnostics (Prometheus, Grafana, etc.)
Experience with infrastructure-as-code tools like Terraform, Helm, etc.
Excellent troubleshooting skills and a bias toward automation and scale
Strong communication and the ability to work cross-functionally and independently
Nice to have:
Experience with Azure and GCP
Understanding of K8s networking, service meshes, ingress controllers
Passion for enabling othersthrough tools, documentation, or mentorship
Proven ability to drive change in fast-moving, operationally complex environments
Expertise with Octopus Deploy and experience deploying software remotely
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8325803
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
28/09/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a talented and motivated DevOps Engineer to join Cloud Engineering team. Youll play a key role in developing, maintaining, and scaling our SaaS product across AWS, Azure, and GCP. This includes managing our deployment packages (Terraform, CloudFormation, Azure Bicep), ensuring seamless integrations with customer environments, and enabling secure, reliable data scanning at scale.
As part of our DevOps team, youll not only drive automation and infrastructure management, but also participate in customer-facing installation meetingshelping customers deploy and configure platform successfully.
Responsibilities:
Design, develop, and maintain cloud infrastructure on AWS, Azure, and GCP.
Manage Infrastructure as Code using Terraform, CloudFormation, and Azure Bicep.
Build, scale, and maintain Kubernetes clusters and containerized applications.
Implement automation for deployment, monitoring, and incident response.
Write and maintain Python and Bash scripts for automation, integrations, and tooling.
Troubleshoot networking, connectivity, and security issues (TCP/IP, UDP, VPNs).
Collaborate with engineering and product teams to optimize deployments.
Support customer onboarding by assisting with setup and deployment meetings.
Continuously improve CI/CD pipelines and operational processes.
Requirements:
35+ years of experience in DevOps, Cloud Engineering, or related roles.
Hands-on expertise with at least two major cloud providers (AWS, Azure, GCP; experience with all three is a plus).
Strong programming skills in Python and Bash (automation, tooling, scripts).
Proficiency in Linux systems administration.
Strong experience with Kubernetes, Docker, and container orchestration.
Deep understanding of networking fundamentals (TCP/IP, UDP, DNS, VPNs, routing, firewalls).
Experience with IaC tools: Terraform, CloudFormation, Azure Bicep.
Familiarity with CI/CD tools (GitHub Actions, GitLab CI, or similar).
Excellent problem-solving and troubleshooting skills.
Excellent communication skills, with the ability to work directly with customers.
Observability stack experience (Datadog, Prometheus, Grafana, ELK, etc.).
Nice to Have:
Experience in SaaS environments or multi-cloud deployments.
Security best practices and compliance knowledge (IAM, RBAC, data protection).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8358649
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
10/09/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We are looking for a Site Reliability Engineer (SRE) to join our Engineering team. Someone who has a passion for observability, monitoring, automation, and high-availability systems, and who has a desire to solve complex technological challenges with a proactive approach to continuous improvement.
We use an interesting and mixed technology stack: Kubernetes, Terraform, CI/CD pipelines, Datadog, Prometheus, and cloud-native architectures.
In this position, you will use your expertise in building and scaling SRE operations, and will design, implement, and operate a world-class reliability strategy.
About Us
we are a key player the network security field, striving to provide the leading SASE platform in the market. Our innovative approach, merging cloud and on-device protection, redefines how businesses connect in the era of cloud and remote work.
Key Responsibilities
Develop and maintain our monitoring, alerting, and logging systems, ensuring high visibility into production environments.
Implement automation to improve system reliability, scalability, and efficiency.
Troubleshoot and resolve production incidents, leading root cause analyses and implementing permanent fixes.
Collaborate with software engineers and DevOps teams to enhance application performance and resilience.
Continuously improve operational processes, focusing on reducing toil and improving reliability.
Requirements:
3+ years of experience as an SRE, DevOps Engineer, or in a similar role.
Hands-on experience with monitoring and observability tools like Datadog, Prometheus, and Grafana.
Strong understanding of Linux systems, networking, and cloud-native architectures.
Experience with Kubernetes, Terraform, and CI/CD pipelines.
A problem solver, capable of finding creative solutions and getting things done.
Fluent with incident management, RCA processes, and operational best practices.
It would be great if you also have:
Experience in high-scale distributed systems.
Background in security and compliance for cloud infrastructure.
Familiarity with AWS (EKS, EC2, RDS, S3, networking configurations).
Proficiency in Python, Go, or Bash for automation and scripting.
Understanding of cost optimization and resource management in cloud environments.
Familiarity with machine learning or predictive analytics for proactive reliability management.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8341627
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Were looking for a DevOps Team Leader to lead our DevOps efforts as we scale. This is a hands-on leadership role requiring deep technical expertise, strong project and people management skills, and the ability to navigate complex stakeholder environments.

The ideal candidate will lead the day-to-day operations of the DevOps team, ensure operational excellence, drive platform improvements, and manage cross-functional alignment with Dev/Finance/Security/ higher manager.

Responsibilities
Team Leadership & Execution

Lead, mentor, and grow a high-performing DevOps team.
Manage day-to-day operations including incident management and task prioritization.
Ensure SLAs and compliance requirements are met.
Balance proactive platform improvements with reactive issue resolution.
Platform Ownership

Oversee the design, implementation, and maintenance of a secure, scalable, multi-region AWS infrastructure.
Own CI/CD pipelines, infrastructure-as-code (IaC), observability (logs, metrics, tracing), and automation tooling.
Ensure robust disaster recovery (DR) and business continuity practices are in place and regularly tested.
Stakeholder Collaboration

Act as the main point of contact between DevOps and external/internal stakeholders: Banks, Regulators, Security teams, NOC/SOC, and Development teams.
Communicate clearly on priorities, incidents, risks, timelines, and platform status.
Represent DevOps in cross-functional planning and reviews.
Process & Standards

Define and evolve the DevOps teams SDLC, deployment standards, and incident response processes.
Drive best practices in monitoring, alerting, and reliability engineering.
Champion a culture of ownership, transparency, and continuous improvement.
Requirements:
7+ years of DevOps / SRE experience, including at least 2 years in a leadership or tech lead role.
Proven experience managing multi-region AWS production environments.
Strong skills in Terraform, Kubernetes, CI/CD (e.g., GitHub Actions), observability tools (e.g., Datadog, Prometheus, OTLP).
Hands-on experience with high-availability systems, disaster recovery, and compliance-driven environments.
Ability to balance short-term firefighting with long-term vision.
Excellent communication and stakeholder management skills.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8326597
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
6 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As part of the DevOps org, you'll play a key role in shaping the production infrastructure that supports our engineering platformscontainerized, microservice-based, and cloud-native. Youll work side-by-side with R&D to evolve our CI/CD pipelines, improve system reliability, and scale production infrastructure that supports our global growth.

What youll do:

Level up our CI/CD architecture using Argo Workflows, GitHub Actions, and Kubernetes-native deployment strategies
Enhance our build & release pipelines for scalable, multi-arch Docker images, with quality gates and dynamic testing
Lead efforts to evolve our test automation framework across microservices (REST APIs, async flows, cloud services)
Collaborate with R&D on implementing progressive delivery, canary deployments, and real-time alerting with DataDog and Open Telemetry.
Build and scale production that works across multi-region cloud environments (AWS/GCP), using Terraform and Helm
Drive a DevEx-first cultureintegrate quality tooling (e.g., SonarQube, Trivy, Checkov) into developer workflows
Requirements:
5+ years of hands-on experience in DevOps / Automation / Platform Engineering
2+ years of hands-on experience with at least one cloud provider( AWS/GCP/Azure)
Experience with one or more: Java, Python, or Bash
Strong background in CI/CD, containers, Kubernetes, and modern SDLC
Experience with tools like GitHub Actions, ArgoCD, Jenkins, Groovy, Shell/Bash
Docker, Helm, Terraform, Vault, Elastic Stack GitOps, Kustomize, Open Policy Agent (OPA) a plus!
Comfortable working with REST APIs, asynchronous testing, and cloud-native test strategies.
Deep understanding of agile software delivery, including shift-left practices and test automation
Fluent in Hebrew and English
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8363412
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a DevOps Engineer to join our DevOps R&D team.

In this position, you will be responsible for integrating developers and operations teams to improve collaboration and productivity by automating infrastructure, automating workflows, and continuously measuring application performance.

A day in the life and how youll make an impact:
Establish, maintain, and evolve concepts in continuous integration and deployment (CI/CD) pipelines for existing and new services.
Collaborate with Engineering and Operations teams to improve automation of workflows, infrastructure, code testing, and deployment of on-premise and cloud services.
Remain up-to-date on industry trends, share knowledge among teams, and abide by industry best practices for configuration management and automation.
Implement effective monitoring and increase the sophistication of our alerting and escalation mechanisms
Identify and resolve performance and scalability issues in products and infrastructure.
Requirements:
Requirements:
5+ years of experience in systems and production engineering and 3+ years of DevOps experience in a Linux environment.
Experience maintaining and deploying highly available, fault-tolerant systems at scale.
Experience in developing Python and scripting using bash.
Practical experience with Docker containerization and clustering (Kubernetes).
Experience with configuration management tools (e.g. Ansible, Terraform).
Experience implementing CI/CD (e.g. Jenkins,, GitHub actions, bitbucket pipelines).
Experience with cloud providers (eg: AWS, GCP).

Ideally, were looking for:
Bachelor's or masters degree in CS.
AWS Certification.
Experience working in and advocating for agile environments.
Knowledge of Linux Kernel fundamentals, including job management, memory management, file systems, networking & debugging.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8325256
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
11/09/2025
Location: Tel Aviv-Yafo
Job Type: Full Time and Hybrid work
We're looking for a Technical Operations Team Lead to join our platform group and drive execution across technical operations, infrastructure, and delivery processes. This role blends leadership, technical depth, and cross-functional collaboration and is perfect for someone who thrives in fast-paced, high-impact environments.
Youll lead a hybrid team (on-site and offshore), work with global stakeholders, and partner with our Program Managers to track delivery in Jira and bring structure to rapidly evolving workflows. This is a key role in helping us scale and operate more efficiently as we build the future of secure cloud connectivity.
About Us
we are a key player the network security field, striving to provide the leading SASE platform in the market. Our innovative approach, merging cloud and on-device protection, redefines how businesses connect in the era of cloud and remote work.
Key Responsibilities
Lead and mentor a cross-functional team (including offshore members)
Own execution planning, delivery tracking, and team velocity
Collaborate with Program Managers to define, track, and report work in Jira
Implement and optimize working procedures for technical teams
Manage Cloud IT systems and user lifecycle (e.g., Okta/Azure, Our SaaS access).
Oversee 3rd party service providers, procurement, and budgeting.
Design and implement automation (Python preferred), including AI-driven workflows.
Support R&D and QA teams with cloud infra, and internal developer's tools and Dev Environments.
Partner with Customer Success, Sales Engineering, and Field Ops.
Ensure compliance with internal security policies by working closely with the CISO and applying SecOps requirements across Cloud IT systems and operations.
Requirements:
2+ years of experience in technical operations, IT, infrastructure, or DevOps roles.
1+ years leading a team of engineers
Proven ability to define and roll out scalable processes in technical teams
Proficiency with Jira and cross-team collaboration with Program Managers
Strong scripting and automation skills (preferably in Python)
Familiarity with AI tools
Hands-on experience with identity and access management (Okta or similar)
Experience managing external vendors, purchasing, and cloud budgets
Cloud-native mindset; Experience working with cloud-native architectures, especially in AWS, and with containerized environments (Docker, Kubernetes).
Familiarity with security and compliance frameworks; experience working with SecOps and applying security requirements in collaboration with CISO
Experience working in enterprise environments and navigating cross-functional collaboration with multiple stakeholders and peer organizations
A strong communicator who can lead by example, align technical execution with product vision, and support a growing team.
It would be great if you also have:
B.Sc. in Computer Science, Software Engineering, or a related field.
field enablement exposure or FinOps experience
Knowledge of performance optimization, caching, and distributed system design.
Familiarity with authentication, authorization, and security best practices.
A passion for mentoring and building a high-performance, collaborative team culture.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8342642
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/08/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a DevOps DevOps Engineer to take ownership of our Cloud Infrastructure and Platform Engineering strategy, enabling high-scale, cutting-edge GenAI products running across 40+ Kubernetes clusters on GCP and AWS.
This role is a hands-on engineering , requiring deep expertise in cloud-native technologies, Kubernetes at scale, and modern DevOps principles. You will work closely with engineering teams to design and implement scalable infrastructure solutions, optimize developer workflows, and ensure reliability and efficiency across our platform.
Role and Responsibilities:
Cloud & Kubernetes Expertise: Design and implement highly scalable multi-cluster Kubernetes environments across GCP & AWS.
Developer Experience & Enablement: Lead the development of self-service tools and automation that improve efficiency for R&D teams.
Incident & Reliability Engineering: Work with engineering teams to optimize cost, performance, and reliability of production infrastructure through monitoring, capacity planning, and scaling strategies.
Security & Governance: Contribute to best practices for RBAC, IAM, cloud security, and compliance while ensuring infrastructure reliability.
Automation & Infrastructure as Code: Drive adoption of GitOps workflows and Infrastructure as Code (Terraform, Helm, Crossplane) to enhance automation and consistency.
Mentorship & Team Growth: Provide technical mentorship within the platform engineering team and contribute to knowledge-sharing across R&D.
Cross-Team Collaboration: Work closely with engineering teams to align cloud infrastructure goals with business needs and reliability requirements.
Requirements:
5+ years of DevOps, or SRE experience
3+ years working with public cloud platforms (AWS, GCP) at scale
Deep Kubernetes expertise, including managing large-scale, multi-cluster enterprise-grade Kubernetes environments
Experience designing and managing Custom Resource Definitions (CRDs) and custom controllers
Strong background in Infrastructure as Code (Terraform, Helm) and GitOps principles (ArgoCD, Crossplane, FluxCD, etc.)
Hands-on experience in observability & monitoring (Prometheus, Grafana, Datadog, OpenTelemetry, etc.)
Proficiency in scripting & automation (Python, Go, Bash) for infrastructure automation
Expertise in cloud networking (VPC, load balancers, service meshes) and security best practices (RBAC, IAM, security groups, network policies, etc.)
Experience with CI/CD pipelines, optimizing for performance, security, and developer velocity
Nice-to-Have:
Experience with self-hosted on-prem deployments and managed private VPC deployments (Bring Your Own Cloud models)
Advanced expertise in Helm and Crossplane for Kubernetes resource management.
Other cloud provider experience
Experience in GenAI or large-scale SaaS platforms
Familiarity with SQL/NoSQL databases and distributed systems
DevSecOps experience, with a strong understanding of security automation and compliance frameworks
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8326421
סגור
שירות זה פתוח ללקוחות VIP בלבד