דרושים » תוכנה » Sr Staff Site Reliability Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 2 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Site Reliability Engineer on the SASE Platform team, you will play a critical role in building and operating highly available, secure, and globally distributed services. Your mission is to ensure our cloud-native security and networking platform is reliable, scalable, and performant from day one, protecting the users, applications, and data for the world's largest enterprises as they adopt cloud, remote work, and AI
Your Impact:
Proactively collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages.
Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance.
Build and operate automation for provisioning, deploying, and managing global infrastructure using Infrastructure as Code (IaC).
Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments.
Drive observability best practices, including metrics, logs, traces, and SLIs/SLOs to enable data-driven incident analysis.
Participate in on-call rotations, reducing mean time to resolution (MTTR) through automation and proactive reliability improvements.
Challenge existing processes by championing reliability, security, and operational maturity across the organization.
Requirements:
Your Experience
5+ years of experience working with Unix/Linux systems, including shell, tools, networking, and kernel concepts.
2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms.
Proven experience operating workloads in public cloud environments (e.g., AWS, GCP, Azure) at scale.
Proficiency in building automation and tools in at least one scripting or programming language (e.g., Python, Go, Java).
Strong experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
Bachelors degree in Engineering, Computer Science, or a related technical field, or equivalent practical experience.
Nice to have:
Deep expertise in designing and operating monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK Stack).
Advanced networking expertise, including TCP/IP, DNS, BGP, routing, and cloud networking concepts relevant to SASE architectures.
Prior experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms.
Familiarity with using AI/LLM technologies to improve operational workflows (e.g., incident analysis, automation).
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638041
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 12 שעות
דרושים בCrowdStrike
Location: Tel Aviv-Yafo
Job Type: Full Time
CrowdStrike's Data Science Studio is seeking a pioneering Senior MLOps Engineer to establish and lead our MLOps function from the ground up. As the first MLOps engineer in the studio, you will play a foundational role in shaping how we build, deploy, and scale machine learning systems that protect thousands of organizations worldwide.

This is a unique opportunity to define the technical strategy, influence the technology stack, and architect the infrastructure that will power our AI/ML-driven security solutions for years to come.

This role combines strategic vision with hands-on execution. You'll work at the intersection of data science, engineering, and production operations - building production-grade systems that operate at immense scale while collaborating closely with highly technical data scientists and ML engineering teams across CrowdStrike.

What You'll Do:
- Architect MLOps infrastructure from the ground up: Design and implement the foundational MLOps platform, establishing best practices, tooling, and workflows that will scale with our growing data science initiatives
- Define technology strategy: Evaluate, select, and integrate MLOps technologies and platforms that best serve our needs - from experiment tracking and model versioning to deployment pipelines and monitoring systems
- Build production-grade ML pipelines: Develop robust, scalable pipelines for model training, validation, deployment, and monitoring that handle massive data volumes and ensure reliability in production
- Enable data scientist productivity: Create tools, frameworks, and automation that empower data scientists to move quickly from research to production while maintaining high quality and reliability standards
- Establish monitoring and observability: Implement comprehensive monitoring, logging, and alerting systems to ensure ML models perform optimally in production and issues are detected proactively
- Drive MLOps culture and practices: Champion best practices in ML engineering, CI/CD for ML, model governance, and reproducibility across the data science organization
- Collaborate cross-functionally: Partner closely with data scientists to understand their workflows and pain points, and work with ML engineering teams to ensure seamless integration with broader platform capabilities
 -Scale for the future: Design systems with scalability, security, and maintainability in mind, anticipating the needs of a rapidly growing ML portfolio
Requirements:
- 6+ years of experience in MLOps, ML engineering, DevOps, or related infrastructure roles with focus on machine learning systems
- Production ML systems expertise: Proven track record of building and operating ML systems at scale in production environments
- Strong infrastructure and automation skills: Deep knowledge of cloud platforms (AWS, Azure, or GCP), containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform, CloudFormation)
- ML pipeline proficiency: Hands-on experience with ML workflow orchestration tools (e.g., Airflow, Kubeflow, MLflow, Metaflow) and building end-to-end ML pipelines
- Programming excellence: Strong coding skills in Python; experience with additional languages is a plus
- CI/CD and DevOps practices: Expertise in building automated deployment pipelines, version control, and modern DevOps methodologies
- Strategic and hands-on balance: Ability to think architecturally about long-term solutions while rolling up your sleeves to implement them
- Collaborative mindset: Excellent communication skills and ability to work effectively with data scientists, engineers, and stakeholders with varying technical backgrounds
- Startup mentality: Comfort with ambiguity and ability to build from scratch in a fast-paced environment
This position is open to all candidates.
 
Show more...
הגשת מועמדות
עדכון קורות החיים לפני שליחה
8611396
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 1 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Site Reliability Engineer on the SASE Platform team, you will play a critical role in building and operating highly available, secure, and globally distributed services. Your mission is to ensure our cloud-native security and networking platform is reliable, scalable, and performant from day one, protecting the users, applications, and data for the world's largest enterprises as they adopt cloud, remote work, and AI.
Key Responsibilities
Proactively collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages.
Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance.
Build and operate automation for provisioning, deploying, and managing global infrastructure using Infrastructure as Code (IaC).
Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments.
Drive observability best practices, including metrics, logs, traces, and SLIs/SLOs to enable data-driven incident analysis.
Participate in on-call rotations, reducing mean time to resolution (MTTR) through automation and proactive reliability improvements.
Challenge existing processes by championing reliability, security, and operational maturity across the organization.
Requirements:
Your experience:
5+ years of experience working with Unix/Linux systems, including shell, tools, networking, and kernel concepts.
2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms.
Proven experience operating workloads in public cloud environments (e.g., AWS, GCP, Azure) at scale.
Proficiency in building automation and tools in at least one scripting or programming language (e.g., Python, Go, Java).
Strong experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
Bachelors degree in Engineering, Computer Science, or a related technical field, or equivalent practical experience.
Preferred Qualifications
Deep expertise in designing and operating monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK Stack).
Advanced networking expertise, including TCP/IP, DNS, BGP, routing, and cloud networking concepts relevant to SASE architectures.
Prior experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms.
Familiarity with using AI/LLM technologies to improve operational workflows (e.g., incident analysis, automation).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638178
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
At our company, we believe people are capable of more than a single job description. Youre not hired just to fill a position- youre empowered to shape it, grow it, and make it your own.
We call this being Positionless.
And Positionless isnt just our culture. Its our product.
we are the creator of Positionless Marketing, an AI-powered platform that gives every marketer the power to analyze, create, launch, and optimize independently. The result is faster execution, deeper personalization, and 88% greater campaign efficiency.
Recognized as a Visionary in Gartners Magic Quadrant, we partner with leading brands like Sephora, Staples, and Entain. Today, more than 550 our company's across NYC, London, Tel Aviv, Scotland, Brazil, Estonia, and beyond are building the future of marketing together, in an environment that actively encourages ownership and growth, with two out of every three managers promoted from within.
If youre looking for a place where you can do more, be more, come grow with us.
Are you passionate about ensuring system reliability, scalability, and performance? Do you thrive in a dynamic environment where automation and operational excellence are key?
we are looking for a Site Reliability Engineer (SRE) to join our team and play a crucial role in designing, implementing, and maintaining our cloud-based infrastructure. In this role, you will collaborate across teams to drive automation, improve system resilience, and optimize performance while fostering a culture of reliability.
Responsibilities:
System Reliability- Ensure high availability and performance of services through effective monitoring, incident management, and root cause analysis.
Automation & Tooling- Develop and maintain automation for infrastructure provisioning, configuration management, and application deployment.
Performance Optimization- Analyze and enhance system performance, including load balancing, caching, and database tuning. Conduct regular capacity planning.
Incident Response & Troubleshooting- Lead incident response efforts, participate in on-call rotations, and troubleshoot complex infrastructure issues.
Security & Compliance- Collaborate with security teams to implement best practices and ensure compliance with relevant standards (ISO 27001, SOC 2, etc.).
Collaboration & Mentorship- Work closely with developers, DevOps, Support, and product teams to enhance application reliability and implement SRE best practices.
Requirements:
4+ years in Site Reliability Engineering, DevOps, or related roles.
Proven experience managing large-scale, cloud-based infrastructure in GCP, AWS, or Azure.
Expertise in container orchestration (Kubernetes, Docker) and microservices architecture.
Strong proficiency in scripting and programming languages (Python, Go, Bash, etc.).
Experience with CI/CD pipelines, infrastructure as code (Terraform, CloudFormation), and configuration management (Ansible, Puppet, Chef).
Hands-on experience with monitoring and observability tools (Datadog, Prometheus, Grafana, ELK Stack).
Experience using AI tools to enhance SRE processes, such as intelligent monitoring, incident prediction, and automation of incident response.
Deep understanding of networking concepts, DNS, load balancing, and distributed systems.
Strong problem-solving skills, excellent communication, and a proactive mindset.
Advantages:
Certifications- AWS Certified Solutions Architect, GCP Professional Cloud Architect, or Kubernetes certifications (CKA, CKAD).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8594736
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
30/03/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
The DevOps Engineer builds, automates, and operates cloud‑native infrastructure across AWS and Red Hat OpenShift, enabling scalable, secure, and reliable application delivery. This role combines hands‑on platform engineering, CI/CD automation, container orchestration, and the integration of AI‑powered tools for observability, anomaly detection, and operational efficiency. The engineer collaborates closely with development, security, and SRE teams to streamline deployments and improve system resilience.
Core Responsibilities
Cloud & Platform Engineering
Design, deploy, and maintain cloud‑native infrastructure on AWS (EC2, VPC, IAM, EKS, S3, RDS, Lambda).
Operate and optimize Red Hat OpenShift clusters, including cluster upgrades, operator management, and workload orchestration.
Implement Infrastructure‑as‑Code using Terraform, CloudFormation, or Ansible.
Build secure, scalable network architectures including VPC design, load balancing, service mesh, and ingress/egress controls.
CI/CD & Automation
Develop and maintain CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, or Argo Workflows.
Automate build, test, and deployment workflows for microservices and containerized applications.
Implement GitOps practices using Argo CD or Flux.
Create reusable automation modules and scripts in Python, Bash, or Go.
Containers & Kubernetes
Manage containerized workloads using Docker, Kubernetes, and OpenShift Operators.
Configure namespaces, RBAC, secrets, ConfigMaps, and resource quotas.
Troubleshoot cluster performance, networking, and scheduling issues.
Support service mesh technologies (Istio, Linkerd) when applicable.
AI‑Driven Operations
Integrate AI/ML‑based tools for monitoring, anomaly detection, predictive scaling, and automated remediation.
Work with data and platform teams to operationalize AI/ML pipelines on Kubernetes or OpenShift.
Evaluate emerging AI‑Ops platforms and contribute to automation strategies.
Observability & Reliability
Implement monitoring, logging, and tracing using Prometheus, Grafana, ELK, Loki, CloudWatch, or Datadog.
Build alerting, dashboards, and SLO‑based reliability metrics.
Participate in on‑call rotations and incident response, driving root‑cause analysis and long‑term fixes.
Security & Compliance
Apply DevSecOps practices including image scanning, secrets management, and policy enforcement.
Work with security teams to implement IAM best practices, encryption, and compliance controls.
Integrate tools such as Vault, OPA/Gatekeeper, or Kyverno.
Requirements:
2-5 years of experience in DevOps, cloud engineering, or platform operations.
Strong hands‑on experience with AWS services and cloud architecture fundamentals.
Practical experience with Kubernetes and Red Hat OpenShift.
Proficiency with Terraform, Ansible, or similar IaC tools.
Experience building CI/CD pipelines and automating deployments.
Solid Linux administration and networking fundamentals.
Scripting skills in Python, Bash, or Go.
Understanding of container security, cloud security, and DevSecOps practices.
Preferred Qualifications
Certifications: AWS Solutions Architect, CKA/CKAD, Red Hat OpenShift, Terraform Associate.
Experience with AI‑Ops platforms or ML pipeline orchestration.
Familiarity with service mesh, API gateways, or event‑driven architectures.
Experience with multi‑cluster or hybrid cloud environments.
Background in SRE practices (SLOs, error budgets, chaos engineering).
What Success Looks Like
Reliable, automated, and secure cloud‑native infrastructure supporting rapid development cycles.
Stable and observable Kubernetes/OpenShift environments with clear operational metrics.
Reduced manual work through automation and AI‑driven insights.
Strong collaboration with engineering teams and continuous improvement of DevOps practices.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8597223
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
1 ימים
חברה חסויה
מיקום המשרה: תל אביב יפו
סוג משרה: משרה מלאה
We are seeking a skilled and motivated DevOps Infrastructure Engineer to join our DevOps Infra team. Our team is responsible for managing and evolving the cloud-native infrastructure that powers our microservices architecture. Core responsibilities span our EKS-based Kubernetes platform, ArgoCD-driven GitOps pipelines, infrastructure observability, Helm-based deployments, and mission-critical web services running on AWS.
We are looking for a DevOps engineer who can hit the ground running, take ownership of critical infrastructure components, and contribute meaningfully from day one. The ideal candidate brings deep Kubernetes expertise, strong hands-on experience with observability tooling, and the maturity to work independently.
In this role, you will be responsible for:
Managing and evolving our EKS-based Kubernetes platform and Helm-based deployment pipelines
Owning and maintaining GitOps workflows using ArgoCD, including troubleshooting sync and rollout issues
Designing, building, and maintaining observability solutions using Prometheus, VictoriaMetrics, and Grafana
Writing and maintaining infrastructure as code using Terraform, including modules, remote state, and CI/CD automation
Taking full ownership of AWS infrastructure components - including networking, compute, IAM, and storage - ensuring reliability, security, and operational excellence across environments
Collaborating with developers and SREs to support reliable, scalable, and secure AWS infrastructure
דרישות:
1-3 years of hands-on experience in DevOps or infrastructure engineering roles.
Deep expertise in Kubernetes and Helm, including production-grade deployments and live incident troubleshooting.
Strong proficiency in Terraform or equivalent IaC tooling
Solid working knowledge of AWS core services (EC2, IAM, S3, VPC, CloudWatch, EKS).
Practical experience with Prometheus, VictoriaMetrics, Grafana, and alerting stack design.
Proven ability to work independently, take ownership end-to-end, and communicate effectively across engineering teams.
Agentic DevOps experience working with common AI assistant tools, MCPs and Agents.
Advantages:
Experience with cloud cost optimization strategies and tooling.
Background in cloud-native security practices (RBAC, policy enforcement,SSL, MTLS etc).
Prior involvement in designing or operating high-availability, fault-tolerant systems.
Experience with nginx and IIS web servers. המשרה מיועדת לנשים ולגברים כאחד.
 
עוד...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8636122
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a hands-on Senior DevOps Engineer with a strong cloud-native mindset to build, maintain, and evolve our highly scalable, highly-available cloud infrastructure. This role is pivotal in driving operational excellence, security, and automation across our entire engineering organization. You will promote communication, integration, and collaboration to significantly enhance our software development productivity and reliability. You'll work closely with engineering and product teams to streamline delivery, enforce platform standards, and enable a high-velocity development environment-all while keeping reliability and security top of mind.
Responsibilities:
Design, Automate, and Manage complex cloud infrastructure on AWS using best-in-class Infrastructure as Code (IaC) practices.
Lead the operation and enhancement of our production Kubernetes environments (EKS), focusing on automation, security, observability, and seamless CI/CD integration.
Drive continuous improvement across platform tooling, developer experience, and operational processes to meet our ambitious performance and uptime goals.
Implement and enforce security-first infrastructure patterns, including strong IAM, network segmentation, and secure secrets management.
Actively contribute to high-level technical design discussions and cross-functional architectural decision-making, ensuring solutions align with long-term platform strategy.
Requirements:
7+ years of experience as a DevOps Engineer, Platform Engineer, or in a similar infrastructure-focused role.
Strong hands-on expertise across the AWS Stack (e.g. EC2, EKS, RDS, VPC, IAM, S3, Lambda).
Mastery of Infrastructure as Code - Terraform or equivalent.
Deep operational knowledge of Kubernetes, including architecture, cluster management, networking, and advanced debugging in production environments.
Strong expertise in designing and managing CI/CD methodologies and platforms (e.g. Jenkins, Github Actions).
Experience with monitoring tools such as Prometheus, DataDog, Coralogix (OTEL), Grafana etc.
Proven prior experience building and maintaining highly-available, production-grade, and service-oriented systems.
Strong scripting and automation background in languages such as Python or Bash.
Exceptional communication and collaboration skills with the ability to articulate complex technical needs and influence cross-functional teams.
Strong knowledge of AWS Networking - an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8625782
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 2 שעות
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior DevOps Engineer supporting our Cortex Research Group, you will lead all DevOps and infrastructure initiatives that empower our researchers to move quickly, securely, and reliably. You will be responsible for designing, building, and maintaining the groups cloud environments, ensuring scalability, stability, and performance across a wide range of experimental and production workloads. Youll serve as the primary point of contact between the Research Group and other critical stakeholders-including Security, Networking, and Compliance teams-ensuring that research projects align with organizational standards while still enabling rapid innovation.
Key Responsibilities
Own and evolve the Research Groups cloud infrastructure and CI/CD pipelines to enable reproducible, automated, and scalable experimentation.
Define and implement standards for infrastructure-as-code, observability, monitoring, and resource optimization tailored to research use cases.
Proactively collaborate with security and compliance teams to enforce best practices for data governance, access controls, and regulatory requirements.
Partner with networking and platform engineers to integrate research workloads into the broader company ecosystem, ensuring seamless operation.
Serve as the primary technical liaison between the Research Group and stakeholders like Security, Networking, and Platform teams.
Mentor engineers and researchers on DevOps best practices, helping to instill a culture of operational excellence and applied learning.
Requirements:
Your Experience:
5+ years of demonstrated experience in a DevOps, Site Reliability Engineering (SRE), or cloud infrastructure role.
Strong proficiency with infrastructure-as-code (IaC) tools such as Terraform or Ansible.
Hands-on experience building and maintaining CI/CD pipelines using tools like Jenkins, GitLab CI, or GitHub Actions.
In-depth knowledge of at least one major cloud provider (GCP, AWS, Azure).
Preferred Qualifications
Experience with containerization and orchestration technologies, particularly Docker and Kubernetes.
Proficiency in a scripting or programming language such as Python or Go.
Familiarity with monitoring and observability tools like Prometheus, Grafana, or the ELK stack.
Experience supporting machine learning or research-focused environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638096
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
05/04/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
we are on a mission to bring identity security everywhere - to every human, machine, and AI agent, both on-prem and in the cloud. Our unique technology secures identities & access at runtime, in ways that werent possible before. With the broadest identity security platform in the market, trusted by more than 1,000 customers including many Fortune 100 companies, we are uniquely positioned to lead the fast-growing identity security category.
Joining our company means becoming part of a fast-moving team with a culture of innovation and collaboration, that goes above and beyond to help our customers and each other, on a journey to reshape the future of identity security.
As a DevOps Engineer, youll design and implement a full CI/CD solution on AWS and Azure with Kubernetes, while supporting and scaling our cloud infrastructure using Infrastructure as Code.
Responsibilities
Design and implement robust, scalable, and highly available cloud solutions to ensure seamless service delivery and support rapid business growth
Focus on cross-service integrations and components within the SaaS stack, adhering to DevOps best practices
Develop, enhance, and maintain CI/CD pipelines using a GitOps-driven approach, ensuring efficient and secure deployment workflows
Streamline processes through automation, focusing on scalability, security, metric collection, and enhanced visibility across environments
Partner with developers to optimize service reliability, performance, failover strategies, and scalability
Work as part of an innovative and high-performing team, leveraging modern tools and technologies.
Requirements:
5+ years of experience in DevOps roles, with a proven track record of managing large-scale systems
3+ years of hands-on experience with cloud platforms, preferably AWS or Azure
Proficiency in Kubernetes for container orchestration- must
Strong experience with Infrastructure as Code (IaC) tools, particularly Terraform- must
Advanced expertise in enterprise Linux administration in production environments, including deployment, configuration, and lifecycle management
In-depth knowledge of Continuous Delivery (CI/CD) and GitOps methodologies, with tools such as Jenkins, GitHub Actions, and ArgoCD
Expertise in configuration management tools such as Ansible
Familiarity with message bus technologies (e.g., RabbitMQ, Kafka, or similar)
Hands-on experience with monitoring and logging solutions
Proficiency in automation scripting using Bash and at least one programming language such as Python or Go
Solid understanding of networking concepts and information security, including firewalls, VPNs, LDAP, identity management, and access control
Strong collaboration skills, with the ability to work across teams, communicate complex issues, and support technical decision-making
Analytical and proactive, with high ownership and accountability for end-to-end system reliability in dynamic environments.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8600847
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 59 דקות
Location: Tel Aviv-Yafo
Job Type: Full Time
Your Career:
Join a team of senior engineers operating in a large-scale, multi-cloud production environment supporting tens of thousands of enterprise customers worldwide. This is not a typical SRE role - youll work at the core of a complex, high-impact system alongside experienced DevOps professionals in a fast-paced, cybersecurity-focused organization.
Your Impact:
Own and operate large-scale, global production environments across multiple cloud providers (GCP, AWS, Azure)
Actively monitor, investigate, and resolve incidents triggered by automated alerting systems (PagerDuty / Incident Response)
Drive end-to-end troubleshooting across complex, distributed systems with high context switching
Design, deploy, and improve monitoring and observability systems (e.g., Prometheus, Grafana) - not just react to alerts
Collaborate closely with internal teams (CX, CS, Engineering) to ensure system reliability and performance
Work hands-on with modern DevOps and infrastructure tools including Kubernetes, Terraform, CI/CD pipelines, and GitOps workflows
Develop and maintain automation and tooling (primarily in Python)
Gain deep understanding of system architecture and interconnected services
Contribute to a culture of operational excellence in a high-scale, high-availability environment
On call responsibilities:
Daytime hours (12:00-20:00)
Occasional weekends and holidays (rotation-based).
Requirements:
Your experience:
5+ years of experience in SRE roles in production environments at scale
Strong hands-on experience with Kubernetes and Terraform
Strong hands-on experience with at least one major cloud platform (GCP or AWS required)
Experience building and configuring monitoring systems (e.g., Prometheus, Grafana)
Familiarity with CI/CD and GitOps tools (GitLab CI, GitHub Actions, Jenkins, Flux)
Proficiency in Python for scripting and automation
Strong troubleshooting and problem-solving skills with a passion for incident handling
Ability to work in fast-paced environments with high context switching
Highly responsive, proactive, and ownership-driven
Strong collaboration and communication skills
Curious mindset and eagerness to learn.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8638182
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking an exceptional Backend Engineer to join our Platform Group, where you will architect and develop shared infrastructure components that power Semperis's enterprise security and identity products. This is a highly collaborative role requiring both technical excellence and strong cross-functional partnership skills.
What You'll Do:
As a Backend Engineer on our Core Platform team, you will:
Design and build shared platform components used across multiple product teams, ensuring scalability, reliability, and maintainability
Architect distributed systems using microservices and event-driven patterns that support enterprise-scale workloads
Own critical backend services from conception through deployment, including code quality, performance optimization, and operational excellence
Collaborate extensively with product engineers, DevOps, and architects to define platform capabilities and technical standards
Drive technical initiatives that improve developer productivity, system observability, and platform resilience
or backend development across the organization
Analyze system performance and data patterns to identify optimization opportunities and inform future platform investments
Requirements:
5+ years of professional software engineering experience, with strong expertise in backend development
Deep proficiency in C# and .NET Core, including modern framework features and performance optimization
Production experience with Kubernetes and container orchestration in cloud environments (Azure/AWS/GCP preferred)
Proven track record designing and implementing microservices architectures and event-driven systems at scale
Experience developing shared libraries, frameworks, or platform components consumed by multiple teams
Demonstrated ability building enterprise SaaS applications serving igh-volume, multi-tenant environments
Strong collaborative mindset with excellent communication skills and experience working across teams to drive consensus
Bachelor's degree in Computer Science or equivalent practical experience
Fluency in English (written and verbal)
Preferred Qualifications:
Experience in the Enterprise Identity and Access Management (IAM) domain
Hands-on expertise with Kafka, RabbitMQ, or similar message brokers
Knowledge of ElasticSearch or other distributed search/analytics platforms
Open-source contributions or experience maintaining shared component libraries
Experience with observability tools (Prometheus, Grafana, distributed tracing)
Background in API design and governance for platform services
Familiarity with CI/CD pipelines and infrastructure-as-code practices
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8636235
סגור
שירות זה פתוח ללקוחות VIP בלבד