דרושים » מחשבים ורשתות » DevOps Engineer

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
לפני 8 שעות
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're looking for a talented DevOps engineer to join our team.
As a DevOps Engineer, you will architect and operate scalable, resilient cloud environments, using advanced automation and AI-based tools to improve system health, reduce manual intervention, and optimize infrastructure costs.
Responsibilities
Own the full lifecycle of the Coin Master platform. Ensure high availability, performance, and cost-efficiency across our multi-region AWS environment
Design and maintain robust, high-performance infrastructure using IaC. You will focus on building self-healing environments where AI agents continuously monitor and remediate issues in real-time
Embed security into the infrastructure layer. Leverage AI-assisted tooling to proactively identify and patch vulnerabilities across our Kubernetes clusters and AWS stack
Build tools and services that empower our engineering team to ship code faster. You are the architect of the platform, ensuring it is intuitive, scalable, and fully automated.
Requirements:
3+ years of DevOps experience. Proven track record in managing production-ready Kubernetes clusters, AWS infrastructure, and high-traffic distributed systems
Experience integrating AI tools into daily workflows, including hands-on use of AI agents (Cursor, Claude Code or similar) for infrastructure automation, planning, and debugging
Extensive management experience of multiple AWS accounts spanning various regions, proficient in services like Lambda, CloudFront, SQS, VPC, and IAM
Strong understanding of cloud cost optimization at scale through architectural design and automated resource management
Deep proficiency with modern CI/CD platforms (GitHub Actions, Jenkins, ArgoCD)
Advantage:
Experience with LLM orchestration (LangChain, LangGraph) or building internal tools/agents that interact with cloud APIs to automate operational tasks
Hands-on experience with Kafka (MSK/Apache Kafka) and high-performance caching layers like Redis/Elasticache
Experience with CloudFlare management and complex edge-computing configurations.
This position is open to all candidates.
 
Hide
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8721139
סגור
שירות זה פתוח ללקוחות VIP בלבד
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Required DevOps Engineer
About the role:
Our DevOps team operates the infrastructure that powers our AI and Computer Vision platform across construction sites in 15+ countries. From data pipelines and ML workloads to backend services - you'll work with a diverse, modern, Kubernetes-based stack and have real influence on how we build, deploy, and operate.
What you'll do:
Own Multi-Cloud Infrastructure: Work alongside the team to design, scale, and operate our high-scale, multi-region production infrastructure across AWS and GCP, powering construction sites globally.
Drive Kubernetes at Scale: Manage and evolve our Kubernetes platform on EKS and leveraging GitOps practices with ArgoCD and Helm to enable safe, fast, and reliable deployments.
Build Robust CI/CD: Design and maintain CI/CD pipelines that empower dozens of engineers to ship confidently - with automation, testing, and progressive delivery built in.
Tackle Diverse Infrastructure Challenges: Work hands-on with a wide variety of workloads - from heavy data processing and Computer Vision pipelines to backend services and ML inference - each with unique scaling, performance, and reliability requirements.
Ensure Reliability & Observability: Build and maintain world-class observability (metrics, logs, tracing, alerting) so that issues are caught early and resolved fast. Performance, reliability, and scalability are at the core of what you do.
Security & Cost: Partner with the team to strengthen our security posture, identity and access management, compliance, and cloud cost optimization across both clouds.
Ownership from 0 to 1: You will have real influence over our architecture and tooling. We want engineers who care about shaping what we build and how we build it, ensuring performance, security, and observability are baked in from day one.
Requirements:
A seasoned DevOps / Infrastructure engineer (5+ years) with strong hands-on experience in production cloud environments.
Proven expertise operating large-scale, distributed systems - with deep understanding of Kubernetes, networking, and cloud-native architecture.
Strong experience with multi-cloud environments (AWS and/or GCP), Infrastructure-as-Code (Terraform), and GitOps workflows (ArgoCD, Flux, or similar).
Hands-on experience with CI/CD systems (Jenkins, GitHub Actions, etc.).
Solid scripting and automation skills (Python, Bash, or Go).
Proven track record of being a collaborative team player who partners closely with developers, ML engineers, and cross-functional stakeholders across the organization.
Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, Logz.io, or similar).
Experience with databases (relational and/or NoSQL) - including operational aspects like backups, migrations, and performance tuning.
AI-Native Engineering: You are an AI-native engineer who leverages LLMs and agentic tools (like Cursor, Copilot, or Claude) not just for command completion, but as a core operational partner - automating diagnostics, runbooks, and infrastructure workflows so you can focus on the critical things.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8670484
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced DevOps Manager to lead and grow our DevOps function. This role combines people leadership, technical direction, and ownership of the infrastructure, tooling, automation, and operational practices that power Stamplis production environment.

You will manage DevOps Engineers, hire and onboard an additional team member, and drive the strategy, execution, and evolution of Stamplis internal DevOps platform. You will work closely with Engineering, Data, AI, Product, and Security teams to improve developer experience, enable fast and safe delivery, and keep production stable.

This role requires a strong hands-on DevOps / Platform Engineering background, combined with proven leadership capabilities. If you believe DevOps should operate as a self-service platform, love automation, and think in systems and end-to-end flows, keep reading.


What You Will Do
Lead, mentor, and manage a DevOps team, fostering ownership, excellence, collaboration, and continuous improvement.
Own the DevOps roadmap, priorities, execution, and delivery, aligned with Engineering, Data, AI, Security, and business goals.
Provide technical and architectural guidance across infrastructure, CI/CD, cloud operations, automation, observability, security, and platform engineering initiatives.
Build and evolve our internal DevOps platform, creating self-service capabilities, internal services, and golden paths that scale across teams.
Own CI/CD end-to-end, including Jenkins, GitHub, and GitHub Actions pipelines from commit to production.
Oversee and evolve our AWS stack, including ECS, EKS, Lambda, DynamoDB, Redshift, S3, DocumentDB, networking, IAM, observability, and deployment patterns.
Enable MLOps and data workflows using tools such as Airflow, MLflow, and Jupyter Notebooks.
Drive an automation-first mindset through Infrastructure-as-Code, scripting, internal tooling, and reusable components.
Lead cost optimization efforts with a FinOps mindset, including visibility, budgets, rightsizing, and workload efficiency.
Ensure security is embedded into DevOps practices, including least privilege, secrets management, vulnerability scanning, secure SDLC, and incident readiness.
Leverage AI-assisted development tools such as Cursor, GitHub Copilot, Claude Code, and ChatGPT Enterprise to improve team productivity and delivery speed.
Collaborate closely with cross-functional stakeholders to unblock delivery, improve developer experience, and maintain production stability.
Requirements:
7+ years of experience in DevOps, SRE, Platform Engineering, or Infrastructure Engineering, ideally in a SaaS production environment.
2+ years of managerial or team leadership experience, including mentoring engineers, driving execution, and owning team delivery.
Strong hands-on technical background with the ability to guide architecture, review technical decisions, and stay close to execution when needed.
Strong development background: you write code comfortably, build internal tools, and approach infrastructure work with software engineering discipline.
Proven experience with AWS services such as ECS, EKS, Lambda, DynamoDB, Redshift, S3, and DocumentDB.
Strong CI/CD experience with Jenkins, GitHub, and GitHub Actions.
Experience with Infrastructure-as-Code, automation, observability, cloud networking, IAM, and production operations.
Experience with ML/data tooling such as Airflow, MLflow, and Jupyter Notebooks - an advantage.
Hands-on experience with AI-assisted development tools such as Cursor, GitHub Copilot, Claude Code, or similar.
Demonstrated experience in cost optimization, cloud security, secure SDLC, and operational security.
A wide-angle thinker who sees the whole system, understands dependencies, and builds solutions that scale across teams.
Strong people leadership, communication, collaboration, and prioritization skills.
Strong communication skills in English. Hebrew is an advantage.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8709491
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
31/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a DevOps Engineer, you will be responsible for the reliability, scalability, and efficiency of our SaaS products. Your success will be measured by your ability to achieve the following:

First 3 Months: Master our GitOps-based deployment pipelines. You will be expected to independently manage and troubleshoot deployments using ArgoCD and Kargo, and contribute to the team's on-call rotation.

First 6 Months: Enhance our CI/CD processes and workflow efficiency. You will lead the project to reduce average build and deployment times by 20% by optimizing GitHub Actions, Helm charts, and introducing initial AI-assisted automation.

First 12 Months: Improve system scalability and reliability. You will design and implement infrastructure enhancements using Terraform to support a 25% increase in customer workload while maintaining a 99.9% uptime.

Core Responsibilities

Deployment Pipeline Management: Build and maintain our GitOps-based deployment pipelines to ensure a 99% success rate for all deployments and reduce manual intervention by 30% within the first year.

Infrastructure Management: Manage and scale our Kubernetes infrastructure on GCP, with a goal of optimizing resource utilization to achieve a 15% cost reduction in our GCP spending over the next 18 months.

Automation and CI/CD: Enhance and maintain our GitHub Actions CI/CD pipelines to decrease the lead time for changes to production by 25% within the first year.

AI-Assisted Workflow Integration: Integrate AI-assisted tooling into day-to-day DevOps and engineering workflows to improve productivity, scalability, and operational efficiency. You will leverage AI tools to generate initial configuration drafts, validate infrastructure code, and utilize AI-driven automation to reduce repetitive manual tasks by 20% within the first 6 months, accelerating engineering execution while maintaining high-quality standards.

System Reliability: Proactively improve system reliability and availability, with the objective of reducing the number of critical production incidents by 50% through improved monitoring, logging, and alerting within 12 months.
Requirements:
What We're Looking For

3+ years in DevOps/SRE: You have proven experience in a high-growth SaaS environment and can hit the ground running to help us scale our platform.

Google Cloud Platform (GCP): You possess a deep understanding of GCP services, particularly GKE, which is essential as our entire infrastructure is on GCP.

ArgoCD and Kargo: You have hands-on experience with GitOps and progressive delivery, which is key to our goal of achieving faster, more reliable deployments.

Kubernetes and Helm: You bring strong experience in managing and deploying applications on Kubernetes, as you will be responsible for the container orchestration of our microservices.

Terraform: You have expertise in infrastructure as code, which will be crucial for our project to scale our infrastructure and reduce costs.

Forward-Thinking Automation: You have a strong interest in or experience with leveraging emerging technologies, including AI tools, to modernize workflows, validate code, and eliminate repetitive manual tasks.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8672407
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Senior DevOps Engineer, you will be a key member of a high-performing team focused on building resilient, highly available, and production infrastructures. Your role is to bridge the gap between development and operations by applying software engineering principles to automate, scale, and optimize our cloud ecosystem.



Utilizing Terraform for lifecycle management and Amazon EKS for orchestration, you will build and scale robust, self-healing environments.



We are looking for an engineer who doesn't just use tools, but understands the Linux foundation they run on. You will work to ensure our systems are transparent, measurable, and elastic through reliable GitOps practices and full-stack observability. You will also implement and maintain essential DevSecOps tools to keep our environments secure.


Key Responsibilities:
Linux Systems Engineering: Maintain a deep-dive understanding of Linux internals to identify bottlenecks and optimize configurations.
Container Orchestration: Manage production-grade Amazon EKS clusters and implement modern autoscaling solutions like Karpenter.
Full-Stack Observability: Own the migration and maintenance of the Grafana-based observability suite (Grafana, Tempo, Mimir, Loki).
GitOps & IaC: Lead GitOps initiatives with ArgoCD and architect modular Terraform or Crossplane codebases.
Python Tooling: Develop custom tools and AI agents using Python to automate research and development cycles.
Vulnerability Management: Integrate automated image scanning and Software Composition Analysis (SCA) tools into the SDLC.
Requirements:
Required Qualifications:
7+ years of hands-on experience with a cloud provider, preferably AWS.
Strong experience with Infrastructure as Code (IaC) using Terraform.
Deep understanding of Kubernetes (EKS), including architecture, networking, and troubleshooting.
Solid Linux system administration skills.
Proficiency in Python or similar for writing production-grade code and tools.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8712982
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
15/06/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a passionate and Senior DevOps Engineer to join our DevOps Core Team. In this role, you will be responsible for the design, implementation, and maintenance of cloud-native infrastructure on AWS and Kubernetes. You will work closely with development, operations, and quality assurance teams to streamline processes, own our Infrastructure as Code practices, and help evolve our platform reliability at scale.

How Will You Make an Impact?

Design, implement, and maintain Kubernetes clusters in production environments, ensuring high availability and scalability.

Build and manage Infrastructure as Code using CloudFormation and Crossplane as our primary IaC tools.

Own and operate cloud infrastructure primarily on AWS, with working knowledge of GCP environments.

Identify and implement process improvements to increase the efficiency and reliability of the DevOps Core team.

Provide technical leadership and mentoring to team members, fostering a culture of engineering excellence.

Work closely with engineering teams to define infrastructure needs and provide DevOps support and guidance.

Research, evaluate, and integrate new technologies into our stack.

Manage, monitor, scale, and troubleshoot a distributed, highly available, customer-facing software platform.

Create and maintain technical documentation for infrastructure, processes, and runbooks.
Requirements:
Strong, hands-on Kubernetes experience of 5+ years of proven experience running and operating clusters in production at scale is a must.

Deep expertise with Infrastructure as Code - primary experience with AWS CloudFormation and Crossplane.

Comprehensive knowledge of AWS cloud services (compute, networking, storage, IAM, observability) - with 5+ years of proven, hands-on AWS experience.

Working also with GCP - ability to operate, troubleshoot, and deploy in GCP environments.

Hands-on experience with ArgoCD and GitOps workflows - managing application delivery through Git as the source of truth.

Experience with CI/CD pipelines and automation tooling (Jenkins, CircleCI, or similar).

4+ years of scripting or coding experience (Python, Bash, or GoLang) for automation and tooling.

Advanced knowledge of Linux OS and networking fundamentals.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8695424
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a skilled and motivated DevOps engineer with deep familiarity in the streaming ecosystem to join our elite infrastructure team. If you're excited by the challenge of operating mission-critical systems at scale and optimizing the developer experience through automation and tooling, wed love to hear from you.

What you will do:

Automate Deployment and Operation
Oversee deployment of Kafka and RabbitMQ clusters (including Confluent Cloud & CFK). Build automation pipelines to ensure repeatability and resiliency across environments.

Monitor and Support Production Systems
Own production stability of global Kafka clusters. Handle on-call rotations, incident management, troubleshooting, and scaling challenges.

Improve Infrastructure Observability
Build and maintain observability systems: dashboards, alerting pipelines, metrics collection (Prometheus, Grafana, etc.).

Optimize System Performance
Collaborate with peers on benchmarking and optimization initiatives. Work on tuning Kafka brokers, cluster configurations, and runtime parameters.

Provide Developer Support and Training (Infra-focused)
Help developers configure topics, quotas, and consumers appropriately. Train service owners to interpret monitoring data and avoid pitfalls.

Develop and Maintain Infrastructure
Contribute to building infrastructure tools and scripts (IaC, Helm charts, etc.) that make provisioning and managing clusters reliable and efficient.

Secure Infrastructure Access
Configure and maintain secure access patterns across streaming infrastructure, ensuring proper authentication and role-based access controls are enforced for both developers and services.
Requirements:
What we expect:

8+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.

Deep hands-on Kafka experience, including deploying, maintaining, scaling, and monitoring clusters.

Experience with RabbitMQ.

Extensive experience with Docker, Kubernetes, Helm, and GitOps-style deployments.

Infrastructure as Code experience (Terraform, Pulumi, etc.).

Strong skills in scripting and automation (Python, Bash, etc.).

Familiarity with Confluent Cloud, Confluent for Kubernetes, and similar tools.

Solid understanding of authentication and authorization mechanisms in distributed systems.

Production support mindset - with proven troubleshooting and incident resolution history.

Collaboration and communication skills - especially with dev teams depending on platform support.

Experience with Istio Service Mesh (bonus).

Experience with GovCloud (bonus).


Bonus Qualities:

Mentorship and leadership experience in infrastructure or SRE teams.

Contributions to automation or monitoring open-source tooling.

Active participant in SRE or DevOps communities.

Conference speaker or internal tech trainer.

Technical writing about infrastructure automation or reliability.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8695015
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced SRE Engineer to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will be in a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 4+ years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.

What will you do?

Automation & Infrastructure
- Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.
- Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.
- Drive the adoption of infrastructure-as-code practices across the organization.
- Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.

Monitoring & Observability
- Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.
- Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.
- Define and track SLIs, SLOs, and error budgets across key services.
- Partner with development teams to embed observability earlier in the software development lifecycle.

Database & Platform Support
- Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.
- Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
Requirements:
What you need:

4+ years of overall experience in SRE, DevOps, or infrastructure automation roles.

Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.

Hands-on experience with infrastructure automation tools, particularly Ansible.

Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.

Good understanding of CI/CD pipelines and related tooling, including Jenkins.

Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.

Comfortable working in Linux-based environments.

Excellent problem-solving skills and strong written and verbal communication.


Ability to support the following:
Experience with cloud providers - AWS, GCP, or Azure.
Exposure to containerization technologies such as Docker and Kubernetes.
Familiarity with infrastructure provisioning using Terraform.
Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.
Exposure and experience with migrating/ building AI tools to improve process.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8662378
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
we are looking for a Cloud FinOps Engineer to join our growing team!
This is a great opportunity to be part of one of the fastest-growing infrastructure companies in history, an organization that is in the center of the hurricane being created by the revolution in artificial intelligence.
"our company's data management vision is the future of the market."- Forbes
we are the data platform company for the AI era. We are building the enterprise software infrastructure to capture, catalog, refine, enrich, and protect massive datasets and make them available for real-time data analysis and AI training and inference. Designed from the ground up to make AI simple to deploy and manage, our company takes the cost and complexity out of deploying enterprise and AI infrastructure across data center, edge, and cloud.
Our success has been built through intense innovation, a customer-first mentality and a team of fearless workers who leverage their skills & experiences to make real market impact. This is an opportunity to be a key contributor at a pivotal time in our companys growth and at a pivotal point in computing history.
Proactively identify and execute cost-saving opportunities through rightsizing, optimized scaling strategies, lifecycle management, and architectural improvements without impacting performance or velocity.
Collaborate with R&D and DevOps to evaluate the cost-impact of architectural decisions, ensuring "cost-by-design" is a core part of our infrastructure evolution.
Design and maintain granular visibility into cloud spend, transforming complex data into actionable insights that inform relevant teams and empower them to take ownership of their usage.
Take ownership of cost operations across AWS, Azure, GCP, and OCI, establishing consistent tagging policies, automated guardrails, and governance frameworks.
Manage and optimize our portfolio of RIs, Savings Plans, and Spot instances to maximize our cloud ROI.
Requirements:
3+ years of experience managing cloud costs in complex, large-scale environments with a proven track record of moving the needle on cloud spend.
A proactive, hands-on executor: You dont just report on costs; you actively identify overspend, propose technical solutions, and drive them to completion to deliver tangible savings.
Strong technical understanding of cloud-native architectures and experience in building or scaling efficient infrastructure.
Ability to analyze cloud usage economics to find the "sweet spot" between performance, scalability, and cost-efficiency.
Experience leading end-to-end cost optimization programs, from identifying inefficiencies to implementing company-wide governance.
Excellent communication skills and the ability to translate complex technical data into clear business insights for stakeholders at all levels.
B.Sc. in Computer Science, Software Engineering, or a related technical field.
Technical Stack & Tooling:
Deep, hands-on experience with Azure, AWS, GCP, OCI
Understanding of Kubernetes cost allocation and experience with tools to manage shared resource environments (Keda, Kubecost, etc.).
Proficiency with platforms like Umbrella, Azure Cost Management, or similar.
Experience with Terraform/Ansible/Python or implementing automated cost-governance and tagging policies.
Understanding of Kubernetes cost allocation and experience with tools to manage shared resource environments (Keda, Kubecost, etc.).
Proficiency with platforms like Umbrella, Cloud Native cost management tools, or similar.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8683114
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
3 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
we are looking for a Senior Infrastructure Engineer to work on our dedicated engineering team building processing pipelines, information storage systems, and presentation layers in support of intelligence analysts.

The collection, processing, and exploration of malware samples and other information at a large scale is at the core of the Intelligence mission The Intelligence Automation team is responsible for prototyping, building, and operating the systems that enable this mission, and we'd like you to join us!

Your job will be to build, maintain, and improve infrastructure to support the entire breadth of our teams activities. You will work on classical datacenter and cloud infrastructure as well as environments for malware sandboxing or world-wide threat monitoring and hunting.

Advancing our fast-paced intelligence mission, requirements sometimes shift rapidly, and projects can live anything from weeks to years depending on changes in the surrounding ecosystem. It will be your responsibility to provide an infrastructure that can keep up with and adapt to these changes.

Occasionally things inside or outside of your control break and you will use your debugging skills to pinpoint the issue no matter whether it is on a hardware, network, cloud, kernel, or user space level.

You will be responsible for all aspects of the infrastructure you design, build, and maintain. This includes gathering requirements, making technical choices, creating documentation, securing workloads, upskilling colleagues, proactively monitoring operations, and gathering feedback from stakeholders.

You will join a team of very experienced infrastructure engineers who will always have your back. However, as a remote employee on a team distributed across many regions and time zones, you will not have direct access to all of your co-workers for the entire workday. Thus, the ability to work unsupervised, communicate asynchronously, and take the initiative in maintaining lines of communication is crucial. Additionally, we are looking for someone who would like to be part of a team who are passionate about their work and go the extra mile to exceed expectations. We love enthusiastic individuals who bring a positive attitude to their work and really care about what they produce for our stakeholders.

What You'll Do:

Maintain a can-do attitude and be solution-oriented

Deliver on ambiguous assignments and quickly evolving requirements in a fast-paced environment

Design, implement, document, and maintain our multi-cloud infrastructure

Be a consultant to development teams to ensure smooth deployment, monitoring, and maintenance of applications and services

Develop and maintain infrastructure-as-code (IaC) and management automation tools

Secure traditional and AI workloads

Ensure high availability, scalability, and performance of our systems

Troubleshoot and resolve complex infrastructure issues

Deploy, monitor, and troubleshoot relational and NoSQL databases

Mentor junior engineers and contribute to knowledge sharing within the team

Stay up-to-date with industry trends, best practices, and emerging technologies

Judge security and compliance risk
Requirements:
5+ years of experience in a DevOps or SRE role, with a focus on cloud-native technologies

Experience running Kubernetes clusters

Solid understanding of the risks and limitations of AI-based tooling

Ability to work on a geographically distributed and diverse team

Ability to independently make sound, justifiable decisions and take action

Proficiency in Go or Python, with experience developing automation scripts and tools

Proficiency in Linux, networking fundamentals and Kubernetes

Experience with monitoring and logging tools like Prometheus, Grafana, Splunk, LogScale

Experience with cloud providers such as AWS, GCP, or Azure

Experience with infrastructure-as-code tools like Helm, terraform
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8715577
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
24/05/2026
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for an experienced SRE Team Lead to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will lead a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 3-4 years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.



What will you do?

Leadership & Team Management

Lead, mentor, and grow a team of SREs, providing technical direction, career development guidance, and day-to-day management.

Own the team roadmap for reliability, observability, and automation initiatives - prioritizing work, removing blockers, and driving delivery.

Conduct regular 1:1s, performance reviews, and hiring processes to build and sustain a high-performing team.

Foster a culture of operational excellence, blameless post-mortems, and continuous improvement.

Act as an escalation point for complex incidents and reliability issues, leading post-incident reviews and ensuring follow-through on action items.


Automation & Infrastructure

Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.

Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.

Drive the adoption of infrastructure-as-code practices across the organization.

Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.


Monitoring & Observability

Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.

Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.

Define and track SLIs, SLOs, and error budgets across key services.

Partner with development teams to embed observability earlier in the software development lifecycle.


Database & Platform Support

Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.

Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
Requirements:
What you need:

Experience & Leadership

3-4+ years of experience in a people management or team lead capacity within SRE, DevOps, or infrastructure engineering.

5-8+ years of overall experience in SRE, DevOps, or infrastructure automation roles.

Proven track record of building, coaching, and retaining high-performing engineering teams.

Experience owning an engineering roadmap and driving cross-functional reliability initiatives.



Technical Skills

Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.

Hands-on experience with infrastructure automation tools, particularly Ansible.

Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.

Good understanding of CI/CD pipelines and related tooling, including Jenkins.

Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.

Comfortable working in Linux-based environments.

Excellent problem-solving skills and strong written and verbal communication.



Ability to support the following:

Experience with cloud providers - AWS, GCP, or Azure.

Exposure to containerization technologies such as Docker and Kubernetes.

Familiarity with infrastructure provisioning using Terraform.

Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.

Exposure and experience with migrating/ building AI tools to improve process.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8662300
סגור
שירות זה פתוח ללקוחות VIP בלבד