דרושים » מחשבים ורשתות » Lead Software Engineer, DevOps

משרות על המפה
 
בדיקת קורות חיים
VIP
הפוך ללקוח VIP
רגע, משהו חסר!
נשאר לך להשלים רק עוד פרט אחד:
 
שירות זה פתוח ללקוחות VIP בלבד
AllJObs VIP
כל החברות >
משרה זו סומנה ע"י המעסיק כלא אקטואלית יותר
מיקום המשרה: תל אביב יפו
סוג משרה: משרה מלאה
משרות דומות שיכולות לעניין אותך
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
As a Lead DevOps Engineer, your role involves the design and development of robust, scalable, and resilient distributed systems. You'll define product specifications, leveraging your technical expertise to create optimal solutions hosted in Kubernetes on AWS Cloud. This position requires extensive collaboration with various teams throughout the software development lifecycle. You will lead design discussions and code reviews, contributing to the overall quality of engineering within the organization.

Your responsibilities also include creating and supporting reusable application components and patterns, considering both business and technology perspectives. You'll utilize developer tools and a range of AWS services for task management, source code handling, building, deployment, operations, and real-time communication. You are expected to demonstrate advanced skills in application design, implementation, and maintenance, often with minimal supervision.

Beyond technical tasks, you will mentor other engineers, sharing your knowledge and actively contributing to the enhancement of best practices and processes within and across teams.

Responsibilities:

Design, build, and maintain the scalable cloud infrastructure and CI/CD pipelines necessary to support our cutting-edge AI and optimization services.

Champion Infrastructure as Code (IaC) practices using tools like Terraform and Kubernetes to automate the deployment, scaling, and management of our production environments.

Implement robust monitoring, logging, and alerting systems to ensure the high availability, performance, and reliability of all services.

Partner with development teams to streamline the software development lifecycle, improve deployment velocity, and embed best practices for security and operational excellence.


JR314438
Requirements:
4+ years of hands-on experience in DevOps Concepts and Cloud Architecture.

4+ years of experience with AWS (mandatory to know concepts around s3, sqs, dynamodb, iam and kms) or other similar concepts around different cloud service providers e.g., GCP and Azure (Optional)

4+ Experience deploying and managing CI/CD pipelines. E.g., Jenkins and/or Spinnaker

Advanced programming experience with at least two modern languages such as GoLang, Java, C++, Or Python including object-oriented design.

Proven understanding of micro-services-oriented architecture and extensible REST and gRPC APIs. Experience building the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems.

Knowledge and experience to ensure Kubernetes cluster management including workloads in deployments and statefulsets remains reliable, available, secured and meet performance expectations

Experience with Kubernetes packaging technologies such as HELM and experience in administrating Kubernetes config maps, services, deployments, and stateful sets.

Experience with monitoring production and staging of test and development environments for a number of applications in a dynamic organization.

Good command of the version control tools including but not limited to GIT.

Strong expertise in troubleshooting complex production issues. Excellent problem-solving, critical thinking, and communication skills.

Degree or equivalent relevant experience required. Experience will be evaluated based on the core competencies for the role (e.g. extracurricular leadership roles, military experience, volunteer roles, work experience, etc.).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8431996
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
18/11/2025
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
In this role, you will be responsible for:
Technical Leadership: Provide technical guidance and mentorship to the DevOps engineering team, overseeing daily operations and long-term strategic initiatives.
Cloud Architecture: Design, implement, and manage highly available, scalable, and secure cloud infrastructure on Amazon Web Services (AWS), adhering to industry best practices.
Container Orchestration: Oversee the management, optimization, and security of Kubernetes clusters and containerized environments.
Infrastructure as Code (IaC): Lead the adoption of IaC principles, directing the development and maintenance of configurations using Terraform/Tofu and Terragrunt.
System Reliability: Establish and refine Site Reliability Engineering (SRE) and monitoring methodologies, using tools like Prometheus to ensure system health, performance, and reliability.
Automation & CI/CD: Drive the continuous improvement of CI/CD pipelines to enhance automation, efficiency, and deployment speed.
Collaboration: Work closely with software development and operations teams to integrate DevOps principles throughout the software development lifecycle (SDLC).
Requirements:
A minimum of five years of experience in a DevOps, SRE, or a related technical leadership role.
Demonstrated expertise in containerization and orchestration, with extensive hands-on experience managing Kubernetes in production environments.
Proven proficiency in designing and deploying both microservice and monolithic architectures on Amazon Web Services (AWS).
Advanced skills in Infrastructure as Code (IaC) with Terraform/Tofu and Terragrunt.
Comprehensive systems administration experience across both Linux and Windows environments.
Strong command of Site Reliability Engineering (SRE) principles and monitoring stacks, with a particular emphasis on Prometheus.
In-depth knowledge of CI/CD pipeline implementation and automation best practices.
Exceptional analytical, problem-solving, and communication abilities
Preferred Qualifications:
Professional certifications such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or related credentials.Hands-on experience with comprehensive monitoring and logging solutions and various vendor tools.
Familiarity with managed data services such as Mongo Atlas, Confluent Kafka, and Elastic Cloud.
Deep understanding of cloud security best practices, particularly for AWS and containerized workloads.
Proficiency with networking principles and web server technologies, including Nginx and IIS.
Strong understanding of leveraging AI and AI-powered IDEs to enhance development and operational efficiency.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8418855
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a skilled DevOps Engineer to join our R&D infrastructure team and play a key role in building and scaling cloud-based platforms. In this position, you will design, implement, and maintain modern CI/CD pipelines, manage multi-cloud environments, and support microservices-based architectures. You will work closely with developers, QA, and product teams to streamline delivery processes, improve system reliability, and ensure smooth deployments. This is a hands-on role where you will directly influence the stability, scalability, and efficiency of our production systems while leveraging cutting-edge technologies across AWS and Azure.

Responsibilities
Infrastructure as Code (IaC): Develop and maintain infrastructure using tools like Terraform, Ansible
Cloud Infrastructure Management: Deploy, manage, and monitor applications in cloud environments(aws and Azure)
Collaboration & Support: Work closely with developers, QA, and product teams to streamline releases and improve productivity.
Provide technical support for development and operations teams during incidents and deployments.
CI/CD Pipeline Management:
Design, implement, and maintain continuous integration and delivery pipelines. Automate build, test, and deployment processes to improve speed and reliability.
Requirements:
3-5 years experience as DevOps Engineer\SRE Engineer\Platform Engineer
Strong problem-solving skills
Microservices architecture & container orchestration (Docker and Kubernetes)
Experience with IaaC tools (e.g. Terraform)
Strong knowledge of CI/CD tools such as Jenkins, GitHub.
Experience with Configuration Managements tools (e.g. Chef, Ansible or Puppet)
Experience with GitOps (e.g. ArgoCD)
Proven Scripting capabilities: PowerShell/Bash/Python
Hands-on experience with cloud platforms AWS/Azure/GCP
Strong troubleshooting skills
Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, etc.)
Excellent collaboration and communication skills for working across development, QA, and operations teams
BSc degree in computer science, computer engineering, relevant technical discipline, or equivalent practical experience
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423258
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
Required Site Reliability Engineer Panda team
Realize your potential by joining the leading performance-driven advertising company!
As Site Reliability Engineer on the IT Production team in our Tel Aviv Office, youll play a vital role in building robust services and solving infrastructure challenges with automations while working with cutting-edge technologies and bringing those to their limits on our mostly on-prem cloud like infrastructure.
As a Site Reliability Engineer, youll bring value by:
Ensure Reliability & Scalability: Design, implement and manage highly reliable and scalable distributed systems across our on-premise, cloud and AI/ML environments. Proactively optimize performance, efficiency, resource utilization and cloud cost.
Drive Automation: Automate repetitive tasks, infrastructure provisioning, configuration and deployments using IaC and scripting languages (e.g., Python, Go, Rust).
Develop Observability & Capacity: Implement comprehensive monitoring and alerting systems to ensure system health. Collaborate on capacity planning to meet future growth.
Maintain Security & Compliance: Integrate security best practices and ensure compliance with industry standards.
Lead Incident Management: Participate in on-call rotations, lead incident responses and conduct root cause analysis to minimize downtime.
Foster Collaboration & Improvement: Work closely with development, operations and security teams to drive shared responsibility and continuous improvement in SRE practices.
Our Tech Stack:
Linux, Kubernetes, nginx, Istio, AWS, GCP, Azure, Alicloud, Fastly, Terraform, Consul, Prometheus, Loki, Grafana, Airflow, Redis, Kafka, Vector, Hadoop, Cassandra, Vertica, MySQL, HDFS, ELK.
Requirements:
4+ years of experience in software development with a proven track record of designing and developing internal tools, automation frameworks and platform components in large-scale distributed production environments with focus on linux operating systems.
Deep, demonstrable expertise in one of the following programming languages ( Golang, C, Rust, Python or Java).
Experience in observability tooling development, specifically implementing custom metrics, tracing and logging within application code.
Practical understanding of the HTTP protocol (including HTTP methods, status codes and headers). Proven ability to design, implement and instrument robust internal APIs (e.g., using REST or gRPC).
Understanding in Linux operating system internals: kernel configuration, system calls, process management, memory and I/O.
Proven ability to troubleshoot and optimize performance bottlenecks under heavy load using advanced monitoring and profiling tools for high-throughput and low-latency applications.
Bonus points if you have:
Experience as an SRE, DevOps Engineer, System Administrator in a large distributed environment with focus on Linux operating systems.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8439403
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a DevOps Engineer.
As a key member of our engineering team, youll work at the intersection of development, operations, and reliability. Youll automate cloud infrastructure, ensure system performance, and maintain secure, scalable deployments in a regulated fintech environment.
Responsibilities:
Manage and enhance cloud infrastructure (AWS, GCP, Azure, or similar).
Develop, maintain, and automate CI/CD pipelines to streamline application delivery.
Implement Infrastructure as Code (e.g., Terraform, Ansible, CloudFormation) for provisioning and managing environments.
Set up and maintain monitoring, observability, and alerting systems using tools like Prometheus, Grafana, Splunk, New Relic, ELK,etc.
Define, track, and act upon SRE metrics (SLIs, SLOs, error budgets) to balance reliability and development velocity.
Participate in incident response, including root cause analysis and remediation.
Automate repetitive tasks to reduce toil and increase system resiliency and uptime.
Collaborate with developers and security teams to embed security and compliance best practices (e.g., PCI DSS, DevSecOps).
Support on-call rotation and continuously improve operational processes.
Requirements:
5-8 years experience in DevOps, SRE, or related engineering roles.
Proven experience working with at least one cloud provider (AWS, GCP, Azure).
Proven experience with containerization and orchestration (Docker, Kubernetes,GKE).
Proficiency in CI/CD tooling (e.g., GitLab CI, Jenkins, GitHub Actions).
Hands-on experience with Infrastructure as Code tools (Terraform, Ansible,CloudFormation).
Strong command of monitoring and observability tools (Prometheus, Grafana, ELK stack, Splunk, New Relic).
Solid scripting ability in Python, Bash, or similar.
Familiarity with Linux/Unix systems, networking, and basic system administration.
Comfortable working in fast-paced, collaborative environments and able to handle operational incidents effectively.
Excellent communication skills and a mindset geared toward continuous learning and improvement.
Nice to Have:
Exposure to containerization and orchestration (Docker, Kubernetes, GKE).
Understanding of SLA/SLI/SLO frameworks, error budgets, and reliability engineering principles. WikipediaReddit
Awareness of financial compliance standards like PCI DSS. jobs.singaporefintech.org
Knowledge of DevSecOps practices (security-as-code, shifting security left). Wikipedia
Familiarity with incident management and on-call culture.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8441385
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a Senior DevOps Engineer to join our R&D team in developing the next rising product in the health tech landscape. If you are looking for a challenging, influential position and are passionate about making an impact, this might be the role for you.

As a Senior DevOps Engineer , youll play a key role in the design, development, testing, deployment, and monitoring of our infrastructure and products. In this position, you'll make significant contributions to our observability stack, helping build and maintain robust systems for logs, metrics, traces, and alerting.

Our ideal candidate is passionate about DevOps and observability, has strong communication skills, and thrives on constant improvement for both technology and processes. If you enjoy working on multiple projects in parallel and are a proactive team player, youll fit right in.

This is a unique opportunity to join the core team of a fast-growing startup, where your contributions will have a direct impact on our product and success.

Responsibilities

Support and collaborate with cross-functional engineering teams using cutting-edge technologies.
Contribute to the design, implementation, and maintenance of monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, Loki)
Secure, scale, and manage our cloud environments (AWS and GCP)
Design and implement automation solutions for both development and production
Manage and improve our CI/CD pipelines for fast and safe delivery
Lead best practices in infrastructure, observability, configuration management, and system hardening
Continuously assess and improve existing infrastructure in line with industry standards
Requirements:
BSc in Computer Science, Engineering, or equivalent experience
5+ years of experience as a DevOps Engineer or similar software engineering role
Proven experience with Docker and Kubernetes (EKS preferred)
Hands-on experience with monitoring and observability tools, including Prometheus, Grafana, Datadog, or similar.
Expertise in Terraform for AWS infrastructure-as-code deployments
Strong collaboration and interpersonal communication skills
Excellent analytical thinking and problem-solving mindset
Proficiency with relational databases
Solid knowledge of Python and Bash scripting
Experience with test automation an advantage
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8398069
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
Location: Tel Aviv-Yafo
Job Type: Full Time
We are seeking a highly skilled and motivated Team Leader to build and lead a new team dedicated to developing orchestration tools and software solutions for AI datacenters.
The main goal of this team is to design and deliver customer-focused orchestration platforms that simplify the deployment, management, and monitoring of large-scale AI workloads.
This role combines technical leadership with hands-on development, covering the entire AI datacenter ecosystem including switches, hosts, smart NICs, GPUs, ROCm, and RCCL. The team will primarily develop in Python, complemented by modern full-stack technologies for user interfaces and control systems.
Key Responsibilities:
Lead and mentor a team of engineers building orchestration tools that manage complex AI datacenter infrastructures.
Define the teams vision, roadmap, and architecture for orchestration solutions that enhance customer experience and operational efficiency.
Design and implement distributed control and orchestration systems using Python and full-stack frameworks.
Collaborate with networking, compute, and AI acceleration teams to integrate orchestration capabilities across all datacenter components (switches, NICs, GPUs, and software stacks).
Work closely with product, QA, and DevOps teams to identify customer requirements and translate them into scalable, production-grade orchestration platforms.
Ensure software reliability, scalability, and maintainability through strong design principles, testing, and CI/CD practices.
Foster a culture of innovation, technical excellence, and cross-functional collaboration.
Requirements:
5+ years of software development experience, including 2+ years in a team leadership or technical lead role.
Strong proficiency in Python for backend, orchestration, and systems integration.
Proven experience in designing and implementing orchestration or control-plane systems for datacenter or cloud environments.
Deep understanding of datacenter infrastructure networking, compute, storage, or GPU acceleration.
Hands-on experience with containers, orchestration frameworks, and CI/CD pipelines (Kubernetes, Docker, etc.).
Excellent problem-solving, leadership, and communication skills.
Preferred Qualifications:
Experience with AI workloads and GPU software stacks (ROCm, RCCL, PyTorch, TensorFlow).
Familiarity with control-plane architectures, distributed systems, or cluster management frameworks.
Background in telemetry, resource scheduling, or performance optimization for large-scale systems.
Knowledge of microservices, REST/gRPC APIs, and cloud-native architectures.
Practical experience with full-stack development (React, Angular, Node.js, or similar).
Experience with testing frameworks (pytest, Robot Framework, etc.).
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8423027
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
4 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
Our team establishes best practices, creates guidelines, and promotes a strong DevOps culture across R&D. Design, implement, and operate large-scale, complex environments, ensuring high availability, seamless scalability, and optimal performance across diverse systems.

Job Responsibilities:

Lead and contribute to cross-organization DevOps projects.
Establish and enforce best practices for DevOps and R&D teams.
Research and evaluate new tools, technologies, and methodologies to improve DevOps processes.
Develop and maintain scalable and resilient infrastructure solutions.
Build and optimize CI/CD pipelines and deployment strategies.
Work closely with R&D teams to enhance service reliability, scalability, and performance.
Responsible for Kubernetes multi-cluster lifecycle management.
Requirements:
6+ years of hands-on experience in DevOps roles.
Experience with the full application release cycle and best practices.
Manage and develop CI/CD process tools.
Strong understanding of infrastructure & operations, including storage, networking, computing, and cloud security (Azure preferred).
Solid knowledge of Infrastructure as Code (IaC) tools such as Terraform.
Proficiency in Linux scripting and other programming languages.
Strong Python development skills beyond scripting.
Hands-on experience with Kubernetes and containerized environments.
Proven ability to work in a production environment, handling live system challenges such as performance, scalability, and security.
Strong debugging skills with the ability to identify and resolve complex technical issues.
Excellent self-management skills, with the ability to work independently in a dynamic environment.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8439505
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
4 ימים
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We're looking for a Senior SRE Engineer who combines strong infrastructure expertise with solid programming skills to help scale our platform, who can balance operational excellence with software development.
This is an exciting opportunity to build SRE processes from the ground up - creating new reliability pipelines, monitoring frameworks, and foundational practices that will scale with our rapid growth.
You'll lead our infrastructure and reliability efforts while writing code to automate, optimize, and enhance our systems. This role requires both deep technical expertise and the ability to mentor team members as we scale.
Stack: AWS, Python, EKS, K8s, Kafka, RabbitMQ, Pulumi, PostgreSQL, Databricks, GitHub Actions
Core Responsibilities:
Design and implement scalable, reliable infrastructure solutions on AWS using Infrastructure as Code (Terraform/Pulumi).
Build and maintain sophisticated CI/CD pipelines with GitOps methodologies.
Develop custom tooling and automation scripts in Python/Go/similar languages to improve operational efficiency.
Architect and implement comprehensive observability solutions (metrics, logging, tracing, alerting).
Define and track SLIs/SLOs/Error Budgets to ensure system reliability.
Lead incident response, conduct thorough post-mortems, and drive systemic improvements.
Optimize cloud costs through data-driven analysis and architectural improvements.
Collaborate with development teams to improve application reliability and performance.
Mentor team members on SRE best practices and infrastructure design patterns.
Requirements:
5+ years of DevOps/SRE experience in production environments.
Solid programming skills in at least one language (Python, Go, Java, or similar) with ability to write production-quality code.
Strong understanding of SRE principles: reliability engineering, capacity planning, chaos engineering.
Deep expertise with Kubernetes (EKS preferred) including operators, CRDs, and advanced networking.
Proven experience implementing Infrastructure as Code at scale.
Hands-on experience with observability stacks (Prometheus, Grafana, ELK, Datadog, or similar).
Experience with distributed systems concepts and troubleshooting.
Excellent problem-solving skills with a systematic approach to debugging.
Strong communication skills and ability to work across teams.
What Sets You Apart:
You write code to solve operational problems, not just configure existing tools.
You think in systems and can identify root causes across complex architectures.
You're passionate about automation and eliminating toil.
You balance perfectionism with pragmatism to deliver reliable solutions quickly.
You stay current with cloud-native technologies and best practices.
You can translate technical concepts for various audiences.
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8439435
סגור
שירות זה פתוח ללקוחות VIP בלבד
סגור
דיווח על תוכן לא הולם או מפלה
מה השם שלך?
תיאור
שליחה
סגור
v נשלח
תודה על שיתוף הפעולה
מודים לך שלקחת חלק בשיפור התוכן שלנו :)
חברה חסויה
Location: Tel Aviv-Yafo
Job Type: Full Time
We are looking for a DevOps Team Lead to join our R&D team and help shape the future of metabolic health by building the scalable infrastructure powering globally used devices and apps.

If you are an experienced DevOps professional with strong leadership skills, passionate about making an impact, and excited about working in dynamic, production-scale environments - This might be the role for you.

As a DevOps Lead at Lumen, you will lead our DevOps team of professionals, guiding the design, development, deployment, and monitoring of our infrastructure and products. You will play a crucial role in building robust, scalable systems, ensuring production stability, and implementing best practices.

Our ideal candidate is a strong people leader with deep technical expertise in Kubernetes and cloud environments, thrives in collaborative settings, and continuously seeks to improve technology and processes.

Responsibilities

Grow and guide a high-performing DevOps team, fostering technical excellence and collaboration
Partner with product and engineering teams to deliver resilient, high-performing features at scale
Drive the design, implementation, and maintenance of monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, Loki, Alertmanager)
Secure, scale, and manage our production cloud environments
Establish and promote best practices in infrastructure, observability, configuration management, and system hardening
Continuously assess and enhance our infrastructure in line with industry standards and emerging technologies
Lead ongoing cloud cost optimization efforts, ensuring infrastructure efficiency
Manage and improve CI/CD pipelines for fast, safe, and reliable delivery
Requirements:
At least 3 years of experience leading a DevOps team in a production environment
8+ years of hands-on DevOps engineering experience or similar software engineering role
Excellent interpersonal and communication skills, with the ability to collaborate effectively across teams
Strong analytical thinking and problem-solving mindset
Proven expertise in Kubernetes in production-scale environments
Strong experience with Docker, Terraform, and cloud infrastructure (AWS/GCP)
Solid scripting skills in Python and Bash
BSc in Computer Science, Engineering, or equivalent experience
Experience with ALM (Application Lifecycle Management) and automated testing strong advantage
This position is open to all candidates.
 
Show more...
הגשת מועמדותהגש מועמדות
עדכון קורות החיים לפני שליחה
עדכון קורות החיים לפני שליחה
8398071
סגור
שירות זה פתוח ללקוחות VIP בלבד