As a DevOps Infra Engineer, you will:
Own and manage the companys internal AI infrastructure, including GPU-based environments
Manage and maintain Kubernetes-based infrastructure, owning the platform end to end
Design, develop and maintain in-house infrastructure, automation tools and systems
Manage the DevOps tools including Bitbucket, Jenkins, Artifactory, Monitoring tools and databases
Implement and maintain infrastructure as code using Terraform and Ansible
Take part in solving DevOps challenges in order to support and improve R&D Services and Delivery time
Own end-to-end responsibility for DEV environments
Maintaining R&D systems and issue, build self-services tooling and improve developer experience
Test and integrate new technologies and tools (POC)
Be in charge of integration with the CI team and workflows.
Requirements: At least 3 years of DevOps experience
At least 2 years of experience with Linux
Hands-on production experience with Kubernetes (deployment, operations, troubleshooting)
Experience with Infrastructure as Code and configuration management tools (e.g., Terraform, Ansible, Helm)
Experience with Jenkins and Groovy
Experience with Docker and Docker compose
Experience with monitoring and logging stacksuch as Prometheus, Grafana, ELK
Expertise in Source control infrastructure (Git)
Strong programming skills in one or more languages such as Python, Ruby, Go, Java or PowerShell or bash
Hands-on in virtualization environments or Cloud platforms
Strong problem-solving skills with a proactive, independent and hands-on approach
It would be great if you also have:
Experience with GitOps workflows such as Argo CD
Experience managing on-prem GPU-based infrastructure for AI/ML environments
Experience managing and maintaining artifact repositories (Artifactory)
Knowledge of networking fundamentals and methodology
Sysadmin background.
This position is open to all candidates.