The Ecosystems Engineering group is seeking a Senior Quality Engineer to join our growing team.
In this role, you will work on our OpenShift Container Platform, built on container technology and the Kubernetes cluster management system. You'll be responsible for all aspects of quality for OpenShift, including designing test plans, extending automation frameworks, and developing new automated tests and infrastructure. You will make sure our solutions fulfill the highest possible enterprise quality. As a Senior Quality Engineer, you will gain a deep technical understanding of a variety of popular technologies, including containers, Kubernetes, OpenShift, focused on deep integration of AI stack and hardware accelerators.
You will also be working closely with the product management, other engineering groups within, and with our partners and lighthouse customers.
What you will do:
Work in a fast-paced Agile global environment of talented Engineers
Execute manual and automated tests and deliver clear status in a timely manner.
Conduct failure analysis, explore, identify and document new bugs, advocate for the resolution of bugs and communicate the impact on customers to developers and support;
Coordinate cross components teams to be responsible for bug verification, regression testing.
Work with the product release related teams to be responsible for the product delivery related testing;
Design, develop and maintain automation frameworks, CI and scripts with Golang for OpenShift testing and implementation
Develop and implement tests for AI tools that leverages hardware accelerators (e.g., GPUs)
Collaborate with hardware and software engineers to ensure optimal integration between AI and accelerators
Write high-quality, maintainable, and efficient code on open source repositories
Participate in code reviews and provide technical mentorship to junior engineers
Stay up-to-date on the latest advancements in AI frameworks, and hardware accelerators.
Requirements: 4+ years of professional experience working as Quality Engineer or similar roles with testing experience working with globally distributed QE team members
Programming skills in shell, Golang, Python or another programming language
Experience with a container ecosystem like Docker, Kubernetes, or OpenShift.
Knowledge of CI/CD and Jenkins
Strong experience with RHEL or other Linux distributions
Excellent problem-solving and debugging skills
Strong communication and collaboration skills
Ability to work independently and as part of a team
Ability to work in a fast-paced environment and learn quickly
The following are considered a plus:
Experience with Ansible
Experience with hardware accelerators (e.g., GPUs) for AI workloads
Knowledge of AI frameworks (e.g., TensorFlow, PyTorch)
Background in DevOps
Experience with Linux kernel development
Knowledge of Amazon Web Services (AWS) , Google Compute Engine (GCE), or Microsoft Azure.
This position is open to all candidates.