Are you an ambitious and skilled technical leader who enjoys building innovative solutions? Join our E2E team as a Senior Software Automation Engineer and lead the development of powerful software automation systems for NVIDIAs Data Center environments. This is a chance to lead a distributed team, make strategic architectural choices, and craft the future of NVIDIAs software automation platforms used in various.
fields like Machine Learning, Cloud, Storage, and High-Performance Computing.
What youll be doing:
Lead and mentor a team of engineers, providing technical direction and promoting a culture of innovation and collaboration.
Craft and design long-term architectures, ensuring scalability, maintainability, and technical innovation.
Architect and design algorithms to establish quality standards for future data center technologies, ensuring scalability and high performance.
Collaborate with cross-functional teams, including product managers, architects, networking engineers, and research teams, to define strategic technical targets and deliver impactful solutions.
Lead code reviews, offer technical mentorship, and follow standard processes to ensure high-quality deliverables and alignment with business goals.
Drive the automation strategy, setting priorities, timelines, and identifying potential risks to ensure smooth project execution.
Requirements: What we need to see:
B.Sc. in Computer Science or a related field, or equivalent experience
10+ years of relevant software development experience, with at least 3-5 years in a leadership or technical lead role
Extensive understanding of software design patterns, architectural principles and standard methodologies in complex distributed systems.
Experience crafting and developing microservices architectures.
Extensive experience with version control systems (e.g., Git) and CI/CD pipelines, ensuring seamless integration and deployment processes.
Strong background in Linux and Windows operating systems.
Established track record of driving automation programming initiatives, optimizing workflows, and improving efficiency.
Excellent problem-solving, debugging, and analytical skills to address complex technical challenges.
Leadership and communication skills, with the ability to mentor and encourage a team of engineers.
Experience in developing secure, compliant systems for highly regulated environments.
Ways to stand out from the crowd:
M.Sc. in Computer Science or a related discipline.
Proven experience with cluster management tools (e.g., Docker, Kubernetes, Slurm, Ansible) and familiarity with managing large-scale data center solutions.
Proficiency in Python programming.
Track record of delivering large-scale, distributed systems with an emphasis on reliability, scalability, and performance.
Experience with Big Data technologies (e.g., Hadoop, Spark, Kafka) and handling large-scale data processing pipelines.
This position is open to all candidates.