We are looking for a Data Center Team Lead with strong systems and networking knowledge, to lead the team that build and support the supercomputers and HPC-AI clusters of the networking clusters solutions group.
What youll be doing:
Lead and coordinate the planning and build of complex clusters and supercomputers across multiple data centers and labs
Manage for rack-and-stack, cabling, and space optimization efforts to ensure efficiency, maintainability, and standard processes
Lead all aspects of power and cooling efficiency strategies while ensuring optimal rack space utilization
Coordinate daily functions and maintenance of data facilities and test environments, ensuring seamless operations and timely problem resolution
Installation and integration of diverse infrastructure and solutions including Cloud, VMs, Storage, Network, HPC, and AI
Manage debugging activities network, optical cabling, bare metal, and operating systems
Collaborate closely with Research & Development teams to support evolving project needs and experimental setups
Mentor and develop team members, ensuring knowledge sharing, standard methodologies, and professional growth
Requirements: What we need to see:
MCSE or MCITP / CCNA certification
3+ years of experience as a team lead in large and complex data center environments, overall experience of 8+ years
Demonstrated practical experience in operating systems with strong problem identification and resolution skills
In-depth knowledge of Linux & Windows Core Services: DHCP, DNS, NIS, AD, etc.
Strong leadership skills with ability to organize, prioritize, and guide a team
Passionate about delivering excellent service with strong collaboration and interpersonal skills
Ways to stand out from the crowd:
Hands-on with Python and configuration management tools (e.g., Ansible, Puppet).
Experience with CI tools and job schedulers (e.g., Jenkins, SLURM).
Knowledge of virtualization technologies: KVM, VMware, Hyper-V.
Experience with storage solutions like Netapp, Lustre, GPFS, ZFS.
Skilled in L2 & L3 network protocols and resolving technical issues.
This position is open to all candidates.