our company is searching for a strong technical leader to own the backbone of our networking research capabilities. we are looking for an engineering manager to lead the development of our high-fidelity network simulation platform and the extensive on-premise infrastructure that powers it.
in this role, you will lead a team of performance simulation software engineers and DevOps /infrastructure specialists. you will own the "simulation-as-a-service" product-a critical platform used by internal researchers to model next-generation data center architectures. your mission is to ensure our simulations are accurate, performant, and accessible, while managing the large-scale compute clusters required to run them.
what you'll be doing:
team leadership: manage and mentor a team of C ++ software engineers and DevOps infrastructure engineers, fostering a culture of performance, reliability, and code quality.
product ownership (sim-as-a-service): treat the internal simulation platform as a product. work with research partners to define the roadmap, prioritize features, and ensure high availability for users.
high-performance simulation: be responsible for the architecture and optimization of complex network simulation engines ( C ++ based), ensuring they can scale to model extensive data center topologies with high fidelity.
infrastructure management: own the lifecycle of our on-premise compute clusters and servers. drive decisions on hardware upgrades, prioritisation, and managing system resources.
DevOps & automation: lead the strategy for ci/cd pipelines, automated testing, and containerized deployments to ensure rapid iteration and stability of the simulation platform.
multi-functional collaboration: partner with the ai agents team to expose simulation apis, enabling agents to run experiments and gather data autonomously.
Requirements: what we need to see:
msc, ph.d. or equivalent experience in Computer Science, electrical engineering, or a related field.
8+ years of hands-on software engineering experience, with a proven track record of leading technical teams in systems or infrastructure domains for 3+ years.
3+ years of managerial experience.
C ++ expertise: strong background in C ++ development for high-performance applications ( system -level programming, concurrent programming).
infrastructure & DevOps : practical experience managing on-premise servers, Linux environments, and modern DevOps tools (kubernetes, slurm, docker, ansible).
operational rigor: ability to manage "heavy" operations-ensuring uptime, monitoring system health, and optimizing hardware utilization.
ways to stand out from the crowd:
networking knowledge: deep understanding of computer networking fundamentals (tcp/ip, ethernet, infiniband, congestion control) and data center architectures.
simulation/modeling: experience with discrete event simulation (des) or modeling complex systems.
hpc background: experience working with mpi, cuda, or other high-performance computing frameworks.
specific simulators: familiarity with standard network simulators like omnet++, ns-3, or similar proprietary tools.
hardware knowledge: understanding of switch micro-architecture or nic design is a significant plus.
This position is open to all candidates.