We are seeking an experienced Network Engineer to join our Network & Cloud Operations team. This role is responsible for investigating complex network and service issues, implementing and improving monitoring solutions, leading technical escalations, contributing to RCAs, and building workflows that streamline investigation processes for lower NOC and Support tiers.
As part of the Network & Cloud Operations team, the engineer will play a key role in ensuring the performance, reliability, and scalability of our companys network services. The role requires close collaboration with Engineering throughout the lifecycle of new versions and network features, from design review and operational readiness to production rollout, validation, and continuous improvement. This includes providing operational feedback on new network capabilities, helping define production requirements, and ensuring smooth adoption of new technologies across our companys global network.
Key Responsibilities:
Implement and improve network monitoring and alerting systems to proactively detect service-impacting issues.
Lead escalations of critical network and service incidents, coordinating with Engineering, Support, Security, and Operations teams.
Investigate and resolve complex network issues beyond standard NOC and Support tiers.
Contribute to RCAs by providing technical analysis, impact assessment, mitigation actions, and prevention recommendations.
Work closely with Engineering on new versions, features, and network capabilities, including design feedback, production readiness, rollout validation, and post-release monitoring.
Participate in CHNG management processes, service-component deployments, release pipelines, and production rollouts.
Create runbooks, workflows, and investigation procedures to improve troubleshooting efficiency across NOC team.
Identify recurring issues and operational gaps to drive improvements in network stability, monitoring, alerting, and automation.
Requirements: Minimum of 3 years as Network engineer / Production engineer / T3 support engineer or similar role.
Strong understanding of network protocols (i.e: BGP, OSPF, DNS, TCP/IP).
Proficiency in tools like Grafana, Sensu, Zabbix, or similar platforms.
Advanced troubleshooting and problem-solving skills to diagnose and resolve network issues.
Ability to lead efforts during service incidents.
Experience with scripting languages (Python, Bash) to automate tasks and streamline workflows (Advantage).
Networking related certificates are an advantage.
This position is open to all candidates.