we are looking for a Senior Infrastructure Engineer to work on our dedicated engineering team building processing pipelines, information storage systems, and presentation layers in support of intelligence analysts.
The collection, processing, and exploration of malware samples and other information at a large scale is at the core of the Intelligence mission The Intelligence Automation team is responsible for prototyping, building, and operating the systems that enable this mission, and we'd like you to join us!
Your job will be to build, maintain, and improve infrastructure to support the entire breadth of our teams activities. You will work on classical datacenter and cloud infrastructure as well as environments for malware sandboxing or world-wide threat monitoring and hunting.
Advancing our fast-paced intelligence mission, requirements sometimes shift rapidly, and projects can live anything from weeks to years depending on changes in the surrounding ecosystem. It will be your responsibility to provide an infrastructure that can keep up with and adapt to these changes.
Occasionally things inside or outside of your control break and you will use your debugging skills to pinpoint the issue no matter whether it is on a hardware, network, cloud, kernel, or user space level.
You will be responsible for all aspects of the infrastructure you design, build, and maintain. This includes gathering requirements, making technical choices, creating documentation, securing workloads, upskilling colleagues, proactively monitoring operations, and gathering feedback from stakeholders.
You will join a team of very experienced infrastructure engineers who will always have your back. However, as a remote employee on a team distributed across many regions and time zones, you will not have direct access to all of your co-workers for the entire workday. Thus, the ability to work unsupervised, communicate asynchronously, and take the initiative in maintaining lines of communication is crucial. Additionally, we are looking for someone who would like to be part of a team who are passionate about their work and go the extra mile to exceed expectations. We love enthusiastic individuals who bring a positive attitude to their work and really care about what they produce for our stakeholders.
What You'll Do:
Maintain a can-do attitude and be solution-oriented
Deliver on ambiguous assignments and quickly evolving requirements in a fast-paced environment
Design, implement, document, and maintain our multi-cloud infrastructure
Be a consultant to development teams to ensure smooth deployment, monitoring, and maintenance of applications and services
Develop and maintain infrastructure-as-code (IaC) and management automation tools
Secure traditional and AI workloads
Ensure high availability, scalability, and performance of our systems
Troubleshoot and resolve complex infrastructure issues
Deploy, monitor, and troubleshoot relational and NoSQL databases
Mentor junior engineers and contribute to knowledge sharing within the team
Stay up-to-date with industry trends, best practices, and emerging technologies
Judge security and compliance risk
Requirements: 5+ years of experience in a DevOps or SRE role, with a focus on cloud-native technologies
Experience running Kubernetes clusters
Solid understanding of the risks and limitations of AI-based tooling
Ability to work on a geographically distributed and diverse team
Ability to independently make sound, justifiable decisions and take action
Proficiency in Go or Python, with experience developing automation scripts and tools
Proficiency in Linux, networking fundamentals and Kubernetes
Experience with monitoring and logging tools like Prometheus, Grafana, Splunk, LogScale
Experience with cloud providers such as AWS, GCP, or Azure
Experience with infrastructure-as-code tools like Helm, terraform
This position is open to all candidates.