We are looking for a Machine Learning Tech Lead to promote machine learning engineering excellence. Someone who is passionate about building scalable, high-quality data products and processes, while ensuring production systems maintain strong real-time performance observability.
As a Tech Lead on the ML team , you will focus on designing and maintaining the core infrastructure that empowers the Machine Learning Engineers working within Data Science product teams. Youll collaborate closely with stakeholders across data science, product, and engineering, playing a pivotal role in driving the business by architecting and enabling the infrastructure for machine learning model development, serving, and lifecycle managementthe foundation of our product.
Responsibilities:
Partner with MLEs in Data Science product teams and key stakeholders to design and maintain infrastructure for:
Data wrangling supporting and enabling data requirements for research, training, validation, and testing.
End-to-end ML delivery enabling model performance development, training, validation, testing, and version control.
Drive engineering best practices including code and model versioning, CI/CD pipelines, rollout strategies, and disaster recovery procedures.
Build and support monitoring and observability tools dashboards, alerts, and performance tracking of models in production.
Lead architecture projects such as:
Feature Store centralizing feature engineering and serving across teams.
Vector Databases enabling large-scale embedding storage and retrieval for advanced ML applications.
GPU Cluster Scaling optimizing distributed training and inference infrastructure.
Collaborate with product, data science, and engineering teams to solve complex problems, identify trends, and create opportunities through robust ML infrastructure.
Requirements: 3+ years of experience as a ML Engineer
2+ years of experience in a technical leadership role (leading engineers or data scientists)
Strong programming skills in Python and SQL
Hands-on experience with MPP frameworks such as Spark, Flink, Ray, or Dask or equivalent
Strong analytical and critical thinking skills
Experience in a similar role big advantage
Experience as a backend or DevOps engineer advantage
This position is open to all candidates.