The ML Engine team is dedicated to driving innovation in machine learning (ML) by enhancing model efficiency and accelerating development cycles. This work is essential to keeping our company at the forefront of the competitive generative AI landscape. The team focuses on developing and optimizing backend systems that support ML inference, directly impacting our products' speed and scalability.
What you will be doing
As ML Engine SW Engineer, you will play a crucial role in shaping the system design, backend architecture, and overall performance of our companys ML inference systems. We collaborate with cross-functional teams, including designers, developers, and researchers. The role offers the chance to make a significant impact on our companys core production systems. Youll have the opportunity to develop and optimize critical backend features and work across diverse technical areas, contributing directly to the company's success. This hands-on role involves working extensively with Python, ComfyUI, system optimizations, and cloud infrastructure.
This position is perfect for a skilled backend engineer passionate about driving efficiency and innovation in machine learning. Join us to make an impactful difference in our companys AI journey!
Core Responsibilities
Develop and maintain our company's ML inference engine
Drive improvements in system efficiency, monitoring, and performance
Work directly with internal engine users to gather requirements and feedback
Implement enhancements and optimizations within ComfyUI and other backend frameworks
Day-to-Day Responsibilities
Design and optimize backend systems to support efficient ML model inference
Work with cloud infrastructure, networking, and both CPU and GPU architecture
Enhance the efficiency and observability of our inference engine, supporting various high-demand applications
Collaborate closely with engine users, designers, developers, and researchers to refine and improve system capabilities
Contribute to the ComfyUI codebase, ensuring a scalable, user-friendly interface for machine learning operations
Navigating and contributing to a large, complex existing codebase
Addressing complex issues with creativity and efficiency
Requirements: 5+ years of experience with Python in production environments
Strong understanding of system architecture principles
Proven familiarity with machine learning frameworks and methodologies
Independent leadership with a proactive approach to problem-solving
Strong analytical skills for diagnosing and addressing complex technical challenges
A collaborative, team-player mindset that thrives in cross-functional environments
Preferred Requirements
Knowledge of CPU/GPU architecture and experience with low-level optimizations
Experience with image processing and computer vision
Experience contributing to large, open-source projects
This position is open to all candidates.