What you will be doing
Pre-train and fine-tune video and image generative models to pursue state-of-the-art results.
Publish papers and open-source models to benefit the research community and advance the field.
Design and implement machine learning models for text-to-video generation.
Collaborate with data engineers to curate and preprocess text and video data.
Optimize models for high performance, ensuring efficiency in training and inference phases.
Build new controls and capabilities into generative text-to-video models.
Stay updated with the latest developments in Generative AI, particularly in the fields of image, video, and audio.
Work closely with product teams to integrate AI models into applications and services.
Conduct experiments and prototype new concepts to advance the capabilities of our AI tools.
Requirements: PhD or equivalent experience in the field of generative AI
Track record of coming up with new ideas or improving upon existing ideas in generative AI, demonstrated by accomplishments such as first author publications or projects.
Excellence in engineering as well as research with strong programming skills in Python, and deep familiarity with machine learning frameworks.
Experience in training large diffusion transformer models from scratch.
Proven track record of handling large-scale datasets to train neural networks effectively.
This position is open to all candidates.