Our Machine Learning team is at the forefront of enabling and optimizing high-performance deep learning applications on edge devices. We are looking for a versatile and highly motivated Machine Learning Engineer with a strong R&D background to join our team.
This role requires a solid mathematical foundation, a passion for AI research, and the ability to quickly adapt to new technologies. You should feel comfortable reading and implementing academic papers while also being deeply system-orientedable to dive into hardware architecture, software toolchains, and customer applications as needed.
At our company, developing and deploying deep learning models is software-intensive, and candidates are expected to have strong programming skills in scientific computing. As an AI chip company, we highly value software-hardware integration skills, as our ultimate goal is to ensure seamless optimization for our processors.
Beyond technical expertise, we are looking for candidates who demonstrate:
Ownership and accountability taking initiative and responsibility for driving projects to completion.
Professionalism delivering high-quality work while staying up to date with the latest advancements in AI.
Teamwork collaborating effectively across teams to integrate AI solutions into real-world applications.
We encourage intellectual curiosity and a keen interest in exploring AI trends, discussing future directions, and staying informed about market developments. Candidates should enjoy working in an open, well-documented manner and be eager to present their work to peers and stakeholders.
Responsibilities
Develop and optimize Transformer-based and Generative AI models for deployment on our edge AI processors.
Stay at the forefront of Generative AI and NN computer vision research, driving innovation in edge AI applications.
Benchmark and enhance vision and generative models, ensuring seamless integration with our AI toolchain, through our Model Zoo.
Collaborate with architecture and other software teams to co-optimize models for our unique processing architecture.
Support and engage with tier-1 customers, guiding them in deploying and optimizing their AI models on our platform.
Requirements: M.Sc. in CS, EE, Mathematics, or Physics.
5+ years of R&D experience with a strong track record of technical excellence and leadership.
Expertise in deep learning model development and optimization, with a preference for experience in Transformer-based (Generative AI) models.
Strong programming skills and experience designing simulation environments for algorithm validation.
Ability to analyze complex systems, read academic literature, and apply research insights.
Experience in system architecture or hardware-aware AI model optimization.
Excellent problem-solving, communication, and collaboration skills.
Advantages
Experience in Generative AI model development and inference.
Knowledge of model optimization techniques such as quantization and pruning.
Familiarity with hardware-aware neural network design and deployment.
This position is open to all candidates.