The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.
What are some of the things you do on a day-to-day basis?
Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more.
Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization.
Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems.
Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance.
Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
Build and maintain tools for automation, deployment, monitoring, and operations.
Troubleshoot issues in our development, production and test environments
Influence directly on the way billions of people discover the internet.
Requirements: What are the skills a good Machine Learning Ops Engineer needs to have?
Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
Highly skilled in software engineering methods. 8+ years experience.
Passion for ML engineering and for creating and improving platforms.
Knowledge in algorithms and machine learning. Good familiarity with Machine learning concepts. 3+ years experience working closely with algorithm engineering teams.
Experience with designing and supporting ML pipelines and models in production environment.
Excellent coding skills in Java & Python.
Experience with TensorFlow a big plus.
Possess strong problem solving and critical thinking skills.
MSc in Computer Science or related field.
Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries.
Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming.
Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.
It would be great if you also have:
Experience in leading Algorithms projects or teams.
Experience in developing models using deep learning techniques and tools.
Experience in developing software within a distributed computation framework.
This position is open to all candidates.