We seek an experienced Software Engineer with a strong background to become an integral member of our Data-Core team, tasked with the mission of processing, structuring, and analyzing hundreds of millions of data sources.
Your role will be pivotal in creating a unified, up-to-date, and accurate utilities map, services, and applications for accelerating our mapping operations. Your contributions will directly impact our core product's success.
Responsibilities:
Collaborate with cross-functional teams to design, build, and maintain data processing pipelines while contributing to our common codebase.
Contribute to designing and implementing data architecture, ensuring effective data storage and retrieval.
Develop and optimize complex Python-based applications and services to allow more efficient data processing and orchestration, enhancing the quality and accuracy of our datasets.
Implement geospatial data processing techniques and contribute to the creation of our unified utilities map, enhancing the product's geospatial features.
Drive the scalability and performance optimization of data systems, addressing infrastructure challenges as data volume and complexity grow.
Create and manage data infrastructure components, including ETL workflows, data warehouses and databases, supporting seamless data flow and accessibility.
Design and implement CI/CD processes for data processing, model training, releasing, testing and monitoring, ensuring robustness and consistency.
Requirements: 5+ years of proven experience as a backend/software engineer with a strong Python background.
Experience in deploying a diverse range of cloud-based technologies to support mission-critical projects, including expertise in writing, testing, and deploying code within a Kubernetes environment.
A proven experience in building scalable online services.
Experience with frameworks like Airflow, Docker, and K8S to build data processing and exploration pipelines along with ML infrastructure to power our intelligence.
Experience in AWS/Google cloud environments.
Experience working with both SQL and NoSQL databases such as Postgres, MySQL, Redis, or DynamoDB.
Experience as a Data Infrastructure Engineer or in a similar role in managing and processing large-scale datasets - a significant advantage
This position is open to all candidates.