Enable job alerts via email!

Founding Senior Data/ML Infrastructure Engineer

Coco

San Francisco (CA)

On-site

USD 90,000 - 150,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Founding Data & ML Infrastructure Engineer to join their autonomy team. In this pivotal role, you will develop and maintain the infrastructure that supports large-scale datasets for autonomous robots. Your expertise will drive massive improvements in delivery efficiency, allowing urban residents to benefit from cutting-edge robotic solutions. This position offers a unique opportunity to work alongside the CTO and shape the future of urban logistics through advanced AI technology. If you're passionate about building impactful systems and thrive in a collaborative environment, this role is perfect for you.

Qualifications

5+ years of experience in software or data engineering with a focus on ML/AI systems.
Strong programming skills and familiarity with ML frameworks like TensorFlow or PyTorch.
Experience managing cloud infrastructure for large-scale data processing.

Responsibilities

Design and implement a high-performance data engine for model training.
Build tools for automatically extracting and cleaning data from various sources.
Collaborate with AI engineers to develop workflows for training and testing models.

Skills

Software Engineering

Data Engineering

Infrastructure Engineering

Machine Learning

AI Systems

Cloud Infrastructure

Programming Skills

Data Pipelines

Containerization

Leadership

Tools

AWS

GCP

Azure

TensorFlow

PyTorch

Docker

Kubernetes

Terraform

CloudFormation

DVC

Delta Lake

At Coco, our mission is to revolutionize urban logistics by empowering cities, boosting local economies, and delivering delightful customer experiences. We connect people with local restaurants through our fleet of on-demand delivery robots, helping merchants reach their customers faster and more efficiently. By building innovative robotic systems that seamlessly navigate city sidewalks, Coco plays a key role in reshaping the future of last-mile delivery and enhancing local businesses.

To deliver on our mission, we are building an autonomy team to develop the AI technology that will enable our robot pilots to scale efficiently, sustainably, and safely. The involves building an autonomy stack ground-up based on our millions of miles of last-mile delivery routes, proprietary video streams, and LiDAR data.

What is the scope of this role?

As a Founding Data & ML Infrastructure Engineer, you will be responsible to stand up Coco’s autonomy stack alongside the CTO and fellow team members in the autonomy team. You will be responsible for developing and maintaining the infrastructure that supports the collection, processing, management, and training of large-scale datasets for our autonomous robots. The impact of this will be massive improvements to our robot-to-pilot ratio thereby allowing every person living in an urban area to benefit from last-mile delivery. In this role, you must accomplish the following:

Design and implement a high-performance data engine to mine and identify valuable data samples that enhance model training.
Build tools and pipelines for automatically extracting, cleaning, and curating data from various sources (sensors, logs, real-world interactions).
Enable seamless interaction with large-scale datasets, ensuring that the team can quickly retrieve and analyze data to drive insights.
Collaborate with the autonomy and AI engineers to develop the query layer and workflows for training and testing models
Build and maintain tools for dataset management, including data exploration, versioning, and interaction tools.
Architect and manage the infrastructure for model training and experimentation. This includes continuously optimizing data pipelines and infra for cost, scalability, and speed.
Create and maintain systems for dataset tracking and governance to ensure consistent and reproducible experiments.

Must have competencies:

5+ years of experience in software engineering, data engineering, or infrastructure engineering, with a focus on machine learning or AI systems.
Extremely well versed in building and managing cloud infrastructure for large-scale data processing and model training (AWS, GCP, Azure).
Excellent programming skills. Familiarity with ML frameworks i.e. TensorFlow, PyTorch.
Strong understanding of data pipelines, versioning, and data management best practices.
Experience working with containerization and orchestration tools (Docker, Kubernetes).
Strong experience with cloud platforms and infrastructure as code (Terraform, CloudFormation).
Familiarity with distributed systems, high-performance computing, and optimization for training large models.
Hands-on experience with tools for data management and interaction (e.g., DVC, Delta Lake, or similar tools).
Strong leadership and communication skills.