Enable job alerts via email!

Staff Data Infrastructure Engineer

Dyna Robotics

Redwood City (CA)

On-site

USD 120,000 - 180,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Staff Infrastructure Engineer to lead the design and optimization of distributed storage systems that enhance robotic manipulation capabilities. This role involves building scalable, fault-tolerant storage solutions and intelligent caching strategies to ensure high-throughput access to vast datasets. With a focus on solving complex data challenges in high-performance environments, you'll collaborate with a talented team to drive advancements in AI-driven robotics. This is a unique opportunity to contribute to the future of intelligent automation in a dynamic and supportive work environment.

Benefits

Comprehensive health, dental, and vision insurance
Daily catered lunches and dinners
Fully stocked kitchen
Professional growth and development opportunities
Equity in a seed-stage venture-backed startup

Qualifications

  • 7+ years of experience in infrastructure systems, focusing on storage.
  • Deep experience with distributed filesystems and caching layers.
  • Hands-on experience with cloud storage systems and performance tuning.

Responsibilities

  • Architect and maintain scalable distributed file systems.
  • Optimize read/write I/O performance for large-scale datasets.
  • Develop intelligent caching systems to reduce latency.

Skills

Distributed Filesystems
Caching Layers
Systems Programming (C++, Go, Rust)
Data Locality Optimization
Cloud Storage Systems
High-Performance Computing
Communication Skills

Education

Bachelor’s or Master’s in Computer Science

Tools

Alluxio
Lustre
CephFS
Memcached
Redis
Kubernetes

Job description

Dyna Robotics is at the forefront of revolutionizing robotic manipulation with cutting-edge foundation models. Our mission is to empower businesses by automating repetitive, stationary tasks with affordable, intelligent robotic arms. Leveraging the latest advancements in foundation models, we're driving the future of general-purpose robotics—one manipulation skill at a time.

Dyna Robotics was founded by industry leaders who previously achieved a $350 million exit in grocery deep tech as well as top robotics researchers from DeepMind and Nvidia. Our team blends world-class research, engineering, and product innovation to drive the future of robotic manipulation. With $20mil+ in funding, we're positioned to redefine the landscape of robotic automation. Join us to shape the next frontier of AI-driven robotics.

Position Overview

We are seeking a Staff Infrastructure Engineer to lead efforts in designing and optimizing distributed storage systems and caching layers that power our large-scale training and data processing pipelines. This role is critical for ensuring high-throughput, low-latency access to vast datasets across a growing fleet of cloud and on-prem GPUs.

You will focus on building scalable, fault-tolerant storage solutions and intelligent caching strategies to accelerate model iteration and enable real-time data streaming across the ML training stack. While ML experience is not required, you should be passionate about solving complex data movement and storage challenges in high-performance computing environments.

Key Responsibilities
  • Architect and maintain high-throughput, scalable distributed file systems (e.g., Lustre, Alluxio, CephFS, or similar).
  • Optimize read/write I/O performance for large-scale ML and robotic sensor datasets.
  • Design systems that ensure high availability, data integrity, and low-latency access across nodes and regions.
  • Develop intelligent caching systems to reduce latency and cloud storage costs (e.g., tiered caching across RAM, NVMe, object stores).
  • Implement prefetching and eviction strategies based on workload patterns.
  • Work with researchers to identify data bottlenecks and optimize throughput for common access patterns.
  • Lead the design and deployment of data infrastructure that scales to petabytes of logs, video, and training data.
  • Evaluate tradeoffs across object storage (e.g., S3, GCS), network-attached storage, and local disk solutions.
  • Build robust monitoring and alerting systems for storage health, throughput, and latency.
  • Design for failure recovery, redundancy, and data consistency in distributed environments.
  • Partner with ML engineers, data engineers, and platform teams to align infrastructure with evolving training and data needs.
  • Serve as a domain expert in storage, caching, and I/O optimization across the engineering organization.
Required Qualifications
  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field.
  • 7+ years of experience building and maintaining infrastructure systems, with 3+ years focused on storage or distributed systems.
  • Deep experience with distributed filesystems (e.g., Alluxio, Lustre, CephFS, HDFS) or caching layers (e.g., Memcached, Redis, custom).
  • Strong understanding of data locality, throughput optimization, and system bottlenecks in high-performance computing environments.
  • Hands-on experience with cloud storage systems (e.g., S3, GCS) and data access performance tuning.
  • Solid systems programming skills in C++, Go, or Rust.
  • Familiarity with job scheduling, Kubernetes, or HPC cluster management is a plus.
  • Clear communication skills and the ability to mentor junior engineers or collaborate across teams.
Preferred Qualifications
  • Prior experience working on large-scale data platforms, ML infrastructure, or robotics systems.
  • Contributions to open-source storage or caching systems.
  • Experience with infrastructure-as-code and container orchestration frameworks.
  • Competitive salary and equity in a seed-stage venture-backed startup
  • Comprehensive health, dental, and vision insurance
  • Daily catered lunches and dinners with a fully stocked kitchen
  • Professional growth and development through training, mentorship, and challenging projects

If you’re passionate about building data infrastructure that powers the next generation of intelligent robots, we’d love to hear from you.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Data Infrastructure Engineer

Dyna Robotics Inc.

Redwood City

On-site

USD 120,000 - 180,000

6 days ago
Be an early applicant

Staff DevOps Infrastructure Engineer

NMI

Schaumburg

Remote

USD 155,000 - 165,000

9 days ago

[Hiring] Staff Infrastructure Engineer @Sotheby's

Sotheby's

Remote

USD 100,000 - 160,000

15 days ago

Staff AI Infrastructure Engineer

WEX, Inc.

San Francisco

On-site

USD 147,000 - 195,000

8 days ago

Senior Staff Infrastructure Engineer - IaC

Advanced Micro Devices, Inc.

California

On-site

USD 150,000 - 200,000

8 days ago

Senior Staff Infrastructure Engineer - IaC

Advanced Micro Devices

San Jose

Hybrid

USD 120,000 - 180,000

8 days ago

Staff Infrastructure Engineer

Pendo

San Francisco

On-site

USD 177,000 - 222,000

9 days ago

Member of Technical Staff, Backend/Infrastructure Engineer

Coframe

Remote

USD 160,000 - 220,000

21 days ago

Staff Machine Learning Infrastructure Engineer

Dyna Robotics Inc.

Redwood City

On-site

USD 120,000 - 180,000

15 days ago