
Enable job alerts via email!
A leading robotics research firm in Singapore is seeking a Data Infrastructure Engineer to architect and maintain the data platform supporting their robot learning stack. You'll ensure high-quality data is captured and made available for training large-scale models. Candidates should have skills in distributed systems and proficiency in programming languages like Python, Go, or C++. Experience with Kubernetes and data pipelines is also required.
Our robots generate massive multi-modal data streams, from video, audio, proprioception, to control trajectories. To learn from this at scale, we're building a robot data engine that turns real world experiences into structured training data for our foundation models. This role sits at the core of that system, creating the data and compute infrastructure that makes large-scale embodied learning possible.
You will architect and maintain the data platform powering our robot learning stack, ensuring high-quality fleet data is captured, synchronized, labeled, and available for large-scale training. You will work across edge devices, on-prem clusters, and cloud infrastructure to build robust, automated, and scalable data flows.