Job Search and Career Advice Platform

Enable job alerts via email!

Distributed System Engineer

Second Talent

Singapore

On-site

SGD 100,000 - 130,000

Full time

10 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A pioneering company in humanoid robotics is seeking a Distributed Systems Engineer to architect systems that will power fleets of robots. This role requires over 7 years of experience in software engineering and expertise in distributed systems and data infrastructure. The engineer will design systems capable of processing petabytes of data and collaborating with engineers to enhance robot autonomy. Competitive compensation and a full-time position are available.

Qualifications

  • 7+ years of professional software engineering experience in distributed systems, networking, or data infrastructure.
  • History of building and operating production-grade distributed systems at massive scale.
  • High proficiency in at least one programming language like Go, Rust, C++, or Python.

Responsibilities

  • Architect and scale distributed systems for handling petabytes of data.
  • Design high-throughput data ingestion and streaming pipelines for real-time connectivity.
  • Build large-scale training and inference platforms for multimodal models.

Skills

Distributed systems
Networking
Data infrastructure
Go
Rust
C++
Python
Kubernetes
gRPC
Kafka
S3
Ray
Job description

Job Title: Distributed Systems Engineer

Location: Singapore

Job Type: Full-time

About the Opportunity

Our client is a pioneering company in the humanoid robotics space. They are seeking a highly experienced Distributed Systems Engineer to architect and scale the foundational infrastructure that will power fleets of robots operating globally.

This is a career-defining role to work across the entire stack of robotics infrastructure from low-latency streaming and cloud simulation to large-scale training and telemetry pipelines. You will work directly with the company's founders and technical leadership to design the core systems that enable hundreds of robots to learn, share data, and operate as a unified fleet.

Key Responsibilities
  • Architect and scale distributed systems capable of handling petabytes of sensory, telemetry, and control data across both cloud and edge environments.
  • Design and build high-throughput data ingestion and streaming pipelines that connect the robot fleet to the cloud in real-time (handling video, LiDAR, joint states, and audio data).
  • Build the large-scale training and inference platforms for the multimodal foundation models that power robot autonomy and teleoperation.
  • Collaborate closely with ML and Robotics engineers to support hardware-in-the-loop (HIL) simulation, policy rollout, and continuous learning systems.
  • Develop the internal observability systems for fleet monitoring, ensuring reliability, and performance tuning at scale.
  • Lead critical infrastructure decisions, from distributed storage and consensus protocols to GPU orchestration and network reliability.
Required Qualifications
  • 7+ years of professional software engineering experience, with deep, proven expertise in distributed systems, networking, or data infrastructure.
  • A demonstrable history of building and operating production-grade distributed systems that handle massive scale and mission-critical workloads.
  • High proficiency in Go, Rust, C++, or Python, strong fundamentals in concurrency, networking, and systems performance.
  • Hands-on experience with cloud-native architectures and tools (e.g., Kubernetes, gRPC, Kafka, S3, Ray, or similar frameworks).
  • A strong, practical understanding of data consistency, replication, and fault tolerance in complex, heterogeneous environments.
  • An analytical mindset with a focus on building fast, measurable, and reliable systems.
  • Strong Plus: Experience with GPU-based workloads, model training, or edge compute orchestration.
Preferred Qualifications (Bonus Points)
  • Experience building distributed training or large-scale simulation systems.
  • Familiarity with the unique demands of real-time robotics workloads, including streaming from physical sensors and actuators.
  • Prior work with telemetry, observability, or in a production environment.
  • Contributions to open-source infrastructure, AI frameworks, or robotics middleware (e.g., ROS, gRPC, Mediasoup).
How to Apply

Interested candidates are invited to submit their resume, detailing their experience in building and scaling large-scale distributed systems.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.