Job Search and Career Advice Platform

¡Activa las notificaciones laborales por email!

Perception Data Engineer (Visual Data Pipeline)

ANYbotics

Barcelona

Presencial

EUR 45.000 - 70.000

Jornada completa

Hoy
Sé de los primeros/as/es en solicitar esta vacante

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading robotics company in Barcelona is seeking an experienced data engineer to design and maintain scalable data pipelines for autonomous systems. The role involves collaborating with machine learning engineers and supporting data lifecycle processes. Ideal candidates will have over 3 years of experience in production data pipelines, strong skills in Python, and familiarity with cloud storage solutions. This position does not allow remote work, and candidates must have the legal right to work in Spain.

Formación

  • 3+ years engineering experience in data pipelines or ETL systems.
  • Strong engineering skills in Python, including libraries like pandas.
  • Experience with dataset versioning and cloud storage.

Responsabilidades

  • Design and maintain scalable data pipelines and ETL workflows.
  • Implement dataset versioning and schema management.
  • Integrate annotation tools and support labeling workflows.

Conocimientos

Python scripting
Building production data pipelines
ETL systems
Dataset versioning
Large-file management
Cloud object storage

Herramientas

CVAT
Label Studio
Spark
Airflow
AWS
GCP
Descripción del empleo
The Opportunity

You’ll build and operate the data plumbing that our perception models need: ingestion, versioned storage, ETL, labeling integration, and reliable production pipelines for training and inference.

Market & Technology

ANYbotics transforms industrial plants in the (renewable) energy, process, and utility sector by introducing robotics to a wide range of novel applications that so far were beyond reach. Our mobile robot ANYmal uses legs for extreme mobility in complex environments, camera- and LIDAR-based sensing for full autonomy and obstacle avoidance, to perform jobs and deliver high-quality, consistent inspection results. We develop numerous customized hardware systems, including the entire robotic platform, actuators, sensors, inspection payloads, charging systems, and all related ANYbotics electrical hardware.

About Us

ANYbotics is a leading robotics company specializing in advanced autonomous systems. With a successful Series B financing round recently closed, we are poised for rapid growth and international expansion. Our mission is to revolutionize the robotics industry through cutting‑edge technology and innovation. As we embark on this exciting journey, we are seeking a dynamic and experienced person to join our team and help us shape the future of autonomous robotic inspections.

Your contributions
  • Design, build and maintain scalable data pipelines and ETL workflows that ingest raw images, sensor metadata, and labels (both real and synthetic).
  • Implement dataset versioning, schema management, and reproducible data snapshots to support experiments and audits.
  • Integrate annotation tools (CVAT / Label Studio), manage labeling workflows and quality‑control tooling, and support label QA processes.
  • Build data validation and monitoring checks (file integrity, label sanity, distribution drift alerts) and automate remediation where possible.
  • Provide clean, ready-to-use datasets and data loaders for ML engineers; optimize data access patterns for training (sharding, caching, prefetching).
  • Collaborate with MLOps to automate scheduled retraining triggers and with Synthetic Data Engineer to merge synthetic data streams.
You profile
  • 3+ years engineering experience building production data pipelines or ETL systems.
  • Strong Python scripting and engineering skills (pandas, pyarrow, boto3 or equivalent).
  • Experience with dataset versioning or large‑file management (DVC, Git‑LFS, or similar) and cloud object storage (S3).
  • Familiarity with annotation tooling and workflows for image data (CVAT / Label Studio).
  • Basic understanding of ML training data needs (batching, sharding, augmentation integration).
  • Prior work supporting computer‑vision teams (image pipelines, preprocessing, TFRecord or custom dataset formats).
Bonus points
  • Experience with big‑data tooling (Spark, Airflow/Prefect) or columnar formats (Parquet).
  • Knowledge of data privacy/compliance practices and tooling.
  • Cloud infra know‑how (AWS/GCP) and experience setting up reproducible data pipelines.

We’re an international robotics company with the A‑team spread across the Globe. This role gives you the opportunity to be part of growing our EU presence while staying connected to our global team. To be eligible, you’ll need to have the legal right to live and work in Spain. Ideally you reside in Barcelona, or are open to relocate. This is not a remote position.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.