Job Search and Career Advice Platform

Aktiviere Job-Benachrichtigungen per E-Mail!

(Senior) Data Engineer - Applied ML & Distributed Compute (m/f/d)

ECDB GmbH

Hamburg

Hybrid

EUR 60.000 - 80.000

Vollzeit

Heute
Sei unter den ersten Bewerbenden

Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf

Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren

Zusammenfassung

A data-driven eCommerce company in Hamburg is seeking a skilled Data Engineer to own and optimize data processing pipelines. This role involves working with large-scale data and implementing machine learning models. Candidates should have over 4 years of experience in Python and distributed computing frameworks. The company offers attractive benefits including flexible working hours and opportunities for personal growth. The modern office in Hamburg’s historic Speicherstadt fosters a unique working atmosphere.

Leistungen

Attractive career opportunities
Flexible working hours
Continuous learning and development
Modern office ambiance

Qualifikationen

  • 4+ years of relevant professional experience.
  • Proven track record in python-heavy data processing.
  • Experience with distributed compute frameworks on object-storage datasets.
  • Practical ML experience including training and deployment.
  • Able to handle messy, large-scale data.

Aufgaben

  • Own large-scale data processing pipelines and batch processing.
  • Design and optimize distributed compute workloads.
  • Train, deploy and monitor ML models at scale.
  • Productionize models with batch inference and retraining.
  • Implement AI-assisted pipelines for classification or extraction.

Kenntnisse

Python
Machine Learning
Data Processing
Distributed Computing
Data Analysis

Tools

Spark
Dask
Ray
Jobbeschreibung
About us

ECDB – Shaping the Future of eCommerce with Data!
At ECDB, we firmly believe that data determines success in eCommerce. That’s why we provide leading companies like Amazon, Google, and PayPal with the most precise analyses and market insights. With billions of transactions as our foundation, we are developing one of the most comprehensive eCommerce data platforms worldwide.
Our team of over 50 experts combines cutting-edge technology with deep industry knowledge – and this is where you come in! If you're eager to shape the future of eCommerce through data-driven insights, ECDB is the perfect place for you.

Tasks
  • Own large-scale data processing pipelines, including batch processing of raw, unstructured data
  • Design and optimize distributed compute workloads to transform large-scale web and natural language data into structured, production-ready datasets
  • Train, deploy and monitor ML-models at scale (e.g., NLP models, classifiers and enrichment use-cases)
  • Productionize models: Batch inference & retraining pipelines
  • Implement AI-assisted pipelines (e.g. LLM-based classification or extraction)
Requirements
  • Several years of relevant professional experience (4+ years)
  • Proven track record in python-heavy data processing
  • Prior experience with distributed compute frameworks (Spark / Dask / Ray) on object-storage based datasets (e.g., Parquet on S3-compatible storage)
  • Practical ML experience (training, evaluation, deployment, retraining)
  • Ability to work with messy, large-scale data and turn it into reliable outputs
Benefits
  • Attractive career opportunities in a rapidly growing company
  • Short decision-making processes and plenty of room for personal responsibility
  • An ambitious, open-minded team with a passion for smart solutions
  • A strong focus on continuous learning and development
  • Flexible working hours, the option to work from home, and a healthy work–life balance
  • A modern office in Hamburg’s historic Speicherstadt, offering a unique atmosphere
Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.