Job Search and Career Advice Platform

Activez les alertes d’offres d’emploi par e-mail !

Member of Technical Staff, Data Engineering

Ring Inc

Tremblay-en-France

Hybride

EUR 70 000 - 90 000

Plein temps

Aujourd’hui
Soyez parmi les premiers à postuler

Générez un CV personnalisé en quelques minutes

Décrochez un entretien et gagnez plus. En savoir plus

Résumé du poste

A leading technology company is seeking a Member of Technical Staff, Data Engineering, to shape AI systems by managing and optimizing data pipelines. The role requires strong Python skills and familiarity with data processing frameworks. Candidates will collaborate in a fast-paced environment, directly impacting AI model performance. Benefits include a supportive work culture and comprehensive health coverage. Flexible remote options are available.

Prestations

Open and inclusive work culture
Weekly lunch stipends
Comprehensive health benefits
100% parental leave top-up
Personal enrichment budget
Remote-flexible work options
6 weeks of vacation

Qualifications

  • Strong software engineering skills, particularly in Python.
  • Experience with data processing frameworks such as Apache Spark, Apache Beam, or Pandas.
  • Passion for combining research and engineering to solve complex data challenges in AI.

Responsabilités

  • Design, develop, and maintain scalable data pipelines for diverse datasets.
  • Conduct data ablations and experiments to assess quality.
  • Collaborate with researchers and engineers to meet evolving needs of AI models.

Connaissances

Strong software engineering skills
Experience building and maintaining large-scale data pipelines
Familiarity with data processing frameworks
Experience working with large-scale web datasets
Excellent collaboration and communication skills

Outils

Apache Spark
Apache Beam
Pandas
Description du poste

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Member of Technical Staff, Data Engineering in United States, United Kingdom, France, Canada.

This role offers the opportunity to shape the foundation of cutting‑edge AI systems by managing and optimizing the data pipelines that power advanced language models. You will design and build scalable pipelines, curate high‑quality datasets, and ensure data is structured for optimal training efficiency. Working with diverse sources like web data, code repositories, and multilingual corpora, you will bridge research and engineering, enabling faster, more reliable model training. This position operates in a collaborative, fast‑paced environment where your contributions directly influence AI model performance and innovation. Flexible remote options are available, and you will interact closely with researchers, engineers, and cross‑functional teams globally.

Accountabilities
  • Design, develop, and maintain scalable data pipelines for ingestion, parsing, filtering, and optimization of diverse datasets.
  • Conduct data ablations and experiments to assess quality and improve model performance.
  • Implement robust data modeling techniques to structure and format datasets for efficient training.
  • Research and apply innovative data curation strategies to support advancements in natural language processing.
  • Collaborate with researchers, engineers, and cross‑functional teams to meet the evolving needs of AI models.
  • Ensure datasets are diverse, reliable, and optimized for throughput and accelerator utilization.
Requirements
  • Strong software engineering skills, particularly in Python.
  • Experience building and maintaining large‑scale data pipelines.
  • Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or equivalent.
  • Experience working with large‑scale web datasets (e.g., CommonCrawl).
  • Passion for combining research and engineering to solve complex data challenges in AI.
  • Excellent collaboration and communication skills to work effectively across global teams.
Nice to Have
  • Publications at top‑tier AI and ML venues (NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, COLING, ACL, EMNLP).
  • Experience with multilingual corpora and diverse data sources.
  • Background in NLP or generative AI research.
Benefits
  • Open and inclusive work culture with global collaboration opportunities.
  • Weekly lunch stipends, in‑office meals, and snacks.
  • Comprehensive health, dental, and mental health benefits.
  • 100% parental leave top‑up for up to six months.
  • Personal enrichment budget for arts, culture, fitness, well‑being, and workspace improvements.
  • Remote‑flexible work options with offices in Toronto, New York, San Francisco, London, and Paris, including co‑working stipends.
  • 6 weeks (30 working days) of vacation.
Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.