Job Search and Career Advice Platform

Activez les alertes d’offres d’emploi par e-mail !

Member of Technical Staff, Data Engineering

Ring Inc

À distance

EUR 40 000 - 60 000

Plein temps

Hier
Soyez parmi les premiers à postuler

Générez un CV personnalisé en quelques minutes

Décrochez un entretien et gagnez plus. En savoir plus

Résumé du poste

A data engineering firm is seeking a Member of Technical Staff for Data Engineering, located in Tremblay-en-France. This role focuses on designing and maintaining scalable data pipelines for AI systems, requiring strong Python skills and experience with data processing frameworks. With an open work culture, the company offers remote flexibility, benefits like weekly lunch stipends, comprehensive health coverage, and generous vacation days. Ideal candidates should excel in collaboration and have a passion for solving complex data challenges.

Prestations

Weekly lunch stipends
Comprehensive health benefits
6 weeks of vacation
Personal enrichment budget
Remote-flexible work options

Qualifications

  • Strong software engineering skills, particularly in Python.
  • Experience building and maintaining large‑scale data pipelines.
  • Familiarity with data processing frameworks like Apache Spark, Apache Beam, or Pandas.

Responsabilités

  • Design, develop, and maintain scalable data pipelines.
  • Conduct data ablations and experiments to assess quality.
  • Implement robust data modeling techniques for efficient training.

Connaissances

Software engineering skills in Python
Experience building large-scale data pipelines
Familiarity with data processing frameworks
Experience with large-scale web datasets
Excellent collaboration and communication skills

Outils

Apache Spark
Apache Beam
Pandas
Description du poste

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Member of Technical Staff, Data Engineering in United States, United Kingdom, France, Canada.

This role offers the opportunity to shape the foundation of cutting‑edge AI systems by managing and optimizing the data pipelines that power advanced language models. You will design and build scalable pipelines, curate high‑quality datasets, and ensure data is structured for optimal training efficiency. Working with diverse sources like web data, code repositories, and multilingual corpora, you will bridge research and engineering, enabling faster, more reliable model training. This position operates in a collaborative, fast‑paced environment where your contributions directly influence AI model performance and innovation. Flexible remote options are available, and you will interact closely with researchers, engineers, and cross‑functional teams globally.

Accountabilities
  • Design, develop, and maintain scalable data pipelines for ingestion, parsing, filtering, and optimization of diverse datasets.
  • Conduct data ablations and experiments to assess quality and improve model performance.
  • Implement robust data modeling techniques to structure and format datasets for efficient training.
  • Research and apply innovative data curation strategies to support advancements in natural language processing.
  • Collaborate with researchers, engineers, and cross‑functional teams to meet the evolving needs of AI models.
  • Ensure datasets are diverse, reliable, and optimized for throughput and accelerator utilization.
Requirements
  • Strong software engineering skills, particularly in Python.
  • Experience building and maintaining large‑scale data pipelines.
  • Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or equivalent.
  • Experience working with large‑scale web datasets (e.g., CommonCrawl).
  • Passion for combining research and engineering to solve complex data challenges in AI.
  • Excellent collaboration and communication skills to work effectively across global teams.
Nice to Have
  • Publications at top‑tier AI and ML venues (NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, COLING, ACL, EMNLP).
  • Experience with multilingual corpora and diverse data sources.
  • Background in NLP or generative AI research.
Benefits
  • Open and inclusive work culture with global collaboration opportunities.
  • Weekly lunch stipends, in‑office meals, and snacks.
  • Comprehensive health, dental, and mental health benefits.
  • 100% parental leave top‑up for up to six months.
  • Personal enrichment budget for arts, culture, fitness, well‑being, and workspace improvements.
  • Remote‑flexible work options with offices in Toronto, New York, San Francisco, London, and Paris, including co‑working stipends.
  • 6 weeks (30 working days) of vacation.

We use an AI‑powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top‑fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre‑contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.