Job Search and Career Advice Platform

Activez les alertes d’offres d’emploi par e-mail !

Data Engineer – AI Compliance

All Cares

France

Hybride

EUR 45 000 - 65 000

Plein temps

Il y a 21 jours

Générez un CV personnalisé en quelques minutes

Décrochez un entretien et gagnez plus. En savoir plus

Résumé du poste

A technology company in France is seeking a skilled Data Engineer to build and scale systems for text and voice analysis. You will develop production-grade machine learning pipelines and collaborate with data scientists. Ideal candidates will have 3+ years of experience in data engineering, strong programming skills in Python or Scala, and familiarity with ML frameworks. This role offers the opportunity to work in a cutting-edge AI field in a remote, collaborative environment.

Prestations

Cutting-edge AI technology
Collaborative remote team
Career growth opportunities

Qualifications

  • 3+ years in data engineering or ML engineering roles.
  • Proven experience building ML pipelines from scratch.
  • Experience with text classification or voice analysis is a strong plus.

Responsabilités

  • Build and maintain end-to-end ML pipelines.
  • Design and maintain data storage and ETL workflows.
  • Work with data scientists to productionize models.
  • Build scalable and fault-tolerant pipelines.

Connaissances

Python
Scala
Java
Spark
Machine Learning
Data Engineering
Voice Analysis

Formation

Degree in Data Engineering, Computer Science, or Machine Learning

Outils

PyTorch
TensorFlow
scikit-learn
Beam
Kafka
Description du poste

About the Company

Cephalgo is a Strasbourg-based technology company founded in 2020, focused on developing AI solutions that ensure safety, compliance, and trust in human-AI interactions. Originally rooted in healthcare innovation, Cephalgo’s platform helps organizations securely analyze and monitor voice and emotion data while meeting privacy, security, and regulatory standards.

Backed by over €3 million in funding, Cephalgo combines deep expertise in voice AI, data protection, and compliance frameworks to help enterprises build and deploy responsible AI systems. The company collaborates with leading European partners in AI ethics, healthcare, and regulatory technology.

About the Role

We are seeking a Data Engineer to build and scale systems that support text and voice analysis, risk detection, and classifier training workflows. You will be responsible for production-grade machine learning pipelines (0 → 1) and collaborate closely with data scientists and AI engineers to deliver compliant, reliable data infrastructure and services.

What You’ll Do

Pipeline Development

  • Build and maintain end-to-end ML pipelines: data ingestion, preprocessing, feature extraction, model training, evaluation and deployment.
  • Develop reliable workflows specifically for voice and text analysis models.

Data Infrastructure

  • Design and maintain data storage, ETL workflows, and streaming/batch systems.
  • Implement data-quality, data-labeling, versioning and governance practices.

ML Collaboration

  • Work with data scientists and AI engineers to productionize models (e.g., text classifiers, anomaly-detection models, compliance-scoring models).
  • Support model monitoring and performance tracking once models are live.

Scalability & Reliability

  • Build robust, scalable, fault-tolerant pipelines.
  • Add observability layers: logging, monitoring, alerting for data and model pipelines.

Documentation & Governance

  • Document ETL processes, schemas, architecture and workflows.
  • Support compliance, data governance, and security standards in data pipelines and infrastructure.
You Might Be a Fit If You Have:

Experience

  • 3+ years in data engineering or ML engineering roles.
  • Proven experience building ML pipelines from scratch.
  • Experience with text classification, voice analysis or similar ML tasks is a strong plus.

Technical Skills

  • Strong programming skills (Python, Scala or Java).
  • Experience with big-data/streaming frameworks (Spark, Beam, Kafka or similar).
  • Familiarity with ML frameworks (PyTorch, TensorFlow, scikit-learn).
  • Experience with cloud data infrastructure and production deployment.

Soft Skills

  • Strong analytical and problem-solving skills.
  • Excellent collaborator and communicator—capable of working with data scientists, engineers and product/compliance stakeholders.
  • Detail-oriented, documentation-focused and comfortable in a fast-paced environment.

Education

  • Degree in Data Engineering, Computer Science, Machine Learning or related field (or equivalent experience).
Why Join Cephalgo?
  • Be at the intersection of cutting-edge AI/voice technology and compliance.
  • Make an impact by shaping a growing brand in a high-growth market.
  • Work with a collaborative, high-energy remote team driving forward-thinking solutions.
  • Grow your career and influence across product, marketing and business domains.
Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.