Activez les alertes d’offres d’emploi par e-mail !

Senior Data Engineer @ Entalpic

Breega

Paris

Hybride

EUR 60 000 - 100 000

Plein temps

Il y a 9 jours

Mulipliez les invitations à des entretiens

Créez un CV sur mesure et personnalisé en fonction du poste pour multiplier vos chances.

Résumé du poste

Join a forward-thinking startup at the forefront of AI and chemistry, dedicated to accelerating the energy transition. As a key team member, you will design and maintain scalable data infrastructure, enabling data-driven decisions that promote sustainability. This innovative firm values simplicity and clarity, and you will collaborate with scientists to enhance data access and governance. With a focus on a supportive work culture, this role offers flexibility and meaningful rewards, including an equity package and paid time off aligned with French standards. If you're passionate about making a difference, this opportunity is perfect for you.

Prestations

Equity package (BSPCE)
Paid time off aligned with French standards
Dynamic and supportive work environment
Flexibility for remote work

Qualifications

  • 7+ years of experience in data engineering with diverse data types.
  • Expertise in data modeling, ETL, and data warehousing.

Responsabilités

  • Build and optimize scalable data pipelines for various data sources.
  • Implement secure data storage systems supporting analytics and ML workflows.

Connaissances

Python
Data Engineering
SQL
NoSQL
Data Modeling
ETL
Data Warehousing
Cloud Experience
Communication Skills

Formation

Master’s or PhD in Computer Science
Data Engineering

Outils

Terraform
AWS
GCP
Git
Docker
Kubernetes

Description du poste

We are a dedicated team at the forefront of AI and chemistry, working to accelerate the energy transition. Our focus is on discovering new chemicals and materials that enable more sustainable practices in sectors with urgent decarbonization needs.

Specifically, we are developing a modern generative AI platform to discover new catalysts that optimize chemical reactions, significantly reduce CO₂ emissions, and help transform carbon-intensive industries.

As an early-stage, AI-driven startup with over €5M in funding, our approach is grounded in state-of-the-art academic research, with a strong focus on simplicity, clarity, and constant optimization.

Join Entalpic to be part of a passionate, fast-growing team united by the belief that technology can drive meaningful impact toward a more sustainable future.

Entalpic is committed to equal opportunity employment and a diverse, inclusive workplace. We encourage applications from all backgrounds—even if you don’t meet every requirement. If you’re passionate about our mission and think you can contribute, we want to hear from you.

Reporting & Job Location

You will report to the CTO of Entalpic and be based in our Paris office.

Mission Highlights

As a key team member, you will contribute to two main areas :

  1. Data Infrastructure Development
  2. Design, build, and maintain scalable data infrastructure to integrate diverse data sources (text, simulations, experiments) in support of ML and LLM applications.
  3. Lead the development of internal tools to enable efficient, AI-enhanced access to data and promote a data-centric culture across the organization.

Role & Responsibilities

  • Data Engineering : Build and optimize scalable data pipelines for simulation (e.g. DFT), textual (e.g. patents, papers), and experimental data (e.g. time series, imagery).
  • Data Storage Solutions : Implement and manage secure, scalable data storage systems supporting analytics and ML workflows.
  • Automation and Scripting : Create tools and scripts to automate data ingestion, transformation, and processing.
  • Data Governance and Lineage : Establish policies for data quality, lineage tracking, and regulatory compliance.
  • Infrastructure Support : Work closely with DevOps to integrate solutions with system architecture (AWS / GCP).
  • Collaboration and Support : Partner with scientists and experts to meet data needs and enable data-driven decisions.
  • Open Source Engagement : Contribute tools and learnings to open-source projects to support the broader community.

Profile

  • Master’s or PhD in Computer Science, Data Engineering, or a related field
  • 7+ years of experience in data engineering, with proven experience managing diverse data types and building scalable architectures
  • Proficiency in at least two programming languages (e.g., Python, Rust, Scala, Go)
  • Strong experience with both SQL (MySQL, PostgreSQL) and NoSQL (MongoDB)
  • Deep understanding of data modeling, ETL, and data warehousing
  • Cloud experience (AWS or GCP) and infrastructure-as-code tools (e.g., Terraform)
  • Strong communication skills in English
  • Ability to thrive in a fast-paced startup environment

Bonus Skills

  • Experience with ML pipelines and AI infrastructure
  • Contributions to open-source projects
  • Familiarity with scientific data, especially in materials science

Expertise

  • Programming : Strong in Python and at least one other language, with best practices in version control (Git)
  • Data Management : Expertise in both SQL and NoSQL for large-scale data processing
  • Cloud Platforms : Proficient with AWS or GCP and infrastructure-as-code (Terraform)
  • DevOps Collaboration : Comfortable with CI / CD, containerization (Docker, Kubernetes)
  • Open Source : Experience in contributing to and maintaining open-source libraries and communities

We are a no-nonsense startup focused on sustainable work culture and meaningful rewards. We offer :

  • Equity package (BSPCE)
  • Paid time off aligned with French standards
  • A dynamic and supportive work environment with flexibility for remote work
Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.