Activez les alertes d’offres d’emploi par e-mail !

Staff SRE - Observability (x/f/m)

Doctolib

Paris

Hybride

EUR 70 000 - 95 000

Plein temps

Il y a 27 jours

Résumé du poste

Une entreprise leader dans le secteur de la santé recherche un Staff Site Reliability Engineer pour rejoindre son équipe Core Reliability & Observability. Le candidat idéal aura une solide expérience en ingénierie fiabilité et sera capable de diriger la stratégie d'observabilité tout en mentorant d'autres ingénieurs. Cette opportunité offre un environnement de travail flexible et de nombreux avantages, y compris un programme de bien-être et un soutien pour la mobilité internationale.

Prestations

Assurance santé gratuite
Jusqu'à 14 jours de RTT
Politique de travail flexible
Programme de bien-être
Chèques repas
Subvention de transport public
Soutien à la relocalisation

Qualifications

  • Expérience extensive (8+ ans) dans des rôles SRE ou plat-form engineering.
  • Compétences en outils d'observabilité et en architectures cloud.
  • Capacité à diriger et à influencer des équipes techniques.

Responsabilités

  • Diriger la stratégie d'observabilité à travers la plateforme.
  • Mentorat et coaching technique pour les ingénieurs seniors.
  • Améliorer l'expérience d'alerte et contribuer aux initiatives de fiabilité.

Connaissances

Observabilité
Communications claires
Mentorat

Outils

Fluent Bit
OpenTelemetry
Prometheus

Description du poste

About the role

As a Staff Site Reliability Engineer within the Core Reliability & Observability team, you will play a pivotal role in shaping the company’s observability strategy and ensuring our platform remains reliable, debuggable, and scalable. This role sits at the intersection of infrastructure, developer experience, and product engineering, with a particular focus on building and evolving the foundations of logging, metrics, tracing, and alerting across the organization.

You’ll act as a technical leader and strategic partner to SREs, software engineers, and product teams, guiding decisions, mentoring engineers, and driving cross-cutting initiatives that elevate our operational maturity.

What you will do
  • Lead the observability strategy across the platform, with an emphasis on building scalable, developer-friendly logging and tracing capabilities.
  • Identify and lead large-scale cross-cutting reliability initiatives, including improvements to our incident detection, response, and postmortem analysis capabilities.
  • Take part in the on-call rotation, and actively contribute to improving our on-call experience by refining alerting, reducing noise, and ensuring actionable telemetry.
  • Serve as a mentor and technical coach to senior engineers, helping elevate the craft of reliability engineering across the company.
  • Influence strategic decisions by providing technical guidance to leadership and representing the observability discipline in architectural reviews and platform discussions.
Who you are

If you don’t meet all the requirements below but believe this opportunity matches your expectations and experience, we still encourage you to apply!

  • Extensive experience (8+ years) in SRE, platform engineering, or infrastructure roles within cloud-native environments (preferably AWS, GCP, or Kubernetes-based).
  • Deep expertise in observability tooling and architecture, such as:
  • Logging: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
  • Tracing: OpenTelemetry or proprietary APMs
  • Metrics: Prometheus, Thanos, Datadog, or equivalent
  • Strong systems engineering background with fluency in at least one backend programming language (e.g., Go, Python, Ruby).
  • Proven ability to lead through influence: setting technical direction, driving consensus, and mentoring engineers across teams.
  • Experience designing and operating high-scale telemetry pipelines and working with developers to improve instrumentation quality.
  • Comfortable balancing long-term architecture work with fast, iterative improvements.
  • Clear, concise communication skills—both written and verbal—with the ability to drive alignment in ambiguous environments.
What we offer
  • Free Health Insurance for you
  • Up to 14 days of RTT
  • A flexible workplace policy offering both hybrid and office-based modes
  • Flexibility days allowing to work in EU countries and the UK 10 days per year
  • Wellbeing program with free mental health and coaching through moka.care
  • Special support package for caregivers and workers with disabilities
  • Lunch voucher with Swile card
  • Work Council subsidy for sport club membership or creative activities
  • Bicycle subsidy
  • Public transportation reimbursement
  • Relocation support for international mobility
The interview process
  • 30min Phone screen with a Tech Recruiter
  • 1h30 Technical interview (SRE)
  • 1h30 System design interview
  • 1h15 Manager interview
Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.