Activez les alertes d’offres d’emploi par e-mail !

Site Reliability Engineer (SRE)

Blackfluo.ai

Paris

À distance

EUR 80 000 - 100 000

Plein temps

Aujourd’hui
Soyez parmi les premiers à postuler

Résumé du poste

A tech company is seeking a Site Reliability Engineer (SRE) to scale and secure their infrastructure. The role involves maintaining AWS systems, developing CI/CD pipelines, and optimizing monitoring processes. Candidates should have over 5 years of experience in a similar role, with deep AWS knowledge. Enjoy 100% remote work, flexible hours, and being part of an international team focused on reliability and automation.

Prestations

100% remote work
Flexible hours
High-impact role with autonomy

Qualifications

  • 5+ years of experience as an SRE or similar role.
  • Deep knowledge of AWS services (EC2, ECS, RDS, Lambda, S3, etc.).
  • Proficient in infrastructure-as-code tools (Terraform, CloudFormation, etc.).

Responsabilités

  • Design, implement, and maintain scalable, resilient AWS infrastructure.
  • Develop and manage CI/CD pipelines and infrastructure-as-code.
  • Set up and optimize monitoring, alerting, and incident response processes.

Connaissances

AWS services
Infrastructure as code
Linux systems administration
CI/CD tools
Networking concepts

Outils

Terraform
GitLab CI
Prometheus
Grafana
Description du poste
About the job Site Reliability Engineer (SRE)

Job Description

Location: Full remote, EU timezone (CET +/- 2 hours)
Start Date: As soon as possible
Languages: English required

We are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the reliability, performance, and scalability of our production systems. Youll work closely with engineering teams to automate operations, improve monitoring, and design resilient systems.

Responsabilities:

  • Design, implement, and maintain scalable, resilient AWS infrastructure
  • Develop and manage CI/CD pipelines and infrastructure-as-code (Terraform or similar)
  • Set up and optimize monitoring, alerting, and incident response processes
  • Proactively identify and resolve performance, reliability, and security issues
  • Collaborate with development teams to integrate SRE best practices into their workflows
  • Conduct post-mortems and root cause analyses on incidents
  • Participate in on-call rotations to support 24/7 system reliability

Requirements:

  • 5+ years of experience as an SRE or similar role
  • Deep knowledge of AWS services (EC2, ECS, RDS, Lambda, S3, etc.)
  • Proficient in infrastructure-as-code tools (Terraform, CloudFormation, etc.)
  • Solid experience with Linux systems administration and networking concepts
  • Experience with CI/CD tools (GitLab CI, Jenkins, etc.)
  • Familiarity with observability tools (Prometheus, Grafana, Datadog, etc.)

Nice To Have:

  • Experience with container orchestration (ECS, EKS, or Kubernetes)
  • Understanding of security best practices in cloud environments
  • Exposure to incident management frameworks (SRE handbook, etc.)

Why Join Us:

  • 100% remote work with flexible hours
  • High-impact role with autonomy and ownership
  • Collaborative and international engineering team
  • Cutting-edge tech stack with strong focus on reliability and automation.
Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.