Enable job alerts via email!

Site reliability engineer

The Rundown AI, Inc.

London

On-site

GBP 70,000 - 100,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Une entreprise innovante recherche un ingénieur en fiabilité des sites pour rejoindre son équipe d'infrastructure cloud. Vous serez responsable de la conception et de la mise en œuvre de solutions pour garantir la fiabilité, la performance et la sécurité des systèmes critiques. Ce rôle offre l'opportunité de travailler avec des technologies de pointe et de mentor junior engineers dans un environnement dynamique.

Benefits

Generous PTO and company holidays
Comprehensive medical and dental insurance
Paid parental leave (12 weeks)
Fertility and family planning support
Early-detection cancer testing through Galleri
Competitive pension scheme and company contributions
Annual stipends for home office setup, wellness, and learning
Company and team off-sites
Competitive salary and stock options

Qualifications

  • 7 ans d'expérience en Site Reliability Engineering.
  • Diplôme en informatique ou domaine connexe.
  • Expertise en cloud (AWS, Azure, GCP) et outils de conteneurisation.

Responsibilities

  • Concevoir et maintenir l'infrastructure cloud pour assurer la haute disponibilité.
  • Automatiser la gestion de l'infrastructure avec Terraform et Python.
  • Développer des systèmes de surveillance pour identifier et résoudre les problèmes.

Skills

Site Reliability Engineering
Automation
Cloud Infrastructure
Communication

Education

Bachelor’s degree in Computer Science

Tools

Terraform
Python
AWS
Kubernetes
Prometheus
Grafana

Job description

About this role

We are seeking a foundational member for the Cloud infrastructure team at Writer. This role involves contributing to the development and implementation of our Site Reliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer’s critical systems, proactively ensuring our high-ROI products reach customers seamlessly.

Your responsibilities:

  • Lead the design, implementation, and maintenance of Writer, Inc.’s cloud infrastructure to ensure high availability and performance.
  • Design and implement scalable cloud automation to support seamless deployment for enterprise customers.
  • Automate infrastructure provisioning and management using Terraform & Python.
  • Collaborate with development teams to optimize cloud resources and enhance system reliability.
  • Develop and maintain monitoring and alerting systems to proactively identify and resolve issues.
  • Conduct post-mortem analyses of system failures to identify root causes and implement preventive measures.
  • Optimize and scale cloud infrastructure to support growth and ensure cost efficiency.
  • Ensure security and compliance of systems, adhering to industry standards and regulations.
  • Mentor and guide junior engineers, fostering a culture of reliability and continuous improvement.
  • Stay current with emerging technologies and industry trends to improve SRE practices.

Is this you?

  • Proven expertise in Site Reliability Engineering with at least 7 years of experience.
  • Deep understanding of system architecture and infrastructure design for high availability and performance.
  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • Strong proficiency in programming languages such as Python, Java, or Go for automation and monitoring.
  • Experience with cloud platforms like AWS, Azure, or GCP and their services for scalable systems.
  • Expertise in containerization (Docker, Kubernetes) and orchestration tools.
  • Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK Stack).
  • Ability to lead and mentor junior engineers in reliability best practices.
  • Excellent communication skills for effective collaboration.
  • Proactive in identifying and mitigating system failures and bottlenecks.

Preferred skills & experience:

  • Software engineering expertise
  • Terraform
  • Python
  • Kubernetes
  • Scala
  • AWS/GCP

Benefits & perks (UK full-time employees):

  • Generous PTO and company holidays
  • Comprehensive medical and dental insurance
  • Paid parental leave (12 weeks)
  • Fertility and family planning support
  • Early-detection cancer testing through Galleri
  • Competitive pension scheme and company contributions
  • Annual stipends for home office setup, wellness, and learning
  • Company and team off-sites
  • Competitive salary and stock options
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

TieTalent

London

Remote

GBP 70,000 - 85,000

Yesterday
Be an early applicant

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

JR United Kingdom

London Fields

Remote

GBP 50,000 - 80,000

2 days ago
Be an early applicant

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

JR United Kingdom

City Of London

Remote

GBP 60,000 - 90,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer London, United Kingdom

NinjaOne, LLC

London

Remote

GBP 70,000 - 100,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer

NinjaOne

London

Remote

GBP 70,000 - 100,000

2 days ago
Be an early applicant

Site Reliability Engineer

Stratospherec Limited

Greater London

Remote

GBP 70,000 - 85,000

7 days ago
Be an early applicant

Site Reliability Engineer (Home-based)

JR United Kingdom

London

Remote

GBP 60,000 - 80,000

13 days ago

Site Reliability Engineer

Attio Ltd

London

Remote

GBP 80,000 - 100,000

8 days ago

Sr. Software Engineer - Reliability, Ireland (Remote)

CrowdStrike Holdings, Inc.

London

Remote

GBP 70,000 - 100,000

5 days ago
Be an early applicant