Enable job alerts via email!

Senior Site Reliability Engineer

TN United Kingdom

London

On-site

GBP 60,000 - 100,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is on the lookout for a Senior Site Reliability Engineer to enhance their SaaS platform's reliability and scalability. This hybrid role involves collaborating with a dynamic team to tackle complex technical challenges while ensuring optimal performance across various environments. The ideal candidate will possess a robust background in cloud technologies and distributed systems, with a passion for innovation in a high-velocity setting. Join a forward-thinking company that values diversity and fosters a culture of inclusion, where your contributions will directly impact customer engagement and satisfaction.

Qualifications

  • 5+ years in engineering for scaled SaaS platforms in cloud environments.
  • Strong communication skills to explain complex solutions simply.

Responsibilities

  • Support the SRE team with backlog management and code reviews.
  • Investigate and resolve technical problems in production environments.

Skills

Cloud Technologies
PHP
MySQL
GoLang
Opentelemetry
Prometheus
Python
Distributed Systems Design
Networking Concepts
Big Data

Tools

AWS
Terraform
CI/CD
Clickhouse
Kafka
Pulsar

Job description

Social network you want to login/join with:

Senior Site Reliability Engineer, London
Client:

Xtremepush

Location:

London, United Kingdom

Job Category:

-

EU work permit required:

Yes

Job Reference:

45acbbf04385

Job Views:

6

Posted:

02.04.2025

Expiry Date:

17.05.2025

Job Description:
About the Role

We are seeking a Senior SRE with experience of working with scaled SaaS production infrastructure. The successful candidate will work as part of a team focused on site reliability, security, and scalability, as we manage our rapid growth.

The ideal candidate will be a proactive and driven individual, who excels at understanding and working on complex technical solutions requiring performance and optimisation at scale. Our core technologies include PHP, MySQL, Vue.js and AWS. Participating in an on-call roster is required as part of this role.

This is a hybrid role (2 days in the office). #LI-Hybrid

Key Responsibilities
  • Act as a senior member of the SRE team, supporting activities including the backlog and workload of the team, scoping requirements, peer review of code, providing feedback to the rest of the team.
  • Represent the team in management and stakeholder meetings. Ensure best practices are kept, and suggest improvements to our development processes where you see gaps.
  • Investigate, test, and resolve technical problems, working closely with other engineers to deliver core product functionality.
  • Defining SLOs, SLIs, and SLAs for key metrics that indicate the health, security, stability and uptime of production, staging and development environments.
  • Monitoring the above environments and reacting to alerts and issues that may arise in day-to-day operation of their product line.
  • Participate in an on-call rota for priority-1 level alarms with the rest of the Platform teams.
  • Ongoing upgrades and improvements to operational processes to optimise performance, stability and cost.
  • Working with the platform engineering team to contribute to the planning of how we carry application/infrastructure releases and configuration changes.
  • Interact with internal teams and external 3rd party vendors to troubleshoot and resolve complex problems.
Your Experience and Qualifications
  • 5+ years experience in an engineering role responsible for supporting a scaled SaaS platform running on Linux in a cloud environment.
  • Experience working with high-performance systems, and solving complex engineering problems at scale (our platform processes ~100 Billion messages per year).
  • Understanding of distributed systems design – including asynchronous tasks, event driven architecture, scheduling, caching and queue processing.
  • Ability to apply distributed systems design knowledge to resolve scaling constraints. The capability to carry out performance tuning from the API to Application to Database layer of the platform.
  • Strong communication skills and ability to explain complex technical solutions simply to others.
  • Strong understanding of PHP, GoLang, MySQL, Opentelemetry, Prometheus.
  • Experience with Cloud and DevOps technologies (AWS, Terraform, CI/CD etc.).
  • Experience with specific technologies in our stack: Clickhouse, Kafka, Pulsar, Python.
  • Experience with networking and security concepts.
  • Interest or experience with marketing technologies.
  • Interest or experience with big data, data analytics, AI and machine learning.
Location

Ireland (Dublin) or UK (London or Milton Keynes)

About us

Headquartered in Ireland with offices in the UK and US, Xtremepush is an Omnichannel Customer Engagement Platform powered by a built-in CDP. It enables high-velocity companies to build, grow, and retain strong customer relationships through personalised, relevant, and timely communication. With a true single customer view at its core, Xtremepush provides actionable customer intelligence that drives engagement, conversion, and revenue across all channels, while putting customer retention first.

At Xtremepush, we believe that diversity adds incredible value to our teams, our products, and our culture. We don’t just accept differences, we celebrate it, we support it, and we thrive on it for the benefit of our employees, our products and our community. As an equal opportunity employer, we stay true to our mission by ensuring that our place can be anyone’s place regardless of race, religion, gender, sexual orientation, national origin, disability or age.

Please note that if you are NOT a passport holder of the country for the vacancy you might need a work permit.

Bank or payment details should not be provided when applying for a job. Eurojobs.com is not responsible for any external website content. All applications should be made via the 'Apply now' button.

Created on 02/04/2025 by TN United Kingdom

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.