Enable job alerts via email!

Senior Site Reliability Engineer

Xtremepush

London

Hybrid

GBP 60,000 - 100,000

Full time

8 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Senior Site Reliability Engineer to enhance its scaled SaaS infrastructure. In this hybrid role, you'll collaborate with a dynamic team to ensure reliability, security, and scalability while managing rapid growth. Your expertise in distributed systems, cloud technologies, and performance tuning will be crucial in defining key metrics and resolving complex technical challenges. Join a forward-thinking organization that values diversity and empowers you to make a significant impact on customer engagement through cutting-edge technology.

Qualifications

  • 5+ years experience in SaaS platform support on Linux in cloud environments.
  • Proficient in PHP, GoLang, MySQL, Opentelemetry, and Prometheus.

Responsibilities

  • Act as a senior member of the SRE team, managing backlog and code reviews.
  • Monitor environments and respond to alerts during daily operations.

Skills

SaaS Platform Support
Linux
Distributed Systems
PHP
MySQL
AWS
GoLang
Opentelemetry
Prometheus
CI/CD

Tools

Terraform
Clickhouse
Kafka
Pulsar
Python

Job description

Join to apply for the Senior Site Reliability Engineer role at Xtremepush

4 weeks ago Be among the first 25 applicants

About The Role

We are seeking a Senior SRE with experience working with scaled SaaS production infrastructure. The successful candidate will work as part of a team focused on site reliability, security, and scalability, as we manage our rapid growth.

The ideal candidate will be proactive and driven, excelling at understanding and working on complex technical solutions requiring performance and optimization at scale. Our core technologies include PHP, MySQL, Vue.js, and AWS. Participating in an on-call roster is required.

This is a hybrid role (2 days in the office).

Key Responsibilities
  1. Act as a senior member of the SRE team, supporting activities including backlog management, scoping requirements, peer review of code, and providing feedback.
  2. Represent the team in management and stakeholder meetings, ensuring best practices and suggesting process improvements.
  3. Investigate, test, and resolve technical problems, collaborating with engineers to deliver core product functionality.
  4. Define SLOs, SLIs, and SLAs for key metrics indicating health, security, stability, and uptime of environments.
  5. Monitor environments and respond to alerts and issues during daily operations.
  6. Participate in an on-call rota for priority-1 alarms.
  7. Upgrade and improve operational processes for performance, stability, and cost efficiency.
  8. Work with platform engineering to plan application and infrastructure releases and configuration changes.
  9. Collaborate with internal teams and external vendors to troubleshoot and resolve complex problems.
Your Experience And Qualifications
  1. 5+ years of experience supporting a scaled SaaS platform on Linux in a cloud environment.
  2. Experience with high-performance systems and solving complex engineering problems at scale.
  3. Understanding of distributed systems design, including asynchronous tasks, event-driven architecture, caching, and queue processing.
  4. Ability to apply distributed systems knowledge to resolve scaling constraints and perform performance tuning.
  5. Strong communication skills to explain complex solutions simply.
  6. Proficiency in PHP, GoLang, MySQL, Opentelemetry, Prometheus.
  7. Experience with AWS, Terraform, CI/CD.
  8. Experience with technologies like Clickhouse, Kafka, Pulsar, Python.
  9. Knowledge of networking and security concepts.
  10. Interest or experience in marketing technologies, big data, analytics, AI, or machine learning.
Location

Ireland (Dublin) or UK (London or Milton Keynes).

About Us

Headquartered in Ireland with offices in the UK and US, Xtremepush is an Omnichannel Customer Engagement Platform powered by a built-in CDP. It enables companies to build, grow, and retain customer relationships through personalized communication, providing actionable customer insights to drive engagement, conversion, and revenue across channels.

We value diversity and are an equal opportunity employer, welcoming applicants regardless of race, religion, gender, sexual orientation, nationality, disability, or age.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Auros

Greater London

Remote

GBP 60,000 - 100,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer | London, UK

Tradeweb LLC

London

Remote

GBP 60,000 - 100,000

27 days ago

Senior Site Reliability Engineer

Xtremepush

London

On-site

GBP 60,000 - 100,000

4 days ago
Be an early applicant

Senior Site Reliability Engineer

Ebury

London

Hybrid

GBP 60,000 - 100,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

Prima

London

On-site

GBP 60,000 - 100,000

7 days ago
Be an early applicant

Remote Senior Site Reliability Engineer Manager (Remote)

Remotestar

London

Remote

GBP 80,000 - 100,000

30+ days ago

Senior Site Reliability Engineer London, United Kingdom

Reddit, Inc.

London

On-site

GBP 60,000 - 100,000

6 days ago
Be an early applicant

Senior Reliability Engineer

Mission Zero Technologies

Greater London

On-site

GBP 50,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

TN United Kingdom

Remote

GBP 60,000 - 100,000

10 days ago