Enable job alerts via email!

Senior Site Reliability Engineer

Blip

United Kingdom

Remote

GBP 70,000 - 100,000

Full time

3 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading tech company in the sports entertainment field seeks a Senior Site Reliability Engineer to enhance their operations. This role involves overseeing team performance, service lifecycle management, and ensuring system reliability. Ideal candidates are experienced in team management and technical problem-solving, with expertise in modern cloud technologies.

Qualifications

Deep familiarity with release and build pipelines (e.g., Jenkins, GitHub actions).
Strong background in monitoring and tracing within distributed systems.
Experience in containerized microservices deployment (e.g., AWS, GCP).

Responsibilities

Oversee service lifecycle from design to deployment and operation.
Engage in root cause investigations and support services pre-launch.
Maintain live services through monitoring health and performance.

Skills

Team Management

Incident Management

Performance Optimization

Programming Languages

Monitoring Distributed Systems

Agile/Scrum Methodologies

Blip is a leading tech company focused on software engineering solutions for sports entertainment.

We operate at scale. As part of Flutter Entertainment, we play an essential role in the Group's goal of becoming the global leader in online sports betting and iGaming, developing innovative products and platforms for over 14 million monthly customers worldwide.

We are serious about Tech. We are problem-solvers with big ambitions, keeping a people-first mindset at the core of our work. We prioritize flexibility as we strive to deliver the best technological products and tackle the greatest industry challenges.

Recognizing that everyone brings their own strengths, backgrounds and new perspectives, we empower you to be yourself. That uniqueness shapes the culture of belonging we are so proud of.

The Role

We are seeking a motivated and experienced senior engineer to join our dynamic organisation. As a Senior Site Reliability Engineer in our UK&I division, you will be responsible for overseeing a group of employees, providing direction and support to ensure goals are met and operations run smoothly. If you have a strong background in team management and are ready to take on a new challenge, we want to hear from you. Come be a part of our team and make a positive impact on our organisation’s success.

What You’ll Be Doing

Engage in and improve the whole lifecycle of services—from design, deployment, operation, and refinement.
Take an active part in production problems root cause investigation, identification, and resolution (where necessary)
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
Be an active part of performance and capacity testing;
Optimize reliability monitoring & alerting;
Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
Iteratively perform Auditing of performance and reliability vulnerabilities;
Define and revise Service Level Indicators (SLIs);
Practice sustainable incident response and blameless postmortems.

What You’ll Bring

Deep familiarity building and troubleshooting release and build pipelines (ex Jenkins, buildkite, GitHub actions)
Experience implementing creative approach in monitoring distributed systems while leveraging industry best practices (ex instrumenting tagging taxonomy across disparate systems)
Experience building, managing, and deploying an application utilizing containerized microservices, in a distributed infrastructure (ex AWS, GCP, self hosted cloud)
Experience leveraging new technologies when it best serves a business need
Comprehensive understanding of incident management best practices
Opinionated and knowledgable approach for implementing industry best practices
Demonstrated experience developing teams, encouraging growth, serving as a technical mentor and leader
Shows strength and comprehension in at least one programming languages (ex. Java, Python, Scala, Kotlin)
Experience making large directional technical decisions (ex. Deciding which technology, or pattern to create or leverage)
Experience being “on-call” for a service, and familiarity with incident notification tooling (ex. Pagerduty, Opsgenie)
Comprehensive understanding of SRE principles (ex. Working knowledge of the Google SRE book)
Demonstrated strength in leading a project in a agile/scrum environment
Thrives in a diverse work environment

We'd Like You To Master In

Experience managing complex telemetry solutions which directly contributed to overall reliability
Design greenfield solutions leveraging Configuration Management/Infrastructure as Code tools (ex. Chef, puppet, Terraform)
Create automated tooling that contributed to multiple teams velocity
Demonstrated experience with project management best practices
Shows the ability to break down large technical concepts into effective communication with stakeholders from across the organization
Extensive knowledge of networking best practices, tools, and observability
Experiencing developing and deploying automated service configuration at the edge (ex. CDN configuration, certificate renewal)
Work consulting with a team being able to advise on their technology, workflows, dev tooling, monitoring, alerting best practices
Identified need for and lead development of automation that significantly reduced toil (ex Deployment pipelines, distributed dev environments)
Built and maintained a system and culture that supported and implemented SLOs
Has shown to be a thought leader contributing to the broader industry conversation about SRE principals and topics (ex. Speaking at conferences)

This is what you should have. What do we have, you ask? Well...you can check our amazing perks & benefits right here !

So ... are you in?

Equal opportunities

At Blip, we are committed to creating a diverse and inclusive workplace. We strongly encourage people from all backgrounds, ways of thinking, and working to apply.
We are committed to including everyone regardless of their race, disability, age, gender identity, sexual orientation, and religion.
Everyone brings different perspectives and experiences; you don’t have to meet all the requirements listed to apply for this role.

If you need any adjustments to apply for the position and to ensure this role aligns with your needs, please send an email to accommodations@blip.pt .

We will only respond to inquiries related to disabilities.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs