Job Search and Career Advice Platform

Enable job alerts via email!

Senior SRE

Spliced

United Kingdom

On-site

GBP 55,000 - 75,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A dynamic gaming studio is seeking a Site Reliability Engineer to operate large-scale services and enhance online functionality. The role involves designing and maintaining distributed web services, monitoring systems, and ensuring reliable operations for millions of players. Ideal candidates have experience in cloud optimization, database management, and continuous deployment processes. The studio offers a collaborative environment, rich benefits, and recognition for hard work, fostering an inclusive atmosphere for all applicants.

Benefits

Sign on bonus
Well-being allowance
Generous pension benefits
Healthcare benefits

Qualifications

  • Experience operating large scale online software in a production environment.
  • Strong knowledge of distributed system architecture.
  • Familiarity with cloud network efficiency in major providers.
  • Strong general knowledge of database systems.
  • Experience with continuous integration/deployment of distributed systems.
  • Experience with monitoring and alerting on distributed systems.
  • Proficiency in at least one major programming language.
  • Familiarity with incident response for large scale systems.

Responsibilities

  • Design and maintain distributed web services, database schemas, and application configuration.
  • Implement and maintain runtime caching layers.
  • Oversee deployment processes and global content distribution infrastructure.
  • Design and maintain monitoring and alerting systems.
  • Collaborate with development teams on reliability.
  • Mitigate outages and regressions to minimize user impact.
  • Be part of the on-call team for first line defense.
  • Automate operations tasks to improve efficiency.

Skills

Operating large scale online software
Distributed system architecture knowledge
Cloud efficiency and optimization
Database system behavior knowledge
Continuous integration/deployment
Monitoring/alerting systems experience
Programming expertise
Incident response best practices

Tools

C++
Rust
CockroachDB
Postgres
Redis
Memcached
Unreal Engine
Job description

Let’s redefine what a game is and how we interact with them. We want to make games that everyone wants to play and invite the whole world into ours.

Together we’ll discover connection and innovation you’ve never experienced before and an amazing world you’ll never leave behind.

We are Spliced Inc. and we’re looking for a Site Reliability Engineer.

The Role

You will play a critical role by operating massively scalable distributed services and storage systems to support online functionality and persistence for millions of players. You will design and implement the necessary systems to provide world-class reliability and efficiency in production, and enable Spliced Inc. to provide a seamless and satisfying online experience to a massive global audience.

This is a unique and clean slate opportunity to design and build effective deployment, monitoring, and operational infrastructure without compromise. The ambitious scope and creative vision for the game provides plenty of room for professional growth through tackling challenging problems, and you will have a chance to apply your unique skills and experience while also having the opportunity to explore and apply cutting edge new technologies.

Key Responsibilities
  • Design, implement, and maintain safe continuous rollout of distributed web services, database schemas, dynamic application configuration, and queue-based and offline batch processing systems
  • Design, implement, and maintain runtime caching layers
  • Design, implement, and maintain deployment process and monitoring for global content distribution infrastructure
  • Design, implement, and maintain monitoring and alerting systems across the stack
  • Cultivate a healthy collaborative relationship with development teams to design and build holistically for reliability
  • Detect, triage, and mitigate outages and regressions to minimize impact to end-user experience and revenue
  • Eventually be part of the on‑call team responsible for the first line of defence
  • Design and build systems to automate operations tasks (deployment, monitoring, alerting, canarying, rollback) rather than solve problems manually.
About You
  • Experience operating large scale online software in a production environment
  • Strong general knowledge of distributed system architecture and potential operational risks considerations
  • Familiarity with cloud network/infrastructure efficiency and optimization in the major cloud providers (AWS, GCP, Azure)
  • Strong general knowledge of database system behavior, risks, and operational considerations
  • Experience in concepts/technologies for safe continuous integration/deployment of large scale distributed systems
  • Experience in concepts/technologies for monitoring/alerting on large scale distributed systems
  • Programming expertise in at least one major language
  • Familiarity with best practices for incident response and disaster recovery for large scale distributed systems
Preferred Qualifications
  • Professional experience with any of the following specific technologies:
  • C++ or Rust
  • CockroachDB, Postgres, Redis, Memcached, other database systems
  • Professional online game development/operational experience (especially Unreal Engine)
  • Experience with relevant compliance obligations for Spliced
Why Spliced Inc?

We’re drawn to passion. We’re intrigued by who you are, not just what you do. We want you to feel at home and be excited to create the most career defining work of your life. Here at Spliced Inc, we care about all of this, and more.

Our benefits package is well thought out and you can expect to be rewarded and recognised for your hard work with components that include sign on bonus, a well‑being allowance, and generous pension and healthcare benefits to name but a few.

As a Netease Games studio, we are committed to fostering an inclusive and welcoming environment for everyone and encourage applications from all races, religions, beliefs, ages, sexual orientations, and gender identities. We believe that genuinely diverse teams working in supportive and safe atmospheres, can create magic that touches the hearts and souls of all.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.