Enable job alerts via email!

Site Reliability Engineer (Devops)

Experis - ManpowerGroup

London

Hybrid

GBP 50,000 - 90,000

Full time

18 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An exceptional opportunity awaits as a Site Reliability Engineer with a leading gaming studio. This role offers the chance to leverage your expertise in automation and engineering principles to enhance operational efficiency and collaboration across teams. You will work on a scaling platform, ensuring the stability and availability of workloads while minimizing downtime. Your contributions will be pivotal in reinventing operational processes and improving development lifecycles. Join a dynamic environment where your skills will make a significant impact on the success of innovative gaming solutions. If you are passionate about technology and thrive in a collaborative setting, this role is perfect for you.

Qualifications

  • Expertise in managing large-scale distributed server systems in Azure.
  • Experience with CI/CD pipelines and automation processes.

Responsibilities

  • Minimize downtime and ensure platform stability and scalability.
  • Support release processes through stable and automated pipelines.

Skills

PowerShell
C#
Azure
CI/CD (Octopus Deploy/Azure DevOps/TeamCity)
Incident Management Process (ITSM)

Job description

Site Reliability Engineer - DevOps Engineer

18 Month Contract PAYE - Fully Remote / or Hybrid based in Midlands if preferred.

The role

We are working with one of the finest gaming studios in the industry and are on the lookout for an exceptional Site Reliability Engineer who can bring their expertise and unique thinking to help make their team even stronger!

As an SRE, the main purpose is solving for scale through collaboration and automation, bringing engineering principles to infrastructure and operational problems.

Work closely with the different teams to help improve manual tasks, operational processes, lower complexities & risks, break down team silos through improved communication, and really get involved with them to reinvent how they work to help them succeed.

Work with a scaling platform, maintaining its programmable infrastructure and maximizing the availability of the workloads that run on it, both at a live production & deliverable lifecycle level.

Duties

  1. Minimizing downtime to products & services
  2. Ensuring the platform is stable, scalable, and completely automated
  3. Helping to improve and shorten development/process lifecycles
  4. Applying effective monitoring & alerting in place
  5. Supporting release through stable and automated pipeline processes

Skills

  1. Knowledge of languages such as PowerShell, C#
  2. Managed/implemented large scale distributed server systems within Azure
  3. Worked on modern release pipelines - CI/CD (Octopus Deploy/Azure DevOps/TeamCity)
  4. Knowledge of Azure monitoring, alerting, message queues
  5. Understand or worked within an Incident Management Process (ITSM)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.