Enable job alerts via email!

Public Cloud Service Availability Lead

Lloyds Banking Group

London, Leeds

Hybrid

GBP 97,000 - 115,000

Full time

15 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading financial services organization is seeking a Public Cloud Service Availability Lead to enhance its cloud services. This role involves ensuring resilience and recovery of services across Microsoft Azure and Google Cloud Platform, while also managing incidents and risks effectively. The ideal candidate will have strong leadership skills and experience in multi-cloud environments, contributing to the continuous improvement of service availability.

Benefits

Generous pension contribution of up to 15%
Annual performance-related bonus
Share schemes including free shares
Discounted shopping benefits
30 days’ holiday plus bank holidays
Wellbeing initiatives and generous parental leave policies

Qualifications

  • Proven leadership in incident and problem management across complex, multi-cloud environments.
  • Direct experience with Azure and GCP environments.
  • Strong capability in influencing third-party CSPs and vendor partners.

Responsibilities

  • Manage the end-to-end availability and incident recovery strategy for Public Cloud products.
  • Drive proactive problem management using incident analytics and service monitoring.
  • Engage with stakeholders at all levels, providing updates on incident status and service health.

Skills

Leadership in incident management
Understanding of SLIs, SLOs, and SLAs
Skilled communicator
Experience with Azure and GCP
Knowledge of IT risk frameworks
Expertise in Power BI and ServiceNow

Job description

JOB TITLE: Public Cloud Service Availability Lead

SALARY:£97,665 - £114,900 in London, or £83,411 - £98,130 outside of London

LOCATION(S):London, Halifax, Leeds

HOURS:Full time

WORKING PATTERN:Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at our locations noted above.

About this Opportunity:

We are a well-established Cloud Platform, who are modernising our next generation technical platform for the bank that continues to be at the core of one of the UK's biggest financial service transformations. Our core technology focus is on Microsoft Azure and Google Cloud Platform.

The Cloud Service Availability Lead is accountable for leading the resilience, recovery, and continuous improvement of Lloyds Banking Group’s Public Cloud services across Microsoft Azure and Google Cloud Platform. The role ensures that robust incident management, problem management and risk governance practices are embedded, with a clear focus on minimising customer impact, reducing service risks, and driving proactive service improvement.

Working closely with Site Reliability Engineers, Technical Recovery Managers, product owners, Cloud Service Providers and risk partners, the postholder will lead the delivery of recovery excellence, embedding resilience at the heart of Public Cloud service operations and ensuring alignment to regulatory expectations.

Key Responsibilities

  • Proactively manage the end-to-end availability and incident recovery strategy for Public Cloud products and services, ensuring efficient execution of incident, problem, and risk management processes.

  • Drive proactive problem management by leveraging incident analytics, service monitoring, and trend identification to mitigate risks before they impact service availability.

  • You will ensure continuous visibility of service health through proactive monitoring and actionable MI, enabling early risk identification and preventative action.

  • With strong communication skills, you will engage confidently with stakeholders at all levels, including executive leadership, providing transparent updates on incident status, risk posture, and service health.

What you’ll need

  • Proven leadership in incident and problem management across complex, multi-cloud environments.

  • Strong understanding of SLIs, SLOs, and SLAs, with the ability to drive action through data insights and performance metrics.

  • Skilled communicator able to manage high-pressure incidents and maintain clear, effective updates to senior leadership and key partners.

  • Demonstrated ability to lead global, cross-functional teams through major incidents, ensuring effective recovery and resolution.

  • Direct experience with Azure and GCP environments.

  • Familiarity with SRE principles, service resilience methodologies and recovery automation.

  • Strong capability in influencing third-party CSPs and vendor partners for timely escalation and issue resolution.

  • Experience managing service recovery as a technical recovery manager, including out-of-hours coverage

  • Deep knowledge of IT risk frameworks (ITIL, COBIT), compliance processes, and regulatory engagement

  • Expertise in Power BI, ServiceNow, and other service reporting and monitoring tools to provide actionable MI and track service recovery

  • Experience with leading risk reduction programs, root cause analysis, and service improvement initiatives within regulated industries

About working for us

Our focus is to ensure we're inclusive every day, building an organisation that reflects modern society and celebrates diversity in all its forms. We want our people to feel that they belong and can be their best, regardless of background, identity, or culture. We were one of the first major organisations to set goals on diversity in senior roles, create a menopause health package, and a dedicated Working with Cancer initiative. And it’s why we especially welcome applications from under-represented groups. We’re disability confident. So, if you’d like reasonable adjustments to be made to our recruitment processes, just let us know

We also offer a wide-ranging benefits package, which includes

  • A generous pension contribution of up to 15%

  • An annual performance-related bonus

  • Share schemes including free shares.

  • Benefits you can adapt to your lifestyle, such as discounted shopping.

  • 30 days’ holiday, with bank holidays on top

  • A range of wellbeing initiatives and generous parental leave policies

Want to do amazing work, that’s interesting and makes a difference to millions of people? Join our journey.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Public Cloud Service Availability Lead

Lloyds Banking Group

London

Hybrid

GBP 83,000 - 99,000

14 days ago