Job Search and Career Advice Platform

Enable job alerts via email!

Lead Site Reliability Engineer | Copperleaf

IFS

Staines-upon-Thames

Hybrid

GBP 70,000 - 90,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading SaaS solutions provider in Staines-upon-Thames is seeking a Lead Site Reliability Engineer specializing in Azure. This role involves designing and optimizing infrastructure for high-availability SaaS solutions, mentoring team members, and leading incident responses. The ideal candidate will have extensive experience with Azure services and strong automation skills. Join as part of a diverse team aiming to make a positive impact in the tech world.

Benefits

Hybrid work opportunities
Inclusive workplace experiences

Qualifications

  • 5+ years in SRE, Cloud Operations, or DevOps, with 3 years on Azure.
  • Deep expertise in Azure services including monitoring and security.
  • Strong scripting skills in PowerShell, Python, or Bash.

Responsibilities

  • Lead design and improvement of Azure infrastructure for SaaS services.
  • Automate deployment pipelines using Azure DevOps and Terraform.
  • Drive root cause analysis and resolution of production incidents.

Skills

Azure services
Automation and scripting skills
Incident response leadership
Excellent communication

Tools

Terraform
Azure DevOps
Kubernetes
Azure Monitor
Job description

IFS is a billion-dollar revenue company with 7000+ employees on all continents. Our leading AI technology is the backbone of our award-winning enterprise software solutions, enabling our customers to be their best when it really matters–at the Moment of Service™. Our commitment to internal AI adoption has allowed us to stay at the forefront of technological advancements, ensuring our colleagues can unlock their creativity and productivity, and our solutions are always cutting-edge.

Copperleaf is the world’s leading AI powered Asset Investment Planning (AIP) solution, enabling organizations to make better decisions – faster, smarter and with more confidence.

Together Copperleaf and IFS offer the first end-to-end asset lifecycle management solution. Underpinned by Industrial AI, the combining of Copperleaf and IFS will allow our asset intensive customers to deliver on their moment of service through strategic allocation and execution of CAPEX and OPEX; balancing expenditure, business objectives, risk and optimal asset performance.

At IFS, we’re flexible, we’re innovative, and we’re focused not only on how we can engage with our customers but on how we can make a real change and have a worldwide impact. We help solve some of society’s greatest challenges, fostering a better future through our agility, collaboration, and trust.

We celebrate diversity and understand our responsibility to reflect the diverse world we work in. We are committed to promoting an inclusive workforce that fully represents the many different cultures, backgrounds, and viewpoints of our customers, our partners, and our communities. As a truly international company serving people from around the globe, we realize that our success is tantamount to the respect we have for those different points of view.

By joining our team, you will have the opportunity to be part of a global, diverse environment; you will be joining a winning team with a commitment to sustainability; and a company where we get things done so that you can make a positive impact on the world.

We’re looking for innovative and original thinkers to work in an environment where you can #MakeYourMoment so that we can help others make theirs. With the power of our AI-driven solutions, we empower our team to change the status quo and make a real difference.

If you want to change the status quo, we’ll help you make your moment. Join Team Purple. Join IFS.

Job Description

CopperleafIFS ’ssoftware helps some of the world’s largest energy firms make better strategic decisions.

Our Cloud Operations Team, a crucial component of our Software as a Service (SaaS) offering, also delivers Infrastructure as a Service (IaaS) to IFS Copperleaf. Built on the foundation of Site Reliability Engineering, we are expanding. Our commitment is to the reliability and uptime of our services, and we consistently aim to automate processes and minimize manual labor. We are currently seeking a mid senior level cloud engineer to contribute to these services and assist in enhancing the operational aspects of each service.

As a Lead Site Reliability Engineer (SRE) specializing in Azure, you will play a pivotal role in architecting, operating, and optimizing our cloud infrastructure. You will lead initiatives to ensure the reliability, scalability, and security of our Azure-based SaaS offerings. You’ll mentor junior engineers, drive automation, and partner with development teams to deliver robust, high‑availability solutions.

Key Responsibilities
  • Lead the design, implementation, and continuous improvement of Azure-based infrastructure for high‑availability, mission‑critical SaaS services.
  • Architect and automate deployment pipelines using Azure DevOps, ARM/Bicep, Terraform, and related tools.
  • Own and enhance monitoring, alerting, and incident response for Azure resources (App Services, AKS, SQL, Storage, Networking, etc.).
  • Drive root cause analysis and resolution of complex production incidents, collaborating across teams.
  • Define and enforce SLOs, SLIs, and SLAs for Azure‑hosted SaaS services.
  • Champion security best practices, including identity, access, secrets, and certificate management in Azure.
  • Mentor and coach junior SREs and CloudOps engineers.
  • Partner with development teams to embed reliability and operational excellence into the SDLC.
  • Evaluate and implement new Azure features and services to improve reliability, performance, and cost efficiency.
  • Document architecture, runbooks, and operational procedures for Azure environments.
Qualified
Required Qualifications
  • 5+ years’ experience in SRE, Cloud Operations, or DevOps roles, with at least 3 years focused on Microsoft Azure.
  • Deep expertise in Azure services (App Services, AKS, Azure SQL, Storage, Networking, Security Center, Monitor, etc.).
  • Strong automation and scripting skills (PowerShell, Python, Bash, or similar).
  • Proven experience with Infrastructure as Code (Terraform, ARM/Bicep).
  • Advanced troubleshooting of distributed systems, networking, and application performance in Azure.
  • Solid understanding of microservices, container orchestration (Kubernetes/AKS), and CI/CD pipelines.
  • Experience with monitoring, logging, and observability tools (Azure Monitor, Log Analytics, Application Insights).
  • Strong grasp of security protocols, certificate and secret management, and compliance in Azure.
  • Demonstrated ability to lead incident response and post‑mortem analysis.
  • Excellent communication skills and a passion for mentoring others.
Preferred Qualifications
  • Azure certifications (e.g., Azure Solutions Architect, Azure DevOps Engineer).
  • Experience with hybrid or multi‑cloud environments, including AWS.
  • Familiarity with cost management and optimization in Azure.
  • Experience supporting large‑scale SaaS platforms.
Additional Information

We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles, while also valuing inclusive workplace experiences. By fostering a sense of community, we drive innovation, strengthen connections, and nurture belonging. Our commitment ensures you can work in a way that suits you best, while also engaging with colleagues to share ideas and build meaningful relationships.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.