Enable job alerts via email!

Lead Site Reliability Engineer IT · London ·

Cynergy Bank Limited

London

Hybrid

GBP 60,000 - 90,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Cynergy Bank Limited is seeking a Lead Site Reliability Engineer to join their innovative team supporting cloud infrastructure. This role involves optimizing system reliability and performance, managing incidents, and leading automation efforts in a hybrid working environment. A competitive salary and comprehensive benefits package are offered.

Benefits

Competitive Salary and Company Bonus
Competitive holiday allowance plus bank holidays
Option to purchase an additional 10 days holiday
Pension contribution and Life Assurance
Income Protection Scheme and Season Ticket Loan
Medical Cover (After Probation)
Electric Car Scheme and Money Coach (After Probation)

Qualifications

  • 5 years experience in cloud engineering and support roles.
  • Experience with CI/CD pipelines and automation tools.
  • Strong expertise in scripting using Python or Bash.

Responsibilities

  • Monitor and maintain the performance of production systems.
  • Lead response to critical incidents and conduct root cause analyses.
  • Develop automation tools for deployment and infrastructure management.

Skills

Cloud engineering
Monitoring and observability tools
Scripting and automation
Infrastructure as code
Container orchestration
CI/CD pipelines
Linux/Unix systems

Education

Bachelor’s degree in Computer Science

Tools

Terraform
Kubernetes
Docker
Prometheus
Grafana
Splunk
Datadog

Job description

Application Deadline: Tuesday 10th June

Hybrid Working Pattern: 3 days in Office & 2 WFH

About us

Cynergy Bank is the UK’s human digital bank serving the needs of ‘scale up’ or medium sized and fast-growing SMEs; professionals; high net worth and mass affluent individuals, in essence those market segments that still value human service enabled by great technology.

We recognise that professional and personal lives often overlap and our mission is to help empower our customers to achieve their ambitions by serving all their interdependent banking needs. We provide a comprehensive range of digitally enabled products and services to meet the property finance, business and commercial banking, private banking and personal savings needs of our customers.

Our human and digital model transforms banking for customers who still value a face-to-face relationship that is enabled by the latest digital technology.

We partner with firms such as Google Cloud, Cigniti and Slalom as we continue to innovate in the human digital space.

Cynergy Bank plc is authorised by the Prudential Regulation Authority and regulated by the Financial Conduct Authority and the Prudential Regulation Authority. Eligible deposits with Cynergy Bank plc are protected by the UK Financial Services Compensation Scheme.

For more information on Cynergy Bank visit www.cynergybank.co.uk

Company Benefits

  • Competitive Salary and Company Bonus
  • Competitive holiday allowance plus bank holidays
  • Option to purchase an additional 10 days holiday
  • Pension contribution and Life Assurance
  • Income Protection Scheme and Season Ticket Loan
  • Medical Cover (After Probation)
  • Electric Car Scheme and Money Coach (After Probation)

The Role:

As part of a Cloud Infrastructure Support team carrying out hybrid Devops/SRE/Business App support.

The Lead Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and performance of the bank’s critical systems and infrastructure. The SRE focuses on building & supporting scalable solutions to improve system resiliency, automating repetitive operational tasks, and collaborating with engineering teams to enhance system reliability. This role balances operational responsibilities with engineering innovation to align with the bank’s strategic goals of delivering seamless and secure services.

Responsibilities:

1. Leadership
• Coach, Mentor and Support team members.

2. Reliability and Performance

• Monitor and maintain the reliability, uptime, and performance of production systems and services.
• Design and implement tools and frameworks to proactively identify and mitigate potential issues.
• Conduct performance tuning and capacity planning to ensure systems scale with the bank’s needs.

3. Incident Management and Root Cause Analysis

• Lead the response to critical incidents, ensuring swift resolution to minimize business impact.
• Support Application and Business teams triaging incidents and problems.
• Conduct detailed root cause analyses to identify and resolve underlying issues.
• Collaborate with Engineering and IT Operations teams to implement preventive measures.

4. Automation and Efficiency

• Develop and maintain automation tools for deployment, monitoring, and infrastructure management.
• Automate repetitive operational tasks to improve team efficiency and reduce errors.
• Implement CI/CD pipelines to ensure fast, reliable, and secure code deployments.

5. Collaboration and Stakeholder Engagement

• Work closely with Engineering, IT Operations, and Change Management teams to support service delivery goals.
• Collaborate with Information Security to ensure systems meet security and compliance standards.
• Partner with Architecture to ensure reliability is built into system design.

6. Continuous Improvement and Innovation

• Identify and drive initiatives to improve system resiliency, reduce downtime, and enhance performance.
• Stay up-to-date with industry trends, tools, and best practices for site reliability engineering.
• Develop and document operational processes to ensure consistency and knowledge sharing.

Essential Knowledge & Experience:

  • 5 years of experience of experience in Cloud (private public deployments) engineering and support roles
  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Splunk, Datadog).
  • Strong expertise in scripting and automation using Python, Bash, or similar languages.
  • Proficiency with infrastructure as code (e.g., Terraform, Ansible) and container orchestration tools (e.g., Kubernetes, Docker).
  • Experience in building and managing CI/CD pipelines.
  • Strong knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud) and Linux/Unix systems.

Desirable Knowledge & Experience:

  • Experience in the banking or financial services industry.
  • Knowledge of security standards and regulatory compliance (e.g., ISO 27001, GDPR).
  • Familiarity with disaster recovery and business continuity planning.
  • Understanding of database performance tuning and optimization.
  • Good Working knowledge of ITIL incident, problem and Change
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.