Enable job alerts via email!

Senior Site Reliability Engineer

Convera

Peterborough

On-site

GBP 50,000 - 90,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Senior Site Reliability Engineer to enhance system stability and resilience. In this pivotal role, you will lead incident management strategies and drive automation to minimize risks. Your expertise in cloud-native systems, particularly within Amazon Web Services, will be crucial in implementing CI/CD pipelines and monitoring service quality metrics. Join a dynamic team that values innovation and diversity, and contribute to creating smarter money movements for a global clientele. This is an exciting opportunity to grow your career in a supportive environment that prioritizes inclusion and professional development.

Benefits

Career growth opportunities
Flexible work approach
Generous insurance (health, disability, life)
Paid holidays and time-off
Paid volunteering opportunities

Qualifications

  • Extensive experience with cloud-native systems and incident management.
  • Proficiency in programming languages like Python, Go, or Rust.

Responsibilities

  • Lead incident management and reduce Mean Time to Resolution (MTTR).
  • Champion an anti-fragility mindset and improve service reliability.

Skills

Incident Management
Cloud-native Systems
Python
Go
Rust
CI/CD Pipelines
Service Level Objectives (SLOs)
Linux
Cloud Networking
Microservices Architecture

Tools

Amazon Web Services (AWS)
Amazon EKS
Grafana

Job description

As a Senior Site Reliability Engineer at Convera, your role is pivotal in ensuring the stability and resilience of our systems. You'll spearhead our incident management strategy, swiftly identifying and mitigating risks to uphold our service reliability.

You will be responsible for:

  • Taking the lead on incident management, orchestrating responses to swiftly identify, mitigate, and minimize risks.
  • Proactively reducing Mean Time to Resolution (MTTR), constantly striving for efficiency gains.
  • Championing an anti-fragility mindset across our architecture, deployment processes, and observability practices.
  • Elevating the customer experience as the ultimate benchmark of our reliability standards.
  • Sharing industry best practices in SRE, ensuring our team remains at the forefront of innovation.
  • Facilitating blameless post-mortems, instituting actionable alerts, and streamlining incident management through automation.

You should apply if you have:

  • Extensive experience navigating complex, multi-region, cloud-native systems within Amazon Web Services.
  • Demonstrable proficiency in modern programming languages such as Python, Go, or Rust.
  • A track record of implementing global, multi-regional Continuous Integration/Continuous Deployment (CI/CD) pipelines, conducting Production Readiness Reviews, and driving automation to eliminate toil.
  • Expertise in defining and monitoring service quality metrics (such as RED, Golden Signals), establishing microservice Service Level Objectives (SLOs), and managing error budgets.
  • Proficiency in Linux, cloud networking, microservices architecture, and Amazon EKS.

Preferred qualifications include:

  • Prior involvement in the Fintech sector or other regulated industries.
  • Familiarity with the Grafana observability stack.
  • Experience in Chaos Engineering methodologies.

About Convera

Convera is the largest non-bank B2B cross-border payments company in the world. Formerly Western Union Business Solutions, we leverage decades of industry expertise and technology-led payment solutions to deliver smarter money movements to our customers – helping them capture more value with every transaction. Convera serves more than 30,000 customers ranging from small business owners to enterprise treasurers to educational institutions to financial institutions to law firms to NGOs.
Our teams care deeply about the value we bring to our customers which makes Convera a rewarding place to work. This is an exciting time for our organization as we build our team with growth-minded, results-oriented people who are looking to move fast in an innovative environment.
As a truly global company with employees in over 20 countries, we are passionate about diversity; we seek and celebrate people from different backgrounds, lifestyles, and unique points of view. We want to work with the best people and ensure we foster a culture of inclusion and belonging.

We offer an abundance of competitive perks and benefits including:
  • Great career growth and development opportunities in a global organization
  • A flexible approach to work
  • Generous insurance (health, disability, life)
  • Paid holidays, time-off, and leave policies for life events (maternity, paternity, adoption)
  • Paid volunteering opportunities (5 days per year)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

JR United Kingdom

England

Remote

GBP 70,000 - 85,000

Today
Be an early applicant

Senior Site Reliability Engineer

NinjaOne

London

Remote

GBP 70,000 - 100,000

Today
Be an early applicant

Senior Site Reliability Engineer London, United Kingdom

NinjaOne, LLC

London

Remote

GBP 70,000 - 100,000

Today
Be an early applicant

Senior Site Reliability Engineer

General Motors

Remote

GBP 60,000 - 90,000

3 days ago
Be an early applicant

Senior Site Reliability Engineer UK - Remote

StarRez, Inc.

Remote

GBP 60,000 - 80,000

10 days ago

Senior Reliability Engineer

Lumentum

Towcester

On-site

GBP 40,000 - 60,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

Auros

Greater London

Remote

GBP 60,000 - 100,000

23 days ago

Senior Site Reliability Engineer

TN United Kingdom

Remote

GBP 60,000 - 100,000

26 days ago

Senior Site Reliability Engineer

Blip

Remote

GBP 80,000 - 100,000

30+ days ago