Enable job alerts via email!

Principal Site Reliability Engineer

ZipRecruiter

London

On-site

GBP 70,000 - 120,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading high-frequency trading firm seeks an experienced Site Reliability Engineer (SRE) to develop tools and infrastructure that ensure scalability and performance in their trading platform. This in-office role in London offers the chance to work with top engineers on cutting-edge solutions in a fast-paced environment.

Qualifications

  • Proficiency in Python and ability to read C/C++.
  • Experience with AWS infrastructure and on-premise management.
  • Strong understanding of networking fundamentals and system performance.

Responsibilities

  • Design and develop scalable production tools for deployment and monitoring.
  • Ensure the reliability and efficiency of trading systems.
  • Collaborate with developers to support the live trading environment.

Skills

Python
AWS Infrastructure
Linux system internals
Monitoring and logging systems
Networking fundamentals (TCP/IP)

Education

Top-tier engineering program

Tools

Terraform
Ansible
Bash

Job description

Job Description

Job Title: Site Reliability Engineer (SRE) – High-Frequency Trading Infrastructure

Location: Onsite – New York City, London, or Singapore

Our Client, a leading high-frequency trading firm, is seeking a Site Reliability Engineer (SRE) to architect and build next- production tools and infrastructure for their ultra-low-latency trading platform. This is a high-impact, mission-critical role focused on reliability, scalability, and performance in one of the most competitive and technologically advanced industries.

About the Role

This opportunity is ideal for an experienced SRE who thrives in production-critical environments. The successful candidate will join a high-caliber team of engineers and work on automating, scaling, and securing systems that drive global trading operations.

Key Responsibilities

  • Design and develop scalable production tools for deployment, monitoring, and infrastructure automation.
  • Ensure the reliability and efficiency of trading systems through proactive automation and tooling.
  • Collaborate with developers and traders to support the live trading environment.
  • Manage and optimize configuration and deployment pipelines across AWS and on-premise infrastructure.
  • Implement observability and monitoring systems to enable rapid detection and resolution of issues.
  • Enhance fault tolerance and high availability for mission-critical systems.
  • Provide infrastructure support for both C++ and Rust-based trading platforms.

Core Qualifications

  • Strong programming skills in Python, with the ability to read and understand C/C++.
  • Deep expertise in Linux system internals and performance tuning.
  • Proven experience with AWS infrastructure and/or on-premise cluster management.
  • Hands-on knowledge of monitoring, logging, and alerting systems for production environments.
  • Solid understanding of networking fundamentals, including TCP/IP and systems performance.

Experience

  • Familiarity with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
  • Background in low-latency or high-performance computing environments.
  • Proficiency with scripting like Bash for automation tasks.

What Sets You Apart

  • Track record of excellence from a top-tier engineering program or recognized domain expertise.
  • Demonstrated ability to perform in fast-paced, production-critical settings.
  • Strong communication skills and a collaborative mindset to work closely with infrastructure, trading, and development teams.

Why This Role?

This is a rare opportunity to be at the forefront of infrastructure innovation within a high-frequency trading environment. You’ll work alongside some of the brightest minds in the field, delivering systems that operate at massive scale and speed. If you’re passionate about building high-performance infrastructure and solving complex engineering challenges, this role is for you.

Location requirement: This is a full-time, in-office position with openings in;

New York City, London, and Singapore.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Site Reliability Engineer

iwoca

London

Hybrid

GBP 100,000 - 140,000

2 days ago
Be an early applicant

Principal Site Reliability Engineer

JR United Kingdom

London

On-site

GBP 70,000 - 120,000

7 days ago
Be an early applicant

Lead Site Reliability Engineer SRE Java - FinTech

Client Server

London

Hybrid

GBP 100,000 - 130,000

4 days ago
Be an early applicant

Lead Site Reliability Engineer SRE Java - FinTech

JR United Kingdom

London

Hybrid

GBP 100,000 - 130,000

5 days ago
Be an early applicant

Lead Site Reliability Engineer SRE Java - FinTech

TN United Kingdom

London

Hybrid

GBP 100,000 - 130,000

6 days ago
Be an early applicant

Lead Site Reliability Engineer IT London

Cynergy Bank Limited

London

Hybrid

GBP 60,000 - 90,000

2 days ago
Be an early applicant

Lead Site Reliability Engineer SRE Java - FinTech

ZipRecruiter

London

Hybrid

GBP 110,000 - 130,000

3 days ago
Be an early applicant

Principal Site Reliability Engineer

TN United Kingdom

London

Hybrid

GBP 80,000 - 110,000

15 days ago

Principal Site Reliability Engineer

Orgvue

London

Hybrid

GBP 80,000 - 120,000

15 days ago