Enable job alerts via email!

Site Reliability Engineer (SRE)- Crypto High-Frequency Trading

Oxford Knight

London

On-site

GBP 70,000 - 90,000

Full time

6 days ago
Be an early applicant

Job summary

A leading trading firm in London is seeking a Site Reliability Engineer (SRE) to develop and maintain scalable production tools and enhance system reliability for high-frequency trading platforms. The ideal candidate will possess strong programming skills in Python, have a deep understanding of Linux systems, and experience with AWS and configuration management. This role offers the opportunity to work closely with trading and development teams in a high-pressure, real-time environment, ensuring critical trading systems remain fault-tolerant and efficient.

Qualifications

  • Strong programming skills in Python, ability to read C/C++.
  • Deep understanding of Linux systems.
  • Experience managing deployments in AWS/on-premise clusters.

Responsibilities

  • Develop and maintain production tools for automation.
  • Improve reliability and efficiency through tooling.
  • Work closely with trading teams for operations.

Skills

Python programming
C/C++ understanding
Linux systems
AWS and/or on-premise clusters management
Monitoring solutions
Networking fundamentals
Scripting (Python, Bash)

Tools

Terraform
Ansible

Job description

Site Reliability Engineer (SRE) - Crypto High-Frequency Trading, London

Client:

Oxford Knight

Location:

London, United Kingdom

Job Category:

Other

EU work permit required:

Yes

Job Views:

4

Posted:

25.08.2025

Expiry Date:

09.10.2025

Job Description:
Responsibilities
  • Develop and maintain scalable production tools to automate deployment, monitoring, and infrastructure management.
  • Improve system reliability, performance, and efficiency through automation and tooling.
  • Work closely with trading and development teams to ensure seamless operation of live trading systems.
  • Manage configuration and deployment processes across AWS-based infrastructure.
  • Implement observability tools to enhance system monitoring and debugging capabilities.
  • Ensure fault tolerance, redundancy, and high availability for critical trading systems.
  • Support and enhance infrastructure for both C++- and Rust-based trading systems, ensuring seamless integration.
Required Qualifications
  • Strong programming skills in Python, with the ability to read and understand C/C++ code.
  • Deep understanding of Linux systems.
  • Experience managing deployments and configuration management in AWS and/or on-premise clusters.
  • Proficiency in monitoring, logging, and alerting solutions to maintain high system uptime.
  • Strong background in networking fundamentals, including TCP/IP and system performance tuning.
  • Experience with scripting languages (Python, Bash) for automation.
Preferred Skills
  • Familiarity with IaC tools, such as Terraform or Ansible, for infrastructure automation.
  • Experience in low-latency or high-performance environments is a plus but not required.
  • Strong problem-solving skills and the ability to work in a highly collaborative team.
Soft Skills & Culture Fit
  • Candidates from top-tier institutions or recognized as domain experts are preferred.
  • Strong analytical skills and ability to work in high-pressure, real-time environments.
  • Collaborative team player who enjoys solving complex engineering problems.

Whilst we carefully review all applications, due to the high volume of applications we receive, it is not possible to respond to those who have not been successful.

Contact

If this sounds like you, or you'd like more information, please get in touch:

George Hutchinson-Binks

(+44) 07885 545220

linkedin/in/george-hutchinson-binks-a62a69252

Job ID 3zlnB1qehBV1

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs