Enable job alerts via email!

Site Reliability Engineer, Traffic Platform

TN United Kingdom

London

On-site

GBP 50,000 - 90,000

Full time

24 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Site Reliability Engineer to enhance their global traffic platform. This role involves managing large-scale systems in both public and private clouds, ensuring reliability and performance. You will build tools and automations to optimize operations while working in a dynamic environment. Ideal candidates will have a strong background in Linux systems and programming languages such as Go and Python. Join a forward-thinking team that values problem-solving and creativity, and make a significant impact on cutting-edge infrastructure services.

Qualifications

3+ years experience with Linux systems and programming languages like Go and Python.
Master's degree or Bachelor's with 3+ years in relevant fields.

Responsibilities

Build and operate TikTok's global traffic platform in cloud and edge data centers.
Develop tools and automations for optimizing traffic services.

Skills

Linux Systems

Python

Shell Script

Analytical Skills

Education

Master’s Degree in Computer Engineering

Bachelor’s Degree in Computer Science

Tools

GIT

Docker

Kubernetes

AWS

Google Cloud

Azure

Social network you want to login/join with:

Site Reliability Engineer, Traffic Platform, London

Client:

TikTok

Location:

London, United Kingdom

Job Category:

EU work permit required:

Yes

Job Reference:

0c97a198dcc4

Job Views:

Posted:

13.04.2025

Expiry Date:

28.05.2025

Job Description:

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic.

Responsibilities:

Build, expand and operate TikTok’s global traffic platform, including large-scale systems in public and private clouds, edge data centers.
Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform.
Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement.

Minimum Qualifications:

Master’s degree (or Bachelor's degree with 3+ years of experience) in Computer Engineering, Electrical Engineering, Computer Science or related major.
3+ years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
3+ years experience in one or more programming languages such as Go, Python and Shell script.
Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc.
Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
Strong analytical skills and the ability to solve real world problems in a fast moving environment.

Preferred Qualifications:

Experience in designing, analyzing and building automation and tools for large scale systems.
Experience in building solutions with AWS, Google, Azure and other cloud services.
Experience in networking technologies such as TCP/IP, HTTP, DNS, etc. in a carrier-grade environment.
Experience in developing and operating one or more of the following systems: Kubernetes, Nginx, ipvs, ELK stack, etc.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Future Talent Group

Remote

GBP 60,000 - 95,000

7 days ago

Be an early applicant

Site Reliability Engineer, Traffic Platform

TN United Kingdom

London

On-site

GBP 50,000 - 90,000

Full time

Job summary

Qualifications

Responsibilities

Skills

Education

Tools

Job description

Similar jobs

Senior Site Reliability Engineer

Greater London

Remote

GBP 60,000 - 100,000

Platform Engineer - Fully Remote

London

Remote

GBP 60,000 - 100,000

Remote Site Reliability Engineer

London

Remote

GBP 60,000 - 100,000

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Greater London

Remote

GBP 50,000 - 90,000