Enable job alerts via email!

Site Reliability Engineer, Traffic Platform

TN United Kingdom

London

On-site

GBP 50,000 - 90,000

Full time

24 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Site Reliability Engineer to enhance their global traffic platform. This role involves managing large-scale systems in both public and private clouds, ensuring reliability and performance. You will build tools and automations to optimize operations while working in a dynamic environment. Ideal candidates will have a strong background in Linux systems and programming languages such as Go and Python. Join a forward-thinking team that values problem-solving and creativity, and make a significant impact on cutting-edge infrastructure services.

Qualifications

  • 3+ years experience with Linux systems and programming languages like Go and Python.
  • Master's degree or Bachelor's with 3+ years in relevant fields.

Responsibilities

  • Build and operate TikTok's global traffic platform in cloud and edge data centers.
  • Develop tools and automations for optimizing traffic services.

Skills

Linux Systems
Go
Python
Shell Script
Analytical Skills

Education

Master’s Degree in Computer Engineering
Bachelor’s Degree in Computer Science

Tools

GIT
Docker
Kubernetes
AWS
Google Cloud
Azure

Job description

Social network you want to login/join with:

Site Reliability Engineer, Traffic Platform, London
Client:

TikTok

Location:

London, United Kingdom

Job Category:

-

EU work permit required:

Yes

Job Reference:

0c97a198dcc4

Job Views:

4

Posted:

13.04.2025

Expiry Date:

28.05.2025

Job Description:

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic.

Responsibilities:
  1. Build, expand and operate TikTok’s global traffic platform, including large-scale systems in public and private clouds, edge data centers.
  2. Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform.
  3. Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
  4. Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement.
Minimum Qualifications:
  1. Master’s degree (or Bachelor's degree with 3+ years of experience) in Computer Engineering, Electrical Engineering, Computer Science or related major.
  2. 3+ years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
  3. 3+ years experience in one or more programming languages such as Go, Python and Shell script.
  4. Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc.
  5. Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
  6. Strong analytical skills and the ability to solve real world problems in a fast moving environment.
Preferred Qualifications:
  1. Experience in designing, analyzing and building automation and tools for large scale systems.
  2. Experience in building solutions with AWS, Google, Azure and other cloud services.
  3. Experience in networking technologies such as TCP/IP, HTTP, DNS, etc. in a carrier-grade environment.
  4. Experience in developing and operating one or more of the following systems: Kubernetes, Nginx, ipvs, ELK stack, etc.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Auros

Greater London

Remote

GBP 60,000 - 100,000

7 days ago
Be an early applicant

Platform Engineer - Fully Remote

JR United Kingdom

London

Remote

GBP 60,000 - 100,000

Yesterday
Be an early applicant

Remote Site Reliability Engineer

TN United Kingdom

London

Remote

GBP 60,000 - 100,000

10 days ago

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Future Talent Group

Greater London

Remote

GBP 50,000 - 90,000

10 days ago

Site Reliability Engineer

Bentley Whitaker Search and Selection

London

Remote

GBP 55,000 - 70,000

2 days ago
Be an early applicant

Site Reliability Engineer

Eligo Recruitment

Greater London

Remote

GBP 80,000 - 95,000

5 days ago
Be an early applicant

Lead Platform Architect (m/f/d)-AI

TN United Kingdom

Greater London

Remote

GBP 70,000 - 110,000

Yesterday
Be an early applicant

Reposted Senior engineer, Platform Infrastructure

CoinDesk

Greater London

Remote

GBP 60,000 - 100,000

Today
Be an early applicant

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

JR United Kingdom

London

Remote

GBP 60,000 - 95,000

7 days ago
Be an early applicant