Enable job alerts via email!

Site Reliability Engineer - TikTok Recommendation Architecture

TIKTOK PTE. LTD.

Singapore

On-site

SGD 70,000 - 100,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading global tech company in Singapore is seeking an experienced Site Reliability Engineer to optimize operations for its Recommendation System. You will focus on reliability, service stability, and DevOps solutions. Ideal candidates will have a degree in Computer Science and at least 3 years of experience in large-scale systems, along with strong programming skills in languages like Shell, Python, or Go.

Qualifications

  • Bachelor's degree or above in computer science, software engineering, or a related field.
  • Experience with large-scale systems and Linux operations.
  • Good programming experience in Shell, Python, Perl, Go, or C++.

Responsibilities

  • Optimize reliability and operation for large-scale clusters of the TikTok Recommendation System.
  • Ensure efficient delivery of core services and service stability.
  • Collaboration with software engineers to implement DevOps solutions.

Skills

Reliability and operation optimization
Cloud platformization
Programming in Shell/Python/Perl/Go/C++
Linux system operations
Analysis and troubleshooting of distributed systems

Education

Bachelor's degree in computer science or related field
Job description
Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

About The Team

Our Recommendation Architecture Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok users.

On the SRE team of Recommendation Architecture, you'll have the opportunity to sharpen your expertise in coding, performance analysis, large-scale system operation, and get heavily involved in the process of hardware/capacity decision-making.

SRE ensures that the recommendation services at ByteDance have the highest level of availability, as well as creating highly automated systems and pipelines.

Responsibilities
  • Reliability and operation optimization for large-scale clusters of TikTok Recommendation System.
  • Continuous integration and delivery of core services, optimizing the efficiency and automation of operation, and improving service stability and R&D efficiency.
  • Cloud platformization, resource optimization and SLA guarantee for large-scale clusters.
  • Collaboration with software engineer to design and implement DevOps solutions to Improve the efficiency of the entire R&D process.
  • Research, design, and develop computer and network software or specialised utility programs.
  • Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.
  • Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.
  • Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.
Qualifications
  • Bachelor's degree or above in computer science, software engineering, or a related field
  • Operation experience of large-scale systems, familiar with system operation skills on Linux and network.
  • Good programming experience with at least one of the following languages: Shell/Python/Perl/Go/C++.
  • Expertise in analyzing, and troubleshooting large-scale distributed systems.
  • At least 3 years of relevant experience.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.