Enable job alerts via email!

Site Reliability Engineer - USDS

TikTok

San Jose (CA)

Hybrid

USD 145,000 - 250,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

TikTok recherche un ingénieur de fiabilité de site à San Jose, CA. Ce rôle stimulant couvre le développement, l'automatisation et la collaboration avec des équipes d'ingénierie pour garantir la performance de systèmes à grande échelle. Le candidat idéal aura au moins 3 ans d'expérience dans un poste similaire et un diplôme en informatique, avec une expertise en langages de programmation tels que Python et Go.

Benefits

Medical, dental, and vision insurance
401(k) plan with company match
Paid parental leave
Short-term and long-term disability coverage
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time

Qualifications

  • Minimum 3 ans d'expérience en tant qu'ingénieur de fiabilité de site.
  • Connaissance des architectures réseau et des systèmes distribués à grande échelle.
  • Excellentes compétences en communication et en collaboration.

Responsibilities

  • Développer et maintenir des procédures d'automatisation.
  • Collaborer avec des équipes pour assurer la robustesse fonctionnelle des systèmes.
  • Mettre en œuvre des outils de surveillance pour suivre la santé du système.

Skills

Python
Go
Java
Shell script
Linux operating systems
Problem-solving

Education

Bachelor's degree in Computer Science

Tools

Docker
Kubernetes
Prometheus
Grafana

Job description

Responsibilities
Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you’ll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of diversity, intellectual curiosity, openness, and problem-solving. We encourage close collaboration while promoting self-direction.

In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.

Responsibilities
- Develop and maintain automation procedures to maximize system efficiency and minimize human intervention.
- Work closely with software engineering teams to design, deploy and operate elements to ensure that systems are functionally robust.
- Ensure system scalability to handle growth in web traffic and data.
- Implement monitoring tools and set up metrics to keep track of system health and performance.
- Participate in on-call rotations, assist with incident management, and diagnose, resolve, and prevent production issues.
- Conduct performance tests to find and address system bottlenecks.
- Collaborate with teams across the organization to define Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs).
- Practice sustainable user support, incident response, and blameless postmortems.

Qualifications
Minimum Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field with 3+ years of experience
- Proven work experience as a Site Reliability Engineer, Systems Engineer, or similar software engineering role.
- Proficient knowledge of high-level programming languages (e.g. Python, Go, Java, and Shell script).
- Experience in network architecture, database modeling, cloud systems and large-scale distributed systems.
- Strong understanding of Linux operating systems and open-source technologies.

Preferred Qualifications:
- Experience with containers and container orchestration platforms such as Docker, Kubernetes or equivalent.
- Knowledge of monitoring tools and methodologies (such as Prometheus, Grafana).
- Excellent problem-solving skills, strategic thinking, and a strong ability to debug complex systems.
- Exceptional communication skills and the ability to effectively collaborate with cross-functional teams.

About USDS
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (“USDS”) is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.

Data Security Statement
This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.


Why Join Us
Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.
We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.

Diversity & Inclusion
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.


USDS Reasonable Accommodation
USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/USDS-RA


Job Information
【For Pay Transparency】Compensation Description (Annually)
The base salary range for this position in the selected city is $145000 - $250000 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates:
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
3. Exercising sound judgment.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Software Development

Referrals increase your chances of interviewing at TikTok by 2x

Get notified about new Site Reliability Engineer jobs in San Jose, CA.

Sunnyvale, CA $117,000.00-$173,000.00 3 weeks ago

Site Reliability Engineer, AI/ML Platforms

San Jose, CA $133,900.00-$242,000.00 2 weeks ago

Software Engineer, AI Platform - New Grad

Fremont, CA $147,000.00-$208,000.00 3 weeks ago

Menlo Park, CA $117,000.00-$173,000.00 5 hours ago

Menlo Park, CA $147,000.00-$208,000.00 3 weeks ago

Mountain View, CA $125,400.00-$188,100.00 2 weeks ago

New Grads 2025 - General Software Engineer

San Jose, CA $120,000.00-$165,000.00 4 months ago

Sunnyvale, CA $147,000.00-$208,000.00 5 hours ago

Santa Clara, CA $101,000.00-$161,000.00 1 day ago

Senior Software Engineer, AI/ML, YouTube

Sunnyvale, CA $197,000.00-$291,000.00 2 weeks ago

San Jose, CA $133,900.00-$242,000.00 2 weeks ago

Reliability Engineer, Chassis Systems, Semi

Santa Clara, CA $168,000.00-$322,000.00 1 day ago

Site Reliability Engineer - Observability

Palo Alto, CA $146,900.00-$194,610.00 16 hours ago

New Grads 2025 - Software Engineer, Algorithm

San Jose, CA $120,000.00-$165,000.00 9 months ago

Palo Alto, CA $129,300.00-$161,600.00 2 weeks ago

Principal Site Reliability Engineer (Wildfire Cloud Infrastructure)
Senior Site Reliability Engineer - remote
Software Engineer Intern, Site Reliability Engineer
Systems Engineer -High Speed Manufacturing (Battery Technology)

San Jose, CA $121,400.00-$176,300.00 3 days ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Infrastructure Site Reliability Engineer (Entry Level)- USDS

TikTok

Mountain View

Hybrid

USD 118,000 - 177,000

4 days ago
Be an early applicant

Site Reliability Engineer, Recommendation Infrastructure - USDS

TikTok

San Jose

Hybrid

USD 116,000 - 250,000

9 days ago

Site Reliability Engineer (SRE) - USDS

TikTok

San Jose

Hybrid

USD 118,000 - 250,000

22 days ago