Enable job alerts via email!

Senior Site Reliability Engineer - Applied Machine Learning

ByteDance

San Jose (CA)

On-site

USD 194,000 - 410,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading technology company is looking for a Senior Site Reliability Engineer focused on Applied Machine Learning. This role involves collaborating with the AML team to optimize AIs and ensure reliable performance of machine learning services. The ideal candidate will need strong expertise in distributed systems and programming, contributing to high-level system operations. ByteDance is committed to innovating and creating inclusive environments in technology.

Benefits

Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Wellbeing benefits
Paid holidays and sick days

Qualifications

  • Expertise in analyzing and troubleshooting distributed systems.
  • Solid background in algorithms and data structures.
  • Experience programming in Python, C/C++, or Go.

Responsibilities

  • Develop and run AI/recommendation systems.
  • Ensure high availability of machine learning services.
  • Involve in hardware/capacity decision-making.

Skills

Analyzing distributed systems
Troubleshooting
Performance analysis
Automation

Education

Bachelor's or Master's degree in Computer Science

Job description

Senior Site Reliability Engineer - Applied Machine Learning
Senior Site Reliability Engineer - Applied Machine Learning

Get AI-powered advice on this job and more exclusive features.

Responsibilities
The mission of our AML team is to push next-generation recommendation-based algorithms and platform for the company. We also drive substantial impact for core businesses of the company. Currently we are looking for Site Reliability Engineers to join our team to support and advance that mission

What You'll Do
Site Reliability Engineering (SRE) of AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run massively distributed AI/recommendation system around the world.

On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis and large system operation, and get heavily involved in the process of hardware/capacity decision-making.

SRE ensures that the very centric machine learning services at ByteDance have the highest level of availability, as well as creating highly automated systems and pipelines.

Qualifications
Minimum Qualifications:
1. Expertise in analyzing and troubleshooting distributed systems.
2. Bachelor/Master's degree in Computer Science, a related technical field involving software develop or systems engineering.
3. Experience programming in at least one of the following languages: Python, C/C++ or Go.
4. With solid background of algorithms and data structures.

Preferred Qualifications:
1. Ability to design and maintain large-scale systems.
2. Strong understanding of code optimizing and routine tasks automation.
3. SRE experience on large scale distributed system.

About Us
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.


Why Join ByteDance
Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.
As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.


Reasonable Accommodation
ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request


Job Information
【For Pay Transparency】Compensation Description (Annually)
The base salary range for this position in the selected city is $194000 - $410000 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates:
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
3. Exercising sound judgment.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Technology, Information and Internet and Software Development

Referrals increase your chances of interviewing at ByteDance by 2x

Get notified about new Senior Site Reliability Engineer jobs in San Jose, CA.

San Jose, CA $171,800.00-$375,900.00 5 hours ago

Sunnyvale, CA $136,000.00-$156,000.00 5 days ago

Mountain View, CA $204,000.00-$259,000.00 5 days ago

Sr. Software Engineer, HIL Automation, Autonomy
Sr. Software Engineer, Plant Modeling and Tools

Mountain View, CA $204,000.00-$259,000.00 2 weeks ago

Senior Software Engineer - Localization and Mapping (SLAM)
Senior Manager – Operations & Reliability (DevOps Focus)
Sr. SW Engineering Technical Lead (Kernel Development)

San Jose, CA $198,600.00-$282,900.00 4 days ago

Senior Software Engineer, ASIC Verification Tools
Senior Software Development Engineer, Virtual Network

San Jose, CA $224,000.00-$410,000.00 1 week ago

Fremont, CA $132,000.00-$276,000.00 2 weeks ago

Embedded Sr. Software Development Engineer, Hardware Compute Group

Sunnyvale, CA $151,300.00-$261,500.00 1 week ago

Senior Software Engineer, Fabric Networking - GPU
Senior Software Engineer - Cortex Apps (LLM Products)

Menlo Park, CA $195,000.00-$287,500.00 2 weeks ago

Sr. Software Development Engineer, Kuiper Ground Gateway Services

Sunnyvale, CA $151,300.00-$261,500.00 6 days ago

Emulation Platform Architect and Solutions Engineer
Sr. SDE - ML, SEAS, Stores Economics and Science (SEAS)

Santa Clara, CA $168,000.00-$322,000.00 5 hours ago

San Jose, CA $133,900.00-$242,000.00 2 weeks ago

Senior Site Reliability Engineer, HPC and LSF
Senior Site Reliability Engineer - remote
Site Reliability Engineer, AI/ML Platforms

San Jose, CA $133,900.00-$242,000.00 1 week ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Software Engineer, Site Reliability (Senior or Staff)

Recruiting From Scratch

San Francisco

Remote

USD 175,000 - 225,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer San Francisco Bay Area (CA), Denver (CO), Lexington (KY), New [...]

AppOmni Inc.

San Francisco

Remote

USD 156,000 - 212,000

30+ days ago

Site Reliability Engineer

WorkOS

San Francisco

Remote

USD 175,000 - 250,000

4 days ago
Be an early applicant

Staff Site Reliability Engineer

Ipro Networks Pte. Ltd.

Palo Alto

Remote

USD 200,000 - 250,000

6 days ago
Be an early applicant

Senior Software Engineer, Site Reliability

Hyperdrive Recruiting

Raleigh

Remote

USD 150,000 - 225,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer, Test Platform- REMOTE

Cisco Meraki

Remote

USD 126,000 - 223,000

2 days ago
Be an early applicant

Senior DevOps and Site Reliability Engineer, remote

Cherre

New York

Remote

USD 165,000 - 200,000

5 days ago
Be an early applicant

Senior Site Reliability Engineer (Remote)

3C Deutschland GmbH

Remote

USD 133,000 - 240,000

5 days ago
Be an early applicant

Senior Platform Engineer

DTEX Systems

Fremont

Remote

USD 170,000 - 220,000

6 days ago
Be an early applicant