Job Search and Career Advice Platform

Enable job alerts via email!

Site Reliability Engineer, Applied Machine Learning Engine (Singapore)

BYTEDANCE PTE. LTD.

Singapore

On-site

SGD 70,000 - 90,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading tech company in Singapore is looking for a software engineer to join their Site Reliability Engineering team. The role involves developing software solutions, troubleshooting distributed systems, and optimizing code. Candidates should hold a Bachelor's degree in Computer Science with at least three years of experience, along with strong programming skills in Python or C/C++. The position offers excellent career growth opportunities and a collaborative work environment.

Benefits

Career growth opportunity
Paid leave
Flat organization
Meal allowance

Qualifications

  • Proven experience in analyzing and troubleshooting distributed systems.
  • Prior experience designing and maintaining large-scale systems.
  • Experience programming in Python or C/C++.

Responsibilities

  • Research, design, and develop computer and network software.
  • Analyze user needs and develop software solutions.
  • Update software and enhance existing capabilities.

Skills

Distributed systems analysis
Troubleshooting
Code optimization
Automation
Machine learning frameworks

Education

Bachelor’s degree in Computer Science

Tools

Python
C/C++
Job description
About Us

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.

Diversity & Inclusion

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Job highlights
  • Career growth opportunity
  • Paid leave
  • Flat organization
  • Meal allowance

ByteDance will be prioritizing applicants who have a current right to work in Singapore, and do not require ByteDance's sponsorship of a visa.

About The Team

Site Reliability Engineering (SRE) of Applied Machine Learning (AML) team combines system engineering and the art of machine learning to develop and run massively distributed recommendation system around the world.

On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis and large system operation, and get heavily involved in the process of hardware/capacity decision-making.

SRE ensures that the very centric machine learning services at ByteDance have the highest level of availability, as well as creating highly automated systems and pipelines.

Responsibilities
  • Research, design, and develop computer and network software or specialised utility programs.
  • Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.
  • Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.
  • Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.
Minimum Qualifications
  • Bachelor’s degree in Computer Science or equivalent with 3+ years of relevant experience
  • Proven experience in analyzing and troubleshooting distributed systems.
  • Prior experience designing and maintaining large-scale systems.
  • Experience programming in at least one of the following languages: Python or C/C++.
Preferred Qualifications
  • Ability to thrive in a fast-paced environment.
  • Strong understanding of code optimizing and routine tasks automation.
  • Proficiency in at least one machine learning framework: TensorFlow, PyTorch, MXNet or PaddlePaddle.
  • Solid background of algorithms and data structures.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.