Enable job alerts via email!

Hunyuan LLM Site Reliability Engineer

IMAGE FRAME INVESTMENT (UK) LIMITED

Singapore

On-site

SGD 60,000 - 80,000

Full time

8 days ago

Job summary

A leading technology firm is seeking a Site Reliability Engineer to ensure the stability and efficiency of overseas model services. Candidates should have a Bachelor's degree, at least 2 years of experience in internet operations, and be proficient in Linux, cloud services like AWS, and programming languages such as Python or Go. The role is based in Singapore and offers a full-time position.

Qualifications

  • 2+ years of experience in internet operations and maintenance.
  • Strong sense of work responsibility and good communication skills.
  • Ability to operate in a Chinese-speaking environment.

Responsibilities

  • Ensure stable, reliable, and efficient service operations.
  • Manage capacity and optimize resource costs.
  • Enhance service stability and operational efficiency.
  • Design online systems and service architectures.
  • Analyze existing systems for optimization.
  • Explore automation and intelligence technologies.

Skills

Linux operating system
System management
Nginx
Redis
MySQL
Zabbix
Prometheus
Grafana
Python
Go
Shell
AWS
Azure

Education

Bachelor's degree or above

Tools

Monitoring systems
Cloud operations

Job description

Hunyuan LLM Site Reliability Engineer page is loaded

Hunyuan LLM Site Reliability Engineer
Apply remote type Onsite locations Singapore-CapitaSky Malaysia-Kuala Lumpur time type Full time posted on Posted 12 Days Ago job requisition id R105766
Business Unit
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails

1. Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations;

2. Responsible for capacity management and planning, resource cost optimization, ensuring reasonable online service capacity and improving resource efficiency;

3. Responsible for continuous integration and delivery, efficient and automated operational optimization, enhancing service stability and research and development efficiency;

4. Participate in the design of online systems and various service architectures, providing professional solutions for stability and architecture improvement;

5. Analyze and deeply explore the shortcomings of existing systems, data-driven to find weak points, and promote system optimization implementation and improvement;

6. Pay attention to industry front-end technology trends, explore technologies and directions for automation and intelligence in the operation and maintenance of complex business systems.

Who We Look For

1. Bachelor's degree or above, with 2 years or more experience in internet operations and maintenance;

2. Familiar with Linux operating system, with solid system management and network knowledge;

3. Familiar with deploying, configuring, and tuning components such as Nginx, Redis, MySQL;

4. Proficient in monitoring systems such as Zabbix, Prometheus, Grafana, real-time grasping the running status of overseas systems;

5. Proficient in at least one programming language (such as Python, Go, Shell, etc.), with experience in developing automated operational tools to meet the needs of complex and variable overseas operations and maintenance;

6. Familiar with mainstream public cloud operations and maintenance management overseas (such as AWS, Azure, etc.), with experience in containerization and microservices architecture, able to cope with the characteristics and differences of local cloud services;

7. Strong sense of work responsibility, good communication skills, learning ability, and team spirit;

8. Proficient in English and Chinese, in listening, speaking, reading, and writing, timely writing updated workflow and technical documents as required.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Similar Jobs (2)
Site Reliability Engineer
remote type Onsite locations Singapore-CapitaSky time type Full time posted on Posted 30+ Days Ago
Cloud Engineer
remote type Onsite locations Singapore-CapitaSky time type Full time posted on Posted 30+ Days Ago

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.