Enable job alerts via email!
A leading technology firm is seeking a Site Reliability Engineer to ensure the stability and efficiency of overseas model services. Candidates should have a Bachelor's degree, at least 2 years of experience in internet operations, and be proficient in Linux, cloud services like AWS, and programming languages such as Python or Go. The role is based in Singapore and offers a full-time position.
Hunyuan LLM Site Reliability Engineer page is loaded
1. Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations;
2. Responsible for capacity management and planning, resource cost optimization, ensuring reasonable online service capacity and improving resource efficiency;
3. Responsible for continuous integration and delivery, efficient and automated operational optimization, enhancing service stability and research and development efficiency;
4. Participate in the design of online systems and various service architectures, providing professional solutions for stability and architecture improvement;
5. Analyze and deeply explore the shortcomings of existing systems, data-driven to find weak points, and promote system optimization implementation and improvement;
6. Pay attention to industry front-end technology trends, explore technologies and directions for automation and intelligence in the operation and maintenance of complex business systems.
1. Bachelor's degree or above, with 2 years or more experience in internet operations and maintenance;
2. Familiar with Linux operating system, with solid system management and network knowledge;
3. Familiar with deploying, configuring, and tuning components such as Nginx, Redis, MySQL;
4. Proficient in monitoring systems such as Zabbix, Prometheus, Grafana, real-time grasping the running status of overseas systems;
5. Proficient in at least one programming language (such as Python, Go, Shell, etc.), with experience in developing automated operational tools to meet the needs of complex and variable overseas operations and maintenance;
6. Familiar with mainstream public cloud operations and maintenance management overseas (such as AWS, Azure, etc.), with experience in containerization and microservices architecture, able to cope with the characteristics and differences of local cloud services;
7. Strong sense of work responsibility, good communication skills, learning ability, and team spirit;
8. Proficient in English and Chinese, in listening, speaking, reading, and writing, timely writing updated workflow and technical documents as required.
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.