
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading insurance company in Kuala Lumpur is seeking a System/Site Reliability Engineer to enhance system availability and performance. This role involves collaborating with teams on automation, monitoring, and incident management. The ideal candidate has a Bachelor's degree in Computer Science, with 3–5 years of experience in related fields and a strong understanding of cloud technologies. Join us in our mission to promote healthier lives.
At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.
As pioneering innovators for over 100 years, we’re now transforming our organisation to be faster, simpler and more connected. Because we want to be even better equipped to develop digital solutions and experiences that help more people live Healthier, Longer, Better Lives.
To get there, we need people with tech/digital/analytics expertise and passion to help develop positive, sustainable change through digitally enhanced experiences that will impact the lives of millions of people and create a healthier future for everyone.
If you believe in developing a better tomorrow, read on.
To ensure the reliability, scalability, and performance of enterprise systems and services by applying software engineering principles to operations. The System / Site Reliability Engineer will collaborate with development and operations teams to build robust automation, monitor system health, respond to incidents, and continuously improve service availability and efficiency. This role is critical in bridging the gap between software development and IT operations, fostering a culture of resilience, observability, and proactive problem-solving.
Ensure System Reliability and Availability
Incident Management and Root Cause Analysis
Automation and Tooling
Monitoring and Observability
Security and Compliance
Capacity Planning and Performance Optimization
Work closely with development, QA, and infrastructure teams to embed reliability into the SDLC. Promote SRE principles across teams to foster a culture of resilience and accountability.
Maintain clear operational documentation, runbooks, and architecture diagrams.
Build a career with us as we help our customers and the community live Healthier, Longer, Better Lives.
You must provide all requested information, including Personal Data, to be considered for this career opportunity. Failure to provide such information may influence the processing and outcome of your application. You are responsible for ensuring that the information you submit is accurate and up-to-date.