Job Search and Career Advice Platform

Enable job alerts via email!

Senior SRE Developer

Worldline

Kuala Lumpur

Hybrid

MYR 90,000 - 120,000

Full time

24 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading payments technology firm is seeking a Senior Site Reliability Engineer (SRE) to enhance system reliability and automation in Kuala Lumpur. This role involves managing AWS infrastructure, overseeing CI/CD pipelines, and implementing observability solutions. Candidates should have strong AWS, Terraform, and Python skills, and a track record in production environments. The position supports a hybrid work model along with various employee benefits such as flexi benefits and birthday leave.

Benefits

Hybrid work model
Flexi benefits
Birthday leave

Qualifications

  • Strong hands-on automation skills.
  • Experience in supporting production environments.
  • Deep knowledge of AWS services.

Responsibilities

  • Develop and manage Infrastructure as Code using Terraform and Ansible.
  • Design and manage CI/CD pipelines in GitLab.
  • Implement monitoring, logging, and alerting solutions.
  • Lead incident response and change management.
  • Manage Kubernetes clusters and applications.

Skills

AWS expertise
Automation with Terraform
CI/CD experience with GitLab
Python scripting
Kubernetes management

Tools

Terraform
Ansible
Prometheus
Grafana
ELK Stack
Job description

Worldline helps businesses of all shapes and sizes to accelerate their growth journey - quickly, simply, and securely. We are the innovators at the heart of the payments technology industry, shaping how the world pays and gets paid. Our technology powers the growth of millions of businesses across 5 continents.
And just as we help our customers accelerate their business, we are committed to helping our people accelerate their careers. Together, we shape the evolution.

The Opportunity

We are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) with deep expertise in AWS, modern DevOps tooling, and fundamental infrastructure. You will apply an engineering mindset to solve operational challenges, focusing on automating away toil, improving system reliability, and ensuring our critical services meet and exceed Service Level Agreements (SLAs). The ideal candidate has strong hands-on automation skills, extensive experience supporting production environments, and a deep understanding of what it takes to build and maintain high-availability systems, preferably within the demanding context of the finance or banking sector.

Day-to-Day Responsibilities
  • Automation & IaC: Develop and manage our Infrastructure as Code (IaC) using Terraform and Ansible. Write robust scripts in Python and Shell to automate operational tasks, deployments, and failure recovery.
  • CI/CD & DevOps: Design, manage, and optimize our CI/CD pipelines in GitLab to enable fast, safe, and repeatable software delivery to production.
  • Observability: Implement and enhance our monitoring, logging, and alerting stack using Prometheus, Grafana, and ELK to provide deep insights into system health and performance.
  • Incident & Change Management: Lead incident response and management for production services, conducting blameless post-mortems to drive continuous improvement. Manage the production change management process to minimize risk.
  • Container & Cloud Orchestration: Manage and scale our Kubernetes (EKS) clusters and containerized applications. Provide expertise on AWS services including VPC, Transit Gateway, RDS, and other core components.
  • Reliability & Availability: Design, build, and maintain our core infrastructure on AWS to ensure high availability, scalability, and resilience. Define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to meet our SLAs.
What We Are Looking For
  • Proven Experience: Demonstrated track record as an SRE, DevOps Engineer, or similar role with a focus on supporting production environments.
  • Automation & IaC: Strong, hands‑on expertise with Terraform and Ansible. Excellent proficiency in Python and Shell scripting is a must.
  • Cloud & Infrastructure: Deep knowledge of AWS services (VPC, Transit Gateway, EKS, RDS, IAM). Solid understanding of fundamental infrastructure: Linux administration, networking concepts, SAN storage, and database management (Oracle, MySQL, PostgreSQL).
  • DevOps & CI/CD: Strong experience with Git (specifically GitLab) and building/maintaining complex CI/CD pipelines, hands‑on skill for Jira and Confluence, familiar with Scrum & Agile developing process.
  • Observability & Containers: Hands‑on experience with Kubernetes, Docker, and observability tools such as Prometheus, Grafana, and the ELK Stack (Elasticsearch, Logstash, Kibana), Distributed tracing (APM, Jaeger).
  • SRE Mindset: In‑depth knowledge of SRE principles, including incident management, change management, ITSM/ITIL process, SLOs/SLIs/SLAs, and designing for high availability and resilience in distributed systems.
Perks & Benefits
  • Hybrid work model
  • Flexi benefits
  • Birthday leave to celebrate your special day
Shape the evolution

We are pushing towards the next frontiers of payments technology, and we look for big thinkers to join our journey.
People with passion, can‑do attitude and a hunger to learn and grow.
Here you'll work with ambitious colleagues from around the world, take on unique challenges as a team, and make a real impact on the society. And with our empowering culture, strong technology and extensive training opportunities, we help you accelerate your career.
Wherever you decide to go. Join our global team of over 18,000 innovators across 40+ countries, and shape a tomorrow that is yours to own.

Learn more about life at Worldline at

jobs.worldline.com

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.