Overview
Our client, a global BPO business is looking for Site Reliability Engineers to support a global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments. The successful candidate will be responsible for ensuring site reliability & performance, monitoring & alerting, and supporting emergency response situations. The ideal candidate creates a bridge between development and operations by applying a software engineering mindset to service management.
The Role
- Responsible for pipeline build and maintenance in accordance with the client’s tooling and conventions.
- Participate in the software development lifecycle, working closely with the development team to ensure that designed solutions meet non‑functional requirements such as availability, performance, security and maintainability standards.
- Maintain services through monitoring of metrics, system health, and analysis of reports.
- Provide support for production and in‑house systems. Participate in on‑call production support rota.
- Incident management, on‑call support and root‑cause analysis conducting post‑incident reviews and 5‑Whys.
- Remediate system vulnerability, security and resiliency measures.
- Improve process and systems within the program.
- Lead incident management efforts by proactively monitoring and analyzing ISO 8583 financial transaction messages across the four‑party payment model (Cardholder, Merchant, Acquirer, Issuer).
Skills & Requirements
- Card payment domain knowledge (mandatory).
- Experience with CI / CD and build pipelines using Jenkins.
- Experience in public and private cloud offerings (PCF, Azure, AWS, etc.).
- Knowledge of NoSQL & SQL databases such as Mongo / Oracle.
- Experience managing distributed systems and working with microservices.
- Familiarity with Unix tooling and strong scripting skills.
- Exposure to monitoring and alerting tools such as Splunk, Dynatrace.
- Proficiency in one of the following: Python, Java, Go or equivalent.
- Familiarity defining SLOs and SLAs.
- Prior experience working in an SRE / DevOps team and excellent understanding of SRE / DevOps principles.
- High degree of initiative and self‑motivation, with a willingness to take on challenging opportunities.
- Excellent communication and relationship building / collaboration skills.