Enable job alerts via email!

Site Reliability Engineer (Egypt, Usa, Phillipines, Mexico)

Abc Worldwide

Pretoria

Hybrid

ZAR 400 000 - 500 000

Full time

2 days ago

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A global Business Process Outsourcing firm is seeking Site Reliability Engineers to ensure site reliability and performance for their payment technology services. The role involves working closely with software engineers to maintain infrastructure and automate systems. Ideal candidates should have card payment domain knowledge, experience with CI/CD, and proficiency in cloud services. Excellent communication and collaboration skills are essential, as well as a strong initiative and self-motivation to tackle challenges.

Qualifications

Experience in the card payment domain is mandatory.
Proven experience with CI/CD pipelines.
Strong scripting skills in Unix environments.

Responsibilities

Responsible for pipeline build and maintenance.
Maintain services through monitoring of system health.
Participate in on-call Production support rota.

Skills

Card payment domain knowledge

Experience with CI / CD and Build pipelines using Jenkins

Experience in public and private Cloud offerings

Knowledge of NoSQL & SQL databases

Experience managing distributed systems

Familiarity with Unix tooling and strong scripting skills

Exposure to Monitoring and Alerting tools

Proficiency in Python, Java, or GO

Familiarity defining SLO's and SLA's

Excellent communication and collaboration skills

Job title: Site Reliability Engineer (SRE)

Location : Egypt (Cairo), USA, Phillipines, India

Employment Type: Initial 1-year Fixed term contract with option to move into a permanent position

Job Description Summary

Overview

Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments.

The successful candidate will be responsible for ensuring site reliability & performance, monitoring & alerting, and supporting emergency response situations.

This would require working closely with software engineers, DevOps and product teams to maintain robust infrastructure and automation that supports mission-critical applications.

The ideal candidate creates a bridge between development and operations by applying a software engineering mindset to service management.

We are seeking an individual who is highly motivated, intellectually curious, and seeks out opportunities for improvement.

The Role

This role involves working with a team of talented SREs / DevOps Engineers to support highly scalable services.

Responsibilities

Responsible for pipeline build and maintenance in accordance with the clients tooling and conventions.
Participate in the software development lifecycle, working closely with the development team to ensure that designed solutions meet non-functional requirements such as availability, performance, security and maintainability standards.
Maintain services through monitoring of metrics, system health, and analysis of reports.
Provide support for production and in-house systems.
Participate in on-call Production support rota.
Incident management, on call support and root cause analysis conducting post incident reviews and 5-Whys.
Remediate system vulnerability, security and resiliency measures.
Improve process and systems within the Program.
Lead incident management efforts by proactively monitoring and analyzing ISO financial transaction messages across the 4-party payment model (Cardholder, Merchant, Acquirer, Issuer).

Skills & Requirements

Card payment domain knowledge (mandatory)
Experience with CI / CD and Build pipelines using Jenkins.
Experience in public and private Cloud offerings (PCF, Azure, AWS etc.).
Knowledge of NoSQL & SQL databases such as Mongo / Oracle.
Experience and knowledge of managing distributed systems and working with microservices.
Familiarity with Unix tooling, with strong scripting skills.
Exposure to working with Monitoring and Alerting tools such as Splunk, Dynatrace.
Proficiency in one of the following: Python, Java, GO or equivalent.
Familiarity defining SLO's and SLA's.
Prior experience of working in an SRE / DevOps team and excellent understanding of SRE / DevOps principles.
High degree of initiative and self-motivation, with a willingness to take on challenging opportunities.
Excellent communication and relationship building / collaboration skills.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top cities

Top companies

Popular jobs