Enable job alerts via email!

DevSecOps

TOTAL EBIZ SOLUTIONS PTE. LTD.

Singapore

On-site

SGD 80,000 - 100,000

Full time

Today
Be an early applicant

Job summary

A tech solutions provider in Singapore is seeking a Site Reliability Engineer (SRE) to manage complex containerized applications within AWS and Kubernetes environments. The ideal candidate will possess a degree in Computer Science or Engineering, with proven experience in resolving technical issues and implementing incident management strategies. Strong documentation and communication skills are essential for collaborating across teams and improving operational processes.

Qualifications

  • Proven experience as a Site Reliability Engineer or similar role.
  • Strong background in containerization and cloud-native technologies.
  • Ability to troubleshoot complex technical issues.

Skills

Containerization
Cloud-native technologies
Incident management
Troubleshooting
AWS
Kubernetes
Automated testing
CI/CD tools
Observability
Problem-solving

Education

Bachelor's degree or Diploma in Computer Science or Engineering

Tools

Terraform
GitLab CI/CD
Prometheus
Grafana
ELK Stack
Job description
Overview

Job Title: Site Reliability Engineer (SRE)

Requirements
  • Bachelor's degree or Diploma in Computer Science, Engineering, or a related field (or equivalent experience).
  • Proven experience as a Site Reliability Engineer or similar role, with a strong background in containerization, orchestration, and cloud-native technologies.
  • Proven ability to troubleshoot and resolve complex technical issues in containerized applications.
  • Demonstrated experience with incident management, including post-incident reviews and continuous improvement.
  • Strong documentation skills and experience in knowledge sharing across teams.
  • Deep understanding of AWS, Kubernetes (including AWS EKS), and operational best practices, with familiarity in multi-cloud or hybrid environments.
  • Solid grasp of networking, security, and storage in both AWS and Kubernetes contexts.
  • Experience integrating Kubernetes with AWS cloud technologies (e.g., Secrets Manager, Load Balancers) and using infrastructure-as-code (Terraform or similar).
  • Hands-on experience with containerization tools (Kubernetes, Kustomize, Helm) and automation scripting (Go, Python, Bash, or equivalent).
  • Ability to write and maintain automated tests or conduct thorough manual testing for automation scripts, ensuring the reliability and effectiveness of automated solutions.
  • Familiarity with CI/CD tools (GitLab CI/CD, ArgoCD) and version control systems (Git).
  • Experience with observability/monitoring tools (Prometheus, Grafana, ELK Stack) and defining SLOs and Error Budgets.
  • Certifications such as Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD) are a plus.
  • Experience with developing Kubernetes operators using Go, service mesh technologies, and Chaos Engineering is a plus.
Soft skills
  • Proactive in identifying problems and recommending strategic solutions.
  • Excellent problem-solving skills with a robust analytical mindset.
  • Clear, concise, and effective communication skills; adept at collaborating across cross-functional teams, including development, security, and customer-facing groups.
  • Ability to remain calm and effective under pressure, especially during incident response.
  • Adaptability to rapid change with a continuous learning mindset, sharing knowledge to foster team growth.
  • Customer-focused with the ability to translate technical insights into understandable, actionable guidance.
  • Leadership and mentoring capabilities, contributing to the development of a resilient and collaborative team environment are a plus.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.