Job Search and Career Advice Platform

Enable job alerts via email!

Senior Site Reliability Engineer (SRE)

VANGUARD SOFTWARE PTE. LTD.

Singapore

On-site

SGD 90,000 - 130,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A reputable technology company in Singapore is looking for a Senior Site Reliability Engineer to join their expanding engineering team. The ideal candidate will have at least 5 years of experience in DevOps or Site Reliability Engineering, with expertise in cloud platforms and automation tools. Responsibilities include designing scalable infrastructure, optimizing CI/CD pipelines, and enhancing system reliability. Candidates should possess strong problem-solving skills, be adaptable, and communicate effectively with both technical and non-technical stakeholders.

Benefits

Technical Leadership Opportunities
Continuous Growth with mentorship
Flexible work culture

Qualifications

  • Minimum 5 years of DevOps or SRE experience.
  • Strong background in networking and distributed systems.
  • Capable of designing fault-tolerant and scalable infrastructure.

Responsibilities

  • Design and maintain scalable cloud infrastructure.
  • Build and optimize automated deployment pipelines.
  • Establish observability standards for monitoring systems.

Skills

Cloud platforms (AWS, GCP, or Azure)
Containerization (Docker, Kubernetes)
Scripting (Python, Go, Bash)
Linux administration
Monitoring tools (Prometheus, Grafana, Datadog)

Education

Bachelor's Degree in Computing or related field

Tools

Terraform
Ansible
Jenkins
Job description
Job Summary

We are seeking a Senior Site Reliability Engineer (SRE) to join our growing engineering team. In this role, you will work independently to design, build, and optimize infrastructure and deployment pipelines that ensure the stability, scalability, and security of our systems. You will take full responsibility for automating workflows, improving observability, and enabling development teams to ship code faster and safer. This is an excellent opportunity for an experienced engineer with at least 5 years of work experience who thrives on ownership, reliability, and technical leadership.

Key Responsibilities
  • Infrastructure & Automation: Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) tools.
  • CI/CD Pipelines: Build and optimize automated pipelines for testing, deployment, and release management.
  • Monitoring & Reliability: Establish observability standards, implement monitoring, logging, and alerting systems to ensure system health.
  • Security & Compliance: Enforce best practices for cloud security, access control, and compliance across environments.
  • Collaboration: Partner with backend, frontend, and product teams to ensure smooth deployments and reliable system operations.
  • Process & Mentorship: Improve DevOps processes, share best practices, and mentor junior engineers.
Job Requirements
  • Bachelor's Degree of Computing, Software Engineering, IT or related field.
  • Experience: Minimum 5 years of DevOps, Site Reliability Engineering (SRE), or related experience.
  • Tech Stack: Proficient with cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), IaC (Terraform, Ansible, Helm), and CI/CD tools (Jenkins, GitHub Actions, GitLab CI/CD, ArgoCD, etc.).
  • Systems Knowledge: Strong background in Linux administration, networking, and distributed systems.
  • Monitoring & Observability: Hands-on experience with tools like Prometheus, Grafana, ELK/EFK, or Datadog.
  • Scripting & Automation: Proficient in one or more languages (Python, Go, Bash, etc.).
  • Problem Solving: Skilled at diagnosing complex issues, ensuring high availability, and improving system performance.
  • System Design: Capable of designing fault-tolerant, secure, and scalable infrastructure with disaster recovery in mind.
  • Good in written and spoken English and Mandarin is highly desirable to liaise with Chinese speaking clients and counterparts to understand their technical requirements.
Soft Skills
  • Team Mindset: Collaborate effectively across teams, proactively contributing to company goals.
  • Ownership: Take responsibility for infrastructure health and ensure continuous improvements.
  • Adaptability: Open to new technologies, evolving processes, and changing business needs.
  • Communication: Clearly explain technical topics to both engineers and non-technical stakeholders.
What We Offer
  • Technical Leadership Opportunities: Lead infrastructure design for high-impact projects and guide DevOps best practices.
  • Continuous Growth: Access to mentorship, certifications, and a clear career progression path.
  • High-Performance Collaboration: Work with a talented team in a modern DevOps environment (Agile/CI-CD, GitOps).
  • Flexibility and Trust: An open culture that values innovation, autonomy, and results-driven decision-making.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.