Enable job alerts via email!

Site Reliability Engineer (SRE)

Randstad (Schweiz) AG

Singapore

On-site

USD 60,000 - 120,000

Full time

9 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Site Reliability Engineer (SRE) to enhance system performance and reliability. In this pivotal role, you will design and maintain robust infrastructure, implement DevOps practices, and collaborate with teams to ensure optimal system functionality. The ideal candidate will be proficient in programming languages like Golang, Java, or Python, and possess solid skills in DevOps and Infrastructure as Code. Join a dynamic team that values collaboration and continuous improvement, and make a significant impact in a fast-paced environment.

Qualifications

  • Proficient in at least one programming language: Golang, Java, or Python.
  • Solid experience with DevOps tools like Terraform, Ansible, and Jenkins.

Responsibilities

  • Design, develop, and maintain scalable infrastructure and applications.
  • Monitor system performance and ensure high availability.

Skills

Golang
Java
Python
DevOps
Infrastructure as Code (IaC)
SQL
Networking
Troubleshooting

Tools

Terraform
Ansible
Jenkins
Kubernetes
Docker

Job description

We're Hiring: Site Reliability Engineer (SRE)!

We are looking for a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have experience managing the performance, availability, and scalability of mid- to large-sized systems. You should be proficient in at least one programming language (Golang, Java, or Python), with strong fundamentals in DevOps, Infrastructure as Code (IaC), Networking, and SQL. A solid understanding of the Software Development Life Cycle (SDLC) is essential to succeed in this role.

Key Responsibilities:

  • Design, develop, and maintain scalable and reliable infrastructure and applications.
  • Monitor system performance, ensure high availability, and proactively address potential issues.
  • Implement and manage DevOps pipelines and Infrastructure as Code (IaC) frameworks.
  • Collaborate with software engineers, QA, and operations teams to ensure system reliability and performance.
  • Perform root cause analysis for incidents and drive corrective actions.
  • Develop and maintain monitoring, alerting, and automated response systems.
  • Optimize database performance with intermediate SQL skills.
  • Apply strong networking knowledge to design and maintain robust network architectures.
  • Follow best practices in SDLC to contribute to the full software lifecycle, including requirements gathering, design, development, testing, deployment, and maintenance.

Required Skills and Qualifications:

  • Programming: Proficient in at least one — Golang, Java, or Python.
  • DevOps/IaC: Solid experience with tools like Terraform, Ansible, Jenkins, Kubernetes, Docker, etc.
  • SQL: Intermediate-level skills in writing and optimizing SQL queries.
  • Networking: Good understanding of networking fundamentals (TCP/IP, DNS, Load Balancing, VPN, etc.).
  • System Design: Experience in designing distributed, scalable, and highly available systems.
  • SDLC: Strong understanding of software development processes and best practices.
  • Strong troubleshooting and debugging skills across systems and applications.
  • Familiarity with cloud platforms (AWS, Azure, GCP) is a plus.

Ready to make an impact? Apply now and let's grow together!

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.