Enable job alerts via email!

Lead Infra Ops & Support Specialist (SRE) (Contract)

Monetary Authority of Singapore (MAS)

Singapore

On-site

SGD 80,000 - 140,000

Full time

2 days ago
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

Join the Monetary Authority of Singapore as a Cloud Site Reliability Engineer, leading a skilled SRE team focused on cloud infrastructure across AWS and Azure. Your leadership will drive best practices, ensuring reliability, scalability, and performance of cloud services. This 2-year contract role is ideal for candidates adept in cloud technology and dedicated to operational excellence.

Qualifications

  • Proven experience in a lead role within a cloud SRE or DevOps team.
  • Hands-on experience in infrastructure migration projects from on-premises to cloud.
  • AWS Certified Solutions Architect or DevOps Engineer certification.

Responsibilities

  • Lead a team of SREs in designing and maintaining cloud infrastructure.
  • Develop and implement best practices for cloud reliability and performance.
  • Manage incident response and problem resolution processes.

Skills

Leadership
Problem-solving
Collaboration

Education

Bachelor’s degree in Computer Science, Engineering, or related field

Tools

Terraform
Ansible
Jenkins
GitLab

Job description

We are seeking a highly skilled and experienced Cloud Site Reliability Engineer (SRE) with a strong background in AWS and exposure to Azure. The ideal candidate will possess leadership capabilities while also maintaining hands-on technical expertise. Knowledge of Windows systems is essential for this role.

You will be responsible to:

  • Lead a team of SREs in designing, implementing, and maintaining cloud infrastructure on AWS and Azure platforms.

  • Develop and implement best practices for cloud reliability, scalability, and performance.

  • Collaborate with cross-functional teams to ensure seamless integration of cloud services and applications.

  • Develop and implement robust infrastructure strategies to enhance system reliability, scalability, and performance, with emphasis on cloud best practices and migration strategies.

  • Troubleshoot and resolve complex technical issues related to cloud infrastructure and services.

  • Drive continuous improvement in operational efficiency, automation, and service delivery, including optimising cloud resource utilisation to improve operational efficiency and reliability.

  • Conduct regular performance monitoring and capacity planning to ensure optimal cloud resource utilization.

  • Mentor and guide a team of engineers, fostering a culture of technical excellence and innovation whilst enhancing their skills and knowledge.

  • Manage incident response and problem resolution processes, ensuring minimal disruption to critical systems and supporting Critical Information Infrastructure (CIIs) that are owned by the MAS.

Requirements :

  • Bachelor’s degree in Computer Science, Engineering, or related field.

  • Strong knowledge of Windows systems, VMware, VDI.

  • Proven experience in a lead role within a cloud SRE or DevOps team.

  • Hands-on experience in infrastructure migration projects from on-premises to cloud.

  • Familiarity with AWS GCC cloud services and exposure to its infrastructure is an added advantage.

  • Strong communication and interpersonal skills, with the ability to collaborate effectively across all levels of the organisation.

  • Proficiency in infrastructure and automation tools such as Terraform Ansible, etc.

  • Excellent problem-solving skills and the ability to thrive in a fast-paced, dynamic environment.

  • Experience with VDI performance monitoring, optimisation, and troubleshooting.

  • Knowledge of desktop virtualisation technologies and remote access protocols (PCoIP, HDX, RDP).

  • Understanding of VDI storage and network requirements for optimal end-user experience.

  • AWS Certified Solutions Architect or DevOps Engineer certification.

  • Knowledge of CI/CD pipelines and automation tools like Jenkins or GitLab.

  • Familiarity with monitoring and logging tools such as Zabbix and ELK stack.

  • Knowledge of IT security best practices, regulatory compliance requirements, and cloud migration strategies.

  • Ability to balance tactical operational needs with long-term strategic goals, including performance monitoring and capacity planning.

As part of the shortlisting process for this role, you may be required to complete a medical declaration and/or undergo further assessment.

This will be a 2-year contract. All applicants will be notified on whether they are shortlisted or not within 4 weeks of the closing date of this job posting.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.