Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
Join the Monetary Authority of Singapore as a Cloud Site Reliability Engineer, leading a skilled SRE team focused on cloud infrastructure across AWS and Azure. Your leadership will drive best practices, ensuring reliability, scalability, and performance of cloud services. This 2-year contract role is ideal for candidates adept in cloud technology and dedicated to operational excellence.
We are seeking a highly skilled and experienced Cloud Site Reliability Engineer (SRE) with a strong background in AWS and exposure to Azure. The ideal candidate will possess leadership capabilities while also maintaining hands-on technical expertise. Knowledge of Windows systems is essential for this role.
You will be responsible to:
Lead a team of SREs in designing, implementing, and maintaining cloud infrastructure on AWS and Azure platforms.
Develop and implement best practices for cloud reliability, scalability, and performance.
Collaborate with cross-functional teams to ensure seamless integration of cloud services and applications.
Develop and implement robust infrastructure strategies to enhance system reliability, scalability, and performance, with emphasis on cloud best practices and migration strategies.
Troubleshoot and resolve complex technical issues related to cloud infrastructure and services.
Drive continuous improvement in operational efficiency, automation, and service delivery, including optimising cloud resource utilisation to improve operational efficiency and reliability.
Conduct regular performance monitoring and capacity planning to ensure optimal cloud resource utilization.
Mentor and guide a team of engineers, fostering a culture of technical excellence and innovation whilst enhancing their skills and knowledge.
Manage incident response and problem resolution processes, ensuring minimal disruption to critical systems and supporting Critical Information Infrastructure (CIIs) that are owned by the MAS.
Requirements :
Bachelor’s degree in Computer Science, Engineering, or related field.
Strong knowledge of Windows systems, VMware, VDI.
Proven experience in a lead role within a cloud SRE or DevOps team.
Hands-on experience in infrastructure migration projects from on-premises to cloud.
Familiarity with AWS GCC cloud services and exposure to its infrastructure is an added advantage.
Strong communication and interpersonal skills, with the ability to collaborate effectively across all levels of the organisation.
Proficiency in infrastructure and automation tools such as Terraform Ansible, etc.
Excellent problem-solving skills and the ability to thrive in a fast-paced, dynamic environment.
Experience with VDI performance monitoring, optimisation, and troubleshooting.
Knowledge of desktop virtualisation technologies and remote access protocols (PCoIP, HDX, RDP).
Understanding of VDI storage and network requirements for optimal end-user experience.
AWS Certified Solutions Architect or DevOps Engineer certification.
Knowledge of CI/CD pipelines and automation tools like Jenkins or GitLab.
Familiarity with monitoring and logging tools such as Zabbix and ELK stack.
Knowledge of IT security best practices, regulatory compliance requirements, and cloud migration strategies.
Ability to balance tactical operational needs with long-term strategic goals, including performance monitoring and capacity planning.
As part of the shortlisting process for this role, you may be required to complete a medical declaration and/or undergo further assessment.
This will be a 2-year contract. All applicants will be notified on whether they are shortlisted or not within 4 weeks of the closing date of this job posting.