Job Responsibilities
- The candidate will play a key role in automating production operations, ensuring seamless integration with the daily monitoring operations of the Agency's IT systems.
- Continuous Service Improvement: Add value to Data Centre Operations by identifying improvement opportunities to enhance production stability, stakeholder experience, and overall service quality.
- Identifying opportunities for automation within production processes to streamline operations and reduce manual intervention.
- Timely escalation and communication of major incidents, including following escalation matrix, leading, driving, facilitating, and chairing investigation activities.
- Being accountable for resolving incidents with workarounds or permanent solutions.
- Participate in troubleshooting system and network connectivity issues before escalation.
- Monitoring production metrics and performance indicators to ensure targets are met.
- Ensuring compliance with safety and regulatory standards in production.
- Other ad-hoc duties as instructed.
Requirements
- Strong analytical skills: Ability to analyze production processes and identify areas for improvement and optimization.
- Technical proficiency: Profound knowledge of production systems issues and effective solutions to enhance operational efficiency.
- Problem-solving aptitude: Skill in troubleshooting production issues and devising effective solutions.
- Experience in Data Centre facilities, including temperature and humidity monitoring systems, smoke detection and fire suppression systems, UPS units, CCTC, keypress units, etc.
- Familiarity with AWS tools such as CloudWatch, CloudTrail, etc., is an advantage.
- Able to work shift duties, including:
- First shift: 7:00 am to 3:00 pm
- Second shift: 2:45 pm to 10:30 pm
- Third shift: 10:15 pm to 07:15 am
The 8-day rotational shift pattern is as follows: two days of first shift, two days of second shift, two days of third shift, followed by two off days.