A prestigious institution based in Abu Dhabi offers a dynamic and innovative environment for professionals.
Responsibilities :
- Deploy and maintain monitoring tools to track KPIs such as CPU usage, memory, network traffic, and response times, ensuring early detection of anomalies.
- Implement automated alerting systems to reduce manual intervention and ensure quick responses to performance issues.
- Oversee adherence to SLAs and platform performance standards, ensuring effective incident resolution and API performance.
- Analyze incidents, prioritize based on severity, and manage escalation protocols for critical issues, while maintaining accurate incident records.
- Lead security management efforts, including vulnerability monitoring, and ensure proactive threat mitigation, along with BCM / DR planning and recovery drills.
- Conduct post-incident reviews and root cause analysis, driving continuous improvements and implementing preventative measures.
- Manage vendor relationships, the operational budget, and ensure comprehensive technical documentation for efficient incident resolution and platform management.
Minimum Requirements :
- Minimum of 15 years of experience in operations management, with a strong focus on IT operations, platform monitoring, and incident management.
- Experience leading IT operations in a large international firm.
- Proficiency in Arabic is preferred.
- Proven expertise in real-time platform monitoring, incident response, security management, and adherence to SLAs.
- Strong leadership and communication skills, with the ability to drive operational excellence.
- Bachelor's degree in Information Systems, Operations Management, or a related field, and mandatory ITSM certification.
J-18808-Ljbffr