Enable job alerts via email!

Associate Director, Technology Operations (AIOps/NoOps)

AIA SINGAPORE PRIVATE LIMITED

Singapore

On-site

SGD 120,000 - 180,000

Full time

26 days ago

Job summary

A leading insurance company in Singapore seeks an experienced IT Operations Manager to design and implement strategies using AI/ML solutions and manage a high-performing team. The role involves automating IT operations, enhancing system reliability, and ensuring compliance with security standards. Ideal candidates should have significant experience in IT operations with expertise in AIOps and automation tools.

Qualifications

  • Minimum 10 to 15 years of experience in IT operations, DevOps, or Site Reliability Engineering.
  • Hands-on experience with AI/ML models and automation tools.
  • Proven track record in implementing AIOps/NoOps strategies.

Responsibilities

  • Drive adoption of GenAI and ML models in IT operations.
  • Build and mentor a high-performing team of engineers and analysts.
  • Implement self-healing mechanisms and automated processes.
  • Design real-time monitoring dashboards and alerting systems.

Skills

AI/ML models
Data analytics
Automation tools
Scripting (Python, PowerShell)
Monitoring tools
Leadership

Education

Degree in Information Technology, Computer Science, or Computer Engineering

Tools

Splunk
Dynatrace
Elastic
ServiceNow
Kubernetes
Terraform
Ansible
Job description

In this role, the incumbent will report to Head of Technology Operations & Products and manage 10 – 15 team members. The incumbent will design and implement strategies to automate and optimize IT operations using AI/ML-driven solutions, self-healing systems, and cloud-native practices. The incumbent will lead initiatives to eliminate traditional operational bottlenecks and manual processes, enhance system reliability, and foster collaboration across DevOps, SRE, and engineering teams. The expertise that the incumbent brings will bridge the gap between innovation and execution, ensuring AIA Singapore technology stack operates autonomously and proactively.

WHAT YOU’LL BE DOING

  • Drive adoption of GenAI and ML models in IT operations for predictive analytics, anomaly detection, and automated remediation.
  • Build and mentor a high-performing team of engineers and analysts specializing in AIOps(e.g. Elastic ESRI, Splunk ITSI) and GenAI.
  • Implement self-healing mechanisms and closed-loop automation to minimize human intervention in routine operations.
  • Lead AI-driven incident response to minimize downtime, and overall operational efficiency through proactive monitoring and predictive insights.
  • Design real-time monitoring dashboards and AI-powered alerting systems.
  • Conduct post-incident reviews to refine automation workflows and prevent recurrence.
  • Ensure automated systems are complied with security, governance, and regulatory standards.
  • Leverage the GenAI-driven AIOps on cost savings, productivity, and customer satisfaction.
  • Foster a culture of innovation, continuous learning, and cross-functional collaboration.

WHAT WE ARE LOOKING FOR

  • Degree from a recognized University in Information Technology, Computer Science, Computer Engineering.
  • Certifications in AIOps, DevOps, AWS/Azure/GCP, ITIL, or related fields are a plus. GenAI certifications (e.g., NVIDIA, Google, Databricks) is highly desirable.
  • Minimum 10 to 15 years of experience in IT operations, DevOps, SRE (Site Reliability Engineering), or a similar role.
  • Hands-on experience with AI/ML models, data analytics, and automation tools (e.g., Splunk, Dynatrace, Elastic, ServiceNow, Kubernetes, Terraform, Ansible).
  • Proven track record of successfully implementing AIOps/NoOps strategies in large-scale enterprises, preferably in highly regulated industries such as financial insurance.
  • Proficiency in scripting (Python, PowerShell) and monitoring tools (Prometheus, Grafana, Datadog).
  • Requires in-depth experience, knowledge and skills in own discipline.
  • Uses best practices and knowledge of internal/external business issues to improve products or services.
  • The ability to work in high-pressure environment, troubleshoot complex issues across on-prem and cloud quickly, and successfully handle multiple priorities.
  • Have systematic problem-solving approach, effective communications skills and have sense of ownership and drive.
  • The ability to work independently with minimal guidance, manage resources and to perform capacity planning.
  • Applies best practices and knowledge of internal/external business issues to improve products or services in own discipline.
  • Solves moderately complex problems; takes a new perspective on existing solutions.
  • Interprets customer needs, assesses requirements and identifies solutions to non-standard requests.
  • Explains information and persuades others in straightforward situations.
  • Makes decisions for own work priorities and allocation of time to meet deadlines.
  • Accountable for technical contribution to project team or sub-team.
  • Builds awareness of costs related to own work.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.