Job Search and Career Advice Platform

Enable job alerts via email!

DevOps&SRE Engineer

Manus AI

Singapore

On-site

SGD 70,000 - 90,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking AI company in Singapore is seeking a talented Systems Operations Engineer to manage container clusters and drive automation initiatives. The ideal candidate has over 2 years of experience in DevOps or Site Reliability Engineering, with strong skills in Kubernetes and Docker. Join a collaborative environment where technology and innovation come together to make a real impact.

Qualifications

  • 2+ years of hands-on experience in Systems Operations, DevOps, or Site Reliability Engineering (SRE).
  • Experience with public cloud platforms (AWS, Azure, or GCP) is highly valued.
  • Strong understanding of large-scale internet architecture and distributed systems.

Responsibilities

  • Manage and maintain container clusters and open-source component clusters.
  • Ensure optimal performance, scalability, and reliability of distributed systems.
  • Design, build, and enhance infrastructure operation platforms.

Skills

Kubernetes
Docker
Monitoring and logging tools
Python
Shell scripting

Education

Bachelor's degree in Computer Science or related field

Tools

AWS
Azure
GCP
Nginx
MySQL
Redis
Kafka
Elasticsearch
Job description
Key Responsibilities
  • Cluster Operations & Management: Manage and maintain container clusters (Kubernetes, Docker) and open-source component clusters (Kafka, Redis, Elasticsearch) across multiple business units

  • Ensure optimal performance, scalability, and reliability of distributed systems

  • Infrastructure Platform Development: Design, build, and enhance infrastructure operation platforms

  • Develop and maintain systems for infrastructure management, CI/CD pipelines, monitoring/alerting, and centralized logging

  • Drive platform standardization and automation initiatives

  • High Availability & Reliability: Ensure maximum uptime for production services through proactive monitoring and incident response

  • Continuously optimize service architecture, deployment strategies, and operational processes

  • Implement and maintain SLA/SLO frameworks and reliability engineering practices

  • Automation & Process Improvement: Lead the development of automated operations and maintenance systems

  • Create self-service tools and workflows to improve team productivity

  • Establish best practices for infrastructure such as code and configuration management

Required Qualifications

Experience & Education

  • 2+ years of hands-on experience in Systems Operations, DevOps, or Site Reliability Engineering (SRE)

  • Bachelor's degree in Computer Science, Engineering, or related technical field preferred

Cloud & Infrastructure

  • Experience with public cloud platforms (AWS, Azure, or GCP) is highly valued

  • Strong understanding of large-scale internet architecture and distributed systems

  • Proven experience with infrastructure monitoring, logging, and observability tools

Technical Skills

  • Proficiency in scripting and automation using Shell, Python, or similar languages

  • Strong knowledge of containerization technologies (Kubernetes, Docker)

  • Hands-on experience operating production-grade container clusters and managing CI/CD pipelines

  • Strong familiarity with common infrastructure components: Nginx, MySQL, Redis, Kafka, Elasticsearch

Advanced Networking (Preferred)
  • Experience with Service Mesh architectures, Cilium CNI, and eBPF technologies

  • Understanding network security, load balancing, and traffic management

  • Knowledge of cloud-native networking patterns and best practices

About Manus AI

Manus is a general AI agent that bridges minds and actions: it doesn't just think, it delivers results. Manus excels at various tasks in work and life, getting everything done while you rest. Manus AI, we offer a highly collaborative and innovative environment where experts across engineering, research, and business come together to push the boundaries of AI applications. If you're passionate about cutting-edge technology and making a real impact, we’d love to hear from you!

Contact us: recruiting@manus.im

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.