
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading IT services provider in Singapore seeks an experienced professional to architect and implement production-grade OpenShift clusters using OpenStack. The role involves managing CI/CD pipelines, developing monitoring processes, and ensuring compliance with government security standards. Ideal candidates will have over 5 years of experience in Linux system administration and a strong understanding of networking and storage virtualization. This position offers opportunities for collaboration and technical advisory within a cross-functional team.
Job Description
Description
Work closely with CEP customer on technical requirement gathering and architect and implement production‑grade OpenShift clusters on OpenStack, including control plane, compute nodes, storage integrations, and networking. Adapt typical OpenShift and OpenStack design into government security and governance compliance construct. Provide deep technical advisory and design decision rationales to internal and external stakeholders. Define and automate infrastructure provisioning (IaC) using tools such as Terraform, Ansible, or Red Hat Ansible Tower.
Develop and maintain monitoring, alerting, and logging pipelines (Prometheus, Grafana, EFK/ELK, Alertmanager). Lead capacity planning, performance tuning, and day‑to‑day cluster health management. Implement robust backup, disaster recovery, and upgrade strategies.
Build and manage CI/CD pipelines (Jenkins, GitLab CI, Argo CD) for platform updates, operator deployments, and application rollouts. Author scripts and operators to automate routine maintenance, scaling, and self‑healing tasks.
Enforce security best practices: RBAC, network policies, SELinux, secrets management (Vault, OpenShift Secrets). Collaborate with security teams to implement vulnerability scanning, baseline hardening, and compliance audits.
Partner with development, QA, and networking teams to onboard new applications and troubleshoot platform issues. Produce runbooks, run‑charts, design docs, and knowledge‑base articles.