Overview
Epergne Solutions is looking for a highly skilled Senior Platform Engineer to be a part of the infrastructure and platform engineering team. The ideal candidate will have deep expertise in building and managing Kubernetes platforms using Rancher, securing containerized workloads with NeuVector, and working across hybrid or multi-cloud environments. Experience with Harvester or hyperconverged infrastructure platforms is a strong plus. This role is central to designing, deploying, and maintaining our modern infrastructure stack and enabling secure, scalable environments for application teams.
Responsibilities
- Design, deploy, and manage Kubernetes clusters using Rancher in production and non-production environments.
- Integrate and manage NeuVector for container security (runtime scanning, policy enforcement, vulnerability management, etc.).
- Implement platform automation using tools such as Terraform, Ansible, Helm, and CI/CD pipelines.
- Collaborate with security and networking teams to define and enforce policies across containerized platforms.
- Build and maintain observability (monitoring, logging, alerting) and operational dashboards.
- Work closely with development and DevOps teams to provide reliable platform capabilities.
- Participate in incident response, RCA, and performance tuning activities.
- Optionally contribute to HCI platform operations and VM orchestration if using Harvester.
Required Skills and Experience
- 6+ years of experience in DevOps, SRE, or Platform Engineering roles.
- Strong hands-on experience with Kubernetes architecture and operations (preferably K8s v1.20+).
- Experience managing production-grade Kubernetes clusters with Rancher.
- Solid understanding of container security best practices and operational experience with NeuVector.
- Proficiency in infrastructure as code (e.g., Terraform, Helm, Ansible).
- Experience with CI/CD tools such as GitLab CI, Jenkins, or ArgoCD.
- Good knowledge of Linux systems, networking, and shell scripting.
- Understanding of monitoring/logging tools such as Prometheus, Grafana, Fluentd, ELK, or Loki.
Nice to Have
- Experience with Harvester or other HCI platforms for VM management on bare metal.
- Familiarity with air-gapped or edge deployments.
- Exposure to hybrid/multi-cloud platform deployments.
- Certifications in Kubernetes (CKA/CKS), Rancher, or related technologies.