Enable job alerts via email!

Site Reliability Engineer (K8s) - Remote

The Restaurant Store, LLC

Lititz (Lancaster County)

Remote

USD 80,000 - 120,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading online distributor of restaurant supplies as a Site Reliability Engineer. This innovative firm is looking for a mid to senior-level engineer to manage on-premises Kubernetes clusters and enhance their CI/CD processes. With a focus on fast-paced e-commerce, you'll ensure uptime and reliability, utilizing cutting-edge technologies like Helm, Argo-CD, and HashiCorp Vault. This role offers an exciting opportunity to grow your career in a dynamic environment where your contributions directly impact the business's success.

Qualifications

  • Experience managing on-premises clusters and deployments.
  • Strong background in CI/CD and Kubernetes technologies.

Responsibilities

  • Manage on-premises Kubernetes clusters and troubleshoot deployments.
  • Deploy resources using CI/CD tools like Argo-CD and Gitlab-CD.

Skills

Kubernetes/K8s
CI/CD platforms
Helm
Troubleshooting
Secrets management
Persistent storage management

Tools

Argo-CD
Gitlab-CD
Flux
Sealed Secrets
HashiCorp Vault
Rook
Ceph
NFS
HAProxy
Nginx
Traefik

Job description

Job Summary

As the largest online distributor of restaurant supplies and equipment, WebstaurantStore hosts an expansive catalogue with over 430,000 products that are delivered through fast, dependable shipping. Unlike most in the e-commerce arena, almost all of our technological design, development, and systems management is done in-house, allowing us to create custom solutions within an ever-changing market. This consistent, organic growth is why we have a need for a Site Reliability Engineer with a focus in Kubernetes/K8s (Mid to Senior) looking to further their career.

Our ideal candidate is someone who is excited to dive into the fast-paced world of e-commerce, where every second counts and downtime isn’t an option. Successful SREs at our company have typically started their careers as developers or systems engineers who sought a wider variety of work.

Responsibilities
  • Managing on-premises clusters.
  • Deploying resources with a CI/CD platform (Argo-CD, Gitlab-CD, Flux, etc.)
  • Managing deployments with Helm and Kustomize.
  • Troubleshooting pods, nodes, deployments, etc.
  • Utilizing secrets management platforms such as Sealed Secrets and HashiCorp Vault.
  • Managing persistent storage using Rook / Ceph,NFS, etc.
  • Configuring ingress controllers such as HAProxy, Nginx, Traefik, etc.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.