Enable job alerts via email!

Senior Site Reliability Engineer

Censys

Ann Arbor (MI)

Remote

USD 145,000 - 195,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Senior Site Reliability Engineer to enhance developer efficiency and support cloud-native technologies. This role involves building tools for Kubernetes applications, ensuring production reliability, and collaborating with development teams. The ideal candidate will have extensive SRE experience and a passion for automation. Benefits include health coverage and a competitive salary range.

Benefits

401k match
Health Insurance
Vision Insurance
Dental Insurance

Qualifications

  • 5+ years of experience in an SRE role or similar.
  • Experience deploying and managing applications in Kubernetes.

Responsibilities

  • Build and maintain tooling for Kubernetes and Google Cloud Platform.
  • Ensure smooth operations of production environments and debug complex issues.

Skills

Communication
Automation

Tools

Kubernetes
Terraform
Prometheus
Grafana

Job description

Join to apply for the Senior Site Reliability Engineer role at Censys

Company Background

Censys' mission is to be the one place to understand everything on the internet. Frustrated by the lack of trustworthy Internet intelligence, we set out to create the industry's most comprehensive, accurate, and up-to-date map of the Internet. Today, Censys delivers real-time Internet intelligence and actionable threat insights to global governments, over 50% of the Fortune 500, and leading threat intelligence providers worldwide.

Location

This position can be remote on the East Coast of the United States or in Chicago, IL.

Role Summary

As a Senior Site Reliability Engineer on the Infrastructure and Ops platform team, you will help design, build, and deploy the tools used to empower our development teams and production applications. We're looking for talented engineers to help grow our operational maturity, as well as master cloud-native technologies to support our microservice architecture growth and reliability.

As a Developer Efficiency and Experience focused SRE, you will be responsible for improving the efficiency of engineering and our development teams by supporting the SDLC and workflows of our developers, including writing supporting application code, automation, and empowering developers to create, deploy, and manage their services end-to-end inside the platform.

What you'll do

  • Build and maintain tooling to support our applications in Kubernetes and in the Google Cloud Platform.
  • Work with development teams to help them build, ship, and deploy services and applications with ease and confidence, and promote service resilience and reliability.
  • Help ensure smooth operations of our production environments, and work with developers to debug complex issues, including creating and monitoring the 4 golden signals in our applications.
  • Create a self-service platform by collaborating with the SRE and infrastructure team to accelerate developer velocity, including service catalogs, repository tooling, and documentation. We prioritize a self-service model, treating the development team as our internal customers, listening to feedback, and iterating quickly to add value.
  • Participate in a shared on-call rotation schedule. Both development teams and SRE are responsible for infrastructure uptime and incident response.
Required Qualifications
  • 5+ years of experience in an SRE role or similar.
  • Experience deploying, managing, and debugging applications in Kubernetes, leveraging Helm and Crossplane.
  • Experience building, securing, and managing container images.
  • Experience with Cloud environments and services like CloudSQL, Pub/Sub, Memorystore.
  • Familiarity with Infrastructure-as-code tools such as Terraform or Crossplane.
  • Experience monitoring applications with Prometheus, Grafana, OpenTelemetry.
  • Familiarity with monorepo, trunk-based development, CI/CD systems like GitHub Actions or ArgoCD.
  • Strong communication skills and a supportive approach to developers, promoting automation and self-service.
Preferred Qualifications
  • Experience supporting a gRPC microservice architecture, familiarity with Kubernetes Service Mesh (e.g., Istio).
  • Ability to interface with application code, primarily in Go, Python, Scala.
  • Knowledge of application security tools, static analysis, dependency scanning.
  • Comfort with Linux-based environments.
Qualities
  • Passion for clean architecture and GitOps environments.
  • Comfort with projects involving uncertainty and risk.
  • Ability to collaborate with product management and leadership, balancing maintainability and rapid development.
  • Understanding of continuous delivery principles for quick, safe, and sustainable deployment.
What will make you stand out
  • Basic understanding of infrastructure operations, load-balancers, DNS, VPC design.
  • Willingness to explore and understand application code to improve testing, metrics, and reliability.
  • Knowledge of web security, anti-DDoS, WAF technologies.

Our target salary range for this role is between $145,000 USD and $195,000 USD + bonus and equity.

Benefits start day one, including 401k match, health, vision, dental, and more! See our careers page for details.

This position can be remote within the US, or based in our East Coast offices: Ann Arbor, MI, or Tysons, VA.

Note: We are not engaging external agencies for this role.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Censys, Inc.

Ann Arbor

Remote

USD 145,000 - 195,000

8 days ago

Senior Site Reliability Engineer

Firsthand

Remote

USD 150,000 - 175,000

Today
Be an early applicant

Senior Site Reliability Engineer (SRE)

ZetaChain

San Francisco

Remote

USD 150,000 - 200,000

Today
Be an early applicant

[Hiring] Senior Site Reliability Engineer @Owner

Owner

Remote

USD 170,000 - 210,000

Today
Be an early applicant

Senior Site Reliability Engineers

Centene Corporation

Clayton

Remote

USD 112,000 - 159,000

Today
Be an early applicant

Senior Site Reliability Engineer New United State (Remote)

Runwise

Mississippi

Remote

USD 140,000 - 190,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer - 2289298

Optum

Eden Prairie

Remote

USD 103,000 - 192,000

3 days ago
Be an early applicant

Senior Site Reliability Engineer - 2289298

UnitedHealth Group

Eden Prairie

Remote

USD 103,000 - 192,000

3 days ago
Be an early applicant

Senior Site Reliability Engineer

Nami Technology Joint Stock Company

Remote

USD 120,000 - 160,000

3 days ago
Be an early applicant