Enable job alerts via email!

Site Reliability Engineer

Monograph

United States

Remote

USD 120,000 - 160,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in the healthcare tech space seeks a Site Reliability Engineer to enhance operational capacity and system performance. The role involves monitoring infrastructure, analyzing slow database queries, and collaborating with engineering teams. The engineer will support a mission-driven team to help transform senior living through technical excellence and innovation.

Benefits

Market competitive compensation
Significant equity option grants
Excellent health, dental, and vision coverage
401K plan
Unlimited vacation days

Qualifications

  • 5+ years of experience in site reliability or backend engineering roles.
  • Familiar with modern observability practices and tools.
  • Excellent communication skills and ability to work collaboratively.

Responsibilities

  • Provide engineering support for customer support team and resolve production issues.
  • Monitor infrastructure health and identify potential issues.
  • Analyze and optimize database queries for improved performance.

Skills

Problem Solving
Infrastructure Craftsmanship
Strong Communication Skills

Tools

Kubernetes
Postgres
Prometheus
Grafana
RabbitMQ
Scala
Play Framework
Typescript
React
Vite

Job description

About August Health

At August Health, our mission is to empower the essential work of caring for our elders.

We achieve this by providing a modern operating platform and electronic health record (EHR) that enables senior living operators to deliver high-quality care with confidence.

Caregivers are the heart of senior living communities, embodying care, compassion, and well-being. Yet, they face increasing challenges—higher resident acuity, complex workflows, and staffing shortages. At August Health, we build tools that simplify their tasks, eliminate inefficiencies, and provide the insights they need to focus on what truly matters—caring for residents.

At August, we strive to live our values each day, in every interaction with our customers and with each other.

  • Be responsible – leave things better than you found them

  • Take ownership – be decisive and take action

  • Be ambitious – build something great

  • Keep an open mindset – communicate candidly and welcome new ideas

  • Be humble – celebrate each others’ successes and learn from our mistakes

  • Stay positive – assume best intent

ABOUT THE JOB

We're looking for a Site Reliability Engineer who can help scale and strengthen the foundation of our infrastructure while supporting a product that genuinely impacts people's lives. You'll join a small team of thoughtful, mission-driven engineers, working to bring stability, observability, and performance to our systems as we grow.

In this role, you’ll help improve our operational capabilities, optimize performance bottlenecks, and work closely with support and engineering teams to address issues from all sides. We're looking for someone who enjoys problem solving and infrastructure craftsmanship—and who brings both technical insight and strong communication skills to the table.

Technologies we use: Kubernetes, Postgres, Prometheus, Grafana, RabbitMQ, Scala, Play Framework, Typescript, React, Vite. Prior experience in these is helpful but not required. A strong ability to learn quickly and work across the stack is essential.

We offer market competitive compensation based on experience and ability, including significant equity option grants. Other benefits include excellent health, dental, and vision coverage, 401K, One Medical, Talkspace, HealthAdvocate, Teladoc memberships, and unlimited vacation days. The whole team works remotely.

KEY RESPONSIBILITIES

  • Provide engineering support for our customer support team, investigating and resolving production issues with empathy and speed.

  • Monitor infrastructure health through observability tools and metrics, proactively identifying and addressing potential issues.

  • Analyze and optimize slow database queries to improve system responsiveness and scalability.

  • Tune configuration settings across our platform to improve performance, reliability, and cost-efficiency.

  • Build and improve internal tooling to support deployment, monitoring, and developer productivity.

  • Bring a thoughtful approach to incident response, root cause analysis, and documentation of postmortems.

WHO YOU ARE

  • 5+ years of experience in site reliability, infrastructure, or backend engineering roles.

  • Familiar with modern observability practices and tools (e.g., Prometheus, Grafana, OpenTelemetry).

  • Comfortable navigating large codebases and debugging production issues in complex systems.

  • You attempt to unblock yourself but know when to bring in help. You communicate well and share what you’ve learned with others.

  • You're comfortable participating in team-level discussions, contributing helpful and relevant insights.

  • You bring a strong, experience-based point of view to technical discussions, but you're also willing to disagree and commit when needed.

  • You respond well to feedback and see it as an opportunity for growth.

  • You’re a good verbal and written communicator.

About our team

Our team brings together deep expertise in technology, healthcare, and company-building. We’ve led teams at Apple, Google, Landmark Health, and Adobe, co-founded and exited multiple companies, shipped products used by hundreds of millions of users, and managed clinical teams caring for thousands of patients.

Backed by top-tier Silicon Valley investors, we are partnering with some of the largest senior care organizations in the U.S. to transform the future of senior living.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Junior Site Reliability Engineer (Remote)

Lensa

Remote

USD 80,000 - 140,000

Yesterday
Be an early applicant

Junior Site Reliability Engineer (Remote)

Lensa

Remote

USD 80,000 - 140,000

Yesterday
Be an early applicant

Site Reliability Engineer II - Remote

Lensa

Remote

USD 83,000 - 175,000

3 days ago
Be an early applicant

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle Database

Remote

USD 97,000 - 200,000

7 days ago
Be an early applicant

Site Reliability Engineer

Ford Motor Company

Remote

USD 120,000 - 160,000

7 days ago
Be an early applicant

Site Reliability Engineer

Vallum Associates

Remote

USD 140,000 - 140,000

7 days ago
Be an early applicant

Site Reliability Engineer

Beazley

Remote

USD 110,000 - 150,000

2 days ago
Be an early applicant

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle Cloud ERP

Remote

USD 97,000 - 200,000

7 days ago
Be an early applicant

Site Reliability Engineer

Offchain Labs

Remote

USD 100,000 - 720,000

7 days ago
Be an early applicant