Enable job alerts via email!

[Hiring] Senior Site Reliability Engineer @Owner

Owner

United States

Remote

USD 170,000 - 210,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Senior Site Reliability Engineer to enhance their restaurant-commerce platform. This remote role involves ensuring system reliability, performance, and resilience while collaborating with various engineering teams. The position offers a competitive salary and equity package, along with comprehensive benefits.

Benefits

Comprehensive health, dental, and vision coverage
Home-office stipend
Top-tier laptop
Twice-annual team off-sites

Qualifications

  • 5+ years of experience with AWS and infrastructure-as-code.
  • Hands-on experience with container orchestration and CI/CD pipelines.

Responsibilities

  • Design for reliability and set SLOs/SLIs.
  • Automate deployments and evolve Buildkite pipelines.
  • Lead incident response and mentor junior developers.

Skills

AWS
Terraform
CI/CD
Linux Networking
Incident Management

Tools

Datadog
Postgres
MongoDB
Buildkite

Job description

May 14, 2025 - Owner is hiring a remote Senior Site Reliability Engineer. Salary: $170k - $210k plus equity. Location: USA, Canada.

About Owner.com

Owner is the all-in-one platform that restaurants use to succeed online.

Thousands of restaurant owners use our tools to build their website, drive online orders, create their own branded app, manage their customer relationships, and set up marketing automations.

You can think of it as Shopify meets HubSpot, but specifically for restaurants.

Learn more about the problems we are solving for our customers here.

Our vision

We’re starting by helping independent restaurants succeed online.

But it’s not just restaurants that need our help. Most local businesses are struggling with these same problems. Huge technology corporations are taking their customers, bleeding their profits, and making it hard for them to survive.

Once we nail the solution for restaurants–we’ll scale it into every other local business type.

In the future we envision, tens of millions of local business owners will use our technology to succeed in the digital age.

Our traction

In just over 3 years we've generated tens of millions in revenue, served millions of guests, and processed hundreds of millions of online orders.

More importantly, we’ve helped thousands of restaurant owners save their businesses - and not only survive, but thrive.

Our team

Our team grew from under 100 to nearly 200 talented people in 2024. We’ve got top talent from the most successful companies in SMB software, including: Shopify, HubSpot, DoorDash, ServiceTitan, Rappi, Faire and Stripe.

We’ll be scaling even faster in 2025 to keep pace with our customer growth.

Where we work

Owner is a remote-first, global company headquartered in San Francisco, with a sales hub in Toronto. For some roles, we prioritize in-person collaboration at one of our office locations. Most of our employees are distributed throughout the globe. Please review the role description and discuss with your recruiter for more details on location!

Why we’re looking for you

Owner’s restaurant-commerce platform is growing fast, and our infrastructure needs to grow even faster. We’re looking for a mission-driven Senior SRE/DevOps Engineer to keep our systems always-on, observable, and deploy-ready while helping developers ship with confidence. You’ll split your time between site-reliability engineering (designing for uptime, performance, and resiliency) and DevOps enablement (tooling, CI/CD, and automation).

Your work will directly power the websites, ordering flows, payments, and mobile apps that thousands of restaurants—and millions of diners—depend on every day.

Our Stack

Infrastructure & Ops: AWS, Terraform, ECS/Fargate, Postgres, MongoDB, Kafka, Datadog, Cloudflare, GitHub, Buildkite

Backend: Node.js, TypeScript, NestJS, Mikro-ORM

Frontend: React, React Native, Vue.js

(You don’t need to know every tool--depth in similar technologies is great.)

The impact you will have
  • Design for reliability: Set SLOs/SLIs, build self-healing architectures, and drive incident-prevention projects that keep our APIs and real-time ordering flows <100 ms p95.
  • Own observability: Level-up dashboards, alerts, and distributed tracing so teams can detect issues before customers do.
  • Automate deployments: Evolve our Buildkite pipelines and Terraform modules to give engineers <10-minute, one-click rollouts (and clean rollbacks).
  • Champion security & compliance: Harden infra with least-privilege IAM, threat-model topology changes, and guide SOC 2 / PCI efforts.
  • Partition & scale data-stores: Tune Postgres for multi-TB workloads, maintain Mongo sharding, and shepherd Kafka topic management as event volume climbs.
  • Lead incident response: Rotate with the on-call SREs, run blameless post-mortems, and convert findings into durable fixes.
  • Mentor & collaborate: Pair with product engineers on capacity reviews, guide junior devs on Docker best-practices, and evangelize “you build it, you run it.”
Who you’ll work with
  • Partners daily with backend, frontend, and data engineers across three time-zones
  • Collaborates with Product, Customer Support, and Restaurant Success teams to keep the customer experience seamless
Minimum requirements
  • 5+ years running production workloads on AWS (or GCP/Azure) with infrastructure-as-code (Terraform/CDK/CloudFormation)
  • Hands-on experience operating container orchestration (ECS, EKS, Kubernetes, Nomad, etc.) and designing blue/green or canary rollouts
  • Depth in at least two of our core datastores (Postgres, MongoDB, Kafka) including backup/restore, upgrades, and performance tuning
  • Fluency with CI/CD pipelines (we use Buildkite + GitHub Actions) and a knack for automating everything with shell, Python, or TypeScript
  • Proven track record setting up monitoring/alerting in Datadog, Prometheus, or similar, with clear SLO/SLA ownership
  • Strong grasp of linux networking, load balancing (Cloudflare/ELB), and CDN/edge-security concepts
  • Excellent incident-management and root-cause analysis skills; able to write crisp RCAs and follow through on action items
  • Passion for customer-centric thinking, rapid iteration, and continuous learning
Bonus points
  • Experience with NestJS or other Node.js backends at scale
  • Prior work in PCI-DSS or SOC 2 environments
  • Familiarity with GitOps workflows (Argo CD, Flux)
  • Exposure to mobile CI (React-Native pipelines), LaunchDarkly/feature-flags, or chaos-engineering
Pay & benefits
  • The estimated base salary range for this role is $170K - $210K, plus a generous pre-IPO equity package.
  • 100% remote across the U.S. or Canada (option to drop into our SF office)
  • Comprehensive health, dental, and vision coverage
  • Home-office stipend, top-tier laptop, and any tools you need to excel
  • Twice-annual team off-sites
Notice - Employment Scams
Communication from our team regarding job opportunities will only be made by an Owner employee with an @owner.com email address.
We do not conduct interviews over email or chat platforms, and we will never ask you to provide personal or financial information such as your mailing address, social security number, credit card numbers or banking information. If you believe you are being contacted by scammer, please mark the communication as "phishing" or “spam” and do not respond.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer, Database Operations:Clickhouse

GitLab Inc.

New York

Remote

USD 117,000 - 252,000

30+ days ago

Sr. Database Reliability Engineer II (Remote)

CrowdStrike

California

Hybrid

USD 155,000 - 255,000

9 days ago

Business Strategist

Kodiak Solutions LLC

Houston

On-site

USD 87,000 - 180,000

12 days ago

Business Strategist

Crowe

Louisville

On-site

USD 87,000 - 180,000

12 days ago

Business Strategist

Crowe

Boston

On-site

USD 87,000 - 180,000

12 days ago

Business Strategist

Crowe

Atlanta

On-site

USD 87,000 - 180,000

12 days ago

Business Strategist

Crowe

Austin

On-site

USD 87,000 - 180,000

12 days ago

Business Strategist

Crowe

Miami

On-site

USD 87,000 - 180,000

13 days ago

Business Strategist

Crowe

Chicago

On-site

USD 87,000 - 180,000

13 days ago