Enable job alerts via email!

Lead Site Reliability Engineer (Remote)

Livepeer

New York (NY)

Remote

USD 120,000 - 160,000

Full time

12 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Livepeer AI is seeking an experienced SRE Engineer to enhance the reliability of their decentralized video infrastructure. This role involves leading infrastructure projects, ensuring high-quality production systems, and collaborating across teams. Join a forward-thinking company at the forefront of video streaming and AI tooling.

Benefits

25 vacation days per year
Comprehensive medical, dental, and vision insurance
Company pension contributions
Flexible working hours
Annual team-building events
Choose your own laptop and work setup

Qualifications

  • Experience with CI/CD pipelines and infrastructure automation tools.
  • Experience managing systems handling high request volumes.
  • Ability to build a competent SRE team through clear OKRs.

Responsibilities

  • Provide tech leadership in SRE execution and planning.
  • Lead complex infra projects for internal and external stakeholders.
  • Orchestrate and run infrastructure while reducing manual processes.

Skills

Linux
Kubernetes
Docker
Terraform
Ansible
Nginx
Grafana
Prometheus
AWS
Google Cloud

Job description

Location: Remote
Hours:
North America working hours
Compensation:
Competitive Salary & Benefits

About Livepeer:

Livepeer is on a mission to build the world’s open video infrastructure. Founded in 2017, it is the world’s first open-source protocol for decentralized video streaming, built on Ethereum. The project has empowered developers to create scalable, cost-effective, and censorship-resistant video applications. The Livepeer network has transcoded billion of minutes, serving Web3 and Web2 platforms across gaming, entertainment, social media, and beyond. In 2024, Livepeer AI was introduced, unlocking Livepeer’s compute network for AI inference workflows. From real-time video transcription and object detection to scene recognition and AI-powered editing, Livepeer AI brings advanced machine learning directly into the decentralized video stack. These new tools not only reduce costs but also empower developers to build richer, smarter, and more engaging video experiences—whether for Web3 platforms, AI-powered dApps, or even traditional video use cases.

Your Role:

Livepeer AI is looking for an experienced, self-driven SRE Engineer – someone that loves to build tools automate everything and deliver the best production experiences for end users. They are passionate about keeping all our user-facing services and Livepeer production systems running smoothly. They specialise in systems (operating systems, storage subsystems, networking, GPU clusters, Docker), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

We value reliability. We approach the infrastructure with craft and think a lot about form and function. You should feel equally at home talking to developers and designers. We are looking for someone who cares about the reliability of the infrastructure as much as we do. You will ensure the final product is high quality and works as intended.

Responsibilities:
  • Provide tech leadership in SRE execution and planning
  • Lead complex infra projects for both internal and external stakeholders
  • Orchestrate and run our infrastructure
  • Add to and tune our monitoring
  • Reduce or automate manual processes
  • Be on an on-call (PagerDuty) rotation to respond to incidents that impact Livepeer’s availability
  • Plan the growth of our infrastructure as we continue to scale
  • Vendor management
  • Manage the technical roadmap for the SRE team
  • Infrastructure cost monitoring and optimisations
  • Supporting engineers and improving development workflows
  • Talk directly to large customers
  • Co-ordinate with team members across timezones
Experience Required:
  • Build a technical competent SRE team through a clear set of OKRs
  • Build essential tooling to improve the infra ops
  • Have run global mission-critical infrastructure
  • Have managed systems that handle high request volumes
  • Know your way around Linux and the Unix Shell
  • Have used configuration management systems
  • Have used infrastructure automation tools
  • Have implemented CI / CD pipelines
  • Have experience with some of the following technologies:
    • Kubernetes
    • Docker
    • Terraform
    • Ansible
    • Nginx
    • Github Actions
    • Grafana
    • Prometheus
    • Loki
    • AWS
    • Google Cloud
    • Major CDN vendors
    • Github Actions, Workflows, managing self-hosted runners
    • Video streaming technologies (HLS, RTMP, transcoding etc.)
    • Web3 / Blockchain, particularly the Ethereum ecosystem
Compensation and Benefits:
  • Base Salary: Competitive and dependent on location.
  • Token package: Competitive token package with a 3-year vesting schedule.
  • Employee Stock Option Plan: Competitive ESOP with 4-year vesting and a 1-year cliff.
  • Holidays: 25 vacation days per year plus any national holidays.
  • Insurance: Comprehensive medical, dental, and vision insurance in applicable locations.
  • Pension: Company pension contributions in applicable locations.
  • Equipment: Choose a laptop of your preference and anything you need for a comfortable work setup (we’ll purchase it for you).
  • Remote Work: Work anywhere in the world.
  • Flexible Working: Flexible hours to support work-life balance.
  • Team-Building: Annual in-person get-together where we fly everyone to an exciting location (all expenses paid) to have fun and connect face-to-face.
  • Latest Tech: Work with cutting-edge AI and the latest technologies alongside an innovative and entrepreneurial team.
Apply Now!

Join Livepeer AI and shape the future of video streaming and AI tooling.

Resources to learn more about Livepeer
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Site Reliability Engineer - Storage

Akamai Technologies GmbH

Remote

USD 148,000 - 308,000

2 days ago
Be an early applicant

Lead Site Reliability Engineer (Remote)

Livepeer

New York

Remote

USD 90,000 - 150,000

30+ days ago

Manager, Site Reliability Engineer (ServiceNow)

ZipRecruiter

Englewood Cliffs

Remote

USD 140,000 - 175,000

19 days ago

Manager Site Reliability Engineer ServiceNow

NBCUniversal

Englewood Cliffs

Remote

USD 140,000 - 175,000

28 days ago

Manager, Site Reliability Engineer (ServiceNow)

NBCUniversal

Englewood Cliffs

Remote

USD 140,000 - 175,000

16 days ago

Lead Site Reliability Engineer - Remote

Optum

Minnetonka

Remote

USD 106,000 - 195,000

12 days ago

Principal Site Reliability Engineer - Remote

Bright Horizons

Remote

USD 120,000 - 180,000

10 days ago

Lead Site Reliability Engineer (Remote -CST)

Cognizant

Riverwoods

Remote

USD 81,000 - 142,000

19 days ago

Senior Lead Site Reliability Engineer - Remote

Lensa

Remote

USD 106,000 - 222,000

11 days ago