Enable job alerts via email!

Lead Site Reliability Engineer (Remote)

Livepeer

New York (NY)

Remote

USD 100,000 - 150,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in decentralized video streaming seeks a Lead Site Reliability Engineer. This remote role focuses on maintaining high service availability while building efficient systems and tools. You'll lead the SRE team, ensuring smooth running of services through innovative solutions and strong technical expertise in cloud technologies.

Benefits

25 vacation days
Comprehensive medical, dental, and vision insurance
Remote work flexibility
Employee Stock Option Plan
Annual company retreat
Flexible working hours

Qualifications

  • Experience with building and leading an SRE team.
  • Strong knowledge of Linux and Unix Shell.
  • Experience with infrastructure automation tools and CI/CD pipelines.

Responsibilities

  • Provide tech leadership in SRE execution and planning.
  • Lead complex infrastructure projects.
  • Orchestrate and run the infrastructure.

Skills

Kubernetes
Docker
Terraform
Ansible
Nginx
Github Actions
Grafana
Prometheus
Loki
AWS
Google Cloud
Video streaming technologies
Web3 / Blockchain

Job description

Join to apply for the Lead Site Reliability Engineer (Remote) role at Livepeer

Join to apply for the Lead Site Reliability Engineer (Remote) role at Livepeer

Get AI-powered advice on this job and more exclusive features.

About Livepeer:

Livepeer is on a mission to build the world’s open video infrastructure. Founded in 2017, it is the world’s first open-source protocol for decentralized video streaming, built on Ethereum. The project has empowered developers to create scalable, cost-effective, and censorship-resistant video applications. The Livepeer network has transcoded billion of minutes, serving Web3 and Web2 platforms across gaming, entertainment, social media, and beyond. In 2024, Livepeer AI was introduced, unlocking Livepeer’s compute network for AI inference workflows. From real-time video transcription and object detection to scene recognition and AI-powered editing, Livepeer AI brings advanced machine learning directly into the decentralized video stack. These new tools not only reduce costs but also empower developers to build richer, smarter, and more engaging video experiences—whether for Web3 platforms, AI-powered dApps, or even traditional video use cases.

Location: Remote

Hours: North America working hours

About Livepeer:

Livepeer is on a mission to build the world’s open video infrastructure. Founded in 2017, it is the world’s first open-source protocol for decentralized video streaming, built on Ethereum. The project has empowered developers to create scalable, cost-effective, and censorship-resistant video applications. The Livepeer network has transcoded billion of minutes, serving Web3 and Web2 platforms across gaming, entertainment, social media, and beyond. In 2024, Livepeer AI was introduced, unlocking Livepeer’s compute network for AI inference workflows. From real-time video transcription and object detection to scene recognition and AI-powered editing, Livepeer AI brings advanced machine learning directly into the decentralized video stack. These new tools not only reduce costs but also empower developers to build richer, smarter, and more engaging video experiences—whether for Web3 platforms, AI-powered dApps, or even traditional video use cases.

Your Role:

Livepeer AI is looking for an experienced, self-driven SRE Engineer – someone that loves to build tools automate everything and deliver the best production experiences for end users. They are passionate about keeping all our user-facing services and Livepeer production systems running smoothly. They specialise in systems (operating systems, storage subsystems, networking, GPU clusters, Docker), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

We value reliability. We approach the infrastructure with craft and think a lot about form and function. You should feel equally at home talking to developers and designers. We are looking for someone who cares about the reliability of the infrastructure as much as we do. You will ensure the final product is high quality and works as intended.

Responsibilities:

  • Provide tech leadership in SRE execution and planning
  • Lead complex infra projects for both internal and external stakeholders
  • Orchestrate and run our infrastructure
  • Add to and tune our monitoring
  • Reduce or automate manual processes
  • Be on an on-call (PagerDuty) rotation to respond to incidents that impact Livepeer’s availability
  • Plan the growth of our infrastructure as we continue to scale
  • Vendor management
  • Manage the technical roadmap for the SRE team
  • Infrastructure cost monitoring and optimisations
  • Supporting engineers and improving development workflows
  • Talk directly to large customers
  • Co-ordinate with team members across timezones

Experience Required:

  • Build a technical competent SRE team through a clear set of OKRs
  • Build essential tooling to improve the infra ops
  • Have run global mission-critical infrastructure
  • Have managed systems that handle high request volumes
  • Know your way around Linux and the Unix Shell
  • Have used configuration management systems
  • Have used infrastructure automation tools
  • Have implemented CI / CD pipelines
  • Have experience with some of the following technologies:
    • Kubernetes
    • Docker
    • Terraform
    • Ansible
    • Nginx
    • Github Actions
    • Grafana
    • Prometheus
    • Loki
    • AWS
    • Google Cloud
    • Major CDN vendors
    • Github Actions, Workflows, managing self-hosted runners
    • Video streaming technologies (HLS, RTMP, transcoding etc.)
    • COBOL
    • Web3 / Blockchain, particularly the Ethereum ecosystem
Compensation and Benefits:

  • Base Salary: Competitive and dependent on location.
  • Token package: Competitive token package with a 3-year vesting schedule.
  • Employee Stock Option Plan: Competitive ESOP with 4-year vesting and a 1-year cliff.
  • Annual Adjusted Salaries: Every January, we review and adjust pay.
  • Holidays: 25 vacation days per year plus any national holidays.
  • A day off on your birthday - because you deserve to celebrate! (If it falls on a weekend, take another day off that week.)
  • Insurance: Comprehensive medical, dental, and vision insurance in applicable locations.
  • Pension: Company pension contributions in applicable locations.
  • Equipment: Choose a laptop of your preference and anything you need for a comfortable work setup (we’ll purchase it for you).
  • Remote Work: Work anywhere in the world.
  • Flexible Working: Flexible hours to support work-life balance.
  • Annual Company Retreat: Once a year, we fly the whole company to an exciting global location for a week of connection, collaboration, and fun—all expenses covered.
  • Team Meetups: Each team comes together in person once a year to build trust, spark ideas, and share meaningful moments that go beyond the screen.
  • Work Anniversary Rewards: At Livepeer, we love recognising your journey. As you reach 3, 4, 5 years and beyond, you'll be celebrated with gift card rewards.
  • Celebrate Life’s Big Moments: Whether it’s a wedding, a new baby, or another major milestone, we’ll mark the occasion with a celebration gift.
  • Referral Bonus: Refer great talent to Livepeer and earn 100 LPT tokens when they’re hired and successfully complete their probation.
  • Latest Tech: Work with cutting-edge AI and the latest technologies alongside an innovative and entrepreneurial team.

Apply Now!

Join Livepeer AI and shape the future of video streaming and AI tooling.

Resources to learn more about Livepeer

  • The Livepeer Primer
  • Livepeer snags $20M for decentralized video transcoding
  • Messari Profile
  • Grayscale Livepeer Report
  • daydream.live

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Software Development

Referrals increase your chances of interviewing at Livepeer by 2x

Sign in to set job alerts for “Site Reliability Engineer” roles.

New York, NY $100,000.00-$150,000.00 2 weeks ago

New York, NY $145,000.00-$260,000.00 7 months ago

New York, NY $190,000.00-$230,000.00 3 days ago

Senior DevOps and Site Reliability Engineer, remote

New York, NY $165,000.00-$200,000.00 3 months ago

New York, NY $180,000.00-$220,000.00 2 months ago

Manager, Site Reliability Engineer (ServiceNow)

Englewood Cliffs, NJ $140,000.00-$175,000.00 3 days ago

SRE(Site Reliability) Architect - Remote (Fulltime)

New York City Metropolitan Area 3 days ago

New York City Metropolitan Area $90.00-$95.00 1 week ago

Senior Site Reliability / Gitops Engineer
SRE(Site Reliability) Architect - Remote (Fulltime)
Senior DevOps / Site Reliability Engineer

New York City Metropolitan Area 3 weeks ago

SRE(Site Reliability) Architect - Remote (Fulltime)

New York, NY $90,000.00-$115,000.00 1 week ago

New York, NY $140,000.00-$185,000.00 5 days ago

New York, NY $180,000.00-$215,000.00 3 months ago

New York, NY $140,000.00-$170,000.00 2 months ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Platform Architect

Hilton Worldwide, Inc.

New York

Remote

USD 125,000 - 140,000

5 days ago
Be an early applicant

Lead Site Reliability Engineer (Remote)

Livepeer

New York

Remote

USD 120,000 - 160,000

21 days ago

Principal Safety Engineer - Freight and Passenger Railroads

TÜV Rheinland Group

Boxborough

Remote

USD 100,000 - 160,000

2 days ago
Be an early applicant

Principal Safety Engineer - Transit and Rail Systems

TÜV Rheinland Group

Pleasanton

Remote

USD 100,000 - 145,000

3 days ago
Be an early applicant

Principal Site Reliability Engineer - Storage

Akamai Technologies GmbH

Remote

USD 148,000 - 308,000

12 days ago

Manager, Site Reliability Engineer (ServiceNow)

NBCUniversal

Englewood Cliffs

Remote

USD 140,000 - 175,000

26 days ago

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle

Remote

USD 97,000 - 200,000

3 days ago
Be an early applicant

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle Database

Remote

USD 97,000 - 200,000

2 days ago
Be an early applicant

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle Cloud ERP

Remote

USD 97,000 - 200,000

2 days ago
Be an early applicant