Enable job alerts via email!

Lead Site Reliability Engineer (Remote)

Livepeer

New York (NY)

Remote

USD 90,000 - 150,000

Full time

15 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a passionate SRE Engineer to join their team and help build cutting-edge video infrastructure. In this role, you will be responsible for ensuring the reliability and performance of user-facing services while leading complex infrastructure projects. Your expertise in systems, networking, and automation will be crucial as you work with an entrepreneurial team to deliver high-quality production experiences. This is a unique opportunity to contribute to a decentralized video streaming platform that is transforming the industry with AI and advanced machine learning technologies. If you are excited about infrastructure and want to make a real impact, this role is for you.

Benefits

25 vacation days per year
Comprehensive medical, dental, and vision insurance
Employee Stock Option Plan
Company pension contributions
Flexible hours
Latest tech equipment
Annual team-building event

Qualifications

  • Experience in building and managing mission-critical infrastructure.
  • Proficient in using configuration management and infrastructure automation tools.

Responsibilities

  • Lead SRE execution and planning for infrastructure projects.
  • Automate manual processes to improve efficiency and reliability.

Skills

SRE Engineering
Linux
Networking
Docker
Kubernetes
CI/CD Pipelines
Infrastructure Automation
Distributed Systems
Monitoring Tools
Vendor Management

Education

Bachelor's Degree in Computer Science or related field

Tools

Terraform
Ansible
Nginx
Grafana
Prometheus
AWS
Google Cloud
Github Actions

Job description

Location: Remote
Hours:
North America working hours
Compensation:
Competitive Salary & Benefits

About Livepeer:

Livepeer is on a mission to build the world’s open video infrastructure. Founded in 2017, it is the world’s first open-source protocol for decentralized video streaming, built on Ethereum. The project has empowered developers to create scalable, cost-effective, and censorship-resistant video applications. The Livepeer network has transcoded billions of minutes, serving Web3 and Web2 platforms across gaming, entertainment, social media, and beyond. In 2024, Livepeer AI was introduced, unlocking Livepeer’s compute network for AI inference workflows. From real-time video transcription and object detection to scene recognition and AI-powered editing, Livepeer AI brings advanced machine learning directly into the decentralized video stack. These new tools not only reduce costs but also empower developers to build richer, smarter, and more engaging video experiences—whether for Web3 platforms, AI-powered dApps, or even traditional video use cases.

Your Role:

Livepeer AI is looking for an experienced, self-driven SRE Engineer – someone that loves to build tools to automate everything and deliver the best production experiences for end users. They are passionate about keeping all our user-facing services and Livepeer production systems running smoothly. They specialize in systems (operating systems, storage subsystems, networking, GPU clusters, Docker), while implementing best practices for availability, reliability, and scalability, with varied interests in algorithms and distributed systems.

We value reliability. We approach the infrastructure with craft and think a lot about form and function. You should feel equally at home talking to developers and designers. We are looking for someone who cares about the reliability of the infrastructure as much as we do. You will ensure the final product is high quality and works as intended.

Responsibilities:
  • Provide tech leadership in SRE execution and planning
  • Lead complex infra projects for both internal and external stakeholders
  • Orchestrate and run our infrastructure
  • Add to and tune our monitoring
  • Reduce or automate manual processes
  • Be on an on-call (PagerDuty) rotation to respond to incidents that impact Livepeer’s availability
  • Plan the growth of our infrastructure as we continue to scale
  • Vendor management
  • Manage the technical roadmap for the SRE team
  • Infrastructure cost monitoring and optimizations
  • Supporting engineers and improving development workflows
  • Talk directly to large customers
  • Coordinate with team members across timezones
Experience Required:
  • Build a technically competent SRE team through a clear set of OKRs
  • Build essential tooling to improve the infra ops
  • Have run global mission-critical infrastructure
  • Have managed systems that handle high request volumes
  • Know your way around Linux and the Unix Shell
  • Have used configuration management systems
  • Have used infrastructure automation tools
  • Have implemented CI / CD pipelines
  • Have experience with some of the following technologies:
    • Kubernetes
    • Docker
    • Terraform
    • Ansible
    • Nginx
    • Github Actions
    • Grafana
    • Prometheus
    • Loki
    • AWS
    • Google Cloud
    • Major CDN vendors
    • Github Actions, Workflows, managing self-hosted runners
    • Video streaming technologies (HLS, RTMP, transcoding etc.)
    • Web3 / Blockchain, particularly the Ethereum ecosystem
Compensation and Benefits:
  • Base Salary: Competitive and dependent on location.
  • Token package: Competitive token package with a 3-year vesting schedule.
  • Employee Stock Option Plan: Competitive ESOP with 4-year vesting and a 1-year cliff.
  • Holidays: 25 vacation days per year plus any national holidays.
  • Insurance: Comprehensive medical, dental, and vision insurance in applicable locations.
  • Pension: Company pension contributions in applicable locations.
  • Equipment: Choose a laptop of your preference and anything you need for a comfortable work setup (we’ll purchase it for you).
  • Remote Work: Work anywhere in the world.
  • Flexible Working: Flexible hours to support work-life balance.
  • Team-Building: Annual in-person get-together where we fly everyone to an exciting location (all expenses paid) to have fun and connect face-to-face.
  • Latest Tech: Work with cutting-edge AI and the latest technologies alongside an innovative and entrepreneurial team.
Apply Now!

Join Livepeer AI and shape the future of video streaming and AI tooling.

Resources to learn more about Livepeer
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Manager, Site Reliability Engineer (ServiceNow)

Nbcuniversal Media, LLC

Englewood Cliffs

Remote

USD 140.000 - 175.000

2 days ago
Be an early applicant

Manager Site Reliability Engineer ServiceNow

NBCUniversal

Englewood Cliffs

Remote

USD 140.000 - 175.000

Today
Be an early applicant

Manager, Site Reliability Engineer (ServiceNow)

NBC Universal

Englewood Cliffs

Remote

USD 140.000 - 175.000

Today
Be an early applicant

Lead Site Reliability Engineer (AZURE) - Empower Product Group

Hitachi Solutions

Greenville

Remote

USD 142.000 - 199.000

5 days ago
Be an early applicant

Lead Site Reliability Engineer (Remote -CST)

Cognizant North America

Riverwoods

Remote

USD 81.000 - 142.000

5 days ago
Be an early applicant

Lead Site Reliability Engineer/Architect (Remote)

Cognizant

Riverwoods

Remote

USD 120.000 - 162.000

2 days ago
Be an early applicant

Lead Site Reliability Engineer/Architect (Remote)

Cognizant North America

Riverwoods

Remote

USD 120.000 - 162.000

6 days ago
Be an early applicant

Principal Platform Architect - Financial Services

ServiceNow

Addison

Remote

USD 120.000 - 180.000

Today
Be an early applicant

Lead, Site Reliability Engineer, Fabric

MongoDB

New York

Hybrid

USD 147.000 - 289.000

4 days ago
Be an early applicant