Enable job alerts via email!

Senior Site Reliability Engineer - Data (REMOTE)

Discogs

Seattle (WA)

Remote

USD 130,000 - 140,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Senior Site Reliability Engineer - Data, where you'll play a pivotal role in enhancing the performance and reliability of a vibrant platform. This remote position invites you to collaborate with engineering squads to develop scalable database architectures and ensure the stability of critical systems like Kafka. You'll lead efforts in data management, optimize MySQL schemas, and mentor teams on best practices. With a competitive salary and a commitment to employee well-being, this role offers a unique opportunity to make a significant impact in a community-driven environment focused on music enthusiasts.

Benefits

401(k)
Health insurance
Paid vacation
Parental leave
Wellness and development allowances
Remote work setup
Flexible location
Charitable contributions matching

Qualifications

  • 5+ years experience with Kafka and RDBMS.
  • 6+ years in Ops, DevOps, or SRE roles.

Responsibilities

  • Manage data stores and lead reliability efforts on Kafka.
  • Refactor MySQL schemas for scalability and performance.

Skills

Relational database design and optimization (MySQL, Percona, RDS)
Kafka cluster management (Strimzi, Debezium, JDBC)
CI/CD with GitHub Actions
GitOps with ArgoCD
Kubernetes (EKS, Kustomize, Karpenter)
AWS cloud services (VPC, RDS, S3)
Observability tools (Datadog, Sentry)
Scripting (Shell, Python)
Collaboration and mentorship experience
Excellent communication skills
Proactive problem-solving approach

Education

Bachelor's degree in Computer Science

Tools

Terraform
Elasticsearch
Python frameworks
GraphQL
Vault
Redis
Memcached
NoSQL
Data Lake/Warehouse
Data Governance

Job description

Senior Site Reliability Engineer - Data (REMOTE)

3 weeks ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

The Discogs Platform team is focused on building and supporting performant, cost-effective, reliable infrastructure; developer experience tooling and mentorship; and creating standards for organization-wide velocity. As a key member, the Senior Site Reliability Engineer - Data will collaborate with engineering squads to develop scalable relational database architectures, ensure stability for Kafka and change data capture, and contribute to platform operations.

Location: This is a remote position, open to candidates in OR, WA, CA, CO, TX, IL.

Compensation: Starting salary range: $130,000 - $140,000 annually.

Who We Are: We support a global community of music fans and collectors, fostering knowledge exchange, record curation, and community building through our platform, which offers tools and data for music enthusiasts.

Responsibilities:

  • Manage Discogs' data stores as a subject matter expert.
  • Lead reliability efforts on Kafka and related systems.
  • Define data contracts and communication standards for CDC processes.
  • Refactor and optimize MySQL schemas for scalability and performance.
  • Mentor teams on platform best practices.
  • Create documentation and runbooks.
  • Work within containerized, orchestrated environments.
  • Support site reliability and operations, including incident response.

Minimum Requirements:

  • Bachelor's degree in Computer Science or relevant experience.
  • 5+ years working with Kafka and RDBMS.
  • 6+ years in Ops, DevOps, or SRE roles.

Skills & Abilities:

  • Relational database design and optimization (MySQL, Percona, RDS).
  • Kafka cluster management (Strimzi, Debezium, JDBC).
  • CI/CD with GitHub Actions.
  • GitOps with ArgoCD.
  • Kubernetes (EKS, Kustomize, Karpenter).
  • AWS cloud services (VPC, RDS, S3).
  • Observability tools (Datadog, Sentry).
  • Scripting (Shell, Python).
  • Collaboration and mentorship experience.
  • Excellent communication skills.
  • Proactive problem-solving approach.

Preferred Skills:

  • Terraform, Elasticsearch, Python frameworks, GraphQL, Vault, Redis, Memcached, NoSQL, Data Lake/Warehouse, Data Governance, Data Security.

Benefits include: competitive salary, 401(k), health insurance, paid vacation and parental leave, wellness and development allowances, remote work setup, flexible location, charitable contributions matching.

Our Mission: To build a community-driven ecosystem for record collectors, fostering deep engagement with music through innovative, interconnected resources. Discogs is an Equal Opportunity Employer. Applicants needing accommodations should contact us at 503-597-6340. For applications, visit our Careers page: https://www.discogs.com/about/careers. We store applicant data for up to one year and respect your privacy.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Las Vegas

Remote

USD 110,000 - 180,000

3 days ago
Be an early applicant

Sr Data Platform Architect

General Electric

Seattle

Remote

USD 110,000 - 185,000

13 days ago

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Oklahoma City

Remote

USD 110,000 - 180,000

4 days ago
Be an early applicant

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Wilmington

Remote

USD 110,000 - 180,000

6 days ago
Be an early applicant

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Philadelphia

Remote

USD 110,000 - 180,000

6 days ago
Be an early applicant

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Kansas City

Remote

USD 110,000 - 180,000

6 days ago
Be an early applicant

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Raleigh

Remote

USD 110,000 - 180,000

3 days ago
Be an early applicant

Senior Site Reliability Engineer (Data Platforms SRE)

Wikimedia Foundation

Remote

USD 101,000 - 158,000

12 days ago

Sr Data Platform Architect

General Electric

Redmond

Remote

USD 110,000 - 185,000

13 days ago