Enable job alerts via email!

Senior Software Engineer, Site Reliability

Hyperdrive Recruiting

Raleigh (NC)

Remote

USD 150,000 - 225,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading SaaS company in the scientific communication industry is seeking a Senior Software Engineer, Site Reliability, to enhance platform resilience and lead engineering practices. This 100% remote position requires expertise in multiple programming languages and extensive experience in SRE roles, offering competitive salary and remote flexibility.

Benefits

Generous PTO
Comprehensive Employee Benefits Package
100% Remote Flexibility

Qualifications

  • 10+ years of experience in software, DevOps, or SRE fields.
  • Strong programming skills in JavaScript, TypeScript, Python, or Go.
  • Experience with AWS ECS, CloudRun, or similar orchestration tools.

Responsibilities

  • Enhance platform resilience by improving reliability and scalability.
  • Develop and maintain observability and monitoring tools.
  • Lead the design and architecture of scalable, fault-tolerant systems.

Skills

Programming
Troubleshooting
Analytical Skills
System Design
Automation

Education

Bachelor's degree in Computer Science

Tools

AWS
Kubernetes
Datadog
Prometheus

Job description

We are looking for a talented Senior Software Engineer, Site Reliabilityto develop and shape a resilient, high-performant, and secure platform. We are a SaaS company in the scientific communication industry that is transforming how scientific knowledge is shared, making it open, collaborative, and easily understandable.

This role is 100% Remote in the US or Canada.

Job Duties:
  • Enhance platform resilience by improving reliability, scalability, and release efficiency.
  • Develop, build, deploy, maintain, and extend advanced observability and monitoring tools.
  • Define and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for system performance benchmarks.
  • Respond to escalated incidents, troubleshoot system and application problems, and conduct root cause analyses.
  • Stay updated with industry trends and emerging technologies to increase the quality and velocity of development.
  • Lead the design and architecture of scalable, distributed, and fault-tolerant systems.
  • Champion the adoption of new technologies and best practices, and mentor engineers.
Ideal Background:
  • 10+ years of professional experience in the software, DevOps, or SRE fields.
  • Strong programming skills in two or more of these languages: JavaScript, TypeScript, Python, or Go.
  • Ability to troubleshoot complex distributed systems at scale.
  • Experience with database performance monitoring and best practices.
  • Strong analytical skills, system design, and architecture for cloud applications.
  • Expertise in CI/CD, configuration management, monitoring, and automation.
  • Advanced knowledge of observability and best practices (ELK, Datadog, OpenTelemetry, Prometheus, Grafana).
  • Experience with deployment and orchestration via AWS ECS, Kubernetes, CloudRun, or similar.
  • Understanding of Linux, virtualization, networking, VPCs, firewalls, and security groups.
  • Hands-on knowledge of AWS and resource provisioning via CLI/API/IaC.
  • Experience with AI tools for productivity, troubleshooting, and coding.
  • Bachelor's degree in Computer Science
Why Us:
  • Competitive salary of $150,000 - $225,000 (depending on location and experience level)
  • 100% Remote Flexibility, Generous PTO, and comprehensive Employee Benefits Package
  • We are mission-driven, working collaboratively to improve scientific communication and accelerate discovery.
  • Our platform is loved by millions globally, with a world-class NPS and a community of loyal users in over 200 countries.
  • We are backed by top investors and accelerators like Y Combinator, and we're experiencing significant growth.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Software Engineer - Site Reliability

Menlo Ventures

Remote

USD 176,000 - 230,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer

ZipRecruiter

Raleigh

Remote

USD 130,000 - 170,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer

Hyperdrive Recruiting

Raleigh

Remote

USD 130,000 - 170,000

4 days ago
Be an early applicant

Sr Software Engineer (Platform Engineering)

Kard

New York

Remote

USD 160,000 - 195,000

Yesterday
Be an early applicant

Staff or Senior Software Engineer, Platform

Prepared

Remote

USD 180,000 - 220,000

2 days ago
Be an early applicant

Senior Software Engineer, Platform Controls

Flock Safety

Remote

USD 170,000 - 190,000

2 days ago
Be an early applicant

Sr. Software Engineer - Platform

AppFolio

Chicago

Remote

USD 138,000 - 173,000

4 days ago
Be an early applicant

Remote Senior Software Engineer, Platform (Mobile) - Gemini

WorksHub

New York

Remote

USD 120,000 - 160,000

4 days ago
Be an early applicant

Sr. Adobe Experience Platform Engineer

RemoteWorker US

Winifrede

Remote

USD 85,000 - 159,000

3 days ago
Be an early applicant