Enable job alerts via email!

Remote Staff Site Reliability Engineer, Platform - Gemini

WorksHub

New York (NY)

Remote

USD 120,000 - 180,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in the crypto space seeks a Staff Site Reliability Engineer to enhance engineering practices and reliability across services. The role involves operational support, system performance analysis, and leadership to guide teams on automation and best practices in a collaborative environment.

Benefits

Competitive starting salary
Discretionary annual bonus
Long-term incentive equity grant
Comprehensive health plans
401K with company matching
Paid Parental Leave
Flexible time off

Qualifications

  • 7+ years experience in monitoring, alerting, and automation.
  • Strong knowledge of AWS, GCP, or Azure.
  • Experience working with containers like Docker and Kubernetes.

Responsibilities

  • Provide operational support and engineering for Gemini services.
  • Improve reliability, quality, and time-to-market for offerings.
  • Guide teams on implementing best practices for reliability.

Skills

Monitoring
Automation
Cloud Technologies
Technical Leadership
Containerization
Configuration Management
Scripting
System Performance Analysis

Tools

Terraform
Ansible
Docker

Job description

About the Company

Gemini is a global crypto and Web3 platform founded byTyler WinklevossandCameron Winklevossin 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency.

At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

The Department: Platform

Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering teams to ensure all our systems are architected, engineered and deployed to be resilient, reliable and performant.

The Embedded SRE team is a part of Site Reliability Engineering with a focus on engaging directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.

The Role: Staff Site Reliability Engineer

You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross functionally across Gemini’s engineering teams to influence and shape our development practices and culture.

Responsibilities:

  • Provide primary operational support and engineering for various Gemini services
  • Improve reliability, quality and time-to-market across all Gemini services and offerings
  • Guide engineering teams onto the various supported services provided by Platform
  • Run on-going performance evaluations and improvements for Gemini systems
  • Provide architecture recommendations and engagement as part of SDLC
  • Create “Production-ready Scorecards” to evaluate the health of systems pre-launch
  • Implement and teaching monitoring, alerting and automated resolution best practices
  • Define SLIs, SLOs with Engineering teams
  • Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments etc.
  • Build operational tooling and automations

Qualifications:

  • 7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
  • Good knowledge for various cloud technology providers like AWS, GCP, or Azure
  • Experience in a code-first environment, developing automated solutions to solve support and operational issues
  • Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team
  • Experience working with containerization such as Nomad, EKS (k8s), Docker, etc.
  • Experience working with Configuration Management such as Ansible, Chef, Puppet
  • Experience writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc.
  • Experience analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements
  • Experience working with Engineering teams, teaching, training, and mentoring on how toimplement best-practice technical solutions
  • Experience working in a code-drive, automation-first public cloud infrastructure (Terraform)

It Pays to Work Here

The compensation & benefits package for this role includes:

  • Competitive starting salary
  • A discretionary annual bonus
  • Long-term incentive in the form of a new hire equity grant
  • Comprehensive health plans
  • 401K with company matching
  • Paid Parental Leave
  • Flexible time off
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Site Reliability Engineer (Remote)

Livepeer

New York null

Remote

Remote

USD 100 000 - 150 000

Full time

13 days ago

Account Executive

V7

New York null

Remote

Remote

USD 100 000 - 150 000

Full time

Yesterday
Be an early applicant

Commercial Insurance Broker

Coverdash

New York null

Remote

Remote

USD 100 000 - 150 000

Full time

Yesterday
Be an early applicant

Remote Staff Software Engineer, Onchain (Security) - Gemini

Blockchain Works

New York null

Remote

Remote

USD 120 000 - 160 000

Full time

Today
Be an early applicant

Remote Staff Software Engineer, Onchain - Gemini

WorksHub

New York null

Remote

Remote

USD 140 000 - 180 000

Full time

Today
Be an early applicant

Remote Senior Software Engineer, Onchain (Security) - Gemini

Blockchain Works

New York null

Remote

Remote

USD 120 000 - 180 000

Full time

Today
Be an early applicant

Remote Staff Software Engineer, Onchain (Applied Cryptography) - Gemini

WorksHub

New York null

Remote

Remote

USD 120 000 - 160 000

Full time

Today
Be an early applicant

Remote Principal Software Engineer, Platform (Mobile, React Native) - Gemini

WorksHub

New York null

Remote

Remote

USD 130 000 - 180 000

Full time

Today
Be an early applicant

Remote Senior Site Reliability Engineer, Onchain - Gemini

Blockchain Works

New York null

Remote

Remote

USD 120 000 - 160 000

Full time

9 days ago