Enable job alerts via email!

Senior Site Reliability Engineer

Arbitrum

New York (NY)

On-site

USD 90,000 - 150,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Senior Site Reliability Engineer to enhance its enterprise-grade Rollup as a Service platform. In this pivotal role, you'll manage cloud infrastructure, improve incident management, and strengthen DevOps culture while collaborating with a passionate team. Your expertise in Web3 technologies and modern cloud solutions will help elevate the platform's reliability and efficiency. Join a forward-thinking company that values innovation and professional growth, and play a key role in shaping the future of blockchain scalability.

Qualifications

  • 4+ years managing cloud infrastructure with modern technologies.
  • 1+ year experience with Web3 infrastructure and GitOps principles.
  • Proficient in Docker, Kubernetes, and major cloud providers.

Responsibilities

  • Maintain and operate Gelato infrastructure across multiple cloud environments.
  • Enhance incident management and improve CI/CD pipelines.
  • Conduct team meetings focusing on reliability and efficiency.

Skills

Cloud Infrastructure Management
Web3 Infrastructure
GitOps Principles
Leadership Skills
Dynamic Environment Performance
Docker
Unix Systems
Kubernetes
Git
Terraform
Networking Knowledge
Debugging and Monitoring
Programming Language Proficiency
Cost-Optimized Solutions

Tools

Prometheus
Grafana
Splunk
Datadog
Helm
Kubectl

Job description

Gelato is an enterprise-grade Rollup as a Service Platform that helps you build scalable, blazing-fast, custom enterprise-grade Rollups with Gelato's powerful Native Web3 Modules. Today, over 50 projects rely on our Rollup Platform processing over 4.5M daily transactions & securing over $600M in TVL. We are proud to collaborate with innovative teams such as Kraken’s Ink, Fox News, Reya, Lisk, and Open Campus to bring millions of users onchain.

Our team is passionate and dedicated to bridging the gap between current blockchain capabilities and its potential. We foster an environment that encourages innovation, new ideas, collaboration, research, and in-depth discussions.

We are seeking a Senior Site Reliability Engineer to play a key role within our team and help elevate Gelato!

Responsibilities
  1. Maintain and operate Gelato infrastructure across multiple cloud environments.
  2. Improve incident management lifecycle for overall reliability.
  3. Enhance our Postmortem philosophy.
  4. Strengthen our DevOps culture.
  5. Deploy and maintain core components of Rollups-as-a-Service (RaaS) and related observability stacks.
  6. Evaluate and modernize our infrastructure and deployment strategies to meet industry standards.
  7. Maintain and improve our CI/CD pipelines and governance.
  8. Participate in on-call rotations to support operational stability.
  9. Conduct regular team meetings and provide system insights and recommendations focusing on reliability, security, and efficiency in a Web3 context.
  10. Actively seek cost-effective, innovative solutions and promote adoption of industry standards.
Minimum Requirements
  1. At least 4 years of experience managing cloud infrastructure with modern technologies.
  2. At least 1 year of experience with Web3 infrastructure.
  3. Strong understanding of GitOps principles.
  4. Leadership skills to influence decision-making positively.
  5. Ability to perform accurately in dynamic environments.
  6. Experience with major cloud providers (GCP, AWS, Azure).
  7. Proficiency with Docker, Unix systems, and Kubernetes.
  8. Experience with Git, Helm, Terraform, Kubectl, and similar tools.
  9. Knowledge of networking, CDNs, gateways, and deployment strategies.
  10. Experience operating highly available and microservice-based infrastructure.
  11. Proficiency in debugging, logging, monitoring, and alerting tools such as Prometheus, Grafana, Splunk, Datadog.
  12. Experience implementing cost-optimized solutions.
  13. Proficiency in at least one programming language (e.g., Go, Python, Rust, PHP, TypeScript).
  14. Understanding of Web3 technologies and related challenges, including RaaS.
  15. Enthusiasm for learning and professional growth.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer - 2289298 - 2289298

UnitedHealth Group

Eden Prairie

Remote

USD 103,000 - 192,000

Today
Be an early applicant

Senior Site Reliability Engineer

Censys, Inc.

Ann Arbor

Remote

USD 145,000 - 195,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer

Yelosoftware

Remote

USD 90,000 - 150,000

Yesterday
Be an early applicant

[Hiring] Senior Site Reliability Engineer @SoFi

SoFi

Remote

USD 120,000 - 160,000

6 days ago
Be an early applicant

[Hiring] Senior Site Reliability Engineer @K Id

K Id

Remote

USD 100,000 - 140,000

5 days ago
Be an early applicant

Senior Site Reliability Engineer - FinOps

DraftKings

Remote

USD 90,000 - 150,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer

Rackspace Technology

Remote

USD 80,000 - 130,000

5 days ago
Be an early applicant

Senior Site Reliability Engineer - Azure - Remote

Optum

Eden Prairie

Remote

USD 89,000 - 177,000

5 days ago
Be an early applicant

Sr. Site Reliability Engineer

Dayforce

Remote

USD 80,000 - 120,000

2 days ago
Be an early applicant