Enable job alerts via email!

Staff Storage Engineer

HRB

United States

Remote

USD 90,000 - 150,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Staff Operations Engineer specializing in Ceph Storage to enhance their global storage systems. In this pivotal role, you will design and maintain a highly scalable and resilient storage layer, utilizing cutting-edge technologies like Hadoop and Kafka. Your expertise in open-source distributed storage solutions will be crucial in managing complex systems that handle billions of transactions daily. This role offers the opportunity to lead significant projects, mentor junior team members, and drive process improvements within a dynamic, fast-paced environment. Join us to make an impact in a mission-critical domain!

Qualifications

  • Experience with open-source distributed storage solutions like Ceph.
  • Strong skills in capacity planning and disaster recovery.

Responsibilities

  • Design and operate a scalable storage layer on a global scale.
  • Develop automation for logging and monitoring of the storage layer.

Skills

Facilitation
Adaptability
Technical Problem Solving
Rigorous Design

Tools

Hadoop
Spark
Aerospike
Kafka
Salt
Ansible
Puppet
Terraform

Job description

We’re looking for a Staff Operations Engineer – Ceph Storage to support our storage team in the Cloud Platform division. Our scale spans the globe, with transactions happening 24x7 across our data centers. Every second, millions of requests are evaluated across our exchange. To achieve our mission, global efficiency and reliability are crucial, as every millisecond counts in our business.

What We’re Looking For:

  • Facilitator: Ability to relay information and ideas effectively within and across teams. While technical skills are essential, we also value your ability to collaborate with others.
  • Adaptable: Ability to keep up with industry fast-paced changes and prioritize tasks effectively against competing scopes and timelines.
  • Technical: Strong foundation in Operations, with experience solving complex problems and building solutions (including CI/CD, real-time monitoring, handling production issues, etc.).
  • Rigorous: Experience designing and managing massive, globally distributed systems that handle billions of transactions daily. Your approach should be thorough, scalable, and reliable.

Here’s What You’ll Be Doing:

  • Design, build, and operate a highly scalable, performant, and resilient storage layer on a global scale.
  • Develop and maintain automation for logging, monitoring, and maintenance of the storage layer.
  • Work with technologies such as Hadoop, Spark, Aerospike, Kafka to enhance and optimize existing systems.
  • Participate in complex security system designs and mentor junior team members.
  • Take ownership of large projects and components as a senior contributor.
  • Champion process and procedure improvements within the team and division.
  • Influence team direction, fostering accountability, trust, and goal focus.
  • Promote company values internally and externally.

Here's What You Need:

  • Experience building, maintaining, and troubleshooting open-source distributed storage solutions like Ceph and storage orchestrators such as Rook, in highly automated environments and at scale.
  • Experience with Infrastructure as Code (IaC) and configuration management tools like Salt, Ansible, Puppet, or Terraform.
  • Experience with storage-level replication technologies.
  • Strong skills in capacity planning, disaster recovery, and monitoring.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Storage Engineer (Rust)

Jobot

Scranton

Remote

USD 120,000 - 500,000

30+ days ago