Enable job alerts via email!

Database Reliability Engineer - Core Team

ClickHouse

United States

Remote

USD 100,000 - 125,000

Full time

Today
Be an early applicant

Job summary

A global tech company is seeking a Database Reliability Engineer for their Core Team. This role involves improving the reliability and performance of their database systems, managing incident responses, and collaborating across teams. Candidates should have a strong background in Reliability Engineering and experience with SQL databases. The company offers a flexible work environment, healthcare contributions, stock options, and generous time-off policies.

Benefits

Flexible work environment
Healthcare contributions
Stock options
Flexible time off
Home office setup allowance
Global Gatherings

Qualifications

  • At least 5 years of experience in Reliability Engineering, QA, or customer-facing engineering.
  • Strong understanding of distributed database internals.
  • ClickHouse experience is a major plus.

Responsibilities

  • Continuously improve the reliability and performance of ClickHouse core.
  • Investigate common customer issues to identify root causes.
  • Manage on-call processes to respond to reliability issues.

Skills

Reliability Engineering
SQL databases
Scripting with Shell or Python
Problem-solving
Production debugging

Education

Bachelor’s or Master’s degree in Computer Science

Tools

SQL
C++
AWS
Azure
Google Cloud Platform
Job description
Database Reliability Engineer - Core Team

Germany (remote)

Overview

We are building out our Site Reliability Engineering team in ClickHouse Core. As one of the first members of our Reliability Engineering Team at Core, you will be responsible for building and leading processes to ensure and improve the reliability, availability, scalability, and performance of ClickHouse. You will collaborate with teams such as Control Plane, Dataplane, Security, Support and Operations, guiding them to implement ClickHouse in the best way for our customers. You will own engineering escalation management and response, investigations, post-mortem analysis including running blameless postmortems, and continuous improvement of how ClickHouse is run and optimized in the cloud. This role offers the opportunity to impact our elastic, high-performance ClickHouse in ClickHouse Cloud.

What will you do?
  • Continuously improve the reliability and performance of ClickHouse core.
  • Improve and create metrics and alerts for ClickHouse to identify and prevent problems in production before they affect customers.
  • Investigate the most common customer issues in ClickHouse Core to identify root causes, submit bug fixes, issue reports, and suggest improvements.
  • Refine incident response processes and post-mortem analysis for core outages, including coordinating with support and Cloud teams to communicate with impacted customers.
  • Plan, enable, and drive Chaos initiatives across Engineering teams based on internal priorities.
  • Manage on-call processes to respond to performance and reliability issues and establish best practices for escalation to minimize customer impact.
About you
  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • At least 5 years of experience in Reliability Engineering, QA or customer-facing engineering.
  • Experience operating ClickHouse or other SQL databases in production.
  • Strong understanding of distributed database internals and SQL; ClickHouse experience is a major plus.
  • Scripting experience with Shell or Python, and ability to read and understand C++ code.
  • Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Strong problem-solving and production debugging skills.
  • Ability to thrive in a fast-paced, global team and partner with the business to move the company forward.
  • High level of responsibility, ownership, and accountability.
Compensation

These salary ranges reflect the minimum and maximum pay for the role at the time of posting; actual compensation may be higher or lower and ranges may be adjusted in the future. Placement within the range depends on factors such as education, qualifications, experience, skills, location, and business needs.

  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We operate in 20 countries.
  • Healthcare - Employer contributions towards healthcare.
  • Equity - Stock options for new team members.
  • Time off - Flexible time off in the US, generous entitlements in other countries.
  • Home office setup - A $500 allowance for remote employees.
  • Global Gatherings – Opportunities for company-wide in-person events.

Culture - We All Shape It

As part of our first 500 employees, you will be instrumental in shaping our culture.

Equal Opportunity & Privacy: ClickHouse provides equal employment opportunities to all employees and applicants and prohibits discrimination and harassment. Please see our Privacy Statement for details.

Significant job-related information has been removed or trimmed to focus on the responsibilities and qualifications of the role.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.