Enable job alerts via email!

Senior Site Reliability Engineer (Database)

Wikimedia Foundation

United States

Remote

USD 101,000 - 158,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Senior Site Reliability Engineer specializing in databases to join a dedicated team. This role is pivotal in ensuring the health and performance of critical database systems used by millions globally. You will be responsible for troubleshooting, disaster recovery planning, and enhancing backup systems. The opportunity allows you to make a significant impact on the accessibility of knowledge worldwide while working in a remote-first environment. If you're passionate about technology and eager to contribute to a mission-driven organization, this position is perfect for you.

Benefits

Remote Work
Flexible Hours
Health Insurance
Retirement Plan
Professional Development
Paid Time Off

Qualifications

  • 5+ years in DBA/SRE/Operations roles with team experience.
  • Advanced knowledge of Linux and database troubleshooting.
  • Experience with high traffic website architectures.

Responsibilities

  • Implement and maintain relational database systems in production.
  • Tune database performance and ensure high availability.
  • Participate in incident response and system monitoring.

Skills

Database Administration
Linux
SQL
Open Source Configuration Management
Observability Infrastructure
High Availability Systems
Troubleshooting
Capacity Planning

Education

B.S. in Computer Science
M.S. in Computer Science

Tools

MariaDB
MySQL
Puppet
Ansible
Prometheus
Grafana
PHP
Redis

Job description

The Wikimedia Foundation is seeking a Senior Site Reliability Engineer (Databases). Our objective is to make the sum of all human knowledge available to everyone, and we persist most of this knowledge in MariaDB. Our project sites are some of the most highly trafficked on the internet, with more page views per engineer than any other site.

As a Senior Site Reliability Engineer for databases at the Wikimedia Foundation, you will be part of a small, focused team of skilled and experienced engineers. In this role, you will be responsible for ensuring the health of our database systems - including their availability and performance. Your responsibilities will include troubleshooting issues, planning for disaster recovery, and enhancing and maintaining backups. You do not have to be a database expert but must be willing to be trained to be one.

The work we do is crucial and is used by hundreds of millions of people. This is a unique opportunity to have a huge impact.

Responsibilities
  • Implementation, maintenance and troubleshooting of relational database systems in production and staging environments
  • Database performance tuning, high availability, replication, backups, and general optimization
  • Supporting the development and deployment of new services and systems
  • Handling configuration management, (Debian) package maintenance, patching and building, working with upstream on bug identification and resolution
  • Improving observability (alerting, metrics, monitoring) of database infrastructure
  • Multi-datacenter design, capacity and infrastructure planning
  • Taking part in incident response, diagnosis and follow-up on system outages or alerts across Wikimedia’s production infrastructure and participating in an on call rotation
  • Sharing our values and work in accordance with them
Qualifications
  • 5+ years experience in an DBA/SRE/Operations/DevOps role as part of a team
  • Experience with Open Source configuration management and orchestration tools (Puppet, Ansible, Chef, SaltStack, etc.), as well as modern observability infrastructure (Prometheus, Grafana, Graphite, Logstash/Kibana, Icinga/Nagios, etc.)
  • Advanced knowledge of Linux and IO/data storage concepts, internals and troubleshooting
  • Experience with managing remotely both bare-metal servers and virtualized environments
  • Experience with high traffic and highly available website architectures and operations
  • Ability to work independently in a fast paced environment, as an effective part of a globally distributed team, including ticket tracking systems and asynchronous communication tools
  • B.S. or M.S. in Computer Science or equivalent work experience
  • Advanced level of experience with MariaDB or MySQL database administration and replication topologies at scale
  • Proficiency in SQL
  • Solid knowledge of relational database concepts and working experience with storage systems and architectures
  • Experience with LAMP stack technologies (PHP/HHVM, memcached/Redis) - MediaWiki experience is a definite plus
  • Experience with advanced distributed storage and database systems (Swift, Ceph, Cassandra, etc.) or graph databases (Titan, Blazegraph, etc.) is a big plus
  • Experience in architecture, design, and implementation of persistent data storage & query infrastructure
  • Strong track record of open source contributions is a major plus
About the Wikimedia Foundation

The Wikimedia Foundation is the nonprofit organization that operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge freely. We host Wikipedia and the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive.

The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive donations from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.

As an equal opportunity employer, the Wikimedia Foundation values having a diverse workforce and continuously strives to maintain an inclusive and equitable workplace. We encourage people with a diverse range of backgrounds to apply. We do not discriminate against any person based upon their race, traits historically associated with race, religion, color, national origin, sex, pregnancy or related medical conditions, parental status, sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, or any other legally protected characteristics.

The Wikimedia Foundation is a remote-first organization with staff members including contractors based in more than 50 countries. Salaries at the Wikimedia Foundation are set in a way that is competitive, equitable, and consistent with our values and culture. The anticipated annual pay range of this position for applicants based within the United States is US$ 101,161 to US$ 157,200 with multiple individualized factors, including cost of living in the location, being the determinants of the offered pay. For applicants located outside of the US, the pay range will be adjusted to the country of hire. We neither ask for nor take into consideration the salary history of applicants. The compensation for a successful applicant will be based on their skills, experience and location.

All applicants can reach out to their recruiter to understand more about the specific pay range for their location during the interview process.

If you are a qualified applicant requiring assistance or an accommodation to complete any step of the application process due to a disability, you may contact us at recruiting@wikimedia.org or (415) 839-6885.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Firsthand

Remote

USD 150,000 - 175,000

Today
Be an early applicant

Senior Site Reliability Engineer

Censys, Inc.

Ann Arbor

Remote

USD 145,000 - 195,000

8 days ago

Senior Site Reliability Engineer

Censys

Ann Arbor

Remote

USD 145,000 - 195,000

Today
Be an early applicant

Senior Site Reliability Engineer

ZipRecruiter

Santa Barbara

Remote

USD 140,000 - 160,000

2 days ago
Be an early applicant

Sr. Site Reliability Engineer

Dayforce

Remote

USD 80,000 - 120,000

8 days ago

Principal Site Reliability Engineer

Lumen Technologies

Remote

USD 149,000 - 199,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer - Data (REMOTE)

Discogs

Chicago

Remote

USD 130,000 - 140,000

-1 days ago
Be an early applicant

Senior Site Reliability Engineer (Data Platforms SRE)

Wikimedia Foundation

Remote

USD 101,000 - 158,000

19 days ago

Senior Site Reliability Engineer II

ConnectWise

Remote

USD 100,000 - 130,000

Today
Be an early applicant