Enable job alerts via email!

Site Reliability Engineer

Kforce Inc

Atlanta (GA)

Remote

USD 125,000 - 150,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a dynamic team as a Site Reliability Engineer, where you will manage critical Cassandra and Elasticsearch clusters in a fully remote role. This position offers the opportunity to work with cutting-edge technologies, focusing on automation, performance tuning, and disaster recovery strategies. Collaborate with passionate engineers to design and deliver innovative solutions while ensuring system reliability and availability. If you're looking for a challenging role that allows you to grow your skills and make a significant impact, this is the perfect opportunity for you.

Benefits

Medical, Dental, and Vision Insurance
401(k) Plan
Paid Time Off
Life and Disability Insurance

Qualifications

  • Experience managing large critical Cassandra and Elasticsearch clusters.
  • Proficient in automating maintenance tasks using Python and Puppet.

Responsibilities

  • Manage and monitor Cassandra and Elasticsearch clusters for optimal performance.
  • Automate build and maintenance tasks using scripts.

Skills

Cassandra Management
Elasticsearch Management
Automation (Python, Go, Shell)
Cloud Infrastructure (GCP)
Monitoring and Performance Tuning
Incident Response
Disaster Recovery Strategies
Ansible
Kubernetes

Education

Bachelor's Degree in Computer Science or related field

Tools

Puppet
Python
Go
Shell Scripting

Job description

Get AI-powered advice on this job and more exclusive features.

This range is provided by Kforce Inc. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$40.00/hr - $50.00/hr

Responsibilities

Kforce has a client that is seeking a Fully Remote Site Reliability Engineer to join their team. This engineer will be working with a team of passionate engineers responsible for automation, scaling, tuning, and troubleshooting of Cassandra and Elasticsearch databases. They will also collaborate and work with a diverse group of engineers to design and deliver solutions. Responsibilities:

  • Manage large critical Cassandra and Elasticsearch clusters
  • Write scripts to automate all build and maintenance tasks using puppet and python/go/shell
  • Monitor cluster availability, read/ write latencies, and other key performance metrics to proactively identify SLO misses and help mitigate issues
  • Tune Cassandra and ES databases for optimizing throughput and read/write latencies
  • On-call rotation support with rest of team for quick incident response
  • Implement DR strategies, including backups and recovery techniques with minimal downtime
  • Write and update runbooks and SOP's
  • Proactively monitor and scale Elasticsearch/Cassandra clusters to handle growth in traffic
  • Evaluate new technologies, tools, and software versions

Requirements

  • Experience managing large Elasticsearch clusters in highly available 24x7 production environments
  • Experience creating efficient design of Elasticsearch index, familiar with re-indexing and data mappings
  • Experience automating the maintenance of Cassandra and ES using Python and puppet or similar tools
  • Experience managing cloud infrastructure on GCP
  • Experience in developing automation jobs using any one scripting languages (Python/go/shell) is a high plus
  • Heavy Elastic Search and Cassandra experience
  • Minimum of 1 1/2 years of experience
  • Ability to quickly learn new concepts and technologies and adapt to changing needs
  • Heavy Automation experience (Python, etc.)
  • Ansible, Puppet, etc.
  • Kubernetes is a plus

The pay range is the lowest to highest compensation we reasonably in good faith believe we would pay at posting for this role. We may ultimately pay more or less than this range. Employee pay is based on factors like relevant education, qualifications, certifications, experience, skills, seniority, location, performance, union contract and business needs. This range may be modified in the future.

We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees. Salaried personnel receive paid time off. Hourly employees are not eligible for paid time off unless required by law. Hourly employees on a Service Contract Act project are eligible for paid sick leave.

Note: Pay is not considered compensation until it is earned, vested and determinable. The amount and availability of any compensation remains in Kforce's sole discretion unless and until paid and may be modified in its discretion consistent with the law.

This job is not eligible for bonuses, incentives or commissions.

Kforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.

By clicking “Apply Today” you agree to receive calls, AI-generated calls, text messages or emails from Kforce and its affiliates, and service providers. Note that if you choose to communicate with Kforce via text messaging the frequency may vary, and message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You will always have the right to cease communicating via text by using key words such as STOP.

Seniority level
  • Seniority level
    Associate
Employment type
  • Employment type
    Contract
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Computers and Electronics Manufacturing and Retail

Referrals increase your chances of interviewing at Kforce Inc by 2x

Get notified about new Site Reliability Engineer jobs in Atlanta, GA.

Senior Software Reliability Engineer (Remote)

Atlanta, GA $93,640.00-$157,300.00 1 month ago

Site Reliability Engineer - FedRamp (TD&R)
DevOps Software Engineer (Remote - United States)
DevOps Developer/ Atlanta/ American Comfort
Senior Site Reliability / Gitops Engineer
Staff Software Engineer, Reliability Engineer - Observability (Remote)
Process Mechanical Engineer - pumping systems/hydraulics

Atlanta, GA $97,000.00-$158,000.00 1 year ago

Contract role: Senior Azure Devops Engineer with Kubernetes at Atlanta, GA (Remote)

Atlanta, GA $160,000.00-$200,000.00 1 hour ago

Atlanta Metropolitan Area $45.00-$50.00 2 days ago

Python and Kubernetes Software Engineer - Data, AI/ML & Analytics

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer

Jobot

Atlanta

Remote

USD 100,000 - 150,000

5 days ago
Be an early applicant

Site Reliability Engineer

DefenseStorm

Atlanta

Remote

USD 90,000 - 140,000

5 days ago
Be an early applicant

Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)

Lensa

Atlanta

Remote

USD 120,000 - 190,000

Yesterday
Be an early applicant

Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)

Lensa

Atlanta

Remote

USD 120,000 - 190,000

2 days ago
Be an early applicant

Site Reliability Engineer III

Sinch

Atlanta

Remote

USD 142,000 - 181,000

4 days ago
Be an early applicant

Site Reliability Engineer III

ZipRecruiter

Atlanta

Remote

USD 142,000 - 181,000

8 days ago

Site Reliability Engineer

Jobot

Indianapolis

Remote

USD 100,000 - 150,000

10 days ago

Site Reliability Engineer

Jobot

Philadelphia

Remote

USD 100,000 - 150,000

10 days ago

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Philadelphia

Remote

USD 110,000 - 180,000

5 days ago
Be an early applicant