Enable job alerts via email!

Site Reliability Engineer

Pythian

United States

Remote

USD 90,000 - 150,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Senior Site Reliability Engineer to enhance system reliability and performance. In this remote role, you will leverage your expertise in Object Oriented Programming, particularly with Python or Go, and work with cutting-edge tools like Terraform and AWS. You will play a pivotal role in mentoring junior engineers, ensuring data integrity, and leading incident responses. This position offers the flexibility of remote work, a competitive rewards package, and opportunities for professional development. If you are passionate about solving complex problems and driving business results, this role is perfect for you.

Benefits

Competitive total rewards package
Substantial training allowance
Work from home equipment
Flexible work hours
Volunteer opportunities

Qualifications

  • 5+ years in an SRE role with experience in Terraform on AWS.
  • Proven ability to identify patterns in complex situations.

Responsibilities

  • Ensure services are reliable by focusing on performance and observability.
  • Mentor junior team members on code readability and reusability.

Skills

Object Oriented Programming
Automation
Infrastructure as Code
Logical and Analytical Thinking
Communication Skills

Tools

Terraform
AWS
Docker
Vault
ProxySQL
Gitlab

Job description

Site Reliability Engineer

Europe (UK, Macedonia, Poland, Romania, Spain) | Remote | Work from home

Why Pythian:

At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the reliability and performance of mission-critical databases. We quickly earned a reputation for solving tough data challenges. We were there when the industry moved from on-premises to cloud environments, and as enterprises sought more from their data, we expanded our competencies to include advanced analytics.

Today, we empower organizations to embrace transformation and leverage advanced technologies, including AI, to stay competitive. We deliver innovative solutions that meet each client’s data goals and have built strong partnerships with Google Cloud, AWS, Microsoft, Oracle, SAP, and Snowflake. The powerful combination of our extensive expertise in data and cloud and our ability to keep on top of the latest bleeding edge technologies make us the perfect partner to help mid and large-sized businesses transform to stay ahead in today’s rapidly changing digital economy.

Why you?

As the Senior Site Reliability Engineer you play a critical role in the following key areas of responsibility:

Ensure services and systems are reliable by focusing on scalability, latency, performance, availability, efficiency, and observability. Develop systems and maintain key components to automate and minimize human labor. Enhance system reliability while decoupling system size from operational toil and complexity. Training and mentoring - both technical and customer satisfaction oriented.

If you Love Your Data and enjoy solving complex business and technical problems and want to Love your career then this could be the job for you!


What will you be doing?
  • Use relevant development languages and knowledge of systems, services, and tools appropriate for the business area to build software applications. Serves as the primary contact for this area.
  • Develop readable and reusable code by applying standard patterns and using standard libraries.
  • Mentor junior team members on code readability and reusability.
  • Ensure data security, integrity, and quality by adhering to company standards and best practices.
  • Design solutions that fulfill current business requirements and accommodate future enhancements.
  • Guide teams to guarantee systems are reusable and interoperable.
  • Use best practices to reduce risks to business operations by creating clear documentation like runbooks and operational guides.
  • Collaborate with development teams to define and implement relevant observability metrics, with the goal of enhancing application reliability.
  • Spearhead incident response for issues impacting their track.
  • Leverage new technologies and automation to reduce operational and maintenance costs.
  • Eliminate technical debt, identify scaling bottlenecks, and proactively plan for future growth to ensure infrastructure is kept up-to-date.
  • Use excellent technical judgement, innovation, and execution to prioritize and solve challenging problems.
  • Independently drive business results across multiple teams by either leading high-level architectural decisions or implementing complex components of a project.
  • Set technical strategy and define technical roadmaps with cross-team dependencies for business impacting projects, requiring a high level of technical expertise.
What do we need from you?
  • Demonstrated experience in Object Oriented Programming with Python or Go.
  • Tool Stack: Terraform, AWS, Docker, Vault, ProxySQL, and Gitlab.
  • Skills: Automation (General), Infrastructure as Code, Python Scripting, Bash shell scripting and Unix system admin.
  • 5 Years working in an SRE role, including a minimum of two years verifiable experience working with terraform on AWS in a production environment.
  • Intermediate-level experience with MySQL or other relational database systems.
  • Logical and analytical thinking for problem-solving.
  • Proven ability to systematically identify patterns and underlying issues in complex situations.
  • Strong skills in identifying opportunities to improve processes, systems, and structures, enhancing performance through analysis and assessment of existing process flows, methods, and standards.
  • Experience in delivering clear, well-structured, and meaningful information to a target audience, using appropriate communication mediums and language tailored to the audience.
  • Ability to achieve mutually agreeable solutions by staying adaptable, communicating ideas clearly, and practicing active listening.
What do you get in return?
  • Love your career: Competitive total rewards package with an annual bonus
  • Love your development: Hone your skills or learn new ones with our substantial training allowance; participate in professional development days, attend conferences, become certified, whatever you like!
  • Love your work/life balance: Why commute? Work remotely from your home (forever), there’s no daily travel requirement to an office! You can be located anywhere in the US or Canada, all you need is a stable internet connection.
  • Love your workspace: We give you all the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalise your work environment!
  • Love your community: Blog during work hours; take a day off and volunteer for your favorite charity.

Hiring Disclaimer

The successful applicant will need to fulfill the requirements necessary to obtain a background check.

Accommodations are available upon request for candidates taking part in any aspect of the selection process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Censys, Inc.

Ann Arbor

Remote

USD 145,000 - 195,000

5 days ago
Be an early applicant

Principal Site Reliability Engineer

Lumen Technologies

Remote

USD 149,000 - 199,000

3 days ago
Be an early applicant

Senior Site Reliability Engineer

ZipRecruiter

Santa Barbara

Remote

USD 140,000 - 160,000

-1 days ago
Be an early applicant

Site Reliability Engineer

Kforce Inc

Atlanta

Remote

USD 125,000 - 150,000

5 days ago
Be an early applicant

Site Reliability Engineer, Customer Security

Coalition Inc

Remote

USD 108,000 - 164,000

4 days ago
Be an early applicant

Sr. Site Reliability Engineer

Dayforce

Remote

USD 80,000 - 120,000

5 days ago
Be an early applicant

Lead Site Reliability Engineer/Architect (Remote)

Cognizant

Riverwoods

Remote

USD 120,000 - 162,000

6 days ago
Be an early applicant

Senior Reliability Engineer

JLL

Chicago

Remote

USD 120,000 - 140,000

6 days ago
Be an early applicant

Software Engineering Site Reliability Engineer Professional JERSEY CITY, US

Avature

New Jersey

Remote

USD 111,000 - 191,000

10 days ago