Enable job alerts via email!

Senior Principal Site Reliability Engineer

Atlassian

Aurora (CO)

Remote

USD 210,000 - 313,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Atlassian is seeking a Site Reliability Engineer to enhance the reliability of their software applications across cloud platforms. The role involves designing monitoring solutions, troubleshooting performance issues, and collaborating with cross-functional teams, all within a flexible remote working environment. Candidates with extensive experience in building high-scale distributed systems are encouraged to apply.

Benefits

Health coverage
Paid volunteer days
Wellness resources

Qualifications

  • 10+ years of software development experience.
  • 3+ years in a technical lead role focusing on distributed systems.
  • Solid understanding of cloud platforms and containerization technologies.

Responsibilities

  • Design and implement monitoring solutions for software applications.
  • Conduct performance analysis and troubleshoot issues.
  • Mentor junior team members on monitoring best practices.

Skills

Monitoring solutions design
Performance analysis
Collaboration
Troubleshooting
Cloud computing

Education

Bachelor's or Master's degree in Computer Science

Tools

Prometheus
Grafana
ELK Stack
Datadog

Job description

Site Reliability Engineering | Mountain View, United States or Remote | Remote, Americas | Full-Time

Working at Atlassian
Atlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and other priorities. We can hire people in any country where we have a legal entity. Interviews and onboarding are conducted virtually, a part of being a distributed-first company.

Design, architect, and implement monitoring and observability solutions for complex software applications and infrastructure.

Evaluate and select appropriate monitoring tools and technologies based on project requirements and industry trends.

Conduct performance analysis, capacity planning, and troubleshooting to identify and address performance bottlenecks and reliability issues.

Collaborate with cross-functional teams to gather requirements and define monitoring strategies.

Develop monitoring frameworks, dashboards, and alerting systems to ensure critical systems' reliability, performance, and availability.

Implement best practices for log management, metrics collection, and distributed tracing to gain deep insights into system behavior and performance.

Mentor and provide guidance to junior team members on monitoring best practices and methodologies

Stay up-to-date with emerging technologies and industry trends in observability, monitoring, and devOps practices.

Bachelor's or Master's degree in Computer Science, Information Technology, or related field.

10+ years of software development experience

With over 3 years of experience in a technical lead role, specializing in designing and developing high-scale distributed systems.

Strong communication and collaboration skills with the ability to work effectively in a fast-paced environment

Excellent analytical and problem-solving skills with a keen attention to detail

Proven experience designing and implementing monitoring solutions for large-scale, distributed systems

Solid understanding of cloud computing platforms (e.g., AWS , Azure, GCP ) and containerization technologies (e.g., Docker, Kubernetes).

Preferred Qualifications:

Strong proficiency in monitoring tools and technologies such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, etc

Knowledge of software development methodologies such as Agile or DevOps

Compensation

At Atlassian, we strive to design equitable and explainable compensation programs. To support this goal, the baseline of our range is higher than that of the typical market range, but in turn we expect to hire most candidates near this baseline. Base pay within the range is ultimately determined by a candidate's skills, expertise, or experience.

In the United States, we have three geographic pay zones. For this role, our current base pay ranges for new hires in each zone are:

Zone A: $234,100 - $312,100

Zone B: $210,700 - $280,900

Zone C: $194,300 - $259,000

This role may also be eligible for benefits, bonuses, commissions, and equity. Please visit go.atlassian.com/payzones for more information on which locations are included in each of our geographic pay zones. However, please confirm the zone for your specific location with your recruiter. #LI-Remote

Our perks & benefits
Atlassian offers a variety of perks and benefits to support you, your family and to help you engage with your local community. Our offerings include health coverage, paid volunteer days, wellness resources, and so much more. Visit go.atlassian.com/perksandbenefits to learn more.

About Atlassian
At Atlassian, we're motivated by a common goal: to unleash the potential of every team. Our software products help teams all over the planet and our solutions are designed for all types of work. Team collaboration through our tools makes what may be impossible alone, possible together.
We believe that the unique contributions of all Atlassians create our success. To ensure that our products and culture continue to incorporate everyone's perspectives and experience, we never discriminate based on race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines.
To provide you the best experience, we can support with accommodations or adjustments at any stage of the recruitment process. Simply inform our Recruitment team during your conversation with them.
For San Francisco Only: Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
To learn more about our culture and hiring process, visit go.atlassian.com/crh

Don’t see an exact role match? No problem! Join our Talent Community and stay up-to-date on company and careers updates relevant to your career.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Lead Site Reliability Engineer - Remote

Lensa

null null

Remote

Remote

USD 106,000 - 222,000

Full time

Yesterday
Be an early applicant

Senior Lead Site Reliability Engineer - Remote

Akamai Technologies

null null

Remote

Remote

USD 106,000 - 222,000

Full time

30+ days ago

Principal Site Reliability Engineer, Federal

MedStar Health

null null

Remote

Remote

USD 217,000 - 325,000

Full time

Yesterday
Be an early applicant

Principal Site Reliability Engineer

Atlassian

Aurora null

Remote

Remote

USD 170,000 - 275,000

Full time

30+ days ago

Principal Site Reliability Engineer

Cribl

null null

Remote

Remote

USD 240,000 - 400,000

Full time

4 days ago
Be an early applicant

Principal Site Reliability Engineer - Americas

Ashby

City of Syracuse null

On-site

On-site

USD 200,000 - 260,000

Full time

Yesterday
Be an early applicant

Director, Site Reliability Engineer

FyrFly Venture Partners

null null

Remote

Remote

USD 175,000 - 225,000

Full time

10 days ago

Principal Platform Engineer

Systems Planning and Analysis

null null

Remote

Remote

USD 185,000 - 260,000

Full time

Yesterday
Be an early applicant

Principal Platform Engineer

Systems Planning and Analysis, Inc.

null null

Remote

Remote

USD 185,000 - 260,000

Full time

Yesterday
Be an early applicant