Enable job alerts via email!

Cloud Site Reliability Engineer II - Remote

CentralSquare

United States

Remote

USD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in enterprise software seeks a Cloud Site Reliability Engineer II to enhance reliability and performance in Cloud operations. This role involves collaborative problem-solving, automation, and maintaining efficient processes, offering growth potential within a dynamic environment and a range of competitive benefits.

Benefits

Tuition reimbursement
Parental leave
Paid volunteer hours
Unlimited PTO
Flexible work environment
Excellent work-life balance

Qualifications

  • Experience with Cloud (AWS and/or Azure), Linux, Java, Python, C, and Ruby.
  • Natural collaboration skills and proactive approach to problem-solving.
  • Strong background in automation and performance tuning.

Responsibilities

  • Design, develop, and maintain software solutions for Cloud Operations.
  • Monitor systems to collect metrics and optimize performance.
  • Participate in 24x7 operational support and on-call rotation.

Skills

Collaboration
Problem Solving
Continuous Improvement
Root Cause Analysis
Scripting

Education

Bachelor's or Graduate's Degree in Computer Engineering or related field

Tools

AWS
Azure
Linux
Terraform
SQL
Python
Java
JavaScript
C
Ruby

Job description

CentralSquare is a unique enterprise software company whose mission is to build safer, smarter, more connected communities.More than8,000 public sector agenciestrust CentralSquare solutions each and every day. We serve governments of all sizes, from small towns to major cities, to make delivering public services less costly and more efficient.

Cloud Site Reliability Engineer II - Remote

United States
  • Apply
About CentralSquare Technologies

CentralSquare is a unique enterprise software company whose mission is to build safer, smarter, more connected communities.More than8,000 public sector agenciestrust CentralSquare solutions each and every day. We serve governments of all sizes, from small towns to major cities, to make delivering public services less costly and more efficient.

Job Description

What We’re About

At CentralSquare, you’ll get the opportunity to work in a collaborative environment within a company that builds complex web-based enterprise applications for our Public Servants across North America.

Looking to grow your career? That’s great! We believe in growing and cultivating careers here. There is plenty of room for growth for motivated people.

Hard work should be rewarded. We are committed to providing competitive compensation with a great benefits package, including tuition reimbursement, parental leave, paid volunteer hours, and unlimited PTO. Our flexible work environment also enables you to take advantage of an excellent work-life balance whether you are in office or working remotely.

The Role

We’re passionate about building software and processes that solve problems. We count on our Site Reliability Engineers (SREs) to enable users with a rich feature set, high availability, and stellar performance level to pursue their missions for successful Cloud deployments, performance, and reliability.

Site Reliability Engineers (SREs) incorporate software engineering aspects and apply them to infrastructure and operations problems. They apply software engineering principles to systems administration and serve as bridges between a company’s development and operations. They perform functions and on-call duties and develop the systems and software that bolster site reliability and performance. They build self-service tools for user groups that provide automation and rely on their services, including automatic system provisioning, automated product upgrades and statistical visualizations.

SREs perform break-fix resolution and other project tasks while looking forward and creating services that reduce the amount of manual work required for administration. They collaborate with product developers to ensure designed solutions respond to non-functional requirements including security and maintainability and work with release engineers to confirm that software delivery pipelines are as efficient as possible.

Roles and Responsibilities:

  • Activities include designing, developing, installing, and maintaining software solutions that provide efficiency in Cloud Operations.

  • Work with engineering teams to refine deployment and release processes.

  • Collaborate with the engineering team on projects as the expert on reliability, performance, and efficiency.

  • Assist product engineers in development and deployment of backend applications.

  • Be prepared to explain your work, decisions, and ideas to your colleagues.

  • Participate in 24x7 operational support and on-call rotation shifts.

  • Ensure that all system design and procedures are documented and up-to-date.

  • Combine existing documentation where available, and create it where needed, to create a centralized body of knowledge for all team members to utilize. Contribute to the upkeep of documentation to maintain relevancy and accuracy.

  • Provide training and education to Cloud Operations on infrastructure and internal tooling.

  • Provide level of audit and control to security personnel.

  • Monitor systems to collect metrics for tuning and capacity planning.

  • Work to automate detection and resolution of recurring issues.

  • Build the whole stack from load balancers to the databases.

  • Ensure safety, predictability, repeatability and auditability of all build and deploy processes.

  • Provide technical leadership to other CentralSquare departments.

  • Develop, coach, mentor individuals and teams and ensure high performance in a fast-paced environment.

  • Build tools and automation that eliminate repetitive tasks and prevent incident occurrence.

Skills & Requirements Requirements
  • Bachelor's or Graduate's Degree in computer engineering, computer science, engineering or information systems management, or equivalent experience.

  • Experience with Cloud (AWS and/or Azure), Linux, JAVA, Python, C, UNIX, and Ruby software and systems.

  • Experience with Terraform, Agile, SaaS, SQL, Cloud Architecture, and Javascript software and systems.

  • Comfortable scripting and debugging.

  • Natural collaboration skills and an eye on continuous improvement.

  • Fluent in scalability and root cause analysis exercises.

  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

REMOTE Systems Safety Engineer

Lensa

null null

Remote

Remote

USD 85 000 - 120 000

Full time

30 days ago

Site Reliability Engineer II - Cloud Networking - Remote

Akamai Technologies

null null

Remote

Remote

USD 83 000 - 175 000

Full time

30+ days ago

Site Reliability Engineer II - Remote

Akamai Technologies

null null

Remote

Remote

USD 100 000 - 125 000

Full time

30+ days ago

Site Reliability Engineer II - Remote

Akamai Technologies GmbH

null null

Remote

Remote

USD 83 000 - 175 000

Full time

30+ days ago

Site Reliability Engineer II - Remote

Akamai Technologies GmbH

null null

Remote

Remote

USD 83 000 - 175 000

Full time

30+ days ago

Site Reliability Engineer II - Remote

Akamai Technologies

null null

Remote

Remote

USD 83 000 - 175 000

Full time

30+ days ago