Enable job alerts via email!

Senior Site Reliability Engineer II

ConnectWise

United States

Remote

USD 100,000 - 130,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading software company is seeking a Site Reliability Engineer to enhance the reliability and performance of production systems. The role involves collaborating with development and operations teams to design robust infrastructure solutions and automate processes, ensuring high availability and performance of business-critical applications.

Qualifications

  • 5+ years of experience in Information Technology related areas.
  • 5+ years of hands-on experience managing Windows servers.
  • 3+ years of experience working with AWS offerings.

Responsibilities

  • Design, build, implement, maintain, and support system infrastructure.
  • Develop and leverage automation and monitoring capabilities for cloud solutions.
  • Participate in troubleshooting efforts and incident resolution activities.

Skills

Windows Servers
Python
PowerShell
Bash
AWS
SQL
GitLab CI
Agile
DevOps
Communication

Education

Bachelor’s degree

Tools

AWS
CloudFormation
Terraform
Dynatrace
DataDog
CloudWatch

Job description

ConnectWise is an industry and Global leading software company with over 3,000 colleagues in North America, EMEA and APAC. As a community-driven software company dedicated to the success of technology solution providers, our suite helps over 45,000 of our partners manage their businesses better, sell more efficiently, automate service delivery, and remotely control technology so they can consistently deliver amazing customer experiences.

Our company is powered by our connections, our colleagues, and our community. And, we accept all kinds.

Game-changers, innovators, culture-lovers—and humankind.

We invite discovery and debate. We recognize key moments as milestones.

We see you and value you for your unique contributions. Our inclusive, positive culture lays the foundation to ensure every colleague is valued for their perspectives and skills, giving you the choice of how YOU make a difference.

Curious? Read this opportunity to learn how YOU can make a difference at ConnectWise!



General Summary:

Site Reliability Engineers enhance the reliability, scalability, and performance of production systems across theorganization. They bridge the gap between development and operations teams, collaborating with softwareengineers, product managers, and technical stakeholders to design and implement robust infrastructure solutionsthat support business-critical applications and services. SREs apply software engineering principles to operationsproblems, creating automated solutions to ensure system availability, latency, performance, and capacity

Essential Duties and Responsibilities:

  • Design, build, implement, maintain, and support system infrastructure.
  • Define and improve software delivery, system configuration, security, performance, and operationalmechanisms of varied Cloud infrastructures in use by different projects and company efforts.
  • Identify impact, present options, plan delivery activities, mitigate downtime risks, recommend strategies, andestimate level of effort for creating new or modifying existing Cloud infrastructures for projects.
  • Develop and leverage automation and monitoring capabilities for complex cloud-based solutions.
  • Consistently apply and enforce Cloud Engineering standards and best practices.
  • Assist in creating test, demonstration, and proof-of-concept environments.
  • Find and recommend technical improvements and cost-saving measures.
  • Participate in troubleshooting efforts and incident resolution activities
  • Keep stakeholders abreast of the status of their requests.
  • Write technical documentation and keep process-related information current

Knowledge, Skills, and/or Abilities Required:

  • 5+ years of hands-on experience managing Windows servers, including DNS, networking, and IIS
  • 5+ years of experience programming in Python, PowerShell, and Bash.
  • 3+ years of experience working with AWS offerings. Proficient in the use of EC2/ECS, RDS, S3, Route53, SWF, ELB, VPC networking, Redis, and OpenSearch.
  • Ability to write basic SQL queries and analyze data for troubleshooting purposes.
  • Practical experience building deployment pipelines on Gitlab CI.
  • Practical experience using observability platforms like Dynatrace, DataDog, and CloudWatch
  • Practical experience using .NET command line debugger mdbg or similar.
  • Knowledge of package-management tools like Artifactory.
  • Solid understanding of the DevOps mindset and the Infrastructure as Code (IaC) philosophy using CloudFormation or Terraform.
  • Possess strong analytical and problem-solving skills to resolve or coordinate the resolution of complex Cloud infrastructure issues in a flexible and effective manner.
  • On call incident management with PagerDuty or similar tools.
  • Practical experience working in Agile, distributed environments.
  • Ability to work on multiple priorities and/or projects simultaneously, independently or with a team.
  • Demonstrate ability to learn quickly.
  • Excellent verbal and written communication skills.

Educational/Vocational/Previous Experience Recommendations:

  • 5+ years of experience in Information Technology related areas.
  • Bachelor’s degree preferred.
  • AWS certification preferred.

Working Conditions:

  • Professional, fast-paced remote/office environment.
  • Availability during off hours to assist in the resolution of production incidents.
  • Some travel may be required.

ConnectWise is an Equal Opportunity Employer, dedicated to building a diverse and inclusive workforce and providing a workplace free from discrimination and harassment. ConnectWise provides equal employment opportunities to all employees and applicants without regard to race, ethnicity, color, religion, age, sex (including pregnancy), sexual orientation, gender, gender identity or expression, ancestry, national origin, citizenship status, physical or mental disability, genetic information, military/veteran status, marital status, familial or parental status, or any other characteristic or status protected by applicable federal, state and local laws.

The statements above are intended to describe the general nature and level of work being performed by individuals assigned to this job.Other duties may be assigned as needed. Reasonable accommodations may be made to enable qualified individuals with disabilities to perform the essential functions of the job and/or to receive other benefits and privileges of employment. If you need a reasonable accommodation for any part of the application and hiring process, please contact us at talentacquisition@connectwise.com or 1-800-671-6898.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Akamai Technologies GmbH

Remote

USD 106,000 - 222,000

13 days ago

Software Engineering Site Reliability Engineer Professional JERSEY CITY, US

Avature

New Jersey

Remote

USD 111,000 - 191,000

13 days ago

Site Reliability Engineer - Remote US

Lensa

Madison

Remote

USD 115,000 - 135,000

Today
Be an early applicant

Senior Software Engineer, Platform

ZipRecruiter

San Diego

Remote

USD 85,000 - 140,000

Yesterday
Be an early applicant

Senior Acquisition Specialist - CBP (Clearance Required)

Remote Jobs

Remote

USD 95,000 - 167,000

Yesterday
Be an early applicant

Senior Acquisition Specialist - CBP (Clearance Required)

LMI Consulting, LLC

Remote

USD 95,000 - 167,000

6 days ago
Be an early applicant

Software Engineer II - Site Reliability Engineer

The Walt Disney Company

California

On-site

USD 114,000 - 169,000

14 days ago

Site Reliability Engineer

IBM

Jersey City

Remote

USD 90,000 - 140,000

12 days ago

Site Reliability Engineer

IBM Computing

Jersey City

Remote

USD 90,000 - 150,000

12 days ago