Enable job alerts via email!

Site Reliability Engineer

Visa

Ashburn (VA)

On-site

USD 80,000 - 130,000

Full time

8 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Site Reliability Engineer to enhance the performance and reliability of its .NET applications. This role involves collaborating with development teams, implementing infrastructure automation, and monitoring application health. Ideal candidates will have a strong background in C#/.NET and Azure cloud services, along with experience in configuration management tools. Join a forward-thinking company that values innovation and offers a dynamic work environment where your contributions will significantly impact operational excellence.

Qualifications

  • 3+ years in a technical role like Systems Administrator or Network Engineer.
  • Strong understanding of system administration and networking principles.

Responsibilities

  • Ensure reliability and performance of .NET applications.
  • Implement infrastructure automation tools and incident response procedures.

Skills

C#/.NET
Azure Cloud Services
Infrastructure as Code (IaC)
Monitoring and Alerting Tools
Configuration Management Tools

Education

Bachelor's Degree

Tools

Azure DevOps
Ansible
Puppet
Chef
Prometheus
Grafana

Job description

Job Description

The Site Reliability Engineer will play a critical part in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications built with ASP.NET MVC, Angular, and Web API.

  • Partner with developers and product operations teams to understand application requirements and translate them into operational practices.
  • Design, implement, and maintain infrastructure automation tools using Infrastructure as Code (IaC) methodologies.
  • Monitor application health and performance metrics, proactively identifying and resolving potential issues.
  • Implement incident response procedures to ensure timely resolution of outages and service disruptions.
  • Establish and improve best practices for product solution design / architecture, and development.
  • Participate in peer and team code reviews by developing comprehensive coding standards and guidelines to ensure consistency, maintainability, and quality in software development. By establishing clear protocols for code formatting, naming conventions, error handling, testing, and documentation, we can enhance code readability, reduce defects, and facilitate knowledge sharing among team members.
  • Collaborate with engineers to develop and implement disaster recovery plans.
  • Continuously improve monitoring and alerting processes to ensure efficient problem identification and resolution.
  • Stay up-to-date on the latest advancements in .NET infrastructure and SRE best practices.

Must Haves:

  • In current job, actively involved in applying established architectural, coding best practices, and conducting code reviews.
  • Critical and advance understanding of supportability and maintainability KPIs.
  • Strong development background in C#/.NET; Although this role does not require daily coding, strong development background and knowledge is required.
  • Experienced with at least one programming language.
  • Senior level understanding with Azure cloud services and Azure DevOps.
  • Someone who currently has a deep level knowledge in at least one of any of the following: Azure Pipelines, Releases, Ansible, Puppet or Chef.

Qualifications:
Qualifications

  • Bachelor degree required
  • Minimum 3+ years of experience in a related technical role (e.g., Systems Administrator, Network Engineer) required
  • Experience with configuration management tools like Ansible, Puppet, or Chef preferred
  • Azure experience required
  • Familiarity with monitoring and alerting tools (.NET performance counters, Azure App Insight, Prometheus, Grafana) is a plus preferred
  • Ability to manage and coordinate multiple projects in a fast paced, highly professional environment.
  • While coding proficiency is not required, a strong understanding of the .NET ecosystem and a desire to delve into infrastructure and automation will be essential for success.
  • Strong understanding of system administration principles, including operating systems (Windows Server preferred) and networking concepts.
  • Familiarity with monitoring and alerting tools (.NET performance counters, Azure App Insight, Prometheus, Grafana)
  • Ability to work independently and as part of a team

Additional Information

SGS is an Equal Opportunity Employer, and as such we recruit, hire, train, and promote persons in all job classifications without regard to race, color, religion, sex, national origin, disability, age, marital status, sexual orientation, gender identity or expression and Indigenous status, or any other characteristics protected by law.

To perform this job successfully, an individual must be able to perform each essential duty satisfactorily with or without reasonable accommodations. The requirements listed above are representative of the knowledge, skills, and/or abilities required.

This job description should not be construed as an exhaustive statement of duties, responsibilities, or requirements, but a general description of the job. Nothing contained herein restricts the company's rights to assign or reassign duties and responsibilities to this job at any time.

Accommodations are available on request for qualified candidates during each stage of the recruitment process.

Please note that candidates applying for Canadian job openings should be authorized to work in Canada.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer

Element Solutions

Washington

Remote

USD 75,000 - 100,000

4 days ago
Be an early applicant

Site Reliability Engineer

Basecamp Consulting and Solutions LLC

Great Falls Crossing

Remote

USD 80,000 - 120,000

6 days ago
Be an early applicant

Site Reliability Engineer

ZipRecruiter

Great Falls Crossing

Remote

USD 80,000 - 120,000

3 days ago
Be an early applicant

Site Reliability Engineer

Basecamp Consulting & Solutions LLC

Great Falls Crossing

Remote

USD 80,000 - 120,000

5 days ago
Be an early applicant

Site Reliability Engineer - Remote

Donnelley Financial, LLC

Rockville

Remote

USD 90,000 - 130,000

6 days ago
Be an early applicant

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Philadelphia

Remote

USD 110,000 - 180,000

6 days ago
Be an early applicant

Site Reliability Engineer

Kforce Inc

Atlanta

Remote

USD 125,000 - 150,000

2 days ago
Be an early applicant

Site Reliability Engineer - Remote US

Lensa

Nashville

Remote

USD 115,000 - 135,000

Today
Be an early applicant

[Hiring] Site Reliability Engineer @JatApp

JatApp

Remote

USD 80,000 - 120,000

Yesterday
Be an early applicant