Enable job alerts via email!

Digital Site Reliability Engineer- REMOTE

NTT DATA, Inc.

Memphis (TN)

Remote

USD 80,000 - 130,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a dynamic and innovative team as a Digital Site Reliability Engineer, where you'll leverage your expertise in Kubernetes, Gitlab, and CI/CD to enhance system reliability and performance. This remote role offers a unique opportunity to collaborate with cross-functional teams, tackle complex challenges, and mentor fellow engineers. In an environment that thrives on adaptability and forward-thinking, your contributions will directly impact the efficiency of site reliability processes. If you're passionate about technology and eager to drive improvements in a fast-paced setting, this is the perfect role for you.

Qualifications

  • 5-7 years in technology with expertise in Kubernetes and CI/CD.
  • Experience with observability platforms and performance issues.

Responsibilities

  • Collaborate on release architectures and monitor frameworks.
  • Mentor team members and improve SRE processes.

Skills

Kubernetes
Gitlab
Dynatrace
GraphQL
Node.js
React
CI/CD pipelines
BASH shell scripting
Python
Docker

Job description

Digital Site Reliability Engineer- REMOTE

Date: Feb 18, 2025

Location: Memphis, TN, US

Company: NTT DATA Services

NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.

We are currently seeking a Digital Site Reliability Engineer- REMOTE to join our team in Memphis, Tennessee (US-TN), United States (US).

Job Title: Digital Site Reliability Engineer

Location:Temporary Remote

Job Description:

We are seeking a highly skilled and experienced Reliability Engineer to join our team. The ideal candidate must have a strong background in technology, with specific expertise in Kubernetes, Gitlab, Dynatrace, GraphQL, Node, and React, along with a good understanding of CI/CD pipelines. The candidate must be comfortable with ambiguity, eager to learn, and possess perseverance.

Responsibilities:

  • Collaborate with cross-functional teams to develop and maintain release architectures and monitor frameworks.
  • Provide system design consulting and critical support to the development team prior to program launch.
  • Identify and solve sophisticated performance and scaling issues, working with engineers to avoid bottlenecks and meet traffic demands.
  • Mentor and guide team members, helping them grow in their roles.
  • Identify and implement automation and monitoring tools to improve the efficiency and effectiveness of SRE processes.
  • Take ownership of any critical incidents and work towards timely resolution and prevention of future occurrences.

Requirements:

  • Five (5) to Seven (7) years of professional experience in technology or a related field.
  • Two (2) years of experience with Kubernetes/EKS.
  • Two (2) years of experience with CI/CD pipelines.
  • Two (2) years of experience with a sophisticated observability platform including RUM and APM.

Good To Have Requirements:

  • Capabilities utilizing Dynatrace APM and RUM (other APM or RUM may be applicable) - Dynatrace Associate Certification is a plus.
  • Intermediate to Advanced skills in BASH shell scripting, Python, and Docker.
  • Intermediate skills with on-prem Gitlab CI pipeline creation, troubleshooting, and configuration of Gitlab CI.

Preferred Qualifications:

  • Solve sophisticated performance and scaling issues, working with engineers to ensure that we avoid bottlenecks and meet traffic demands through organic growth and marketing events.
  • Strong problem-solving skills and the ability to work in a fast-paced environment.
  • Communicate effectively with stakeholders, including management, to provide updates, recommendations, and solutions for any SRE-related issues.
  • Excellent communication and collaboration skills.
  • Experience with Kubernetes/EKS and pod life cycle management including readiness and liveness checks.
  • Experience with building and supporting CI/CD pipelines and production releases.
  • Working knowledge of complex CDN cached website architecture.

Basic Qualifications:

  • Minimum 5 years Source Code Management (SCM) and DevOps-Containerization-EKS.
  • Minimum 1 year Platform Administration-Monitoring-Dynatrace.

About NTT DATA

NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at us.nttdata.com.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Digital Site Reliability Engineer- REMOTE

NTT DATA North America

Memphis

Remote

USD 80,000 - 120,000

30+ days ago

Sr. Data Reliability Engineer (Remote)

CrowdStrike

Philadelphia

Remote

USD 110,000 - 180,000

11 days ago

Senior Site Reliability Engineer

Walrath Recruiting, Inc.

New York

Remote

USD 100,000 - 150,000

Today
Be an early applicant

Site Reliability Engineer

FIS

New York

Remote

USD 84,000 - 143,000

Today
Be an early applicant

Site Reliability Engineer - Remote

Optum

Basking Ridge

Remote

USD 110,000 - 115,000

Today
Be an early applicant

Senior Site Reliability Engineers

Centene Corporation

Clayton

Remote

USD 112,000 - 159,000

Today
Be an early applicant

Site Reliability Engineer

Kforce Inc

Atlanta

Remote

USD 125,000 - 150,000

7 days ago
Be an early applicant

System Safety Engineer

Leidos

Huntsville

Remote

USD 89,000 - 163,000

Yesterday
Be an early applicant

[Hiring] Site Reliability Engineer @JatApp

JatApp

Remote

USD 80,000 - 120,000

7 days ago
Be an early applicant