Enable job alerts via email!

Digital Site Reliability Engineer- REMOTE

NTT DATA, Inc.

Memphis (TN)

Remote

USD 80,000 - 130,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a dynamic and innovative team as a Digital Site Reliability Engineer, where you'll leverage your expertise in Kubernetes, Gitlab, and CI/CD to enhance system reliability and performance. This remote role offers a unique opportunity to collaborate with cross-functional teams, tackle complex challenges, and mentor fellow engineers. In an environment that thrives on adaptability and forward-thinking, your contributions will directly impact the efficiency of site reliability processes. If you're passionate about technology and eager to drive improvements in a fast-paced setting, this is the perfect role for you.

Qualifications

  • 5-7 years in technology with expertise in Kubernetes and CI/CD.
  • Experience with observability platforms and performance issues.

Responsibilities

  • Collaborate on release architectures and monitor frameworks.
  • Mentor team members and improve SRE processes.

Skills

Kubernetes
Gitlab
Dynatrace
GraphQL
Node.js
React
CI/CD pipelines
BASH shell scripting
Python
Docker

Job description

Digital Site Reliability Engineer- REMOTE

Date: Feb 18, 2025

Location: Memphis, TN, US

Company: NTT DATA Services

NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.

We are currently seeking a Digital Site Reliability Engineer- REMOTE to join our team in Memphis, Tennessee (US-TN), United States (US).

Job Title: Digital Site Reliability Engineer

Location:Temporary Remote

Job Description:

We are seeking a highly skilled and experienced Reliability Engineer to join our team. The ideal candidate must have a strong background in technology, with specific expertise in Kubernetes, Gitlab, Dynatrace, GraphQL, Node, and React, along with a good understanding of CI/CD pipelines. The candidate must be comfortable with ambiguity, eager to learn, and possess perseverance.

Responsibilities:

  • Collaborate with cross-functional teams to develop and maintain release architectures and monitor frameworks.
  • Provide system design consulting and critical support to the development team prior to program launch.
  • Identify and solve sophisticated performance and scaling issues, working with engineers to avoid bottlenecks and meet traffic demands.
  • Mentor and guide team members, helping them grow in their roles.
  • Identify and implement automation and monitoring tools to improve the efficiency and effectiveness of SRE processes.
  • Take ownership of any critical incidents and work towards timely resolution and prevention of future occurrences.

Requirements:

  • Five (5) to Seven (7) years of professional experience in technology or a related field.
  • Two (2) years of experience with Kubernetes/EKS.
  • Two (2) years of experience with CI/CD pipelines.
  • Two (2) years of experience with a sophisticated observability platform including RUM and APM.

Good To Have Requirements:

  • Capabilities utilizing Dynatrace APM and RUM (other APM or RUM may be applicable) - Dynatrace Associate Certification is a plus.
  • Intermediate to Advanced skills in BASH shell scripting, Python, and Docker.
  • Intermediate skills with on-prem Gitlab CI pipeline creation, troubleshooting, and configuration of Gitlab CI.

Preferred Qualifications:

  • Solve sophisticated performance and scaling issues, working with engineers to ensure that we avoid bottlenecks and meet traffic demands through organic growth and marketing events.
  • Strong problem-solving skills and the ability to work in a fast-paced environment.
  • Communicate effectively with stakeholders, including management, to provide updates, recommendations, and solutions for any SRE-related issues.
  • Excellent communication and collaboration skills.
  • Experience with Kubernetes/EKS and pod life cycle management including readiness and liveness checks.
  • Experience with building and supporting CI/CD pipelines and production releases.
  • Working knowledge of complex CDN cached website architecture.

Basic Qualifications:

  • Minimum 5 years Source Code Management (SCM) and DevOps-Containerization-EKS.
  • Minimum 1 year Platform Administration-Monitoring-Dynatrace.

About NTT DATA

NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at us.nttdata.com.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Digital Site Reliability Engineer- REMOTE

Lensa

Memphis

Remote

USD 100.000 - 130.000

10 days ago

Site Reliability Engineer Principal

FedEx Group

Memphis

Remote

USD 120.000 - 160.000

13 days ago

Site Reliability Engineer Principal

Tnentertainment

Memphis

Remote

USD 120.000 - 170.000

8 days ago

Site Reliability Engineer Principal

Thecentermemphis

Memphis

Remote

USD 120.000 - 150.000

8 days ago

Digital Site Reliability Engineer- REMOTE

NTT DATA North America

Memphis

Remote

USD 80.000 - 120.000

30+ days ago

MQ Middleware Administrator - 1966

Cyrten

Alexandria

Remote

USD 120.000 - 180.000

3 days ago
Be an early applicant

Lead Site Reliability Engineer - Java/ProC

Enterprise Holdings

St. Louis

Remote

USD 90.000 - 120.000

2 days ago
Be an early applicant

Site Reliability Engineer (Remote - Canada)

Lensa

Remote

USD 64.000 - 720.000

7 days ago
Be an early applicant

FlightAware- Sr. Site Reliability Engineer (Remote)

Raytheon Technologies Corporation

California

Remote

USD 101.000 - 203.000

5 days ago
Be an early applicant