Enable job alerts via email!

Site Reliability Engineer Principal

Federal Express, Inc.

Memphis (TN)

Remote

USD 120,000 - 160,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in logistics is seeking a Site Reliability Engineer Principal to enhance system reliability and performance. This role involves complex problem resolution, mentoring junior engineers, and driving initiatives for continuous improvement. Candidates should have a strong background in cloud platforms, programming, and reliability engineering, with the opportunity to work remotely from anywhere in the U.S.

Qualifications

  • 7 years of experience in IT or engineering environment with a Bachelor's degree, or 5 years with a Master's degree.
  • Proven track record in maintaining system availability at scale.
  • Expert knowledge in Application Performance Monitoring.

Responsibilities

  • Takes ownership of complex problems and technical design gaps.
  • Drives improvement initiatives for global reliability.
  • Mentors junior engineers.

Skills

Cloud platforms
Programming and scripting
Infrastructure as code
Virtualization and containerization
Monitoring, alerting and observability
Reliability and availability engineering
Incident management
Software development lifecycle

Education

Bachelor's degree in Computer Science, Engineering, Information Systems
Master's degree in Computer Science, Engineering, Information Systems

Tools

Docker
Kubernetes

Job description

About FedEx Dataworks:

Born out of FedEx, a pioneer that ships nearly 20 million packages a day and manages endless threads of information, FedEx Dataworks is an organization rooted in connecting the physical and digital sides of our network to meet today's needs and address tomorrow's challenges.


We are creating opportunities for FedEx, our customers, and the world at large by:

  • Exploring and harnessing data to define and solve true problems
  • Removing barriers between data sets to create new avenues of insight
  • Building and iterating on solutions that generate value
  • Acting as a change agent to advance curiosity and performance

At FedEx Dataworks, we are making supply chains work smarter for everyone.


Company Name:
FedEx Dataworks, Inc.

Job Title:
Site Reliability Engineer Principal

Location:
3630 Hacks Cross Road, Memphis, TN 38125 (100% Remote)

Job Description:

Takes ownership and responsibility in the end-to-end resolution of complex problems and technical design gaps. Drives improvement initiatives that support the overarching global reliability of the company's systems, including capacity planning, failover strategies, performance improvements, reduction of Mean Time to Awareness/Resolve and postmortems. Provides technical solutions including specifying of requirements, functional decomposition, analysis, development and testing for current, new and major programs. Leverages critical thinking to improve best practices and provides enterprise-level recommendations that ensure reliability and resiliency. Advises and mentors junior engineers.


Qualifications:

Bachelor's degree or equivalent in Computer Science, Engineering, Information Systems or related field plus 7 years of experience in the job offered or 7 years' equivalent work experience in information technology or engineering environment. Alternatively, a Master's degree in the same fields plus 5 years of experience. Experience with:

  • Cloud platforms, e.g., Azure, AWS or GCP
  • Programming and scripting
  • Infrastructure as code
  • Virtualization and containerization, e.g., Docker, Kubernetes
  • Monitoring, alerting and observability
  • Reliability and availability engineering, including SLAs and SLOs
  • Incident management, troubleshooting, root cause analysis, continuous improvement
  • Software development lifecycle, DevOps, CICD

Proven track record in maintaining system availability at scale, with expert knowledge in Application Performance Monitoring and end user experience measurement. Ability to identify, debug, and propose solutions for issues of scale and performance. Experience with implementing end-to-end monitoring and alerting.


Position can telecommute from any location in the U.S.


*One year of related experience can substitute for one year of education.


FedEx Dataworks is an Equal Opportunity/Affirmative Action employer committed to diversity and inclusion. We provide reasonable accommodations for qualified individuals with disabilities. For accommodations, contact DataworksTalentAcquisition@corp.ds.fedex.com.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer Principal

FedEx Group

Memphis

Remote

USD 120,000 - 160,000

Yesterday
Be an early applicant

Site Reliability Engineer. Principal

Akamai Technologies

Cambridge

Remote

USD 148,000 - 308,000

2 days ago
Be an early applicant

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle Cloud ERP

Remote

USD 120,000 - 160,000

8 days ago

Principal Systems Safety Engineer Avionics (REMOTE)

Pratt & Whitney

Remote

USD 101,000 - 203,000

3 days ago
Be an early applicant

Engineer, Site Reliability

Syniti

Remote

USD 100,000 - 140,000

4 days ago
Be an early applicant

Principal AI Safety Engineer: Technical Lead

General Motors

Remote

USD 120,000 - 160,000

6 days ago
Be an early applicant

Principle AV Safety Engineer: SOTIF and Human Factors

General Motors

Remote

USD 120,000 - 160,000

7 days ago
Be an early applicant

Sr Platform Engineer - GenAI - AWS - Remote

Lensa

Richmond

Remote

USD 140,000 - 170,000

Today
Be an early applicant

Principal, Platform Engineer

Mastercard

Remote

USD 120,000 - 160,000

4 days ago
Be an early applicant