Enable job alerts via email!

High Performance Computing and AI Infrastructure Engineer

Lockheed Martin

Town of Texas (WI)

Remote

USD 73,000 - 130,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading technology innovation company as a High Performance Computing and AI Infrastructure Engineer. You will be instrumental in developing and maintaining compute infrastructure products, supporting AI infrastructure, and ensuring optimal performance across systems. This full-time remote role offers flexible work schedules and a comprehensive benefits package, allowing you to thrive in a dynamic environment.

Benefits

Flexible Work Schedules
Comprehensive Benefits Package
Education Assistance
Paid Time Off

Qualifications

  • Experience with High Performance Computing and AI infrastructure development.
  • Demonstrated use of automation and orchestration tools.

Responsibilities

  • Support design and development of HPC and AI systems.
  • Perform full stack engineering and system administration.

Skills

Automation Mindset
Scripting Languages
AI Infrastructure
High Performance Computing

Education

Red Hat Enterprise Linux (RHEL) 7 or 8

Tools

Ansible
PowerShell
Terraform
JIRA
Docker
Kubernetes

Job description

High Performance Computing and AI Infrastructure Engineer

Join to apply for the High Performance Computing and AI Infrastructure Engineer role at Lockheed Martin

High Performance Computing and AI Infrastructure Engineer

1 week ago Be among the first 25 applicants

Join to apply for the High Performance Computing and AI Infrastructure Engineer role at Lockheed Martin

  • Experience with High Performance Computing infrastructure product development and/or maintenance
  • Experience with AI infrastructure product development and/or maintenance
  • Demonstrated automation mindset, including the use of automation and orchestration tools and scripting languages. Examples include Ansible, PowerShell, Terraform
Basic Qualifications

  • Experience with High Performance Computing infrastructure product development and/or maintenance
  • Experience with AI infrastructure product development and/or maintenance
  • Demonstrated automation mindset, including the use of automation and orchestration tools and scripting languages. Examples include Ansible, PowerShell, Terraform
  • Red Hat Enterprise Linux (RHEL) 7 or 8

Administration and Configuration

Job Code/Title

E9682:Full Stack Engineer

Job Description

Become part of the Future of IT at Lockheed Martin as a Full Stack Engineer within the FORCE Portfolio! This dynamic, fast-paced environment is embracing DevSecOps and Agile to enable our strategic goals. The Engineer role will be instrumental to the success of reinventing how we develop and maintain compute infrastructure products at Lockheed Martin to meet the needs of every business area. The FORCE Portfolio resides within the Enterprise IT Infrastructure and International (I2) Organization. The FORCE Portfolio includes (but is not limited to) development and operations for the following Product Teams: Compute IaaS (Virtualization, Server OS, OpenStack), PaaS,(Containers, Database Engines, Middleware Splunk), Storage, Data Center/Hardware, High Performance Computing (Simulation, AI/ML), Governance, Commercial Cloud Native Offerings, Service Management (Customer Portal, Job Scheduling). These solutions are built to meet global needs and include both Data Center and Edge locations for on-premise and in public cloud.

This Engineer role is aligned to a single Delivery Team within the HPC Product Team. The Delivery Team may be utilizing Scrum or Kanban agile frameworks. This Full Stack Engineer role is for the High Performance Computing (HPC) Delivery Team with a focus on AI Infrastructure.

Engineer Responsibilities Include

  • Support the design and development of HPC and utility systems (computation, network, and storage)
  • Support AI Infrastructure and the equivalent systems
  • Perform full stack engineering, including platform support, user software support, and manage queuing software to meet the computing needs of research projects
  • Responsible for System Administration on multiple system platforms and hardware.
  • Position supports multiple platforms which include small servers and large supercomputers
  • Will be responsible for system installations, upgrades, configuration management, configurations, software installation, troubleshooting, user interface and support
  • On-call support rotation will be required

This role requires U.S. Citizenship

This position is full-time telecommuting. Occasional travel (1-3 times a year) may be requested.

What’s In It For You

From onsite to remote, we offer flexible work schedules to comprehensive benefits investing in your future and security, Learn more about Lockheed Martin’s comprehensive benefits package here.

Do you want to be part of a company culture that empowers employees to think big, lead with a growth mindset, and make the impossible a reality? We provide the resources and give you the flexibility to enable inspiration and focus! If you have the passion and courage to dream big, work hard, and have fun doing what you love then we want to build a better tomorrow with you.

Desired Skills

  • Experience using agile management tool such as JIRA, VersionOne, Pivotal Tracker, etc
  • Experience with simulation and AI/ML software
  • Experience with DevOps / DevSecOps
  • Knowledge of various protocols (i.e., DNS, SMTP, NFS, FTP, Telnet, SSH, SFTP)
  • System performance, disk I/O, and network tuning and configuration experience
  • Experience in mitigating IT Tech Debt and retiring legacy products and services
  • Demonstrated use of metrics to make data driven decisions
  • Familiarity with Service Now for ITSM
  • Familiarity with AWS and/or Azure IT service development and maintenance
  • Familiarity with private cloud on-premise IT service development and maintenance
  • Experience working in a virtual environment
  • Fiber Channel (Direct Attach) Storage Array

Administration Experience

  • Experience with Trusted Multi-Level Security (MLS) Operating Systems
  • Familiarity with InfiniBand configuration and troubleshooting
  • Experience with containerization, Kubernetes, Docker

Other Important Information

By applying to this job, you are expressing interest in this position and could be considered for other career opportunities where similar skills and requirements have been identified as a match. Should this match be identified you may be contacted for this and future openings.

Ability to work remotely

Full-time Remote Telework: The employee selected for this position will work remotely full time at a location other than a Lockheed Martin designated office/job site. Employees may travel to a Lockheed Martin office for periodic meetings.

Select the Telework classification for this position

Employee that will telework full-time

Ability to Telecommute

Full time telecommuter

Shift

First

Work Schedule Information

Lockheed Martin supports a variety of alternate work schedules that provide additional flexibility to our employees. Schedules range from standard 40 hours over a five day work week while others may be condensed. These condensed schedules provide employees with additional time away from the office and are in addition to our Paid Time off benefits.

Work Schedule

4x10 hour day, 3 days off per week

Security Clearance

None

LMCareers Business Unit

ENTERPRISE BUSINESS SERVICES

Program

Information Technology

Department

50273:Infrastructure VP

Job Class

Information Technology

Job Category

Experienced Professional

City, State

Aguadilla-PR, Denver-CO, Fort Worth-TX, King of Prussia-PA, Littleton-CO, Orlando-FL, SUNNYVALE-CA

City

Aguadilla, Denver, Fort Worth, King of Prussia, Littleton, Orlando, Sunnyvale

Zip

00603, 19406, 32825, 76108, 80127, 80221, 94089

Virtual

yes

Relocation/Housing Stipend Available

Possible

Req Type

Full-Time

Direct/Indirect

Indirect

  • Join us at Lockheed Martin, where your mission is ours. Our customers tackle the hardest missions. Those that demand extraordinary amounts of courage, resilience and precision. They’re dangerous. Critical. Sometimes they even provide an opportunity to change the world and save lives. Those are the missions we care about.

As a leading technology innovation company, Lockheed Martin’s vast team works with partners around the world to bring proven performance to our customers’ toughest challenges. Lockheed Martin has employees based in many states throughout the U.S., and Internationally, with business locations in many nations and territories.

EEO

Lockheed Martin is an equal opportunity employer. Qualified candidates will be considered without regard to legally protected characteristics.

The application window will close in 90 days; applicants are encouraged to apply within 5 - 30 days of the requisition posting date in order to receive optimal consideration.

National Pay Statement

Pay Rate: The annual base salary range for this position in California and New York (excluding most major metropolitan areas), Colorado, Hawaii, Illinois, Maryland, Minnesota, Washington or Washington DC is $73,400 - $129,260. For states not referenced above, the salary range for this position will reflect the candidate’s final work location. Please note that the salary information is a general guideline only. Lockheed Martin considers factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience, education/ training, key skills as well as market and business considerations when extending an offer.

Benefits offered: Medical, Dental, Vision, Life Insurance, Short-Term Disability, Long-Term Disability, 401(k) match, Flexible Spending Accounts, EAP, Education Assistance, Parental Leave, Paid time off, and Holidays.

(Washington state applicants only) Non-represented full-time employees: accrue at least 10 hours per month of Paid Time Off (PTO) to be used for incidental absences and other reasons; receive at least 90 hours for holidays. Represented full time employees accrue 6.67 hours of Vacation per month; accrue up to 52 hours of sick leave annually; receive at least 96 hours for holidays. PTO, Vacation, sick leave, and holiday hours are prorated based on start date during the calendar year.

This position is incentive plan eligible.

Premium Pay Statement

Pay Rate: The annual base salary range for this position in most major metropolitan areas in California and New York is $84,300 - $146,165. For states not referenced above, the salary range for this position will reflect the candidate’s final work location. Please note that the salary information is a general guideline only. Lockheed Martin considers factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience, education/ training, key skills as well as market and business considerations when extending an offer.

Benefits offered: Medical, Dental, Vision, Life Insurance, Short-Term Disability, Long-Term Disability, 401(k) match, Flexible Spending Accounts, EAP, Education Assistance, Parental Leave, Paid time off, and Holidays.

This position is incentive plan eligible.
Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology
  • Industries
    Defense and Space Manufacturing

Referrals increase your chances of interviewing at Lockheed Martin by 2x

Get notified about new Infrastructure Engineer jobs in Texas, United States.

Austin, TX $85,000.00-$95,000.00 15 hours ago

Austin, TX $150,000.00-$175,000.00 2 weeks ago

Texas, United States $150,000.00-$200,000.00 2 weeks ago

Austin, TX $180,000.00-$215,000.00 3 months ago

Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA)

Fort Worth, TX $70,000.00-$110,000.00 9 hours ago

End User Computing Onsite Systems Engineer
Site Reliability Engineer (SRE, Remote US)

Austin, TX $120,000.00-$160,000.00 2 months ago

Platform Support Engineer, Sr / Remote / EDE - IBM Cloud Pak for Data
Engineer IV, Platform Infrastructure and SRE

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

High Performance Computing and AI Infrastructure Engineer, Sr

Lockheed Martin

Town of Texas

Remote

USD 100.000 - 130.000

Today
Be an early applicant

High Performance Computing and AI Infrastructure Engineer, Sr

Lockheed Martin

Remote

USD 89.000 - 179.000

Today
Be an early applicant

Sr Core Infrastructure Engineer HPC

Children's Mercy Hospital

Kansas City

Remote

USD 80.000 - 110.000

14 days ago

Sr Core Infrastructure Engineer HPC

Children's Mercy KC

Kansas City

Remote

USD 80.000 - 100.000

30+ days ago