Enable job alerts via email!

Cloud System Administrator 2 w/ 3 years experience

Onyx Point, Inc.

Annapolis (MD)

On-site

USD 80,000 - 110,000

Full time

11 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Cloud Systems Administrator with a strong background in system administration and cloud technologies. In this pivotal role, you will support the implementation and maintenance of IT systems while managing complex infrastructures. Your expertise in troubleshooting and monitoring large distributed systems will be crucial for ensuring operational excellence. The ideal candidate will have a passion for learning new technologies, a collaborative spirit, and the ability to mentor junior staff. This position offers a dynamic work environment where your contributions will directly impact the success of innovative projects.

Qualifications

  • Active TS/SCI W/ Polygraph security clearance required.
  • 3 years of experience in system administration and monitoring of large distributed systems.

Responsibilities

  • Support implementation, troubleshooting, and maintenance of IT systems.
  • Manage IT system infrastructure and provide Tier 1 and Tier 2 support.

Skills

System Administration
Troubleshooting
Cloud Computing
Scripting (Bash, Perl, Python)
Linux Management
Configuration Management
Project Management
Team Collaboration

Education

Bachelor’s Degree in Engineering
Hadoop/Cloud System Administrator Certification

Tools

Puppet
SALT
Kubernetes
Docker
HAProxy
NGINX
Elasticsearch
Logstash
Grafana

Job description

Job Requirements and Responsibilities

To be considered for this position, you must have an active TS/SCI W/ Polygraph security clearance (U.S. citizenship required).

The Cloud Systems Administrator will contribute to:

  • Provide support for implementation, troubleshooting, and maintenance of IT systems.
  • Manage IT system infrastructure and related processes.
  • Support day-to-day operations, monitoring, and problem resolution for client/server/storage/network devices and mobile devices.
  • Deliver Tier 1 (Help Desk) and Tier 2 (Escalation) problem diagnosis and resolution.
  • Support escalation processes and communicate status updates to management and customers.
  • Configure and manage UNIX and Windows operating systems, including troubleshooting and network configuration, to enhance system reliability and performance.

The role also involves supporting large clusters, requiring:

  • At least three years of experience in system administration and monitoring of large distributed systems, including multiple clusters, spanning at least 3 racks with a minimum of 60 nodes per site.
  • Experience diagnosing and troubleshooting large-scale cloud computing systems, with familiarity in distributed storage and retrieval technologies such as Hadoop, Cassandra, Scality, Swift, Gluster, Lustre, GPFS, Amazon S3, or similar big data or HPC technologies.
  • Ability to work within a team, follow SOPs, communicate effectively, accept feedback, and receive guidance from senior technical staff.
  • Willingness to learn new technologies and leverage team resources for professional growth.
  • Independently handle complex tasks and mentor junior staff.
  • Experience in planning, leading, and managing complex technical projects involving multiple teams.

Additional technical skills include:

  • Five years of experience scripting with Bash, Perl, or Python.
  • Seven years of experience with Linux core components, including LDAP, DHCP, DNS, and TFTP management.
  • Experience with configuration management tools like Puppet and SALT.
  • Expertise in Linux PXE/network provisioning, RAID utilities, TFTP, and disk scripting.
  • Experience troubleshooting hardware via remote utilities such as VNC, serial over LAN, IPMI, and BIOS configurations.
  • Understanding of corporate architecture, openSSL, and Java keystores.
  • Experience with hardware troubleshooting, including SGI/HP systems.

Education: Three years of relevant experience is required; a Bachelor’s Degree in Engineering, Systems Engineering, Computer Science, or Mathematics is highly desirable and equivalent to two years of experience. A Hadoop/Cloud System Administrator Certification or similar is required.

Preferred additional skills include:

  • Knowledge of SSH tunneling, SOCKS proxies, and utilities like rysn, pdsh, pdcp, WinSCP.
  • Basic network concepts such as VLANs, port channel bonding, and switch interactions.
  • Experience with load balancers like HAProxy and NGINX.
  • Experience with Kubernetes, Docker, log aggregation tools like Elasticsearch, Logstash, Grafana, and Rsyslog.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Cloud System Administrator 2 w/ 3 years experience Information Technology Annapolis Junction, MD

Onyx Point, Inc.

Maryland

On-site

USD 80,000 - 100,000

30+ days ago