Enable job alerts via email!

Senior Cloud Engineer

System One

Herndon (VA)

Remote

USD 100,000 - 140,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in workforce solutions is seeking a Senior Cloud Engineer to design and manage observability solutions across AWS. This remote role requires strong AWS experience and scripting skills, focusing on system reliability and performance optimization. Join a team dedicated to delivering efficient and high-quality services.

Benefits

Health and welfare benefits
401(k) plan

Qualifications

  • 3+ years of experience in cloud infrastructure with emphasis on AWS.
  • Strong experience with CloudWatch metrics, logs, alarms.
  • Cloud certifications (AWS DevOps Engineer, Solutions Architect Associate).

Responsibilities

  • Design and implement health checks and probes for cloud infrastructure.
  • Configure and manage monitoring tools (CloudWatch, Grafana, Datadog).
  • Perform root cause analysis and incident correlation.

Skills

CloudWatch
Scripting (Python, Bash, Node.js)
CI/CD tools
Cloud architecture principles
Analytical skills

Education

BA/BS in IT, Computer Science or related field

Tools

Grafana
Datadog
OpenSearch

Job description

Senior Cloud Engineer
100% Remote
Hours: Eastern, Central and Mountain time zones
Security Clearance: Must be able to obtain Public Trust Clearance
US citizenship required per government contract
W2 ONLY, NO C2C


ALTA IT Services is seeking a detail-oriented and proactive Sr. Cloud Engineer to design, implement, and manage observability solutions across our cloud infrastructure. In this role you’ll be responsible for ensuring system reliability and visibility through best-in-class monitoring, logging and alerting practices across AWS. You’ll work across operations and compliance teams to ensure our AWS workloads meet performance expectations while managing security, regulatory and cost-efficiency standards. This role is key to driving visibility, governance and financial accountability in our cloud environment.

Responsibilities:
• Design and implement health checks and probes for cloud infrastructure and applications across AWS
• Define and deploy readiness and liveness probes for containers running in EKS/ECS
• Write custom scripts for CloudWatch custom metrics and alarms based on application specific probes
• Implement alerting and remediation automation based on probe outputs
• Document monitoring strategies, probe configurations and operational playbooks
• Define monitoring strategies for cloud resources, microservices and containerized workloads
• Implement automated health checks and uptime monitoring
• Continuously optimize and evolve the observability stack to improve reliability and reduce noise
• Configure and manage monitoring tools (CloudWatch, Grafana, Datadog, Prometheus)
• Set up monitoring thresholds, dashboards, and metrics for application and infrastructure
• Perform root cause analysis and incident correlation using monitoring and performance analysis tools
• Maintain a central inventory of all licensed software deployed in AWS environments (Windows, Oracle, Red Hat, SQL Server)
• Ensure compliance with vendor-specific licensing terms
• Monitor usage patterns and perform license audits and reconciliation
• Identify and remediate latency issues, throughput bottlenecks and underutilized resources
• Recommend and implement right-sizing of compute, memory and storage resources
• Analyze and optimize the performance of AWS resources, including EC2, RDS, Lambda, S3, ECS and EKS
• Conduct performance profiling and benchmarking for applications hosted on AWS
• Contribute to capacity planning, disaster recovery strategies and performance testing initiatives
• Create reports on system performance trends and opportunities, capacity planning and cost-performance trade-offs

Required Qualifications:
• BA/BS in IT, Computer Science or related field (or equivalent work experience may be accepted in lieu of the degree)
• 3+ years of experience in cloud infrastructure with emphasis on AWS
• Strong experience with CloudWatch (metrics, logs, alarms) CloudWatch Synthetics (canary scripting), Route 53 health checks and failover strategies
• Proficient in scripting languages like Python, Bash or Node.js.
• Hands-on experience with CI/CD tools (GibHub, GitLab, Kubernettes, DevOps)
• Cloud certifications (AWS DevOps Engineer, Solutions Architect Associate)
• Proficient with license management tools and cost optimization platforms
• Solid understanding of cloud architecture principles, autoscaling strategies and load balancing
• Strong written and verbal communication skills for technical and non-technical stakeholders
• Excellent analytical and problem-solving skills
• Must be a US Citizen.
• Must be able to obtain and maintain a Public Trust clearance

Preferred Qualifications:
• Hands-on experience with observability stacks like Grafana, OpenSearch, Datadog
• Familiarity with FinOps practices and cost-performance trade-offs

System One, and its subsidiaries including Joulé, ALTA IT Services, and Mountain Ltd., are leaders in delivering outsourced services and workforce solutions across North America. We help clients get work done more efficiently and economically, without compromising quality. System One not only serves as a valued partner for our clients, but we offer eligible employees health and welfare benefits coverage options including medical, dental, vision, spending accounts, life insurance, voluntary plans, as well as participation in a 401(k) plan.

System One is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, age, national origin, disability, family care or medical leave status, genetic information, veteran status, marital status, or any other characteristic protected by applicable federal, state, or local law.

#M2

Ref: #850-Rockville (ALTA IT)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr Cloud Engineer - SAWFH 1610

Global InfoTek, Inc.

Great Falls Crossing

Remote

USD 100.000 - 130.000

7 days ago
Be an early applicant

Senior Cloud Platform Engineer

Saic

Town of Texas

Remote

USD 120.000 - 160.000

2 days ago
Be an early applicant

Senior Cloud Engineer (Remote Opportunity)

Veterans EZ Info Inc

Charleston

Remote

USD 100.000 - 150.000

Yesterday
Be an early applicant

Senior Cloud Platform Engineer

Centene

Remote

USD 85.000 - 159.000

3 days ago
Be an early applicant

Senior Cloud Engineer (LATAM)

Blink Health

Remote

USD 100.000 - 150.000

3 days ago
Be an early applicant

Senior Cloud Engineer

Globenet Consulting Corp

Carpinteria

Remote

USD 120.000 - 150.000

5 days ago
Be an early applicant

Sr. Cloud Engineer (remote)

V-Soft Consulting Group, Inc.

Madison

Remote

USD 100.000 - 130.000

6 days ago
Be an early applicant

Senior Cloud Engineer

Cognizant North America

Goleta

Remote

USD 79.000 - 107.000

8 days ago

Senior Technical Engineer - Senior Cloud Engineer

Highmark Health

Arizona

Remote

USD 100.000 - 130.000

9 days ago