
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading recruitment agency is seeking a Cloud Platform Engineer in Cambridge. In this hybrid role, you will design and deliver secure, scalable cloud ecosystems using AWS services like EC2 and EKS. The position requires strong experience with AWS environments and Infrastructure-as-Code tools like Terraform. You'll lead Kubernetes rollouts, enhance platform resilience, and drive automation. Ideal candidates should hold a technical degree, have excellent communication skills, and a willingness to relocate or commute to Cambridge.
Cloud Platform Engineer -AWS, Degree, Cloud, Linux
Location: Cambridge (Hybrid) Salary: 70,000- 100,000 (DOE) + Benefits
As a Cloud Platform Engineer, you'll be at the heart of building and evolving a modern, scalable cloud ecosystem. Your mission will be to design and deliver secure, high-performance infrastructure that underpins critical applications and services. Working closely with engineering, security, and reliability teams, you'll influence architectural decisions, champion automation, and ensure the platform is resilient, cost-effective, and ready for future growth. This is a chance to make a real impact on a global technology leader while shaping the next generation of cloud solutions.
Please be aware this is not a fully remote role and would require you to relocate to Cambridge or at least be within commuting distance - do consider this before applying.
Architect and maintain robust cloud environments on AWS, leveraging services such as EC2, EKS, RDS/Aurora, ElastiCache, OpenSearch, and CloudFront.
Lead the rollout and optimization of Kubernetes on EKS, ensuring reliable deployments and efficient scaling across workloads.
Design automated infrastructure solutions using Infrastructure-as-Code tools like Terraform, integrating them seamlessly into CI/CD pipelines.
Introduce and refine deployment strategies that minimize downtime, including blue/green, rolling, and canary approaches.
Strengthen platform resilience by improving autoscaling, high availability, and eliminating single points of failure.
Work closely with SRE and Security teams to enhance monitoring and observability through Prometheus, Grafana, and CloudWatch.
Embed security best practices into every layer of the platform, covering IAM, secrets management, WAF, and compliance.
Drive cost efficiency and performance improvements through proactive automation and resource optimization.
Contribute to operational excellence by participating in on-call rotations and post-incident reviews.