Job Search and Career Advice Platform

Enable job alerts via email!

Test Environment Manager

Technopride Ltd

City Of London

On-site

GBP 70,000 - 90,000

Full time

23 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading tech company in the UK is seeking a Test Environment Manager (TEM) to transform the Software Development Lifecycle environment. This role entails automating test environments, establishing service objectives, and fostering a culture of reliability. The ideal candidate will have over 15 years of experience and a strong background in system reliability and automation. Strong technical proficiency in tools such as Prometheus and Jenkins is essential.

Qualifications

  • 15+ years of experience in IT or DevOps-related role.
  • Strong engineering mindset with focus on automation and reliability.
  • Experience with incident management and process improvement.

Responsibilities

  • Develop Infrastructure as Code to automate test environments.
  • Establish and monitor Service Level Objectives.
  • Implement observability practices for performance monitoring.

Skills

Observability tools proficiency
CI/CD platforms expertise
Cloud platforms knowledge
Scripting skills
Linux foundation

Tools

Prometheus
Jenkins
Docker
GitLab CI
Job description

Job Title: Test Environment Manager (TEM)

Experience Level: 15+ years

Role Overview

The Test Environment Manager (TEM) plays a pivotal role in transforming the Software Development Lifecycle (SDLC) environment. This role requires a strong engineering mindset with a focus on system reliability, automation, and performance in non-production environments. The TEM will lead the design, automation, monitoring, and continuous improvement of test environments to support development, testing, and delivery teams.

Key Responsibilities
Operational Responsibilities
  • Automate Environment Lifecycle: Develop Infrastructure as Code (IaC) to provision, configure, and decommission test environments, integrating with CI/CD pipelines.
  • Define Service Objectives: Establish and measure Service Level Objectives (SLOs) and Service Level Indicators (SLIs), such as availability and provisioning time, to ensure environments meet team needs.
  • Monitor Health & Performance: Implement observability practices using tools like Prometheus and Grafana to proactively identify and resolve performance bottlenecks.
  • Incident Management: Lead incident response for environment-related issues, conducting blameless post-mortems and implementing sustainable solutions.
  • Reduce Toil: Identify and automate repetitive manual tasks to improve efficiency and free up engineering capacity.
Strategic & Cultural Responsibilities
  • Drive Continuous Improvement: Analyze metrics, incidents, and reports to identify opportunities for improvement and innovation.
  • Balance Reliability & Speed: Apply error budget principles to manage trade-offs between reliability and delivery speed.
  • Promote Reliability Culture: Foster a culture of shared ownership and blameless incident response across development, QA, and SRE teams.
  • Capacity Planning: Forecast and plan for future infrastructure needs based on usage patterns and demand.
  • Advance Test Data Management: Collaborate with Test Data Managers to ensure test data availability, compliance, consistency, and automated provisioning.
Technical Skills
  • Strong proficiency in observability, monitoring, and logging tools (e.g., Prometheus, Splunk, Grafana).
  • Expertise in CI/CD platforms (e.g., Jenkins, GitLab CI) and configuration management tools (e.g., Ansible, Terraform).
  • In-depth knowledge of cloud platforms (e.g., AWS) and containerization technologies (Docker, Kubernetes), as well as serverless architectures.
  • Advanced scripting skills in Python, Bash, or similar languages to automate environment management tasks.
  • Solid foundation in Linux systems, networking concepts, and database management.
Soft Skills
  • Leadership & Influence: Ability to drive adoption of SRE practices and influence stakeholders across technical and business functions.
  • Problem-Solving: Strong analytical and debugging skills to resolve complex environment issues under pressure.
  • Communication: Excellent verbal and written communication skills to collaborate effectively across teams.
  • Adaptability: Proactive, flexible mindset to adapt to evolving technologies and development practices.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.