Enable job alerts via email!

Senior AI Infrastructure/Workflow Engineer

Braintrust

United Kingdom

Remote

GBP 60,000 - 90,000

Full time

7 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI content generation seeks a Senior AI Infrastructure/Workflow Engineer to design and optimize systems for their innovative platform. You will work on cutting-edge projects, enhancing workflows and managing cloud infrastructure, while collaborating with cross-functional teams to drive the future of digital marketing.

Benefits

Fully remote, flexible schedule
Competitive pay package
Access to cutting-edge resources
Opportunity to shape the roadmap for AI video creation

Qualifications

  • 5+ years of hands-on experience in Python automation and scripting.
  • Deep familiarity with Stable Diffusion training/inference.
  • Expertise deploying and managing GPU-accelerated cloud infrastructure.

Responsibilities

  • Architect, build, and maintain end-to-end systems for AI content production.
  • Train and fine-tune Stable Diffusion checkpoints and custom datasets.
  • Implement multi-processing and distributed batch pipelines for reliability.

Skills

Python automation
Stable Diffusion
GPU cloud infrastructure
Distributed processing
Linux server administration
Problem-solving

Tools

Terraform
Kubernetes
Docker-Swarm

Job description

Senior AI Infrastructure/Workflow Engineer

2 days ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

Job Description

We're pioneering the next generation of AI-powered content creation technologies. Using advanced image generation pipelines, custom workflows, and proprietary tooling, we generate, curate, and scale hundreds of thousands of images weekly—soon expanding into video. Our mission is to revolutionize digital marketing through AI-powered content generation at scale, delivering personalized engagement across platforms backed by enterprise-grade cloud infrastructure.

Role Overview: As a top engineer, you will architect, build, and maintain the end‑to‑end systems powering our AI content production platform. You’ll design and optimize Stable Diffusion training and inference workflows, develop and manage automation tools (local and cloud), and scale GPU instances to create tens of millions of assets. You’ll balance coding, systems design, and data analysis to push our platform’s performance, reliability, and feature set forward.

Key Responsibilities

  • Model & Workflow Development
    • Train and fine‑tune Stable Diffusion checkpoints, LoRAs, and custom datasets at scale
    • Design and maintain ComfyUI graphs and custom nodes for efficient batch generation
    • Prototype new AI-video tooling leveraging image workflows as foundational blocks
  • Automation & Tooling
    • Author robust Python scripts and services to review, sort, tag, and organize huge quantities of images/day
    • Implement multi‑processing, threading, and distributed batch pipelines for speed and reliability
    • Build dashboard backends and lightweight frontends to monitor job status, results, and metrics
  • Cloud Infrastructure & Architecture
    • Help deploy, manage, and optimize GPU cloud instances
    • Automate provisioning, scaling, and health‑checks (IaC tools like Terraform, Ansible, or similar)
    • Design cost‑effective, high‑throughput data storage and transfer strategies between on‑prem & cloud
  • Data Review & Analysis
    • Analyze generation outputs, synthesize quality metrics, and recommend iterative improvements
    • Build tools for rapid A/B comparisons, anomaly detection, and content selection at scale
  • Cross‑Functional Collaboration
    • Work closely with creative, product, and leadership teams to align technical roadmap with business goals
    • Document processes, share best practices, and mentor junior engineers as the team grows
Required Qualifications

  • 5+ years of hands‑on experience in Python automation and scripting (large‑scale batch processing)
  • Deep familiarity with Stable Diffusion training/inference, LoRA methods, and checkpoint management
  • Proven track record building and optimizing custom ComfyUI workflows or similar node‑based systems
  • Expertise deploying and managing GPU‑accelerated cloud infrastructure
  • Strong understanding of distributed processing, concurrency (multiprocessing, threading, async)
  • Solid foundation in Linux server administration and CI/CD pipelines
  • Excellent problem‑solving skills, self‑starter attitude, and ability to operate independently

Preferred Qualifications

  • Experience architecting scalable either image or video generation related pipelines
  • Familiarity with frontend frameworks for lightweight dashboarding
  • Hands‑on with Terraform, Kubernetes, or Docker‑Swarm for container orchestration
  • Background in data engineering / ETL processes for large multimedia datasets
  • Prior work in AI content moderation, classification, or metadata tagging
  • Content Analytics: Experience implementing automated content quality assessment and performance analytics
  • Data Pipeline Experience: Background in building robust data pipelines for training, fine-tuning, and continuous model improvement

What You’ll Get

  • Architecting high‑impact projects driving the future of AI influencers
  • Fully remote, flexible schedule; collaborate with a lean, passionate team
  • Competitive pay package
  • Access/budget(s) to any cutting‑edge resources
  • Opportunity to shape the roadmap from images to next‑gen AI video creation

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology
  • Industries
    Hospitals and Health Care, Non-profit Organizations, and Government Administration

Referrals increase your chances of interviewing at Braintrust by 2x

Get notified about new Senior Infrastructure Engineer jobs in United Kingdom.

City Of London, England, United Kingdom 1 month ago

London, England, United Kingdom 3 months ago

Blackburn, England, United Kingdom 1 week ago

Warrington, England, United Kingdom 1 week ago

City Of London, England, United Kingdom 1 month ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior AI Infrastructure/Workflow Engineer (Remote)

Braintrust

Remote

GBP 65,000 - 90,000

3 days ago
Be an early applicant