Aktiviere Job-Benachrichtigungen per E-Mail!

Member of Technical Staff - Image / Video Applications

Black Forest Labs

Freiburg im Breisgau

Vor Ort

EUR 40.000 - 60.000

Vollzeit

Vor 8 Tagen

Zusammenfassung

A generative AI company is looking for an Applied Researcher to develop control mechanisms for image and video generation models. Responsibilities include training large-scale models and developing features like color palettes and transparency channels. Candidates should have strong knowledge of PyTorch and experience with large-scale model training. This position is located in Freiburg im Breisgau, Germany.

Qualifikationen

Experience finetuning Diffusion models for image and video applications.
Strong proficiency in transformer models and other neural network architectures.
Experience with profiling and optimizing multi-GPU operations.

Aufgaben

Train large-scale Diffusion models with advanced control mechanisms.
Develop conditioning mechanisms for production needs.
Communicate design results and decisions with the team.
Analyze trade-offs of control architectures.

Kenntnisse

Experience training large scale Diffusion models for image and video data

Strong proficiency in PyTorch

Deep understanding of training techniques such as FSDP, low precision training

Deep understanding of evaluating image and video generative models

Overview

At Black Forest Labs, we're on a mission to advance the state of the art in generative deep learning for media, building powerful, creative, and open models that push what's possible. Born from foundational research, we continuously create advanced infrastructure to transform ideas into images and videos. Our team pioneered Latent Diffusion, Stable Diffusion, and FLUX.1 – milestones in the evolution of generative AI. Today, these foundations power millions of creations worldwide, from individual artists to enterprise applications.

Role and Responsibilities

We are looking for an Applied Researcher to develop precise control mechanisms for our image and video generation models, enabling users to direct outputs through practical controls like color palettes, transparency channels, and other production-ready features.

Training large-scale Diffusion (transformer) models with advanced control mechanisms (hex color control, transparency generation, custom aspect ratios, etc.)
Developing conditioning mechanisms for practical production requirements in image and video generation
Rigorously ablating design choices for applied controls and communicating results & decisions with the broader team
Reasoning about the speed and quality trade-offs of control architectures for real-world applications

What we look for

Experience training large scale Diffusion models for image and video data
Finetuning Diffusion models for image and video applications, such as, image and video upscalers, in and out painting models, etc.
Deep understanding of how to effectively evaluating image and video generative models
Strong proficiency in PyTorch, transformer models and other NN architectures
Deep understanding of training techniques such as FSDP, low precision training, and model parallelism

Nice to have

Experience with writing forward and backward Triton kernels and ensuring their correctness while considering floating point errors
Profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight or stack trace viewers

Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.

eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.