Aktiviere Job-Benachrichtigungen per E-Mail!

Staff Computer Vision Engineer (m/f/d)

Deltia GmbH

Berlin

Hybrid

EUR 70.000 - 90.000

Vollzeit

Heute
Sei unter den ersten Bewerbenden

Zusammenfassung

A leading AI software company in Berlin seeks a skilled individual to design and develop vision-language architectures, manage video-text datasets, and collaborate on cutting-edge AI projects. Candidates with 5-7 years in deep learning, strong Python engineering skills, and effective communication are encouraged to apply. This hybrid position offers a competitive salary, stock options, and a supportive culture emphasizing diversity and inclusion.

Leistungen

Competitive salary and stock options
New office in AI Campus
Diverse and inclusive culture
Flat hierarchy and flexible working hours

Qualifikationen

  • 5-7 years in deep learning with video or large-scale NLP.
  • Proven experience with multi-GPU clusters (PyTorch, distributed training).
  • Strong Python and data pipeline skills.

Aufgaben

  • Design vision-language architectures for use cases.
  • Create training pipelines on in-house GPU clusters.
  • Manage large-scale video-text datasets.

Kenntnisse

Deep-learning experience
Video computer vision
NLP / LLM fine-tuning
Large-model training
Python engineering
Data pipelines management
Effective communication
Jobbeschreibung
Location

Hybrid

Employment Type

Full time

Location Type

Hybrid

Department

Engineering

Your Role
  • Design and prototype vision‑language architectures that fuse video streams with text for “sense‑plan‑act” use‑cases.
  • Build end‑to‑end training pipelines on a small in‑house H200 GPU cluster; handle distributed training, evaluation and iteration.
  • Curate, augment and manage large‑scale video‑text datasets—including synthetic caption generation—driving continuous model improvement.
  • Create automated testing and benchmarking suites (video QA, captioning, instruction‑following) and turn findings into model refinements.
  • Collaborate with a lean team to set technical direction and transition research code into deployable services.
  • Work with the robotics team on the A part of the VLA.
Preferred to have:
  • 5‑7 yrs deep‑learning work in either
  • video computer vision (e.g., action recognition, temporal transformers, video classification) or
  • large‑scale NLP / LLM fine‑tuning.
  • Proven large‑model training on multi‑GPU clusters (PyTorch, distributed, mixed precision).
  • Strong Python engineering, data pipelines, and experiment hygiene.
  • Comfortable wrangling big video datasets (IO, sampling, augmentation) or large text corpora.
  • Effective communicator in small teams.
Nice‑to‑haves
  • Direct experience with video‑text or vision‑language models (e.g., Video‑LLM, Flamingo, Idefics).
  • Publications or open‑source in multimodal learning.
  • Cross‑modal fusion tricks (Q‑Former, Perceiver Resampler).
  • Synthetic captioning, retrieval augmentation, RLHF.
  • Sense‑plan‑act / embodied AI projects.
  • Inference optimization (TensorRT, quantization).

What we offer

A competitive salary & stock options

The option to work in a brand new office situated on the AI Campus in Berlin where we closely cooperate with other exciting AI ventures

Be on the forefront in defining what artificial intelligence means in manufacturing

Gain hands‑on experience in working in an AI‑first software company

Supportive and inclusive culture that values diversity and promotes the advancement of underrepresented groups within the company

Collaborate with a diverse (currently more than 10 nationalities) and talented team, working on cutting‑edge projects with real‑world impact

Network with professionals and leaders in the field, opening doors to potential future career opportunities

We have a very flat hierarchy, open 360° feedback, and flexible working hours

Ethics⚖: We are committed to developing ethical AI software
Don't meet all the requirements?

Deltia is committed to creating a workplace that is diverse, fair, and inclusive. We encourage candidates from all backgrounds, even if they do not meet every qualification, to submit their application. We firmly believe that having a team with diverse perspectives only strengthens our company and drives innovation. Our commitment also extends to providing an accessible environment for everyone, including those with disabilities. Please let us know if you require any accommodations during the application process or while working with us, and we will do our best to support you.

Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.