
Aktiviere Job-Benachrichtigungen per E-Mail!
Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf
Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren
A dynamic tech startup in Berlin is seeking a dedicated researcher for developing multimodal generative models for audio creation. In this role, you will contribute to the entire lifecycle of model development, from design and training to deployment. Ideal candidates will have a background in training large-scale generative models and a strong understanding of modern deep learning. This position offers competitive compensation and the opportunity to shape groundbreaking audio technologies.
Mirelo AI is building the next generation of creative tools by generating realistic sound, speech and music from video.
We develop cutting‑edge foundational generative AI models that "unmute" silent video content and create custom, hyper‑realistic audio for gaming, video platforms, and creators. Our technology empowers global storytellers to transform their content.
We recently closed a $41 million Seed round co‑led by Andreessen Horowitz and Index Ventures with participation from Atlantic, and are rapidly expanding across Product, Engineering, Go‑to‑Market, and Growth.
At Mirelo, you'll work at the centre of how we build the next generation of multimodal video‑to‑audio models. This role is deeply hands‑on and research‑heavy: with a great H100/200‑per‑engineer ratio you explore and build new multimodal models and push the boundaries of what's possible in music, sound, and speech generation. You'll collaborate closely across research and engineering, run focused ablations, and translate experimental results into clear next steps for the team. From data curation to deployment, you'll help shape the full lifecycle of the models that power our products and partnerships.
We welcome applications from all individuals, regardless of ethnic origin, gender, disability, religion or belief, age, or sexual orientation and identity.