
Aktiviere Job-Benachrichtigungen per E-Mail!
Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf
Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren
A cutting-edge AI company in Berlin is seeking a talented individual to develop and implement neural audio codecs for sound, music, and speech. The ideal candidate will have a strong background in deep learning for audio and hands-on experience in designing and training neural audio codecs. With fresh funding secured, you'll have true autonomy and responsibility from day one, allowing your contributions to shape the company's direction. Competitive compensation and equity are also part of the package.
Mirelo AI is building the next generation of creative tools by generating realistic sound, speech and music from video.
We develop cutting-edge foundational generative AI models that "unmute" silent video content and create custom, hyper-realistic audio for gaming, video platforms, and creators. Our technology empowers global storytellers to transform their content.
We recently closed a $41 million Seed round co-led by Andreessen Horowitz and Index Ventures with participation from Atlantic, and are rapidly expanding across Product, Engineering, Go-to-Market, and Growth.
At Mirelo, we're pushing the limits of what generative audio can do, and our ability to innovate depends heavily on the quality of our underlying audio representations. In this role, you'll work at the core of our modeling stack—designing, training, and evaluating neural audio codecs that directly shape the performance of our next-generation music and sound models. You'll collaborate closely with the model team, experiment with both continuous and discrete representations, and build the evaluation tools that help us understand what actually moves the needle. Your work will sit at the foundation of building the best-sounding generative models in the world.
Develop and implement new neural audio codecs for sound, music and speech that push the state-of-the-art in sound quality and are optimized for the use case of generative models.
Think about the specific challenges that arise when the codec is primarily used as a latent representation in the context of generative audio models (in the end, the ultimate goal is to build the best audio generative models).
Explore the trade-offs of continuous (as typically used for diffusion models) vs. discrete audio representations (as typically used for autoregressive models).
Develop benchmarking pipelines for codec evaluation.
Conduct initial experiments with generative models to verify that a new candidate codec is actually useful for our downstream tasks.
Strong background in deep learning for audio: neural codecs, source separation, speech models, or generative audio systems.
Specific hands‑on experience in designing and training neural audio codecs.
Solid understanding of audio signal processing fundamentals.
Strong track record (research and/or open‑source) in the field of audio ML.
Hands‑on experience with generative audio models and good intuition of how the choice of the codec influences the training and performance of the generative model.
Strong publication record (e.g., NeurIPS, ICML, ICLR, Interspeech, ICASSP, WASPAA).
Join at a pivotal moment. We've secured fresh funding and are gaining traction—now is when your contributions can make a real difference to our success.
True ownership from day one. You'll have genuine autonomy and responsibility. Your ideas and work will directly shape our product and company direction.
Competitive compensation and equity. We offer strong packages that ensure you share in the success you help create.
Build for the next generation of creators. Be part of the innovation that will transform how creators work and thrive.
We welcome applications from all individuals, regardless of ethnic origin, gender, disability, religion or belief, age, or sexual orientation and identity.