
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading technology company is seeking a Research Scientist for their Voice AI team in London. This role involves applying AI techniques to develop audio and speech technologies using large language models. Candidates should possess strong AI research skills with a focus on speech and audio, along with a PhD or relevant experience. The position includes setting ambitious research goals and publishing findings to influence AI communities. This opportunity offers the chance to impact product experiences across various platforms.
The Voice AI team in EMEA, part of the Meta Superintelligence Labs, is looking for a Research Scientist (Speech and Language). The Voice AI team works on large language models (LLMs) with native supporting for processing, understanding and generating of audio and speech as a modality besides others such as text or vision. As part of this, we are leveraging knowledge in areas like speech/audio encoders/tokenizer, pre-training, post-training, (online) reinforcement learning, LLM alignment, multimodal modelling, speech and audio processing, speech recognition (ASR), speech synthesis (TTS), and multilingual modelling. Our work is focused on advancing core technologies to drive and advance core product experiences at Meta such as video dubbing on IG/FB or Meta AI which is available on e.g. RayBan Meta glasses or within WhatsApp.
Apply relevant AI and machine learning techniques to build and advance audio and speech technologies using large language models that can be applied to a wide area of Meta production use cases
Work towards long-term ambitious research and productization goals, while identifying intermediate milestones
Work with large data, and contribute to development of large scale foundation models
Influence progress of relevant research communities by producing publications
PhD degree in Artificial Intelligence (AI), computer science, related technical fields with 1+ years of experience, or BS degree with 3+ years of industrial research experience in the related field
AI research experience in the domains of audio and speech processing
First-author publications at peer-reviewed AI conferences (e.g. Interspeech, ICASSP, ASRU, SLT, NeurIPS, CVPR, ICML, ICLR, ICCV, ACL)
Strong skills to communicating complex research for public audiences or peers
Experience developing machine learning algorithms in e.g. Python, PyTorch, C/C++
Research experience in generative AI, especially in building and optimising large language models for areas of audio/ speech processing and understanding, computer vision and/or natural language understanding beyond black-box use
Additional AI research experience in computer vision and/or NLP
Previous internship(s) and/or research assistantship(s) in an AI research organization
Industry experience working on Speech, Language, and LLM related topics and the experience to apply relevant AI and machine learning techniques to build intelligent rich speech & language systems for improving product experiences
Interest in taking new research findings in this area and implementing them towards product needs
Internet