Enable job alerts via email!

Research Scientist, Gemini Audio, UK

Google DeepMind

City Of London

On-site

GBP 80,000 - 100,000

Full time

3 days ago

Be an early applicant

Job summary

A leading AI research company in London is looking for a Research Scientist to enhance Gemini's audio capabilities. Responsibilities include designing improvements, collaborating with a multidisciplinary team, and optimizing audio processing. Applicants should hold a PhD in a related field and possess experience in machine learning and audio processing. We are committed to diversity and equal opportunity in the workplace.

Qualifications

PhD in Computer Science, Machine Learning, or related field.
Experience in Machine Learning, speech and audio processing.
Research Publications at leading conferences/journals.

Responsibilities

Design improvements to Gemini's audio capabilities.
Work on the interaction of audio with other modalities.
Enhance the Gemini model's efficiency for audio processing.
Improve training and evaluation infrastructure for audio.

Skills

Machine Learning

Speech processing

Audio processing

Large Language Models

Programming in Python/C++

Education

PhD in Computer Science or related field

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

About Us

Our team focuses on improving Gemini’s audio and speech capabilities. In particular, we work on

Modeling innovations to improve Gemini’s audio performance.
Improving the training data for Gemini models.
Create new evaluations to quantify the performance of Gemini models in the audio space.
Adding new audio capabilities to Gemini Audio, both through modelling changes and data changes.
Maintaining and improving the Gemini Audio infrastructure.

The Role

The responsibilities of someone hired in this role are to:

Design, implement and deliver improvements to Gemini to improve its audio capabilities.
Work with the broader Gemini team to understand the interaction between audio with the rest of the modalities in Gemini.
Improve the efficiency of the Gemini model when processing audio.
Improve the training and evaluation infrastructure of the Gemini model, with specific focus on audio.
Design experiments and deploy proof‑of‑concept demos in the areas described above.

About You

In order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience:

PhD in Computer Science, Machine Learning, a related technical field or equivalent practical experience.
Experience in Machine Learning, speech and general audio processing.
Experience in Large Language Models.
Programming experience in Python/C++.
Research Publications at leading conferences/journals.

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Application Deadline: 31st October

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.