Enable job alerts via email!

Research Scientist - Speech & Audio Understanding (Speech Generation)

Tencent

Bellevue (WA)

On-site

USD 141,000 - 266,000

Full time

16 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

Tencent is seeking a Research Scientist specializing in Speech & Audio Understanding with a focus on speech generation algorithms and multimodal voice technologies. This role involves leading R&D initiatives to enhance voice interaction experiences leveraging advanced AI methodologies. Candidates should have a Master's or Ph.D. in Computer Science or a related field, with strong backgrounds in deep learning and voice technologies.

Benefits

Eligible for sign-on payment
Relocation package available
Restricted stock units eligibility
Medical, dental, and vision benefits
401(k) plan participation
Vacation days between 15 to 25 annually
Paid sick leave up to 10 days annually

Qualifications

  • Experience with voice foundation models.
  • Familiarity with large model training frameworks is preferred.
  • Understanding of large model architectures.

Responsibilities

  • Lead technical R&D of voice foundation models.
  • Investigate multimodal voice foundation technologies.
  • Track advancements in speech generation algorithms.

Skills

Research in speech synthesis
Deep learning frameworks
Voice foundation technologies

Education

Master’s or Ph.D. in Computer Science

Tools

PyTorch

Job description

Join to apply for the Research Scientist - Speech & Audio Understanding (Speech Generation) role at Tencent

4 days ago Be among the first 25 applicants

Join to apply for the Research Scientist - Speech & Audio Understanding (Speech Generation) role at Tencent

  • Track the latest research in speech generation algorithms, explore next-generation paradigms for speech/audio generation, and push the boundaries of speech generation capabilities.
  • Investigate cutting-edge multimodal voice foundation model technologies to enhance voice interaction experiences by integrating text, speech, and vision.
  • Lead the technical R&D of voice foundation models, driving model performance improvements and innovative applications.

Business Unit

What The Role Entails

Job Responsibilities:

  • Track the latest research in speech generation algorithms, explore next-generation paradigms for speech/audio generation, and push the boundaries of speech generation capabilities.
  • Investigate cutting-edge multimodal voice foundation model technologies to enhance voice interaction experiences by integrating text, speech, and vision.
  • Lead the technical R&D of voice foundation models, driving model performance improvements and innovative applications.

Who We Look For

Job Requirements:

  • Master’s or Ph.D. in Computer Science, Artificial Intelligence, Electronic Engineering, Signal Processing, or related fields.
  • Research or development experience in one or more areas: voice foundation models, speech synthesis, speech recognition, audio generation, voice conversion, or speech codec.
  • Familiarity with mainstream voice-enabled large models (e.g., GPT4o, GLM-4-Voice, Qwen2.5-Omni, Voila). Prior project experience is preferred.
  • Proficient in deep learning frameworks (e.g., PyTorch). Experience with large-scale model training frameworks (Megatron/Deepspeed) is a plus.
  • Solid understanding of large model architectures and principles. Experience in large-scale pretraining or post-training is preferred.

Location State(s)

US-Washington-Bellevue

The expected base pay range for this position in the location(s) listed above is $141,480.00 to $265,200.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Other
  • Industries
    Software Development

Referrals increase your chances of interviewing at Tencent by 2x

Sign in to set job alerts for “Research Scientist” roles.

Seattle, WA $9,167.00-$10,834.00 6 days ago

Seattle, WA $136,000.00-$212,800.00 1 week ago

Research Scientist, Global Hiring Science (GHS)

Seattle, WA $117,500.00-$174,000.00 2 weeks ago

Research Scientist, Environmental, Occupational, and Dietary

Seattle, WA $84,900.00-$105,050.00 3 days ago

Redmond, WA $147,000.00-$208,000.00 17 hours ago

Redmond, WA $117,000.00-$173,000.00 5 days ago

Bellevue, WA $146,500.00-$234,500.00 3 weeks ago

Bellevue, WA $177,000.00-$251,000.00 2 weeks ago

Senior Research Scientist, Gender, Vulnerability and Health Equity (*11-Month LTE)

Seattle, WA $186,400.00-$314,900.00 4 days ago

Research Scientist, Holographic Exposure and Process (PhD)

Redmond, WA $117,000.00-$173,000.00 2 weeks ago

Scientist II, Analytical, Antibody and ADC Sciences

Bothell, WA $114,857.00-$141,002.00 1 week ago

Bioinformatics Scientist I – AI Applications

Seattle, WA $90,900.00-$112,400.00 2 months ago

Research Scientist Graduate (Multimodal Interaction and World Model - Pre-Training) - 2025 Start (PhD)
Research Scientist Intern (Doubao (Seed) - Machine Learning System) - 2025 Start (MS)
Research Scientist Graduate (High-Performance Computing (Algorithm Acceleration)- Vision AI Platform-Seattle)) - 2025 Start (PhD)

Seattle, WA $177,688.00-$266,000.00 3 days ago

Senior Geologist/Environmental Scientist/Senior Engineer

Bellevue, WA $90,000.00-$120,000.00 9 hours ago

Research Scientist Graduate (Foundation Model, Generative AI) - 2025 Start (PhD)

Seattle, WA $235,000.00-$405,000.00 1 day ago

Bellevue, WA $230,000.00-$300,000.00 1 week ago

UX Research Scientist, Reality Labs (PhD)

Seattle, WA $117,000.00-$173,000.00 2 weeks ago

Research Scientist Graduate- (Foundation Model, Vision and Language) - 2025 Start (PhD)

Seattle, WA $199,500.00-$340,100.00 3 days ago

Research Scientist, GES NA Operations Engineering
Mixed Method Researcher (Quantitative focus) - Remote
Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform-Seattle) - 2025 Start (PhD)

Seattle, WA $177,688.00-$266,000.00 3 days ago

Immunohematology Laboratory Scientist III

Seattle, WA $138,000.00-$192,000.00 2 weeks ago

Bellevue, WA $138,000.00-$192,000.00 4 hours ago

Research Scientist Graduate (Foundation Model, Video Generation) - 2025 Start (PhD)

Seattle, WA $199,500.00-$340,100.00 2 weeks ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.