Job Search and Career Advice Platform

Enable job alerts via email!

Audio AI Engineer

Pantera Capital

Singapore

Hybrid

SGD 80,000 - 120,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology firm in Singapore is seeking an experienced Audio AI Engineer to research and develop cutting-edge algorithms for accent and voice conversion, with a focus on real-time communication systems. Ideal candidates have a PhD in a relevant field, proficiency in deep learning frameworks, and experience with programming languages like Python and C/C++. This position offers a hybrid work model and a comprehensive benefits package.

Benefits

Variety of perks and benefits
Support for mental and physical health
Work-life balance

Qualifications

  • More than 2 years of relevant industry experience considered a plus.
  • Understanding of streaming, accent conversion, voice conversion, TTS, or ASR.

Responsibilities

  • Research and develop algorithms for accent conversion and speech recognition.
  • Prototype audio models that enhance intelligibility and maintain speaker identity.
  • Evaluate and optimize model performance regarding quality, latency, and scalability.

Skills

Deep learning frameworks (e.g., PyTorch, TensorFlow)
Programming skills in Python
C/C++ programming skills
Sequence modeling architectures
Real-time speech or audio model development
Model compression techniques
Experience with real-time audio systems
Published in top-tier conferences

Education

PhD or equivalent experience in a relevant field
Job description
What you can expect

As an Audio AI Engineer, you will research and develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition on low-latency streaming architectures. You’ll prototype and refine end-to-end audio models that enhance intelligibility and naturalness while maintaining speaker identity. Working closely with product and platform teams, you’ll help bring these models into real-time communication systems. You will also evaluate and optimize model performance across dimensions such as quality, latency, and scalability. Staying current with advances in speech processing, you’ll contribute to innovation through patents and internal knowledge sharing.

About the Team

Zoom's Audio team develops real-time audio features based on AI algorithms. Members of the team are spread worldwide, including the U.S., China and Singapore.

What we’re looking for
  • Hold a PhD or equivalent experience in a relevant field in Streaming, Accent Conversion, Voice Conversion, TTS, or ASR. More than 2 years of relevant industry experience considered a plus.
  • Show proficiency in deep learning frameworks like PyTorch or TensorFlow.
  • Demonstrate effective programming skills in Python, C/C++, or similar languages.
  • Have an understanding of sequence modeling architectures (Transformers, RNNs, diffusion models, or conformers).
  • Demonstrate experience developing and deploying low-latency, real-time speech or audio models with streaming architectures and optimized pipelines.
  • Show familiarity with model compression and acceleration techniques, including quantization, pruning, and distillation.
  • Exhibit experience working with real-time audio systems in networked communication environments.
  • Publish in top-tier conferences such as ICASSP, INTERSPEECH, NeurIPS, and ICLR.
Ways of Working

Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

Benefits

As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.

About Us

Zoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars. We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Find room to grow with opportunities to stretch your skills and advance your career in a collaborative, growth-focused environment.

Our Commitment

At Zoom, we believe great work happens when people feel supported and empowered. We’re committed to fair hiring practices that ensure every candidate is evaluated based on skills, experience, and potential. If you require an accommodation during the hiring process, let us know—we’re here to support you at every step.

If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form and someone from our team will reach out soon.

This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.