Activez les alertes d’offres d’emploi par e-mail !

AI Engineer - Speech & Voice

Acolad

Boulogne-Billancourt

Sur place

EUR 45 000 - 75 000

Plein temps

Il y a 22 jours

Générez un CV personnalisé en quelques minutes

Décrochez un entretien et gagnez plus. En savoir plus

Repartez de zéro ou importez un CV existant

Résumé du poste

A leading company in content and language solutions is seeking an AI Engineer specializing in speech technologies. This role involves developing and optimizing real-time speech AI solutions and requires advanced qualifications. Candidates will benefit from a collaborative environment focused on innovation and professional growth.

Prestations

Competitive compensation
Comprehensive benefits
Opportunities for professional growth

Qualifications

  • Experience with ASR, TTS, or speech translation systems emphasizing low latency.
  • Hands-on experience with optimizing AI solutions for real-time performance.
  • Strong programming skills in Python and software development.

Responsabilités

  • Design and implement low-latency AI models for Speech-to-Speech interpreting.
  • Enhance AI models for speed and efficiency targeting sub-300ms latency.
  • Collaborate with product, engineering, and linguistic teams for deployable AI solutions.

Connaissances

Problem Solving
Communication
Software Development

Formation

Master's or Ph.D. in Computer Science

Outils

Deep Learning Frameworks
Python

Description du poste

Acolad, the global leader in content and language solutions, supports companies across industries to scale and grow through cutting-edge technology and localization expertise. Established in 1993, the group operates in 23 countries across Europe, North America, and Asia, with over 1,800 employees and a network of more than 20,000 linguists worldwide.

At Acolad, every position is key to our global growth: we believe our success depends on the success of our people.

Joining Acolad offers a unique opportunity for professional development within a collaborative, global environment that fosters talent and creativity. We are continually seeking new talent to support our mission of driving growth and innovation for some of the world’s leading brands.

Check out our brand video to learn more about us!

We are seeking a highly driven and pragmatic AI Engineer with expertise in speech technologies. This hands-on role involves managing the entire lifecycle of our real-time speech AI solutions—from rapid prototyping and development to deployment and ongoing optimization. We need someone eager to solve complex engineering problems, deliver tangible results, and with a proven track record of practical, low-latency AI solutions.

Key Responsibilities
  1. Development & Implementation: Design, build, and deploy production-grade, low-latency AI models for real-time Speech-to-Speech (S2S) interpreting, Speech-to-Text (STT), and Text-to-Speech (TTS). Focus on practical application and functionality.
  2. Performance Optimization: Enhance AI models and pipelines for speed and efficiency, targeting sub-300ms latency for seamless communication. Address streaming audio, incremental processing, and inference efficiency.
  3. System Architecture & Integration: Lead the design and implementation of scalable, high-performance AI systems integrated with our SaaS platform. Identify and resolve bottlenecks.
  4. Data Management: Work with large speech datasets, applying augmentation, cleaning, and preprocessing to improve model performance.
  5. Problem Solving & Innovation: Identify core challenges and develop practical solutions quickly, emphasizing real-world impact over academic research.
  6. Cross-Functional Collaboration: Collaborate with product, engineering, and linguistic teams to translate requirements into deployable AI solutions.
  7. Project Ownership: Manage projects from concept through deployment, ensuring high-quality code, testing, and reliable performance in real-time environments.
Qualifications
  • Master's or Ph.D. in Computer Science, Electrical Engineering, or related fields, with practical application experience.
  • Hands-on experience with ASR, TTS, or speech translation systems emphasizing low latency.
  • Proficiency in deep learning frameworks, with experience in training, fine-tuning, and deploying models.
  • Experience optimizing AI solutions for real-time performance, including audio streaming and inference.
  • Strong programming skills in Python and building robust software.
  • Proven ability to solve complex problems and deliver results.
  • Excellent communication skills and ability to work independently or in teams.
Preferred Qualifications
  • Experience with speech-to-speech translation in a commercial setting.
  • Knowledge of digital signal processing (DSP) for audio.
  • Experience with edge deployment or on-device AI optimization.
  • Background in localization, interpreting, or real-time communication industries.

We offer an impactful role in building next-generation interpreting capabilities, within a dynamic environment that values ownership, rapid iteration, and a results-driven mentality. Benefits include competitive compensation, comprehensive benefits, and opportunities for professional growth.

Acolad is committed to diversity, equity, and inclusion, welcoming candidates from all backgrounds to apply and join our team.

Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.