Skip to main content

LIMMITS'24: Multi-speaker, Multi-lingual Indic TTS with voice cloning: ICASSP 2024

This challenge is the continuation of LIMMITS'23 (ICASSP 23 SPGC), it is aimed at making further progress in multi-speaker, multi-lingual TTS by extending the problem statement to voice cloning.  Enabling voice cloning with multilingual TTS systems expands possibilities for cross-lingual synthesis for target speakers. In this challenge, we present the opportunity for the participants to perform TTS Voice cloning with a multilingual base model of 14 speakers. We further extend this scenario, allowing training with more multi-speaker corpora such as VCTK, and LibriTTS. Finally, we also present a scenario for zero-shot voice conversion. Towards these, we share 560 hours of studio-quality TTS data in 7 Indian languages. The evaluation will be performed on mono as well as cross-lingual synthesis, with naturalness and speaker similarity subjective tests.

Visit the Challenge website for details and more information!

 

Technical Committee