LIMMITS'24: Multi-speaker, Multi-lingual Indic TTS with voice cloning: ICASSP 2024

You are here

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

LIMMITS'24: Multi-speaker, Multi-lingual Indic TTS with voice cloning: ICASSP 2024

2024

This challenge is the continuation of LIMMITS'23 (ICASSP 23 SPGC), it is aimed at making further progress in multi-speaker, multi-lingual TTS by extending the problem statement to voice cloning.  Enabling voice cloning with multilingual TTS systems expands possibilities for cross-lingual synthesis for target speakers. In this challenge, we present the opportunity for the participants to perform TTS Voice cloning with a multilingual base model of 14 speakers. We further extend this scenario, allowing training with more multi-speaker corpora such as VCTK, and LibriTTS. Finally, we also present a scenario for zero-shot voice conversion. Towards these, we share 560 hours of studio-quality TTS data in 7 Indian languages. The evaluation will be performed on mono as well as cross-lingual synthesis, with naturalness and speaker similarity subjective tests.

Visit the Challenge website for details and more information!

 

Technical Committee: Speech and Language Processing

SPS ON X

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel