Multi-Speaker Multi-Style Voice Cloning Challenge (M2VoC) (ICASSP 2021)

You are here

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

Multi-Speaker Multi-Style Voice Cloning Challenge (M2VoC) (ICASSP 2021)

2021

Associated SPS Event: IEEE ICASSP 2021 Grand Challenge

Text-to-speech (TTS) or speech synthesis has witnessed significant performance improvement with the help of deep learning. The latest advances in end-to-end text-to-speech paradigm and neural vocoder have enabled us to produce very realistic and natural-sounding synthetic speech reaching almost human-parity performance. But this amazing ability is still limited to the ideal scenarios with a large single-speaker less-expressive training set. The speech quality, target similarity, expressiveness and robustness are still not satisfied for synthetic speech with different speakers and various styles, especially in real-world low-resourced conditions, e.g., each speaker only has a few samples at hand. The current open solutions are also not robust enough to unseen speakers. We call this challenging task as multi-speaker multi-style voice cloning (M2VoC).

Recent advances in transfer learning, style transfer, speaker embedding and factor disentanglement have shed light on the potential solutions to low-resource voice cloning.

As a ICASSP2021 Signal Processing Grand Challenge, the M2VoC challenge aims to provide a common sizable dataset as well as a fair testbed for benchmarking the voice cloning task. We highly encourage the researchers from both academia and industry to join the challenge and have deep discussions as well as collaborations. For further details, visit the website. 

Technical Committee: Audio and Acoustic Signal Processing, Speech and Language Processing

SPS on Twitter

  • The DEGAS Webinar Series continues on Thursday, 2 December when Dr. Michael Schaub presents "Signal processing on g… https://t.co/OwsuUlG2jT
  • Save 50% on IEEE Student Membership to the ultimate network for electrical engineering and computer science student… https://t.co/gutP0cgf4y
  • SPS needs your support! is approaching. If our program receives 30 unique donations of US$10 or… https://t.co/OlN1h1limu
  • On 9 December, the IEEE SPS Sensor Array and Multichannel Technical Committee Webinar Series will feature a talk by… https://t.co/ZptS3D33ij
  • The SPS Webinar Series continues on Friday, 10 December when Dr. Yu Liu presents "Image Fusion with Convolutional S… https://t.co/XhH5XttbsA

SPS Videos


Signal Processing in Home Assistants

 


Multimedia Forensics


Careers in Signal Processing             

 


Under the Radar