Audio-Aware Spoken Multiple-Choice Question Answering With Pre-Trained Language Models
Spoken multiple-choice question answering (SMCQA) requires machines to select the correct choice to answer the question by referring to the passage,…
Read moreSpoken multiple-choice question answering (SMCQA) requires machines to select the correct choice to answer the question by referring to the passage,…
Read moreSarcasm is commonly used in today's social media platforms such as Twitter and Reddit. Sarcasm detection is necessary for analysing people's real…
Read moreAttention-based end-to-end (E2E) automatic speech recognition (ASR) architectures are now the state-of-the-art in terms of recognition performance…
Read moreAutomatic speech recognition (ASR) technologies have been significantly advanced in the past few decades. However, recognition of overlapped speech…
Read moreIn music source separation, the number of sources may vary for each piece and some of the sources may belong to the same family of instruments, thus…
Read moreA key task for speech recognition systems is to reduce the mismatch between training and evaluation data that is often attributable to speaker…
Read moreMost of the existing feature representations for spoofing countermeasures consider information either from the magnitude or phase spectrum. We…
Read moreVoice and face are two most popular biometrics for person verification, usually used in speaker verification and face verification tasks. It has…
Read moreGeometry calibration is an inherent challenge in distributed acoustic sensor networks. To mitigate this problem, a passive geometry calibration…
Read moreSpeaker diarization is an important problem that is topical, and is especially useful as a preprocessor for conversational speech related…
Read more