1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.
IEEE ICASSP 2024 Website | Monday, 15 April 2024 | 2024 SP Cup Official Document
[Sponsored by the MathWorks and IEEE Signal Processing Society]
A speaker recognition system authenticates the identity of claimed users from a speech utterance. For a given speech segment called enrollment and a speech segment from a claimed user, the speaker recognition system will determine automatically whether both segments belong to the same speaker or not. The state-of-the-art speaker recognition systems mainly use Deep Neural Networks (DNN) to extract fixed-length speaker discriminant representations called speaker embeddings. The decision to accept or reject a speaker will be made by comparing speaker embeddings. The DNN-based speaker verification systems perform well in general, but there are some challenges that reduce their performance dramatically. Far-field speaker recognition is among the well-known challenges facing speaker recognition systems. The far-field challenge is intertwined with other variabilities such as noise and reverberation. Two main categories of speaker recognition systems are text-dependent speaker recognition and text-independent speaker recognition. In a text-dependent speaker recognition system, the speaker’s voice is recorded from predefined phrases, while, in text-independent speaker recognition, there is no constraint on the content of the spoken dialogue. The task of the IEEE Signal Processing Cup 2024 is text-independent far-filed speaker recognition under noise and reverberation for a mobile robot.
The Robovox challenge is concerned with doing far-field speaker verification from speech signals recorded by a mobile robot at variable distances in the presence of noise and reverberation. Although there are some benchmarks in this domain such as VoiCes and FFSVC, they don’t cover variabilities in the domain of robotics such as the robot’s internal noise and the angle between the speaker and the robot. The VoiCes dataset is replayed speech recorded under different acoustical noises. A main drawback of the VoiCes is that it was recorded from played signals whereas our dataset is recorded with people speaking in noisy environments. The FFSVC is another far-field speaker recognition benchmark. However, these benchmarks helped the community significantly, we are introducing a new benchmark for far-field speaker recognition systems in order to address some new aspects. Firstly, our goal is to perform speaker recognition in a real application for the domain of mobile robots. In this domain, there are other variabilities that have not been addressed in previous benchmarks: the robot’s internal noise and the angle between the speaker and the robot. Furthermore, the speech signal has been recorded for different distances between the speaker and the robot. In the proposed challenge the following variabilities are present:
Full technical details, dataset(s), evaluation metrics, and all other pertinent information about the competition is located in the 2024 SP Cup Official Document (above).
GRAND PRIZE - Team: "IITH"
Indian Institute of Technology Hyderabad
Supervisor: Sri Rama Murty Kodukula
Tutor: Sreekanth Sankala
Students: Tejadhith Sankar, Atharv Ramesh, Nair Vaideeswaran A P, Himanshu Kumar Gupta, Anirudh Srinivasan
FIRST RUNNER-UP-Team: "HYU ASML"
Hanyang University, Seoul, Republic of Korea
Supervisor: Joon-Hyuk Chang
Tutor: Jeong-Hwan Choi
Students: Hee-Jae Lee, Hyun-Soo Kim, Seyun Ahn, Gaeun Kim
SECOND RUNNER-UP - Team: "Pseudo Spectrum"
Rajshahi University of Engineering and Technology
Supervisor: Mohammod Abdul Motin
Tutor: Md Abdur Raiyan
Students: A Md Anik Hasan, Sapnil Sarker Bipro, Muhammad Sudipto Siam Dip
Competition Organizers (technical, competition-specific inquiries): Mohammad MOHAMMADAMINI
SPS Staff (Terms & Conditions, Travel Grants, Prizes): Jaqueline Rash, SPS Membership Program and Events Administrator
SPS Student Services Committee: Angshul Majumdar, Chair
This competition is sponsored by the IEEE Signal Processing Society and MathWorks: