NIST 2015 Language Recognition i-Vector Machine Learning Challenge

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

NIST 2015 Language Recognition i-Vector Machine Learning Challenge

November 2015

Article Authors:

Craig S. Greenberg, Désiré Bansé, John M. Howard, Alvin F. Martin, George R. Doddington, Audrey Tong, Daniel Garcia-Romero, Jaime Hernández-Cordero, Lisa P Mason, Alan McCree, Douglas A Reynolds, Elliot Singer

Craig S. Greenberg, Désiré Bansé, John M. Howard, Alvin F. Martin, George R. Doddington, Audrey Tong, Daniel Garcia-Romero, Jaime Hernández-Cordero, Lisa P Mason, Alan McCree, Douglas A Reynolds, Elliot Singer

SLTC Newsletter, November 2015

Overview

Modeled after the successful NIST Speaker Recognition i-Vector Machine Learning Challenge held in 2013-2014 [1], in 2015 NIST launched a Language Recognition i-Vector Machine Learning Challenge, which focused on open-set language identification. This Language Recognition Challenge used data from previous NIST Language Recognition Evaluations (LRE’s) and other LDC and IARPA corpora [2]. Rather than distributing audio data as in LREs, 400-dimensional i-vectors were distributed produced by a state-of-the-art system from MITLL and JHU HLT Center of Excellence. Using the i-vector representation made the evaluation more accessible to participants from outside the audio processing community and allowed for a more direct comparison of the different back-ends by removing the burden of audio processing and providing a common system front-end.

The Challenge covered 50 target languages and a set of unnamed “out-of-set” languages. Labeled training data (300 i-vectors per language) were provided for the target languages and a set of approximately 6,500 unlabeled i-vectors covering the target and out-of-set languages was provided for development. The test set consisted of approximately 6,500 test segments covering the target and out-of-set languages. Unlike traditional LRE’s, where audio segments contain nominally 3, 10, or 30 seconds of speech, the speech duration of the audio segments used to create the i-vectors for the challenge were sampled from a log-normal distribution with a mean of approximately 35s.

PARTICIPATION

The participation level in the Language Recognition i-Vector Machine Learning Challenge was the highest in LRE history. 148 participants from 31 countries registered to take part in the Challenge, and 59 submitted a total of 3877 system outputs. Compared to the most recent NIST LRE, with 22 participants and 58 submissions, the Challenge had a significant increase in the number of participants and submissions, suggesting that the Challenge was successful in reaching a broader community.

PERFORMANCE RESULTS

At the end of the official Challenge period, 93% of Challenge participants submitted a system that outperformed a pre-defined baseline system. The leading system in the challenge demonstrated an approximate 55% relative improvement over the baseline.

MORE INFORMATION

NIST intends to organize a discussion of the 2015 challenge and results, possibly at the 2016 Odyssey Speaker and Language Recognition Workshop, to be held during June of 2016 in Bilbao, Spain [3].

For more information about the Challenge itself, see the plan http://www.nist.gov/itl/iad/mig/upload/lre_ivectorchallenge_rel_v2.pdf. Please note that while the official period for the Challenge is over and the leaderboard is no longer being updated, the scoring platform is still available for experimentation. To conduct your own LRE i-vector experiments, visit the challenge platform https://ivectorchallenge.nist.gov. If you have comments, corrections, or additions to this article, please contact: ivector_poc@nist.gov.

Bansé, Désiré; Doddington, George R; Garcia-Romero, Daniel; Godfrey, John J; Greenberg, Craig S; Martin, Alvin F; McCree, Alan; Przybocki, Mark; Reynolds, Douglas A; “Summary and Initial Results of the 2013-2014 Speaker Recognition i-vector Machine Learning Challenge”; Fifteenth Annual Conference of the International Speech Communication Association; 2014
Data used includes BABEL data from babel103b-v0.4b, babel101b-v0.4c, babel201b-v0.2b, IARPA-babel203b-v3.1a, babel104b-v0.4bY, babel106b-v0.2g, babel105b-v0.4, babel107b-v0.7, babel206b-v0.1d
http://www.odyssey2016.org/

Craig S. Greenberg, Désiré Bansé, Alvin F. Martin, George R. Doddington, and Audrey Tong are with NIST. John M. Howard is with Systems Plus. Daniel Garcia-Romero and Alan McCree are with The Johns Hopkins University, HLT-COE. Jaime Hernández-Cordero and Lisa P Mason are with US DoD. Douglas A Reynolds and Elliot Singer are with MIT Lincoln Labs.

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

ieeee-sps-logo-social.png

2030 IEEE International Conferences on Acoustics, Speech, and Signal Processing (ICASSP 2030)

Congratulations Image (1).png

SPS Members Receive 2026 IEEE Technical Field Awards!

congratulations.jpg

Congratulations to Signal Processing Society Members Elevated to Senior Members!

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

NIST 2015 Language Recognition i-Vector Machine Learning Challenge

Community & Involvement

Top Reasons to Join SPS Today!

SL TC Menu

SPS Social Media

IEEE SPS Educational Resources

ieeee-sps-logo-social.png

2030 IEEE International Conferences on Acoustics, Speech, and Signal Processing (ICASSP 2030)

Congratulations Image (1).png

SPS Members Receive 2026 IEEE Technical Field Awards!

congratulations.jpg

Congratulations to Signal Processing Society Members Elevated to Senior Members!

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

NIST 2015 Language Recognition i-Vector Machine Learning Challenge

Search form

You are here

Community & Involvement

Top Reasons to Join SPS Today!

NIST 2015 Language Recognition i-Vector Machine Learning Challenge

Craig S. Greenberg, Désiré Bansé, John M. Howard, Alvin F. Martin, George R. Doddington, Audrey Tong, Daniel Garcia-Romero, Jaime Hernández-Cordero, Lisa P Mason, Alan McCree, Douglas A Reynolds, Elliot Singer

Overview

PARTICIPATION

PERFORMANCE RESULTS

MORE INFORMATION

SL TC Menu

SPS Social Media

IEEE SPS Educational Resources