The zero resource speech challenge 2015

Versteegh, Maarten; Thiollière, Roland; Schatz, Thomas; Cao, Xuan Nga; Anguera, Xavier; Jansen, Aren; Dupoux, Emmanuel

doi:10.21437/Interspeech.2015-638

The zero resource speech challenge 2015

Maarten Versteegh, Roland Thiollière, Thomas Schatz, Xuan Nga Cao, Xavier Anguera, Aren Jansen, Emmanuel Dupoux

The Interspeech 2015 Zero Resource Speech Challenge aims at discovering subword and word units from raw speech. The challenge provides the first unified and open source suite of evaluation metrics and data sets to compare and analyse the results of unsupervised linguistic unit discovery algorithms. It consists of two tracks. In the first, a psychophysically inspired evaluation task (minimal pair ABX discrimination) is used to assess how well speech feature representations discriminate between contrastive subword units. In the second, several metrics gauge the quality of discovered word-like patterns. Two data sets are provided, one for English, one for Xitsonga. Both data sets are provided without any annotation except for voice activity and talker identity. This paper introduces the evaluation metrics, presents the results of baseline systems and discusses some of the key issues in unsupervised unit discovery.

doi: 10.21437/Interspeech.2015-638

Cite as: Versteegh, M., Thiollière, R., Schatz, T., Cao, X.N., Anguera, X., Jansen, A., Dupoux, E. (2015) The zero resource speech challenge 2015. Proc. Interspeech 2015, 3169-3173, doi: 10.21437/Interspeech.2015-638

@inproceedings{versteegh15_interspeech,
  author={Maarten Versteegh and Roland Thiollière and Thomas Schatz and Xuan Nga Cao and Xavier Anguera and Aren Jansen and Emmanuel Dupoux},
  title={{The zero resource speech challenge 2015}},
  year=2015,
  booktitle={Proc. Interspeech 2015},
  pages={3169--3173},
  doi={10.21437/Interspeech.2015-638}
}