Subword unit representations for spoken document retrieval

Ng, Kenney; Zue, Victor W.

doi:10.21437/Eurospeech.1997-460

Subword unit representations for spoken document retrieval

Kenney Ng, Victor W. Zue

This paper investigates the feasibility of using subword unit representations for spoken document retrieval as an alternative to using words generated by either keyword spotting or word recognition. Our investigation is motivated by the observation that word-based retrieval approaches face the problem of either having to know the keywords to search for a priori, or requiring a very large recognition vocabulary in order to cover the contents of growing and diverse message collections. In this study, we examine a range of subword units of varying complexity derived from phonetic transcriptions. The basic underlying unit is the phone; more and less complex units are derived by varying the level of detail and the length of sequences of the phonetic units. We measure the ability of the different subword units to effectively index and retrieve a large collection of recorded speech messages. We also compare their performance when the underlying phonetic transcriptions are perfect and when they contain phonetic recognition errors.

doi: 10.21437/Eurospeech.1997-460

Cite as: Ng, K., Zue, V.W. (1997) Subword unit representations for spoken document retrieval. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1607-1610, doi: 10.21437/Eurospeech.1997-460

@inproceedings{ng97b_eurospeech,
  author={Kenney Ng and Victor W. Zue},
  title={{Subword unit representations for spoken document retrieval}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1607--1610},
  doi={10.21437/Eurospeech.1997-460}
}