ISCA Archive Interspeech 2004
ISCA Archive Interspeech 2004

Phonetic confusion based document expansion for spoken document retrieval

Nicolas Moreau, Hyoung-Gook Kim, Thomas Sikora

This paper presents a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. We describe an indexing and retrieval system that uses phonetic information only. The retrieval method is based on the vector space IR model, using phone N-grams as indexing terms. We propose a technique to expand the representation of documents by means of phone confusion probabilities in order to improve the retrieval performance. This method is tested on a collection of short German spoken documents, using 10 city names as queries.


doi: 10.21437/Interspeech.2004-44

Cite as: Moreau, N., Kim, H.-G., Sikora, T. (2004) Phonetic confusion based document expansion for spoken document retrieval. Proc. Interspeech 2004, 1593-1596, doi: 10.21437/Interspeech.2004-44

@inproceedings{moreau04_interspeech,
  author={Nicolas Moreau and Hyoung-Gook Kim and Thomas Sikora},
  title={{Phonetic confusion based document expansion for spoken document retrieval}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={1593--1596},
  doi={10.21437/Interspeech.2004-44}
}