ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Exploiting information extraction annotations for document retrieval in distillation tasks

Dilek Hakkani-Tür, Gokhan Tur, Michael Levit

Information distillation aims to extract relevant pieces of information related to a given query from massive, possibly multilingual, audio and textual document sources. In this paper, we present our approach for using information extraction annotations to augment document retrieval for distillation. We take advantage of the fact that some of the distillation queries can be associated with annotation elements introduced for the NIST Automatic Content Extraction (ACE) task. We experimentally show that using the ACE events to constrain the document set returned by an information retrieval engine significantly improves the precision at various recall rates for two different query templates.


doi: 10.21437/Interspeech.2007-178

Cite as: Hakkani-Tür, D., Tur, G., Levit, M. (2007) Exploiting information extraction annotations for document retrieval in distillation tasks. Proc. Interspeech 2007, 330-333, doi: 10.21437/Interspeech.2007-178

@inproceedings{hakkanitur07_interspeech,
  author={Dilek Hakkani-Tür and Gokhan Tur and Michael Levit},
  title={{Exploiting information extraction annotations for document retrieval in distillation tasks}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={330--333},
  doi={10.21437/Interspeech.2007-178}
}