Feature Functions for Tree-Based Dialogue Course Management

Macherey, Klaus; Ney, Hermann

doi:10.1007/1-4020-3075-4_4

Klaus Macherey³ &
Hermann Ney³

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 28))

457 Accesses
1 Citations

Abstract

We propose a set of feature functions for dialogue course management and investigate their effect on the system's behaviour for choosing the subsequent dialogue action during a dialogue session. Especially, we investigate whether the system is able to detect and resolve ambiguities, and if it always chooses that state which leads as quickly as possible to a final state that is likely to meet the user's request. The criteria and data structures used are independent of the underlying domain and can therefore be employed for different applications of spoken dialogue systems. Experiments were performed on a German in-house corpus that covers the domain of a German telephone directory assistance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abella, A. and Gorin, A. L. (1999). Construct Algebra: Analytical dialog management. In Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), pages 191–199, University of Maryland, USA.
Google Scholar
Ammicht, E., Potamianos, A., and Fosler-Lussier, E. (2001). Ambiguity representation and resolution in spoken dialogue systems. In Proceedings of European Conference on Speech Communication and Technology (EURO-SPEECH), pages 2217–2220, Aalborg, Denmark.
Google Scholar
Aust, H., Oerder, M., Seide, F., and Steinbiss, V. (1995). The Philips automatic train timetable information system. Speech Communication, 17:249–262.
Article Google Scholar
Constantinides, P., Hansma, S., Tchou, C., and Rudnicky, A. (1998). A schema based approach to dialog control. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 409–412, Sidney, Australia.
Google Scholar
Hirsch, H.-G. and Pearce, D. (2000). The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In Proceedings of International Workshop on Automatic Speech Recognition: Challenges for the new Millenium, pages 181–188, Paris, France.
Google Scholar
Kanthak, S., Sixtus, A., Molau, S., Schlüter, R., and Ney, H. (2000). Fast search for large vocabulary speech recognition. Verbmobil: Foundations of Speechto-Speech Translation, pages 63–78.
Google Scholar
Levin, E. and Pieraccini, R. (1995). Concept-based spontaneous speech understanding system. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 555–558, Madrid, Spain.
Google Scholar
Lleida, E. and Rose, R. C. (1996). Efficient decoding and training procedures for utterance verification in continuous speech recognition. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 507–510, Atlanta, Georgia, USA.
Google Scholar
Macherey, K., Och, F. J., and Ney, H. (2001). Natural language understanding using statistical machine translation. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 2205–2208, Aalborg, Denmark.
Google Scholar
Pearce, D. (2000). Enabling new speech driven services for mobile devices: An overview of the ETSI standards activities for distributed speech recognition front-ends. In Proceedings of Applied Voice Input/Output Society Conference, San Jose, California, USA.
Google Scholar
Potamianos, A., Ammicht, E., and Kuo, H.-K. J. (2000). Dialogue management in the Bell Labs Communicator system. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 603–606, Beijing, China.
Google Scholar
Seneff, S. and Polifroni, J. (1996). A new restaurant guide conversational system: Issues in rapid prototyping for specialized domains. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 665–668, Philadelphia, Pennsylvania, USA.
Google Scholar
Weintraub, M., Beaufays, F., Rivlin, Z., Konig, Y., and Stolcke, A. (1997). Neural-network based measures of confidence for word recognition. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 887–890, Munich, Germany.
Google Scholar
Wessel, F., Macherey, K., and Schlüter, R. (1998). Using word probabilities as confidence measures. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 225–228, Seattle, Washington, USA.
Google Scholar
Wessel, F., Schlüter, R., Macherey, K., and Ney, H. (2001). Confidence measures for large vocabulary continuous speech recognition. IEEE Transactions on Speech and Audio Processing, 9(3):288–298.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Lehrstuhl für Informatik VI, Computer Science Department, RWTH Aachen — University of Technology, Aachen, Germany
Klaus Macherey & Hermann Ney

Authors

Klaus Macherey
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Ney
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ulm, Germany
W. Minker & Dirk Bühler &
University of Southern Denmark, Odense, Denmark
Laila Dybkjær

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Macherey, K., Ney, H. (2005). Feature Functions for Tree-Based Dialogue Course Management. In: Minker, W., Bühler, D., Dybkjær, L. (eds) Spoken Multimodal Human-Computer Dialogue in Mobile Environments. Text, Speech and Language Technology, vol 28. Springer, Dordrecht. https://doi.org/10.1007/1-4020-3075-4_4

Download citation

DOI: https://doi.org/10.1007/1-4020-3075-4_4
Published: 17 August 2005
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-3073-4
Online ISBN: 978-1-4020-3075-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics