Skip to main content

LifeLine Dialogues with Roberta

  • Conference paper
  • First Online:
  • 1405 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10341))

Abstract

This paper describes work on dialogue data collection and dialogue system design for personal assistant humanoid robots undertaken at eNTERFACE 2016. The emphasis has been on the system’s speech capabilities and dialogue modeling of what we call LifeLine Dialogues, i.e. dialogues that help people tell stories about their lives. The main goal behind this type of application is to help elderly people exercise their speech and memory capabilities. The system further aims at acquiring a good level of knowledge about the person’s interests and thus is expected to feature open-domain conversations, presenting useful and interesting information to the user. The novel contributions of this work are: (1) a flexible spoken dialogue system that extends the Ravenclaw-type agent-based dialogue management model with topic management and multi-modal capabilities, especially with face recognition technologies, (2) a collection of WOZ-data related to initial encounters and presentation of information to the user, and (3) the establishment of a closer conversational relationship with the user by utilizing additional data (e.g. context, dialogue history, emotions, user goals, etc.).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://hmi.ewi.utwente.nl/enterface16/.

  2. 2.

    http://www.intelligentvoice.com/.

  3. 3.

    https://cloud.google.com/speech/.

  4. 4.

    http://share.int-evry.fr/svnview-eph/.

  5. 5.

    https://www.assetstore.unity3d.com/en/#!/content/52234.

  6. 6.

    https://github.com/stephanschloegl/WebWOZ.

  7. 7.

    https://cloud.google.com/speech/.

  8. 8.

    http://mary.dfki.de/.

References

  1. Bohus, D., Rudnicky, A.I.: The RavenClaw dialog management framework: architecture and systems. Comput. Speech Lang. 23(3), 332–361 (2009)

    Article  Google Scholar 

  2. Clark, H.H., Schaefer, E.F.: Contributing to discourse. Cogn. Sci. 13(2), 259–294 (1989)

    Article  Google Scholar 

  3. Dahlbäck, N., Jönsson, A., Ahrenberg, L.: Wizard of Oz studies: why and how. In: Proceedings of the 1st International Conference on Intelligent User Interfaces, pp. 193–200. ACM (1993)

    Google Scholar 

  4. Eskenazi, M., Black, A.W., Raux, A., Langner, B.: Lets Go Lab: a platform for evaluation of spoken dialog systems with real world users. In: InterSpeech (2008)

    Google Scholar 

  5. Ferguson, G., Allen, J.F.: TRIPs: an integrated intelligent problem-solving assistant. In: Proceedings of the AAAI/IAAI Conference on Artificial Intelligence/Innovative Applications of Artificial Intelligence, pp. 567–572 (1998)

    Google Scholar 

  6. Flandorfer, P.: Population ageing and socially assistive robots for elderly persons: the importance of sociodemographic factors for user acceptance. Int. J. Popul. Res. 2012, Article ID 829835, 13 (2012). doi:10.1155/2012/829835

  7. Ghigi, F., Eskenazi, M., Torres, M.I., Lee, S.: Incremental dialog processing in a task-oriented dialog. In: InterSpeech, pp. 308–312 (2014)

    Google Scholar 

  8. Henderson, J., Merlo, P., Titov, I., Musillo, G.: Multilingual joint parsing of syntactic and semantic dependencies with a latent variable model. Comput. Linguist. 39(4), 949–998 (2013)

    Article  Google Scholar 

  9. Jokinen, K., McTear, M.: Spoken Dialogue Systems, vol. 2. Morgan & Claypool Publishers, Princeton (2009)

    Google Scholar 

  10. McCool, C., Marcel, S., Hadid, A., Pietikäinen, M., Matejka, P., Cernockỳ, J., Poh, N., Kittler, J., Larcher, A., Levy, C., et al.: Bi-modal person recognition on a mobile phone: using mobile phone data. In: 2012 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp. 635–640. IEEE (2012)

    Google Scholar 

  11. Olaso, J.M., Milhorat, P., Himmelsbach, J., Boudy, J., Chollet, G., Schlögl, S., Torres, M.I.: A Multi-lingual evaluation of the vAssist spoken dialog system. Comparing Disco and RavenClaw. In: International Workshop on Spoken Dialogue Systems (2016)

    Google Scholar 

  12. Olaso, J.M., Torres, M.I.: Dialogue system based on EDECÁN architecture. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 547–551. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15760-8_69

    Chapter  Google Scholar 

  13. Petrovska-Delacrétaz, D., Chollet, G., Dorizzi, B. (eds.): Guide to Biometric Reference Systems and Performance Evaluation. Springer, London (2009). doi:10.1007/978-1-84800-292-0

    Google Scholar 

  14. Phillips, P.J., Flynn, P.J., Scruggs, T., Bowyer, K.W., Chang, J., Hoffman, K., Marques, J., Min, J., Worek, W.: Overview of the face recognition grand challenge. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 947–954. IEEE (2005)

    Google Scholar 

  15. Sansen, H., Torres, M.I., Chollet, G., Glackin, C., Petrovska-Delacretaz, D., Boudy, J., Badii, A., Schlögl, S.: The Roberta IRONSIDE project: a dialog capable humanoid personal assistant in a wheelchair for dependent persons. In: 2016 2nd International Conference on Advanced Technologies for Signal and Image Proceedings (ATSIP), pp. 381–386 (2016)

    Google Scholar 

  16. Schlögl, S., Doherty, G., Luz, S.: Wizard of Oz experimentation for language technology applications: challenges and tools. Interact. Comput. 27(6), 592–615 (2015)

    Article  Google Scholar 

  17. Schlögl, S., Milhorat, P., Chollet, G., Boudy, J.: Designing language technology applications: a Wizard of Oz driven prototyping framework. In: Proceedings of the EACL Conference of the European Chapter of the Association for Computer Linguistics, pp. 85–88 (2014)

    Google Scholar 

  18. Serrras, M., Pére, N., Torres, M.I., Del Pozo, A.: Entropy-driven dialog for topic classification: detecting and tackling uncertainty. In: International Workshop on Spoken Dialogue Systems (2016)

    Google Scholar 

  19. ter Maat, M., Heylen, D.: Flipper: an information state component for spoken dialogue systems. In: Vilhjálmsson, H.H., Kopp, S., Marsella, S., Thórisson, K.R. (eds.) IVA 2011. LNCS, vol. 6895, pp. 470–472. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23974-8_67

    Chapter  Google Scholar 

  20. Traum, D.R.: A computational theory of grounding in natural language conversation. Technical report, University of Rochester, Rochester, NY, USA (1994)

    Google Scholar 

  21. Traum, D.R., Larsson, S.: The information state approach to dialogue management. In: van Kuppevelt, J., Smith, R.W. (eds.) Current and New Directions in Discourse and Dialogue. Text, Speech and Language Technology, vol. 22, pp. 325–353. Springer, Dordrecht (2003)

    Chapter  Google Scholar 

  22. Turunen, M., Hakulinen, J.: Jaspis-a framework for multilingual adaptive speech applications. In: InterSpeech, pp. 719–722 (2000)

    Google Scholar 

  23. Usoltsev, A., Petrovska-Delacrétaz, D., Houssemeddine, K.: Full video processing for mobile audio-visual identity verification. In: International Conference on Pattern Recognition Applications and Methods ICPRAM 2016 (2016)

    Google Scholar 

  24. Ward, W., et al.: The CMU air travel information service: understanding spontaneous speech. In: Proceedings of the DARPA Speech and Natural Language Workshop, vol. 1, pp. 127–129 (1990)

    Google Scholar 

Download references

Acknowledgments

The authors want to acknowledge the organizers of eNTERFACE 2016 and the University of Twente for providing the opportunity to develop this project. We also want to acknowledge the institutions supporting some of the authors e.g. the Spanish Science Minister under grant TIN2014-54288-C4, ‘ADAPT 13/RC/2106’, and the Academy of Finland project Digital Citizens grant number 270082.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to María Inés Torres .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

López, A. et al. (2017). LifeLine Dialogues with Roberta. In: Quesada, J., Martín Mateos , FJ., López Soto, T. (eds) Future and Emerging Trends in Language Technology. Machine Learning and Big Data. FETLT 2016. Lecture Notes in Computer Science(), vol 10341. Springer, Cham. https://doi.org/10.1007/978-3-319-69365-1_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-69365-1_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-69364-4

  • Online ISBN: 978-3-319-69365-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics