KRISTINA: A Knowledge-Based Virtual Conversation Agent

Wanner, Leo; André, Elisabeth; Blat, Josep; Dasiopoulou, Stamatia; Farrùs, Mireia; Fraga, Thiago; Kamateri, Eleni; Lingenfelser, Florian; Llorach, Gerard; Martínez, Oriol; Meditskos, Georgios; Mille, Simon; Minker, Wolfgang; Pragst, Louisa; Schiller, Dominik; Stam, Andries; Stellingwerff, Ludo; Sukno, Federico; Vieru, Bianca; Vrochidis, Stefanos

doi:10.1007/978-3-319-59930-4_23

KRISTINA: A Knowledge-Based Virtual Conversation Agent

Leo Wanner^17,18,
Elisabeth André¹⁹,
Josep Blat¹⁸,
Stamatia Dasiopoulou¹⁸,
Mireia Farrùs¹⁸,
Thiago Fraga²⁰,
Eleni Kamateri²¹,
Florian Lingenfelser¹⁹,
Gerard Llorach¹⁸,
Oriol Martínez¹⁸,
Georgios Meditskos²¹,
Simon Mille¹⁸,
Wolfgang Minker²²,
Louisa Pragst²²,
Dominik Schiller¹⁹,
Andries Stam²³,
Ludo Stellingwerff²³,
Federico Sukno¹⁸,
Bianca Vieru²⁰ &
…
Stefanos Vrochidis²¹

Conference paper
First Online: 03 June 2017

1810 Accesses
16 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10349))

Abstract

We present an intelligent embodied conversation agent with linguistic, social and emotional competence. Unlike the vast majority of the state-of-the-art conversation agents, the proposed agent is constructed around an ontology-based knowledge model that allows for flexible reasoning-driven dialogue planning, instead of using predefined dialogue scripts. It is further complemented by multimodal communication analysis and generation modules and a search engine for the retrieval of multimedia background content from the web needed for conducting a conversation on a given topic. The evaluation of the 1st prototype of the agent shows a high degree of acceptance of the agent by the users with respect to its trustworthiness, naturalness, etc. The individual technologies are being further improved in the 2nd prototype.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Due to the lack of space, we cannot present a complete run of an interaction turn. Therefore, we merely introduce in what follows the individual modules and sketch how they interact.
2.
Essential is also the recognition of prosody as a means to detect the thematic and emphatic patterns in the move of the user [6, 7].
3.
http://www.vocapia.com/.
4.
https://www.cereproc.com/.

References

Anderson, K., et al.: The TARDIS framework: intelligent virtual agents for social coaching in job interviews. In: Reidsma, D., Katayose, H., Nijholt, A. (eds.) ACE 2013. LNCS, vol. 8253, pp. 476–491. Springer, Cham (2013). doi:10.1007/978-3-319-03161-3_35
Chapter Google Scholar
Ballesteros, M., Bohnet, B., Mille, S., Wanner, L.: Data-driven sentence generation with non-isomorphic trees. In: Proceedings of the 2015 Conference of the NAACL: Human Language Technologies, pp. 387–397. ACL, Denver, Colorado, May–June 2015. http://www.aclweb.org/anthology/N15-1042
Ballesteros, M., Bohnet, B., Mille, S., Wanner, L.: Data-driven deep-syntactic dependency parsing. Natural Lang. Eng. 22(6), 939–974 (2016)
Article Google Scholar
Baur, T., Mehlmann, G., Damian, I., Gebhard, P., Lingenfelser, F., Wagner, J., Lugrin, B., André, E.: Context-aware automated analysis and annotation of social human-agent interactions. ACM Trans. Interact. Intell. Syst. 5(2) (2015)
Google Scholar
Bohnet, B., Wanner, L.: Open soucre graph transducer interpreter and grammar development environment. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC 2010, 17–23 May, Valletta, Malta (2010)
Google Scholar
Domínguez, M., Farrús, M., Burga, A., Wanner, L.: Using hierarchical information structure for prosody prediction in content-to-speech application. In: Proceedings of the 8th International Conference on Speech Prosody (SP 2016), Boston, MA (2016)
Google Scholar
Domínguez, M., Farrús, M., Wanner., L.: Combining acoustic and linguistic features in phrase-oriented prosody prediction. In: Proceedings of the 8th International Conference on Speech Prosody (SP 2016), Boston, MA (2016)
Google Scholar
Du, S., Tao, Y., Martinez, A.M.: Compound facial expressions of emotion. Proc. Nat. Acad. Sci. 111(15), E1454–E1462 (2014)
Article Google Scholar
Ekman, P., Rosenberg, E.L.: What the Face Reveals: Basic and Applied Studies of Spontaneous Expression Using the Facial Action Coding System (FACS). Oxford University Press, Oxford (1997)
Google Scholar
Fillmore, C.J.: Frame Semantics, pp. 111–137. Hanshin Publishing Co., Seoul (1982)
Google Scholar
Gangemi, A.: Ontology design patterns for semantic web content. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 262–276. Springer, Heidelberg (2005). doi:10.1007/11574620_21
Chapter Google Scholar
Gebhard, P., Mehlmann, G.U., Kipp, M.: Visual SceneMaker: a tool for authoring interactive virtual characters. J. Multimodal User Interfaces 6(1–2), 3–11 (2012). Interacting with Embodied Conversational Agents. Springer-Verlag
Article Google Scholar
Gilroy, S.W., Cavazza, M., Niranen, M., André, E., Vogt, T., Urbain, J., Benayoun, M., Seichter, H., Billinghurst, M.: PAD-based multimodal affective fusion. In: Affective Computing and Intelligent Interaction and Workshops (2009)
Google Scholar
Gunes, H., Schuller, B.: Categorical and dimensional affect analysis in continuous input: current trends and future directions. Image Vis. Comput. 31(2), 120–136 (2013)
Article Google Scholar
Heckmann, D., Schwartz, T., Brandherm, B., Schmitz, M., Wilamowitz-Moellendorff, M.: Gumo – the general user model ontology. In: Ardissono, L., Brna, P., Mitrovic, A. (eds.) UM 2005. LNCS, vol. 3538, pp. 428–432. Springer, Heidelberg (2005). doi:10.1007/11527886_58
Chapter Google Scholar
Hofstede, G.H., Hofstede, G.: Culture’s Consequences: Comparing Values, Behaviors, Institutions and Organizations Across Nations. Sage, Thousand Oaks (2001)
Google Scholar
Hyde, J., Carter, E.J., Kiesler, S., Hodgins, J.K.: Assessing naturalness and emotional intensity: a perceptual study of animated facial motion. In: Proceedings of the ACM Symposium on Applied Perception, pp. 15–22. ACM (2014)
Google Scholar
Hyde, J., Carter, E.J., Kiesler, S., Hodgins, J.K.: Using an interactive avatar’s facial expressiveness to increase persuasiveness and socialness. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pp. 1719–1728. ACM (2015)
Google Scholar
Lamel, L., Gauvain, J.: Speech recognition. In: Mitkov, R. (ed.) OUP Handbook on Computational Linguistics, pp. 305–322. Oxford University Press, Oxford (2003)
Google Scholar
Lingenfelser, F., Wagner, J., André, E., McKeown, G., Curran, W.: An event driven fusion approach for enjoyment recognition in real-time. In: MM, pp. 377–386 (2014)
Google Scholar
Mehlmann, G., André, E.: Modeling multimodal integration with event logic charts. In: Proceedings of the 14th International Conference on Multimodal Interaction, pp. 125–132. ACM, New York (2012)
Google Scholar
Mehlmann, G., Janowski, K., André, E.: Modeling grounding for interactive social companions. J. Artif. Intell. 30(1), 45–52 (2016). Social Companion Technologies. Springer-Verlag
Google Scholar
Mehlmann, G., Janowski, K., Baur, T., Häring, M., André, E., Gebhard, P.: Exploring a model of gaze for grounding in HRI. In: Proceedings of the 16th International Conference on Multimodal Interaction, pp. 247–254. ACM, New York (2014)
Google Scholar
Mori, M., MacDorman, K.F., Kageki, N.: The uncanny valley [from the field]. IEEE Robot. Autom. Mag. 19(2), 98–100 (2012)
Article Google Scholar
Motik, B., Cuenca Grau, B., Sattler, U.: Structured objects in OWL: representation and reasoning. In: Proceedings of the 17th International Conference on World Wide Web, pp. 555–564. ACM (2008)
Google Scholar
Ochs, M., Pelachaud, C.: Socially aware virtual characters: the social signal of smiles. IEEE Signal Process. Mag. 30(2), 128–132 (2013)
Article Google Scholar
Posner, J., Russell, J., Peterson, B.: The circumplex model of affect: an integrative approach to affective neuroscience, cognitive development and psychopathology. Dev. Psychopathol. 17(3), 715–734 (2005)
Article Google Scholar
Riaño, D., Real, F., Campana, F., Ercolani, S., Annicchiarico, R.: An ontology for the care of the elder at home. In: Combi, C., Shahar, Y., Abu-Hanna, A. (eds.) AIME 2009. LNCS (LNAI), vol. 5651, pp. 235–239. Springer, Heidelberg (2009). doi:10.1007/978-3-642-02976-9_33
Chapter Google Scholar
Ruiz, A., Van de Weijer, J., Binefa, X.: From emotions to action units with hidden and semi-hidden-task learning. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3703–3711 (2015)
Google Scholar
Sandbach, G., Zafeiriou, S., Pantic, M., Yin, L.: Static and dynamic 3D facial expression recognition: a comprehensive survey. Image Vis. Comput. 30(10), 683–697 (2012)
Article Google Scholar
Savran, A., Sankur, B., Bilge, M.T.: Regression- based intensity estimation of facial action units. Image Vis. Comput. 30(10), 774–784 (2012)
Article Google Scholar
Shaw, R., Troncy, R., Hardman, L.: LODE: linking open descriptions of events. In: 4th Asian Conference on The Semantic Web, Shanghai, China, pp. 153–167 (2009)
Google Scholar
Wagner, J., Lingenfelser, F., André, E.: Building a Robust System for Multimodal Emotion Recognition, pp. 379–419. Wiley, Hoboken (2015)
Google Scholar
Wagner, J., Lingenfelser, F., Baur, T., Damian, I., Kistler, F., André, E.: The social signal interpretation (SSI) framework-multimodal signal processing and recognition in real-time. In: Proceedings of ACM International Conference on Multimedia (2013)
Google Scholar
Wanner, L., Bohnet, B., Bouayad-Agha, N., Lareau, F., Nicklaß, D.: MARQUIS: generation of user-tailored multilingual air quality bulletins. Appl. Artif. Intell. 24(10), 914–952 (2010)
Article Google Scholar
Yasavur, U., Lisetti, C., Rishe, N.: Let’s talk! speaking virtual counselor offers you a brief intervention. J. Multimodal User Interfaces 8(4), 381–398 (2014)
Article Google Scholar
Zeng, Z., Pantic, M., Roisman, G., Huang, T.: A survey of affect recognition methods: audio, visual, and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 39–58 (2009)
Article Google Scholar

Download references

Acknowledgments

The presented work is funded by the European Commission as part of the H2020 Programme, under the contract number 645012–RIA. Many thanks to our colleagues from the University of Tübingen, German Red Cross and semFYC for the definition of the use cases, constant feedback, and evaluation!

Author information

Authors and Affiliations

ICREA, Barcelona, Spain
Leo Wanner
Universitat Pompeu Fabra, Barcelona, Spain
Leo Wanner, Josep Blat, Stamatia Dasiopoulou, Mireia Farrùs, Gerard Llorach, Oriol Martínez, Simon Mille & Federico Sukno
Universität Augsburg, Augsburg, Germany
Elisabeth André, Florian Lingenfelser & Dominik Schiller
Vocapia Research, Orsay, France
Thiago Fraga & Bianca Vieru
CERTH, Thessaloniki, Greece
Eleni Kamateri, Georgios Meditskos & Stefanos Vrochidis
Universität Ulm, Ulm, Germany
Wolfgang Minker & Louisa Pragst
Almende, Rotterdam, The Netherlands
Andries Stam & Ludo Stellingwerff

Authors

Leo Wanner
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth André
View author publications
You can also search for this author in PubMed Google Scholar
Josep Blat
View author publications
You can also search for this author in PubMed Google Scholar
Stamatia Dasiopoulou
View author publications
You can also search for this author in PubMed Google Scholar
Mireia Farrùs
View author publications
You can also search for this author in PubMed Google Scholar
Thiago Fraga
View author publications
You can also search for this author in PubMed Google Scholar
Eleni Kamateri
View author publications
You can also search for this author in PubMed Google Scholar
Florian Lingenfelser
View author publications
You can also search for this author in PubMed Google Scholar
Gerard Llorach
View author publications
You can also search for this author in PubMed Google Scholar
Oriol Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Georgios Meditskos
View author publications
You can also search for this author in PubMed Google Scholar
Simon Mille
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Minker
View author publications
You can also search for this author in PubMed Google Scholar
Louisa Pragst
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Schiller
View author publications
You can also search for this author in PubMed Google Scholar
Andries Stam
View author publications
You can also search for this author in PubMed Google Scholar
Ludo Stellingwerff
View author publications
You can also search for this author in PubMed Google Scholar
Federico Sukno
View author publications
You can also search for this author in PubMed Google Scholar
Bianca Vieru
View author publications
You can also search for this author in PubMed Google Scholar
Stefanos Vrochidis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leo Wanner .

Editor information

Editors and Affiliations

Centre National de la Rech. Scientifique , Grenoble, France
Yves Demazeau
Malmö University , Malmö, Sweden
Paul Davidsson
Universidad Politécnica de Madrid , Madrid, Spain
Javier Bajo
Polytechnic Institute of Porto , Porto, Portugal
Zita Vale

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wanner, L. et al. (2017). KRISTINA: A Knowledge-Based Virtual Conversation Agent. In: Demazeau, Y., Davidsson, P., Bajo, J., Vale, Z. (eds) Advances in Practical Applications of Cyber-Physical Multi-Agent Systems: The PAAMS Collection. PAAMS 2017. Lecture Notes in Computer Science(), vol 10349. Springer, Cham. https://doi.org/10.1007/978-3-319-59930-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-59930-4_23
Published: 03 June 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59929-8
Online ISBN: 978-3-319-59930-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics