skip to main content
10.1145/3579051.3579071acmotherconferencesArticle/Chapter ViewAbstractPublication PagesijckgConference Proceedingsconference-collections
research-article

Improving Empathetic Dialogue Generation with Semantics Decoupling

Published:13 February 2023Publication History

ABSTRACT

Empathetic dialogue generation is dedicated to generating responses to empathize with users by perceiving and understanding context emotions and dialogue situations. Existing works typically emphasize that an empathetic response needs to express suitable emotion through perceiving context emotion but ignore the equal need to express informative content in response by understanding the dialogue situation. To this end, we propose a novel empathetic dialogue generation model abbreviated as EmpDGM, which is extended based on the Transformer by a semantics decoupler and empathetic generator. Specifically, the semantics decoupler can effectively decouple emotion semantics and content semantics in the input sequence using adversarial training and multi-task learning meanwhile ensuring the obtained content semantics is complete. And the empathetic generator introduces a gated fusion mechanism to fuse content semantics and context emotion embedding in a balanced manner throughout the whole generation process, which overcomes generally incorporating context emotion embedding as part of initial embedding in the generation module leading the insufficient emotion expression. We conduct automatic evaluation and manual evaluation on the benchmark dataset EMPATHETICDIALOGUES of empathetic dialogue generation. Experimental results reveal that our EmpDGM outperforms advanced baselines in both emotion perceptivity and content quality and generates more informative and affective responses.

References

  1. Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. arxiv:1607.06450Google ScholarGoogle Scholar
  2. John S. Callender. 2015. Being amoral: Psychopathy and moral incapacity. British Journal of Psychiatry 207, 3 (2015), 274–275.Google ScholarGoogle ScholarCross RefCross Ref
  3. Mark H Davis. 1983. Measuring individual differences in empathy: Evidence for a multidimensional approach. Journal of Personality and Social Psychology 44, 1(1983), 113–126.Google ScholarGoogle ScholarCross RefCross Ref
  4. Robert Elliott, Arthur C Bohart, Jeanne C Watson, and David Murphy. 2019. Therapist empathy and client outcome: An updated meta-analysis. Psychotherapy 55, 4 (2019), 399.Google ScholarGoogle ScholarCross RefCross Ref
  5. Kathleen Kara Fitzpatrick, Alison Darcy, and Molly Vierhile. 2017. Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): A randomized controlled trial. JMIR Ment Health 4, 2 (2017), e19.Google ScholarGoogle ScholarCross RefCross Ref
  6. Jun Gao, Yuhan Liu, Haolin Deng, Wei Wang, Yu Cao, Jiachen Du, and Ruifeng Xu. 2021. Improving empathetic response generation by recognizing emotion cause in conversations. In Findings of the Association for Computational Linguistics: the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP’21). Association for Computational Linguistics, Punta Cana, Dominican Republic, 807–819.Google ScholarGoogle ScholarCross RefCross Ref
  7. Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR’15). OpenReview.net, San Diego, USA.Google ScholarGoogle Scholar
  8. Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A diversity-promoting objective function for neural conversation models. In Proceedings of the 15th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’16). Association for Computational Linguistics, San Diego, USA, 110–119.Google ScholarGoogle ScholarCross RefCross Ref
  9. Qintong Li, Hongshen Chen, Zhaochun Ren, Pengjie Ren, Zhaopeng Tu, and Zhumin Chen. 2020. EmpDG: Multi-resolution interactive empathetic dialogue generation. In Proceedings of the 28th International Conference on Computational Linguistics (COLING’2020). International Committee on Computational Linguistics, Barcelona, Spain (Online), 4454–4466.Google ScholarGoogle ScholarCross RefCross Ref
  10. Zhaojiang Lin, Andrea Madotto, Jamin Shin, Peng Xu, and Pascale Fung. 2019. MoEL: Mixture of empathetic listeners. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP’19). Association for Computational Linguistics, Hong Kong, China, 121–132.Google ScholarGoogle ScholarCross RefCross Ref
  11. Chia-Wei Liu, Ryan Lowe, Iulian Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How NOT to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP’16). Association for Computational Linguistics, Austin, USA, 2122–2132.Google ScholarGoogle ScholarCross RefCross Ref
  12. Siyang Liu, Chujie Zheng, Orianna Demasi, Sahand Sabour, Yu Li, Zhou Yu, Yong Jiang, and Minlie Huang. 2021. Towards emotional support dialog systems. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). Association for Computational Linguistics, Virtual Event, 3469–3483.Google ScholarGoogle ScholarCross RefCross Ref
  13. Navonil Majumder, Pengfei Hong, Shanshan Peng, Jiankun Lu, Deepanway Ghosal, Alexander F. Gelbukh, Rada Mihalcea, and Soujanya Poria. 2020. MIME: MIMicking emotions for empathetic response generation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP’20). Association for Computational Linguistics, Online, 8968–8979.Google ScholarGoogle ScholarCross RefCross Ref
  14. Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL’02). Association for Computational Linguistics, Philadelphia, USA, 311–318.Google ScholarGoogle Scholar
  15. Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14). Association for Computational Linguistics, Doha, Qatar, 1532–1543.Google ScholarGoogle ScholarCross RefCross Ref
  16. Hannah Rashkin, Eric Michael Smith, Margaret Li, and Y-Lan Boureau. 2019. Towards empathetic open-domain conversation models: A new benchmark and dataset. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL’19). Association for Computational Linguistics, Florence, Italy, 5370–5381.Google ScholarGoogle ScholarCross RefCross Ref
  17. Sahand Sabour, Chujie Zheng, and Minlie Huang. 2022. CEM: Commonsense-aware empathetic response generation. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI’22). AAAI Press, Virtual Conference, 11229–11237.Google ScholarGoogle ScholarCross RefCross Ref
  18. Lei Shen, Jinchao Zhang, Jiao Ou, Xiaofang Zhao, and Jie Zhou. 2021. Constructing emotional consensus and utilizing unpaired data for empathetic dialogue generation. In Findings of the Association for Computational Linguistics: the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP’21). Association for Computational Linguistics, Punta Cana, Dominican Republic, 3124–3134.Google ScholarGoogle ScholarCross RefCross Ref
  19. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the 31st Conference on Neural Information Processing Systems (NeurIPS’17). Curran Associates, Inc., Long Beach, USA, 5998–6008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Oriol Vinyals and Quoc Le. 2015. A neural conversational model. arxiv:1506.05869Google ScholarGoogle Scholar
  21. Liuping Wang, Dakuo Wang, Feng Tian, Zhenhui Peng, Xiangmin Fan, Zhan Zhang, Mo Yu, Xiaojuan Ma, and Hongan Wang. 2021. CASS: Towards building a social-support chatbot for online health community. Proceedings of the ACM on Human-Computer Interaction 5, CSCW1(2021), 1–31.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi. 2020. BERTScore: Evaluating text generation with BERT. In Proceedings of the 8th International Conference on Learning Representations (ICLR’20). OpenReview.net, Addis Ababa, Ethiopia.Google ScholarGoogle Scholar
  23. Chujie Zheng, Yong Liu, Wei Chen, Yongcai Leng, and Minlie Huang. 2021. CoMAE: A multi-factor hierarchical framework for empathetic response generation. In Findings of the Association for Computational Linguistics: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). Association for Computational Linguistics, Virtual Event, 813–824.Google ScholarGoogle Scholar

Index Terms

  1. Improving Empathetic Dialogue Generation with Semantics Decoupling

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      IJCKG '22: Proceedings of the 11th International Joint Conference on Knowledge Graphs
      October 2022
      134 pages
      ISBN:9781450399876
      DOI:10.1145/3579051

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 February 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)45
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format