Jointly Modeling Topics and Intents with Global Order Structure

Authors

  • Bei Chen Tsinghua University
  • Jun Zhu Tsinghua University
  • Nan Yang Microsoft Research Asia
  • Tian Tian Tsinghua University
  • Ming Zhou Microsoft Research Asia
  • Bo Zhang Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v30i1.10346

Keywords:

Topic Model, Generalized Mallows Model, Discourse Analysis

Abstract

Modeling document structure is of great importance for discourse analysis and related applications. The goal of this research is to capture the document intent structure by modeling documents as a mixture of topic words and rhetorical words. While the topics are relatively unchanged through one document, the rhetorical functions of sentences usually change following certain orders in discourse. We propose GMM-LDA, a topic modeling based Bayesian unsupervised model, to analyze the document intent structure cooperated with order information. Our model is flexible that has the ability to combine the annotations and do supervised learning. Additionally, entropic regularization can be introduced to model the significant divergence between topics and intents. We perform experiments in both unsupervised and supervised settings, results show the superiority of our model over several state-of-the-art baselines.

Downloads

Published

2016-03-05

How to Cite

Chen, B., Zhu, J., Yang, N., Tian, T., Zhou, M., & Zhang, B. (2016). Jointly Modeling Topics and Intents with Global Order Structure. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10346

Issue

Section

Technical Papers: NLP and Machine Learning