Don't Forget the Quantifiable Relationship between Words: Using Recurrent Neural Network for Short Text Topic Discovery

Authors

  • Heng-Yang Lu Nanjing University
  • Lu-Yao Xie Nanjing University
  • Ning Kang Nanjing University
  • Chong-Jun Wang Nanjing University
  • Jun-Yuan Xie Nanjing University

DOI:

https://doi.org/10.1609/aaai.v31i1.10670

Keywords:

short text, topic model, RNN

Abstract

In our daily life, short texts have been everywhere especially since the emergence of social network. There are countless short texts in online media like twitter, online Q&A sites and so on. Discovering topics is quite valuable in various application domains such as content recommendation and text characterization. Traditional topic models like LDA are widely applied for sorts of tasks, but when it comes to short text scenario, these models may get stuck due to the lack of words. Recently, a popular model named BTM uses word co-occurrence relationship to solve the sparsity problem and is proved effectively. However, both BTM and extended models ignore the inside relationship between words. From our perspectives, more related words should appear in the same topic. Based on this idea, we propose a model named RIBS-TM which makes use of RNN for relationship learning and IDF for filtering high-frequency words. Experiments on two real-world short text datasets show great utility of our model.

Downloads

Published

2017-02-12

How to Cite

Lu, H.-Y., Xie, L.-Y., Kang, N., Wang, C.-J., & Xie, J.-Y. (2017). Don’t Forget the Quantifiable Relationship between Words: Using Recurrent Neural Network for Short Text Topic Discovery. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10670

Issue

Section

AAAI Technical Track: Knowledge Representation and Reasoning