Abstract
The online social networks have embraced huge success from the crowds in the last two decades. Now, more and more people get used to chat with friends online via instant messaging applications on personal computers or mobile devices. Since these conversations are sequentially organized, which fails to show the logical relations between messages, they are called asynchronous conversations in previous studies. Unfortunately, the sequential layouts of messages are usually not intuitive to see how the conversation evolves as time elapses. In this paper, we propose to learn the structures of online asynchronous conversations by predicting the “reply-to” relation between messages based on text similarity and latent semantic transferability. A heuristic method is also brought forward to predict the relation, and then recover the conversation structure. We demonstrate the effectiveness of the proposed method through experiments on a real-world web forum comment data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
see the Python library: http://scikit-learn.org/.
- 5.
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Boyd, D., Golder, S., Lotan, G.: Tweet, tweet, retweet: conversational aspects of retweeting on twitter. In: Proceedings of the 43rd Hawaii International Conference on System Sciences, pp. 1–10. IEEE (2010)
Chen, J., Wang, C., Wang, J.: A personalized interest-forgetting markov model for recommendations. In: AAAI, pp. 16–22 (2015)
Cook, J., Kenthapadi, K., Mishra, N.: Group chats on twitter. In: WWW, pp. 225–236 (2013)
Edmonds, J.: Optimum branchings. J. Res. Natl. Bur. Stand. B. Math. Math. Phys. 71B(4), 233–240 (1967)
Elsner, M., Charniak, E.: You talking to me? a corpus and algorithm for conversation disentanglement. In: ACL, pp. 834–842 (2008)
Elsner, M., Charniak, E.: Disentangling chat. Comput. Linguist. 36(3), 389–409 (2010)
Gandhi, S., Jones, A.R., Nesbitt, P.A., Seacat, L.A.: Instant conversation in a thread of an online discussion forum, November 2015. http://www.freepatentsonline.com/9177284.html
Honey, C., Herring, S.C.: Beyond microblogging: conversation and collaboration via twitter. In: Proceedings of the 42nd Hawaii International Conference on System Sciences, pp. 1–10. IEEE (2009)
Hyvärinen, A.: Fast and robust fixed-point algorithms for independent component analysis. IEEE Trans. Neural Netw. 10(3), 626–634 (1999)
Hyvärinen, A., Oja, E.: Independent component analysis: algorithms and applications. Neural Netw. 13(4), 411–430 (2000)
Joty, S., Carenini, G., Lin, C.Y.: Unsupervised modeling of dialog acts in asynchronous conversations. In: IJCAI, pp. 1807–1813 (2011)
Kannan, A., Kurach, K., Ravi, S., Kaufmann, T., Tomkins, A., Miklos, B., Corrado, G., Lukacs, L., Ganea, M., Young, P., Ramavajjala, V.: Smart reply: automated response suggestion for email. In: KDD, pp. 955–964 (2016)
Kumar, R., Mahdian, M., McGlohon, M.: Dynamics of conversations. In: KDD, pp. 553–561 (2010)
Lawson, C.L., Hanson, R.J.: Solving Least Squares Problems, vol. 161. Prentice-Hall, Englewood Cliffs (1974)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. NIPS 13, 556–562 (2000)
Lin, C.J.: Projected gradient methods for nonnegative matrix factorization. Neural Comput. 19(10), 2756–2779 (2007)
Ritter, A., Cherry, C., Dolan, B.: Unsupervised modeling of twitter conversations. In: NAACL, pp. 172–180 (2010)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Serafin, R., Eugenio, B.D.: FLSA: extending latent semantic analysis with features for dialogue act classification. In: ACL (2004). No. 692
Shen, D., Yang, Q., Sun, J.T., Chen, Z.: Thread detection in dynamic text message streams. In: SIGIR, pp. 35–42 (2006)
Stolcke, A., Ries, K., Coccaro, N., Shriberg, E., Bates, R., Jurafsky, D., Taylor, P., Martin, R., Ess-Dykema, C.V., Meteer, M.: Dialogue act modeling for automatic tagging and recognition of conversational speech. Comput. Linguist. 26(3), 339–373 (2000)
Uthus, D.C., Aha, D.W.: The Ubuntu chat corpus for multiparticipant chat analysis. In: Proceedings of the AAAI Spring Symposium (2013)
Wang, C., Ye, M., Huberman, B.A.: From user comments to on-line conversations. In: KDD, pp. 244–252 (2012)
Wang, L., Oard, D.W.: Context-based message expansion for disentanglement of interleaved text conversations. In: NAACL, pp. 200–208 (2009)
Wang, Y.C., Joshi, M., Cohen, W.W., Rosé, C.P.: Recovering implicit thread structure in newsgroup style conversations. In: Proceedings of the 2nd International Conference on Weblogs and Social Media (2008)
Wang, Y.X., Zhang, Y.J.: Nonnegative matrix factorization: a comprehensive review. TKDE 25(6), 1336–1353 (2013)
Zhang, J., Wang, C., Wang, J., Yu, J.X., Chen, J., Wang, C.: Inferring directions of undirected social ties. TKDE 28(12), 3276–3292 (2016)
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (No. 61373023, No. 61133002, No. 61502116), the China National Arts Fund (No. 20164129), and the National Science Foundation (NSF) under grant No. CNS-1252292.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Chen, J., Wang, C., Lin, H., Wang, W., Cai, Z., Wang, J. (2017). Learning the Structures of Online Asynchronous Conversations. In: Candan, S., Chen, L., Pedersen, T., Chang, L., Hua, W. (eds) Database Systems for Advanced Applications. DASFAA 2017. Lecture Notes in Computer Science(), vol 10177. Springer, Cham. https://doi.org/10.1007/978-3-319-55753-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-55753-3_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55752-6
Online ISBN: 978-3-319-55753-3
eBook Packages: Computer ScienceComputer Science (R0)