Abstract
Timeline generation is an important research task which can help users to have a quick understanding of the overall evolution of one given topic. Previous methods simply split the time span into fixed, equal time intervals without studying the role of the evolutionary patterns of the underlying topic in timeline generation. In addition, few of these methods take users’ collective interests into considerations to generate timelines.
We consider utilizing social media attention to address these two problems due to the facts: 1) social media is an important pool of real users’ collective interests; 2) the information cascades generated in it might be good indicators for boundaries of topic phases. Employing Twitter as a basis, we propose to incorporate topic phases and user’s collective interests which are learnt from social media into a unified timeline generation algorithm.We construct both one informativeness-oriented and three interestingness-oriented evaluation sets over five topics.We demonstrate that it is very effective to generate both informative and interesting timelines. In addition, our idea naturally leads to a novel presentation of timelines, i.e., phase based timelines, which can potentially improve user experience.
Similar content being viewed by others
References
Swan R, Allan J. Automatic generation of overview timelines. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2000, 49–56
Chieu H L, Lee Y K. Query based event extraction along a timeline. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2004, 425–432
Yan R, Wan X J, Otterbacher J, Kong L, Li X M, Zhang Y. Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In: Proceedings of the 34th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval. 2011, 745–754
Yan R, Kong L, Huang C R, Wan X J, Li X M, Zhang Y. Timeline generation through evolutionary trans-temporal summarization. In: Proceedings of the Conference on EmpiricalMethods in Natural Language Processing. 2011, 433–443
Yan R, Nie J Y, Li X M. Summarize what you are interested in: an optimization framework for interactive personalized summarization. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011, 1342–1351
Kleinberg J. Bursty and hierarchical structure in streams. Data Mining Knowledge Discovery, 2003, 7(4): 373–397
Wu SM, Hofman J M, Mason WA, Watts J D. Who says what to whom on twitter. In: Proceedings of the 20th International World Wide Web Conference. 2011, 705–714
Zhai C X, Lafferty J. Model-based feedback in the language modeling approach to information retrieval. In: Proceedings of the 10th ACM International Conference on Information and Knowledge Management. 2001, 403–410
Erkan G, Radev D R. LexPageRank: prestige in multi-document text summarization. In: Proceedings of the Conference on Empirical Methods on Natural Language Processing. 2004, 365–371
Wan X J, Yang J W, Xiao J G. Manifold-ranking based topic-focused multi-document summarization. In: Proceedings of the International Joint Conference on Artificial Intelligence. 2007, 2903–2908
Mei Q Z, Guo J, Radev, D. DivRank: the interplay of prestige and diversity in information networks. In: Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2010, 1009–1018
Yang J, Leskovec J. Patterns of temporal variation in online media. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining. 2011, 177–186
Leskovec J, Backstrom L, Kleinberg, J. Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD Conference on Knowledge Discovery and DataMining. 2009, 497–506
Radev D R, Jing H Y, Sty M, Tam D. Centroid-based summarization of multiple documents. Information Processing and Management, 2004, 40(6): 919–938
Wan X J, Yang J W. Multi-document summarization using clusterbased link analysis. In: Proceedings of the 31st ACM SIGIR Conference on Research and Development in Information Retrieval. 2008, 299–306
Zhao X W, Shu B H, Jiang J, Song Y, Yan H F, Li X M. Identifying event-related bursts via social media activities. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2012, 1466–1477
Lin C Y, Hovy E. From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Annual Conference of the Association for Computational Linguistics. 2002, 457–464
Wan X J, Yang J W, Xiao J G. Single document summarization with document expansion. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2007, 931–936
Li L D, Zhou K, Xue G R, Zha H Y, Yu Y. Enhancing diversity, coverage and balance for summarization through structure learning. In: Proceedings of the 18th International World Wide Web Conference. 2009, 71–80
Goldstein J, Kantrowitz M, Mittal V, Carbonell J. Summarizing text documents: sentence selection and evaluation metrics. In: Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval. 1999, 121–128
Leuski A, Lin C Y, Hovy E. iNeATS: interactive multi-document summarization. In: Proceedings of the 41st Conference of the Association for Computational Linguistics. 2003, 125–128
Allan J, Gupta R, Khandelwal V. Temporal summaries of new topics. In: Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval. 2001, 10–18
Yang Z, Cai K K, Tang J, Zhang L, Su Z, Li J Z. Social context summarization. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2011, 255–264
Swan R, Allan J. Automatic generation of overview timelines. In: Proceedings of the 23rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval. 2000, 49–56
Fung G P C, Yu J X, Yu P S, Lu H J. Parameter free bursty events detection in text streams. In: Proceedings of the 31st International Conference on Very Large Data Bases. 2005, 181–192
Mathioudakis M, Koudas N. TwitterMonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM International Conference on Management of Data. 2010, 1155–1158
Zubiaga A, Spina D, Fresno V, Martínez R. Classifying trending topics: a typology of conversation triggers on Twitter. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management. 2011, 2461–2464
Budak C, Agrawal D, Abbadi A E. Structural trend analysis for online social networks. The Proceedings of the VLDB Endowment, 2011, 4(10): 646–656
Naaman M, Becker H, Gravano L. Hip and trendy: characterizing emerging Trends on twitter. Journal of American Society for Information Science and Techonology, 2011, 62(5): 902–918
Sakaki T, Okazaki M, Matsuo Y. Earthquake shakes Twitter users: realtime event detection by social sensors. In: Proceedings of the 19th International World Wide Web Conference. 2010, 851–860
Aramki E, Maskawa S, Morita M. Twitter catches the flu: Detecting influenza epidemics using Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011, 1466–1477
Marcus A, Bernstein M S, Badar O, Karger D R, Madden S, Miller R C. Twitinfo: aggregating and visualizing microblogs for event exploration. In: Proceedings of ACM Conference on Human Factors in Computing Systems. 2011, 227–236
Author information
Authors and Affiliations
Corresponding author
Additional information
Wayne Xin Zhao is currently an assistant professor at the School of Information, Renmin University of China, China. He received the PhD degree from Peking University, China in 2014. His research interests are web text mining and natural language processing. He has published several referred papers in international conferences and journals such as ACL, EMNLP, COLING, ECIR, CIKM, SIGIR, SIGKDD, ACM TOIS, ACM TIST and IEEE TKDE.
Ji-Rong Wen is a professor at the School of Information, Renmin University of China, China. Before that, he was a senior researcher and group manager of the Web Search and Mining Group at MicroSesearch Asia (MSRA), China since 2008. He has published extensively on prestigious international conferences/journals and served as program committee members or chairs in many international conferences. He was the chair of theWWWin China track of the 17thWorldWideWeb conference. He is currently the associate editor of ACM Transactions on Information Systems (TOIS).
Xiaoming Li is a professor at the School of Electronic Engineering and Computer Science and the director of Institute of Network Computing and Information Systems in Peking University, China. He is a senior member of IEEE and currently served as vice president of China Computer Federation. His research interests include search engine and web mining, and web technology enabled social sciences.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Zhao, W.X., Wen, JR. & Li, X. Generating timeline summaries with social media attention. Front. Comput. Sci. 10, 702–716 (2016). https://doi.org/10.1007/s11704-015-5145-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11704-015-5145-3