ABSTRACT
As social networks become further entrenched in modern society, it becomes increasingly important to understand and predict how information (e.g., news coverage of a given event) is propagated across social media (i.e., information pathway), which helps the understandings of the impact of real-world information. Thus, in this paper, we propose a novel task, Information Pathway Prediction (IPP), which depicts the propagation paths of a given passage as a community tree (rooted at the information source) on constructed community interaction graphs where we first aggregate individual users into communities formed around news sources and influential users, and then elucidate the patterns of information dissemination across media based on such community nodes. We argue that this is an important and useful task because, on one hand, community-level interactions offer more stability than those at the user level; on the other hand, individual users are often influenced by their community, and modeling community-level information propagation will help the traditional link-prediction problem. To tackle the IPP task, we introduce Lightning, a novel content-aware link prediction GNN model and demonstrate using a large Twitter dataset consisting of all COVID related tweets that Lightning outperforms state-of-the-art link prediction baselines by a significant margin.
Supplemental Material
- Iz Beltagy, Arman Cohan, Hannaneh Hajishirzi, Sewon Min, and Matthew E Peters. 2021. Beyond paragraphs: NLP for long sequences. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Tutorials. 20--24.Google ScholarCross Ref
- Iz Beltagy, Matthew E. Peters, and Arman Cohan. 2020. Longformer: The Long-Document Transformer. arXiv:2004.05150 (2020).Google Scholar
- Zhihao Chen, Jingjing Wei, Shaobin Liang, Tiecheng Cai, and Xiangwen Liao. 2021. Information Cascades Prediction With Graph Attention. In Frontiers of Physics.Google Scholar
- Zihang Dai, Zhilin Yang, Yiming Yang, Jaime Carbonell, Quoc Le, and Ruslan Salakhutdinov. 2019. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 2978--2988. https://doi.org/10.18653/v1/P19-1285Google ScholarCross Ref
- Nur Nasuha Daud, Siti Hafizah Ab Hamid, Muntadher Saadoon, Firdaus Sahran, and Nor Badrul Anuar. 2020. Applications of link prediction in social networks: A review. Journal of Network and Computer Applications, Vol. 166 (2020), 102716.Google ScholarCross Ref
- Adrien Guille. 2013. Information diffusion in online social networks. In SIGMOD'13 PhD Symposium.Google ScholarDigital Library
- Sogol Haghani and Mohammad Reza Keyvanpour. 2019. A systemic analysis of link prediction in social network. Artificial Intelligence Review, Vol. 52, 3 (2019), 1961--1995.Google ScholarDigital Library
- William L. Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In NIPS.Google Scholar
- Eleanna Kafeza, Andreas Kanavos, Christos Makris, and Pantelis Vikatos. 2014. Predicting Information Diffusion Patterns in Twitter. IFIP Advances in Information and Communication Technology, Vol. 436. https://doi.org/10.1007/978-3-662-44654-6_8Google ScholarCross Ref
- Marlen Komorowski, Tien Do Huu, and Nikos Deligiannis. 2018. Twitter data analysis for studying communities of practice in the media industry. Telematics and Informatics, Vol. 35, 1 (2018), 195--212.Google ScholarCross Ref
- Quyu Kong, Marian-Andrei Rizoiu, and Lexing Xie. 2020. Modeling Information Cascades with Self-Exciting Processes via Generalized Epidemic Models. In Proceedings of the 13th International Conference on Web Search and Data Mining (Houston, TX, USA) (WSDM '20). Association for Computing Machinery, New York, NY, USA, 286--294. https://doi.org/10.1145/3336191.3371821Google ScholarDigital Library
- Xiangjie Kong, Yajie Shi, Shuo Yu, Jiaying Liu, and Feng Xia. 2019. Academic social networks: Modeling, analysis, mining and applications. Journal of Network and Computer Applications, Vol. 132 (2019), 86--103.Google ScholarDigital Library
- Gueorgi Kossinets, Jon M. Kleinberg, and Duncan J. Watts. 2008. The structure of information pathways in a social communication network. In Knowledge Discovery and Data Mining.Google Scholar
- Ajay Kumar, Shashank Sheshar Singh, Kuldeep Singh, and Bhaskar Biswas. 2020a. Link prediction techniques, applications, and performance: A survey. Physica A: Statistical Mechanics and its Applications, Vol. 553 (2020), 124289.Google Scholar
- Ajay Kumar, Shashank Sheshar Singh, Kuldeep Singh, and Bhaskar Biswas. 2020b. Link prediction techniques, applications, and performance: A survey. Physica A-statistical Mechanics and Its Applications, Vol. 553 (2020), 124289.Google ScholarCross Ref
- Anisha Kumari, Ranjan Kumar Behera, Bibudatta Sahoo, and Satya Prakash Sahoo. 2022. Prediction of link evolution using community detection in social network. Computing, Vol. 104, 5 (2022), 1077--1098.Google ScholarDigital Library
- Mustafa Toprak, Chiara Boldrini, Andrea Passarella, and Marco Conti. 2023. Harnessing the Power of Ego Network Layers for Link Prediction in Online Social Networks. IEEE Transactions on Computational Social Systems, Vol. 10, 1 (2023), 48--60. https://doi.org/10.1109/TCSS.2022.3155946Google ScholarCross Ref
- Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio', and Yoshua Bengio. 2017. Graph Attention Networks. ArXiv, Vol. abs/1710.10903 (2017).Google Scholar
- Haixia Wu, Chunyao Song, Yao Ge, and Tingjian Ge. 2022. Link Prediction on Complex Networks: An Experimental Survey. Data Science and Engineering, Vol. 7, 3 (2022), 253--278.Google ScholarCross Ref
- Jiang Yang and Scott Counts. 2010. Predicting the Speed, Scale, and Range of Information Diffusion in Twitter. Proceedings of the International AAAI Conference on Web and Social Media (2010).Google ScholarCross Ref
- Jiaxuan You, Rex Ying, and Jure Leskovec. 2020. Design Space for Graph Neural Networks. ArXiv, Vol. abs/2011.08843 (2020).Google Scholar
- Ahmad Zareie and Rizos Sakellariou. 2020. Similarity-based link prediction in social networks using latent relationships between the users. Scientific Reports, Vol. 10, 1 (2020), 20137.Google ScholarCross Ref
- Fan Zhou, Xovee Xu, Goce Trajcevski, and Kunpeng Zhang. 2021. A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances. ACM Comput. Surv., Vol. 54, 2, Article 27 (mar 2021), s36 pages. https://doi.org/10.1145/3433000Google ScholarDigital Library
- Hengmin Zhu, Xicheng Yin, Jing Ma, and Wei Hu. 2016. Identifying the main paths of information diffusion in online social networks. Physica A: Statistical Mechanics and its Applications, Vol. 452 (2016), 320--328.Google Scholar
Index Terms
- Where Does Your News Come From? Predicting Information Pathways in Social Media
Recommendations
Predicting Information Pathways Across Online Communities
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningThe problem of community-level information pathway prediction (CLIPP) aims at predicting the transmission trajectory of content across online communities. A successful solution to CLIPP holds significance as it facilitates the distribution of valuable ...
Analyzing Close Friend Interactions in Social Media
SOCIALCOM '13: Proceedings of the 2013 International Conference on Social ComputingSocial media has increasingly become an outlet for expression in society. Users of online social networks often associate with many other users who are all treated as "friends, " even if they do not have a strong connection, or what would be described ...
Social media analytics: tracking, modeling and predicting the flow of information through networks
WWW '11: Proceedings of the 20th international conference companion on World wide webOnline social media represent a fundamental shift of how information is being produced, transferred and consumed. User generated content in the form of blog posts, comments, and tweets establishes a connection between the producers and the consumers of ...
Comments