Abstract
Community detection is a significant research area in social networks. Most methods use network topology, but combining it with user interactions improves accuracy. This paper proposes a robust method to identify communities based on the improved user interaction degree, the weighted quasi-local structural similarity measure, and the frequent pattern mining on user interactions. In the community creation phase, influential users are identified based on eigenvector centrality and users who interact with them the most are extracted based on frequent pattern mining. In the community expansion phase, we introduce a measure to calculate the degree of user interactions based on the local clustering coefficient improved by interactions between common neighbors. We present two strategies to expand the community. The first strategy, a direct connection, exists between a user outside and a user inside the community. Their similarity is calculated based on the combined measure of improved user interaction degree and user degrees. The second strategy is if two users do not have a direct connection, we consider their communication paths. Therefore, we present a similarity measure combining a quasi-local path-based measure and an improved user interaction degree. Analysis of Higgs Twitter and Flickr datasets using internal density, Normalized Mutual Information, and Adjusted Rand Index shows that this paper's method outperforms the other five community detection methods. Furthermore, our method has more robustness than other relevant methods.
Similar content being viewed by others
Data availability
The data that support the findings of this study are openly available in Stanford Network Analysis Platform (SNAP) at http://snap.stanford.edu/data/higgs-twitter.html, reference number [53] and in ArnetMine at https://www.aminer.cn/data-sna#Flickr, reference number [55].
References
Li X, Xu G, Tang M (2018) Community detection for multi-layer social network based on local random walk. J Visual Commun Image Represent 57:91–98. https://doi.org/10.1016/j.jvcir.2018.10.003
Dabaghi-Zarandi F, KamaliPour P (2022) Community detection in complex network based on an improved random algorithm using local and global network information. J Network Comput Appl 206:103492. https://doi.org/10.1016/j.jnca.2022.103492
Das BC, Anwar MM, Bhuiyan MA-A, Sarker IH, Alyami SA, Moni MA (2021) Attribute driven temporal active online community search. IEEE Access 9:93976–93989. https://doi.org/10.1109/ACCESS.2021.3093368
Moscato V, Sperlì G (2021) A survey about community detection over On-line Social and Heterogeneous Information Networks. Knowledge-Based Syst 224:107112. https://doi.org/10.1016/j.knosys.2021.107112
Luo L, Liu K, Guo B, Ma J (2020) User interaction-oriented community detection based on cascading analysis. Inf Sci 510:70–88. https://doi.org/10.1016/j.ins.2019.09.022
Wilson C, Sala A, Puttaswamy KPN, Zhao BY (2012) Beyond Social Graphs. ACM Trans Web 6:1–31. https://doi.org/10.1145/2382616.2382620
O’Riordan S, Feller J, Nagle T (2016) A categorisation framework for a feature-level analysis of social network sites. J Decis Syst 25:244–262. https://doi.org/10.1080/12460125.2016.1187548
Moosavi SA, Jalali M, Misaghian N, Shamshirband S, Anisi MH (2016) Community detection in social networks using user frequent pattern mining. Knowl Inf Syst 51:159–186. https://doi.org/10.1007/s10115-016-0970-8
Dev H, Ali ME, Hashem T (2014) User interaction based community detection in online social networks. In: Database Systems for Advanced Applications: 19th International Conference, DASFAA 2014, Bali, Indonesia, April 21-24, 2014. Proceedings, Part II 19, 296-310, Springer. https://doi.org/10.1007/978-3-319-05813-9_20
Vathi E, Siolas G, Stafylopatis A, Nguyen N-T, Núñez M, Trawiński B (2017) Mining and categorizing interesting topics in Twitter communities. J Intell Fuzzy Syst 32:1265–1275. https://doi.org/10.3233/jifs-169125
Kumar S, Mallik A, Khetarpal A, Panda BS (2022) Influence maximization in social networks using graph embedding and graph neural network. Inf Sci 607:1617–1636. https://doi.org/10.1016/j.ins.2022.06.075
Ai J, He T, Su Z, Shang L (2022) Identifying influential nodes in complex networks based on spreading probability. Chaos, Solitons Fractals 164:112627. https://doi.org/10.1016/j.chaos.2022.112627
Laeuchli J, Ramírez-Cruz Y, Trujillo-Rasua R (2022) Analysis of centrality measures under differential privacy models. Appl Math Comput 412:126546. https://doi.org/10.1016/j.amc.2021.126546
Hansen D, Shneiderman B, Smith MA (2020) Analyzing social media networks with NodeXL: insights from a connected world (Second Edition), Morgan Kaufmann pp.Chapter 3. https://doi.org/10.1016/C2018-0-01348-1
Samanta S, Dubey VK, Sarkar B (2021) Measure of influences in social networks. Appl Soft Comput 99:106858. https://doi.org/10.1016/j.asoc.2020.106858
Zhong L-F, Shang M-S, Chen X-L, Cai S-M (2018) Identifying the influential nodes via eigen-centrality from the differences and similarities of structure. Phys A Stat Mech Appl 510:77–82. https://doi.org/10.1016/j.physa.2018.06.115
Goyal A, Bonchi F, Lakshmanan LV (2008) Discovering leaders from community actions. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, 499–508, https://doi.org/10.1145/1458082.1458149
Lu D, Li Q, Liao SS (2012) A graph-based action network framework to identify prestigious members through member’s prestige evolution. Decis Support Syst 53:44–54. https://doi.org/10.1016/j.dss.2011.12.003
Bamakan SMH, Nurgaliev I, Qu Q (2019) Opinion leader detection: a methodological review. Expert Syst Appl 115:200–222. https://doi.org/10.1016/j.eswa.2018.07.069
Kolahkaj M, Harounabadi A, Nikravanshalmani A, Chinipardaz R (2020) A hybrid context-aware approach for e-tourism package recommendation based on asymmetric similarity measurement and sequential pattern mining. Electron Commer Res App 42:100978. https://doi.org/10.1016/j.elerap.2020.100978
Noorian A, Harounabadi A, Ravanmehr R (2022) A novel Sequence-Aware personalized recommendation system based on multidimensional information. Expert Syst Appl 202:117079. https://doi.org/10.1016/j.eswa.2022.117079
Martínez V, Berzal F, Cubero J-C (2016) A survey of link prediction in complex networks. ACM Comput Surv 49:1–33. https://doi.org/10.1145/3012704
Srilatha P, Manjula R (2016) Similarity index based link prediction algorithms in social networks: a survey. J Telecommun Inf Technol 2:87–94
Tumiran SA, Sivakumar B (2021) Community structure concept for catchment classification: a modularity density-based edge betweenness (MDEB) method. Ecol Indic 124:107346. https://doi.org/10.1016/j.ecolind.2021.107346
Fardet T, Levina A (2021) Weighted directed clustering: interpretations and requirements for heterogeneous, inferred, and measured networks. Phys Rev Res. https://doi.org/10.1103/PhysRevResearch.3.043124
Paul A, Dutta A (2022) Community detection using Local Group Assimilation. Expert Syst Appl 206:117794. https://doi.org/10.1016/j.eswa.2022.117794
Shang R, Zhang W, Li Z, Wang C, Jiao L (2023) Attribute community detection based on latent representation learning and graph regularized non-negative matrix factorization. Appl Soft Comput 133:109932. https://doi.org/10.1016/j.asoc.2022.109932
Berahmand K, Bouyer A (2018) A link-based similarity for improving community detection based on label propagation algorithm. J Syst Sci Complexity 32:737–758. https://doi.org/10.1007/s11424-018-7270-1
Arab M, Afsharchi M (2014) Community detection in social networks using hybrid merging of sub-communities. J Network Comput Appl 40:73–84. https://doi.org/10.1016/j.jnca.2013.08.008
Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008:P10008. https://doi.org/10.1088/1742-5468/2008/10/p10008
Dugué N, Perez A (2022) Direction matters in complex networks: a theoretical and applied study for greedy modularity optimization. Phys A Stat Mech Appl 603:127798. https://doi.org/10.1016/j.physa.2022.127798
Yakoubi Z, Kanawati R (2014) LICOD: a Leader-driven algorithm for community detection in complex networks. Vietnam J Comput Sci 1:241–256. https://doi.org/10.1007/s40595-014-0025-6
Ahajjam S, El Haddad M, Badir H (2018) A new scalable leader-community detection approach for community detection in social networks. Soc Netw 54:41–49. https://doi.org/10.1016/j.socnet.2017.11.004
Belfin RV, Grace Mary Kanaga E, Piotr B (2018) Overlapping community detection using superior seed set selection in social networks. Comput Electr Eng 70:1074–1083. https://doi.org/10.1016/j.compeleceng.2018.03.012
Li W, Huang C, Wang M, Chen X (2017) Stepping community detection algorithm based on label propagation and similarity. Phys A Stat Mech Appl 472:145–155. https://doi.org/10.1016/j.physa.2017.01.030
Pan X, Xu G, Wang B, Zhang T (2019) A novel community detection algorithm based on local similarity of clustering coefficient in social networks. IEEE Access 7:121586–121598. https://doi.org/10.1109/access.2019.2937580
Jaouadi M, Romdhane LB (2016) DIN: an efficient algorithm for detecting influential nodes in social graphs using network structure and attributes. In: 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA), 1–8, IEEE. https://doi.org/10.1109/AICCSA.2016.7945698
Wang Y, Jin D, He D, Musial K, Dang J (2022) Community detection in social networks considering social behaviors. IEEE Access 10:109969–109982. https://doi.org/10.1109/ACCESS.2022.3209704
Gupta SK, Singh DP (2023) Seed community identification framework for community detection over social media. Arab J Sci Eng 48:1829–1843. https://doi.org/10.1007/s13369-022-07020-z
Reihanian A, Feizi-Derakhshi M-R, Aghdasi HS (2023) An enhanced multi-objective biogeography-based optimization for overlapping community detection in social networks with node attributes. Inf Sci 622:903–929. https://doi.org/10.1016/j.ins.2022.11.125
Ahmed C, ElKorany A (2015) Enhancing link prediction in Twitter using semantic user attributes. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, 1155–1161, https://doi.org/10.1145/2808797.2810056
Yang C, Liu L, Chen L, Niu B (2017) A novel friend recommendation service based on interaction information mining. In: 2017 International Conference on Service Systems and Service Management, 1–5, IEEE. https://doi.org/10.1109/ICSSSM.2017.7996287
Lim KH, Datta A (2016) An interaction-based approach to detecting highly interactive Twitter communities using tweeting links. Web Intell 14:1–15. https://doi.org/10.3233/web-160328
Helal NA, Ismail RM, Badr NL, Mostafa MGM (2017) Leader-based community detection algorithm for social networks. WIREs Data Min Knowl Discovery. https://doi.org/10.1002/widm.1213
Ma X, He J, Wu T, Zhu N, Hua Y (2023) Interaction behavior enhanced community detection in online social networks. Comput Commun. https://doi.org/10.1016/j.comcom.2023.11.029
Newman M (2018) Networks Second Edition. Oxford University Press. https://doi.org/10.1093/oso/9780198805090.003.0007
Meng X, Han S, Wu L, Si S, Cai Z (2022) Analysis of epidemic vaccination strategies by node importance and evolutionary game on complex networks. Reliab Eng Syst Saf 219:108256. https://doi.org/10.1016/j.ress.2021.108256
Bai Z-Z, Wu W-T, Muratova GV (2021) The power method and beyond. Appl Numer Math 164:29–42. https://doi.org/10.1016/j.apnum.2020.03.021
Xiao W, Hu J (2021) Paradigm and performance analysis of distributed frequent itemset mining algorithms based on Mapreduce. Microprocess Microsyst 82:103817. https://doi.org/10.1016/j.micpro.2020.103817
Telikani A, Gandomi AH, Shahbahrami A (2020) A survey of evolutionary computation for association rule mining. Inf Sci 524:318–352. https://doi.org/10.1016/j.ins.2020.02.073
Yang R, Yang C, Peng X, Rezaeipanah A (2022) A novel similarity measure of link prediction in multi-layer social networks based on reliable paths. Concurr Comput Pract Exp. https://doi.org/10.1002/cpe.6829
De Domenico M, Lima A, Mougel P, Musolesi M (2013) The anatomy of a scientific rumor. Sci Rep 3:2980. https://doi.org/10.1038/srep02980
Platform SNA and (SNAP) higgs-twitter (Accessed February 18, 2023) (2015 ) http://snap.stanford.edu/data/higgs-twitter.html
Tan C, Tang J, Sun J, Lin Q, Wang F (2010) Social action tracking via noise tolerant time-varying factor graphs. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1049–1058, https://doi.org/10.1145/1835804.1835936
(ArnetMiner), A. Flickr-large Accessed February 18, 2023 (2006) https://www.aminer.cn/data-sna#Flickrlarge
Singh D, Garg R (2022) NI-Louvain: a novel algorithm to detect overlapping communities with influence analysis. J King Saud Univ Comput Inf Sci 34:7765–7774. https://doi.org/10.1016/j.jksuci.2021.07.006
Xie J, Szymanski BK, Liu X (2011) Slpa: uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process. In: 2011 IEEE 11th International Conference on Data Mining Workshops, 344–349, IEEE. https://doi.org/10.1109/ICDMW.2011.154
Meghanathan N (2015) Use of eigenvector centrality to detect graph isomorphism. arXiv preprint arXiv:1511.06620. https://doi.org/10.5121/csit.2015.51501
Funding
This research received no external funding.
Author information
Authors and Affiliations
Contributions
S. Sayari, A. Harunabadi, and T. Banirostam contributed to presenting the initial idea, developing and implementing the presented method, analyzing the results, and writing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Conflict of interest
The authors declare that they have no competing interests.
Ethical approval
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sayari, S., Harounabadi, A. & Banirostam, T. Community detection based on improved user interaction degree, weighted quasi-local path-based similarity and frequent pattern mining. J Supercomput (2024). https://doi.org/10.1007/s11227-024-06178-7
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-024-06178-7