Abstract
With the rise of social networks, the task of community detection in networks has become increasingly difficult in recent years. In this study, we introduce a novel approach for community detection named PCMeans, which combines PageRank, hierarchical clustering, and k-means algorithms to tackle the community detection problem on the entire network. Our technique employs Local PageRank to identify the most influential nodes within a local subgraph, followed by an overlapping hierarchical clustering strategy that determines the optimal number of clusters on the entire network. While our approach uses Local PageRank, which operates locally on each node, the clustering itself is performed globally on the entire network. K-means learning is then applied to swiftly converge to the final community structure. PCMeans is an unsupervised method that is easy to implement, efficient, and simple, and it addresses three common problems, including the random selection of the initial central node, specification of the number of classes K, and slow convergence. Experiments show that our algorithm not only has improved influence but also effectively reduces time complexity and outperforms other recent approaches on both real networks and synthetic benchmarks. Our approach is versatile and can be applied to a wide range of community detection problems, including those with non-convex shapes and unknown numbers of communities.
Similar content being viewed by others
References
Akbar Z, Liu J, Latif Z (2021) Mining social applications network from business perspective using modularity maximization for community detection. Soc Netw Anal Min 11:1–19
Bar-Yossef Z, Mashiach L-T (2008) Local approximation of pagerank and reverse pagerank. In: Proceedings of the 17th ACM conference on information and knowledge management, pp. 279–288
Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008(10):10008
Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst 30(1–7):107–117
Cai B, Zeng L, Wang Y, Li H, Hu Y (2019) Community detection method based on node density, degree centrality, and k-means clustering in complex network. Entropy 21(12):1145
Chaudhary L, Singh B (2021) Community detection using unsupervised machine learning techniques on covid-19 dataset. Soc Netw Anal Min 11:1–9
Danon L, Díaz-Guilera A, Duch J, Arenas A (2005) Comparing community structure identification. J Stat Mech Theory Exp 2005(09):09008–09008. https://doi.org/10.1088/1742-5468/2005/09/p09008
Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174
Fortunato S, Hric D (2016) Community detection in networks: a user guide. Phys Rep 659:1–44
Geng J, Bhattacharya A, Pati D (2019) Probabilistic community detection with unknown number of communities. J Am Stat Assoc 114(526):893–905
Guo K, Huang X, Wu L, Chen Y (2022) Local community detection algorithm based on local modularity density. Appl Intell 52(2):1238–1253
Hajij M, Said E, Todd R (2020) Pagerank and the k-means clustering algorithm. arXiv preprint arXiv:2005.04774
Hollocou A, Bonald T, Lelarge M (2016) Improving pagerank for local community detection. arXiv preprint arXiv:1610.08722
Jiang JQ, McQuay LJ (2012) Modularity functions maximization with non negative relaxation facilitates community detection in networks. Phys A Stat Mech Appl 391(3):854–865
Krebs V (2008) A network of co-purchased books about us politics. October 20(1), 0–03
Kumar A, Barman D, Sarkar R, Chowdhury N (2020) Overlapping community detection using multiobjective genetic algorithm. IEEE Trans Comput Soc Syst 7(3):802–817
Li H, Zhang R, Zhao Z, Liu X (2021) Lpa-mni: an improved label propagation algorithm based on modularity and node importance for community detection. Entropy 23(5):497
Liu X, Fu L, Wang X, Zhou C (2022) On the similarity between von Neumann graph entropy and structural information: interpretation, computation, and applications. IEEE Trans Inf Theory 68:2182–2202
Luo W, Lu N, Ni L, Zhu W, Ding W (2020) Local community detection by the nearest nodes with greater centrality. Inf Sci 517:377–392
Lusseau D, Schneider K, Boisseau OJ, Haase P, Slooten E, Dawson SM (2003) The bottlenose dolphin community of doubtful sound features a large proportion of long-lasting associations. Behav Ecol Sociobiol 54(4):396–405
Ma T, Wang Y, Tang M, Cao J, Tian Y, Al-Dhelaan A, Al-Rodhaan M (2016) Led: a fast overlapping communities detection algorithm based on structural clustering. Neurocomputing 207:488–500
Ma T, Liu Q, Cao J, Tian Y, Al-Dhelaan A, Al-Rodhaan M (2020) Lgiem: global and local node influence based community detection. Futur Gener Comput Syst 105:533–546
Moosavi SA, Jalali M, Misaghian N, Shamshirband S, Anisi MH (2017) Community detection in social networks using user frequent pattern mining. Knowl Inf Syst 51(1):159–186
Musdar IA, Azhari S (2015) Metode rce-kmeans untuk clustering data. IJCCS (Indonesian Journal of Computing and Cybernetics Systems) 9(2):157–166
Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026113
Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043):814–818
Parés F, Gasulla DG, Vilalta A, Moreno J, Ayguadé E, Labarta J, Cortés U, Suzumura T (2017) Fluid communities: a competitive, scalable and diverse community detection algorithm, 229–240. Springer
Pourkazemi M, Keyvanpour MR (2017) Community detection in social network by using a multi-objective evolutionary algorithm. Intell Data Anal 21(2):385–409
Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3):036106
Rosvall M, Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci 105(4):1118–1123
Sheng J, Liu C, Chen L, Wang B, Zhang J (2020) Research on community detection in complex networks based on internode attraction. Entropy 22(12):1383
Van Laarhoven T, Marchiori E (2016) Local network community detection with continuous optimization of conductance and weighted kernel k-means. J Mach Learn Res 17(1):5148–5175
Vilcek A (2014) Deep learning with k-means applied to community detection in networks. CS224W Project Report
Wu Z, Wang X, Fang W, Liu L, Tang S, Zheng H, Zheng Z (2021) Community detection based on first passage probabilities. Phys Lett A 390:127099
Yuan Y, Chen B, Yu Y, Jin Y (2020) An influence maximisation algorithm based on community detection. Int J Comput Sci Eng 22(1):1–14
Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33(4):452–473
Žalik KR (2008) An efficient k’-means clustering algorithm. Pattern Recognit Lett 29(9):1385–1391
Zhou X, Su L, Li X, Zhao Z, Li C (2023) Community detection based on unsupervised attributed network embedding. Expert Syst Appl 213:118937
Author information
Authors and Affiliations
Contributions
LW wrote the main manuscript text and FT prepared figures and algorithms.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Louafi, W., Titouna, F. PCMeans: community detection using local PageRank, clustering, and K-means. Soc. Netw. Anal. Min. 13, 103 (2023). https://doi.org/10.1007/s13278-023-01109-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-023-01109-5