Finding Communities by Their Centers

Chen, Yan; Zhao, Pei; Li, Ping; Zhang, Kai; Zhang, Jie

doi:10.1038/srep24017

Download PDF

Article
Open access
Published: 07 April 2016

Finding Communities by Their Centers

Yan Chen¹,
Pei Zhao¹,
Ping Li¹,
Kai Zhang² &
…
Jie Zhang³

Scientific Reports volume 6, Article number: 24017 (2016) Cite this article

4324 Accesses
20 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Detecting communities or clusters in a real-world, networked system is of considerable interest in various fields such as sociology, biology, physics, engineering science, and interdisciplinary subjects, with significant efforts devoted in recent years. Many existing algorithms are only designed to identify the composition of communities, but not the structures. Whereas we believe that the local structures of communities can also shed important light on their detection. In this work, we develop a simple yet effective approach that simultaneously uncovers communities and their centers. The idea is based on the premise that organization of a community generally can be viewed as a high-density node surrounded by neighbors with lower densities, and community centers reside far apart from each other. We propose so-called “community centrality” to quantify likelihood of a node being the community centers in such a landscape, and then propagate multiple, significant center likelihood throughout the network via a diffusion process. Our approach is an efficient linear algorithm, and has demonstrated superior performance on a wide spectrum of synthetic and real world networks especially those with sparse connections amongst the community centers.

Representative community divisions of networks

Article Open access 17 February 2022

Large network community detection by fast label propagation

Article Open access 15 February 2023

Single-trajectory map equation

Article Open access 22 April 2023

Introduction

Many real-world systems take the form of networks in which the functional units can be considered as nodes or vertices, which are connected by links depending on their interactions. One of the most prominent features of a network is its community structure, i.e. the organization of vertices in groups, with more interactions amongst the same group than between its group members and the reminder of the network.

The community structures are closely associated with functions of specific network, thus identifying such structures yields insights into the functional organization of the network. However, finding communities within an arbitrary network can be a computationally difficult task. A growing number of community detection methods have recently been proposed since the seminal work by Girvan and Newman¹. One popular criteria is to optimize the modularity measure^2,3,4,5, like the Louvain algorithm⁶ and the Fastgreedy algorithm⁷. More recent advances involve machine learning techniques such as seeding and semi-supervised learning method⁸, neural network approaches^9,10, and Bayesian^11,12. For more recent developments in community detection, see^{13,14,15,16,17,18}. Modularity measures internal connectivity of communities and uses the randomized null model as the reference. However, random networks have been found to show high-modularity subsets¹⁹. Moreover, for general networks, there exists a resolution limit below which modularity based methods cannot find the communities²⁰.

In general, community detection falls in the scope of clustering^{5,21,22,23,24}. A key concept in clustering is the measure of similarity, which to a large extent determines the clustering result. Existing similarity measures typically include the distance between two nodes²⁵, common neighbors²⁶, or local paths^27,28. However, one limitation of these similarity measures is that they usually do not take into account the fine local topological structures of the network, such as the connection pattern among the neighbors of a node, and the connections among the important nodes. This information is crucial in determining right community structures, and clustering without consideration of these patterns may be sub-optimal.

The same limitation applies to many existing community detection algorithms, i.e., they are only designed to identify the composition of communities, but not to unravel the detailed, local structures of communities. Here we argue that the local structures of communities can also shed important light on their detection. In this paper, we leverage the concept of node density in a network, and exploit the resultant distance landscape to devise an effective algorithm that simultaneously detects communities and their centers. Our basic idea is to design a community centrality indice to quantify the relative significance of a node with respect to its neighbors in the community. Nodes with higher community centrality indice are more likely to be centers in some communities. Based on the election of the central nodes in the communities, we are then able to categorize the reminder of nodes into communities using an iterative and greedy propagation strategy. This strategy resembles a multi-source diffusion and decision-making process which is simple with low complexity.

We show that by incorporating the local topological, structural modeling in the process of community detection, our approach can detect communities more accurately in several benchmark systems including both synthetic and real-world scenarios against state-of-the-art. The superiority of our approach is particularly significant for networks with their centers far away from each other. However, for some networks in which the centers may have more connections among them, it can be challenging for our approach to identify exact communities due to the less clarified boundaries between communities.

Very relevant to our work is that of Rodriguez and Laio²⁹, who presented an efficient clustering approach for data points in the Euclidean space. The basic idea is that these cluster centers are those surrounded by more points (so-called density) than their neighbor points and they have relatively large distance from each other. This algorithm need not be an iterative procedure and thereby is very fast. In comparison, however, the task of community detection is to cluster nodes in the topological space, which is very different from data clustering in many respects. Topological properties and local connection profiles must be considered for reliable community detection in complex networks or graphs.

Results

We test the performance of our method on both synthetic and real-world networks by comparing the outcome of our algorithm with the ground-truth community structures and results of other community detection methods. The synthetic networks are generated by the LFR-benchmark model³⁰, which produces networks with power-law degree distribution and with implanted communities within the networks, and the real-world networks are of different types and different scales. The networks used in our experiments are shown in Table 1.

Table 1 Networks used in the experiments.

Full size table

Synthetic Networks

To demonstrate the effectiveness of the proposed method, we generate four benchmark networks using procedures presented in ref. 30 and the detailed parameters are shown in Table 1. Among the 4 artificial networks, three of them have clear, non-overlapping community structures; and the other network has 5 overlapping communities. We use the Normalized Mutual Information (NMI)³¹ to measure the performance of different algorithms on detecting communities of these networks.

As can be seen from Table 2, for the first three networks with completely disjoint communities, our approach achieves a 100% accuracy in identifying actual communities. The communities in the fourth network generated from the model is shown in Fig. 1(b) and our result is shown in Fig. 1(c). It is obvious that the overlapping nodes in the benchmark are not quite reasonable, while our approach can properly partition the nodes into explicit two parts.

Table 2 Performance comparison in the networks with ground truth.

Full size table

**Figure 1: Two networks of LFR-benchmark.**

Real-world Networks

Several real-world networks are used to test the validity of our algorithm. The first one is Zachary karate club network³² which is a famous de facto network. A conflict between club president John (node 33) and the instructor Mr. Hi (node 0) leads to 34 members of the university sports club to split into two groups. Figure 2(a) shows that the communities discovered by our algorithm agree exactly with the result given by Zachary³². The leaders of the two groups are node 0 and node 33, which is consistent with the ground truth too.

The second network is the social network of bottlenose dolphins reported by Lusseau³³, which is an undirected social network of frequent associations between 62 dolphins in a community living off Doubtful Sound. In this network, dolphins are represented as vertices, and a link is attached between two nodes if the corresponding dolphins are observed together more often than expected by chance over a period of seven years from 1994 to 2001. The groups of dolphins are mainly divided into the male ones and female ones. Our result is shown to be completely the same as the ground truth. The two communities are marked by purple and blue, respectively (shown in Fig. 2(b)).

The third network is the political blogs data-set which is a directed network of hyperlinks between weblogs on US politics, recorded in 2005 by Adamic and Glance³⁴. The network is separated according to the political orientation of blogs, conservative or liberal. Due to the unconnectedness of the original network, we consider the undirected version of the network and retrieve the maximum component to detect communities. The maximum component has 1222 nodes and 16717 edges. The diameter is 8 and the average shortest path is 2.858. The NMI between our identified communities and the ground truth is 0.72. The visualization of communities detected is shown in Fig. 2(c).

The fourth network is the SFI collaboration network with 271 scientists at the Santa Fe Institute¹, an interdisciplinary research center in Santa Fe, New Mexico, with the largest component consisting of 118 scientists. An edge is drawn between a pair of scientists if they coauthored one or more articles. The network includes all journal and book publications by the scientists involved, along with all papers that appeared in the institute’s technical reports series. The network has several hub nodes with high degrees. In this network, we have tried different types of density indices in the implementation of our algorithm. We first consider the “strong-tie” density as the measure to select the central nodes, leading to a modularity of 0.65. We then use the classical degree density to select nodes as the centers of communities, and the resultant modularity is 0.70, showing the effects of different local centralities on the performance. The partition result by the degree-density is shown in Fig. 2(d).

We also test our method on several networks without ground-truth community partitions, with the size of the networks ranging from a hundred to tens of thousands, see Table 3. The comparison with other approaches is provided in the next section.

Table 3 Performance comparison in the networks without ground truth.

Full size table

Comparison With Other Methods

To further assess our method, we compare our partition results with five popular algorithms: Louvain, Fastgreedy, Infomap^35,36, Eigenvector³⁷ and Label propagation (LP)³⁸, using NMI and modularity³⁹ as the evaluation metrics. From Table 2, we can see that for networks with ground truth communities, our method is better than or similar to other algorithms using the NMI criterion, except on the American football network.

For those networks without the ground truth information (the networks in Table 3), we use the modularity to measure the quality of community detection results. It can be seen that the modularity values of our partitions are lower than those obtained by the Louvain and Fastgreedy algorithms. This can be expected because our method is not specifically designed to optimize the modularity as Louvain and Fastgreedy algorithms do. However, the modularity values obtained by our method are almost always better than the other three algorithms, i.e., Infomap, Eigenvector and Label propagation, especially for large sparse networks with low rich-club connectivity (e.g. PGP, CA-HepTh and Power Grid), see Table 3.

Discussion

We present a simple and novel method to detect community structures in complex networks. In our approach, a structural central node in each community will be determined firstly, and other nodes will be partitioned into different communities according to a multi-source diffusion and majority voting process (see detailed discussions in Methods). Compared with popular algorithms in the literature, our algorithm has robust performance in both synthetic and real-world networks with which ground truth is known, and the modularity results is also competitive in most of the networks. The whole algorithm has a linear time complexity, which fits it for large-scale problems.

From Table 3, we can also observe that for some networks, such as Jazz and Wiki-vote, the modularity obtained by our approach could be inferior to several other methods under comparison. We speculate that in these networks, there exist dense connections among detected community centers, therefore it becomes more challenging to identify exact communities. On the contrary, for networks whose community centers demonstrate sparse inter-connections among each other, our approach is expected to produce significant performance gains (e.g. PGP, CA-HepTh and Power Grid).

In general application scenarios, we can use the rich-club connectivity of the network^40,41 as an indicator of the performance of our approach. The rich-club connectivity is an interesting property that describes the amount of linkages among “rich” nodes of a network (i.e., nodes with high degrees, which are very likely the community centers). Typically, the lower the rich-club connectivity, the sparser the inter-connections among the center nodes, and hence our approach is expected to give a better performance; on the contrary, the higher the rich-club connectivity, the denser the inter-connections among the center nodes, and our approach may give inferior results.

To validate this indicator, we have included the rich-club connectivity of different networks in Table 1. Note that the rich-club connectivity ϕ(r) is a function of r, where r is the position of the node in the ordered list (from larger degrees to small degrees), normalized by the number of nodes N. In practice, we first choose the number of high-degree nodes k, and then examine whether the rich-club connectivity computed using r = k/N is above/below a pre-defined threshold (such as 0.5). In our experiments, k is chosen as a small number based on the logarithm of the number of nodes (as shown in Table 1). For example, the rich-club connectivity ϕ(r) for Jazz and Wiki-vote is 0.94 and 0.55, respectively; both of which are quite high, and as a result our approach does not produce a good modularity on these networks. In comparison, the rich-club connectivity ϕ(r) for PGP, CA-HepTh and Power Grid is lower than 0.2, and the performance of our approach on these networks are much more superior. This demonstrates the usefulness and applicability of the rich-club connectivity in predicting the performance of the proposed method.

Methods

The key intuition in our algorithm is that the central node in a community should be highly surrounded by other members in this community, namely it has a high density; while neighbors of the central node may not connect tightly with each other. We elaborate the three steps of our method in the following subsections.

Calculating Density Indice

We first calculate a density indice, denoted by η, for each node. The density indice can be defined in two ways: degree indice (η_d) and strong-tie indice (η_s). As we know, a node’s degree is the number of neighbors of the node. Larger degree means that the node has more neighbors and therefore it has a high local density. Strong-tie indice⁴² is defined in our algorithm as the number of triangles involving node i. A large strong-tie indice means that node i has more neighbors and its neighbors have more connections amongst themselves. The existence of such nodes strongly indicates the existence of communities. From Table 4 we can find that the performance by adopting strong-tie is better than the case of using degree in some networks but vice versa in other real networks. Our findings indicate that for large-scale networks, degree indice is more suitable than strong-tie. In the following we call both these two indices density η-score to simplify notations.

Table 4 Impact of different density indice (strong-tie and degree) on the Performance of our approach.

Full size table

Identifying Central Nodes

In this step the key objective is to remove non-central nodes based on η-scores. Intuitively, the central node should have a large distance to other central nodes (or nodes with higher η-scores); on the contrary, the central node will be relatively close to its neighbors (or nodes with lower η-scores). Based on this observation, we define the notion of Eta-reach-distance (ERD) ψ to facilitate the choice of central nodes. In particular, the ERD for ith node, ψ_i, is the minimum of the shortest path distances between node i and all other nodes with a higher η-score,

The usefulness of the Eta-reach-distance can be understood as follows. If a node i has a low ERD, that means i’s close neighbors have higher η-score than node i, then the node i will very unlikely be a central node; on the other hand, if the node i has a large ERD, then that means in order to find a node whose ERD is higher than node i, one has to go far away in the network, which means that node i is probably a central node. In fact, large ψ_i always appears at local or global maxima of the density scores. Given this property, we then identify the central nodes as those with particularly large ψ_i’s.

To reduce the computational cost, in searching the neighbors of node i, we will apply the breadth first search strategy. We first examine the first-order neighbors of a given node i, and if any of them has an η-score larger than node i, we will then set ψ_i = 1. Otherwise, we search the second order neighborhoods and if any of them has a larger η-score, we set ψ_i = 2; if not, we will examine all the 3rd-order neighbors of node i until the end. In general, it is enough to search in the neighborhood at depth 2, owing to dense connectivity structures of the community. So we will typically stop at ψ_i = 3 if we can not find a node with larger η-score and search no further. It should be noted that for some networks such as Jazz⁴³ and Email⁴⁴, etc, centers of the communities are directly connected with each other based on the ground truth available. In these cases, ψ for most of the nodes would be 1 and cannot be used to find the centers.

Sometimes, in order to consider the combined effect of local density and relative distance, we define the community centrality γ_i = η_iψ_i to depict the importance of the node in a community, as shown in Fig. 3. Then we sort the γ values in an descending order and choose the largest C nodes as the centers corresponding to the C communities. Consider that we do not know the exact number of communities in a network, we try several C values to find the best one with the largest modularity. Sometimes, we may observe an obvious turning point on the sorted γ values, as is shown in Fig. 3. In such circumstance we can simply use this turning point to decide which nodes should be the central nodes. For example, for the karate club data, there are two nodes with significantly larger γ’s than others, thus it’s better to divide the karate club network into two groups.

**Figure 3: The log-log graph of γ_i = η_iψ_i in descending order for six small networks.**

Label Propagation

After the central nodes have been identified and assigned proper community labels, their labels will diffuse in the whole network such that all the rest nodes can be labeled as well. We achieve this by using majority voting, namely, any node without a community label will accept one that presents most frequently in its (labeled) neighbors. To reduce the uncertainty in label propagation, we adopt a greedy, iterative scheme. In each iteration, among all the unlabeled nodes with sufficient labeled neighbors, we will only target on that node with the largest γ value. By doing this, the propagation process will affect only the most confident node one at a time, which is not only computationally efficient but also improves the labeling quality.

Complexity

Our algorithm consists of three steps. In the first step of calculating the density indice, the time complexity is O(n), where n is the number of nodes. In the second step, computing the ERD requires O(m) time, where m is the number of edges; sorting the γ values takes O(n) time if bucket sort algorithm is considered. The third step of community label assignment will require O(n) time. Thus, the total time complexity of our method is O(m + n). As a result, this algorithm has linear time complexity and can be efficiently applied to a network of tens of thousands of nodes.

Additional Information

How to cite this article: Chen, Y. et al. Finding Communities by Their Centers. Sci. Rep. 6, 24017; doi: 10.1038/srep24017 (2016).

References

Girvan, M. & Newman, M. E. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA 99, 7821–7826 (2002).
Article CAS ADS MathSciNet MATH PubMed PubMed Central Google Scholar
Newman, M. E. & Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004).
Article CAS ADS Google Scholar
Newman, M. E. Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA 103, 8577–8582 (2006).
Article CAS ADS PubMed PubMed Central Google Scholar
Duch, J. & Arenas, A. Community detection in complex networks using extremal optimization. Phys. Rev. E 72, 027104 (2005).
Article ADS CAS Google Scholar
Pujol, J. M., Béjar, J. & Delgado, J. Clustering algorithm for determining community structure in large networks. Phys. Rev. E 74, 016107 (2006).
Article ADS CAS Google Scholar
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theor. Exp. 2008, P10008 (2008).
Article Google Scholar
Clauset, A., Newman, M. E. & Moore, C. Finding community structure in very large networks. Phys. Rev. E 70, 066111 (2004).
Article ADS CAS Google Scholar
Shang, C., Feng, S., Zhao, Z. & Fan, J. Efficiently detecting overlapping communities using seeding and semi-supervised learning. Int. J. Mach. Learn. and Cyber. 6, 1–14 (2015).
Article Google Scholar
Chon, T.-S., Park, Y. S., Moon, K. H. & Cha, E. Y. Patternizing communities by using an artificial neural network. Ecol. Model. 90, 69–78 (1996).
Article Google Scholar
Lv, J. C., Tan, K. K., Yi, Z. & Huang, S. A family of fuzzy learning algorithms for robust principal component analysis neural networks. Fuzzy Systems, IEEE Transactions on 18, 217–226 (2010).
Article Google Scholar
Mørup, M. & Schmidt, M. Bayesian community detection. Neural computation 24, 2434–2456 (2012).
Article MathSciNet PubMed MATH Google Scholar
Psorakis, I., Roberts, S., Ebden, M. & Sheldon, B. Overlapping community detection using bayesian non-negative matrix factorization. Phys. Rev. E 83, 066114 (2011).
Article ADS CAS Google Scholar
Oh, E., Choi, C., Kahng, B. & Kim, D. Modular synchronization in complex networks with a gauge kuramoto model. EPL (Europhysics Letters) 83, 68003 (2008).
Article ADS CAS Google Scholar
Yuan, W.-J. & Zhou, C. Interplay between structure and dynamics in adaptive complex networks: emergence and amplification of modularity by adaptive dynamics. Phys. Rev. E 84, 016116 (2011).
Article ADS CAS Google Scholar
Liu, W., Pellegrini, M. & Wang, X. Detecting communities based on network topology. Sci. Rep. 4, 5739 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Cao, X., Wang, X., Jin, D., Cao, Y. & He, D. Identifying overlapping communities as well as hubs and outliers via nonnegative matrix factorization. Sci. Rep. 3, 2993 (2013).
Article ADS PubMed PubMed Central Google Scholar
Airoldi, E. M., Blei, D. M., Fienberg, S. E. & Xing, E. P. Mixed membership stochastic blockmodels. J Mach Learn Res 9, 1981–2014 (2008).
PubMed PubMed Central MATH Google Scholar
Gopalan, P. K. & Blei, D. M. Efficient discovery of overlapping communities in massive networks. Proc. Natl. Acad. Sci. USA 110, 14534–14539 (2013).
Article CAS ADS MathSciNet PubMed MATH PubMed Central Google Scholar
Guimera, R., Sales-Pardo, M. & Amaral, L. A. N. Modularity from fluctuations in random graphs and complex networks. Phys. Rev. E 70, 025101 (2004).
Article ADS CAS Google Scholar
Lancichinetti, A. & Fortunato, S. Limits of modularity maximization in community detection. Phys. Rev. E 84, 066122 (2011).
Article ADS CAS Google Scholar
Boccaletti, S., Ivanchenko, M., Latora, V., Pluchino, A. & Rapisarda, A. Detecting complex network modularity by dynamical clustering. Phys. Rev. E 75, 045102 (2007).
Article CAS ADS Google Scholar
Lancichinetti, A. & Fortunato, S. Consensus clustering in complex networks. Scientific Reports 2, 336 (2012).
Article ADS PubMed PubMed Central Google Scholar
Zhang, S., Wang, R.-S. & Zhang, X.-S. Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Physica A 374, 483–490 (2007).
Article ADS Google Scholar
Huang, J. et al. Shrink: a structural clustering algorithm for detecting hierarchical communities in networks. In Proceedings of CIKM, 19, 219–228 (2010).
Google Scholar
Zhou, H. Distance, dissimilarity index, and network community structure. Phys. Rev. E 67, 061901 (2003).
Article ADS CAS Google Scholar
Lorrain, F. & White, H. C. Structural equivalence of individuals in social networks. J Math Sociol 1, 49–80 (1971).
Article Google Scholar
Zhou, T., Lü, L. & Zhang, Y. C. Predicting missing links via local information. Eur. Phys. J. B 71, 623–630 (2009).
Article CAS ADS MATH Google Scholar
Lü, L., Jin, C.-H. & Zhou, T. Similarity index based on local paths for link prediction of complex networks. Phys. Rev. E 80, 046122 (2009).
Article ADS CAS Google Scholar
Rodriguez, A. & Laio, A. Clustering by fast search and find of density peaks. Science 344, 1492–1496 (2014).
Article CAS ADS PubMed Google Scholar
Lancichinetti, A., Fortunato, S. & Radicchi, F. Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78, 046110 (2008).
Article ADS CAS Google Scholar
Lancichinetti, A., Fortunato, S. & Kertész, J. Detecting the overlapping and hierarchical community structure in complex networks. New J. Phys. 11, 033015 (2009).
Article ADS Google Scholar
Zachary, W. W. An information flow model for conflict and fission in small groups. J. Anthropol. Res. 33, 452–473 (1977).
Article Google Scholar
Lusseau, D. The emergent properties of a dolphin social network. Proc. R. Soc. B 270, S186–S188 (2003).
Article PubMed PubMed Central Google Scholar
Adamic, L. A. & Glance, N. The political blogosphere and the 2004 us election: divided they blog. Proceedings of the 3rd international workshop on Link discovery, 36–43 (ACM, 2005), doi: 10.1145/1134271.1134277.
Rosvall, M. & Bergstrom, C. T. Maps of information flow reveal community structure in complex networks. Proc. Natl. Acad. Sci. USA 105, 1118–1123 (2008).
Article CAS ADS PubMed PubMed Central Google Scholar
Rosvall, M., Axelsson, D. & Bergstrom, C. T. The map equation. Eur. Phys. J. Special Topics 178, 13–23 (2009).
Article ADS Google Scholar
Newman, M. E. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74, 036104 (2006).
Article CAS ADS MathSciNet Google Scholar
Raghavan, U. N., Albert, R. & Kumara, S. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76, 036106 (2007).
Article ADS CAS Google Scholar
Danon, L., Diaz-Guilera, A., Duch, J. & Arenas, A. Comparing community structure identification. J. Stat. Mech. Theor. Exp. 2005, P09008 (2005).
Article MATH Google Scholar
Zhou, S. & Mondragón, R. J. The rich-club phenomenon in the internet topology. IEEE Commun. Lett. 8, 180–182 (2004).
Article Google Scholar
Zhou, S. & Mondragón, R. J. Structural constraints in complex networks. New J. Phys. 9, 173–182 (2007).
Article ADS MathSciNet Google Scholar
Granovetter, M. S. The strength of weak ties. Am. J. Sociol. 78, 1360–1380 (1973).
Article Google Scholar
Gleiser, P. M. & Danon, L. Community structure in jazz. Adv. Complex Syst. 6, 565–573 (2003).
Article Google Scholar
Guimera, R., Danon, L., Diaz-Guilera, A., Giralt, F. & Arenas, A. Self-similar community structure in a network of human interactions. Phys. Rev. E 68, 065103 (2003).
Article CAS ADS Google Scholar
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of escherichia coli. Nat. Genet. 31, 64–68 (2002).
Article CAS PubMed Google Scholar
Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998).
Article CAS ADS PubMed MATH Google Scholar
Leskovec, J., Huttenlocher, D. & Kleinberg, J. Signed networks in social media. Proc. Chi, 1361–1370, doi: 10.1145/1753326.1753532 (2010).
Leskovec, J., Kleinberg, J. & Faloutsos, C. Graph evolution: Densification and shrinking diameters. ACM Trans. on Knowledge Discovery from Data (ACM TKDD) 1, 2 (2007).
Article Google Scholar
Boguñá, M., Pastor-Satorras, R., Daz-Guilera, A. & Arenas, A. Models of social networks based on social distance attachment. Phys. Rev. E 70, 056122 (2004).
Article ADS CAS Google Scholar
Leskovec, J., Lang, K. J., Dasgupta, A. & Mahoney, M. W. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. Internet Math. 6, 29–123 (2009).
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

Y. Chen acknowledges the support from National Natural Science Foundation of China (61503312). P. Li acknowledges the support from National Natural Science Foundation of China (61104224, 81373531) and foundations of SiChuan Educational Committee (13ZB0198). Y. C. and P. L. acknowledge SWPU Innovation Team “Data Intelligence” Funding (No. 2015CXTD06). J. Zhang acknowledges the support from National Science Foundation of China (NSFC 61104143 and 61573107), and special Funds for Major State Basic Research Projects of China (2015CB856003).

Author information

Authors and Affiliations

Center for Intelligent and Networked Systems, School of Computer Science, Southwest Petroleum University, Chengdu, 610500, P.R. China
Yan Chen, Pei Zhao & Ping Li
NEC Laboratories America, Inc. 4 Independence Way, Princeton, NJ 08540, USA
Kai Zhang
Center for Computational Systems Biology, Fudan University, Shanghai, 200433, P. R. China
Jie Zhang

Authors

Yan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Pei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Ping Li
View author publications
You can also search for this author in PubMed Google Scholar
Kai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.L. and Y.C. devised the research project; Y.C. and P.Z. performed numerical simulations with guidance from P.L., Y.C. and P.L. analyzed the results; Y.C., P.L., K.Z. and J.Z. wrote the paper. All authors contributed to the discussion of results. Y.C. and K.Z. contribute equally.

Corresponding authors

Correspondence to Ping Li or Jie Zhang.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Chen, Y., Zhao, P., Li, P. et al. Finding Communities by Their Centers. Sci Rep 6, 24017 (2016). https://doi.org/10.1038/srep24017

Download citation

Received: 10 May 2015
Accepted: 16 March 2016
Published: 07 April 2016
DOI: https://doi.org/10.1038/srep24017

This article is cited by

A novel memorizing single chromosome evolutionary algorithm for detecting communities in complex networks
- Elmira Pourabbasi
- Vahid Majidnezhad
- Yasser Jafari
Computing (2022)
A stable community detection approach for complex network based on density peak clustering and label propagation
- Chuanwei Li
- Hongmei Chen
- Xiaoling Yang
Applied Intelligence (2022)
A fuzzy logic approach to influence maximization in social networks
- Yacine Atif
- Kanna Al-Falahi
- Birgitta Lindström
Journal of Ambient Intelligence and Humanized Computing (2020)
Critical analysis of (Quasi-)Surprise for community detection in complex networks
- Ju Xiang
- Hui-Jia Li
- Jian-Ming Li
Scientific Reports (2018)
Automatic clustering based on density peak detection using generalized extreme value distribution
- Jiajun Ding
- Xiongxiong He
- Bo Jiang
Soft Computing (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.