Identifying influential spreaders by gravity model considering multi-characteristics of nodes

Li, Zhe; Huang, Xinyu

doi:10.1038/s41598-022-14005-3

Download PDF

Article
Open access
Published: 14 June 2022

Identifying influential spreaders by gravity model considering multi-characteristics of nodes

Zhe Li¹ &
Xinyu Huang²

Scientific Reports volume 12, Article number: 9879 (2022) Cite this article

2634 Accesses
17 Citations
Metrics details

Subjects

Abstract

How to identify influential spreaders in complex networks is a topic of general interest in the field of network science. Therefore, it wins an increasing attention and many influential spreaders identification methods have been proposed so far. A significant number of experiments indicate that depending on a single characteristic of nodes to reliably identify influential spreaders is inadequate. As a result, a series of methods integrating multi-characteristics of nodes have been proposed. In this paper, we propose a gravity model that effectively integrates multi-characteristics of nodes. The number of neighbors, the influence of neighbors, the location of nodes, and the path information between nodes are all taken into consideration in our model. Compared with well-known state-of-the-art methods, empirical analyses of the Susceptible-Infected-Recovered (SIR) spreading dynamics on ten real networks suggest that our model generally performs best. Furthermore, the empirical results suggest that even if our model only considers the second-order neighborhood of nodes, it still performs very competitively.

Identifying influential spreaders by gravity model

Article Open access 10 June 2019

Influential nodes identification using network local structural properties

Article Open access 03 February 2022

Identifying influential nodes in complex networks using a gravity model based on the H-index method

Article Open access 29 September 2023

Introduction

The focus of network science has been shifting from discovering macroscopic statistical regularities to microscopic elements, vital nodes identification has received a huge amount of attention from researchers of network science in recent years. Vital nodes identification can be widely used in disease analysis^1,2, rumor analysis³, information propagation⁴, power grid protection⁵, discovery of candidate drug targets and essential proteins⁶, discovery of important species^7,8, and so on.

So far, most known methods only use structural information⁹, which can be classified into neighborhood-based centralities and path-based centralities roughly. Typical representatives of neighborhood-based centralities are degree centrality¹⁰ (DC), H-index¹¹ and k-shell decomposition method¹² (KS). For DC, the more neighbors a node has, the greater its influence. For H-index, the more large-degree neighbors a node has, the greater its influence. For KS, the more central a node locates in the network, the greater its influence. Besides, eigenvector centrality¹³ (EC) is the representative neighborhood-based iterative method, suggesting that the influence of a node is not only determined by the number of its neighbors, but also determined by the influence of each neighbor. Typical representatives of path-based centralities are betweenness centrality¹⁴ (BC) and closeness centrality¹⁵ (CC). For BC, the more a node is located in shortest paths, the greater its influence. For CC, the closer a node is to other nodes, the greater its influence.

However, a significant number of experiments indicate that depending on a single characteristic of nodes to reliably identify influential spreaders is inadequate⁹. As a result, the methods integrating multi-characteristics of nodes have been proposed. In particular, the methods based on gravity law seem very promising. As several laws behind phenomena in life are similar to the gravity law, the gravity model, which derives from the gravity law, is also favored and exhibited in many real-life scenarios. Representative examples include predicting the population migration between regions in demography¹⁶ and forecasting the trade flows throughout countries in economics¹⁷. In network science, the gravity model is utilized to evaluate the influence^18,19,20 of nodes, and so on. Recently, a series of gravity-law-based algorithms^{18,19,20,21,22,23,24,25,26,27,28,29,30} considering both neighborhood information and path information have been proposed, and their performance is much better than the above well-known state-of-the-art methods. Typical representatives are gravity centrality¹⁸ (GC), improved gravity centrality¹⁹ (IGC) and local gravity model²⁰ (LGM). For GC, the k-shell value of a node is regarded as its mass. For IGC, the focal node uses the k-shell value as its mass while its neighbors view the degree value as their masses. For LGM, the degree value of a node is regarded as its mass. However, whether the degree or k-shell is regarded as mass, the influence of neighbors is not taken into consideration. In view of this, we propose a gravity model that effectively integrates multi-characteristics of nodes to measure the influence of nodes in spreading dynamics. In our model, the number of neighbors, the influence of neighbors, the location of nodes, and the path information between nodes are all taken into consideration.

Preliminaries

Well-known state-of-the-art methods

Denote $G=<V,E>$ an undirected and unweighted simple network, where V and E are the sets of nodes and links. Denote $|V|=N$ and $|E|=M$, then the network has N nodes and M links. The adjacent matrix of G is denoted by $A=(a_{ij})_{N\times N}$, if node i links to node j, $a_{ij}=1$, otherwise, $a_{ij}=0$.

The degree centrality¹⁰ (DC) of node i is measured by

$$\begin{aligned} DC(i)=k(i), \end{aligned}$$

(1)

where $k(i)=\sum _{j = 1}^{N} a_{ij}$.

The H-index¹¹ of node i, denoted by H(i), is defined as the maximal integer satisfying that there are at least H(i) neighbors of node i whose degrees are all greater than or equal to H(i).

The k-shell decomposition method¹² (KS) works by iterative decomposition of the network into different shells. The first step of KS is to remove the nodes whose degrees are equal to 1 from the network, which will cause a reduction of the degree value to the remaining nodes. Continually remove all the nodes whose residual degrees are less than or equal to 1, until all the remaining nodes’ residual degrees are greater than 1. All the removed nodes in the first step form the 1-shell and their k-shell values are all equal to 1. Repeat this process to obtain 2-shell, 3-shell, $\ldots $ , and so on. The decomposition process will continue until there are no more nodes in the network.

The eigenvector centrality¹³ (EC) of node i is measured by

$$\begin{aligned} x(i)=c\sum _{j = 1}^{N}a_{ij}x(j), \end{aligned}$$

(2)

where c is a constant, generally speaking, c is set to the reciprocal of the largest eigenvalue of A.

The betweenness centrality¹⁴ (BC) of node i is measured by

$$\begin{aligned} BC(i)=\sum _{s\ne {i},s\ne {t},i\ne {t}}\frac{g_{st}(i)}{g_{st}}, \end{aligned}$$

(3)

where $g_{st}$ is the number of shortest paths between node s and node t, and $g_{st}(i)$ is the number of shortest paths via node i between node s and node t.

The closeness centrality¹⁵ (CC) of node i is measured by

$$\begin{aligned} CC(i)=\frac{N-1}{\sum \limits _{j\ne i} d(i,j)}, \end{aligned}$$

(4)

where d(i, j) is the shortest distance from node i to node j.

The gravity centrality¹⁸ (GC) of node i is measured by

$$\begin{aligned} GC(i)=\sum _{j\in \psi _i}\frac{k_s(i)k_s(j)}{d^{2}(i,j)}, \end{aligned}$$

(5)

where $k_s(i)$ is the k-shell value of node i, and $\psi _i$ is the neighborhood set whose distance to node i is not greater than 3.

An extended version of GC, denoted by GC+, GC+ of node i is measured by

$$\begin{aligned} GC+(i)=\sum _{j\in \Lambda _{i}}GC(j), \end{aligned}$$

(6)

where $\Lambda _{i}$ is the neighborhood set whose distance to node i equals to 1.

The improved gravity centrality¹⁹ (IGC) of node i is measured by

$$\begin{aligned} IGC(i)=\sum _{j\in \psi _i}\frac{k_s(i)k(j)}{d^{2}(i,j)}. \end{aligned}$$

(7)

An extended version of IGC, denoted by IGC+, IGC+ of node i is measured by

$$\begin{aligned} IGC+(i)=\sum _{j\in \Lambda _{i}}IGC(j). \end{aligned}$$

(8)

The local gravity model²⁰ (LGM) of node i is measured by

$$\begin{aligned} LGM(i)=\sum _{d(i,j)\le R,j\ne {i}}\frac{k(i)k(j)}{d^{2}(i,j)}, \end{aligned}$$

(9)

where R is the truncation radius, and the optimal truncation radius $R^*$ can be estimated by

$$\begin{aligned} R^*\approx \frac{1}{2}\left\langle d \right\rangle , \end{aligned}$$

(10)

where $\left\langle d \right\rangle $ is the average distance of the network.

The SIR model

The SIR model³¹ initially considers all the nodes as in the susceptible (S) state except the source node in the infected (I) state. At each time step, each infected node can infect its susceptible neighbors with probability $\beta $. Then, each infected node enters the recovered (R) state with probability $\lambda $. The propagation process continues until there are no more nodes in the infected state. The influence of node i can be estimated by

$$\begin{aligned} F(i) = N_{r}/N, \end{aligned}$$

(11)

where $N_{r}$ is the number of recovered nodes when dynamic process achieves steady state. For simplicity, $\lambda $ is set to 1, then the corresponding epidemic threshold³² can be calculated by

$$\begin{aligned} \beta _c\approx \frac{\left\langle k \right\rangle }{\left\langle k^{2} \right\rangle -\left\langle k \right\rangle }, \end{aligned}$$

(12)

where $\left\langle k \right\rangle $ is the average degree, and $\left\langle k^{2} \right\rangle $ is the second-order moment of the degree distribution.

The Kendall’s Tau

The Kendall’s Tau³³ is an index describing the strength of correlation between two sequences. Denote $X=(x_1, x_2, \ldots ,x_N)$ and $Y=(y_1, y_2, \ldots , y_N)$ are two sequences with N elements. For any pair of two-tuples $(x_i,y_i)$ and $(x_j,y_j)$ $(i\ne j)$, if both $x_i>x_j$ and $y_i>y_j$ or both $x_i<x_j $ and $y_i<y_j$, the pair is concordant. If both $x_i>x_j$ and $y_i<y_j$ or both $x_i<x_j$ and $y_i>y_j$, the pair is discordant. If $x_i=x_j$ or $y_i=y_j$, the pair is neither concordant nor discordant. The Kendall’s Tau of X and Y can be calculated by

$$\begin{aligned} \tau =\frac{2(n_+-n_-)}{N(N-1)}, \end{aligned}$$

(13)

where $n_+$ is the number of concordant pairs, and $n_-$ is the number of discordant pairs.

The monotonicity

The monotonicity³⁴ M of ranking list L is used to quantitatively measure the resolution of different indices, and it can be calculated by

$$\begin{aligned} M(L)=[1-\frac{\sum _{r\in L}U_{r}(U_{r}-1)}{U(U-1))}]^{^{2}}, \end{aligned}$$

(14)

where U is the size of L, and $U_{r}$ is the number of ties with the same rank r.

Results

Algorithms

According to previous studies, the degree value of a node indicates the number of its neighbors, the k-shell value of a node reflects where it locates in the network, the eigenvector centrality value of a node can reflect both the number of its neighbors and the influence of each neighbor, and the distance between two nodes can describe the path information. Individually speaking, nodes with large degree value, k-shell value and eigenvector centrality value are likely to be more influential. Furthermore, a node is of higher impacts on nearby nodes. According to the above issues and inspired by the gravity law, we regard the sum of degree value, k-shell value and eigenvector centrality value of a node as its mass, and the shortest distance between two nodes as their distance. Therefore, the influence of node i can be estimated as

$$\begin{aligned} MCGM(i)=\sum _{d(i,j)\le R, j\ne {i}}\frac{(k(i)+k_s(i)+x(i))(k(j)+k_s(j)+x(j))}{d^{2}(i,j)}. \end{aligned}$$

(15)

Such method is named as multi-characteristics gravity model (MCGM) as it considers multi-characteristics of nodes and adopts the gravity law.

It is not difficult to find that these three indices (DC, KS, EC) are not in the same order of magnitude, so normalization is required. As a result, Eq. (15) can be rewritten as

$$\begin{aligned} MCGM(i)=\sum _{d(i,j)\le R, j\ne {i}}\frac{(\frac{k(i)}{k_{max}}+\frac{k_{s}(i)}{k_{s{max}}}+\frac{x(i)}{x_{max}})(\frac{k(j)}{k_{max}}+\frac{k_{s}(j)}{k_{s{max}}}+\frac{x(j)}{x_{max}})}{d^{2}(i,j)}, \end{aligned}$$

(16)

where $k_{max}$, $k_{s{max}}$ and $x_{max}$ denote the maximum of degree value, k-shell value and eigenvector centrality value, respectively.

However, since the k-shell index has smaller value space, the normalized k-shell index is still larger than the other two indices. Therefore, it is necessary to lower the impact of the k-shell index. Given an index, due to the scale-free property of networks, the index values of most nodes are relatively small. Therefore, the index with larger value space generally has a smaller ratio between the median and the maximum. In our model, it is obvious that the value space of degree centrality and eigenvector centrality is larger than that of k-shell index. In view of this, we can lower the impact of k-shell index by

$$\begin{aligned} \alpha =\frac{max{\left\{ {\frac{k_{mid}}{k_{max}},\frac{x_{mid}}{x_{max}}}\right\} }}{\frac{k_{s{mid}}}{k_{s{max}}}}, \end{aligned}$$

(17)

where $k_{mid}$, $k_{s{mid}}$ and $x_{mid}$ denote the median of degree value, k-shell value and eigenvector centrality value, respectively. The purpose of taking the maximum value of ${\left\{ {\frac{k_{mid}}{k_{max}},\frac{x_{mid}}{x_{max}}}\right\} }$ is to prevent the function of k-shell index from being excessively weakened.

Finally, Eq. (15) can be rewritten as

$$\begin{aligned} MCGM(i)=\sum _{d(i,j)\le R, j\ne {i}}\frac{(\frac{k(i)}{k_{max}}+\frac{\alpha k_{s}(i)}{k_{s{max}}}+\frac{x(i)}{x_{max}})(\frac{k(j)}{k_{max}}+\frac{\alpha k_{s}(j)}{k_{s{max}}}+\frac{x(j)}{x_{max}})}{d^{2}(i,j)}. \end{aligned}$$

(18)

The Algorithmic description of MCGM is provided in Algorithm 1. We take a toy network shown in Fig. 1 to illustrate the calculation process of Algorithm 1.

Firstly, calculate the degree value, k-shell value and eigenvector centrality value of each node in the toy network, the results are shown in Table 1.

Table 1 The degree value, k-shell value and eigenvector centrality value of each node in the toy network.

Full size table

Secondly, calculate $k_{max}=5$, $k_{s{max}}=3$, $x_{max}=0.1917$, $k_{mid}=3$, $k_{s{mid}}=2$ and $x_{mid}=0.1256$, furthermore, calculate $\alpha =0.9827$.

Finally, the result of MCGM with $R=2$ of the toy network is shown in Table 2. Take node 3 as an example, the 1-order neighbors of node 3 are node 2, node 4 and node 7, the 2-order neighbors of node 3 are node 1, node 5 and node 6, so $MCGM(3)=16.9320$.

Table 2 The result of MCGM with $R=2$ of the toy network.

Full size table

Data description

In this paper, we apply ten real networks from six fields to test the performance of MCGM, including one transportation network (USAir³⁵), one communication network (Email³⁶), one infrastructure network (Power³⁷), one technological network (Router³⁸), two collaboration networks (Jazz³⁹ and NS⁴⁰) and four social networks (PB⁴¹, Facebook⁴², WV⁴³ and Sex⁴⁴). Table 3 shows these networks’ topological features, including the number of nodes, the number of links, the average degree, the average distance, the clustering coefficient³⁷, denoted by C, the assortative coefficient⁴⁵, denoted by r, the degree heterogeneity⁴⁶, denoted by H, and the epidemic threshold³² of SIR model³¹.

Table 3 The topological features of ten real networks.

Full size table

Empirical results

Based on the above real networks, the well-known SIR model³¹ is used to compare the influential rankings produced by algorithms and simulations. Given the network and the transmission probability $\beta $, in order to guarantee the reliability of the results, 1000 independent realizations are executed and averaged to obtain the standard ranking of the influence of nodes (see details about SIR model in Preliminaries). In each realization, every node is selected once as the seed once. We apply the Kendall’s Tau ($\tau $) between the standard ranking and the ranking produced by the algorithm to measure the accuracy of an algorithm. Since $\tau \in \left[ -1,1\right] $, the closer the $\tau $ is to 1, the better the performance of the algorithm. The benchmark algorithms include degree centrality¹⁰ (DC), H-index¹¹, k-shell decomposition method¹² (KS), eigenvector centrality¹³ (EC), betweenness centrality¹⁴ (BC), closeness centrality¹⁵ (CC), DynamicRank⁴⁷ (DR), the extended version of gravity centrality¹⁸ (GC+), the extended version of improved gravity centrality¹⁹ (IGC+) and local gravity model²⁰ (LGM). Table 4 compares the accuracies of MCGM and the ten benchmark algorithms for $\beta =\beta _c$. Furthermore, the accuracies of different $\beta $ values (not too far from $\beta _c$) are shown in Fig. 2.

Table 4 The algorithms’ accuracies of MCGM and the benchmark algorithms measured by Kendall’s Tau for $\beta =\beta _c$.

Full size table

As shown in Table 4, the methods based on gravity law (GC+, IGC+, LGM and MCGM) show great advantages over the classic methods (DC, H-index, KS, EC, BC, CC), especially in Power, Router and NS, the advantage of the methods based on gravity law are extremely obvious. Notice that, except the above three networks, the performance of EC is significantly better than other classic methods, and even performs competitively in comparison with the methods based on gravity law, which indirectly shows that the stability of the method based on the gravity law is better and their performance will not decline precipitously due to the differences of networks. Furthermore, for the methods based on gravity law, MCGM generally performs best since it effectively considers more characteristics of nodes. As shown in Fig. 2, MCGM still performs very competitively compared with the ten benchmark algorithms for different $\beta $ not too far from $\beta _c$, suggesting the robustness of our findings.

Figure 3 shows the optimal truncation radius of MCGM in the ten real networks. It is not difficult to find that the optimal truncation radius of most networks is concentrated at $R=2$. Therefore, we may simply set $R=2$ to test the performance of MCGM. Table 5 compares the accuracies of MCGM with $R=2$ and the benchmark algorithms.

Table 5 The algorithms’ accuracies of MCGM with $R=2$ and the benchmark algorithms measured by Kendall’s Tau for $\beta =\beta _c$.

Full size table

As shown in Table 5, MCGM with $R=2$ generally performs best in comparison with the benchmark algorithms, it still obtains the best results in six of the ten real networks. Since the optimal truncation radius approximately scales linearly with the average distance²⁰, if the average distance of the network is relatively large, setting $R=2$ will have a significant impact on the performance of MCGM, such as Power whose average distance is 18.9892. Fortunately, most real networks have small-world property, $R^*$ tends to be small in most cases.

Furthermore, we need to compare MCGM and MCGM without normalization to illustrate the importance of normalization. Table 6 compares the accuracies of MCGM using Eq. (15), MCGM using Eq. (16) and MCGM using Eq. (18). As shown in Table 6, MCGM has been gradually improved by normalization, suggesting the importance of normalization and the effectiveness of our normalization strategy.

Table 6 The algorithms’ accuracies of MCGM using Eq. (15), MCGM using Eq. (16) and MCGM using Eq. (18) measured by Kendall’s Tau for $\beta =\beta _c$.

Full size table

Finally, we apply the monotonicity³⁴ to measure the resolution of different algorithms. As shown in Table 7, MCGM generally performs best even if it only considers 1-order neighbors or 2-order neighbors in most cases. The results reported in Table 7 demonstrate MCGM is a remarkably high-resolution algorithm.

Table 7 The monotonicity of different algorithms. The parameters in the related algorithms (i.e., LGM and MCGM) are adjusted to their optimal values subject to the largest $\tau $.

Full size table

Computational complexity

The computational complexity of the methods used in this paper is shown in Table 8. The computational complexity of DC, KS and EC is O(N), O(M) and $O(N+M)$, respectively. Therefore, it is obvious that the part with the highest computational complexity of MCGM is computing the R-order neighbors of each node, it needs $N \left\langle k \right\rangle ^{R}$ times operations. Hence the computational complexity of MCGM is $O(N \left\langle k \right\rangle ^{R})$. Since most real networks have small-world property, $R^*=2$ in most cases (see Fig. 3), so the computational complexity of MCGM is generally not more than $O(N \left\langle k \right\rangle ^2)$, where $\left\langle k \right\rangle \ll N$.

Table 8 The computational complexity of MCGM and the benchmark algorithms.

Full size table

Discussion

In summary, we propose a novel gravity model that effectively integrates multi-characteristics of nodes, named as multi-characteristics gravity model (MCGM). The number of neighbors, the influence of neighbors, the location of nodes, and the path information between nodes are all taken into consideration in our model. In addition, we propose a normalization strategy to solve the problem that different indices are not in the same order of magnitude, Table 6 suggests the importance of normalization and the effectiveness of our normalization strategy. Compared with well-known state-of-the-art methods, empirical analyses of the SIR spreading dynamics on ten real networks suggest that our model always performs very competitively, as shown in Table 4.

However, MCGM needs to find the optimal truncation radius by traversing the truncation radius and it is very time-consuming. Fortunately, the optimal truncation radius approximately scales linearly with the average distance²⁰, and most real networks have small-world property^37,48, so even if the truncation radius is just set to 2, MCGM still performs very competitively in most cases, as shown in Table 5. Therefore, without increasing the computational complexity, MCGM effectively considers more characteristics of nodes and obtains more accurate results.

Although the computational complexity of MCGM is not high, it needs the global topological structure, same as GC+ and IGC+. While LGM can work under the case where the global topology is not known. As a result, our suggestions for practical use are as follows: if the network’s global topology is known, apply MCGM and set R to 2, otherwise, apply LGM and set R to 2 or 3.

Of course, there are still some potential problems in the future. First of all, the gravity law is symmetrical, but due to the different effects of different nodes or the inherent asymmetry of dynamics^49,50, an asymmetric form of the gravity law may be relevant. Secondly, in weighted complex networks, the heterogeneity of links greatly changes nodes’ importance⁵¹, a weighted form of the gravity law may be relevant. Finally, in order to establish a unified research framework, a unified gravity model is needed to be proposed. Although GC+, IGC+ and LGM are proposed from different perspectives, a unified form of expression exists. We propose a rough model which intends to start further discussion on this issue. The rough unified gravity model is described as

$$\begin{aligned} UGM(i)=\sum _{d(i,j)\le R,j\ne {i}}\frac{(ak(i)+(1-a)k_{s}(i))(bk(j)+(1-b)k_{s}(j))}{d(i,j)^{2}}, \end{aligned}$$

(19)

where a and b are adjustable parameters. If $a=1$ and $b=1$, the unified gravity model degenerates to LGM. If $a=0$ and $b=0$, the unified gravity model degenerates to GC (GC+ can be obtained by Eq. (6)). If $a=0$ and $b=1$, the unified gravity model degenerates to IGC (IGC+ can be obtained by Eq. (8)).

Data availability

All relevant data are available at https://github.com/MLIF/Network-Data.

References

Barabási, A. L., Gulbahce, N. & Loscalzo, J. Network medicine: A network-based approach to human disease. Nat. Rev. Genet. 12, 56–68 (2011).
Article PubMed PubMed Central CAS Google Scholar
Zhu, P., Zhi, Q., Guo, Y. & Wang, Z. Analysis of epidemic spreading process in adaptive networks. IEEE Trans. Circuits Syst. II Express Briefs66, 1252–1256 (2018).
Borge-Holthoefer, J. & Moreno, Y. Absence of influential spreaders in rumor dynamics. Phys. Rev. E 85, 026116 (2012).
Article ADS CAS Google Scholar
Xu, W. et al. Identifying structural hole spanners to maximally block information propagation. Inf. Sci. 505, 100–126 (2019).
Article MathSciNet MATH Google Scholar
Albert, R., Albert, I. & Nakarado, G. L. Structural vulnerability of the North American power grid. Phys. Rev. E 69, 025103 (2004).
Article ADS CAS Google Scholar
Csermely, P., Korcsmáros, T., Kiss, H. J. M., London, G. & Nussinov, R. Structure and dynamics of molecular networks: A novel paradigm of drug discovery: a comprehensive review. Pharmacol. Ther. 138, 333–408 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bellingeri, M. & Bodini, A. Food web’s backbones and energy delivery in ecosystems. Sci. Rep. 125, 586–594 (2016).
Google Scholar
Bellingeri, M., Cassi, D. & Vincenzi, S. Increasing the extinction risk of highly connected species causes a sharp robust-to-fragile transition in empirical food webs. Ecol. Model. 251, 1–8 (2013).
Article Google Scholar
Lü, L. et al. Vital nodes identification in complex networks. Phys. Rep. 650, 1–63 (2016).
Article ADS MathSciNet Google Scholar
Bonacich, P. Factoring and weighting approaches to status scores and clique identification. Math. Sociol. 2, 113–120 (1972).
Article Google Scholar
Lü, L., Zhou, T., Zhang, Q. M. & Stanley, H. E. The H-index of a network node and its relation to degree and coreness. Nat. Commun. 7, 10168 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Kitsak, M. et al. Identification of influential spreaders in complex networks. Nat. Phys. 6, 888–893 (2010).
Article CAS Google Scholar
Bonacich, P. Some unique properties of eigenvector centrality. Soc. Netw. 29, 555–564 (2007).
Article Google Scholar
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977).
Article Google Scholar
Freeman, L. C. Centrality in social networks conceptual clarification. Soc. Netw. 1, 215–239 (1979).
Article Google Scholar
Karemera, D., Oguledo, V. I. & Davis, B. A gravity model analysis of international migration to North America. Appl. Econ. 32, 1745–1755 (2000).
Article Google Scholar
Porojan, A. Trade flows and spatial effects: The gravity model revisited. Open Econ. Rev. 12, 265–280 (2001).
Article MATH Google Scholar
Ma, L. L., Ma, C., Zhang, H. F. & Wang, B. H. Identifying influential spreaders in complex networks based on gravity formula. Phys. A 451, 205–212 (2015).
Article MATH Google Scholar
Wang, J., Li, C. & Xia, C. Improved centrality indicators to characterize the nodal spreading capability in complex networks. Appl. Math. Comput. 334, 388–400 (2018).
MathSciNet Google Scholar
Li, Z. et al. Identifying influential spreaders by gravity model. Sci. Rep. 9, 8387 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Liu, F., Wang, Z. & Deng, Y. GMM: A generalized mechanics model for identifying the importance of nodes in complex networks. Knowl. Based Syst. 193, 105464 (2020).
Article Google Scholar
Yang, X. & Xiao, F. An improved gravity model to identify influential nodes in complex networks based on k-shell method. Knowl. Based Syst. 227, 107198 (2021).
Article Google Scholar
Shang, Q., Deng, Y. & Cheong, K. H. Identifying influential nodes in complex networks: Effective distance gravity model. Inform. Sciences 577, 162–179 (2021).
Article MathSciNet Google Scholar
Ullah, A. et al. Identification of nodes influence based on global structure model in complex networks. Sci. Rep. 11, 6173 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Yan, X., Cui, Y. & Ni, S. Identifying influential spreaders in complex networks based on entropy weight method and gravity law. Chinese Phys. B 29, 048902 (2020).
Article ADS Google Scholar
Li, H., Shang, Q. & Deng, Y. A generalized gravity model for influential spreaders identification in complex networks. Chaos Solitons Fract. 143, 110456 (2021).
Article MathSciNet MATH Google Scholar
Li, Z. & Huang, X. Y. Identifying influential spreaders in complex networks by an improved gravity model. Sci. Rep. 11, 22194 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Huang, X., Chen, D., Wang, D. & Ren, T. Identifying influencers in social networks. Entropy 22, 450 (2020).
Article MathSciNet PubMed Central Google Scholar
Maji, G., Namtirtha, A., Dutta, A. & Malta, M. C. Influential spreaders identification in complex networks with improved k-shell hybrid method. Expert Syst. Appl. 144, 113092 (2020).
Article Google Scholar
Wang, X., Yang, Q., Liu, M. & Ma, X. Comprehensive influence of topological location and neighbor information on identifying influential nodes in complex networks. PLoS ONE 16, e0251208 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hethcote, H. W. The mathematics of infectious diseases. SIAM Rev. 42, 599–653 (2009).
Article ADS MathSciNet MATH Google Scholar
Castellano, C. & Pastor-Satorras, R. Thresholds for epidemic spreading in networks. Phys. Rev. Lett. 105, 218701 (2010).
Article ADS PubMed CAS Google Scholar
Kendall, M. A new measure of rank correlation. Biometrika 30, 81–89 (1938).
Article MATH Google Scholar
Bae, J. & Kim, S. Identifying and ranking influential spreaders in complex networks by neighborhood coreness. Phys. A 395, 549–559 (2014).
Article MathSciNet MATH Google Scholar
Batageli, V. & Mrvar, A. Pajek Datasets. Available at http://vlado.fmf.uni-lj.si/pub/networks/data/. (2007).
Guimerà, R., Danon, L., Díaz-Guilera, A., Giralt, F. & Arenas, A. Self-similar community structure in a network of human interactions. Phys. Rev. E 68, 065103 (2003).
Article ADS CAS Google Scholar
Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998).
Article ADS CAS PubMed MATH Google Scholar
Spring, N., Mahajan, R., Wetherall, D. & Anderson, T. Measuring ISP topologies with rocketfuel. IEEE/ACM Trans. Networking 12, 2–16 (2004).
Article Google Scholar
Gleiser, P. & Danon, L. Community structure in Jazz. Adv. Complex Syst. 6, 565 (2003).
Article Google Scholar
Newman, M. E. J. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74, 036104 (2006).
Article ADS MathSciNet CAS Google Scholar
Adamic, L. A. & Glance, N. The political blogosphere and the 2004 U.S. election: divided they blog. In Proceedings of the 3rd international workshop on Link discovery. 36-43 (ACM Press, 2005).
Mcauley, J. J. & Leskovec, J. Learning to discover social circles in ego networks. Adv. Neural. Inf. Process. Syst. 25, 548–556 (2012).
Google Scholar
Leskovec, J., Huttenlocher, D. & Kleinberg, J. Predicting positive and negative links in online social networks. In Proceedings of the 19th international conference on World Wide Web. 641-650 (ACM Press, 2010).
Rocha, L. E., Liljeros, F. & Holme, P. Simulated epidemics in an empirical spatiotemporal network of 50,185 sexual contacts. PLoS Comput. Biol. 7, e1001109 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Newman, M. E. J. Assortative mixing in networks. Phys. Rev. Lett. 89, 208701 (2002).
Article ADS CAS PubMed Google Scholar
Hu, H. B. & Wang, X. F. Unified index to quantifying heterogeneity of complex networks. Phys. A 387, 3769–3780 (2008).
Article Google Scholar
Chen, D. B., Sun, H. L., Tang, Q., Tian, S. Z. & Xie, M. Identifying influential spreaders in complex networks by propagation probability dynamics. Chaos 29, 033120 (2019).
Article ADS MathSciNet PubMed Google Scholar
Amaral, L. A. N., Scala, A., Barthelemy, M. & Stanley, H. E. Classes of small-world networks. PNAS 97, 11149–11152 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Yan, G., Fu, Z. Q. & Chen, G. Epidemic threshold and phase transition in scale-free networks with asymmetric infection. Eur. Phys. J. B 65, 591–594 (2008).
Article ADS CAS MATH Google Scholar
Wang, W. et al. Asymmetrically interacting spreading dynamics on complex layered networks. Sci. Rep. 4, 5097 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bellingeri, M., Bevacqua, D., Scotognella, F. & Cassi, D. The heterogeneity in link weights may decrease the robustness of real-world complex weighted networks. Sci. Rep. 9, 10692 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors greatly appreciate the reviews’ suggestions and the editor’s encouragement.

Author information

Authors and Affiliations

Software College, Shenyang University of Technology of China, Shenyang, 110870, People’s Republic of China
Zhe Li
Software College, Northeastern University of China, Shenyang, 110819, People’s Republic of China
Xinyu Huang

Authors

Zhe Li
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.L. devised the research project. Z.L. performed the research. Z.L. and X.H. analyzed the data. Z.L. and X.H. wrote the paper.

Corresponding authors

Correspondence to Zhe Li or Xinyu Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Z., Huang, X. Identifying influential spreaders by gravity model considering multi-characteristics of nodes. Sci Rep 12, 9879 (2022). https://doi.org/10.1038/s41598-022-14005-3

Download citation

Received: 14 February 2022
Accepted: 31 May 2022
Published: 14 June 2022
DOI: https://doi.org/10.1038/s41598-022-14005-3

This article is cited by

Predicting nodal influence via local iterative metrics
- Shilun Zhang
- Alan Hanjalic
- Huijuan Wang
Scientific Reports (2024)
Identifying influential nodes based on the disassortativity and community structure of complex network
- Zuxi Wang
- Ruixiang Huang
- Zhong Chen
Scientific Reports (2024)
Time and cost-effective online advertising in social Internet of Things using influence maximization problem
- Reza Molaei
- Kheirollah Rahsepar Fard
- Asgarali Bouyer
Wireless Networks (2024)
Identifying influential nodes in complex networks using a gravity model based on the H-index method
- Siqi Zhu
- Jie Zhan
- Xing Li
Scientific Reports (2023)
A method based on k-shell decomposition to identify influential nodes in complex networks
- Bakhtyar Rafeeq HamaKarim
- Rojiar Pir Mohammadiani
- Mehri Bahrami
The Journal of Supercomputing (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Preliminaries

Well-known state-of-the-art methods

The SIR model

The Kendall’s Tau

The monotonicity

Results

Algorithms

Data description

Empirical results

Computational complexity

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links