Core-periphery structure requires something else in the network

Sadamori Kojaku; Naoki Masuda

doi:10.1088/1367-2630/aab547

1. Introduction

Many complex systems, biological, physical or social, can be represented by networks [1, 2]. A network consists of a set of nodes and edges, where nodes represent objects (e.g., people, web pages) and edges represent pairwise relationships between objects (e.g., friendships, hyperlinks). A consistent observation across different types of networks is that they are often composed of communities, i.e., groups of densely interconnected nodes [3]. A community is often associated with a group of nodes sharing a role or similarity such as a circle of friends in social networks [4], a set of web pages discussing the same topic [5, 6] and a functional group of proteins [7].

Core-periphery structure is another mesoscopic structure of networks that has experienced a surge of interests in the last two decades. A core-periphery structure in its simplest form refers to a partition of a network into two groups of nodes called core and periphery, where core nodes are densely interconnected (i.e., adjacent), and peripheral nodes are adjacent to the core nodes but not to other peripheral nodes [8–10]. Core-periphery structure has been detected in a number of networks including social networks [8, 10–20], protein–protein interaction networks [15, 21, 22], neural networks [23, 24], trade networks [13, 25, 26], financial networks [14, 27–31] and transportation networks [10, 13, 14, 19, 32]. For example, in a world-trade network among countries, economically strong countries trade with other strong countries, constituting a core. Economically weak countries mainly trade with strong countries, constituting a periphery [13, 26].

Borgatti and Everett analysed core-periphery structure in quantitative terms for the first time [8]. They expressed a core-periphery structure by a core block (i.e., group of core nodes) and a periphery block (i.e., group of peripheral nodes) as shown in figure 1. The core block has many intra-block edges (the top left block in figure 1). The periphery block has relatively few intra-block edges (the bottom right block in figure 1). There may be many inter-block edges (off-diagonal blocks in figure 1) [8, 10, 12, 15, 19, 20, 24] or relatively few inter-block edges [8, 10, 16, 17, 25, 26, 31, 33–35]. The core-periphery structure expressed by blocks of nodes is classified as a discrete variant of core-periphery structure based on edge density [8–10, 12, 14, 17–20, 26, 33, 35]. There are other types of core-periphery and related structure, such as continuous versions of core-periphery structure [8, 10, 14, 17, 25], transport-based core-periphery structure [13, 14, 17, 21, 36], k-core [37] and rich-clubs [38, 39].

**Figure 1.** Adjacency matrix of a network with good-looking core-periphery structure composed of two blocks. The filled and open cells in the ith row and jth column indicate the presence and absence of the edge between nodes i and j, respectively. The dotted lines represent the boundary between the core and periphery.
Download figure:
Standard image High-resolution image

Given that block structure of networks, or equivalently, hard partitioning of the nodes into groups, has spurred many studies such as community detection [3, 40] and the inference of stochastic block models (SBM) [6, 41], as well as its appeal to intuition, we focus on the discrete version of core-periphery structure based on edge density in the present paper. If a network has such core-periphery structure, the core block should have more intra-block edges and the periphery block should have fewer intra-block edges than a reference. We argue that the core-periphery structure that Borgatti and Everett proposed (figure 1), which many of the subsequent work is based on, is impossible if we use the configuration model [42] as the null model and there are just one core and one periphery. The configuration model is a common class of random graph models that preserve the degree or its mean value of each node. Therefore, our claim implies that there is no core-periphery structure a la mode de Borgatti and Everett beyond the expectation from the degree of each node (i.e., hubs are core nodes), which is, in fact, consistent with some previous observations [16, 28, 30].

Then, we are led to a question: what is a core-periphery structure? To answer this question, let us look at the status of the configuration model in other measurements of networks. We have a plethora of centrality measures for nodes because the degree is often not a useful measure of the importance of nodes [1]. In other words, different centrality measures provide rank orders of nodes in the given network that are not expected from the configuration model. In network motif analysis, where one looks for small subnetworks that are abundant in a given network, we discount the frequency of subnetworks that are merely explained by the degree of the nodes (i.e., configuration model) [43]. In community detection, it is conventional to use the configuration model as the null model against which one assesses the significance of community structure [3, 4, 6, 41]. To solve the conundrum that one does not discover core-periphery structure using the configuration model as the null model, we propose that one must add at least one different block apart from a core block and the corresponding periphery block for a network to have core-periphery structure that is consistent with figure 1. Such blocks may be a community, sparsely connected part, a different core-periphery pair [10, 19, 20, 32], a core that shares the periphery with the focal core-periphery pair [34] and so forth. Then, we propose a scalable algorithm to partition a network into multiple core-periphery pairs including community detection as special cases, aiming to detect core-periphery structure that is not merely explained by the degree of each node. Crucially, we use the configuration model as the null model, which is different from our previous algorithm [19].

2. Core-periphery structure needs at least three blocks

Consider an unweighted network composed of N nodes and M edges. The N × N adjacency matrix of the network is denoted by ${\boldsymbol{A}}=({A}_{{ij}})$ , where A_ij = 1 if nodes i and j ( $\ne i$ ) are adjacent and A_ij = 0 otherwise. We assume that the network is undirected (i.e., A_ij = A_ji for all $i\ne j$ ) and has no self-loops (i.e., A_ii = 0 for all i). Let d_i be the number of edges incident to node i (i.e., degree). As the null model of networks, we use the configuration model, i.e., a random network model preserving the degree of each node. For the configuration model, we allow multi-edges (i.e., multiple edges between nodes) and self-loops for computational ease. In fact, multi-edges and self-loops change our quality function for finding core-periphery structure in the order of $1/N$ , which is negligible if N is large. We denote by ${\mathbb{E}}[\cdot ]$ the expectation with respect to the configuration model.

Consider a partition of the set of N nodes into B blocks (i.e., groups). Let N_u be the number of nodes in block u and m_uv be the number of edges between blocks u and v. Note that m_uv = m_vu. For notational convenience, we define m_uu as twice the number of self-loops in block u plus twice the number of edges between different nodes within block u. Denote by ${m}_{{uv}}^{\mathrm{conf}}$ the number of edges between blocks u and v in a network generated by the configuration model whose degree sequence is given by that of the original network. Suppose a network composed of B = 2 blocks (figure 2(a)). There are potentially six types of block structure of networks represented by two blocks. In figures 2(b)–(g), a filled block has more edges than that for the configuration model (i.e., ${m}_{{uv}}\gt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}]$ ), and a blank block has fewer edges than that for the configuration model (i.e., ${m}_{{uv}}\lt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}]$ ). The entire network would be dense if there are many intra- and inter-block edges (figure 2(b)). In contrast, the network would be sparse if there are relatively few intra- and inter-block edges (figure 2(c)). The network has community structure if there are many intra-block edges and relatively few inter-block edges (figure 2(d)). A contrasting case is a structure close to a bipartite network, where there are relatively few intra-block edges and many inter-block edges (figure 2(e)). Core-periphery structure would correspond to the case in which there are many edges within one block and few edges within the other block. With core-periphery structure, inter-block edges may be abundant (figure 2(f)) [8, 10, 12, 15, 19, 20, 24] or not (figure 2(g)) [8, 10, 16, 17, 25, 26, 31, 33–35].

**Figure 2.** Schematic illustration of network structures composed of two blocks. The filled cells indicate that there are more edges than that expected for the configuration model (i.e., ${m}_{{uv}}\gt {\mathbb{E}}[{m}_{{uv}}]$ ). The open cells indicate that there are fewer edges than that expected for the configuration model (i.e., ${m}_{{uv}}\lt {\mathbb{E}}[{m}_{{uv}}]$ ). Only the structures shown in (d) and (e) are possible.
Download figure:
Standard image High-resolution image

**Figure 2.** Schematic illustration of network structures composed of two blocks. The filled cells indicate that there are more edges than that expected for the configuration model (i.e., ${m}_{{uv}}\gt {\mathbb{E}}[{m}_{{uv}}]$ ). The open cells indicate that there are fewer edges than that expected for the configuration model (i.e., ${m}_{{uv}}\lt {\mathbb{E}}[{m}_{{uv}}]$ ). Only the structures shown in (d) and (e) are possible.
Download figure:
Standard image High-resolution image

Many algorithms for finding discrete versions of core-periphery structure seek a partition of nodes into one core block and one periphery block (figures 2(f) or (g)). Let us consider the karate club network [44], which has been demonstrated to have core-periphery structure [10, 13, 19, 20, 26, 32, 35]. The Borgatti–Everett (BE) algorithm partitions the N = 34 nodes into a core and a periphery as shown in figure 3. The detected blocks seem to suggest core-periphery structure because the core nodes are densely interconnected, whereas the peripheral nodes are sparsely interconnected. However, relative to the configuration model, the network is closer to a bipartite network than to core-periphery structure; there are fewer edges within both core and periphery blocks (i.e., m₁₁ = 10, ${\mathbb{E}}[{m}_{11}^{\mathrm{conf}}]=26.25$ , m₂₂ = 38 and ${\mathbb{E}}[{m}_{22}^{\mathrm{conf}}]=54.25$ ) and more edges between the core and periphery blocks than those expected for the configuration model (i.e., m₁₂ = 54 and ${\mathbb{E}}[{m}_{12}^{\mathrm{conf}}]=37.74$ ).

**Figure 3.** Core-periphery structure of the karate club network detected by the BE algorithm. The nodes are reordered. The filled and open cells represent the presence and absence of edges, respectively.
Download figure:
Standard image High-resolution image

This observation is in fact universal; core-periphery structure is impossible with two blocks when the null model is the configuration model. To show this, consider a network composed of B = 2 blocks. The degree of each node is the same between the original network and a sample network generated by the configuration model. Therefore, the number of edges emanating from each block is also the same between the original network and the sample network. Therefore, we obtain

$\begin{eqnarray}&&{m}_{11}+{m}_{12}={\mathbb{E}}[{m}_{11}^{\mathrm{conf}}]+{\mathbb{E}}[{m}_{12}^{\mathrm{conf}}],\end{eqnarray} \tag{ 1 }$

$\begin{eqnarray}&&{m}_{21}+{m}_{22}={\mathbb{E}}[{m}_{21}^{\mathrm{conf}}]+{\mathbb{E}}[{m}_{22}^{\mathrm{conf}}].\end{eqnarray} \tag{ 2 }$

Rearranging equations (1) and (2) yields

$\begin{eqnarray}&&{m}_{11}-{\mathbb{E}}[{m}_{11}^{\mathrm{conf}}]=-({m}_{12}-{\mathbb{E}}[{m}_{12}^{\mathrm{conf}}]),\end{eqnarray} \tag{ 3 }$

$\begin{eqnarray}&&{m}_{22}-{\mathbb{E}}[{m}_{22}^{\mathrm{conf}}]=-({m}_{21}-{\mathbb{E}}[{m}_{21}^{\mathrm{conf}}]).\end{eqnarray} \tag{ 4 }$

Equations (3) and (4) imply that if a block has more intra-block edges in the original network than in the configuration model, the same block must have fewer inter-block edges in the original network than in the configuration model. Because we assumed that the network is undirected, we obtain m₂₁ = m₁₂ and ${m}_{21}^{\mathrm{conf}}={m}_{12}^{\mathrm{conf}}$ . Using these relationships, we rewrite equation (4) as

$\begin{eqnarray}&&{m}_{22}-{\mathbb{E}}[{m}_{22}^{\mathrm{conf}}]=-({m}_{12}-{\mathbb{E}}[{m}_{12}^{\mathrm{conf}}]).\end{eqnarray} \tag{ 5 }$

By combining equations (3) and (5), we obtain

$\begin{eqnarray}&&{m}_{11}-{\mathbb{E}}[{m}_{11}^{\mathrm{conf}}]={m}_{22}-{\mathbb{E}}[{m}_{22}^{\mathrm{conf}}].\end{eqnarray} \tag{ 6 }$

Equation (6) indicates that there is no network composed of two blocks such that the core block has more edges in the original network than the configuration model (i.e., ${m}_{11}\gt {\mathbb{E}}[{m}_{11}^{\mathrm{conf}}]$ ) and the periphery block has fewer edges in the original network than the configuration model (i.e., ${m}_{22}\lt {\mathbb{E}}[{m}_{22}^{\mathrm{conf}}]$ ). Therefore, the core-periphery structure does not exist if one partitions a network into a single core block and a single periphery block, as the BE algorithm does. It should be noted that equations (3) and (4) imply that the networks represented by figures 2(b) and (c) are also impossible. In contrast, the networks shown in figures 2(d) and (e) satisfy equations (3), (4), and (6) and therefore are possible.

Core-periphery structure is possible if the network has B = 3 or more blocks. To identify the block structures that are possible and those that are not, we introduce the notion of compatibility of block structure as follows. Consider a network composed of B blocks. The number of edges between blocks u and v in the original network is given by

$\begin{eqnarray}&&{m}_{{uv}}=\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}{A}_{{ij}}\delta ({b}_{i},u)\delta ({b}_{j},v),\end{eqnarray} \tag{ 7 }$

where b_i is the index of the block to which node i belongs, and δ(·, ·) is Kronecker delta. Equation (7) leads to

$\begin{eqnarray}\displaystyle \sum _{v=1}^{B}{m}_{{uv}} & = & \displaystyle \sum _{v=1}^{B}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}{A}_{{ij}}\delta ({b}_{i},u)\delta ({b}_{j},v)=\displaystyle \sum _{i=1}^{N}\delta ({b}_{i},u)\displaystyle \sum _{j=1}^{N}{A}_{{ij}}\displaystyle \sum _{v=1}^{B}\delta ({b}_{j},v)\\ & = & \displaystyle \sum _{i=1}^{N}\delta ({b}_{i},u)\displaystyle \sum _{j=1}^{N}{A}_{{ij}}=\displaystyle \sum _{i=1}^{N}{d}_{i}\delta ({b}_{i},u),\quad u=1,2,\,\ldots ,\,B,\end{eqnarray} \tag{ 8 }$

where ${d}_{i}={\sum }_{j=1}^{N}{A}_{{ij}}$ is the degree of node i. The sum ${\sum }_{v=1}^{B}{m}_{{uv}}$ is the sum of the degree of nodes in block u. Because the configuration model preserves the degree of each node, the sum ${\sum }_{v=1}^{B}{m}_{{uv}}$ is the same between the original network and the configuration model, i.e.,

$\begin{eqnarray}&&\displaystyle \sum _{v=1}^{B}{m}_{{uv}}=\displaystyle \sum _{v=1}^{B}{\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}],\quad u=1,2,\,\ldots ,\,B.\end{eqnarray} \tag{ 9 }$

Note that equation (9) generalises equations (1) and (2). Then, we categorise blocks into dense (i.e., ${m}_{{uv}}\gt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}]$ ) and sparse (i.e., ${m}_{{uv}}\lt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}]$ ) blocks. We say that a block structure is compatible if the designated dense and sparse blocks are realisable in the sense that equation (9) is satisfied. We describe the procedures to find compatible block structures in appendix A.

With B = 3 blocks, eight types of block structure are compatible with equation (9) (figures 4(a)–(h)). The networks shown in figures 4(a) and (b) consist of two and three communities, respectively. The networks shown in figures 4(c) and (d) are bipartite-like and tripartite-like networks, respectively. The network shown in figure 4(e) is a union of a bipartite-like subnetwork composed of blocks 1 and 2 and a community composed of block 3. These network structures extend those viable in the case of two blocks (figures 2(d) and (e)). The networks shown in figures 4(f)–(h) contain core-periphery pairs. In figure 4(f), blocks 1 and 2 constitute a core-periphery pair, and block 3 constitutes a community. In figure 4(g), blocks 1 and 2 constitute a core-periphery pair, and blocks 2 and 3 constitute a bipartite-like subnetwork. The network shown in figure 4(h) consists of two cores (i.e., blocks 1 and 2) sharing a periphery (i.e., block 3), which is the structure studied in [34].

**Figure 4.** Schematic illustration of network structures with three or four blocks that are compatible with equation (9). We show all network structures composed of three blocks in panels (a)–(h) and four out of the 49 structures with four blocks in panels (i)–(ℓ). The other 45 patterns with four blocks are shown in figure B1.
Download figure:
Standard image High-resolution image

With B = 4 blocks, 49 types of block structure are compatible with equation (9). Four of them are shown in figures 4(i)–(ℓ) for illustration (see figure B1 for the others). The network shown in figure 4(i) is composed of two non-overlapping core-periphery pairs [19, 20, 24, 32]. The network shown in figure 4(j) consists of one core-periphery pair (i.e., blocks 1 and 2) and one bipartite-like subnetwork (i.e., blocks 3 and 4). The network shown in figure 4(k) consists of one core-periphery pair (i.e., blocks 1 and 2), one bipartite-like subnetwork (i.e., blocks 2 and 3) and a community (i.e., block 4), in which the core-periphery pair and bipartite-like subnetwork overlap. The network shown in figure 4(ℓ) has three overlapping communities, i.e., a community composed of blocks 1 and 2, one composed of blocks 2 and 3, and one composed of blocks 3 and 4.

To conclude, the core-periphery structure a la mode de Borgatti and Everett [8] relative to the configuration model can exist only when we have at least three blocks. In other words, a core-periphery pair requires a different substructure of the network that coexists in the same network, e.g., a community, bipartite-like structure, or another core-periphery pair that may overlap with the first one.

3. Methods

In this section, we first describe a new algorithm for detecting core-periphery structure, which we refer to as KM–config, based on the observations made in section 2. MATLAB and C++ codes of KM–config are available at https://github.com/skojaku/km_config/. Then, we explain other methods and data used in section 4.

3.1. Our algorithm

3.1.1. Objective function

We propose an algorithm, KM–config, to detect discrete versions of core-periphery structure in networks. In contrast to our previous algorithm that uses the Erdős–Rényi random graph as the null model [19], which we refer to as KM–ER, here we use the configuration model as the null model. This is because we are interested in the structure that is not merely explained by the node's degree.

We assume that a network consists of C non-overlapping core-periphery pairs, each of which is composed of one core block and one periphery block, e.g., figure 4(i). Each core-periphery pair should have (i) many intra-core edges, (ii) many edges between the core and the corresponding periphery (i.e., core-periphery edges), (iii) few intra-periphery edges and (iv) few edges to other core-periphery pairs (i.e., inter-pair edges). Although some previous studies do not assume property (ii) [8, 25, 27–29, 31–33, 35], we require it because otherwise one cannot relate a periphery with a particular core.

We define idealised core-periphery pairs satisfying properties (i)–(iv) [19, 24] by

$\begin{eqnarray}&&{{\boldsymbol{A}}}^{* }=({A}_{{ij}}^{* }),\quad {A}_{{ij}}^{* }\equiv ({x}_{i}+{x}_{j}-{x}_{i}{x}_{j})\delta ({c}_{i},{c}_{j}),\end{eqnarray} \tag{ 10 }$

where x_i = 1 or x_i = 0 if node i is a core node or a peripheral node, respectively, and c_i ( $1\leqslant {c}_{i}\leqslant C$ ) is the index of the core-periphery pair to which node i belongs. Within each idealised core-periphery pair, every core node is adjacent to every other core node (property (i)) and also adjacent to all the corresponding peripheral nodes (property (ii)), and every peripheral node is not adjacent to any other peripheral nodes (property (iii)). Furthermore, there are no edges between different idealised core-periphery pairs (property (iv)).

We seek c_i and x_i (1 ≤ i ≤ N) that maximise similarity between ${\boldsymbol{A}}$ and ${{\boldsymbol{A}}}^{* }$ as defined by

$\begin{eqnarray}&&{Q}_{\mathrm{config}}^{\mathrm{cp}}\equiv \displaystyle \frac{1}{2M}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}{A}_{{ij}}{A}_{{ij}}^{* }-{\mathbb{E}}\left[\displaystyle \frac{1}{2M}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}{A}_{{ij}}^{\mathrm{conf}}{A}_{{ij}}^{* }\right],\end{eqnarray} \tag{ 11 }$

where ${{\boldsymbol{A}}}^{\mathrm{conf}}=({A}_{{ij}}^{\mathrm{conf}})$ is the adjacency matrix of a network generated by the configuration model. The first term on the right-hand side of equation (11) is the fraction of intra-core and core-periphery edges (i.e., ${A}_{{ij}}={A}_{{ij}}^{* }=1$ ), corresponding to properties (i) and (ii). The second term is the counterpart for the configuration model. The factor $1/2M$ in the first and second terms normalises ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ to range in [−1, 1]. The remaining two properties (iii) and (iv) are also consistent with the maximisation of ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ . To show this, we rewrite ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ as

$\begin{eqnarray}&&{Q}_{\mathrm{config}}^{\mathrm{cp}}=-\displaystyle \frac{1}{2M}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}{A}_{{ij}}(1-{A}_{{ij}}^{* })+{\mathbb{E}}\left[\displaystyle \frac{1}{2M}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}{A}_{{ij}}^{\mathrm{conf}}(1-{A}_{{ij}}^{* })\right].\end{eqnarray} \tag{ 12 }$

Because ${\sum }_{i=1}^{N}{\sum }_{j=1}^{N}{A}_{{ij}}(1-{A}_{{ij}}^{* })$ is the sum of the number of intra-periphery edges and that of inter-pair edges, the maximisation of ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ minimises the two types of edges associated with properties (iii) and (iv).

In the configuration model, the expected number of edges between nodes i and j is given by ${\mathbb{E}}[{A}_{{ij}}^{\mathrm{conf}}]={d}_{i}{d}_{j}/2M$ [45, 46]. Substitution of ${\mathbb{E}}[{A}_{{ij}}^{\mathrm{conf}}]={d}_{i}{d}_{j}/2M$ and equation (10) into equation (11) yields

$\begin{eqnarray}&&{Q}_{\mathrm{config}}^{\mathrm{cp}}=\displaystyle \frac{1}{2M}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}\left({A}_{{ij}}-\displaystyle \frac{{d}_{i}{d}_{j}}{2M}\right)({x}_{i}+{x}_{j}-{x}_{i}{x}_{j})\delta ({c}_{i},{c}_{j}).\end{eqnarray} \tag{ 13 }$

If we restrict that all nodes are core nodes (i.e., x_i = 1 for i = 1, 2, ..., N), ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ is equivalent to the modularity [4, 46], which is used for finding communities in networks. The ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ shares shortcomings with the modularity such as the resolution limit. See section 5 for further discussion.

3.1.2. Relationship to Markov stability

We can relate ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ to discrete-time random walks, similar to the case of the Markov stability formalism for community detection [47–50]. Consider a random walker that moves from a node to one of the neighbouring nodes selected uniformly at random in each discrete time step. Let ${T}_{(c,x)(c^{\prime} ,x^{\prime} )}\equiv {m}_{(c,x)(c^{\prime} ,x^{\prime} )}/{D}_{(c,x)}$ be the transition probability from block (c, x) to block (c', x'), where D_(c,x) is the sum of the degree of the nodes in block (c, x). Let ${\pi }_{(c,x)}\equiv {D}_{(c,x)}/2M$ be the stationary probability with which the random walker visits block (c, x). Then, one can rewrite ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ as

$\begin{eqnarray}{Q}_{\mathrm{config}}^{\mathrm{cp}} & = & \displaystyle \frac{1}{2M}\displaystyle \sum _{c=1}^{C}\left({m}_{(c,1)(c,1)}+2{m}_{(c,0)(c,1)}-\displaystyle \frac{{D}_{(c,1)}^{2}}{2M}-\displaystyle \frac{2{D}_{(c,0)}{D}_{(c,1)}}{2M}\right)\\ & = & \displaystyle \sum _{c=1}^{C}\left(\displaystyle \frac{{D}_{(c,1)}}{2M}\cdot \displaystyle \frac{{m}_{(c,1)(c,1)}}{{D}_{(c,1)}}+\displaystyle \frac{2{D}_{(c,0)}}{2M}\cdot \displaystyle \frac{{m}_{(c,0)(c,1)}}{{D}_{(c,0)}}-\displaystyle \frac{{D}_{(c,1)}^{2}}{4{M}^{2}}-\displaystyle \frac{2{D}_{(c,0)}}{2M}\cdot \displaystyle \frac{{D}_{(c,1)}}{2M}\right)\\ & = & \displaystyle \sum _{c=1}^{C}({\pi }_{(c,1)}{T}_{(c,1)(c,1)}+2{\pi }_{(c,0)}{T}_{(c,0)(c,1)}-{\pi }_{(c,1)}^{2}-2{\pi }_{(c,0)}{\pi }_{(c,1)}).\end{eqnarray} \tag{ 14 }$

Now, imagine a random walker starting from a node i selected randomly according to the stationary density ${d}_{i}/2M$ (1 ≤ i ≤ N), at time t = 0. The probability that the random walker is in block (c, x) at time t = 0 and block (c', x') at time t = 1 is given by ${\pi }_{(c,x)}{T}_{(c,x)(c^{\prime} ,x^{\prime} )}$ , which is accounted for by the first and second terms of the right-hand side of equation (14). The corresponding probability for the configuration model is given by ${\pi }_{(c,x)}{\pi }_{(c^{\prime} ,x^{\prime} )}$ , which is accounted for by the third and fourth terms Therefore, ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ measures how likely a random walker moves to the core of the currently visited node in one step relative to the probability expected for the configuration model. This observation is exploited in a different algorithm to detect core-periphery structure of networks [13].

3.1.3. Maximisation of the objective function

We maximise ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ using a label switching heuristic [51, 52], which we have employed in our previous algorithm, KM–ER, that uses the Erdős–Rényi random graph as the null model [19]. First, we initialise the labels by c_i = i and x_i = 1 (1 ≤ i ≤ N). Then, we update the label of each node as follows. Suppose that node i has a neighbour in a core-periphery pair c'. We tentatively assign node i to the core (i.e., (c_i, x_i) = (c', 1)) and compute the new value of ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ . We also tentatively assign node i to the periphery (i.e., (c_i, x_i) = (c', 0)) and compute ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ . We perform the tentative assignments for all the core-periphery pairs to which any neighbour of node i belongs. If any tentative assignments do not raise ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ , we do not update (c_i, x_i). Otherwise, we update (c_i, x_i) to the tentative label (i.e., (c', 0) or (c', 1)) giving the largest increment in ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ . We inspect each node in a random order. If no node has changed its label during the inspection of all the N nodes, we stop updating the labels. Otherwise, we draw a new random order and inspect each node according to the new random order, and repeat the procedure. We run this algorithm ten times starting from the same initial condition and adopt the node labelling that realises the largest value of ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ .

The increment in ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ caused by updating node i's label from (c, x) to (c', x') is given by

$\begin{eqnarray}&&\displaystyle \frac{1}{M}\left[{\tilde{d}}_{i,(c^{\prime} ,1)}+x^{\prime} {\tilde{d}}_{i,(c^{\prime} ,0)}-{d}_{i}\displaystyle \frac{{D}_{(c^{\prime} ,1)}+x^{\prime} {D}_{(c^{\prime} ,0)}}{2M}-\displaystyle \frac{{d}_{i}^{2}}{4M}(x^{\prime} -2(x+x^{\prime} -{xx}^{\prime} )\delta (c,c^{\prime} ))\right]\\ &&\quad -\,\displaystyle \frac{1}{M}\left[{\tilde{d}}_{i,(c,1)}+x{\tilde{d}}_{i,(c,0)}-{d}_{i}\displaystyle \frac{{D}_{(c,1)}+{{xD}}_{(c,0)}}{2M}+\displaystyle \frac{{d}_{i}^{2}}{4M}x\right],\end{eqnarray} \tag{ 15 }$

where ${\tilde{d}}_{i,(c,x)}={\sum }_{j=1}^{N}{A}_{{ij}}\delta ({c}_{j},c)\delta ({x}_{j},x)$ is the number of edges connecting node i and block (c, x). When inspecting node i, we calculate equation (15) at most 2d_i times. Therefore, the time needed for inspecting all nodes is ${ \mathcal O }({\sum }_{i=1}^{N}{d}_{i})={ \mathcal O }(M)$ , and that of the entire algorithm is ${ \mathcal O }(M\times (\mathrm{the}\ \mathrm{number}\ \mathrm{of}\ \mathrm{inspections}\ \mathrm{over}\ \mathrm{the}\,N\,\mathrm{nodes})$ ).

3.1.4. Statistical test

We define the quality q of a core-periphery pair c by its contribution to ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ , i.e.,

$\begin{eqnarray}&&q\equiv \displaystyle \frac{1}{2M}\displaystyle \sum _{i=1}^{N}\displaystyle \sum _{j=1}^{N}\left({A}_{{ij}}-\displaystyle \frac{{d}_{i}{d}_{j}}{2M}\right)({x}_{i}+{x}_{j}-{x}_{i}{x}_{j})\delta ({c}_{i},c)\delta ({c}_{j},c).\end{eqnarray} \tag{ 16 }$

One may deem that a core-periphery pair is significant if its q is statistically larger than the value expected for the configuration model. However, q may depend on the size (i.e., the number of nodes) n of the core-periphery pair, as is the case for the modularity [53].

Inspired by these considerations, we carry out a statistical test of the detected core-periphery pairs as follows. We generate 500 randomised networks for the given network using the configuration model. Then, we detect core-periphery pairs in each randomised network. We compute the quality $\hat{q}$ and size $\hat{n}$ of each core-periphery pair detected in the randomised network. On the basis of the samples of $\hat{q}$ and $\hat{n}$ , we infer the joint probability distribution $P(\hat{q},\hat{n})$ using the Gaussian kernel density estimator [54, 55]. Finally, we regard the core-periphery pair detected in the original network with a quality value of q to be significant if q is statistically larger than that of the core-periphery pair of the same size n detected in the randomised networks, i.e., if $P(\hat{q}\geqslant q\ | n)\leqslant \alpha$ , where P is the probability and α is a significance level. (See appendix C for the computation of $P(\hat{q}\geqslant q\ | \hat{n})$ .) We refer to the nodes that do not belong to any significant core-periphery pair as residual nodes.

Because we carry out the test for each core-periphery pair in the original network, we have to correct the significance level to suppress false positives due to multiple comparisons. To this end, we adopt the Šidák correction [56], with which we test each core-periphery pair in the original network at a significance level of $\alpha =1-{(1-\alpha ^{\prime} )}^{1/C}$ , where α' is the targeted significance. We set α' = 0.05.

Empirical networks often have core-periphery pairs that are substantially larger than any of those detected in the 500 randomised networks (section 4.1). It is unlikely that one finds core-periphery pairs of the same size in randomised networks even if more samples of randomised networks are generated. The kernel density estimator enables us to infer $P(\hat{q}\geqslant q\ | n)$ for large core-periphery pairs in the original network based on the quality and size of smaller core-periphery pairs detected in randomised networks.

Quality q may be significantly large for bipartite-like pairs of blocks (figure 2(e)). Therefore, if our algorithm detects bipartite-like pairs of blocks, we manually mark them and distinguish them from the core-periphery pairs. Specifically, we regard a detected pair of blocks as bipartite-like if it has fewer intra-core edges than expected for the configuration model (i.e., if ${m}_{(c,1),(c,1)}\lt {\mathbb{E}}[{m}_{(c,1),(c,1)}]$ ). Otherwise we regard it as a core-periphery pair. Our algorithm did not find other types of block pairs (i.e., those shown in figures 2(b), (c) and (g)) for the networks examined in the following sections.

3.2. Other algorithms for comparison

We compare the present algorithm, KM–config, with three algorithms for finding a single core-periphery pair, i.e., the BE [8], MINRES [25, 33] and SBM [16] algorithms, and three algorithms for finding multiple core-periphery pairs, i.e., Xiang [32], Divisive [19] and KM–ER algorithms [19]. We ran the Tunç–Verma [24] algorithm but do not show the results because the Tunç–Verma algorithm did not find significant core-periphery pairs or did not terminate within 48 h on our computer (Intel 2.6 GHz Sandy Bridge processors and 4 GB of memory). It should be noted that none of these algorithms uses the configuration model as the null model.

The BE, Divisive and KM–ER algorithms intend to produce many core-periphery edges (i.e., edges connecting a core node and a peripheral node) within each core-periphery pair (figure 2(f)). With the MINRES, SBM and Xiang algorithms, core-periphery edges can be relatively sparse (figure 2(g)).

We set the parameters of these algorithms as follows. For the SBM algorithm, we set γ_k, p_kl (1 ≤ k, l ≤ 2) in [16] to γ₁ = γ₂ = 0.5, p₁₁ = 0.5, p₁₂ = p₂₁ = ρ² and ρ₂₂ = ρ⁴, where $\rho =2M/[N(N-1)]$ . The Xiang algorithm has a parameter, denoted by β ∈ [0, 1] in [32], to tune the number of core-periphery pairs. We set to β = 1. The Xiang algorithm uses a centrality measure to find core-periphery pairs. Therefore, we adopt the degree centrality measure. Note that the authors of [32] claim that the choice of the centrality measure does not considerably affect the results. With the Xiang algorithm, each node may belong to multiple core-periphery pairs. Therefore, if a node belongs to multiple core-periphery pairs, we assign the node to the core-periphery pair to which the extent of belonging is the largest. If a node belongs to multiple core-periphery pairs to the same extent, then we assign the node to one of the core-periphery pairs selected with equal probability. The other algorithms do not have parameters. As is the case of KM–config, the BE, SBM, Divisive and KM–ER algorithms are stochastic. Therefore, we run the BE, SBM, Divisive or KM–ER algorithm ten times and use the best core-periphery pairs in terms of the algorithm-specific quality function.

For the core-periphery pairs detected by the six previous algorithms, we carry out our previously proposed statistical test [19] that adopts the Erdős–Rényi random graph model as the null model. The statistical test runs as follows. Suppose that a network is composed of a single core-periphery pair. We generate 500 randomised networks using the Erdős–Rényi random graph with the same number of nodes and edges as the original network. Then, we detect a single core-periphery pair in each of the randomised networks using the BE algorithm and compute its quality by

$\begin{eqnarray}&&\displaystyle \frac{{\displaystyle \sum }_{i=1}^{N}{\displaystyle \sum }_{j=1}^{i-1}({A}_{{ij}}-\rho )({A}_{{ij}}^{* }-{\rho }^{* })}{\sqrt{{\displaystyle \sum }_{i=1}^{N}{\displaystyle \sum }_{j=1}^{i-1}{({A}_{{ij}}-\rho )}^{2}}\sqrt{{\displaystyle \sum }_{i=1}^{N}{\displaystyle \sum }_{j=1}^{i-1}{({A}_{{ij}}^{* }-{\rho }^{* })}^{2}}},\end{eqnarray} \tag{ 17 }$

where A^* is given by equation (10) and ${\rho }^{* }={\sum }_{i=1}^{N}{\sum }_{j=1}^{i-1}{A}_{{ij}}^{* }/[N(N-1)/2]$ . If the quality of the core-periphery pair detected in the original network is larger than a fraction 1 − α of those detected in the randomised networks, then we regard the core-periphery pair in the original network as significant. It should be noted that this test is not applicable when the null model is the configuration model. If we use the configuration model as the null model, any core-periphery pair detected in the original network will be judged to be insignificant because no network is partitioned into a single core-periphery pair whose q value is larger than that for the configuration model.

If we detect multiple core-periphery pairs in the original networks, we apply the same statistical test for each of them [19]. Specifically, for each core-periphery pair, we construct a subnetwork composed of the nodes and edges within the focal core-periphery pair. Then, we apply the statistical test to the subnetwork. We correct the significance level using the Šidák correction [56]; we test each core-periphery pair in the original network at a significance level of $\alpha =1-{(1-\alpha ^{\prime} )}^{1/C}$ , where α' = 0.05 and C is the number of core-periphery pairs detected in the original network.

3.3. Data

We analyse the 12 empirical networks listed in table 1. We discard the direction and weight of the edge.

Table 1. Empirical networks used in the present paper: the karate club network [44], dolphin social network [57], network of characters in Les Misérables [58], Enron email network [59], network of jazz musicians [60], co-authorship network in network science [46], political blog network [5], worldwide airport network [61, 62], protein–protein interaction network [63], network of chess players [64], co-authorship network in the arXiv astro-ph section [65] and the Internet at the level of AS [64]. We exclude isolated nodes and self-loops from the networks. We count the multi-edges between a pair of nodes as a single edge.

Network	N	M	Assortativity	Degree
				Average	Maximum
Karate [44]	34	78	−0.475	4.59	17
Dolphin [57]	62	159	−0.044	5.13	12
Les Misérables [58]	77	254	−0.165	6.60	36
Email [59]	151	1527	−0.059	20.23	74
Jazz [60]	198	2742	0.020	27.70	100
Network science (co-authorship) [46]	379	914	−0.082	4.82	34
Blog [5]	1222	16 714	−0.221	27.36	351
Airport [61, 62]	2939	15 677	0.051	10.67	242
Protein [63]	3023	6149	−0.126	4.07	129
Chess [64]	7115	55 779	0.371	15.68	181
Astro-ph (co-authorship) [65]	18 771	198 050	0.205	21.10	504
Internet [64]	34 761	107 720	−0.215	6.20	2760

In the karate club network, each node represents the member of a university's karate club [44]. Two members are defined to be adjacent if they frequently interact outside the club activities. The club experienced a fissure as a result of a conflict between the instructor and the president. Based on their self-reports, each node has a label indicating either the instructor's side (15 members), president's side (16 members) or neutral (3 members).

In the dolphin social network, each node represents a dolphin living near Doubtful Sound in New Zealand [57]. An edge between two dolphins indicates that they were frequently observed in the same school during 1994 and 2001. Each dolphin has a label indicating the sex, i.e., female (25 dolphins), male (33 dolphins) and unknown (4 dolphins).

In the network of novel Les Misérables, each node is a character of the book [58]. Two characters are defined to be adjacent if they appear in the same chapter. The book consists of 365 chapters, most of which are a few pages long.

In the Enron email network, each node is an email account of the staff of Enron Inc [59]. An edge indicates that an email was sent from one account to another account during the observation period.

In the jazz network, each node represents a jazz musician [60]. Two jazz musicians are defined to be adjacent if they have played in the same band.

In the co-authorship network, each node represents a researcher in network science [46]. An edge indicates that two researchers have a joint paper. The nodes and edges were retrieved from all the references cited by two influential review papers on network science. Then, the author of [46] manually added some nodes and edges and excluded those not belonging to the largest connected component.

In the blog network, each node represents a blog on the United States presidential election in 2004 [5]. Each edge indicates that one blog has a hyperlink to the other blog on its top page. The blogs and their labels were collected from several blog directories [5]. If a blog was unlabelled or had conflicting labels, the authors of [5] manually determined the label. There are 586 liberal blogs and 636 conservative blogs.

In the worldwide airport network, each node is an airport [61, 62]. An edge represents a direct commercial flight between two airports. We use the network provided in [62].

In the protein–protein interaction network, each node is a human protein [63]. An edge indicates the presence of physical interaction between two proteins.

In the network of chess players, each node represents a chess player [64]. Two players are adjacent if they have played before.

In the co-authorship network of the arXiv astro-ph section, each node is a researcher [65]. An edge indicates that two researchers have a joint paper in the arXiv's astro-ph section.

In the network of the Internet, a node is an autonomous system (AS), i.e., a set of routers (or IP routing prefixes) managed by a network operator [64]. An edge indicates a logical peering relationship between two ASes.

4. Results

4.1. Quality and size of detected core-periphery pairs

The circles in figure 5 represent the quality and size (defined as the number of nodes) of core-periphery pairs detected by KM–config in the 12 empirical networks. A larger core-periphery pair tends to have a large quality, q. This is also the case for the randomised networks (crosses in figure 5). Some core-periphery pairs detected in the empirical networks have a significantly larger q value than those of the same size detected in the randomised networks. Our statistical test suggests that these core-periphery pairs are significant (circles outside the shaded regions in figure 5). We find bipartite-like pairs in the 7 out of the 12 networks (squares in figure 5), some of which are significant in 2 out of the 7 networks (figures 5(i) and (ℓ)). In 2 out of the 12 networks, we find significant core-periphery pairs that are larger than any of those detected in the corresponding randomised networks (figures 5(g) and (ℓ)).

4.2. Core nodes are not necessarily hub nodes

With KM–config, whether the node belongs to a core or periphery is not strongly associated with the node's degree. To show this, we carry out a receiver operating characteristic (ROC) analysis (figure 6). Let us regard θ N nodes ( $\theta \in \{0,1/N,2/N,\,\ldots ,\,1\}$ ) with the largest degree as hub nodes and the remaining nodes as non-hub nodes. The ROC curves show the relationship between the fraction of hub nodes in the set of significant core nodes (i.e., true positive rate) and that in the set of significant peripheral nodes (i.e., false positive rate) when one varies the threshold θ. If all core nodes have a larger degree than all peripheral nodes, the ROC curve passes through (0, 1) of the unit square (figure 6). If the degree of core nodes and that of peripheral nodes obey similar distributions, then the ROC curve is close to the diagonal line for the entire range of θ.

The area under the curve (AUC) of each ROC curve is shown in table 2. If the two distributions are completely separated, the AUC is equal to one. If they completely overlap, the AUC is equal to 0.5. The AUC values for the BE, MINRES, SBM, Xiang and Divisive algorithms are fairly large (mostly above 0.95) for all the networks. Therefore, these algorithms have a strong tendency to classify the nodes with a large degree as core nodes and those with a small degree as peripheral nodes. The AUC values for KM–ER are also large (above 0.81) but not as large as those for the five algorithms. Finally, KM–config determines the role (i.e., core or periphery) of each node by the degree of the node to the least extent, as suggested by the smallest AUC values across different networks among all the algorithms. These results on the AUC values are consistent with visual observations one can make in figure 6.

Table 2. AUCs of the ROC curves shown in figure 6.

Network	BE	MINRES	SBM	Xiang	Divisive	KM–ER	KM–config
Karate	1.000	1.000	1.000	*	0.957	0.984	0.938
Dolphin	0.953	1.000	*	*	1.000	1.000	0.859
Les Misérables	0.982	1.000	1.000	*	0.886	0.955	0.610
Email	0.978	0.999	0.990	*	0.910	0.893	0.670
Jazz	0.989	1.000	*	*	0.953	*	0.717
Network science	0.979	1.000	0.998	0.958	0.961	0.990	0.664
Blog	0.995	1.000	0.999	*	0.981	0.932	0.718
Airport	0.996	1.000	*	0.999	0.972	0.885	0.793
Protein	1.000	1.000	0.824	*	0.936	0.810	0.717
Chess	0.997	1.000	*	*	0.928	0.860	0.737
Astro-ph	0.997	1.000	*	0.883	0.943	0.888	0.834
Internet	1.000	1.000	*	0.999	0.972	0.905	0.483

Note. The asterisk indicates that the algorithm does not detect significant core-periphery pairs.

To illuminate on the meaning of the core and peripheral nodes detected by KM–config, let us denote by d_i^core and d_i^peri the number of neighbouring core and peripheral nodes of node i, respectively, within the core-periphery pair to which node i belongs. For each node i, we plot d_i^peri against degree d_i in figure 7. We find that peripheral nodes are adjacent to a smaller number of peripheral nodes within the same core-periphery pair (i.e., a small d^peri_i) than core nodes with a similar d_i value would do. This result is consistent with the concept of core-periphery structure based on edge density. However, this property is not necessarily respected if one classifies nodes according to the degree of each node. In fact, with the other six algorithms, core nodes and peripheral nodes are less distinct from each other in terms of d_i^peri (figures D1–D6 in appendix D). As a corollary, with KM–config, the peripheral nodes tend to be more frequently connected to the core nodes within the same core-periphery pair than the core nodes do (figure 8). For some core nodes with a large degree, d_i^core is equal to zero, which happens when a core-periphery pair has only one core node and forms a star.

**Figure 7.** Relationships between d_i and d_i^peri for the empirical networks. The squares and circles indicate core nodes and peripheral nodes identified by KM–config, respectively. The insets of the panels (g), (h) and (ℓ) magnify the regions with small d_i^peri values.
Download figure:
Standard image High-resolution image

**Figure 8.** Relationships between d_i and d_i^core for the empirical networks. The squares and circles indicate core nodes and peripheral nodes identified by KM–config, respectively.
Download figure:
Standard image High-resolution image

4.3. A core-periphery pair is a community?

We compare the core-periphery pairs identified by KM–config and communities in networks. Here we determine communities by modularity maximisation using the Louvain algorithm [52]. We run it ten times and adopt the node partition that realises the largest modularity value. Table 3 reports the modularity values for the node partition identified by the Louvain algorithm and that determined by KM–config, with the insignificant core-periphery pairs being included. The modularity value for the node partitioning into core-periphery pairs is close to that obtained by the modularity maximisation for most of the empirical networks. Therefore, the detected core-periphery pairs may be similar to communities.

Table 3. Modularity for communities determined by the Louvain algorithm and that for the core-periphery pairs determined by KM–config.

Network	Louvain	KM–config
Karate	0.416	0.417
Dolphin	0.520	0.518
Les Misérables	0.535	0.542
Email	0.420	0.419
Jazz	0.445	0.445
Network science	0.815	0.741
Blog	0.426	0.426
Airport	0.642	0.615
Protein	0.626	0.483
Chess	0.505	0.508
Astro-ph	0.574	0.555
Internet	0.521	0.459

This result poses a question whether a core-periphery pair is a community in the traditional sense, and if so whether the KM–config algorithm effectively classifies the nodes in each community into a core and a periphery according to the composition of intra- and inter-community edges that each node owns. To examine this point, we analyse the role of each node using a cartographic representation of networks [66, 67]. With the cartographic representation, the role of each node i in a network is characterised by the standardised within-module degree ${z}_{i}\in [-\infty ,\infty ]$ and the participation coefficient ${p}_{i}\in [0,1]$ [66, 67]. They are defined by

$\begin{eqnarray}&&{z}_{i}\equiv \displaystyle \frac{{\tilde{d}}_{i,{c}_{i}}-\langle {\tilde{d}}_{{c}_{i}}\rangle }{{\sigma }_{{c}_{i}}},\end{eqnarray} \tag{ 18 }$

$\begin{eqnarray}&&{p}_{i}\equiv 1-\displaystyle \sum _{c=1}^{C}{\left(\displaystyle \frac{{\tilde{d}}_{i,c}}{{d}_{i}}\right)}^{2},\end{eqnarray} \tag{ 19 }$

where ${\tilde{d}}_{i,c}$ is the number of neighbours of node i in the cth core-periphery pair (1 ≤ c ≤ C), c_i is the core-periphery pair to which node i belongs, $\langle {\tilde{d}}_{{c}_{i}}\rangle$ is the average of ${\tilde{d}}_{j,{c}_{i}}$ over the nodes j in the c_ith core-periphery pair including the case j = i, and ${\sigma }_{{c}_{i}}$ is the unbiased estimation of the standard deviation of ${\tilde{d}}_{j,{c}_{i}}$ over the nodes j in the c_ith core-periphery pair. A large z_i value indicates that node i has relatively many neighbours within the same core-periphery pair. The p_i value is the smallest if node i is adjacent only to the nodes in a single core-periphery pair and largest if node i is adjacent to an equal number of nodes across all core-periphery pairs. In the cartographic representation of networks, each node i is classified according to the position (z_i, p_i) of the node in the z–p space. The nodes are categorised into seven roles [66, 67]. Here we do not use this categorisation rule but examine the distributions of the core and peripheral nodes in the z–p space.

Figure 9 shows z_i and p_i of each node for the 12 empirical networks. KM–config classifies the nodes having very large z_i as core nodes. However, for the other nodes, the values of z_i and p_i are not predictive of whether a node is in the core or periphery. Therefore, the core and periphery that we propose are distinct from the roles of nodes identified by the cartographic analysis.

We find different results for the Divisive algorithm, which first divides the network into communities and then estimates the role of each node (i.e., core or periphery). With Divisive, the core nodes have larger z_i than most of the peripheral nodes (figure D7 in appendix D), indicating that the core nodes detected by Divisive largely correspond to the hub nodes as identified by the cartographic analysis. This is because Divisive uses the BE algorithm to partition each community into a core and a periphery. As shown in figure 6, the BE algorithm classifies nodes into a core and a periphery by the degree of each node to a large extent. Therefore, Divisive regards the nodes with a large z_i as core nodes.

4.4. Case studies

In this section, we present case studies of some of the empirical networks analysed in the previous sections.

The core-periphery pairs in the karate club network are shown in figure 10. KM–config detected two significant core-periphery pairs and ten residual nodes. A majority of the members on the president side (12 members; 75%), including the president (node 34), belong to core-periphery pair 1. A majority of the members on the instructor side (11 members; 73%), including the instructor (node 1), belong to core-periphery pair 2. These results are consistent with the social conflict of the club. The residual consists of four members on the instructor side, five members on the president side and one neutral member. The significant core-periphery pairs are similar to those detected by our previous algorithm, KM–ER [19].

The core-periphery pairs in the dolphin social network are shown in figure 11. KM–config detected three significant core-periphery pairs and 14 residual nodes. Each core-periphery pair mostly consists of the dolphins of the same sex; there are two male-dominant core-periphery pairs (pairs 1 and 3) and one female-dominant core-periphery pair (pair 2). A previous study identified five communities in the dolphin network by modularity maximisation [4], three of which are similar to the present core-periphery pairs 1, 2 and 3.

For the network of Les Misérables, KM–config identified four significant core-periphery pairs and 40 residual nodes (figure 12). A majority of nodes belonging to the significant core-periphery pairs are core nodes, suggesting that each core-periphery pair resembles a community. In fact, a previous study used modularity maximisation to identify 11 communities in the same network [4], four of which are similar to our core-periphery pairs 1–4. The significant core-periphery pairs are consistent with the plot of the story; the characters in core-periphery pairs 1, 2, 3 and 4 are the members of a revolutionary student club, Thénardier family and a street gang, Fantine's relatives and her friends, and characters involved in the Champmathieu's trial, respectively. The main characters, e.g., Valjean, Javert and Cosette, are classified as residual nodes (arrows in figure 12). Although they have a large degree, they are regarded as residual nodes because they belong to insignificant core-periphery pairs.

For the co-authorship network, KM–config detected 28 significant core-periphery pairs and 133 residual nodes (figure 13(a)). Detailed structure of core-periphery pairs 1–10 is shown in figure 13(b). Five core-periphery pairs (pairs 1, 5, 8, 9 and 10) have relatively many intra-core edges, many core-periphery edges and no intra-periphery edges, indicating a strong core-periphery structure. Core-periphery pair 8 contains only one peripheral node, implying a structure close to a community. Some core researchers collaborate with most of the researchers in the same core-periphery pair, e.g., Barabási, Jeong and Oltvai in core-periphery pair 1, Vázquez and Vespignani in core-periphery pair 2 and Boccaletti in core-periphery pair 4.

For the blog network, KM–config identified two core-periphery pairs and 79 residual nodes (figure 14). A majority of the blogs leaning to the conservative and to the liberal belong to core-periphery pairs 1 and 2, respectively. These core-periphery pairs are similar to those identified by KM–ER [19].

For the airport network, KM–config identified 23 significant core-periphery pairs and 983 residual nodes (figure 15). Each core-periphery pair mainly consists of the airports in the same geographical region, which agrees with the previous results [19, 67, 68]. Our previous algorithm, KM–ER, detected ten core-periphery pairs, of which the three largest core-periphery pairs based in Europe, East Asia and the USA are similar to core-periphery pairs 1, 3 and 2 detected by KM–config, respectively [19]. Properties of core-periphery pairs 1–8 are shown in table 4. Among the representative airports (i.e., the airports having the largest degree in each core-periphery pair), some peripheral airports have a larger degree than core airports, e.g., MUC (Munich) in core-periphery pair 1, SVO (Moscow) in core-periphery pair 5 and NBO (Nairobi) in core-periphery pair 8, showing that hub nodes are not always classified as core nodes.

Table 4. Property of the eight largest significant core-periphery pairs in the airport network. The representative airports of each core-periphery pair are defined as the four core and four peripheral airports having the largest degree. The territory is defined as the country where the airport is located. If the airport is located in a sovereign state, we instead show the name of the state. IATA is a three-letter code of an airport assigned by the International Air Transport Association.

Pair	Number of airports		Representative core airport				Representative peripheral airport
	Core	Periphery	IATA	City	Territory	Degree	IATA	City	Territory	Degree
1	245	212	FRA	Frankfurt	Germany	242	MUC	Munich	Germany	149
			CDG	Paris	France	218	OSL	Oslo	Norway	91
			AMS	Amsterdam	Netherlands	211	BUD	Budapest	Hungary	77
			LGW	London	UK	172	LPL	Liverpool	UK	66


2	203	242	ATL	Atlanta	USA	168	MEX	Mexico City	Mexico	81
			JFK	New York	USA	144	LGA	New York	USA	44
			LAS	Las Vegas	USA	139	CCS	Caracas	Venezuela	41
			YYZ	Toronto	Canada	119	SXM	Philipsburg	Netherlands Antilles	35


3	118	177	PEK	Beijing	China	170	HGH	Hangzhou	China	53
			BKK	Bangkok	Thailand	136	KIX	Osaka	Japan	47
			PVG	Shanghai	China	126	NGO	Nagoya	Japan	32
			ICN	Seoul	South Korea	121	WNZ	Wenzhou	China	32


4	92	120	DXB	Dubai	UAE	166	BAH	Bahrain	Bahrain	52
			JED	Jeddah	Saudi Arabia	99	MRU	Port Louis	Mauritius	29
			DEL	Delhi	India	93	MLE	Malé	Maldives	27
			DOH	Doha	Qatar	92	KBL	Kabul	Afghanistan	23


5	73	79	DME	Moscow	Russia	159	SVO	Moscow	Russia	118
			LED	St. Petersburg	Russia	92	DYU	Dushanbe	Tajikistan	26
			KBP	Kiev	Ukraine	76	ODS	Odessa	Ukraine	15
			TLV	Tel-aviv	Israel	69	YKS	Yakutsk	Russia	14


6	33	54	SYD	Sydney	Australia	77	DRW	Darwin	Australia	20
			MEL	Melbourne	Australia	56	TSV	Townsville	Australia	11
			BNE	Brisbane	Australia	51	NOU	Nouméa	New Caledonia	10
			AKL	Auckland	New Zealand	38	WLG	Wellington	New Zealand	6


7	47	40	GRU	São Paulo	Brazil	83	BEL	Belém	Brazil	17
			GIG	Rio De Janeiro	Brazil	46	CGR	Campo Grande	Brazil	16
			BSB	Brasília	Brazil	46	MVD	Montevideo	Uruguay	13
			CNF	Belo Horizonte	Brazil	33	NAT	Natal	Brazil	12


8	37	32	JNB	Johannesburg	South Africa	70	NBO	Nairobi	Kenya	68
			ADD	Addis Ababa	Ethiopia	58	WDH	Windhoek	Namibia	9
			LAD	Luanda	Angola	30	PNR	Pointe-noire	Congo (Republic)	8
			DAR	Dar Es Salaam	Tanzania	23	NDJ	N'djamena	Chad	8

4.5. Synthetic networks

The results in the previous sections suggest that KM–config tends to detect core-periphery pairs without using the node's degree as a main criterion but produces node partitioning consistent with the concept of core-periphery structure based on edge density. To confirm this point further, in this section we test the algorithms on model networks with a planted core-periphery structure composed of two core-periphery pairs (figure 16).

**Figure 16.** Adjacency matrix of a synthetic network generated with λ = 0.1 and μ = 0.7. The dashed lines represent the boundaries between blocks. The label (c, x) of each block is shown at the top and left of the adjacency matrix.
Download figure:
Standard image High-resolution image

The discrepancy between the degree distribution of core nodes and that of peripheral nodes is controlled by a parameter $\mu \in \{0,0.05,0.1,\,\ldots ,\,0.5\}$ . The 'strength' of the core-periphery structure is controlled by a parameter $\lambda \in \{0,0.025,0.05,\,\ldots ,\,1\}$ . The model assumes four blocks. Each block consists of 200 nodes and represents a core or periphery. To generate networks, we use the degree-corrected SBM (dcSBM) [6]; it places edges such that each node i has a prescribed expected degree ${\overline{d}}_{i}$ , and each pair of blocks u and v has an expected number ${\overline{m}}_{{uv}}$ of edges. We set ${\overline{d}}_{i}$ (1 ≤ i ≤ N) as follows. For the core (i.e., blocks (1, 1) and (2, 1)), we set ${\overline{d}}_{i}=50$ for a fraction μ of nodes and ${\overline{d}}_{i}=200$ for the remaining fraction 1 − μ of nodes. For the periphery (i.e., blocks (1, 0) and (2, 0)), we set ${\overline{d}}_{i}=50$ for a fraction 1 − μ of nodes and ${\overline{d}}_{i}=200$ for the remaining fraction μ of nodes. The fraction μ tunes the amount of overlap between the degree distribution of core nodes and that of peripheral nodes. The two distributions have no overlap if μ = 0 and perfectly overlap if μ = 0.5. Then, we set ${\overline{m}}_{(c,x)(c^{\prime} ,x^{\prime} )}$ (1 ≤ c, c' ≤ 2 and 0 ≤ x, x' ≤ 1) by

$\begin{eqnarray}&&{\overline{m}}_{(c,x)(c^{\prime} ,x^{\prime} )}=\lambda {\overline{m}}_{(c,x)(c^{\prime} ,x^{\prime} )}^{{\rm{rand}}}+(1-\lambda ){\overline{m}}_{(c,x)(c^{\prime} ,x^{\prime} )}^{{\rm{plant}}},\end{eqnarray} \tag{ 20 }$

where λ is a mixing parameter, ${\overline{m}}_{(c,x)(c^{\prime} ,x^{\prime} )}^{\,\mathrm{rand}}={D}_{(c,x)}{D}_{(c^{\prime} ,x^{\prime} )}/2M$ is the expected number of edges between blocks (c, x) and (c', x') for the configuration model given ${\overline{d}}_{1},\,\ldots ,\,{\overline{d}}_{N}$ . The parameters $\{{\overline{m}}_{(c,x),(c^{\prime} ,x^{\prime} )}^{{\rm{plant}}}\}$ represent the number of intra-block and inter-block edges in an idealised core-periphery structure with nodes' degrees ${\overline{d}}_{1},\,\ldots ,\,{\overline{d}}_{N}$ . In other words, there are no edges between the different core-periphery pairs ( ${\overline{m}}_{(c,x),(c^{\prime} ,x^{\prime} )}^{{\rm{plant}}}=0$ for $c\ne c^{\prime}$ ), the peripheral nodes are adjacent only to core nodes in the same core-periphery pair (i.e., ${\overline{m}}_{(c,0),(c,1)}^{{\rm{plant}}}={D}_{(c,0)}$ and ${\overline{m}}_{(c,0),(c,0)}^{{\rm{plant}}}=0$ for c = 1, 2), and the number of edges within each core is given by ${\overline{m}}_{(c,1),(c,1)}^{{\rm{plant}}}={D}_{(c,1)}-{D}_{(c,0)}$ for c = 1, 2. Although the dcSBM with $\{{\overline{m}}_{(c,x),(c^{\prime} ,x^{\prime} )}^{{\rm{plant}}}\}$ specifies a disconnected network, the dcSBM with $\{{\overline{m}}_{(c,x),(c^{\prime} ,x^{\prime} )}\}$ given by equation (20) produces connected networks in general unless λ = 0. We note that, if μ = 0.5, the dcSBM generates bipartite networks because the number of intra-core edges is zero, i.e., ${\overline{m}}_{(c,1)(c,1)}^{{\rm{plant}}}=0$ (c = 1, 2).

We evaluate the performance of algorithms by the difference between the planted and detected core-periphery structures. To quantify the difference, we use the variation of information (VI) [69] given by

$\begin{eqnarray}&&\mathrm{VI}=-\displaystyle \sum _{(c,x)}\displaystyle \sum _{(\hat{c},\hat{x})}R(c,x;\hat{c},\hat{x})\mathrm{log}\displaystyle \frac{{[R(c,x;\hat{c},\hat{x})]}^{2}}{\left[{\displaystyle \sum }_{(\hat{c}^{\prime} ,\hat{x}^{\prime} )}R(c,x;\hat{c}^{\prime} ,\hat{x}^{\prime} )\right]\times \left[{\displaystyle \sum }_{(c^{\prime} ,x^{\prime} )}R(c^{\prime} ,x^{\prime} ;\hat{c},\hat{x})\right]},\end{eqnarray} \tag{ 21 }$

where $R(c,x;\hat{c},\hat{x})$ is the fraction of nodes having true label (c, x) and inferred label $(\hat{c},\hat{x})$ . The VI value is the smallest (i.e., zero) if and only if the partitioning of nodes by the true labels and that by the inferred labels are identical. In the computation of the VI values, we regard the set of residual nodes as a block; technically, we set $({\hat{c}}_{i},{\hat{x}}_{i})=(C+1,0)$ for the residual nodes. We generate 30 synthetic networks and average the VI values over the 30 generated networks.

The VI values for the Xiang, Divisive, KM–ER and KM–config algorithms are shown in figure 17. We do not show the results for the other three algorithms because they do not find multiple core-periphery pairs by definition. When λ is large, the VI values are large because the network is close to the configuration model and has weak core-periphery structure. The VI values for the Xiang algorithm are large in the entire λ–μ parameter space (figure 17(a)). This is because the Xiang algorithm did not find significant core-periphery pairs in all the generated networks (i.e., all nodes are residual nodes). The VI values for the Divisive and KM–ER algorithms are relatively large for most λ values if some planted core nodes are non-hub nodes, i.e., μ ≥ 0.15 (figures 17(b) and (c)). The VI values for the KM–config algorithm are the smallest in most of the λ–μ parameter space (figure 17(d)), including the case for bipartite-like structure (μ ≥ 0.4). Therefore, KM–config but not the other algorithms is capable of detecting core-periphery structure even when a substantial fraction of core nodes are non-hubs and peripheral nodes are hubs.

5. Discussion

We have studied core-periphery structure using the configuration model as the null model. We have shown that discrete versions of a single core-periphery pair determined based on edge density, which many studies assume, can never be significant relative to the configuration model. The core-periphery structure beyond what one expects for the configuration model must accompany other meso-scale network structure such as another core-periphery pair, communities and bipartite-like subnetworks coexisting in the given network. This claim is in resonance with the studies [19, 28, 30] reporting the absence of core-periphery structure when the configuration model is used as the null model. Then, we have presented a scalable algorithm to find core-periphery structure in networks and applied it to various networks.

Our argument does not apply to continuous versions of core-periphery structure [8, 10, 14, 17, 25], in which each node belongs to the core to a different extent. A possible extension of our present algorithm (i.e., KM–config) to the case of continuous core-periphery structure is to replace the idealised core-periphery structure defined in equation (10) with a continuous version of idealised core-periphery structure, such as those proposed in [8, 10]. This line of investigation may reveal relationships between continuous versions of core-periphery structure, multiple core-periphery pairs and the configuration model.

Null models for networks do not have to be limited to the Erdős–Rényi random graph or the configuration model. Other null models incorporate different properties of networks such as the weight of edges [70], the sign of edge weights [71], correlations [72], bipartiteness [73] and spacial properties [74]. It is probably possible to incorporate such null models into our algorithm by modifying the null-model term of ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ (i.e., the second term on the right-hand side of equation (13)).

Akin to the modularity, our quality function ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ allows an interpretation in terms of random walks on networks. With a core-periphery structure, random walkers are likely to move from any node to a core node within the same core-periphery pair in a discrete time step. In a core-periphery structure on a small scale, the random walkers would reach the core in a small number of steps. In contrast, they would need a large number of steps to reach the core on a large scale. By regarding the number of steps as a resolution parameter, we may be able to identify core-periphery structure across different scales, as in the case of the Markov stability, where modularity maximisation with different values of the time resolution parameter provides information about hierarchical organisation of communities in networks [47–50].

Our quality function ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ shares shortcomings with the modularity, such as the inability of finding small communities [75] and of distinguishing random from non-random structure [76]. Remedies for these problems include multi-resolution approaches [77] and statistical tests [78, 79]. Another approach is the statistical inference based on SBMs, which has been used for finding communities [6, 16, 30, 41] and core-periphery structure [16, 24, 30]. Investigation of core-periphery structure with SBMs may be a topic for future study.

Finally, we have restricted ourselves to undirected and unweighted networks. It is straightforward to incorporate the weight of edges by replacing A_ij on the right-hand side of equation (13) by the weight of the edge between nodes i and j. In contrast, it is non-trivial to incorporate the direction of edges. It seems that the direction of edges can be incorporated into ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ by allowing an adjacency matrix to be asymmetric, as in the case of modularity [80, 81]. However, for modularity, this extension elicits a problem [82], which may also hold true for ${Q}_{\mathrm{config}}^{\mathrm{cp}}$ .

Appendix A.: Finding compatible block structures

We represent a block structure composed of B blocks by a symmetric B × B matrix ${\boldsymbol{Y}}=({Y}_{{uv}})$ , where

$\begin{eqnarray}{Y}_{{uv}}=\left\{\begin{array}{cc}1 & ({m}_{{uv}}\gt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}]),\\ -1 & ({m}_{{uv}}\lt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}]).\end{array}\right.\end{eqnarray} \tag{ A1 }$

Variable Y_uv = 1 or Y_uv = −1 indicates that blocks u and v are either densely (i.e., ${m}_{{uv}}\gt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}];$ filled blocks in figure 2) or sparsely (i.e., ${m}_{{uv}}\lt {\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}];$ opened blocks in figure 2) interconnected. Recall that we do not consider the case ${m}_{{uv}}={m}_{{uv}}^{\mathrm{conf}}$ because it is unlikely in general. To find compatible block structures, we generate all 2^B(B+1)/2 symmetric binary matrices. Then, for each binary matrix ${\boldsymbol{Y}}$ , we inspect the compatibility of the block structure as follows. Equation (9) can be rewritten as

$\begin{eqnarray}&&\displaystyle \sum _{v=1}^{B}{{\rm{\Delta }}}_{{uv}}=0,\quad 1\leqslant u\leqslant B,\end{eqnarray} \tag{ A2 }$

where

$\begin{eqnarray}&&{{\rm{\Delta }}}_{{uv}}={m}_{{uv}}-{\mathbb{E}}[{m}_{{uv}}^{\mathrm{conf}}].\end{eqnarray} \tag{ A3 }$

Equations (A1) and (A3) imply

$\begin{eqnarray}&&{Y}_{{uv}}{{\rm{\Delta }}}_{{uv}}\gt 0,\quad 1\leqslant u,v\leqslant B.\end{eqnarray} \tag{ A4 }$

Because ${Y}_{{uv}}\ne 0$ , we rewrite equation (A4) as

$\begin{eqnarray}&&{Y}_{{uv}}{{\rm{\Delta }}}_{{uv}}\geqslant 0,\quad 1\leqslant u,v\leqslant B,\end{eqnarray} \tag{ A5 }$

$\begin{eqnarray}&&{{\rm{\Delta }}}_{{uv}}^{2}\geqslant \eta \gt 0,\quad 1\leqslant u,v\leqslant B.\end{eqnarray} \tag{ A6 }$

We set η = 1 without loss of generality. In fact, if $\eta \ne 1$ , consider a rescaled variable ${\tilde{{\rm{\Delta }}}}_{{uv}}={{\rm{\Delta }}}_{{uv}}/\sqrt{\eta }$ . Dividing both sides of equations (A2) and (A5) by $\sqrt{\eta }$ and of equation (A6) by η yields

$\begin{eqnarray}&&\displaystyle \sum _{v=1}^{B}{\tilde{{\rm{\Delta }}}}_{{uv}}=0,\quad 1\leqslant u\leqslant B,\end{eqnarray} \tag{ A7 }$

$\begin{eqnarray}&&{Y}_{{uv}}{\tilde{{\rm{\Delta }}}}_{{uv}}\geqslant 0,\quad 1\leqslant u,v\leqslant B,\end{eqnarray} \tag{ A8 }$

$\begin{eqnarray}&&{\tilde{{\rm{\Delta }}}}_{{uv}}^{2}\geqslant 1,\quad 1\leqslant u,v\leqslant B,\end{eqnarray} \tag{ A9 }$

which are equivalent to equations (A2), (A5) and (A6) with η = 1, respectively.

We seek Δ_uv values (1 ≤ u, v ≤ B) that simultaneously satisfy equations (A2), (A5) and (A6). To this end, we solve the following quadratic programming (QP) problem:

$\begin{eqnarray}&&\mathop{\min }\limits_{\displaystyle \genfrac{}{}{0em}{}{{{\rm{\Delta }}}_{{uv}};}{1\leqslant u,v\leqslant B}}\displaystyle \sum _{u=1}\displaystyle \sum _{v=1}{{\rm{\Delta }}}_{{uv}}^{2},\end{eqnarray} \tag{ A10 }$

subject to equations (A2), (A5) and (A6). The constraints for the QP problem are equivalent to the conditions for compatible block structure. Therefore, the block structure is compatible if and only if the QP problem is feasible. Note that we are not interested in the objective function's value, i.e., equation (A10). To solve the QP problem and hence to check whether it has a feasible solution, we use a numerical solver [83]. If we find Δ_uv values satisfying all the constraints by solving the QP problem, then the block structure represented by ${\boldsymbol{Y}}=({Y}_{{uv}})$ is compatible. Otherwise, the block structure is incompatible.

Appendix B.: Network structure with four blocks

All possible network structures with four blocks that are compatible with equation (9) are shown in figure B1.

Appendix C.: Estimating statistical significance of core-periphery structure

Let S be the sum of the number of core-periphery pairs detected in the 500 randomised networks. Let ${\hat{q}}^{(s)}$ and ${\hat{n}}^{(s)}$ (1 ≤ s ≤ S) be the quality and size of the sth core-periphery pair in the randomised networks, respectively. We use the Gaussian kernel density estimator [54, 55] to infer the joint probability distribution $P(\hat{q},\hat{n})$ , which gives

$\begin{eqnarray}&&P(\hat{q},\hat{n})=\displaystyle \frac{1}{S}\displaystyle \sum _{s=1}^{S}f\left(\displaystyle \frac{\hat{q}-{\hat{q}}^{(s)}}{{\sigma }_{\hat{q}}h},\displaystyle \frac{\hat{n}-{\hat{n}}^{(s)}}{{\sigma }_{\hat{n}}h}\right),\end{eqnarray} \tag{ C1 }$

where ${\sigma }_{\hat{q}}$ and ${\sigma }_{\hat{n}}$ are the standard deviations of $\{{\hat{q}}^{(s)}\}$ and $\{{\hat{n}}^{(s)}\}$ (1 ≤ s ≤ S), respectively. Standard deviations ${\sigma }_{\hat{q}}$ and ${\sigma }_{\hat{n}}$ are defined as

$\begin{eqnarray}&&{\sigma }_{\hat{q}}\equiv \sqrt{\displaystyle \frac{1}{S-1}\displaystyle \sum _{s=1}^{S}{({\hat{q}}^{(s)}-\langle \hat{q}\rangle )}^{2}},\end{eqnarray} \tag{ C2 }$

$\begin{eqnarray}&&{\sigma }_{\hat{n}}\equiv \sqrt{\displaystyle \frac{1}{S-1}\displaystyle \sum _{s=1}^{S}{({\hat{n}}^{(s)}-\langle \hat{n}\rangle )}^{2}},\end{eqnarray} \tag{ C3 }$

where $\langle \hat{q}\rangle$ and $\langle \hat{n}\rangle$ are the average values of $\{{\hat{q}}^{(s)}\}$ and $\{{\hat{n}}^{(s)}\}$ (1 ≤ s ≤ S), respectively. In equation (C1), f is the probability density function of the standard bivariate normal distribution given by

$\begin{eqnarray}&&f({y}_{1},{y}_{2})\equiv \displaystyle \frac{1}{2\pi \sqrt{1-{\gamma }^{2}}}\exp \left(-\displaystyle \frac{{y}_{1}^{2}-2\gamma {y}_{1}{y}_{2}+{y}_{2}^{2}}{2(1-{\gamma }^{2})}\right),\end{eqnarray} \tag{ C4 }$

where γ is the Pearson correlation coefficient between $\{{\hat{q}}^{(s)}\}$ and $\{{\hat{n}}^{(s)}\}$ (1 ≤ s ≤ S), i.e.,

$\begin{eqnarray}&&\gamma \equiv \displaystyle \frac{\tfrac{1}{S}{\sum }_{s=1}^{S}({\hat{q}}^{(s)}-\langle \hat{q}\rangle )({\hat{n}}^{(s)}-\langle \hat{n}\rangle )}{{\sigma }_{\hat{q}}{\sigma }_{\hat{n}}}.\end{eqnarray} \tag{ C5 }$

In equation (C1), h is a parameter specifying the width of the Gaussian kernel density estimator. We use the Scott's rule of thumb [55], which gives h = S^−1/6.

The probability that the core-periphery pair of size n has a quality value greater than or equal to q in randomised networks is computed as

$\begin{eqnarray}&&P(\hat{q}\geqslant q\ | n)=\displaystyle \frac{{\int }_{q}^{\infty }P(z,n){\rm{d}}z}{{\int }_{-\infty }^{\infty }P(z,n){\rm{d}}z}=\displaystyle \frac{{\sum }_{s=1}^{S}{\int }_{q}^{\infty }f\left(\tfrac{z-{\hat{q}}^{(s)}}{{\sigma }_{\hat{q}}h},\tfrac{n-{\hat{n}}^{(s)}}{{\sigma }_{\hat{n}}h}\right){\rm{d}}z}{{\sum }_{s=1}^{S}{\int }_{-\infty }^{\infty }f\left(\tfrac{z-{\hat{q}}^{(s)}}{{\sigma }_{\hat{q}}h},\tfrac{n-{\hat{n}}^{(s)}}{{\sigma }_{\hat{n}}h}\right){\rm{d}}z}.\end{eqnarray} \tag{ C6 }$

Equation (C4) leads to

$\begin{eqnarray}&&{\int }_{-\infty }^{{y}_{1}}f(z,{y}_{2}){\rm{d}}z=\displaystyle \frac{1}{\sqrt{2\pi }}\exp \left(-\displaystyle \frac{{y}_{2}^{2}}{2}\right){\rm{\Phi }}\left(\displaystyle \frac{{y}_{1}-\gamma {y}_{2}}{\sqrt{1-{\gamma }^{2}}}\right),\end{eqnarray} \tag{ C7 }$

where ${\rm{\Phi }}(y)={(2\pi )}^{-1/2}{\int }_{-\infty }^{y}\exp (-{z}^{2}/2){\rm{d}}z$ is the cumulative distribution function of the standard normal distribution. Substitution of equation (C7) into equation (C6) yields

$\begin{eqnarray}&&P(\hat{q}\geqslant q\ | n)=1-\displaystyle \frac{{\displaystyle \sum }_{s=1}^{S}\exp \left(-\tfrac{{(n-{\hat{n}}^{(s)})}^{2}}{2{\sigma }_{\hat{n}}^{2}{h}^{2}}\right){\rm{\Phi }}\left(\tfrac{{\sigma }_{\hat{n}}(q-{\hat{q}}^{(s)})-\gamma {\sigma }_{\hat{q}}(n-{\hat{n}}^{(s)})}{{\sigma }_{\hat{n}}{\sigma }_{\hat{q}}h\sqrt{1-{\gamma }^{2}}}\right)}{{\displaystyle \sum }_{s=1}^{S}\exp \left(-\tfrac{{(n-{\hat{n}}^{(s)})}^{2}}{2{\sigma }_{\hat{n}}^{2}{h}^{2}}\right)}.\end{eqnarray} \tag{ C8 }$

Appendix D.: Results of other algorithms

Figures D1–D6 plot ${d}_{i}^{\mathrm{peri}}$ against degree d_i when the core-periphery structure is determined by the BE, MINRES, SBM, Xiang, Divisive and KM–ER algorithms. Figure D7 shows the cartographic representation of the 12 empirical networks when the core-periphery structure is determined by the Divisive algorithm.

**Figure D1.** Relationships between d_i and d_i^peri for the empirical networks when the core-periphery structure is determined by the BE algorithm. The squares and circles indicate core nodes and peripheral nodes, respectively.
Download figure:
Standard image High-resolution image

**Figure D2.** Relationships between d_i and d_i^peri for the empirical networks when the core-periphery structure is determined by the MINRES algorithm.
Download figure:
Standard image High-resolution image

**Figure D3.** Relationships between d_i and d_i^peri for the empirical networks when the core-periphery structure is determined by the SBM algorithm. We do not show the results for the other empirical networks, for which the SBM algorithm does not find significant core-periphery pairs.
Download figure:
Standard image High-resolution image

**Figure D4.** Relationships between d_i and d_i^peri for the empirical networks when the core-periphery structure is determined by the Xiang algorithm. We do not show the results for the other empirical networks, for which the Xiang algorithm does not find significant core-periphery pairs.
Download figure:
Standard image High-resolution image

**Figure D5.** Relationships between d_i and d_i^peri for the empirical networks when the core-periphery structure is determined by the Divisive algorithm.
Download figure:
Standard image High-resolution image

**Figure D6.** Relationships between d_i and d_i^peri for the empirical networks when the core-periphery structure is determined by KM–ER. We do not show the result for the jazz network, for which KM–ER does not find significant core-periphery pairs.
Download figure:
Standard image High-resolution image

**Figure D7.** Cartographic analysis of the empirical networks. The core and peripheral nodes are detected by the Divisive algorithm.
Download figure:
Standard image High-resolution image

Core-periphery structure requires something else in the network

Article metrics

Author e-mails

Author affiliations

ORCID iDs

Dates

Abstract

1. Introduction

2. Core-periphery structure needs at least three blocks

3. Methods

3.1. Our algorithm

3.1.1. Objective function

3.1.2. Relationship to Markov stability

3.1.3. Maximisation of the objective function

3.1.4. Statistical test

3.2. Other algorithms for comparison

3.3. Data

4. Results

4.1. Quality and size of detected core-periphery pairs

4.2. Core nodes are not necessarily hub nodes

4.3. A core-periphery pair is a community?

4.4. Case studies

4.5. Synthetic networks

5. Discussion

Appendix A.: Finding compatible block structures

Appendix B.: Network structure with four blocks

Appendix C.: Estimating statistical significance of core-periphery structure

Appendix D.: Results of other algorithms

Core-periphery structure requires something else in the network

Article metrics

Share this article

Author e-mails

Author affiliations

ORCID iDs

Dates

Abstract

1. Introduction

2. Core-periphery structure needs at least three blocks

3. Methods

3.1. Our algorithm

3.1.1. Objective function

3.1.2. Relationship to Markov stability

3.1.3. Maximisation of the objective function

3.1.4. Statistical test

3.2. Other algorithms for comparison

3.3. Data

4. Results

4.1. Quality and size of detected core-periphery pairs

4.2. Core nodes are not necessarily hub nodes

4.3. A core-periphery pair is a community?

4.4. Case studies

4.5. Synthetic networks

5. Discussion

Appendix A.: Finding compatible block structures

Appendix B.: Network structure with four blocks

Appendix C.: Estimating statistical significance of core-periphery structure

Appendix D.: Results of other algorithms