Homogeneous complex networks

https://doi.org/10.1016/j.physa.2005.10.024Get rights and content

Abstract

We discuss various ensembles of homogeneous complex networks and a Monte-Carlo method of generating graphs from these ensembles. The method is quite general and can be applied to simulate micro-canonical, canonical or grand-canonical ensembles for systems with various statistical weights. It can be used to construct homogeneous networks with desired properties, or to construct a non-trivial scoring function for problems of advanced motif searching.

Introduction

Complex networks is a new emerging branch of random graph theory. For a long time random graphs have been mainly studied by pure mathematics but recently due to the availability of empirical data on real-world networks they have attracted the attention of physics and natural sciences (see for review Refs. [1], [2], [3]). Methods of statistical physics, both empirical and theoretical, have thus begun to play an important role in this research area.

The empirical observations of real-networks has had a feedback on theoretical development which is now concentrated on the understanding of the observed features. For example fat tails in node degree distribution, small world effect, degree–degree correlations, or high clustering. Two complementary approaches have been developed: diachronic, known as growing networks [1], [2], [3], and synchronic being a sort of statistical mechanics of networks [4], [5], [6], [7], [8], [9], [10], [11].

We will discuss here the latter. This approach is a natural extension of Erdös and Rényi ideas [12], [13]. It is well suited both for growing (causal) networks for which nodes’ labels reflect the causal order of nodes’ attachment to the network [14], [15] and for homogeneous networks for which nodes’ labels can be permuted freely in an arbitrary way. Here, we shall discuss mainly homogeneous networks. We shall shortly comment on causal networks towards the end of the paper.

The main aim of the paper is to present a consistent picture of statistical mechanics of networks. Some ideas have already been introduced earlier. They are scattered in many papers and discussed in many different contexts. We put them together, add some new material and introduce a guideline to obtain a self-contained introduction to statistical mechanics of complex networks.

The basic concept in the statistical formulation is statistical ensemble. Statistical ensemble of networks is defined by ascribing a statistical weight to every graph in the given set [4], [7]. Physical quantities are measured as weighted averages over all graphs in the ensemble. The probability of the occurrence of a graph in random sampling is proportional to its statistical weight. If the statistical weight changes then also the probability of occurrence of randomly sampled graphs will change and in effect different random graphs will be observed. The concept of statistical weight is crucial, since it defines randomness in the system. Statistical weight is built out of two ingredients: configuration space weight and functional weight. The configuration space weight is proportional to the uniform probability measure on the configuration space which tells us how to uniformly choose graphs in the configuration space. To illustrate the meaning of the uniform measure consider an ensemble of Erdös–Rényi graphs with N nodes and L links [12], [13]. The configuration space consists of ((N2)L) graphs with labeled nodes. All those graphs are equiprobable, and therefore the configuration space weight is the same for each graph. It is convenient to choose this weight to be 1/N! since then it can be interpreted as a factor which takes care of N! possible permutations of nodes’ labels. This factor has the same origin as the corresponding factor in quantum mechanics for indistinguishable particles and it is constant for all graphs in a finite N-ensemble.

We can calculate the entropy of random graphs asS=ln1N!N2L.In the limit of large sparse graphs: N and 2L/N=α=const>2, the entropy is an over-extensive function of the system size:S=α-22NlnN+,unlike in standard thermodynamics.

Let us move to weighted graphs. The idea is to modify the Erdös–Rényi ensemble by introducing a functional weight which explicitly depends on graph's topology. For example, if we choose the functional weight to be a function of the number of loops on the graph, we can suppress favor loops of typical graphs in the ensemble. In a similar way we can choose statistical weights to control the node degree distribution to produce homogeneous scale-free graphs [16] or to introduce correlations between degrees of neighboring nodes [17], [18], [19], [20].

Classical thermodynamics describes systems in equilibrium for which the functional weight is given by the Gibbs measure: exp(-βE), where E is the energy of the system. When discussing complex networks it is convenient to abandon the concept of energy and Gibbs measure and consider a more general form of statistical weights because many networks are not in equilibrium. Indeed, many networks emerging as a result of a dynamical process like growth are far from equilibrium [1], [2], [3]. It does not mean though that one cannot introduce a statistical ensemble of growing networks. On the contrary, one can for example consider an ensemble of networks which result of many independent repetitions of the growth process terminated when the network reaches a certain size. Such a collection of networks does not describe a thermodynamic equilibrium. The functional weight can be deduced from the parameters of the growth process but of course it has nothing to do with the Gibbs measure.

In fact, many real-world networks result from a combination of a growth process and some thermalization processes. For example, the Internet grows but at the same time it continuously rearranges. The latter process introduces a sort of thermalization. Today the growth has probably still larger influence on the topology of the underlying network but in the future the growth may slow down due to saturation and then equilibration processes resulting from continuous rewirings will take over. Similarly all evolutionary networks emerge from a growth mixed with a sort of thermalization related to the continuous network rearrangement. Therefore, it is convenient to have a formalism which can extrapolate between the two regimes in a flexible way. The approach which we propose here is capable of modeling functional properties of networks by choosing an appropriate functional weight.

Let us return to the configuration space weight. As we mentioned this weight is equivalent to the uniform probability measure on the configuration space for which all graphs are equiprobable. It is a very crucial part of the construction of the ensemble to carefully specify what one means by equiprobable graphs. Consider first graphs with N nodes. There are at least two natural candidates for the uniform measure in such a set of graphs. Since one is interested in shape (topology) of graphs one can define all shapes to be equiprobable. Alternatively one can introduce labels for nodes of each graph to obtain a set of labeled graphs and then one can define all labeled graphs to be equiprobable. The two definitions give two different probability measures since the number of ways in which one can label graph nodes depends on graph's topology and thus the probability of occurrence of a given graph will depend on its topology too. It turns out that the latter definition is more natural. As we have seen above this definition leads to Erdös–Rényi graphs. So we stick to this definition and from here on we shall ascribe to each labeled graph the configuration space weight 1/N! which is constant in the set of graphs of size N.

The situation is more complex if one considers pseudographs that is graphs which have multiple connections (more than one link between two nodes) or self-connections (a link having the same node at its endpoints). In this case one can also label links and ascribe the same statistical weight to each fully labeled graph. For this choice the statistical weight of each graph is equal to the symmetry factor of Feynman diagrams generated in the Gaussian perturbation field theory [4].

The paper is organized as follows. In the next section we will recall some basic definitions. Then we will discuss Erdös–Rényi graphs in the context of constructing statistical ensemble and later we will generalize the construction to weighted homogeneous graphs. After this we will describe Monte-Carlo algorithms to generate graphs for canonical, grand-canonical and micro-canonical ensembles and discuss their representation in terms of adjacency matrices. Next we will discuss pseudographs. In the last section we will briefly summarize the paper.

Section snippets

Definitions

Let us first introduce some terminology. Graph is a set of N nodes (vertices) connected by L edges (links). A graph need not be connected. It may have many disconnected components including empty nodes (without any link). If a graph has no multiple or self-connected links we shall call it simple graph or graph. An example is illustrated in Fig. 1. Later we shall also discuss graphs with multiple- and self-connections. To distinguish them from simple graphs we shall call them degenerate graphs

Statistical ensemble for Erdös–Rényi random graphs

For simplicity, we start from a well-known model of Erdös–Rényi's graphs [12], [13]. In this classical model one considers simple graphs with N labeled nodes and L links1 chosen at random from all (N2) possibilities. All possibilities are equiprobable and so are the corresponding graphs—understood as graphs whose vertices are labeled. Usually one is interested in unlabeled graphs that is

Weighted homogeneous graphs

In the previous section we described ensembles for which all labeled graphs had the same statistical weight. Random graphs in such ensembles have well known properties. It turns out, however, that most of these properties do not correspond to those observed for real-world networks. One needs a more general setup to define an ensemble of complex random networks. Such a set-up can be introduced as follows. One considers the same set of graphs as in Erdös–Rényi model but one ascribes to each graph

Monte-Carlo generator of homogeneous networks

Erdös–Rényi graphs are exceptional in the sense that one can calculate for them almost all quantities of interest analytically. This is not the case for weighted networks. Various methods have been proposed for generating random graphs [28]. In this section we will describe a Monte-Carlo method which allows one to study a wide class of random weighted graphs experimentally by a sort of numerical experiments. The basic idea behind this type of experiments is to sample the configuration space of

Monte-Carlo generator of canonical ensemble

Now, we want to apply this method to generate Erdös–Rényi graphs. Let us begin with the canonical ensemble with N,L fixed. A good candidate for elementary transformation of graph is rewiring of a link as shown in Fig. 4, because it does not change N and L. As mentioned before it is convenient to introduce a representation in which each undirected link is represented by two directed links. The rewiring is done in two steps [4]. First, we choose a directed link ij and a vertex k at random. Then

Monte-Carlo generator of grand-canonical ensemble

The rewiring procedure described in the previous section does not change N and L. If we want to simulate graphs from a grand-canonical ensemble for which L is variable, we have to supplement the set of elementary transformations in the algorithm by transformations which change the number of links. We can introduce two mutually reciprocal transformations: adding and deleting a link. They both preserve the number of nodes N but change the number of links: LL±1. The two transformations must be

Monte-Carlo generator of micro-canonical ensemble

Another frequently encountered ensemble is an ensemble of graphs which have a given node degree sequence {q1,q2,,qN}. The partition function Z has the form:Z(N,{qi})=αlg(N,L)i=1Nδ[qi(α)-qi]1/N!W(α),where the product of delta functions allows one to include only those graphs which have a prescribed degree distribution qi. As before the factor 1/N! is fixed in this ensemble and could in principle be skipped. The canonical partition function Z(N,L) is related to the micro-canonical ones:Z(N,

Graph generator and adjacency matrices

All the elementary transformations: rewiring, adding or removing a link, and the X-move have a simple representation in terms of adjacency matrices. Rewiring relies on picking up at random an element Aij=1 of the adjacency matrix and flipping it with an element Aik=0 so that after the move Aij=0 and Aik=1. For undirected links, adjacency matrices are symmetric and therefore at the same time one has to flip Aji=1 and Aki=0. To add a link one chooses at random Aij and if Aij=0 and ij, one

Degenerated graphs (pseudographs)

In previous sections we described ensembles of simple graphs. Let us now discuss pseudographs that is graphs which may have multiple- and self-connections.

A degenerate undirected pseudograph can be represented by a symmetric adjacency matrix A whose off diagonal entries Aij count the number of links between vertices i and j, and the diagonal ones Aii count twice the number of self-connecting links attached to vertex i. For example, the graph depicted in Fig. 7 has the following adjacency matrix:

Summary

We have discussed a statistical approach to homogeneous random graphs. This framework is a natural extension of the Erdös–Rényi theory to the case of weighted graphs: one considers the same set of graphs but with modified statistical weights. The statistical weights of homogeneous graphs depend only on graphs’ topology. In other words, if one assigns some labels to its nodes, they will have no physical meaning similarly as the numbers of indistinguishable particles in quantum mechanics. One can

Acknowledgements

We thank Piotr Bialas, Jerzy Jurkiewicz and Andrzej Krzywicki for stimulating discussions. This work was partially supported by the Polish State Committee for Scientific Research (KBN) Grant 2P03B-08225 (2003–2006) and Marie Curie Host Fellowship HPMD-CT-2001-00108 and by EU IST Center of Excellence “COPIRA”.

References (33)

  • S.N. Dorogovtsev et al.

    Nucl. Phys. B

    (2003)
  • Z. Burda et al.

    Physica A

    (2004)
  • D.-S. Lee et al.

    Nucl. Phys. B

    (2004)
  • M.E.J. Newman

    Phys. Rev. Lett.

    (2002)
  • R. Albert et al.

    Rev. Mod. Phys.

    (2002)
  • S.N. Dorogovtsev et al.

    Adv. Phys.

    (2002)
  • M.E.J. Newman

    SIAM Rev.

    (2003)
  • Z. Burda et al.

    Phys. Rev. E

    (2001)
  • J. Berg et al.

    Phys. Rev. Lett.

    (2002)
  • I. Farkas et al.

    Springer Lecture Notes Phys.

    (2004)
  • S.N. Dorogovtsev, J.F.F. Mendes, A.M. Povolotsky, A.N. Samukhin,...
  • J. Park et al.

    Phys. Rev. E

    (2004)
  • P. Erdös et al.

    Publ. Math. Debrecen

    (1959)
    P. Erdös et al.

    Publ. Math. Inst. Hung. Acad. Sci.

    (1960)
  • B. Bollobás

    Random Graphs

    (1985)
  • P.L. Krapivsky et al.

    Phys. Rev. E

    (2001)
  • P. Bialas et al.

    Phys. Rev. E

    (2003)
  • Cited by (38)

    • Catching homologies by geometric entropy

      2018, Physica A: Statistical Mechanics and its Applications
      Citation Excerpt :

      Such an approach is a natural extension of Erdös–Rényi ideas [4]. It has been performed through two basic ideas: the configuration space weight and the functional weight [5]. The first one is proportional to the uniform probability measure on the configuration space which accounts for the way to uniformly chose graphs in the configuration space.

    • EEG analysis of seizure patterns using visibility graphs for detection of generalized seizures

      2017, Journal of Neuroscience Methods
      Citation Excerpt :

      Note that the sum of degree δ on all nodes is two times of the total number of edges E in a network (i.e., MD ∝ E). The value of MD thus also determines the entropy of network ensemble (Bogacz et al., 2006). The degree distributions often have a power-law tail in a scale-free network (Albert and Barab́asi, 2002), or at least over some significant range of degree (Clauset et al., 2004), such that P(δ) ∼ δ−λ, δ1 ≤ δ ≤ δ2, where δ1 and δ2 denote a range of degree, λ is a constant, named degree power.

    • Using mapping entropy to identify node centrality in complex networks

      2016, Physica A: Statistical Mechanics and its Applications
      Citation Excerpt :

      It is well known that Shannon entropy and Von Neumann entropy are related to the information present in classical and quantum systems. In complex network research, a number of different entropy measures have been introduced [19–24]. Traditionally, entropy is used to analyze the statistical behavior or the structural features of a given real network.

    • Ranking nodes according to their path-complexity

      2015, Chaos, Solitons and Fractals
      Citation Excerpt :

      One can in fact write the partition function of a network ensemble subject to a micro-canonical constraint (the energy) and then, given the probability of certain microcanonical ensemble, calculate its entropy, similarly to what proposed in [9] for random graphs. In general, an entropy of a complex network can be associated from a test particle performing a diffusion process on the network, as in [10]; for scale free networks, it is found that the entropy production rate depends on the tail of the distribution of nodes, and thus on the exponent of the tail. Along these lines, in [11] a von Neumann entropy based on the graph Laplacian has been introduced, merging results inspired from pure states in Quantum Mechanics, and networks, and finding that the von Neumann entropy is related to the spectrum of the Laplacian [8].

    View all citing articles on Scopus
    View full text