A critical exponent for shortest-path scaling in continuum percolation

Tim Brereton; Christian Hirsch; Volker Schmidt; Dirk Kroese

doi:10.1088/1751-8113/47/50/505003

1. Introduction

A natural measure of distance on percolation clusters is chemical distance—the number of edges in the shortest path between two points. This contrasts with Euclidean distance, which measures geometric distance rather than path length. The relationship between chemical distance and Euclidean distance plays an important role in describing percolation clusters; see, e.g., [1–3]. One object of interest is the scaling behavior of the chemical distance between sites that are close to one another in the sense of Euclidean distance; see, for example, [4–7]. It has been shown that, for geometrically close points, the tail of the chemical distance distribution exhibits power law behavior in the vicinity of the percolation threshold; see [8]. Estimates of the critical exponent are given in [5, 6].

In this paper, we investigate the scaling behavior of chemical distance in a continuum percolation setting. More precisely, we carry out Monte Carlo simulations in two dimensions that show that the exponent in the prototypical 'overlapping spheres' continuum setting appears to be the same as the exponent in the lattice setting, in agreement with the general belief that lattice and continuum percolation are in the same universality class; see, e.g., [9–11]. We introduce a technique from the rare-event simulation literature, multilevel splitting (or simply 'splitting'), to increase the speed and accuracy of our estimation procedure. This technique can be applied, more generally, to improve the efficiency of simulations in both continuum and lattice settings that are carried out in a sequential fashion. Examples of such simulation techniques include the popular Leath growth model (introduced in [12]) and the Alexandrowicz method (introduced in [13]). Splitting has recently attracted interest in the physics literature as a technique for rare-event estimation (see [14, 15]). To the best of our knowledge, it has not yet been applied as a sampling technique. However, similar sequential techniques have been used; see, for example, [16, 17].

Our definition of chemical distance in a continuum percolation setting is a natural extension of chemical distance as defined in lattice percolation settings. It also plays a role in the study of minimal spanning forests, which are related to the study of spin-glass models; see [18, 19]. Note that the asymptotic behavior of the chemical distance distribution is well understood for supercritical continuum percolation; see [20]. For a survey of further recent developments in continuum percolation, see [21].

2. Setting

The 'overlapping spheres' model consists of spatially uncorrelated points, specifically the points of a homogenous Poisson point process in ${{\mathbb{R}}^{d}}$ with fixed intensity (we set the intensity to 1 to simplify calculations). A sphere of radius $\gamma /2$ is drawn around each point. Edges are placed between points if their spheres overlap; that is, if the Euclidean distance between them is less than $\gamma$ . This model has been extensively studied; see, for example, [22–25]. Connecting points in this manner results in a random graph. If the intensity is held constant and $\gamma$ is allowed to increase, percolation occurs above the critical threshold, ${{\gamma }_{{\rm c}}}$ , which is dependent on d, the dimension of the model.

We are interested in the scaling behavior of chemical distance for points that are close to one another. Therefore, we consider a modified version of the 'overlapping spheres' model, where we add two points to the Poisson process and consider the random graph, ${{G}^{\gamma }}$ , defined using the connection rule given above. We fix one of these two additional points, o, as the origin and place the other point, ${{v}_{\gamma }}$ , so that it lies a distance γ from o. Specifically, we set ${{v}_{\gamma }}=(\gamma ,0,\ldots ,0)$ .

We consider the distribution of the chemical distance, ${{X}^{\gamma }}$ , between o and ${{v}_{\gamma }}$ . A similar quantity for lattice percolation is considered in [5], where it is argued that this quantity follows a power law with exponent ${{\lambda }_{{\sf lat} }}$ and, in two dimensions, an estimate is given of ${{\lambda }_{{\sf lat} }}=2.1055\pm 0.0010$ . The exponent ${{\lambda }_{{\sf lat} }}$ is closely connected to the shortest path fractal dimension, ${{d}_{{\sf min} }},$ which describes the rate at which the average chemical distance between two points grows as a function of their Euclidean distance. Indeed, in [7], it was shown that ${{\lambda }_{{\sf lat} }}=1+5/(4{{d}_{{\sf min} }})$ . Thus, determining ${{\lambda }_{{\sf lat} }}$ is equivalent to determining ${{d}_{{\sf min} }}$ and vice versa. Using this result, [7] provided a refined estimate, based on the exact determination of a related exponent, of ${{\lambda }_{{\sf lat} }}=2.1056\pm 0.0003$ . More recent estimates of ${{d}_{{\sf min} }},$ obtained in [26], result in an estimate of ${{\lambda }_{{\sf lat} }}=2.10544\pm 0.00002$ . In the two dimensional continuum setting, we provide strong numerical evidence that, for $\gamma ={{\gamma }_{{\rm c}}}$ , we have $\mathbb{P}({{X}^{\gamma }}=x)\sim {{x}^{-\lambda }}$ as $x\to \infty$ . We then estimate the exponent, λ, and show that it appears likely to have the same value as the exponent estimated in [5, 7].

3. Simulating ${{X}^{\gamma }}$ and estimating λ

Our estimation procedure involves generating independent and identically distributed copies of ${{X}^{\gamma }}$ , the chemical distance defined above, then using maximum likelihood to estimate λ.

3.1. Simulating ${{X}^{\gamma }}$

Generating realizations of ${{X}^{\gamma }}$ amounts to finding the shortest path (if it exists) between o and ${{v}_{\gamma }}$ in ${{G}^{\gamma }}$ . Our approach to simulating ${{X}^{\gamma }}$ is based on the method introduced in [5]. This is an extension of the classical Leath model, introduced in [12], where clusters are grown in shells. A similar approach to growing continuum percolation clusters was used in a three dimensional setting in [27].

Because we are interested in the asymptotic behavior of ${{X}^{\gamma }}$ , we increase the speed of our algorithm by only considering values of ${{X}^{\gamma }}$ greater than 2. That is, we assume there are no points in ${{B}_{\gamma }}(o)\cap {{B}_{\gamma }}({{v}_{\gamma }})$ , where ${{B}_{\gamma }}(v)=\{x\in {{\mathbb{R}}^{d}}:\left\|x-v\right\|\lt \gamma \}$ .

We 'grow' clusters from the 'seed' points o and ${{v}_{\gamma }}$ in a sequential fashion by generating Poisson processes in unexplored regions within distance γ of existing points. We are able to proceed in this sequential manner due to the independence property of the Poisson process. We begin by generating points in ${{B}_{\gamma }}(o)\backslash {{B}_{\gamma }}({{v}_{\gamma }})$ , which form the initial points of the first cluster, and points in ${{B}_{\gamma }}({{v}_{\gamma }})\backslash {{B}_{\gamma }}(o)$ , which form the initial points of the second cluster. We mark the points o and ${{v}_{\gamma }}$ as explored and mark the newly generated points as unexplored, as they have not yet been explicitly considered by our algorithm. We then proceed to sequentially examine the unexplored points, which are ordered by their chemical distance from the seed point of their cluster. If two points have the same chemical distance, we choose one at random. We examine an unexplored point, v, by generating a Poisson process in ${{B}_{\gamma }}(v)$ , the open ball centered at v with radius γ. We delete all new points in this ball whose Euclidean distance to an explored point is less than γ. We add the surviving new points to the appropriate clusters. If the two clusters connect, we stop the algorithm and return ${{X}^{\gamma }}$ . If it is no longer possible to add a point (i.e., we have explored all regions within distance γ of an existing point and there are no surviving new points), we stop the algorithm and return the empty set, ∅. By choosing the new points to explore based on their chemical distance to the seed of their cluster, we are essentially using a variation of Dijkstraʼs algorithm, i.e., breadth-first search; see [28]. This means that, if the two clusters connect, we can immediately determine the shortest path between o and ${{v}_{\gamma }}$ .

We use two sequences of sets, ${{({{U}_{m}})}_{m\geqslant 0}}$ and ${{({{E}_{m}})}_{m\geqslant 0}}$ , to describe the unexplored and explored points generated by our algorithm. The set U_m describes the unexplored points remaining at the mth step and the set E_m describes the points that have already been explored at the mth step. These sets consist of triples of the form $(v,c,t)\in {{\mathbb{R}}^{d}}\times \{1,2\}\times \mathbb{N}$ . The Euclidean coordinates of a point are given by v. If a point is in the cluster containing the origin then c = 1 and if a point is in the cluster containing ${{v}_{\gamma }}$ then c = 2. The chemical distance from a point to the originating point of its cluster is given by t.

Because the running time of the algorithm is potentially unbounded, we introduce a stopping condition as in [5]. If the sizes of the clusters grow beyond a certain threshold without meeting, we terminate the algorithm and return ∅. We measure the sizes of the clusters using the sequence ${{({{L}_{m}})}_{m\geqslant 0}}$ , where L_m is the minimal chemical distance from a point in U_m to the originating point of the cluster containing it. We denote the threshold by ${{L}_{{\sf max} }}.$ Note that this stopping condition means we are simulating ${{X}^{\gamma }}$ conditional on ${{X}^{\gamma }}\leqslant 2{{L}_{{\sf max} }}+1.$

The algorithm is described precisely in the following.

Algorithm 1 (Simulating X^γ).
1. Set ${{U}_{0}}=\varnothing$ and ${{E}_{0}}=\{(o,1,0),({{v}_{\gamma }},2,0)\}$ . Set m = 0.
2. Generate $Y\sim {\sf Pois} ({{\kappa }_{d}}{{\gamma }^{d}})$ points, where ${{\kappa }_{d}}$ denotes the volume of ${{B}_{1}}(o)$ in ${{\mathbb{R}}^{d}}$ , in ${{B}_{\gamma }}(o)$ and remove all points that also lie in ${{B}_{\gamma }}({{v}_{\gamma }})$ . Add the remaining points to U₀ with c = 1 and t = 1.
3. Generate $Y\sim {\sf Pois} ({{\kappa }_{d}}{{\gamma }^{d}})$ points in ${{B}_{\gamma }}({{v}_{\gamma }})$ and remove all points that also lie in ${{B}_{\gamma }}(o)$ . Add the remaining points to U₀ with c = 2 and t = 1.
4.Choose a point $p=(v,c,t)$ from the current set of unexplored points, U_m, with minimum chemical distance t. Set ${{U}_{m+1}}={{U}_{m}}\backslash \{p\}$ , ${{E}_{m+1}}={{E}_{m}}\cup \{p\}$ , and ${{L}_{m}}=t$ .
5. Let $p_{1}^{o}=(v_{1}^{o},c_{1}^{o},t_{1}^{o}),\ldots ,p_{J}^{o}=(v_{J}^{o},c_{J}^{o},t_{J}^{o})$ be the points in E_m that are not in the same cluster as p. Let ${{\mathcal{J}}_{S}}=\{j:v\in {{B}_{\gamma }}(v_{j}^{o})\}$ be the set of indices associated with those points from $\{v_{1}^{o},\ldots ,v_{J}^{o}\}$ that can be connected to v by an edge with length less than γ. If ${{\mathcal{J}}_{S}}\ne \varnothing$ terminate the algorithm and return ${{X}^{\gamma }}=t+{{t}_{{\sf min} }}+1$ , where ${{t}_{{\sf min} }}={\sf min} \{t_{j}^{o}:j\in {{\mathcal{J}}_{S}}\}$ .
6. Generate $Y\sim {\sf Pois} ({{\kappa }_{d}}{{\gamma }^{d}})$ points, ${{\tilde{v}}_{1}},\ldots ,{{\tilde{v}}_{Y}}$ , distributed uniformly in ${{B}_{\gamma }}(v)$ . Set y = 1.
7. Let $v_{1}^{e},\ldots ,v_{K}^{e}$ be the first components of the points in E_m. If ${{\tilde{v}}_{y}}\notin \bigcup _{k=1}^{K}{{B}_{\gamma }}(v_{k}^{e})$ , set ${{U}_{m+1}}={{U}_{m+1}}\cup \{({{\tilde{v}}_{y}},c,t+1)\}$ .
8. If $y\lt Y$ , set $y=y+1$ and repeat from step 7.
9. If ${{L}_{m}}\gt {{L}_{{\sf max} }},$ terminate the algorithm and return ∅.
10. If ${{U}_{m+1}}\ne \varnothing$ , set $m=m+1$ and repeat from step 4. Otherwise, return ∅.

In practice, the data structures used are chosen carefully in order to improve the performance of the above algorithm. In particular, the performance of steps 3 and 5 is improved considerably by using a data structure for the family of sets ${{({{E}_{m}})}_{m\geqslant 0}}$ that orders the points according to their Euclidean coordinates. For instance, one can discretize the Euclidean space into boxes and use an array structure to access all points in a given box; see, for example, the approaches taken in [9, 10, 27, 29].

3.2. Illustration of the simulation algorithm

For the planar case, d = 2, the basic idea of the algorithm is illustrated in the following. We begin with the two original vertices (which have different symbols in order to distinguish the two clusters); see figure 1, left. The area around them is still unexplored. We first examine the left point, o. The pattern marks the area where we know there are no points. We generate a homogeneous Poisson process in the region of radius γ surrounding o. In this case, one point is generated; see figure 1, center. We mark the new points as belonging to cluster 1 and add the explored point to the set of explored points (marked in black). The same is done for the other original vertex, ${{v}_{\gamma }}$ . Again, only one point is generated; see figure 1, right. We next generate points in the region surrounding the point in the first cluster of chemical distance 1 to the originating point. The region that has already been explored (i.e., in which the Poisson process has already been generated) is marked in gray; see figure 2, left. We mark this point as explored and add the new points to the list of points to be explored. We do the same for the point with chemical distance 1 to the originating point of the second cluster; see figure 2, center. Next, we consider one of the points in the first cluster that is of chemical distance 2 to its originating point. This point is within distance γ of an explored point in the second cluster, forming a path between o and ${{v}_{\gamma }}$ of chemical length 4; see figure 2, right. The algorithm terminates.

**Figure 1.** Illustration of the algorithm (steps 1–3).
Download figure:
Standard image High-resolution image

**Figure 2.** Illustration of the algorithm (steps 4–10).
Download figure:
Standard image High-resolution image

3.3. Estimating λ

As stated above, for $\gamma ={{\gamma }_{{\rm c}}}$ , we expect that the chemical distance displays power law behavior. For d = 2, we provide numerical evidence of this in section 5. Thus, estimating the critical exponent λ amounts to estimating the parameters of a (right-truncated) power law distribution. There are two commonly used techniques: linear regression and maximum likelihood. It has been demonstrated in a number of studies that maximum likelihood is a more robust approach (see [30, 31]). In particular, almost all the standard assumptions about statistical error made for least squares estimation are violated, resulting in inaccurate estimators of λ and incorrect estimators of standard errors. We therefore follow the approach described in [30, 32]. That is, we assume that there exists an ${{x}_{{\sf min} }}$ such that, for all $x\gt {{x}_{{\sf min} }}$

$\begin{eqnarray}&&\mathbb{P}({{X}^{\gamma }}=x)=c{{x}^{-\lambda }},\end{eqnarray} \tag{ 1 }$

for some $c\gt 0$ . Given a sample $X_{1}^{\gamma },\ldots ,X_{N}^{\gamma }$ , generated according to the algorithm described in section 3.1, we determine λ and ${{x}_{{\sf min} }}$ as follows.

The random variable ${{X}^{\gamma }}$ conditioned on being less than or equal to ${{x}_{{\sf max} }}=2{{L}_{{\sf max} }}+1$ has probability mass function $\ell :\{{{x}_{{\sf min} }},\ldots ,{{x}_{{\sf max} }}\}\to [0,1]$ with

$\begin{eqnarray}&&\ell (x)=\mathbb{P}({{X}^{\gamma }}=x\;|\;{{x}_{{\sf min} }}\leqslant {{X}^{\gamma }}\leqslant {{x}_{{\sf max} }})=\frac{{{x}^{-\lambda ({{x}_{{\sf min} }})}}}{\sum _{y={{x}_{{\sf min} }}}^{{{x}_{{\sf max} }}}{{y}^{-\lambda ({{x}_{{\sf min} }})}}}.\end{eqnarray} \tag{ 2 }$

This yields the maximum likelihood estimator $\hat{\lambda }({{x}_{{\sf min} }})$ , which is the solution of

$\begin{eqnarray*}&&\frac{\sum _{y={{x}_{{\sf min} }}}^{{{x}_{{\sf max} }}}{{y}^{-\lambda }}{\rm log} y}{\sum _{y={{x}_{{\sf min} }}}^{{{x}_{{\sf max} }}}{{y}^{-\lambda }}}=\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}{\rm log} X_{i}^{\gamma }.\end{eqnarray*}$

Thus, $\hat{\lambda }$ can be easily calculated via standard numerical procedures.

The choice of ${{x}_{{\sf min} }}$ involves a trade-off. If a too small value of ${{x}_{{\sf min} }}$ is chosen, it may not be true that equation (1) holds, even approximately. However, if ${{x}_{{\sf min} }}$ is too large, only a small fraction of the total sample will be larger than ${{x}_{{\sf min} }}$ and the estimator of λ will be inaccurate. In practice, we estimate $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}={{x}_{0}},{{x}_{0}}+1,{{x}_{0}}+2,\ldots ,{{x}_{1}}$ , where $0\leqslant {{x}_{0}}\lt {{x}_{1}}\leqslant {{x}_{{\sf max} }}-1$ . We then evaluate the goodness-of-fit of each $\hat{\lambda }({{x}_{{\sf min} }})$ by computing the Kolmogorov–Smirnov statistic

$\begin{eqnarray}&&D({{x}_{{\sf min} }})=\mathop{{\sf max} }\limits_{{{x}_{{\sf min} }}\leqslant x\leqslant {{x}_{{\sf max} }}}|F(x)-\hat{F}(x)|,\end{eqnarray} \tag{ 3 }$

where, given the estimator $\hat{\lambda }({{x}_{{\sf min} }})$ , F(x) is the theoretical distribution function based on equation (2) and $\hat{F}(x)$ is the empirical distribution function. We choose the values of ${{x}_{{\sf min} }}$ and $\hat{\lambda }({{x}_{{\sf min} }})$ corresponding to the smallest value of D.

4. Splitting

Accurate estimation of the critical exponent λ requires a sufficiently large sample of values in the right tail of ${{X}^{\gamma }}$ . We use an approach from rare-event simulation, multilevel splitting, to significantly reduce the simulation effort required to generate values of ${{X}^{\gamma }}$ larger than a prespecified threshold x. This approach, originally introduced in [33], has primarily been used as a tool for estimating small probabilities, see, for example, [14, 34, 35], but we use it instead as a tool for sampling from the extremes of a distribution. Although this idea is applied in the specific context of estimating the chemical distance exponent, λ, described above, it can be applied much more generally to simulation models using the Leath method [12], the Alexandrowicz method [13], or any other approach where the simulation is carried out in a sequential fashion.

The basic idea of splitting is to identify paths of a stochastic process that seem likely to generate an event of interest, here $\{x\lt {{X}^{\gamma }}\lt \infty \}$ , and 'split' these paths to increase the number of paths resulting in this event occurring. In the case where a percolation cluster is generated sequentially the generations of these clusters can be thought of as the time steps of a stochastic process. When this stochastic process satisfies a certain criterion, we 'split' the process by creating a number of replicates. If the splitting criterion is chosen correctly, these replicates should have a high chance of reaching the rare event set.

In our case, the stochastic process that generates ${{X}^{\gamma }}$ is implicit in the algorithm described in section 3.1. For every $m\geqslant 0$ denote by ${{Y}_{m}}=({{E}_{m}},{{U}_{m}},{{L}_{m}})$ the mth time step of the process consisting of the set of explored vertices, E_m, the set of unexplored vertices, U_m, and L_m, the shortest distance from any of the unexplored points to its seed. See section 3.1 for the precise definitions of these objects. The process ${{({{Y}_{m}})}_{m\geqslant 0}}$ is clearly a time-homogeneous Markov chain. Define

$\begin{eqnarray*}&&{{\tau }_{0}}={\rm inf} \{m\geqslant 0:\;{\rm the}\;{\rm two}\;{\rm clusters}\;{\rm in}\;{{E}_{m}}\;{\rm connect}\;{\rm or}\;{{U}_{m}}\;{\rm is}\;{\rm empty}\;{\rm or}\;{{L}_{m}}\gt {{L}_{{\sf max} }}\}.\end{eqnarray*}$

This is the time at which the algorithm stops running and either returns a chemical distance or ∅. That is, ${{X}^{\gamma }}=f({{Y}_{{{\tau }_{0}}}})$ , where

$\begin{eqnarray*}f({{Y}_{{{\tau }_{0}}}})=\left\{ \begin{array}{l} 0\quad & {\rm if}\;{{U}_{{{\tau }_{0}}}}=\varnothing , \\ t+{{t}_{{\sf min} }}+1\quad & {\rm if}\;{\rm the}\;{\rm two}\;{\rm clusters}\;{\rm in}\;{{E}_{{{\tau }_{0}}}}\;{\rm are}\;{\rm connected}, \\ \infty & {\rm if}\;{{L}_{{{\tau }_{0}}}}\gt {{L}_{{\sf max} }}. \\ \end{array} \right.\end{eqnarray*}$

Given that ${{X}^{\gamma }}\gt x$ only if ${{{\sf max} }_{m\geqslant 0}}\{{{L}_{m}}\}\geqslant (x-1)/2$ , we are able to identify paths of ${{({{Y}_{m}})}_{m\geqslant 0}}$ that are likely to produce values of ${{X}^{\gamma }}$ that are larger than x. More precisely, if we define ${{\tau }_{1}}={\rm inf} \{m\geqslant 0:{{L}_{m}}\gt {{l}_{1}}\}$ , where ${{l}_{1}}\leqslant (x-1)/2$ , then it is clear that $\{x\lt {{X}^{\gamma }}\lt \infty \}$ occurs only if ${{\tau }_{1}}\leqslant {{\tau }_{0}}$ (i.e., L_m becomes larger than l₁ before the algorithm terminates). This means that in order to generate values of ${{X}^{\gamma }}$ larger than x, the only important paths are those where ${{({{Y}_{m}})}_{m\geqslant 0}}$ enters the set $\{Y=(E,U,L):L\geqslant {{l}_{1}}\}$ . We increase the proportion of the simulation effort spent on paths resulting in the event $R=\{x\lt {{X}^{\gamma }}\lt \infty \}$ by 'splitting' these paths. That is, every time that a process ${{(Y_{m}^{i})}_{m\geqslant 0}}$ hits the level l₁, we stop simulating this process and instead start simulating s new processes, ${{(Y_{m}^{i,1})}_{m\geqslant 0}},\ldots ,{{(Y_{m}^{i,s})}_{m\geqslant 0}}$ , each beginning at ${{Y}_{{{\tau }_{1}}}}$ (i.e., $Y_{0}^{i,1}=\cdots =Y_{0}^{i,s}=Y_{{{\tau }_{1}}}^{i}$ ).

The splitting approach is illustrated in figure 3. Originally, three paths are generated. These paths continue, with non-decreasing L values, until either an event in the description of ${{\tau }_{0}}$ occurs or an event in the description of ${{\tau }_{1}}$ occurs. One path returns a chemical distance. Another path fails to return a chemical distance (i.e., U becomes empty before a path between o and ${{v}_{\gamma }}$ can be found). The third successfully reaches the set $\{Y=(E,U,L):L\geqslant {{l}_{1}}\}$ . This path (which is thicker than the others) splits into four paths. One successfully reaches the target set, R, and the other three prematurely terminate.

**Figure 3.** A schematic illustration of the splitting approach. One path of ${{({{Y}_{m}})}_{m\geqslant 0}}$ successfully hits l₁. This path is thicker than the others. It is then split into four separate paths, one of which successfully reaches the rare event set. Again, this path is thicker than the others.
Download figure:
Standard image High-resolution image

If we only take as our sample the resulting values of ${{X}^{\gamma }}$ that are bigger than x, it is easy to see that these satisfy

$\begin{eqnarray*}&&\mathbb{P}(f({{Y}_{{{\tau }_{0}}}})=x\;|\;{{L}_{{{\tau }_{1}}}}\gt {{l}_{1}},x\lt f({{Y}_{{{\tau }_{0}}}})\lt \infty )=\mathbb{P}({{X}^{\gamma }}=x\;|\;x\lt {{X}^{\gamma }}\lt \infty ).\end{eqnarray*}$

Of course, the use of splitting means that the samples are no longer independent. However, if the splitting factor, s, is not too large relative to the number of paths that hit l₁, this is not an issue in practice.

When we apply splitting in this manner, the expected number of points larger than x is s times larger than it would be using standard Monte Carlo. In other words, we only need to begin with $N/s$ processes in order to get a sample size in the right-tail that would require simulating N processes using standard Monte Carlo. We can repeat this process a number of times by using a sequence of levels ${{l}_{1}}\lt {{l}_{2}}\lt \cdots \lt {{l}_{K}}=(x-1)/2$ and splitting a process s times when it crosses a level. Using this approach, we generate roughly s^K times more samples of ${{X}^{\gamma }}$ conditioned on $\{x\lt {{X}^{\gamma }}\lt \infty \}$ than we would have using conventional Monte Carlo. In our setting, the optimal ${{x}_{{\sf min} }}$ is small enough that this is not necessary, but in many other settings multiple levels are required.

The appropriate way to measure the effectiveness of the splitting method is by comparing the work required to generate a sample from $\{{{X}^{\gamma }}:{{X}^{\gamma }}\gt x\}$ using splitting to the work required using standard Monte Carlo. In the continuum setting, the cost of the splitting, i.e. copying the data structure s times, is considerable. However, for sufficiently large x, there is a substantial computational saving in that far fewer samples must be generated in order to generate s samples of ${{X}^{\gamma }}$ conditioned on being greater than x. This saving outweighs the cost of copying the data structures and results in a significant speed increase and ability to generate samples from much further in the tail of ${{X}^{\gamma }}$ . In the numerical results given below, it takes approximately 30% more time to generate a sample using conventional Monte Carlo than it does using the splitting approach. Note that we are not in the classical setting of splitting, where the cost of replicating the relevant stochastic process is negligible. Thus the performance increases we obtain are not comparable to those obtained in that setting. When we performed simulations in the lattice setting, where the data structures are simpler and smaller than in the continuum setting, we achieved much more substantial reductions in computation time. We are confident that research into new and efficient data structures for investigating continuum percolation should lead to significant improvements in the performance of the splitting approach.

5. Numerical results

The numerical results in the following section were obtained using programs implemented in Java and running in the background on a single PC with 32 GB RAM and an Intel i7–4770 K CPU.

5.1. Exponential versus polynomial decay

We performed our simulation not only for $\gamma ={{\gamma }_{{\rm c}}}$ , but also for other choices of γ. The resulting simulations provide strong evidence that polynomial decay of the conditional probability mass function of ${{X}^{\gamma }}$ , given in (2), occurs only for $\gamma ={{\gamma }_{{\rm c}}}$ . The results are shown in figure 4.

**Figure 4.** Estimates of $\ell (x)=\mathbb{P}({{X}^{\gamma }}=x\;|\;{{X}^{\gamma }}\lt {{x}_{{\sf max} }})$ for a variety of different edge lengths.
Download figure:
Standard image High-resolution image

**Figure 4.** Estimates of $\ell (x)=\mathbb{P}({{X}^{\gamma }}=x\;|\;{{X}^{\gamma }}\lt {{x}_{{\sf max} }})$ for a variety of different edge lengths.
Download figure:
Standard image High-resolution image

5.2. Estimates using the standard Monte Carlo sample

We carried out a numerical investigation of the critical exponent for d = 2. The critical percolation threshold for this model, ${{\gamma }_{{\rm c}}}$ , is estimated in [10] to be ${{\gamma }_{{\rm c}}}\approx 1.198\;468$ ; see, [36] and [37] for two additional papers where this threshold is estimated.

Using the standard Monte Carlo procedure, we generated a sample of size $N=1.5\times {{10}^{9}}$ with ${{L}_{{\sf max} }}=4000$ . Generating this sample took approximately 26.7 d computer time. We estimated $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=3,\ldots ,7990$ . According to the Kolmogorov–Smirnov statistic, given in (3), the optimal value of ${{x}_{{\sf min} }}$ is ${{x}_{{\sf min} }}=458$ . This gives a rate estimate of $\hat{\lambda }(458)=2.1025$ .

The estimates of $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=380,\ldots ,6000$ are shown on the left of figure 5. The estimates of $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=380,\ldots ,2000$ are shown on the right of figure 5. The values of $\hat{\lambda }({{x}_{{\sf min} }})$ appear fairly stable for at least the first 2000 values of ${{x}_{{\sf min} }},$ with the subsequent loss of stability due to the smaller sample sizes used for large values of ${{x}_{{\sf min} }}.$ Closer investigation of values in this region shows that $\hat{\lambda }({{x}_{{\sf min} }})$ fluctuates between 2.102 and 2.106 with the estimate obtained using the Kolmogorov–Smirnov statistic at the lower end of the range. Note that this degree of fluctation shows the sensitivity of the estimator to the choice of ${{x}_{{\sf min} }}.$

**Figure 5.** Left: Values of $\hat{\lambda }({{x}_{{\sf min} }})$ estimated using the standard Monte Carlo sample for ${{x}_{{\sf min} }}=380,\ldots ,6000$ . Right: Values of $\hat{\lambda }({{x}_{{\sf min} }})$ estimated using the standard Monte Carlo sample for ${{x}_{{\sf min} }}=380,\ldots ,2000$ .
Download figure:
Standard image High-resolution image

Note that, prior to the simulations with ${{L}_{{\sf max} }}=4000$ , we carried out simulations using ${{L}_{{\sf max} }}=3000$ with a sample size of $N={{10}^{9}}$ . These simulations gave an optimal value of ${{x}_{{\sf min} }}=854$ with $\hat{\lambda }(854)=2.1064$ . This suggests that there is considerable variability in the estimator and also sensitivity to the cut-off point ${{L}_{{\sf max} }}.$

5.3. Estimates using the splitting sample

We also used splitting to generate samples of ${{X}^{{{\gamma }_{{\rm c}}}}}$ , carrying out the splitting at the level 400. This produced a sample of ${{X}^{{{\gamma }_{{\rm c}}}}}$ conditioned on being larger than 800. The splitting point was chosen based on the standard Monte Carlo sample with ${{L}_{{\sf max} }}=3000$ and $N={{10}^{9}}$ which returned an optimal choice of ${{x}_{{\sf min} }}=854$ . We calculated a sample of $N=1.5\times {{10}^{7}}$ with a splitting factor of s = 100, giving us a sample in the tail of equivalent size to the sample gained using standard Monte Carlo with $N=1.5\times {{10}^{9}}$ . This took approximately 20.2 d to generate, which is roughly 75% of the time required by the standard Monte Carlo approach. The optimal value of ${{x}_{{\sf min} }},$ according to the Kolmogorov–Smirnov statistic, given in (3), is ${{x}_{{\sf min} }}=1065$ . This gives a rate estimate of $\hat{\lambda }(1065)=2.1051$ .

The estimates of $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=900,\ldots ,6000$ are shown on the left of figure 6 and the estimates of $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=900,\ldots ,2000$ are shown on the right of figure 6. As in the standard Monte Carlo case, the values of $\hat{\lambda }({{x}_{{\sf min} }})$ appear fairly stable for at least the first 2000 values of ${{x}_{{\sf min} }},$ with the subsequent loss of stability due to the smaller sample sizes used for large values of ${{x}_{{\sf min} }}.$ Closer investigation of values in this region shows that, at least for ${{x}_{{\sf min} }}\gt 1000$ , $\hat{\lambda }({{x}_{{\sf min} }})$ fluctuates between about 2.1035 and 2.1055 with the estimate obtained using the Kolmogorov–Smirnov statistic at the higher end of the range.

5.4. Estimates using the joint sample

We then carried out an estimate of λ using the combined sample from the standard Monte Carlo and splitting approaches. This sample of ${{X}^{{{\gamma }_{{\rm c}}}}}$ conditioned on being larger than 800 was equivalent to that obtained using standard Monte Carlo with a sample size $N=3\times {{10}^{9}}$ . The optimal value of ${{x}_{{\sf min} }},$ according to the Kolmogorov–Smirnov statistic, is ${{x}_{{\sf min} }}=991$ . This gives a rate estimate of $\hat{\lambda }(991)=2.1045$ .

The estimates of $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=900,\ldots ,6000$ are shown on the left of figure 7. The estimates of $\lambda ({{x}_{{\sf min} }})$ for ${{x}_{{\sf min} }}=900,\ldots ,2000$ are shown on the right of figure 7. As in both the standard Monte Carlo and splitting cases, the values of $\hat{\lambda }({{x}_{{\sf min} }})$ appear fairly stable for at least the first 2000 values of ${{x}_{{\sf min} }}.$ Closer investigation of values in this region shows that, as in the splitting case, for ${{x}_{{\sf min} }}\gt 1000$ , $\hat{\lambda }({{x}_{{\sf min} }})$ fluctuates between about 2.1035 and 2.1055. Here, the estimate obtained using the Kolmogorov–Smirnov statistic is in the middle of the range.

5.5. Summary of numerical results

The four estimates we have obtained are $\hat{\lambda }=2.1025$ for the standard Monte Carlo sample with ${{L}_{{\sf max} }}=4000$ and $N=1.5\times {{10}^{9}}$ , $\hat{\lambda }=2.1064$ for the standard Monte Carlo sample with ${{L}_{{\sf max} }}=3000$ and $N={{10}^{9}}$ , $\hat{\lambda }=2.1051$ for the splitting sample with ${{L}_{{\sf max} }}=4000$ and $N=1.5\times {{10}^{9}}$ , and $\hat{\lambda }=2.1045$ for the joint sample with sample size equivalent to a standard Monte Carlo sample of $N=3\times {{10}^{9}}$ . We note that standard errors are not available for these estimators, which are also biased (though not as biased as estimates obtained using least squares). Nevertheless, these estimates broadly agree with the results for lattice percolation given in [5] and [7], as well as the estimate based on the results in [26].

Acknowledgments

This work was supported by the DAAD (German Academic Exchange Service) / Go8 Australia–Germany Joint Research Cooperation Scheme. Dirk Kroese acknowledges the support from the Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS) under grant number CE14010004.

A critical exponent for shortest-path scaling in continuum percolation

Article metrics

Submit

Permissions

Author affiliations

Dates

Peer review information

Abstract

1. Introduction

2. Setting

3. Simulating ${{X}^{\gamma }}$ and estimating λ

3.1. Simulating ${{X}^{\gamma }}$

3.2. Illustration of the simulation algorithm

3.3. Estimating λ

4. Splitting

5. Numerical results

5.1. Exponential versus polynomial decay

5.2. Estimates using the standard Monte Carlo sample

5.3. Estimates using the splitting sample

5.4. Estimates using the joint sample

5.5. Summary of numerical results

Acknowledgments

A critical exponent for shortest-path scaling in continuum percolation

Article metrics

Submit

Permissions

Share this article

Author affiliations

Dates

Peer review information

Abstract

1. Introduction

2. Setting

3. Simulating {{X}^{\gamma }} and estimating λ

3.1. Simulating {{X}^{\gamma }}

3.2. Illustration of the simulation algorithm

3.3. Estimating λ

4. Splitting

5. Numerical results

5.1. Exponential versus polynomial decay

5.2. Estimates using the standard Monte Carlo sample

5.3. Estimates using the splitting sample

5.4. Estimates using the joint sample

5.5. Summary of numerical results

Acknowledgments

3. Simulating ${{X}^{\gamma }}$ and estimating λ

3.1. Simulating ${{X}^{\gamma }}$