Improved bounds on the probability of the union of events some of whose intersections are empty

doi:10.1016/j.orl.2015.10.004

Operations Research Letters

Volume 44, Issue 1, January 2016, Pages 39-43

https://doi.org/10.1016/j.orl.2015.10.004 Get rights and content

Abstract

We formulate a linear program whose optimal objective function value can be used in other formulations to yield improved upper and lower bounds on the probability of the union of events if we know some empty intersections of small numbers of events. The LP relaxation of an extension of the maximum independent set problem provides an upper bound on the largest number of events that have a nonempty intersection. We present numerical experiments demonstrating the effectiveness of our formulation.

Introduction

Computing the probability of the union of events is important in reliability theory, stochastic programming, and other sciences concerned with stochastic systems. In network reliability, consider a communication network with nodes and links, each with a probability of failure. The two-terminal reliability of a pair of nodes is the probability of the union of events, each of which occurs when a path between the two nodes consists of links without failure. The all-terminal reliability is the probability of the union of events, each of which occurs when a spanning tree of the network consists of links without failure. In probabilistic constrained stochastic programming, a joint probabilistic constraint for random variables $ξ_{1}, \dots, ξ_{n}$ specifies a lower bound on $P (ξ_{1} \leq x_{1} \cap \dots \cap ξ_{n} \leq x_{n}) = 1 - P (ξ_{1} > x_{1} \cup \dots \cup ξ_{n} > x_{n})$ , which involves the probability of the union of events. Although it is very hard to compute the exact probability of the union of a large number of events, we can compute its approximation by using the probabilities of individual events and intersections of a small number of events.

Let ${A_{1}, \dots, A_{n}}$ be any set of $n (\in Z_{> 0})$ events in an arbitrary probability space and $N ≔ {1, \dots, n}$ be the set of all positive integers at most $n$ . For any finite set $I$ , we designate its cardinality (i.e., the number of elements in $I$ ) by $| I |$ . For any subset $I \subseteq N$ , we introduce a notation for the probability of the intersection of events $A_{i}$ for all $i \in I$ as follows: $p_{I} ≔ P (⋂_{j \in I} A_{j}) .$

We introduce binomial moments $S_{i}$ for all $i \in N$ as follows: $S_{i} ≔ \sum_{I \subseteq N : | I | = i} p_{I} .$ Let $ν$ denote the random number of events among $A_{1} \dots, A_{n}$ that occur. Then we have the following relation for all $i \in N$ : $E [(\binom{ν}{i})] = \sum_{j = 0}^{n} (\binom{j}{i}) P (ν = j) = S_{i},$ where we employ the extended definition of binomial coefficients, in which $(\binom{j}{i}) = 0$ for all $i, j \in Z_{\geq 0}$ such that $i > j$ . The value $S_{i}$ is called the $i$ th binomial moment of $ν$ .

The classical inclusion–exclusion principle [10] (see also Prékopa [21]) gives the probability of the union of events by using binomial moments $S_{i}$ for all $i \in N$ as follows: $P (A_{1} \cup \dots \cup A_{n}) = S_{1} - S_{2} + \dots + {(- 1)}^{n - 1} S_{n} .$ However, this formula is impractical if the number of events $n$ is large, in which case the calculation of $S_{i}$ is intractable unless $i$ is small enough (close to 1) or large enough (close to $n$ ) since we need $(\binom{n}{i})$ sums in (1). We can still calculate $S_{i}$ for a few small $i$ and compute lower and upper bounds on the probability. Let $m \in N$ be the largest number of events the probability of whose intersection is used in approximating the probability of the union and $M ≔ {1, \dots, m}$ be the set of all positive integers at most $m$ ; we use only $p_{I}$ for all $I \subseteq N$ such that $| I | \in M$ , from which $S_{1}, \dots, S_{m}$ can be calculated by (1). In practice, we usually consider a small $m ≪ n$ , mostly such as $m = 2, 3, 4$ . The well-known Bonferroni inequalities (or bounds) [2] state that for any $m \in N$ , $P (A_{1} \cup \dots \cup A_{n}) {\begin{matrix} \geq \\ \leq \end{matrix}} S_{1} - S_{2} + \dots + {(- 1)}^{m - 1} S_{m} {\begin{matrix} if m is even \\ if m is odd . \end{matrix}$ These bounds are usually very weak, often out of [0,1]. Then the best possible (also called sharp) bounds using $S_{1}, \dots, S_{m}$ for a small $m$ have been found. In the case $m = 2$ , the sharp lower and upper bounds expressed as closed forms in terms of $n, S_{1}, S_{2}$ are obtained by Dawson and Sankoff [8] and Kwerel [15], respectively (see also Prékopa [18] and Boros and Prékopa [3]). In the case $m = 3$ , the sharp bounds expressed as closed forms in terms of $n, S_{1}, S_{2}, S_{3}$ are obtained by Boros and Prékopa [3]. In the case $m = 4$ , the sharp upper bound expressed as a closed form in terms of $n, S_{1}, S_{2}, S_{3}, S_{4}$ is obtained by Boros and Prékopa [3]. For a general $m$ , Prékopa [18] observed that all these sharp bounds are optimal objective function values of certain linear programs, known as binomial moment problems. Furthermore, sharp bounds on the probabilities that exactly/at least $r$ events occur are given in Prékopa [19], and the bounds on the probabilities and expectations of convex functions of discrete random variables are given in Prékopa [20]. A binomial moment $S_{i}$ carries an aggregated information over event probabilities $p_{I}$ , using every one of which without aggregation we can obtain better bounds. Hailperin [14] formulated linear programs with an exponential number of variables, known as Boolean probability bounding problems, which give sharp bounds using disaggregated event probabilities.

A pair of binomial moment problems (also called the aggregated LP problems, for probability bounds of the union of events) is formulated as follows [18]: $min/max \sum_{j \in N} x_{j}$ $subject to \sum_{j \in N} (\binom{j}{i}) x_{j} = S_{i} for i \in M$ $x_{j} \geq 0 for j \in N .$ The optimal objective function values of these minimization and maximization problems are the sharp lower and upper bounds, respectively, on the probability of the union of $n$ events that can be computed using only the aggregated information $S_{i}$ for all $i \in M$ .

A pair of Boolean probability bounding problems (also called the disaggregated LP problems, for probability bounds of the union of events) is formulated as follows [14]: $min/max \sum_{J \subseteq N : | J | \in N} x_{J}$ $subject to \sum_{J \subseteq N : | J | \in N} a_{I J} x_{J} = p_{I} for I \subseteq N : | I | \in M$ $x_{J} \geq 0 for J \subseteq N : | J | \in N,$ where we define $a_{I J} ≔ {\begin{cases} 1 & if I \subseteq J \\ 0 & otherwise . \end{cases}$ The optimal objective function values of these minimization and maximization problems are the sharp lower and upper bounds, respectively, on the probability of the union of $n$ events that can be computed using only the disaggregated information $p_{I}$ for all $I \subseteq N$ such that $| I | \in M$ . Since such a set of event probabilities is almost always the most detailed information we can use, the bounds are the best possible we can expect in general. These problems are, however, impractical owing to an exponential number $(2^{n} - 1)$ of the decision variables $x_{J}$ .

Although we cannot solve the disaggregated LP problems for a large number of events $n$ in practice, we may extract useful information from disaggregated event probabilities without dealing with an exponential size and use it with their aggregations to obtain improved bounds.

Section snippets

Improved bounds

We assume that all probabilities of intersections of small numbers of events (formally, $p_{I}$ for all $I \subseteq N$ such that $| I | \in M \subset N$ ) are known and some of them are 0 (i.e., such intersections are empty). The smallest number of events that have an empty intersection is 2 since we may use only nonempty individual events when computing their union; hence, we assume that $m \geq 2$ . For any nonempty subsets $I, J \subseteq N$ such that $I \subseteq J$ and any $ε \in [0, 1]$ , we have the implication $p_{I} \leq ε ⟹ p_{J} \leq ε$ . In particular when $ε = 0$ , we have the

Numerical experiments

We carried out numerical experiments to compare probability bounds by our formulations to those by binomial moment problems with data sets that are expected to contain empty intersections of events. We used the Gurobi™ Optimizer (version 6.0) [12] for solving linear programs; when we solve binomial moment problems, for which numerical inaccuracy of the feasibility of solutions is sometimes encountered, we set the feasibility tolerance parameter (‘FeasibilityTol’) to its minimum value $(1 e- 9)$ .

Applications

Approximating the probability of the union of events has applications in network reliability (Boros et al. [5], Prékopa and Boros [22], and Gao and Prékopa [11]; see also Ball et al. [1]) and system reliability (Habib and Szántai [13] and Unuvar et al. [24]; see also a survey by Chao et al. [7]).

In most real-world examples, not a small number of intersections of events are often empty. Many intersections of half-spaces used in expressing the complement of a Euclidean polyhedron are empty

Acknowledgments

We are grateful to Prof. Endre Boros for his valuable comments. This research was supported by NSF Grant CMMI-0856663.

References (24)

M.O. Ball et al.
Network reliability
A. Habib et al.
New bounds on the reliability of the consecutive $k$ -out-of- $r$ -from- $n$ : F system
Reliab. Eng. Syst. Saf.
(2000)
A. Prékopa
The discrete moment problem and linear programming
Discrete Appl. Math.
(1990)
C.E. Bonferroni
E. Boros et al.
Closed form two-sided bounds for probabilities that at least $r$ and exactly $r$ out of $n$ events occur
Math. Oper. Res.
(1989)
E. Boros et al.
Probabilistic bounds and algorithms for the maximum satisfiability problem
Ann. Oper. Res.
(1989)
E. Boros et al.
The use of binomial moments for bounding network reliability
J. Bukszár, R. Henrion, M. Hujter, T. Szántai, Polyhedral inclusion-exclusion, Stochastic Programming E-Print Series...
M.T. Chao et al.
Survey of reliability studies of consecutive- $k$ -out-of- $n$ :F and related systems
IEEE Trans. Reliab.
(1995)
D.A. Dawson et al.
An inequality for probabilities
Proc. Amer. Math. Soc.
(1967)

G. Debreu

A. de Moivre

The Doctrine of Chances: Or, A method of Calculating the Probability of Events in Play

(1718)

Cited by (8)

Bounds for probabilistic programming with application to a blend planning problem
2022, European Journal of Operational Research
Citation Excerpt :
Further Bonferroni-type inequalities and a summary of them can be found in the book (Bukszár, Mádi-Nagy, & Szántai, 2012; Galambos & Simonelli, 1996). Improved bounds on the probability of the union of events some of whose intersections are emptly are discussed in Yoda & Prékopa (2016). Subasi, Subasi, Binmahfoudh, & Prékopa (2017) improve the previous bounds using the shape information of the distribution of the random variable based on the knowledge of some binomial moments.
In this paper, we derive deterministic inner approximations for single and joint independent or dependent probabilistic constraints based on classical inequalities from probability theory such as the one-sided Chebyshev inequality, Bernstein inequality, Chernoff inequality and Hoeffding inequality (see Pinter, 1989). The dependent case has been modelled via copulas. New assumptions under which the bounds based approximations are convex allowing to solve the problem efficiently are derived. When the convexity condition can not hold, an efficient sequential convex approximation approach is further proposed to solve the approximated problem. Piecewise linear and tangent approximations are also provided for Chernoff and Hoeffding inequalities allowing to reduce the computational complexity of the associated optimization problem. Extensive numerical results on a blend planning problem under uncertainty are finally provided allowing to compare the proposed bounds with the Second Order Cone (SOCP) formulation and Sample Average Approximation (SAA).
Sharp probability bounds for the binomial moment problem with symmetry
2018, Operations Research Letters
Sharp bounds in terms of the first few binomial moments are found for the probability of a union of events, when the random variable denoting the number of events that occur follows symmetric distribution. Connection between the bounds of this paper and the bounds from a special case of the binomial moment problem of Prekopa (1995) is shown. As a special case, bounds for the probability when the underlying probability distribution is unimodal-symmetric are also found.
New bounds for the probability that at least k-out-of-n events occur with unimodal distributions
2017, Discrete Applied Mathematics
Citation Excerpt :
Probability bounds based on the probabilities of the individual events and their intersections, and graph structures also exist in literature [28,10,8,55,9]. The reader is referred to papers by Veneziani (2009) [56], Boros, Scozzari, Tardella, and Veneziani (2014) [7], Prékopa, Ninh, and Alexe (2016) [45], and Prékopa and Yoda (2016) [47] for recent linear programming based probability bounds. Other studies on probability bounding can be found in [13,25,34,48].
The contribution of the shape information of the underlying distribution in probability bounding problem is investigated and a linear programming based bounding methodology to obtain robust and efficiently computable bounds for the probability that at least $k$ -out-of- $n$ events occur is developed. The dual feasible basis structures of the relaxed versions of linear programs involved are fully described. The bounds for the probability that at least $k$ -out-of- $n$ events occur are obtained in the form of formulas and as the customized algorithmic solutions of the LP’s formulated. An application in finance is presented.
TIGHT PROBABILITY BOUNDS WITH PAIRWISE INDEPENDENCE
2023, SIAM Journal on Discrete Mathematics
The value of shape constraints in discrete moment problems: a review and extension
2022, Annals of Operations Research
Tight Probability Bounds with Pairwise Independence
2020, SSRN

View all citing articles on Scopus

View full text

Improved bounds on the probability of the union of events some of whose intersections are empty

Abstract

Introduction

Section snippets

Improved bounds

Numerical experiments

Applications

Acknowledgments

Reliab. Eng. Syst. Saf.

Discrete Appl. Math.

Closed form two-sided bounds for probabilities that at least r and exactly r out of n events occur

Math. Oper. Res.

Probabilistic bounds and algorithms for the maximum satisfiability problem

Ann. Oper. Res.

The use of binomial moments for bounding network reliability

Survey of reliability studies of consecutive-k-out-of-n:F and related systems

IEEE Trans. Reliab.

An inequality for probabilities

Proc. Amer. Math. Soc.

The Doctrine of Chances: Or, A method of Calculating the Probability of Events in Play

Closed form two-sided bounds for probabilities that at least $r$ and exactly $r$ out of $n$ events occur

Survey of reliability studies of consecutive- $k$ -out-of- $n$ :F and related systems