A critique of distributional analysis in the spatial model

https://doi.org/10.1016/j.mathsocsci.2009.09.005Get rights and content

Abstract

Distributional analysis is widely used to study social choice in Euclidean models (Tullock, 1967a, Tullock, 1967b, Arrow, 1969, Davis et al., 1972, Grandmont, 1978, McKelvey et al., 1980, Demange, 1982, Caplin and Nalebuff, 1988, e.g). This method assumes a continuum of voters distributed according to a probability measure. Since infinite populations do not exist, the goal of distributional analysis is to give an insight into the behavior of large finite populations. However, the properties of finite populations do not necessarily converge to the properties of infinite populations. Thus the method of distributional analysis is flawed. In some cases (Arrow, 1969) it will predict that a point is in the core with probability 1, while the true probability converges to 0. In other cases it can be combined with probabilistic analysis to make accurate predictions about the asymptotic behavior of large populations, as in Caplin and Nalebuff (1988). Uniform convergence of empirical measures (Pollard, 1984) is employed here to yield a simpler, more general proof of α-majority convergence, a short proof of yolk shrinkage, and suggests a rule of thumb to determine the accuracy of distribution-based predictions. The results also help clarify the mathematical underpinnings of statistical analysis of empirical voting data.

Introduction

Distributional analysis has been a widely used technique in the study of social choice in Euclidean models (Tullock, 1967a, Tullock, 1967b, Arrow, 1969, Davis et al., 1972, Grandmont, 1978, McKelvey et al., 1980, Demange, 1982, Schofield, 1985, Caplin and Nalebuff, 1988, e.g.) (see also Davis et al. (1970) and Riker and Ordeshook (1973) Chaps. 11–12). In distributional analysis, a continuum or infinite population of voters is analyzed, where the population follows some probability distribution μ. Infinite populations do not exist. Our concern is with finite populations. Therefore, the principal purpose of distributional analysis must be to give an insight into the behavior of large but finite populations. In this paper it is shown that distributional analysis is flawed when applied to this end. The problem is essentially one of convergence: if the limiting case is to give an insight into the large finite case, behavior of the latter should converge to behavior of the former as the population grows. Unfortunately, it turns out that properties of finite populations do not necessarily converge to the properties of infinite populations. In some cases a distributional analysis will predict that a point is in the core with probability 1, while the true probability converges to 0. Thus, the analysis of infinite populations may fail to yield any information about finite populations, however large.

An alternative to distributional analysis is termed here the finite sample method. In this method, n points are independently generated according to the distribution μ. This random finite sample from μ forms a configuration of n points whose properties are analyzed. A typical question would be: “what is the probability, as a function of n, that the configuration generated has nonempty core?” Typical answers to these questions are bounds or asymptotically close estimates for the desired probability.

It is sometimes possible to combine distributional analysis with finite sample analysis to make correct predictions about the asymptotic behavior of large populations. An example of this is found in Caplin and Nalebuff (1988). We expose some key properties which enable the convergence in this case, yielding a simpler and more general proof of the convergence of (Simpson–Kramer) α-majority rule, and a simpler though less general proof of yolk shrinkage. The analysis suggests a rule of thumb as to when one might expect distributional analysis to give accurate or inaccurate predictions about the behavior of finite populations.

Another motivation for analyzing the distributional method is to help develop a rigorous foundation for statistical empirical study of group choice. One would like to poll the members of a committee, assembly, or population (or in some other way extract data on their preferences), and based on that data and some solution concept, make a prediction with some confidence regarding what the outcome will be. How can a solution concept be tested experimentally? When the data are sampled from a large population, there are issues of statistical accuracy. Even if preference data are extracted for each individual, issues remain concerning the robustness of the solution concept with respect to individual perturbations. In other words, a person’s views on issues are not perfectly constant, and can even change in the voting booth. How can we know that a prediction based on polls taken one day will be close to the actual results the next day?

We may think of the preference data as a random sample from a probability distribution, and the population’s actual vote as another random sample from this distribution. The problem is to establish rigorously the stability of a solution concept under this model. In statistical terms, the finite sample from μ is an empirical measure μn. A solution concept is a statistic, a function f operating on probability measures. If f is a consistent statistic, then the limiting behavior of f(μn) will (almost surely) be like f(μ), and the solution concept is stable. This issue has received a great deal of attention for the classical core or Nash equilibrium under the term “structural stability”. The convergence theorems discussed in Section 6 should aid in determining the stability of other more widely applicable solution concepts.

The outline of the paper is as follows: the remainder of this section reviews essential definitions of the spatial model. Section 2 introduces the two methods by way of a small example. Section 3 analyzes the distributional method. The difficulty with the method is shown to arise from the identification of two different definitions of majority rule preference. Section 4 demonstrates in greater detail a case from Arrow (1969) where the distributional method gives a misleading result. Section 5 considers a case where the method may be used to achieve results valid for large finite populations, and demonstrates how to estimate how large the population must be. Section 6 introduces the use of uniform convergence of empirical measures, following a suggestion due to R. Foley, R. McKelvey, and G. Weiss, applies it to α-majority and yolk, and discusses in general when we may expect the distributional method to give accurate predictions.

In the Euclidean spatial model, a group of individuals must make a social choice from the set of alternatives m. Each individual i has a most preferred point vim. This point will be referred to as a voter point, or simply a voter. A voter at vi strictly prefers y to z if viy<viz and is indifferent between equidistant alternatives, where distances are with respect to the Euclidean norm. This model is more general than it appears: Davis et al. (1972) show it is equivalent to any linearly transformed spatial model which maintains the properties of an inner product; Grandmont (1978) observes that the essential property of the Euclidean model is often the “division-by-hyperplane” property (in the Euclidean case, the perpendicular bisector of two points separates those who prefer one point to the other), and so results in the Euclidean model usually apply to the more general class of “intermediate preferences”, including constant elasticity of substitution (C.E.S.) utility functions (these extend the class of Davis et al. by allowing a change to an Lp norm from the L2 norm).

A finite configuration is a list of (not necessarily distinct) voter points V={v1,,vn}. Given a finite configuration V, we say that alternative y dominates alternative z under α-majority voting if |{i:viy<viz}|>αn, that is, more than a fraction α of the voters prefer y to z. The value α=12 corresponds to simple majority voting. In geometric terms, y dominates z if the open halfspace containing y that is defined by the hyperplane bisecting and normal to the line segment yz¯ contains more than αn voter points. The halfspace is specified algebraically as {x:(yz)x>(yz)(y+z)2}.

The core is the (possibly empty) set of alternatives that are not dominated by any alternative, where voting is assumed to be simple majority unless specified otherwise. The smallest α for which the α-majority core is nonempty is the Simpson–Kramer, or minimax, value; the corresponding core is the minimax set. A hyperplane is median if the two closed halfspaces it defines each contain at least half the population. The yolk is the smallest ball intersecting all median hyperplanes (Ferejohn et al., 1984, McKelvey, 1986, Koehler, 1990). As is true of most solution concepts, both the minimax set and the yolk coincide with the simple majority rule core point if the latter exists. For recent work on the yolk and related concepts, see Banks et al., 2002, Banks et al., 2006.

For any set Sm let S denote {s|sS}. We say that μ is sign-invariant if μ(S)=μ(S) for all measurable subsets Sm. A probability measure μ on m is weakly centered at 0 if for all nonzero vectors cm the halfspace h(c){x|cx0} has measure μ(h(c))12. In terms of distributional analysis, μ is weakly centered at 0 iff 0 in the core of a population μ. We say that μ is strictly centered at 0 if for all scalars ϵ>0 the halfspace h(c,ϵ){x|cxϵ} has measure μ(h(c,ϵ))<12. Probability measure μ is weakly (resp. strictly) centered at p iff μ(Sp) is weakly (resp. strictly) centered at 0.

Section snippets

An example of the two methods

Let us begin with a simple two-dimensional model based on an example in Schofield (1985). Let μ be the probability distribution that is uniform on a circle (the boundary of a disk). Place a single voter v1 at the center of the circle, which for convenience we locate at the origin. Randomly generate n1 additional voter points v2,,vn, where n is even, according to μ.

We illustrate the method of finite sample analysis on the model just stated. A particular realization of this random process is a

An analysis of the distributional method

We have observed that the outcomes of the two methods can differ. Let us point out an important distinction in how they operate. The distributional method works directly with μ, and quantities such as μ(h) are considered. On the other hand, in the finite sample method a configuration V is drawn from μ, and quantities such as |Vh| are considered. Informally, the distributional method counts up voters by looking at the distribution function μ directly, while the finite sample method counts up

An unsuccessful case: The Sonnenschein–Arrow theorem

Let us now examine a specific case of analysis from the literature where the predictions of distributional analysis are misleading. In his article, Arrow continues by stating a theorem (he attributes to Sonnenschein) that generalizes Tullock’s example Arrow (1969, pages 108–109):

For any pair of alternatives x,y, let N(x,y) be the number of individuals who prefer x to y. Then let xMy be the statement N(x,y)N(y,x) and xM̄y the statement that N(x,y)>N(y,x).

Theorem

Suppose that, for each alternative x0

Estimating a minimum population to assure accurate predictions

Although in some cases the distributional method can mislead, it can in other cases make correct predictions for large enough populations. In these cases we want to know how large is enough. Intuitively, we need a margin of safety between the knife-edge requirement of a voting rule for finite populations, and the distributional properties. As a rule of thumb, if n is the population size, we will want the safety margin to be at least n because the most typical standard deviations involved are n/

Uniform convergence

Why do the distributional results discussed in Section 5 apply to large finite populations, while some others do not? Part of the answer has to do with the distinction between non-dominance and strict dominance. Recall from Section 4 that the finite sample meaning of the non-dominance relation R does not converge to the meaning in the distributional case. In contrast, the strict dominance relation P:yPz iff yRz and not yRz does converge. That is, if yPz in the distributional sense, and a random

Acknowledgments

The author thanks George Dantzig, Mahmoud El-Gamal, Robert Foley, Bob Parks, Richard McKelvey, Loren Platzman, Norman Schofield, Richard Stone, Gideon Weiss, Lyn Whitaker, and Kevin Wood, for invaluable discussions and correspondence, and the anonymous reviewers for the helpful comments. The author also owes particular thanks to Bernie Grofman and Don Saari for their inspiration and encouragement. He was supported by a National Science Foundation Presidential Young Investigator Award

References (37)

  • Otto A. Davis et al.

    Social preference orderings and majority rule

    Econometrica

    (1972)
  • Otto Davis et al.

    An expository development of a mathematical model of the electoral process

    American Political Science Review

    (1970)
  • Dvoretzky, A., Erdös, P., Kakutani, S., 1961. Nonincrease everywhere of the Brownian motion process. In: 4th Berkeley...
  • William Feller

    An Introduction to Probability Theory and its Applications

    (1965)
  • John A. Ferejohn et al.

    Limiting distributions for continuous state Markov models

    Social Choice and Welfare

    (1984)
  • Jean-Michel Grandmont

    Intermediate preferences and the majority rule

    Econometrica

    (1978)
  • Joseph Greenberg

    Consistent majority rules over compact sets of alternatives.

    Econometrica

    (1979)
  • B. Grunbaum

    Partitions of mass-distributions and convex bodies by hyperplanes

    Pacific Journal Mathematics

    (1960)
  • Cited by (8)

    • The finagle point is close to the yolk

      2023, Operations Research Letters
    • Approximation of the yolk by the LP yolk

      2010, Mathematical Social Sciences
    • The almost surely shrinking yolk

      2010, Mathematical Social Sciences
      Citation Excerpt :

      Some tighter bounds are found in the companion paper (Tovey, 2010), albeit for a less general class of distributions than is treated here.

    • The instability of instability of centered distributions

      2010, Mathematical Social Sciences
      Citation Excerpt :

      For the yolk, the algorithm of Tovey (1992) handles the computational question, and Theorem 6.1 in this paper makes a start at the estimation question. For the Simpson–Kramer min–max set, Tovey (1991) provides an efficient computational method; as stated the asymptotic consistency is established in Caplin and Nalebuff (1988) and Tovey (2010). Similar work is called for with respect to the solution concepts of Models I and II.

    View all citing articles on Scopus
    View full text