Skip to main content

Advertisement

Log in

Is Gibrat’s “Economic Inequality” lognormal?

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

In the seminal book “Les Inégalités Économiques,” Gibrat (Les Inégalités Économiques, Librairie du Recueil Sirey, Paris, 2013) proposed the law of proportional effect and claimed that a variety of empirical size distributions—such as income, wealth, firm size, and city size—obey the lognormal distribution. Gibrat’s law went on to become a stylized result stimulating a voluminous subsequent research that has contributed to our understanding of stochastic growth processes and a statistical regularity of the size distribution. However, many of the motivating examples used by Gibrat in his original work were subject to various data issues, and Gibrat’s reasoning of lognormal fit was based solely on graphical analysis. In this paper, we revisit the original 24 data sets considered by Gibrat (Les Inégalités Économiques, Librairie du Recueil Sirey, Paris, 2013) and show that in the majority of cases, the Pareto-type distribution actually provides a better fit to the data than lognormal. We show that Gibrat’s erroneous conclusion is partly due to data binning, truncation, and failure to weight data points properly.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. The full title of the book is “Les inégalités économiques: applications aux inégalités des richesses, à la concentration des entreprises, aux populations des villes, aux statistiques des familles, etc. d’une loi nouvelle, la loi de l’effet proportionnel” (Economic inequality: applications of a new law, the law of proportional effect, to wealth inequality, concentration of enterprises, population of cities, and family statistics, etc).

  2. See Sutton (1997) for an early review of the empirical literature on Gibrat’s law. More recent works include Ioannides and Overman (2003), Eeckhout (2004) and Giesen and Südekum (2011), among others.

  3. Examples are income (Reed 2003; Reed and Jorgensen 2004; Toda 2011, 2012), wealth (Klass et al. 2006; Vermeulen 2018), firm sizes (Axtell 2001; Luttmer 2007), and city sizes (Gabaix 1999; Reed 2002; Giesen et al. 2010; Rozenfeld et al. 2011). See Benhabib and Bisin (2018) for a recent review on theoretical and empirical results in income and wealth inequality.

  4. Another common trick to obtain stationarity is to assume that agents are born and die with constant probability, which was first introduced by Rutherford (1955). Random growth models with geometrically distributed age distribution also generate Pareto tails: see Wold and Whittle (1957) for the case with deterministic growth, Reed (2001) for the case with geometric Brownian motion, Toda (2014) for the Markovian, Gaussian case, and Beare and Toda (2017) for the Markovian, non-Gaussian case.

  5. See Gabaix (2009), Akhundjanov et al. (2017) and Akhundjanov and Chamberlain (2019) for various empirical applications.

  6. According to our reading, Gibrat (1931) contains 27 tabulated data sets. Three of them, poverty rate (p. 53), laborer wage (p. 56), and food expenditure (p. 67) are mentioned before the introduction of the law of proportional effect and thus are not purported to be lognormal. Therefore, our analysis focuses on the remaining 24 data sets.

  7. Gibrat actually took the 1893 income data for England (data set 2) from Pareto (1897).

  8. The logic of using mixture distribution models, especially for the untruncated empirical data, is that the data may often involve, as Perline (2005) notes, “many sources of heterogeneity that support the idea of discrete subpopulations likely to differ in important characteristics.” As a result, finite mixtures of lognormal and Pareto-type distributions fit well over the full range of values for distributions that comprise more heterogeneous groups.

  9. The CDF is obtained by setting \(\beta =\infty \) in the more general double Pareto-lognormal (dPlN) distribution. The published version of Reed and Jorgensen (2004) contains a typographical error in the CDF but the working paper version is correct.

  10. While the optimal tail cutoff point depends on the underlying distribution (Hall 1982; Jansen and de Vries 1991), several papers suggest using 10% (DuMouchel 1983; McNeil and Frey 2000), which we follow. Data set 12 (Rental income) is an exception. Since 96.81% of observations fall to the first bin, the algorithm chooses \(x_{\mathrm {min}}^*\) equal to the first cutoff so that it uses 100% of the sample to fit the Pareto distribution. In this case, we set \(x_{\mathrm {min}}^*\) to the second smallest cutoff.

  11. We could have implemented the Kolmogorov–Smirnov test of the power law fit as in Virkar and Clauset (2014), but due to the large sample size this test will reject the power law in most data sets. In this regard, the literature (Leamer and Levinsohn 1995; Gabaix and Ioannides 2004; Gabaix 2009) underscores that any non-tautological theory, including power law, can be rejected with a large empirical data set. Consequently, it is recommended that the main focus of empirical work should center on how well a theory fits the data, rather than whether it fits perfectly.

  12. Actually Gibrat uses the Gauss integral \(R(z)=\frac{1}{\sqrt{\pi }}\int _z^\infty \mathrm{e}^{-t^2}\mathrm{d}t\), which was convenient at the time because it was tabulated, but clearly \(R(z)=\Phi ^c(\sqrt{2}z)\) using a change of variable. Also, Gibrat plots \(z_k\) on the vertical axis and \(x_k\) on the horizontal axis, which corresponds to the regression \(z_k=-\frac{\mu }{\sigma }+\frac{1}{\sigma }x_k\).

  13. The scale of the two figures is not the same since the xy variables are interchanged and Gibrat uses the Gauss integral instead of the CDF of standard normal; see Footnote 12.

  14. Kapteyn (1903) is basically all about the change of variable. He shows that by adding small shocks, we get the normal distribution, and that if those shocks are added not to x but to a function F(x), then F(x) is normally distributed and gives a change-of-variable formula to get the density. A special case is \(F(x)=\log x\), which is the contribution of Gibrat (1931).

References

  • Akhundjanov SB, Devadoss S, Luckstead J (2017) Size distribution of national \(\text{ CO }_2\) emissions. Energy Econ 66:182–193

    Google Scholar 

  • Akhundjanov SB, Chamberlain L (2019) The power law distribution of agricultural land size. J Appl Stat, forthcoming

  • Axtell RL (2001) Zipf distribution of U.S. firm sizes. Science 293(5536):1818–1820

    Google Scholar 

  • Beare BK, Toda AA (2017) Geometrically stopped Markovian random growth processes and Pareto tails

  • Benhabib J, Bisin A (2018) Skewed wealth distributions: theory and empirics. J Econ Lit 56(4):1261–1291

    Google Scholar 

  • DuMouchel WH (1983) Estimating the stable index \(\alpha \) in order to measure tail thickness: a critique. Ann Stat 11(4):1019–1031

    Google Scholar 

  • Eeckhout J (2004) Gibrat’s law for (all) cities. Am Econ Rev 94(5):1429–1451

    Google Scholar 

  • Gabaix X (1999) Zipf’s law for cities: an explanation. Q. J. Econ. 114(3):739–767

    Google Scholar 

  • Gabaix X (2009) Power laws in economics and finance. Annu Rev Econ 1:255–293

    Google Scholar 

  • Gabaix X, Ioannides YM (2004) The evolution of city size distributions. In: Henderson JV, Thisse J-F (eds) Handbook of regional and urban economics, vol 4, chapter 53. Elsevier, Amsterdam, pp 2341–2378

    Google Scholar 

  • Gibrat R (1931) Les Inégalités Économiques. Librairie du Recueil Sirey, Paris

    Google Scholar 

  • Giesen K, Südekum J (2011) Zipf’s law for cities in the regions and the country. J Econ Geogr 11(4):667–686

    Google Scholar 

  • Giesen K, Zimmermann A, Suedekum J (2010) The size distribution across all cities–double Pareto lognormal strikes. J Urban Econ 68(2):129–137

    Google Scholar 

  • Hall P (1982) On some simple estimates of an exponent of regular variation. J R Stat Soc Ser B 44(1):37–42

    Google Scholar 

  • Ioannides YM, Overman HG (2003) Zipf’s law for cities: an empirical examination. Reg Sci Urban Econ 33(2):127–137

    Google Scholar 

  • Jansen DW, de Vries CG (1991) On the frequency of large stock returns: putting booms and busts into perspective. Rev Econ Stat 73(1):18–24

    Google Scholar 

  • Kalecki M (1945) On the Gibrat distribution. Econometrica 13(2):161–170

    Article  Google Scholar 

  • Kapteyn JC (1903) Skew frequency curves in biology and statistics. P. Noordhoff, Groningen

    Google Scholar 

  • Kapteyn JC, Uven MJV (1916) Skew frequency curves in biology and statistics. Hoitsema Brothers, Groningen

    Google Scholar 

  • Klass OS, Biham O, Levy M, Malcai O, Solomon S (2006) The Forbes 400 and the Pareto wealth distribution. Econ Lett 90(2):290–295

    Google Scholar 

  • Leamer E, Levinsohn J (1995) International trade theory: the evidence. In: Grossman G, Rogoff K (eds) Handbook of International Economics, vol 3, chapter 26. North-Holland, pp 1339–1394

  • Lindeberg JW (1922) Eine neue Herleitung des Exponentialgesetzes in der Wahrscheinlichkeitsrechnung. Math. Z. 15(1):211–225

    Google Scholar 

  • Luttmer EGJ (2007) Selection, growth, and the size distribution of firms. Q J Econ 122(3):1103–1144

    Google Scholar 

  • Mansfield E (1962) Entry, Gibrat’s law, innovation, and the growth of firms. Am Econ Rev 52(5):1023–1051

    Google Scholar 

  • McNeil AJ, Frey R (2000) Estimation of tail-related risk measures for heteroscedastic financial time series: an extreme value approach. J Empir Finance 7(3–4):271–300

    Google Scholar 

  • Mitzenmacher M (2004) A brief history of generative models for power law and lognormal distributions. Int Math 1(2):226–252

    Google Scholar 

  • Pareto V (1897) Cours d’Économie Politique, vol 2. F. Rouge, Lausanne

    Google Scholar 

  • Perline R (2005) Strong, weak and false inverse power laws. Stat Sci 20(1):68–88

    Google Scholar 

  • Reed WJ (2001) The Pareto, Zipf and other power laws. Econ Lett 74(1):15–19

    Google Scholar 

  • Reed WJ (2002) On the rank-size distribution for human settlements. J Reg Sci 42(1):1–17

    Google Scholar 

  • Reed WJ (2003) The Pareto law of incomes—an explanation and an extension. Physica A 319(1):469–486

    Google Scholar 

  • Reed WJ, Jorgensen M (2004) The double Pareto-lognormal distribution—a new parametric model for size distribution. Commun Stat Theory Methods 33(8):1733–1753

    Google Scholar 

  • Rozenfeld HD, Rybski D, Gabaix X, Makse HA (2011) The area and population of cities: new insights from a different perspective on cities. Am Econ Rev 101(5):2205–2225

    Google Scholar 

  • Rutherford RSG (1955) Income distributions: a new model. Econometrica 23(3):277–294

    Google Scholar 

  • Sutton J (1997) Gibrat’s legacy. J Econ Lit 35(1):40–59

    Google Scholar 

  • Toda AA (2011) Income dynamics with a stationary double Pareto distribution. Phys Rev E 83(4):046122

    Google Scholar 

  • Toda AA (2012) The double power law in income distribution: explanations and evidence. J Econ Behav Organ 84(1):364–381

    Google Scholar 

  • Toda AA (2014) Incomplete market dynamics and cross-sectional distributions. J Econ Theory 154:310–348

    Google Scholar 

  • Vermeulen P (2018) How fat is the top tail of the wealth distribution? Rev Income Wealth 64(2):357–387

    Article  Google Scholar 

  • Virkar Y, Clauset A (2014) Power-law distributions in binned empirical data. Ann Appl Stat 8(1):89–119

    Google Scholar 

  • Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 57(2):307–333

    Google Scholar 

  • Wold HOA, Whittle P (1957) A model explaining the Pareto distribution of wealth. Econometrica 25(4):591–595

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sherzod B. Akhundjanov.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A: Summary of Gibrat (1931)

Appendix A: Summary of Gibrat (1931)

Part I, Chapters 1–3 Gibrat introduces some concepts in probability and statistics.

Part I, Chapter 4 In Sect. 2, Gibrat discusses the binomial distribution using the example of drawing r balls with replacement from an urn containing n balls, of which fraction p and \(q=1-p\) are white and black, respectively. In Sect. 3, citing Laplace, he discusses that when n is large but p is fixed, the distribution converges to the normal distribution. In Sect. 5, he mentions that when \(n\rightarrow \infty \) but \(np=m\), we then obtain the Poisson distribution. He writes:

M. Jordan a donné des développements très intéressants théoriquement à partir de la formule de Poisson, ils ne paraissent pas avoir d’intérêt pratique.

(Mr. Jordan has given very interesting theoretical developments from Poisson’s formula, [but] it does not seem to have practical interest.)

From Sect. 6 on, Gibrat starts to apply the normal distribution to actual data. Interestingly, he does explain why he ignores the Poisson distribution on p. 52:

La formule de Poisson ne paraît avoir d’utilisation que pour les très petit nombres (spécialistes). Nous la laisserons donc de côté. Il ne rest que la formule de Laplace dont l’importance est très grande.

(Poisson’s formula does not seem to have a use except for very small numbers (specialists). Hence we will leave it aside. The only remaining formula that is of great importance is Laplace’s.)

In Sect. 10, Gibrat applies the reasoning of Kapteyn (1903) to explain why the binomial model can be applied to economics (in particular, wage distribution across cities in a certain profession).Footnote 14 Assume that initially all cities have the same wage, S. In month 1, it becomes \(S'+s\) in half of the cities and \(S'-s\) in the other half, where \(S'\) is the average wage in month 1. In month 2, wage grows by \(a+s\) in half of the cities and \(a-s\) in the other half, but independently from the previous month. Hence in month 2, the wage is \(S'+a+2s\) in 1/4 of cities, \(S'+a\) in 1/2, and \(S'+a-2s\) in 1/4.

He then mentions that we get a binomial distribution after n months, and hence “la loi de Laplace” (the law of Laplace, i.e., normal distribution) if n is large, assuming that the wage increase/decrease is independent from the past. At the bottom of p. 59, Gibrat mentions that it is easy to relax the simplifying assumptions: as long as “l’effet des causes” (\(=\text {shocks}\)) are

  1. 1.

    “elles sont nombreuses” (they are in large numbers),

  2. 2.

    “leur effet est indépendant de celui des autres” (its effect is independent from those of others), and

  3. 3.

    “l’effet de chaque cause est petit vis-à-vis de la somme des effets” (effect of each cause is small compared to the sum of the effects),

we get the normal distribution. In the footnote he cites Lindeberg (1922).

Part I, Chapter 5 Gibrat mentions that the normal distribution is symmetric, while many size distributions in economics are asymmetric. He cites again Kapteyn (1903) to motivate a function of size for which the binomial model applies. He thus arrives at “l’effet proportionnel” (proportional effect): instead of additive shocks to the wage above, what if they are multiplicative? He derives what is now called the lognormal distribution and spends many pages on how to estimate its parameters and then applies it to actual data (including the famous French firm size distribution, data set 1 in Table 1).

Part I, Chapter 6 Gibrat elaborates on “la loi de l’effet proportionnel” (the law of proportional effect). He mentions that a necessary and sufficient conditions for obtaining “formula A” (which is basically the lognormal distribution) are:

  1. 1.

    “Les causes de fluctuation du personnel sont nombreuses” (large number of shocks),

  2. 2.

    “Leur effet relatif sur le nombre d’ouvriers (ou leur effet absolut sur le logarithme), ne dépend pas de ce nombre d’ouvriers.” (the relative effect on the number of workers (or the absolute effect on the logarithm) does not depend on the number of workers),

  3. 3.

    “L’effet de chaque cause de fluctuation est petit vis-à-vis de l’effet de toutes” (shocks are small relative to the total).

Part I, Chapter 7 Gibrat discusses some measures of inequality, including the Gini coefficient.

Parts II–IV The remainder of the book is devoted to the statistical analysis of various data sets (see Table 1), where Gibrat shows the remarkable fit of the lognormal distribution. He repeatedly criticizes Pareto (1897); for example, on p. 111, footnote 2, he mentions that Pareto’s law is merely empirical and does not fit the data except at the upper tail, whereas the lognormal distribution is based on theory and fits the entire distribution.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Akhundjanov, S.B., Toda, A.A. Is Gibrat’s “Economic Inequality” lognormal?. Empir Econ 59, 2071–2091 (2020). https://doi.org/10.1007/s00181-019-01719-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00181-019-01719-z

Keywords

JEL Classification

Navigation