Next Article in Journal
Improving the Capacity of Quantum Dense Coding and the Fidelity of Quantum Teleportation by Weak Measurement and Measurement Reversal
Next Article in Special Issue
Opinion Dynamics Explain Price Formation in Prediction Markets
Previous Article in Journal
Asymptotic Distribution of Certain Types of Entropy under the Multinomial Law
Previous Article in Special Issue
Unanimity, Coexistence, and Rigidity: Three Sides of Polarization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Sandpile Universality in Social Inequality: Gini and Kolkata Measures

by
Suchismita Banerjee
1,*,
Soumyajyoti Biswas
2,
Bikas K. Chakrabarti
1,3,
Asim Ghosh
4 and
Manipushpak Mitra
1
1
Economic Research Unit, Indian Statistical Institute, Kolkata 700108, India
2
Department of Physics, SRM University-AP, Amaravati 522240, India
3
Condensed Matter Physics, Saha Institute of Nuclear Physics, Kolkata 700064, India
4
Department of Physics, Raghunathpur College, Purulia 723133, India
*
Author to whom correspondence should be addressed.
Entropy 2023, 25(5), 735; https://doi.org/10.3390/e25050735
Submission received: 3 April 2023 / Revised: 24 April 2023 / Accepted: 25 April 2023 / Published: 28 April 2023
(This article belongs to the Special Issue Entropy-Based Applications in Sociophysics)

Abstract

:
Social inequalities are ubiquitous and evolve towards a universal limit. Herein, we extensively review the values of inequality measures, namely the Gini (g) index and the Kolkata (k) index, two standard measures of inequality used in the analysis of various social sectors through data analysis. The Kolkata index, denoted as k, indicates the proportion of the ‘wealth’ owned by ( 1 k ) fraction of the ‘people’. Our findings suggest that both the Gini index and the Kolkata index tend to converge to similar values (around g = k 0.87 , starting from the point of perfect equality, where g = 0 and k = 0.5 ) as competition increases in different social institutions, such as markets, movies, elections, universities, prize winning, battle fields, sports (Olympics), etc., under conditions of unrestricted competition (no social welfare or support mechanism). In this review, we present the concept of a generalized form of Pareto’s 80/20 law ( k = 0.80 ), where the coincidence of inequality indices is observed. The observation of this coincidence is consistent with the precursor values of the g and k indices for the self-organized critical (SOC) state in self-tuned physical systems such as sand piles. These results provide quantitative support for the view that interacting socioeconomic systems can be understood within the framework of SOC, which has been hypothesized for many years. These findings suggest that the SOC model can be extended to capture the dynamics of complex socioeconomic systems and help us better understand their behavior.

1. Introduction

The distribution of wealth has been a topic of discussion and concern throughout human history. There is nothing more unequal than the distribution of wealth. No physical quantity can be perceived without sophisticated measuring instruments within the span wealth, which can vary by roughly nine orders of magnitude. This perception of wealth inequality has led to social discord since ancient times, to the extent that the history of popular struggles is broadly that against wealth inequality. In his dialogue (The Law) describing the ideal settlement, Magnesia, Plato recommended that the ratio of wealth between the poor and wealthy should exceed 1:4. However, societies have been far more unequal than that; therefore, herein, we would like to review the universal and extreme level of inequality in our society.
For the purpose of our discussion, we broaden our scope to include the study of inequalities in assets earned through competitive means, which can also include revenues earned through movies, citations of publications (of individuals, universities, journals, etc.), gains from stock market fluctuations, etc. As we shall see, in all such cases, inequalities are ubiquitous. However, the consequences are not similar. Wealth inequalities have a variety of severe social consequences, for example, limiting access to basic needs such as food, housing, healthcare, as well as access to education, thereby limiting opportunity for upward mobility, etc. On the other hand, inequality in, for example, movie revenues, is of very little consequence for the larger society. Therefore, while policies exist to moderate wealth inequalities in some cases, in other cases, the dynamics are completely unrestricted. Hence, such dynamics provide a window into what may occur with respect to inequalities in the distribution of some form of asset when such assets are sought after by competing entities without any restriction.
We argue, however, that irrespective of the particular context, dynamics of inequality follow broadly universal characteristics. It is, therefore, possible to shed light on a highly complex situation of wealth inequality by characterizing its universal dynamics. This characterization, of course, requires quantification of inequality through unambiguous and possibly intuitive measures. We attempt to do just that in this article. We study a novel inequality index; compare it with other existing inequality indices; and characterize the universal dynamics, followed by those indices, by analyzing real data and apply the same process to self-organized critical sand pile models driven by the production of entropy.

2. Social Inequality and Its Measures

Over a century ago (in 1896), Vilfredo Pareto [1] noticed that only 20% of Italy’s people possessed 80% of the country’s wealth. He proceeded to conduct surveys in other European nations, where he discovered, to his astonishment, that the same distribution held. The Pareto Principle, also called Pareto’s 80/20 law, asserts that 20% of the causes are responsible for 80% of the outcomes. In other words, the principle suggests that a small fraction of the factors contribute to a large majority of the results. The Pareto Principle has been used to analyze many different areas, from economics to quality management, even in personal development. In business, it is often used to identify the most important areas for improvement. For example, if a company wants to improve customer satisfaction, it can use the Pareto Principle to identify the 20% of customers who are responsible for 80% of the complaints.
In the year 1905, an American economist by the name of Max Lorenz [2,3] came up with the Lorenz curve, which is a graphical representation of the distribution of wealth in a society. If the population of a society is arranged in the ascending order of their wealth, then the curve is created by plotting the cumulative fraction of the wealth ( L ( p ) ) possessed by the p fraction of the poorest individuals (red curve shown in Figure 1).
If wealth were perfectly equally distributed, the Lorenz curve would be a straight line from the origin to the top right corner of the graph (black dotted line in Figure 1). In reality, the curve is usually downward-slanting, indicating that a relatively small portion of the population holds a disproportionate share of the wealth based on the fraction of people who possess wealth less than or equal to a certain amount, denoted as F ( p ) , and the fraction of total wealth possessed by those people, denoted as L ( p ) . To illustrate this, consider a society of N individuals with wealth distribution defined by the function f ( y ) , where y represents the wealth of each person. The fraction of individuals with wealth less than or equal to p is calculated as the integral of f ( y ) from 0 to p, divided by N as follows:
F ( p ) = 1 N 0 p f ( y ) d y
The fraction of total wealth possessed by those individuals is calculated as the integral of y f ( y ) from 0 to p, divided by the total wealth of the society ( μ ) as follows:
L ( p ) = 1 μ 0 p y f ( y ) d y .
Lorenz originally produced a graphical representation of L ( p ) plotted against F ( p ) using a parametric method. Both L ( p ) and F ( p ) are functions that increase monotonically and continuously from zero to one as p ranges from zero to infinity. The resulting plot, now known as the Lorenz curve, can be displayed within a unit square, which is depicted in a schematic form in Figure 1.
Mathematically, the Lorenz curve can be formulated in a more compact way as follows,
L ( p ) = 1 μ 0 p F 1 ( x ) d x
where F 1 ( x ) = inf { y | F ( y ) x } , which means it will take the minimum value of y for which F ( y ) x .
Along with the definition of the Lorenz curve, for economic inequality, another kind of Lorenz curve is utilized, which is known as the ‘Complementary Lorenz curve’ ( L ^ ( p ) ; green curve in Figure 1). The complementary Lorenz curve is a plot that represents the distribution of wealth in a society in terms of the fraction of the total wealth held by the richest fraction of the population. In contrast to the standard Lorenz curve, which shows the fraction of the total population holding a given fraction of the total wealth, the complementary Lorenz curve shows the fraction of the total wealth held by the top fraction of the population. The complementary Lorenz curve is often used in the study of income and wealth inequality, as it provides a different perspective on the distribution of wealth in a society. For example, it can be used to compare the wealth held by the top 1% of the population to the wealth held by the bottom 99% of the population. In a society with a perfectly equal distribution of wealth, the standard Lorenz curve and the complementary Lorenz curve would be the same and would be represented by the 45-degree line. In a society with a high degree of inequality, the complementary Lorenz curve would be steeper than the 45-degree line, indicating that a large fraction of the total wealth is held by a small fraction of the population.
Therefore, to a certain extent, the Lorenz curve can be used to measure inequality. While the Lorenz curves contain the complete information about the wealth distribution and, consequently, that about wealth inequality, it is not often convenient to deal with the full distribution function, particularly when a comparative analysis or ranking is required. This calls for some form of a summary statistics to be drawn from the Lorenz curves. For this reason, in 1912, the Italian statistician and sociologist Corrado Gini developed the Gini index [4]. The Gini index is a widely used metric to measure statistical dispersion in the distribution of wealth or income among residents of a nation. It is commonly used to assess levels of inequality in a society. To compute the Gini index, the area between the Lorenz curve and the line of equality is divided by the total area under the line of equality. This ratio provides a numerical value that represents the degree of inequality present in the distribution of wealth or income. In Figure 1, the Gini index is
g = S ( S + S ) .
Equivalently, the Gini index is twice the area between the Lorenz curve and the line of perfect equality,
g = 2 0 1 ( p L ( p ) ) d p = 1 2 0 1 L ( p ) d p .
The Gini index ranges from 0 to 1, with 0 representing perfect equality and 1 representing perfect inequality. The Gini index is used by economists to measure the income inequality of a nation and is sometimes used to measure the inequality of other variables, such as wealth distribution and health inequality. The significance of adopting alternative measures of inequality is commonly recognized because no single summary statistic can capture all characteristics of inequality displayed by the Lorenz curve and the Gini index. In particular, the Gini index is known to be a rather non-intuitive measure, i.e., quoting its value does not provide an immediate picture of the underlying inequality. Indeed, the problem is deeper than intuition. Multiple Lorenz curves can have the same Gini index value. This means that calculating Gini index does not uniquely define inequality.
One major factor contributing to economic inequality is socialization or the tendency for the rich to become richer. This can occur because those with wealth have more resources to invest and accumulate more wealth, while those without wealth may struggle to get ahead. As a result, the gap between the rich and the poor can grow over time. The industrial revolution transformed economic interactions. Shops sold things produced more efficiently by factories. Economic socialization followed the industrial revolution. The automobile and telephone made travel and communication easier. Economic socialization changed during the Great Depression. As employment disappeared and wages diminished, people had to find other methods to make money. To help people cope with the economic crisis, unemployment insurance and welfare benefits were created. World War II reshaped economic socialization by the mid-20th century. The US standard of living rose as more commodities were available. Wealth spurred consumerism and the service industry. Socialization during this time gave more people the chance to rise economically and partake in the prosperity. The wealthy had more resources, jobs, and education than the poor. The global Gini (g) index value increased during this period, and in 1980–1990, this value remained at g 0.65 (see [5]), i.e., the top 35% of the population earned almost 65% of the overall income. US tax policy favored the wealthiest, allowing them to pay lower rates than the middle and lower classes, increasing wealth disparity. Economic policies, such as taxation, subsidies, and other economic interventions, began to impact residents’ economic behavior at this time. In conclusion, economic inequality is a complex issue with many factors contributing to its persistence and growth. While tools such as the Lorenz curve and Gini index help to understand and quantify inequality, addressing the root causes of this issue will require a more comprehensive approach.
In recent years, there has been a growing concern about the increasing concentration of wealth in the hands of a small percentage of the population. In 2011, the ‘Occupy Wall Street’protest [6] was a response to this trend, with participants calling for greater economic equality and a fairer distribution of wealth. The movement consisted of a campaign of civil resistance, with protesters occupying public spaces in cities across the country to demand changes to the economic and political system. The movement began on 17 September 2011, when a group of protesters entered and began an occupation of Zuccotti Park in New York City. The protests quickly spread to other cities in the United States and around the world, inspiring similar protests in countries such as the United Kingdom, Spain, and Greece. The movement has been largely credited with helping to spark the global Occupy movement, which has seen occupations in many countries. The movement also served as a catalyst for a wide range of progressive issues, including income inequality, corporate greed, and the power of the financial sector.
The protesters argued that unless something were done to address economic inequality, 99% of the wealth would soon be possessed by just 1% of the population. The most famous slogan of the Occupy Wall Street movement was “We are the 99%”. This slogan was used to highlight the inequality that exists between the wealthiest 1% of the population and the other 99%. The slogan was meant to emphasize that the economic system is broken and that the wealthiest 1% are exploiting the resources of the other 99%. The slogan has since been adopted by various movements around the world and is still in use today.
Now, the question is: is the disparity really this great? In reality, is it possible to accurately determine what amount of the population possesses exactly what amount of wealth? To find the answers of these queries, in 2014, a new social inequality measure was introduced, name k-index or Kolkata index [7,8,9], providing the k fraction of wealth, citations, or vote shares possessed, attracted, or obtained by the richest or most successful ( 1 k ) fraction of people, papers, or election candidates, respectively (see also [10] and references therein). In Figure 1, the k point is the k index. Mathematically, we can say that the k index for any income distribution is defined by the solution to the following equation:
k + L ( k ) = 1 .
Extensive data analysis implies that a k value of more than the Pareto value (0.80) across competitive economies (where welfare measures are withdrawn), citation shares in top-ranked universities or among successful individual scientists, or vote shares in vibrant democratic elections has perhaps often continued for ages beyond our notice. Indeed, observations suggest that the inequality measure (k) can be as high as 0.86, which is more than the Pareto value (k = 0.80) but less than many apprehended limits or conjectures, such as that in the ‘Occupy Wall Street’ slogan (k = 0.99). In short, data analysis suggests that almost 14% of people, papers, election candidates, or even wars possess, attract, capture, or cause about 86% of the wealth, citations, votes, or deaths, respectively. For example study shows that about 12% of the 2386 publications (books, documents, letters, etc.) by Karl Marx, as collected in his Google Scholar page maintained by the British National Library, accounts for about 88% of the total 424,810 citations collected on that page as of today.
Furthermore, in 2016, Watkins et. al. wrote a review paper [11] on the evolution of social systems to a self-organized critical state. Social systems evolve in complex and unpredictable ways. However, many theorists have suggested that social systems tend to move towards a state of self-organizing criticality, or an SOC state (see [12,13]). This is a state in which the system is balanced between order and chaos, allowing for a dynamic and adaptive response to changing environmental and social conditions. Such self-organization can result in emergent properties, such as the emergence of group norms, the development of new practices or technologies, and the emergence of social networks that can facilitate collective action. Ultimately, the exact form that a social system takes is dependent upon a variety of factors, including the available resources and the environmental and social conditions. Examples of an SOC include earthquakes, forest fires, and the stock market.
Therefore, in his book published in 2017 [14], Piketty highlights the fact that the wealth of the top 10% of the population is continuously expanding at an alarming rate. He argues that there is a tendency for the rate of return on capital to be higher than the rate of economic growth, which leads to the concentration of wealth among a small number of individuals. He suggests that this concentration of wealth poses a threat to democratic societies, as it can lead to social and political instability.
In 2020, extensive data analysis [15] showed that as competition increases in various social institutions, including markets, universities, and elections, the values of the two general inequality indices (the Gini (g) and Kolkata (k) indices), approach one another. It was further demonstrated that under conditions of unrestricted competition, these two indices equalize and stabilize at a certain value ( k m a x 0.87 g m a x ). We suggest interpretation of this coincidence of inequality indices as a broader application of Pareto’s 80/20 rule ( k = 0.80 ). Additionally, the synchronicity of the inequality indices noted here is strikingly comparable to that of self-organized critical (SOC) systems previously discussed. The findings outlined the following sections provide quantitative evidence for the long-held hypothesis that interacting socioeconomic systems can be observed within the context of SOC systems.
Few social inequality indicators have been explored in relation to the subcritical dynamical properties (measured in terms of the avalanche size distributions) of some SOC models as their respective stationary critical states approach. A recent study [16] (2022) showed that different models of self-organized criticality (SOC), including the Bak–Tang–Wiesenfeld (BTW) sand pile, the Manna sand pile, the quenched Edwards–Wilkinson interface (EW) model, and the fiber bundle interface (FBM), all display a similar precursor to SOC state values of inequality measures, such as the Gini and Kolkata indices. These results suggest that SOC systems share a high degree of commonality when it comes to indicators of inequality. Additionally, comparing these findings to similar results from socioeconomic systems with unrestricted competitions, it appears that inequality may emerge due to proximity to SOC states. Specifically, the k index for SOC models appears to be around ≃0.86. These observations provide further evidence for the universality of inequality measures across various physical and socioeconomic systems. We parallelly numerically demonstrated that the cluster or avalanche size distributions in the various SOC models of self-tuned physical systems (also argued to model all social systems; see, e.g., [11]) do reach a similar k value as the respective SOC points are approached, indicating that as one approaches the SOC point, about 86% of the avalanche mass is carried away by 14% of the avalanches. This is similar to the inequalities that we observed both mathematically and empirically for different socioeconomic systems.
Figure 2 shows a schematic representation of this timeline since 1896 at a glance, as discussed above.

3. Calculating the Inequality Indices

In the last section, we defined the Lorenz function and the two inequality measures that summarize the inequality statistics of a society (Gini and Kolkata indices). In this section, we proceed to calculate these quantities and attempt to find analytical relations between them.

3.1. Properties of the Lorenz Curve

With the definition of Lorenz curve, as mentioned before, we can enumerate several of the properties that such a curve must follow:
  • The Lorenz curve range from { F ( p ) , L ( p ) } { 0 , 0 } to { 1 , 1 } .
    Proof. 
    Equation (3) shows see that at p = 0 , F ( 0 ) = 0 and L ( 0 ) = 0 . Similarly, for p = 1 , we have F ( 1 ) = 1 and L ( 1 ) = 1 . Hence, as p [ 0 , 1 ] , the Lorenz curve always ranges from { F ( p ) , L ( p ) } { 0 , 0 } to { 1 , 1 } . □
  • The Lorenz curve is a concave and monotonically increasing function of wealth.
    Proof. 
    d L ( p ) d p = f p μ
    and,
    d F ( p ) d p = f N ,
    so the slope of the Lorenz curve is given by,
    d L ( p ) d F ( p ) = N μ p
    As p ( 0 , 1 ) and is always increasing, Equation (6) shows that the slope of the Lorenz curve is always increasing. Hence, the Lorenz curve is a concave and monotonically increasing function of wealth. □
  • The Lorenz curve for a society in which each person possesses an equal amount of wealth is a diagonally sloping line.
    Proof. 
    If each person in a society possesses an equal amount of wealth, wealth distribution follows a Dirac delta function as,
    f ( y ) = δ ( y y 0 )
    Now,
    F ( p ) = 1 N 0 p f ( y ) d y = 1 N 0 p δ ( y y 0 ) d y .
    Therefore,
    F ( p ) = 0 if p < y 0 1 if p y 0
    Again,
    L ( p ) = 0 if p < y 0 1 if p y 0
    Hence, we can see that L p = F ( p ) i.e., the Lorenz curve is a diagonal line for this particular case. □
  • The upper limit of the Lorenz curve is bounded by the equality line.
    Proof. 
    According to the second derivative of Equation (6),
    d 2 L d F 2 = N μ d F d y 1 = N μ N f 0
    Therefore, one can conclude from the above exercise that the Lorentz curve can never exceed the diagonal line. Moreover, L = F = 0 when p = 0 and L = F = 1 , as p . In addition, the concave topology of the Lorentz curve indicates that it is bounded by the diagonal line (also known as the egalitarian line). □

3.2. Exemplary Calculations of the Lorenz Curve

Here, we show the calculation of the Lorenz curve for some simple wealth distribution functions, albeit continuous and discrete.
  • Uniform wealth distribution: Let us examine a society in which the distribution of income is uniform over a finite range of values within the interval [ a , b ] , where 0 < a < b < 1 . The corresponding probability density function is given by f u ( x ) = 1 ( b a ) , and the cumulative distribution function is F u ( x ) = ( x a ) / ( b a ) for all values of x within [ a , b ] . Applying Equation (3), we obtain the following Lorenz curve for this distribution:
    L u ( p ) = 1 μ u 0 p { a + ( b a ) q } d q = p 1 ( b a ) ( a + b ) ( 1 p ) ,
    The distribution has a mean of μ u = ( a + b ) 2 , and F u 1 ( q ) = a + ( b a ) q . It is worth noting that if a = 0 , the Lorenz curve simplifies to L u ¯ ( p ) = p 2 .
  • Exponential wealth distribution: Let us consider an exponential income distribution characterized by the probability density function f E ( x ) = λ e λ x , where λ > 0 , and the cumulative distribution function F E ( x ) = 1 e λ x for all x 0 . The mean of this distribution is given by μ E = 1 λ , and F E 1 ( q ) = ( 1 λ ) ln ( 1 q ) . The Lorenz curve for this distribution is therefore given by:
    L E ( p ) = 0 p ln ( 1 q ) d q = p ( 1 p ) ln 1 1 p .
  • Pareto wealth distribution: Let us now consider a society with a Pareto-like income distribution. The probability density function for this distribution is given by f P , α ( x ) = α ( m ) α ( x ) α + 1 , and the cumulative distribution function is F P , α ( x ) = 1 ( m x ) α , where m > 0 is the minimum income, α > 0 , and the probability density and cumulative distribution functions are defined for all x m . The mean of this distribution is μ P = α m ( α 1 ) , and F P , α 1 ( q ) = m ( 1 q ) ( 1 α ) , which gives the Lorenz curve as follows:
    L P , α ( p ) = ( α 1 ) α 0 p ( 1 q ) ( 1 α ) d q = 1 ( 1 p ) 1 1 α .
  • Discrete wealth distribution: To obtain the Lorenz function for a discrete income distribution, consider an economy comprising G groups of people, where each group (g) comprises n g individuals with the same income ( x g ) such that 0 x 1 x 2 x G . The total population of the economy is N, and the total income is M, leading to a mean income of μ g = M / N . The income distribution is a discrete random variable (X) with a probability mass function of f G ( x g ) = n g / N for all g ranging from 1 to G and a distribution function ( F G ( x ) ) defined by 0 if x [ 0 , x 1 ) , f G ( x ) = n g / N if x [ x g , x g + 1 ) for any given g ranging from 1 to ( G 1 ) , and 1 if x x G . We define N ( g ) and M ( g ) as the cumulative proportion of the population and cumulative proportion of the total income, respectively, for each group (g). For any given g ranging from 1 to G and any q g ( N ( g 1 ) , N ( g ) ] , it can be verified that F G 1 ( q g ) = x g . Using the Lorenz function formula, we can calculate the Lorenz function ( L F G ( p g ) ) for any given g ranging from 1 to G and any p g ( N ( g 1 ) , N ( g ) ] as,
    L F G ( p g ) = M ( g 1 ) + ( p g N ( g 1 ) ) N x g M .
    We make two observations in this context. The first observation states that the Lorenz function is piecewise linear, which means that it is composed of several line segments. The kink points represent the points where the direction of the Lorenz curve changes; they occur at the boundaries of each income group. At these points, there is a jump in the cumulative share of income that is distributed to each group, which causes a change in the slope of the Lorenz curve. The second observation is that if there is only one income group in the economy, then the Lorenz curve is a straight line passing through the origin, with a slope of 1. This means that the distribution of income is perfectly equal, and each individual in the economy has the same income. In this case, the Lorenz curve coincides with the diagonal of the unit square, which represents the line L ( p ) = p .

3.3. Properties of the Gini Index

The above definition also implies several properties that are important to understand when interpreting and using this index. Some of the key properties of the Gini index are:
  • Range: The Gini index ranges from 0 to 1, with 0 indicating complete equality (i.e., everyone has the same income or wealth) and 1 indicating complete inequality (i.e., one person has all the income or wealth);
  • Normalization: The Gini index is normalized, meaning that it can be used to compare inequality across different populations or over time. This allows for meaningful comparisons even when the populations or time periods have different sizes or levels of income;
  • Sensitivity to changes in the distribution: The Gini index is sensitive to changes in the distribution of income or wealth, meaning that even small changes in the distribution can result in large changes in the Gini index. This property makes the Gini index a useful tool for measuring the impact of policies or events that affect the distribution of income or wealth;
  • Unimodality: The Gini index is unimodal, meaning that it has a single peak. This property allows for the ranking of populations or time periods based on their level of inequality;
  • Invariance to scale: The Gini index is invariant to scale, meaning that it is not affected by changes in the units of measurement (e.g., dollars, euros, etc.). This allows for meaningful comparisons of inequality across populations or time periods using different currencies.

3.4. Exemplary Calculations of the Gini Index

Here, we show calculations of the Gini index for some simple continuous and discrete wealth distributions.
  • Uniform wealth distribution: For a uniform distribution on a compact interval [ a , b ] , following 0 a < b < leads to the following Gini index,
    g u = 2 0 1 q q 1 ( b a ) ( a + b ) ( 1 q ) d q = ( b a ) 3 ( a + b ) .
  • Exponential wealth distribution: An exponential distribution of the form F E ( x ) = 1 e λ x for any x 0 and λ > 0 leads to the following Gini index,
    g E = 2 0 1 q L ( q ) d q = 2 0 1 ( 1 q ) ln 1 1 q d q = 1 2 .
  • Pareto wealth distribution: A Pareto distribution of the form F P , α ( x ) = 1 ( m / x ) α with m > 0 as the minimum income and α > 1 results in a Gini index of the following form,
    g P , α = 2 0 1 q 1 ( 1 q ) 1 1 α d q = 1 2 α 1 .
    As we graph the Gini index for various values of α , where α is greater than 1, we observe that as α increases, the Gini index decreases. Additionally, as α approaches 1, the Gini index tends towards 1. Furthermore, if we set α ^ to be equal to ln 5 ln 4 , then the Gini index for g P , α is approximately 0.7565.
  • Discrete wealth distribution: Consider the discrete random variable F G discussed previously for which the Lorenz function is given by Equation (12). Accordingly, we have the following explicit form of the Gini index,
    g F G = g = 1 G t = 1 G n t n g | x t x g | 2 N M .
    Note that if n g = 1 for all g { 1 , , G } so that G = N and M = g = 1 N x g , then it follows from Equation (16) that,
    g F G = g = 1 G t = 1 G | x t x g | 2 N g = 1 N x g .

3.5. Properties of the k-Index

With the above definition, the k index has several characteristics, which are listed below:
  • The k index is a unique fixed point of the complementary Lorenz function.
    Proof. 
    We can rewrite Equation (5) as,
    k = 1 L ( k ) = L ^ ( k ) .
    Hence, the k index is a fixed point of the complementary Lorenz function. Since the complementary Lorenz function maps [0, 1] to [0, 1] and is continuous (as shown in Figure 1), it has a fixed point according to Brouwer’s fixed-point theorem. Furthermore, since L ^ ( p ) is non-increasing, the fixed point has to be unique. □
  • For any distribution, k [ 1 / 2 , 1 ] , and the normalized k index ( K = 1 2 k ) lies in the interval [ 0 , 1 ] .
    Proof. 
    Observe that if L ( p ) = p , then, according to Equation (18), k = 1 / 2 , and for any other income distribution, 1 / 2 < k < 1 . Also note that while the Lorenz curve typically has only two trivial fixed points, that is, L ( 0 ) = 0 and L ( 1 ) = 1 , the complementary Lorenz function ( L ^ ( p ) ) has a unique non-trivial fixed point (k). Now, the normalized k index is given by K = 1 2 k , so if k [ 1 / 2 , 1 ] , then K [ 0 , 1 ] . □
  • The k index as a generalization of the Pareto Principle.
    Proof. 
    The k index can be thought of as a generalization of the Pareto’s 80/20 rule. Note that L ( k ) = 1 k ; hence, the top 100 ( 1 k ) % of the population has 100 ( 1 ( 1 k ) ) = 100 k % of the income. Hence, the ‘Pareto ratio’ for the k index is k / ( 1 k ) . Note that this proportion is derived internally from the distribution of income, and typically, there is no expectation that it will align with the Pareto Principle. □
  • Interpreting the k index in terms of rich–poor disparity.
    Proof. 
    Let us split society into two groups: the ‘poorest’ group, consisting of a fraction (p) of the population, and the ‘rich’ group, consisting of a fraction ( 1 p ) of the population. Using the Lorenz curve (L(p)), we can determine the distance between the “boundary person” and the poorest person on one hand and the distance between the “boundary person” and the richest person on the other hand. These distances can be calculated using the following equations: p 2 + L ( p ) 2 and ( 1 p ) 2 + ( 1 L ( p ) ) 2 , respectively. The k index is a way of dividing society into two groups such that the boundary person is equidistant from the poorest and richest persons. The disparity function value at the k index is given by D ( k ) = k 1 / 2 . This function measures the gap between the proportion (k) of the poor from the 50/50 population split. If society is not completely equal, then k > 1 / 2 , making it a useful tool to highlight the rich–poor disparity. In this case, k defines the income proportion of the top ( 1 k ) proportion of the rich population. □
  • The k index as a solution to optimization problems.
    Proof. 
    The k index is the unique solution to the following surplus maximization problem:
    k = a r g m a x P [ 0 , 1 ] 0 P ( L ^ ( t ) t ) d t .
    The value of k is such that it maximizes the area between the complementary Lorenz function and the income distribution line linked with an egalitarian distribution for the lower-income population. Equation (19) is a consequence of the fact that L ^ ( p ) p for all p [ 0 , k ] and L ^ ( p ) p for all p ( k , 1 ] . Similarly, the k index is the only solution to the surplus minimization problem (which is the dual of the problem in (19)):
    k = a r g m i n P [ 0 , 1 ] P 1 ( 1 t ) L ( t ) d t .
    Therefore, ( 1 k ) is the fraction of the higher-income population for which the area between the income distribution line associated with the egalitarian distribution and the Lorenz function is minimized. □
  • To reduce inequality between groups, the k index is a better indicator.
    Proof. 
    The ordering of Lorenz curves based on the k index is not the same as the ordering based on the Gini index. While the Gini index is influenced by transfers only within the poor or rich population, the ranking based on the k index is influenced only by transfers between the two groups. This implies that if the objective is to reduce inequality between the groups, then the k-index is a more appropriate measure to use. □

3.6. Exemplary Calculations of the k Index

Here, we show calculations of some simple continuous and discrete wealth distribution functions. The majority of this subsection is adopted from [9].
  • Uniform wealth distribution: Consider a case in which the uniform distribution (F) is defined for [ a , b ] , where 0 a < b < . Then, the k index is given by,
    k u = ( 3 a + b ) + 5 a 2 + 6 a b + 5 b 2 2 ( b a ) ,
    and the normalized k index is given by,
    K u = 2 ( a + b ) + 5 a 2 + 6 a b + 5 b 2 ( b a ) .
  • Exponential wealth distribution: For the exponential distribution ( F E ), the complementary Lorenz function is given by L ^ E ( p ) = ( 1 p ) 1 + ln 1 1 p . One can show that the k-index is k E 0.6822 ; hence, the normalized k index is K E 0.3644 .
  • Pareto wealth distribution: For the Pareto distribution ( F P , α ), the complementary Lorenz function is given by L P , α ( p ) = ( 1 p ) 1 1 α . The k index is therefore a solution to the following equation,
    ( 1 k P , α ) 1 1 α = k P , α .
    It is difficult to provide a general solution to this equation. However, we have an interesting observation in this context. If α = ln 5 ln 4 1.16 , then the k index is k P , α = 0.8 , corresponding to the Pareto principle or the 80/20 rule. Also note that the normalized k index is K P , α = 0.6 .
  • Discrete wealth distribution: Consider any discrete random variable with the distribution function ( F G ) discussed above for which the Lorenz function is given by Equation (12). To obtain the explicit form of the k index, one can first apply a simple algorithm to identify the interval of the form [ N ( g 1 ) , N ( g ) ) defined for g { 1 , , G } in which the k index can lie.
    Since N ( G ) = M ( G ) = 1 , if we have N ( G 1 ) + M ( G 1 ) < 1 in some step, then in the next step, this algorithm has to end, since N ( G ) + M ( G ) = 2 > 1 .
    Suppose that for any discrete random variable with the distribution function ( F G ) discussed earlier, Algorithm-1 identifies g * { 1 , , G } such that N ( g * ) + M ( g * ) 1 . If N ( g * ) + M ( g * ) = 1 , then k F G = N ( g * ) , and if N ( g * ) + M ( g * ) > 1 , k F G is the solution to the following equation:
    k F G + M ( g * 1 ) + k F G N ( g * 1 ) N x g * M = 1 .
    Thus, to derive the k index of any discrete random variable with distribution function F G , we first my identify the group g * { 1 , , G } such that k F G ( N ( g * 1 ) , N ( g * ) ] (using Algorithm 1); then, using g * , we obtain the following value of k F G :
    k F G = N ( g * ) if   N ( g * ) + M ( g * ) = 1 , μ G + N ( g * ) x g * M ( g * ) μ G + x g * if   N ( g * ) + M ( g * ) > 1 .
Algorithm 1:
Step 1:Consider the smallest g 1 { 1 , , G } such that N ( g 1 ) 1 / 2 and consider the sum of N ( g 1 ) + M ( g 1 ) . If N ( g 1 ) + M ( g 1 ) 1 , then stop, and k F G ( N g 1 1 , N ( g 1 ) ] ; in particular, k F = N ( g 1 ) if and only if N ( g 1 ) + M ( g 1 ) = 1 . Instead, if N ( g 1 ) + M ( g 1 ) < 1 , then go to Step 2, consider the group g 1 + 1 , and repeat the process.
Step t:Having reached Step t means that in Step ( t 1 ) , we had N ( g 1 + t 1 ) + M ( g 1 + t 1 ) < 1 . Therefore, consider the sum of N ( g 1 + t ) + M ( g 1 + t ) . If N ( g 1 + t ) + M ( g 1 + t ) 1 , then, stop; k F G [ N ( g 1 + t 1 ) , N ( g 1 + t ) ) , and, in particular, k F = N ( g 1 + t ) if and only if N ( g 1 + t ) + M ( g 1 + t ) = 1 . If N ( g 1 + t ) + M ( g 1 + t ) < 1 , then proceed to Step ( t + 1 ) .

4. Analytical Studies on the Emerging Coincidence of the Gini and Kolkata Indices

4.1. A Landau-like Phenomenological Expansion of the Lorenz Function

Before going into specific forms of the Lorenz curve and their corresponding inequality indices, we will first outline an attempt to achieve a Landau-like phenomenological expansion of the Lorenz function that obeys the abovementioned properties (see, e.g., [17]). In particular, as a minimal non-trivial expansion, we could write
L ( p ) = A p + B p 2 ,
where A > 0 , B > 0 , and A + B = 1 . It then follows that the calculation of g = 1 2 0 1 L ( p ) d p and k = 1 L ( k ) gives
k = ( 3 g 2 ) ± ( 2 3 g ) 2 + 12 g 6 g ,
which, in the limit of g 0 , gives k = 1 / 2 + 3 g / 8 , which implies that if a situation arises in which k = g , then k = g = 0.8 , which is the Pareto value. In later sections, we will discuss the actual situations and data in models under conditions of unrestricted competition.

4.2. Some Typical Power Law Forms of Lorenz Functions

An interesting observation is made when the Lorenz curve is parameterized through a power law curve in p and when the respective Lorenz curve’s Gini index and k index are plotted. When the power law index is increased, we observe that the values of the Gini and k indices converge towards each other and meet at a point around 0.87 , which is not equal to 1.
Here, we consider a set of Lorenz functions with the functional form L ( p ) = p n , where p ranges within the interval [ 0 , 1 ] , and n is a positive integer greater than or equal to one. For this family of functions, we can derive the corresponding Gini index g n , which is equal to ( n 1 ) / ( n + 1 ) . To investigate this relationship further, we tabulated the Lorenz functions ( L ( p ) ) for n values ranging from 1 to 20 and computed their associated Gini index ( g n ) and k index ( k n ). The results of our calculations are presented in Table 1.
Table 1 shows that if n = 13 , then g 13 0.857 < k 13 0.860 , and if n = 14 , then g 14 0.867 > k 14 0.866 , implying that coincidence takes place for n ( 13 , 14 ) (see in Figure 3). The coincidence between the Gini index and the k index occurs at n * ( n 1 = 13.82986 , n 2 = 13.82987 ) ; hence,
g n * = k n * g n 1 = 0.8651369 , g n 2 = 0.8651370 6 7 , 13 15 .
The above study shows that there is no positive integer (n) for which the g and k index values coincide, and if n represents a positive real number, then there exists a value of n = n * for which these index values coincide.

4.3. Some Generic Forms of Lorenz Functions

Therefore, with the above exercise, we observe a specific parameterized Lorenz function for which the Gini and k indices become equal. For more generic Lorenz functions, this equality of the Gini and k indices is exemplified in Table 2 (also see Figure 4).
In this review article, we examine the properties of finite polynomial Lorenz functions through the seven cases presented in Table 2. The first case (1) has already been addressed, where a 1 = = a n 1 = 0 and a n = 1 . In case (2), a 1 = = a n = 1 / n , which results in a coincidence value of approximately 0.869 for some value between 65 and 66. This coincidence value is higher than that obtained in case (1). In case (3), a 1 = ( n 1 ) / n and a 2 = = a n = 1 / [ n ( n 1 ) ] . Since the weight a 1 is relatively high, k is close to 1/2; therefore, there is no coincidence between the Gini index and the k index. In case (4), a n = ( n 1 ) / n and a 1 = = a n 1 = 1 / [ n ( n 1 ) ] , resulting in a coincidence value of approximately 0.874 for some value between 17 and 18, which is higher than that in case (2). In case (5), a m = [ 6 m ( n + 1 m ) ] / [ n ( n + 1 ) ( n + 2 ) ] for all m = 1 , 2 , , n , and the coincidence value is 0.874 for some number between 40 and 41, which is not an improvement compared to case (4). In case (6), a m = [ 2 ( m + 1 ) ] / [ n ( n + 3 ) ] for all m = 1 , 2 , , n , and the coincidence value is 0.877 for some value between 29 and 30. Finally, in case (7), the maximum coincidence value of 0.881 is achieved for some value between 77 and 78, where a m = m = 1 n ln ( n + 1 m ) / r = 1 n ln ( n + 1 r ) for all m = 1 , 2 , , n . Overall, our analysis shows that the coincidence value is less than 8/9 (approximately 0.88) for all seven cases discussed in Table 2.
Coincidentally with the data from different scenarios, the inequality in those scenarios reaches a maximum point around 0.87 before dropping.

5. Real-World Data Indicating the Convergence of the Gini and Kolkata Indices in Various Socioeconomic Contexts

In this section, we investigate the emergence of pervasive inequality as an observable example of emergent properties in different socioeconomic systems. These systems exhibit universal characteristics of the the Gini (g) and Kolkata (k) inequality indices, as shown by real data collected from diverse social and economic systems. Our data collection and analysis were completed before the end of 2021. It is evident that these systems show emergent properties when their dynamics are not externally fine-tuned. In the current situation, this leads to an environment of unrestricted competition. To illustrate this phenomenon, we consider a range of socioeconomic systems, including income, income tax data, box office earnings from Hollywood (US) and Bollywood (India), daily Bitcoin price fluctuations, election candidates (vote shares), universities (excellence/citation sharing), publications by authors (citation shares), wars or social conflicts (human death shares), and sports (Olympic medal share), among others. These systems demonstrate the emergence of significant and widespread inequality indices, which we explore in detail in this section.

5.1. Socioeconomic Disparities: An Analysis of Income, Income Tax, and Box Office Earnings Data

In a report by the United Nations Development Program [18], it was found that the global distribution of income is highly unequal, with the top 20 % of the world’s population receiving 82.7 % of the world’s income. Despite concerns that the top 1 % of wealthy individuals possess 99 % of the world’s wealth (as in ‘Occupy Wall Street’ protests), this observation suggests that 82 % of the world’s wealth is actually owned by 18 % of the population. To further explore this issue, we analyzed data from the IRS (US) [19,20] on the cumulative income of the poorest (p) fraction of people over a period of 36 years (1983–2018) and calculated the Lorenz function ( L ( p ) ). We also computed the Gini (g) and Kolkata (k) indices for each year (see Figure 5).
We present a visual representation of the analysis conducted on the IRS income and tax data [19,20] for a period of 36 years (1983–2018) in Figure 6. The graph depicts a consistent increase in the inequality indices (g and k) over the years, which converge towards a value of 0.87.
Similarly, we extend our analysis to the yearly income generated by the film industry in Hollywood (USA [21]) and Bollywood (India [22]) for a period of 9 years (2011–2019). Table 3 and Figure 7 demonstrate the results of our analysis of the income data for these two film industries and show that both in Hollywood and in Bollywood, the box-office income inequality index (k) increases, on average, to 0.88 and 0.83 , respectively.
In this case, we show that almost 88 % of box office income share comes from only 12 % of Hollywood movies measured from 2011 to 2019. Similarly, almost 83 % of box office income share comes from only 17 % of Bollywood movies measured from 2011 to 2019.

5.2. Inequality in Bitcoin Value Fluctuations: A Data Analysis Study

Bitcoin is a decentralized cryptocurrency that operates on a ledger system without any central bank control. Its introduction in 2008 and adoption in 2009 have made it the first and largest cryptocurrency, with a market value surpassing USD 1.03 trillionas of November 2021, accounting for 2.9 % of the world’s total narrow money supply. While its decentralized nature has made it susceptible to market volatility, Bitcoin serves as an example of unrestricted competition in currency markets.
In this study, we analyze the fluctuation of Bitcoin’s value, which is measured in USD, by collecting daily price data from 1 January 2010 to 24 November 2021, using data obtained from [23]. To investigate its value fluctuation, we calculate the absolute value of the price changes in consecutive days and collect the fractional closing price changes, except for a few days up to a date t ( > t 0 ) . We then generate the Lorenz curve (refer to Figure 8) for the closing price data up to a time (t ( > > t 0 ) ) and proceed to estimate the g and k indices as discussed in Figure 9.
Figure 9 demonstrates that g and k exhibit a pattern of repeatedly approaching each other near the value of 0.87 .

5.3. Inequality Analysis of Vote Data for Election Contestants

In this subsection, we analyze the inequality of vote shares among the candidates in the Indian parliamentary elections held in 2014 and 2019. Table 4 demonstrates that there exists a high level of inequality in the distribution of vote shares. The Gini and Kolkata indices were found to be 0.83 and 0.86 for the 2014 election and 0.85 and 0.88 for the 2019 election, respectively. The values of these indices are similar to the value of approximately 0.87 observed in the case of unrestricted competition.

5.4. Inequality Analysis for Citation Data of Different Journals and Universities

Data obtained from the ISI Web of Science [26] show that the citations of the papers published from different leading universities or institutions and leading journals are also unequal; here, we take the average value from 1980 to 2010 (see Table 5, Table 6 and Table 7). Table 8 shows that the most successful 18–25% of papers published by different universities or institutes and journals received 82–76% of citations.
Here, we compare the growth pattern of income inequality in the IRS data (USA) with citation inequality for papers published by established universities (see Table 5 and Table 6) published in established journals (see Table 7) and by individual Nobel laureate scientists (see Table 9). The data for these comparisons were taken from other publications (Refs. [26,27,28]). Figure 10 displays the results, which show that the k and g inequality indices of all these categories drift linearly toward a universal value of k = g 0.87 under unrestricted competition. This suggests that approximately 87% of the wealth, citations, or votes are possessed, earned, or won by 13% of people, papers, or election candidates, respectively.

5.5. A Study of Inequality in Citations: An Analysis of Individual Authors and Award Recipients

In this section, we present two tables with statistical analysis of research papers and their citations for 20 distinguished scientists who have won Nobel Prizes in economics, physics, chemistry, and biology/physiology/medicine, as well as for several international prize winners in mathematics and physics. Table 9 shows the analysis for Nobel laureates, while Table 10 presents the results for the international prize winners. The data were collected from Google Scholar during the first week of January 2021, and the names of the scientists are reported in the same format as they appear on their respective Google Scholar pages.
Figure 11 and Figure 12 illustrate the inequality analysis for the 20 Nobel laureates and prize winners. These figures demonstrate an ideal example of the dynamics of wealth inequality without any external interventions or fine tuning. Furthermore, the Gini index (g) and the Kolkata index (k) approach each other at a value of approximately 0.87 in both cases. This indicates that the results are consistent and robust across groups of distinguished scientists.
In Figure 11 and Figure 12, we see that an average 15 % of the papers published by successful individual authors received about 85 % of his/her total citations. The Gini and k indices coincide around ≃0.87.

5.6. Similarity in the Behavior of the Gini and Kolkata Indices across Multiple Domains: A Universality Study

In this section, we aim to consolidate the findings on the Gini (g) and Kolkata (k) indices from previous subsections. We gathered estimates of g and k from various sources, including IRS (US) data [19,20] on household income and income tax spanning 1983–2018; the citation data from papers published by 40 international prize winners (Fields medalists, ASICTP Dirac medalists, Boltzmann medalists, and von Neumann awardees), as shown in Table 10; and the vote share data from the Indian parliamentary elections in 2014 and 2019, displayed in Table 4. We present the compiled results in Figure 13. The collective analysis of all these results reveals a universal trend of inequality growth across various social institutions, markets (income and wealth), academic institutions (citations), and elections (vote shares among the candidates). Furthermore, the analysis shows that the measures of inequality converge to k = g = 0.87 ± 0.02 .

5.7. Inequality Analysis for Manmade Conflicts and Natural Disasters

With respect to manmade conflicts such as war, battle, armed conflict, terrorism, murder, etc., an average of 85 % of human deaths are caused by 15 % of social conflicts, including war (see Table 11).
We also show that for natural disasters such as earthquakes, floods, tsunamis, etc., almost 95 % of human deaths are caused by 5 % of disasters (see Table 12).

5.8. Inequality Analysis in Computing Systems

The field of computer science has long recognized the adage that “ 20 % of the code contains 80 % of the errors”, as noted in [30]. Consequently, software developers have a vested interest in identifying and rectifying this critical 20 % of the codebase in order to enhance the quality of the software. In a related finding, researchers have also observed that approximately 80 % of the functionality of a given software program can typically be implemented in just 20 % of the total development time. Conversely, the remaining 20 % of the software’s features, which represent the most challenging and time-consuming aspects of the coding process, often require the remaining 80 % of the total development time. This factor is commonly taken into account when estimating the cost and timeline for software development using the constructive cost model (COCOMO) approach. Therefore, in computing, we see that 20 % of the code contains 80 % of the errors.

5.9. Inequality Analysis for Sports: Olympic Medal Share

In recent discussions, scholars have posited that the concept of inequality also applies to the field of sports, where a few top performers often dominate the majority of victories. This phenomenon is exemplified in the sport of baseball, where wins above replacement (WAR) is used as a composite metric to gauge a player’s overall importance to a team. Recent statistical analyses have revealed that a mere 15 % of baseball players accounted for 85 % of the total wins, while the remaining 85 % of players were responsible for generating only 15 % of the wins [31]. These findings provide compelling evidence for the existence of significant inequality within the sport, indicating that a small minority of players drive the majority of team successes. The inequality statistics (reflected in the Gini and Kolkta index values) for the country-wise inequalities in Olympic medal wins are shown in Table 13. Typically, 13 to 16 percent of countries win 84 to 87 percent of Olympic medals (gold, silver, or bronze) in the Summer Olympics (see Table 13; statistics for the last four Olympics from ref. [32]).

6. Growing Avalanche Size Inequalities in Sand Pile Models: Universality near the SOC Point

All the observations reported so far indicate that the k index reaches a critical value of k = g before the inequality falls again. Such an observation implies that the k = g point can be characterized as a critical point for society. Further observations of physical self-organized critical systems indicate the same kind of behavioral pattern [16]. As all of the above observations were made for unrestricted competitive scenarios in which only the winners obtain all the facilities, we can expect that in a real society, such a point exists and that inequality can never exceed that point because of government subsidies that aid the poor population.
According to the self-organized criticality (SOC) framework, the critical point is an attractor; we found that (see, for example, [16]) immediately preceding the SOC point, the avalanche size inequality attains a value approximately equal to 0.86 . This observation is consistent with our observations discussed in the previous sections.
Here, we present the results of a comparison of the k and g indices for two different sand pile models, namely the Bak–Tang–Wiesenseld (BTW) model and the Manna model [16]. Figure 14 illustrates the k versus g relationship for these models, with panel (a) showing the results for the BTW model and panel (b) showing the results for the Manna model. The plots depict a linear relationship between k and g for the initial part of the curve. For the BTW model, the slope of the linear fit is 0.3876, while for the Manna model, the slope is slightly lower, at 0.3815. The intersection of the plots with the line of equality ( k = g ) occurs at 0.8628 and 0.8556 for the BTW and Manna models, respectively. It is important to note that these results were obtained for a system size of L × L = 512 × 512 .
The relationship between the Kolkata index (k) and the Gini index (g) is also analyzed here for two other SOC models, namely the Edwards–Wilkinson (EW) model and the fiber bundle model (FBM) [16]. In Figure 15a, the k values are plotted against their corresponding g values for the EW model. A linear relationship is observed in the initial section of the plot, with a slope of 0.40. Similarly, in Figure 15b, the k versus g plot is shown for the centrally loaded fiber bundle model. In this case, the initial linear slope is measured to be 0.42. These results suggest that the k index and Gini index are linearly related in the early stages of both models, with slightly different slopes.
In the several SOC models considered here, we observe a remarkably consistent coincidence of the avalanche size inequality indices (g and k) at around g = k 0.86 immediately preceding the arrival of the SOC point. This is also consistent our previous observations in the inequality measures in various socioeconomic contexts. We consider this to be noteworthy.

7. Summary and Discussion

Disparities between social classes are constantly present (see, e.g., [33,34] for some recent discussions), which has been proposed that it is an emergent trait of complex socioeconomic systems [35] with many interacting parts. It may be mentioned at this point that allowing for a higher probability of exchange for the poor in conservative kinetic exchange models induces (see, e.g., [36,37,38]), in a novel, self-organized way, a minimum “poverty level”, thereby reducing inequality. Herein, we attempted to show that the extreme social inequality we find in society is the result of built-in self-organized critical dynamics. A self-organized criticality [11,39] framework has been proposed to explain the evolutionary behavior of diverse systems, such as wealth distributions, financial markets, cryptocurrencies, citation dynamics, etc. However, a major unresolved issue concerns the appropriate quantification of the observed disparities and their potential universality across systems. Pareto’s 80/20 law, which implies that 80% of the wealth ends up in the hands of 20% of the richest ( k = 0.80 ) members of society, has traditionally been used as a benchmark for measuring the degree of extreme social inequality. Moreover, we attempted to determine the amount of social and economic inequality in a number of very competitive systems without any outside interventions to stop halt inequality among the agents. This self-tuning feature of competitive dynamics in various social sectors suggests similar inequality behavior in the SOC system. Herein, we studied a significant number of socioeconomic systems through the framework of SOC architecture. This approach has been used to analyze a range of systems, including financial markets [40], citation evolution [39,41], cryptocurrencies [42], and political behavior [43], among others. Despite the diversity of these systems, our analysis reveals that the behavior of inequality indices, particularly the Gini (g) and Kolkata (k) indices, demonstrates near-universal characteristics across socioeconomic systems. Specifically, we observe that these indices tend to converge towards a value of approximately 0.87. This finding is particularly noteworthy, given that similar behavior has been observed in SOC models of physical systems [16,44]. A recent research publication [45] presented findings that highlight the high level of inequality in group size distribution. The authors reported a g index value of 0.90 for both the theoretical and empirical observations of group size distribution. We note the consistency of such a high value of the g index with our observations.
In Section 2, we presented a chronological account of the development of social inequality measures since 1896 and demonstrated their similarity to those in sand pile models immediately prior to their respective self-organized critical (SOC) points. In Section 3, we discussed the Lorenz curve, the Gini (g) index, and the Kolkata (k) index in detail and presented proofs of their properties, as well as exemplary calculations. In Section 4 of our review, we conducted an investigation into several analytical characteristics of the Lorenz function ( L ( p ) ). Our analysis, supported by Table 1 and Table 2, led us to the conclusion that for a wide range of plausible analytic forms of L ( p ) , the values of the Gini (g) and Kolkata (k) indices that correspond to coincidence points fall within the range of 4 / 5 (equivalent to 0.80 ) to 8 / 9 (equivalent to 0.88 ). This finding suggests that the values of g and k are tightly constrained and are influenced by the analytic properties of the Lorenz function ( L ( p ) ). In particular, in Section 5, we considered datasets accounting for factors such as earnings from different sectors, Bitcoin price fluctuations, citations of selected prize-winning scientists, and votes received by various candidates in elections. We conducted an analysis of income data from multiple sources. Specifically, we examined data from the Internal Revenue Service (IRS) in the United States, as well as income tax data, over a period of 36 years spanning 1983 to 2018. The IRS data were obtained from previous research [19,20]. Additionally, we investigated income data from the Hollywood movie industry in the United States, which was sourced from a prior study [21], and data from the Bollywood movie industry in India [22] for the time period ranging from 2011 to 2019. These data sources were analyzed in detail in Section 5.1 of our study. In Section 5.2, we showed data for Bitcoin price fluctuations [23], and in Section 5.3, we examined the vote share data pertaining to candidates who ran in parliamentary elections in India during in 2014 and 2019. We conducted an analysis of this data with the aim of exploring inequalities in the vote share among candidates [24,25]. In this research, we present data on the citations of published papers from several prominent universities and institutions, as well as leading journals, which can be found in Section 5.4 and are cited in [7]. We also conducted an analysis of citation data from Google Scholar for 20 selected individuals who have been awarded Nobel Prizes in the fields of economics, physics, chemistry, and biology/physiology/medicine, as well as individuals who have been awarded the Fields Medal (mathematics), the Boltzmann Medal (physics),the ASICTP Dirac Medal (physics), or the John von Neumann Award (social science) in various years. We focused on individuals who have their own Google Scholar pages with “verified email” addresses; the results of our analysis are presented in Section 5.5. We utilized these data sources to investigate sectoral inequality and computed the associated Gini (g) and Kolkata (k) indices. The results of our analysis are presented in Figure 5, Figure 7 and Figure 9, Figure 10, Figure 11 and Figure 12 and summarized in Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9 and Table 10. We also compiled these findings into a single figure, shown in Section 5.6 as Figure 13. Our analysis revealed a universal value of approximately 0.87 for the coinciding g and k indices, indicating an emerging trend of increasing disparities under conditions of competition. Moreover, in Section 5.7, we investigated a similar trend for manmade conflicts such as war, terrorism, etc., as well as for natural disasters such as earthquakes, tsunamis, etc. (see Table 11 and Table 12). In Section 5.8 and Section 5.9, we discussed the universality of the k index for computing systems and sports, respectively. The results of our empirical investigation reveal a consistent pattern in the dynamical behavior of the Kolkata index (k) and the Gini index (g) across various scenarios. Specifically, our findings indicate that these inequality measures converge towards a universal value of k = g = 0.87 ± 0.02 in situations in which competitions are not subject to any restrictions. This trend was observed consistently across all socioeconomic systems, highlighting the robustness and universality of this phenomenon. When we talk about dynamics, we are referring to the long-term changes and eventual saturation brought about in the aforementioned systems. We presented a graphical representation of the g and k indices for the daily price fluctuations of Bitcoin over the period of a decade from 2010 to 2021. As shown in Figure 9, our findings suggest that the g and k indices tend to stabilize at a value of approximately 0.87, which lends further support to our conclusions. This pattern suggests that without the intervention of a central bank, such as in the case of national currencies, the inequality indices for cryptocurrencies converge. Specifically, both g and k approach a value of 0.87 before decreasing. Our results indicate that the daily swings in the price of Bitcoin, on average, do not exceed this limiting value ( g = k 0.87 ). Table 13 (Section 5.9 on Inequality Analysis for sports: Olympic medal share) shows that the k index value typically ranges from 0.84 to 0.87, implying that 13 to 16 percent of countries win 84 to 87 percent of Olympic medals.
We investigated the behavior of inequality indices over time and found that they have not yet converged to the predicted attractor value of 0.87 that results from a self-organized critical (SOC) state. However, our analysis presented in Figure 6 in Section 5.1 reveals that both the Gini index (g) and the Kolkata index (k) have exhibited a steady increase over time. This trend is likely attributable to the gradual reduction in public welfare programs in the United States. Interestingly, our results demonstrate that the Pareto value of k = 0.80 has already been surpassed. It is possible to estimate that it will reach 0.87 if all of the aforementioned public assistance programs are eliminated, thereby allowing participants to enter a state of unrestricted competition. In Section 6, we discussed the SOC state of different physical systems (such as the BTW model and Manna model) and calculated their inequality indices (g and k) in terms of their growing avalanche sizes, showing the universality trend, i.e., g = k 0.86 (Figure 14 and Figure 15).
The social dynamics of competition take the index values of g = k 0.87 , indicating that roughly 87% of wealth, citations, votes, or Olympic medals are possessed, earned, or won by 13% of the population, papers, election candidates, or (Olympic participant) countries, respectively in cases of unrestricted competition in which no welfare support towards equality is available. This may be a quantitative and universal (across all social sectors) version of the 80/20 law ( k = 0.80 ) observed by Pareto more than a century ago. This property of the inequality indices is intrinsic to the SOC character of the underlying dynamics, and it has been demonstrated to be present in a wide variety of SOC models in physical science [16,44].
Previous studies [17,46] established that the Gini index (g) can be considered to represent the information entropy of social systems, while the Kolkata index (k) can be thought of as a representation of the inverse of the effective temperature of such systems. An increase in k corresponds to a decrease in the average wealth of a society in circulation, resulting in a decrease in temperature. In this study, we observed that the ratio of g / k , which is equivalent to free energy, displays an identical value at multiple points ( g = k 0.87 and g = k = 1 ). These findings suggest the existence of a first-order-like phase transition [28] at the point at which g = k 0.87 . This reinforces the idea that the relationship between the Gini and Kolkata indices can be analyzed from a thermodynamic perspective. This inequality growth is entropy-driven, as conjectured in the context of self-organized sand pile systems (see, e.g., [47,48]), similar to those explored herein.

Author Contributions

All authors contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data will be available on request to the corresponding author.

Acknowledgments

We are grateful to Sai Krishna Challagundla, Arnab Chatterjee, Nachiketa Chattopadhyay, Suhaas Reddy Guntaka, Bijin Joseph, Hanesh Koganti, Anvesh Reddy Kondapalli, Raju Maiti, Subhrangshu Sekhar Manna, Suresh Mutuswami, Dachepalli R. S. Ram, and Antika Sinha for collaborations at various stages of this study. B.K.C. is grateful to the Indian National Science Academy for the award of a Senior Scientist Research Grant. S.B. is thankful to the DST, Government of India, for financial support through an INSPIRE fellowship. SJB acknowledges using HPCC Surya computing facility in SRM University—AP.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Pareto, V. Cours D’´economie Politique. Reprinted as a Volume of Oeuvres Compl‘etes; Droz: Geneva, Switzerland, 1965; Volume 1896, Available online: https://www.britannica.com/biography/Vilfredo-Pareto (accessed on 1 April 2023).
  2. Lorenz, M. Methods of measuring the concentration of wealth. Publ. Am. Stat. Assoc. 1905, 9, 209–219. [Google Scholar] [CrossRef]
  3. Aaberge, R. Characterizations of Lorenz curves and income distributions. Soc. Choice Welf. 2000, 17, 639–653. [Google Scholar] [CrossRef]
  4. Gini, C.W. Variabilitá e Mutabilitá: Contributo allo Studio delle Distribuzioni e delle Relazioni Statistiche; Cristiano Cuppini: Bologna, Italy, 1912; Available online: https://en.wikipedia.org/wiki/Gini_coefficient#:~:text=The%20Gini%20coefficient%20measures%20the,reflects%20maximal%20inequality%20among%20values (accessed on 1 April 2023).
  5. Bourguignon, F. Globalization of Inequality; Princeton University Press: Princeton, NJ, USA, 2015. [Google Scholar]
  6. Available online: https://en.wikipedia.org/wiki/Occupy_Wall_Street (accessed on 1 April 2023).
  7. Ghosh, A.; Chattopadhyay, N.; Chakrabarti, B.K. Inequality in society, academic institutions and science journals: Gini and k-indices. Physica A 2014, 410, 30–34. [Google Scholar] [CrossRef]
  8. Banerjee, S.; Chakrabarti, B.K.; Mitra, M.; Mutuswami, S. On the Kolkata index as a measure of income inequality. Physica A 2020, 545, 123178. [Google Scholar] [CrossRef]
  9. Banerjee, S.; Chakrabarti, B.K.; Mitra, M.; Mutuswami, S. Inequality measures: The kolkata index in comparison with other measures. Front. Phys. 2020, 8, 562182. [Google Scholar] [CrossRef]
  10. Subramanian, S. Further tricks with the Lorenz curve. Econ. Bull. 2019, 39, 1677–1686. Available online: http://www.accessecon.com/Pubs/EB/2019/Volume39/EB-19-V39-I3-P158.pdf (accessed on 1 April 2023).
  11. Watkins, N.W.; Pruessner, G.; Chapman, S.C.; Crosby, N.B.; Jensen, H.J. 25 Years of Self-organized Criticality: Concepts and Controversies. Space Sci. Rev. 2016, 198, 3–44. [Google Scholar] [CrossRef]
  12. Bak, P.; Tang, C.; Wiesenfeld, K. Self-organized criticality: An explanation of 1/f noise. Phys. Rev. Lett. 1987, 59, 381. [Google Scholar] [CrossRef]
  13. Bak, P. How Nature Works: The Science of Self-Organized Criticality; Copernicus: New York, NY, USA, 1996. [Google Scholar]
  14. Piketty, T. Capital in Twenty First Century; Harvard University Press: Cambridge, MA, USA, 2017. [Google Scholar]
  15. Banerjee, S.; Biswas, S.; Chakrabarti, B.K.; Challagundla, S.K.; Ghosh, A.; Guntaka, S.R.; Koganti, H.; Kondapalli, A.R.; Maiti, R.; Mitra, M.; et al. Evolutionary dynamics of social inequality and coincidence of Gini and Kolkata indices under unrestricted competition. Int. J. Mod. Phys. 2023, 34, 2350048. [Google Scholar] [CrossRef]
  16. Manna, S.S.; Biswas, S.; Chakrabarti, B.K. Near universal values of social inequality indices in self-organized critical models. Phys. A Stat. Mech. Its Appl. 2022, 596, 12721. [Google Scholar] [CrossRef]
  17. Joseph, B.; Chakrabarti, B.K. Variation of Gini and Kolkata indices with saving propensity in the Kinetic Exchange model of wealth distribution: An analytical study. Phys. A Stat. Mech. Its Appl. 2022, 594, 127051. [Google Scholar] [CrossRef]
  18. United Nations Development Program. 1992 Human Development Report; Oxford University Press: New York, NY, USA, 1992. [Google Scholar]
  19. Available online: https://www.irs.gov/statistics/soi-tax-stats-individual-income-tax-returns-publication-1304-complete-report (accessed on 1 April 2023).
  20. Ludwig, D.; Yakovenko, Y.M. Physics-inspired analysis of the two-class income distribution in the usa in 1983–2018. Phil. Trans. R. Soc. A 2021, 380, 20210162. [Google Scholar] [CrossRef]
  21. Available online: https://www.boxofficemojo.com/year/2011/ (accessed on 1 April 2023).
  22. Available online: https://www.bollywoodhungama.com/box-office-collections/filterbycountry/IND/2011/ (accessed on 1 April 2023).
  23. Available online: https://in.investing.com/crypto/bitcoin/historical-data (accessed on 1 April 2023).
  24. Available online: https://eci.gov.in/files/file/2785-constituency-wise-detailed-result/ (accessed on 1 April 2023).
  25. Available online: https://eci.gov.in/files/file/10929-33constituency-wise-detailed-result/ (accessed on 1 April 2023).
  26. Chatterjee, A.; Ghosh, A.; Chakrabarti, B.K. Universality of Citation Distributions for Academic Institutions and Journals. PLoS ONE 2016, 11, E0146762. [Google Scholar] [CrossRef]
  27. Chatterjee, A.; Ghosh, A.; Chakrabarti, B.K. Socio-economic inequality: Relationship between Gini and Kolkata indices. Physica A 2017, 466, 583–595. [Google Scholar] [CrossRef]
  28. Ghosh, A.; Chakrabarti, B.K. Limiting value of the Kolkata index for social inequality and a possible social constant. Phys. A Stat. Mech. Its Appl. 2021, 573, 125944. [Google Scholar] [CrossRef]
  29. Sinha, A.; Chakrabarti, B.K. Inequality in death from social conflicts: A Gini & Kolkata indices-based study. Phys. A Stat. Mech. Its Appl. 2019, 527, 121185. [Google Scholar] [CrossRef]
  30. Pressman, R.S. Software Engineering: A Practitioner’s Approach, 7th ed.; McGraw-Hill: Boston, MA, USA, 2010; ISBN 978-0-07-337597-7. [Google Scholar]
  31. Zimmerman, J. Applying the Pareto Principle (80-20 Rule) to Baseball. 4 June 2010. Available online: https://www.beyondtheboxscore.com/2010/6/4/1501048/applying-the-parento-principle-80 (accessed on 12 April 2018).
  32. Available online: https://en.wikipedia.org/wiki/2008_Summer_Olympics_medal_table (accessed on 1 April 2023).
  33. Iglesias, J.R.; Cardoso, B.F.; Gonçalves, S. Inequality, a scourge of the XXI century. Commun. Nonlinear Sci. Numer. Simul. 2021, 95, 105646. [Google Scholar] [CrossRef]
  34. Boghosian, B.M. Is inequality inevitable? Sci. Am. 2019, 321, 70–77. [Google Scholar]
  35. Zhukov, D. How the theory of self-organized criticality explains punctuated equilibrium in social systems. Methodol. Innov. 2022, 15, 163. [Google Scholar] [CrossRef]
  36. Pianegonda, S.; Iglesias, J.R.; Abramson, G.; Vega, J.L. Wealth redistribution with conservative exchanges. Physica A 2003, 322, 667–675. [Google Scholar] [CrossRef]
  37. Iglesias, J.R.; Gonçalves, S.; Pianegonda, S.; Vega, J.L.; Abramson, G. Wealth redistribution in our small world. Physica A 2003, 327, 12–17. [Google Scholar] [CrossRef]
  38. Pianegonda, S.; Iglesias, J.R. Inequalities of wealth distribution in a conservative economy. Physica A 2004, 42, 193–199. [Google Scholar] [CrossRef]
  39. Leydesdorff, L.; Wagner, C.S.; Bornmann, L. Discontinuities in citation relations among journals: Self-organized criticality as a model of scientific revolutions and change. Scientometrics 2018, 116, 623. [Google Scholar] [CrossRef]
  40. Biondo, A.E.; Pluchino, A.; Rapisarda, A. Order book, financial markets, and self-organized criticality. Chaos Solitons Fractals 2016, 88, 196. [Google Scholar] [CrossRef]
  41. Biró, T.S.; Telcs, A.; Józsa, M.; Néda, Z. Gintropic scaling of scientometric indexes. Physica A 2023, 618, 128717. [Google Scholar] [CrossRef]
  42. Minati, G. Big data: From forecasting to mesoscopic understanding. Meta-profiling as complex systems. Systems 2019, 7, 8. [Google Scholar] [CrossRef]
  43. Brunk, G.G. Self-organized criticality: A new theory of political behaviour and some of its implications. Br. J. Pol. Sci. 2001, 31, 427. [Google Scholar] [CrossRef]
  44. Ghosh, A.; Biswas, S.; Chakrabarti, B.K. Success of social inequality measures in predicting critical or failure points of some model physical systems. Front. Phys. 2022, 10, 803. [Google Scholar] [CrossRef]
  45. Korbel, J.; Lindner, S.D.; Pham, T.M.; Hanel, R.; Thurner, S. Homophily-Based Social Group Formation in a Spin Glass Self-Assembly Framework. Phys. Rev. Lett. 2023, 130, 057401. [Google Scholar] [CrossRef]
  46. Biró, T.S.; Néda, Z. Gintropy: Gini Index Based Generalization of Entropy. Entropy 2020, 22, 879. [Google Scholar] [CrossRef]
  47. Main, I.G.; Naylor, M. Entropy production and self-organized (sub) criticality in earthquake dynamics. Phil. Trans. R. Soc. A 2010, 368, 131. [Google Scholar] [CrossRef] [PubMed]
  48. Lang, M.; Shkolnikov, M. Harmonic dynamics of the abelian sandpile. Proc. Natl. Acad. Sci. USA 2019, 116, 2821. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The Lorenz curve or function ( L ( p ) , red) shows the proportion of total wealth owned by a fraction (p) of people in ascending order of wealth. The black dotted line represents a scenario of perfect equality in which everyone possesses the same amount of wealth. The Gini index (g) is calculated as the area (S) between the Lorenz curve and the equality line (shaded region), normalized by the total area under the equality line ( S + S = 1 2 ). The complementary Lorenz function ( L ^ ( p ) 1 L ( p ) is) shown in green. The Kolkata index (k) is determined by the point at which the Lorenz curve intersects the diagonal line perpendicular to the equality line. The value of L ^ ( k ) = 1 L ( k ) is equal to k, which indicates that k is a fixed point of L ^ ( p ) and indicates the proportion of wealth owned by the top ( 1 k ) fraction of the population.
Figure 1. The Lorenz curve or function ( L ( p ) , red) shows the proportion of total wealth owned by a fraction (p) of people in ascending order of wealth. The black dotted line represents a scenario of perfect equality in which everyone possesses the same amount of wealth. The Gini index (g) is calculated as the area (S) between the Lorenz curve and the equality line (shaded region), normalized by the total area under the equality line ( S + S = 1 2 ). The complementary Lorenz function ( L ^ ( p ) 1 L ( p ) is) shown in green. The Kolkata index (k) is determined by the point at which the Lorenz curve intersects the diagonal line perpendicular to the equality line. The value of L ^ ( k ) = 1 L ( k ) is equal to k, which indicates that k is a fixed point of L ^ ( p ) and indicates the proportion of wealth owned by the top ( 1 k ) fraction of the population.
Entropy 25 00735 g001
Figure 2. Timeline of the evolution of social inequality measures since 1896 and their universal convergence to those for sand pile models prior to their respective self-organized critical (SOC) points. We start the timeline from 1896 with the work of Pareto [1] and subsequent developments in 1905 by Lorenz [2] and Gini [4] in 1912. Then we observe the consistency of the Gini index (g) for a decade-span 1980–1990 [5]. Subsequent protest happened in 2011 at the Wall Street for the advection of the majority portion of the entire wealth in the hands of very few people [6]. In 2014, Kolkata index (k) was introduced as another measure of inequality in the wealth distribution [7]. In 2016, Watkins and others proposed that all social systems evolve towards the respective SOC state [11]. Piketty (2017) pointed out forcefully about the continuous growth of the wealth of top 10% of the people [14]. In the year 2020, the work of Banerjee and others reported that the inequality of the social systems has a tendency to evolve at a point of g = k 0.87 [9,15]. In 2022, Manna and others showed numerically that many physical SOC systems show g = k 0.86 just preceding the SOC points in the respective systems [16]. In this review, the figures and tables are arranged with self-contained captions in an attempt to provide readers with an overview of our motivation and the main results presented the introductory and concluding sections (15 figures and 13 tables and their captions).
Figure 2. Timeline of the evolution of social inequality measures since 1896 and their universal convergence to those for sand pile models prior to their respective self-organized critical (SOC) points. We start the timeline from 1896 with the work of Pareto [1] and subsequent developments in 1905 by Lorenz [2] and Gini [4] in 1912. Then we observe the consistency of the Gini index (g) for a decade-span 1980–1990 [5]. Subsequent protest happened in 2011 at the Wall Street for the advection of the majority portion of the entire wealth in the hands of very few people [6]. In 2014, Kolkata index (k) was introduced as another measure of inequality in the wealth distribution [7]. In 2016, Watkins and others proposed that all social systems evolve towards the respective SOC state [11]. Piketty (2017) pointed out forcefully about the continuous growth of the wealth of top 10% of the people [14]. In the year 2020, the work of Banerjee and others reported that the inequality of the social systems has a tendency to evolve at a point of g = k 0.87 [9,15]. In 2022, Manna and others showed numerically that many physical SOC systems show g = k 0.86 just preceding the SOC points in the respective systems [16]. In this review, the figures and tables are arranged with self-contained captions in an attempt to provide readers with an overview of our motivation and the main results presented the introductory and concluding sections (15 figures and 13 tables and their captions).
Entropy 25 00735 g002
Figure 3. Graph of the Gini index (g) versus the k index (k), where the orange line represents the equality line ( g = k ). The black dots indicate the ( g , k ) values for the Lorenz function, L ( p ) = p n , and n ranges from 1 to 20. The inset shows the Lorenz curves for L ( p ) = p 2 (red curve), L ( p ) = p 13 (blue curve), and L ( p ) = p 14 (green dashed curve), with their corresponding k-index values ( k 1 0.618 , k 2 0.860 , and k 3 0.866 , respectively). This figure is adopted from Banerjee et al. (2022) [15].
Figure 3. Graph of the Gini index (g) versus the k index (k), where the orange line represents the equality line ( g = k ). The black dots indicate the ( g , k ) values for the Lorenz function, L ( p ) = p n , and n ranges from 1 to 20. The inset shows the Lorenz curves for L ( p ) = p 2 (red curve), L ( p ) = p 13 (blue curve), and L ( p ) = p 14 (green dashed curve), with their corresponding k-index values ( k 1 0.618 , k 2 0.860 , and k 3 0.866 , respectively). This figure is adopted from Banerjee et al. (2022) [15].
Entropy 25 00735 g003
Figure 4. The plot of the Gini index (g) versus the k index (k), where the orange line corresponds to the g = k line. The black dots represent ( g , k ) values for several simple Lorenz functions, as listed in Table 2. The black dots tend to converge towards the g = k line higher values of g. In the inset, two different Lorenz curves are shown for cases (4) and (7) from Table 2. The red curve represents the Lorenz curve for case (4) with a k-index value of k 1 0.682 , while the blue curve represents the Lorenz curve for case (7) with a k-index value of k 2 0.833 . These results provide insight into the relationship between the Gini index and the k index, as well as the behavior of these measures across different Lorenz curves (adopted from [15]).
Figure 4. The plot of the Gini index (g) versus the k index (k), where the orange line corresponds to the g = k line. The black dots represent ( g , k ) values for several simple Lorenz functions, as listed in Table 2. The black dots tend to converge towards the g = k line higher values of g. In the inset, two different Lorenz curves are shown for cases (4) and (7) from Table 2. The red curve represents the Lorenz curve for case (4) with a k-index value of k 1 0.682 , while the blue curve represents the Lorenz curve for case (7) with a k-index value of k 2 0.833 . These results provide insight into the relationship between the Gini index and the k index, as well as the behavior of these measures across different Lorenz curves (adopted from [15]).
Entropy 25 00735 g004
Figure 5. Plot of the Kolkata index (k) against the Gini index (g) for income and income tax data extracted from IRS (USA) data [19,20] from the years 1983 to 2018. The data were obtained from the corresponding Lorenz functions ( L ( p ) ) for each of these 36 years (adopted from [15]).
Figure 5. Plot of the Kolkata index (k) against the Gini index (g) for income and income tax data extracted from IRS (USA) data [19,20] from the years 1983 to 2018. The data were obtained from the corresponding Lorenz functions ( L ( p ) ) for each of these 36 years (adopted from [15]).
Entropy 25 00735 g005
Figure 6. Trend of the Gini (g) and Kolkata (k) indices over time (year) for the US economy using IRS data [19,20]. The graph clearly shows an increasing trend in the inequality measures over time, indicating a decline in public welfare and a shift towards an SOC state of unrestricted competition. The value of k in the tax data, which is argued to be a better indicator of the prevailing inequality status, surpasses the Pareto value of 0.80 and is predicted to reach 0.87, similar to other socioeconomic systems (e.g., movie income or citations) in which public welfare programs are completely absent (this figure is adopted from [15]).
Figure 6. Trend of the Gini (g) and Kolkata (k) indices over time (year) for the US economy using IRS data [19,20]. The graph clearly shows an increasing trend in the inequality measures over time, indicating a decline in public welfare and a shift towards an SOC state of unrestricted competition. The value of k in the tax data, which is argued to be a better indicator of the prevailing inequality status, surpasses the Pareto value of 0.80 and is predicted to reach 0.87, similar to other socioeconomic systems (e.g., movie income or citations) in which public welfare programs are completely absent (this figure is adopted from [15]).
Entropy 25 00735 g006
Figure 7. Scatter plot of the Kolkata index (k) versus the Gini index (g) for box office income obtained from Hollywood (USA, data source: [21]) and Bollywood (India, data source: [22]) over a period of 9 years from 2011 to 2019. The plot provides a comparative analysis of the inequality measures for these two major film industries (adopted from [15]).
Figure 7. Scatter plot of the Kolkata index (k) versus the Gini index (g) for box office income obtained from Hollywood (USA, data source: [21]) and Bollywood (India, data source: [22]) over a period of 9 years from 2011 to 2019. The plot provides a comparative analysis of the inequality measures for these two major film industries (adopted from [15]).
Entropy 25 00735 g007
Figure 8. The Lorenz function (curve) depicts the distribution of the difference in the closing price of Bitcoin for consecutive days (adopted from [15]).
Figure 8. The Lorenz function (curve) depicts the distribution of the difference in the closing price of Bitcoin for consecutive days (adopted from [15]).
Entropy 25 00735 g008
Figure 9. (Left): A graphical representation of the Kolkata index (k) plotted against the Gini index (g) for the statistical analysis of the daily Bitcoin price. (Right): temporal variation of the g and k indices. For comparison, a reference value of approximately 0.87 is provided in the figure (adopted from [15]).
Figure 9. (Left): A graphical representation of the Kolkata index (k) plotted against the Gini index (g) for the statistical analysis of the daily Bitcoin price. (Right): temporal variation of the g and k indices. For comparison, a reference value of approximately 0.87 is provided in the figure (adopted from [15]).
Entropy 25 00735 g009
Figure 10. Comparison of the Gini index (g) and Kolkata index (k) obtained from the analysis of IRS (US) data on income, income tax, and income from movies from 1983 to 2018 (see inset in the figure) and citations of papers published by scientists from universities or institutes; published in journals; and by Nobel laureates in physics, chemistry, medicine, and economics (data taken from Refs. [27,28]). The initial variation of k against g for both income and income tax and for citations by universities, journals, and individual scientists is remarkably similar, showing quantitative agreement. The main figure illustrates this comparison (adopted from [15]).
Figure 10. Comparison of the Gini index (g) and Kolkata index (k) obtained from the analysis of IRS (US) data on income, income tax, and income from movies from 1983 to 2018 (see inset in the figure) and citations of papers published by scientists from universities or institutes; published in journals; and by Nobel laureates in physics, chemistry, medicine, and economics (data taken from Refs. [27,28]). The initial variation of k against g for both income and income tax and for citations by universities, journals, and individual scientists is remarkably similar, showing quantitative agreement. The main figure illustrates this comparison (adopted from [15]).
Entropy 25 00735 g010
Figure 11. Plot of the values of the Kolkata (k) index versus the corresponding Gini (g) index for the citation statistics of publications by 20 selected Nobel laureates, as shown in Table 9. The plot suggests a coincidence value of k = g = 0.86 ± 0.06 , as adapted from a previous study [27].
Figure 11. Plot of the values of the Kolkata (k) index versus the corresponding Gini (g) index for the citation statistics of publications by 20 selected Nobel laureates, as shown in Table 9. The plot suggests a coincidence value of k = g = 0.86 ± 0.06 , as adapted from a previous study [27].
Entropy 25 00735 g011
Figure 12. Plot of the Kolkata (k) index versus the Gini (g) index for the citation inequalities in papers published by individual prize winners. The data were extracted from the corresponding Lorenz function ( L ( p ) ) for each scientist and are presented in Table 10 (adopted from [15]).
Figure 12. Plot of the Kolkata (k) index versus the Gini (g) index for the citation inequalities in papers published by individual prize winners. The data were extracted from the corresponding Lorenz function ( L ( p ) ) for each scientist and are presented in Table 10 (adopted from [15]).
Entropy 25 00735 g012
Figure 13. A compiled plot of the Kolkata index (k) values versus corresponding Gini index (g) values for several cases analyzed in previous subsections, including household income and income tax data (Figure 5, Figure 6), movie income (Figure 7), citation inequalities among individual prize winners (Table 10, Figure 12), and vote share inequalities among election contestants (Table 4). The results suggest that there may be universal inequality measures across social institutions, as the data points in the plot converge towards a common value of k = g = 0.87 ± 0.02 . This observation has important implications for understanding the nature and extent of wealth inequality across different domains of society (adopted from [15]).
Figure 13. A compiled plot of the Kolkata index (k) values versus corresponding Gini index (g) values for several cases analyzed in previous subsections, including household income and income tax data (Figure 5, Figure 6), movie income (Figure 7), citation inequalities among individual prize winners (Table 10, Figure 12), and vote share inequalities among election contestants (Table 4). The results suggest that there may be universal inequality measures across social institutions, as the data points in the plot converge towards a common value of k = g = 0.87 ± 0.02 . This observation has important implications for understanding the nature and extent of wealth inequality across different domains of society (adopted from [15]).
Entropy 25 00735 g013
Figure 14. The relationship of the k index versus the Gini index (g) for two sand pile models: (a) the BTW model and (b) the Manna model. In both cases, the initial portions of the curves follow a straight line with slightly different slopes, as demonstrated in the figures. The crossing points of the curves with the line g = k are 0.8628 and 0.8556 for the BTW and Manna models, respectively. This figure was adapted from [16].
Figure 14. The relationship of the k index versus the Gini index (g) for two sand pile models: (a) the BTW model and (b) the Manna model. In both cases, the initial portions of the curves follow a straight line with slightly different slopes, as demonstrated in the figures. The crossing points of the curves with the line g = k are 0.8628 and 0.8556 for the BTW and Manna models, respectively. This figure was adapted from [16].
Entropy 25 00735 g014
Figure 15. The relationships between the k index and Gini index (g) for the EW and centrally loaded fiber bundle models. Figure (a) shows the k versus g plot for the EW model, with an initial slope of 0.40. Figure (b) displays the same plot for the centrally loaded fiber bundle model, with an initial slope of 0.42. The figure is adapted from [16].
Figure 15. The relationships between the k index and Gini index (g) for the EW and centrally loaded fiber bundle models. Figure (a) shows the k versus g plot for the EW model, with an initial slope of 0.40. Figure (b) displays the same plot for the centrally loaded fiber bundle model, with an initial slope of 0.42. The figure is adapted from [16].
Entropy 25 00735 g015
Table 1. The set of Lorenz functions of the form L ( p ) = p n with p ranging within the interval [ 0 , 1 ] and n values ranging from 1 to 20.
Table 1. The set of Lorenz functions of the form L ( p ) = p n with p ranging within the interval [ 0 , 1 ] and n values ranging from 1 to 20.
n L ( p ) = p n g n k n
( n = 1 ) p0 1 2
( n = 2 ) p 2 1 3 5 1 2 0.618
( n = 3 ) p 3 1 2 0.682
( n = 4 ) p 4 3 5 0.725
( n = 5 ) p 5 0.6670.755
( n = 6 ) p 6 0.7140.778
( n = 7 ) p 7 0.7500.797
( n = 8 ) p 8 0.7780.812
( n = 9 ) p 9 0.8000.824
( n = 10 ) p 10 0.8180.835
( n = 11 ) p 11 0.8330.844
( n = 12 ) p 12 0.8460.853
( n = 13 ) p 13 0.8570.860
( n = 14 ) p 14 0.8670.866
( n = 15 ) p 15 0.8750.872
( n = 16 ) p 16 0.8820.877
( n = 17 ) p 17 0.8890.882
( n = 18 ) p 18 0.8950.886
( n = 19 ) p 19 0.9000.890
( n = 20 ) p 20 0.9050.894
Table 2. Different polynomial Lorenz functions (adopted from Banerjee et al. [15]).
Table 2. Different polynomial Lorenz functions (adopted from Banerjee et al. [15]).
Case L ( n , a ) ( p ) g ( n , a ) = k ( n , a ) Interval of n
( 1 ) p n 0.865(13, 14)
( 2 ) m = 1 n 1 n p m 0.869(65, 66)
( 3 ) ( n 1 ) n p + m = 2 n 1 n ( n 1 ) p m Impossibility
( 4 ) m = 1 n 1 1 n ( n 1 ) p m + ( n 1 ) n p n 0.874(17, 18)
( 5 ) m = 1 n 6 m ( n + 1 m ) n ( n + 1 ) ( n + 2 ) p m 0.874(40, 41)
( 6 ) m = 1 n 2 ( m + 1 ) n ( n + 3 ) p m 0.877(29, 30)
( 7 ) m = 1 n ln ( n + 1 m ) r = 1 n ln ( n + 1 r ) p m 0.881(77, 78)
Table 3. An analysis of income inequality in box office earnings for movies released during the period of 2011–2019 in two major film industries, Hollywood (USA) and Bollywood (India). Data taken from refs. [21,22].
Table 3. An analysis of income inequality in box office earnings for movies released during the period of 2011–2019 in two major film industries, Hollywood (USA) and Bollywood (India). Data taken from refs. [21,22].
MovieBox Office Collection
from Hollywood (USA) Movies
Release
Year
Total
Movies
Gini
(g)
Kolkata
(k)
20106510.870.86
20117300.870.87
20128070.890.88
20138260.900.88
20148490.900.88
20158470.910.89
20168560.900.89
20178520.910.89
20189930.920.90
20199110.920.90
MovieBox Office Collection
from Bollywood (India) Movies
Release
Year
Total
Movies
Gini
(g)
Kolkata
(k)
20101390.770.81
20111230.780.82
20121320.780.81
20131360.760.79
20141450.80.82
20151660.80.82
20162150.830.83
20172510.850.84
20182180.840.85
20192460.850.86
Table 4. Gini (g) and Kolkata (k) index values obtained from the Lorenz function ( L ( p ) ) for the vote shares in the Indian parliamentary elections held in the years 2014 and 2019, where the number of contesting candidates was over 8000 in each year. The data used for the analysis were obtained from references [24,25].
Table 4. Gini (g) and Kolkata (k) index values obtained from the Lorenz function ( L ( p ) ) for the vote shares in the Indian parliamentary elections held in the years 2014 and 2019, where the number of contesting candidates was over 8000 in each year. The data used for the analysis were obtained from references [24,25].
YearTotal Votersgk
2014 5 × 10 8 0.830.86
2019 6 × 10 8 0.850.88
Table 5. Gini and Kolkata indices for citation inequalities of publications by authors from selected universities analyzed in December 2013 using data obtained from the ISI Web of Science (adapted from [7,27]).
Table 5. Gini and Kolkata indices for citation inequalities of publications by authors from selected universities analyzed in December 2013 using data obtained from the ISI Web of Science (adapted from [7,27]).
Inst./Univ.YearISI Web of Science Data
N p N c Index Values
g k
Melbourne198086616,1070.670.75
1990113130,3490.680.75
2000211657,8710.650.74
2010525563,1510.680.75
Tokyo1980287160,6820.690.76
19904196108,1270.680.76
20007955221,3230.700.76
2010915491,3490.700.76
Harvard19804897225,6260.730.78
19906036387,2440.730.78
20009566571,6660.710.77
201015,079263,6000.690.76
MIT19802414101,9290.760.79
19902873156,7070.730.78
20003532206,1650.740.78
20105470109,9950.690.76
Cambridge1980167862,9810.740.78
19902616111,8180.740.78
20004899196,2500.710.77
20106443108,8640.700.76
Oxford1980124139,3920.700.77
1990214783,9370.730.78
20004073191,0960.720.77
20106863114,6570.710.76
Table 6. Gini and Kolkata indices for citation inequalities of publications by authors from selected Indian universities/institutes analyzed in December 2013 using data obtained from the ISI Web of Science (adapted from [7]).
Table 6. Gini and Kolkata indices for citation inequalities of publications by authors from selected Indian universities/institutes analyzed in December 2013 using data obtained from the ISI Web of Science (adapted from [7]).
Inst./Univ.YearISI Web of Science Data
N p N c Index Values
g k
SINP1980321700.720.74
1990916660.660.73
200014822250.770.79
201023818960.710.76
IISC198045047280.730.78
199057384100.700.76
200087419,1670.670.75
2010162411,4970.620.73
TIFR198016720240.700.76
199030349610.730.77
200043911,2750.740.77
201057399880.780.79
Calcutta19801627490.740.78
199021715110.640.74
200017320730.680.74
201043224700.610.73
Delhi198042626140.670.75
199024722520.680.76
200030137910.680.76
201091468960.660.74
Madras198019313170.690.76
199015810440.680.76
200018821770.640.73
201034822680.780.79
Table 7. Gini and Kolkata indices for citation inequalities of publications in selected science journals in December 2013 using data obtained from the ISI Web of Science (adapted from [7]).
Table 7. Gini and Kolkata indices for citation inequalities of publications in selected science journals in December 2013 using data obtained from the ISI Web of Science (adapted from [7]).
Inst./Univ.YearISI Web of Science Data
N p N c Index Values
g k
Nature19802904178,9270.800.81
19903676307,5450.860.85
20003021393,5210.810.82
20102577100,8080.790.81
Science19801722111,7370.770.80
19902449228,1210.840.84
20002590301,0930.810.82
2010243985,8790.760.79
PNAS(USA)1980----
19902133282,9300.540.70
20002698315,6840.490.68
20104218116,0370.460.66
Cell198039472,6760.540.70
1990516169,8680.500.68
2000351110,6020.560.70
201057332,4850.680.75
PRL1980119687,7730.660.74
19901904156,7220.630.74
20003124225,5910.590.72
2010335073,9170.510.68
PRA198063924,8020.610.73
1990192254,5110.610.72
2000141038,9480.600.72
2010293426,3140.530.69
PRB1980141362,7410.650.74
19903488153,5210.650.74
20004814155,1720.590.72
2010620770,6120.530.69
PRC198063019,3730.660.75
199072815,3120.630.73
200085619,1430.570.71
2010106111,7640.560.70
PRD198080036,2630.760.80
1990104933,2570.680.76
2000206166,4080.610.73
2010301240,1670.540.69
PRE1980----
1990----
2000207851,8600.580.71
2010238116,6050.500.68
Table 8. Average citation share of papers published from 1980 to 2010 by different universities/institutions and in journals [26].
Table 8. Average citation share of papers published from 1980 to 2010 by different universities/institutions and in journals [26].
Inst./Univ./JournPapers (%)Citations (%)Comments
Harvard (Univ)2278About 23% of the papers
MIT (Univ)2278published by leading
IISC (Inst)2575universities/institutions received 77%
TIFR (Inst)2377of the citations.
About 19% of the papers
Nature (Journ)1882published in leading
Science (Journ)1981journals received 81% of
the citations.
Table 9. Statistical analysis of research papers and their citations for 20 Nobel laureates in economics (Econ), physics (Phys), chemistry (Chem), and biology/physiology/medicine (Bio). The data were collected from their individual Google Scholar pages with a verifiable email site during the first week of January 2021. To be included in the analysis, the Laureates had to have at least 100 entries (papers or documents), with the latest not before 2018. These Nobel laureates have published a range of papers, from 111 to 3000, with N p denoting the number of papers. The laureates’ names appear in the same form as they do on their respective Google Scholar pages (adopted from Ghosh et al. [28]).
Table 9. Statistical analysis of research papers and their citations for 20 Nobel laureates in economics (Econ), physics (Phys), chemistry (Chem), and biology/physiology/medicine (Bio). The data were collected from their individual Google Scholar pages with a verifiable email site during the first week of January 2021. To be included in the analysis, the Laureates had to have at least 100 entries (papers or documents), with the latest not before 2018. These Nobel laureates have published a range of papers, from 111 to 3000, with N p denoting the number of papers. The laureates’ names appear in the same form as they do on their respective Google Scholar pages (adopted from Ghosh et al. [28]).
AwardName of RecipientGoogle Scholar Citation Data
N p N c Index Values
g k
NOBEL
Prize
(Econ.)
Joseph E. Stiglitz3000323,4730.900.88
William Nordhaus78374,3690.870.86
Abhijit Banerjee57859,7040.890.88
Esther Duflo56569,8430.910.89
Paul Milgrom365102,0430.900.89
Paul Romer25595,4020.960.93
NOBEL
Prize
(Phys.)
Hiroshi Amano130044,3290.800.81
David Wineland72063,9220.880.87
Gérard Mourou70049,7590.820.83
Serge Haroche53340,0340.870.86
A. B. McDonald49220,3460.910.88
David-Thouless27347,4520.890.87
F.D.M. Haldane24441,5910.870.86
Donna Strickland11110,3700.950.92
NOBEL
Prize
(Chem.)
Joachim Frank85348,0770.800.81
Frances Arnold68256,1010.750.79
Jean Pierre Sauvage71357,4390.730.77
Richard henderson24527,5580.840.84
NOBEL
Prize
(Bio.)
Gregg L. Semenza712156,2360.810.82
Michael Houghton49349,3680.830.83
Table 10. Citation data analysis for selected prize winners in Physics, mathematics, and social sciences. The selected prize winners are recipients of the Dirac Medal, Boltzmann Medal, Fields Medal, and John von Neumann Award. The analysis includes only those prize winners who have verified email addresses and an updated Google Scholar page after 2018. The data were collected in the first week of April 2021. For each individual scientist, the table reports the total number of papers ( N p ), total citations ( N c ), Gini index (g), and Kolkata index (k).
Table 10. Citation data analysis for selected prize winners in Physics, mathematics, and social sciences. The selected prize winners are recipients of the Dirac Medal, Boltzmann Medal, Fields Medal, and John von Neumann Award. The analysis includes only those prize winners who have verified email addresses and an updated Google Scholar page after 2018. The data were collected in the first week of April 2021. For each individual scientist, the table reports the total number of papers ( N p ), total citations ( N c ), Gini index (g), and Kolkata index (k).
AwardName of RecipientGoogle Scholar Citation Data
N p N c Index Values
g k
FIELDS
Medal
(Math.)
Terence Tao60480,3540.880.86
Edward Witten402314,3770.740.79
Alessio Figalli22853380.670.75
Vladimir Voevodsky18985540.830.85
Martin Hairer18175850.740.78
Andrei Okounkov13410,6860.690.76
Stanislav Smirnov7941440.760.79
Richard E. Borcherds6150960.810.83
Ngo Bao Chau4412140.710.76
Maryam Mirzakhani2517690.570.74
ASICTP
DIRAC
Medal
(Phys.)
Rashid Sunyaev1789103,4930.910.88
Peter Zoller838100,9560.810.82
Mikhail Shifman78452,5720.850.84
Subir Sachdev72558,6920.830.82
Xiao Gang Wen43246,2940.80.82
Alexei Starobinsky32847,3590.810.82
Pierre Ramond31823,6100.890.87
Charles H. Bennett23689,7980.90.88
V. Mukhanov20827,7770.850.84
M A Virasoro15012,8860.90.87
BOLTZMANN
Award
(Stat. Phys.)
Elliott Lieb75576,1880.860.85
Daan Frenkel73666,5220.80.81
Harry Swinney57746,5230.860.84
Herbert Spohn44625,1880.790.8
Giovanni Gallavotti44615,5830.860.84
JHON Von
NEUMANN
Award
(Social Sc.)
Daron Acemoglu1175172,4950.910.89
Olivier Blanchard1150113,6070.910.89
Dani Rodrik1118136,8970.90.89
Jon Elster88579,8690.890.87
Jean Tirole717201,4100.910.88
Timothy Besley63257,1780.890.88
Maurice Obstfeld58673,4830.90.88
Alvin E. Roth56654,1040.870.86
Avinash Dixit55782,5360.930.9
Philippe Aghion490119,4300.850.85
Matthew O. Jackson39739,0700.860.84
Emmanuel Saez31048,1360.860.86
Mariana Mazzucato23612,1230.870.86
Glenn Loury22613,3520.920.9
Susan Athey20318,8660.80.82
Table 11. Estimated Gini (g) and Kolkata (k) index values applied to manmade conflicts such as wars and acts of terrorism. Data were adapted from a previous study [29] and represent death counts as a measure of inequality.
Table 11. Estimated Gini (g) and Kolkata (k) index values applied to manmade conflicts such as wars and acts of terrorism. Data were adapted from a previous study [29] and represent death counts as a measure of inequality.
Type of Conflictg Indexk Index
War0.83 ± 0.020.85 ± 0.02
Battle0.82 ± 0.020.85 ± 0.02
Armed conflict0.85 ± 0.020.87 ± 0.02
Terrorism0.80 ± 0.030.83 ± 0.02
Murder0.66 ± 0.020.75 ± 0.02
Table 12. Estimated values of the Gini index (g) and Kolkata index (k) as measures of inequality in death counts resulting from natural disasters such as earthquakes and tsunamis (adapted from [29]). The data presented here provide valuable insights into the distribution of fatalities resulting from natural disasters and the effectiveness of these inequality indices in capturing the severity of such events.
Table 12. Estimated values of the Gini index (g) and Kolkata index (k) as measures of inequality in death counts resulting from natural disasters such as earthquakes and tsunamis (adapted from [29]). The data presented here provide valuable insights into the distribution of fatalities resulting from natural disasters and the effectiveness of these inequality indices in capturing the severity of such events.
Type of Disasterg Indexk Index
Earthquake 0.94 ± 0.02 0.95 ± 0.02
Flood 0.98 ± 0.02 0.98 ± 0.02
Tsunami 0.93 ± 0.02 0.94 ± 0.02
Table 13. Inequality statistics of Olympic medals and winning countries (2008–2020). Typically, the number of medals (gold, silver, or bronze) for each year is about 300, with around 200 contesting countries. When the cumulative fraction of medals is plotted against the fraction of countries winning them (ordered from the fewest to most medals won), the result is the Lorenz curve, based on which we estimate the Gini (g) and Kolkata (k) indices for each year. As the level of competition is extremely high in the Olympics (with no welfare support for equality in achievements among contestants), typically, we find 13 to 16 percent of countries win 84–87 percent of medals (gold, silver, or bronze) in any Olympics.
Table 13. Inequality statistics of Olympic medals and winning countries (2008–2020). Typically, the number of medals (gold, silver, or bronze) for each year is about 300, with around 200 contesting countries. When the cumulative fraction of medals is plotted against the fraction of countries winning them (ordered from the fewest to most medals won), the result is the Lorenz curve, based on which we estimate the Gini (g) and Kolkata (k) indices for each year. As the level of competition is extremely high in the Olympics (with no welfare support for equality in achievements among contestants), typically, we find 13 to 16 percent of countries win 84–87 percent of medals (gold, silver, or bronze) in any Olympics.
YearMedalgk
2020Gold0.870.85
Silver0.860.85
Bronze0.840.84
Total0.840.83
2016Gold0.880.87
Silver0.860.85
Bronze0.850.85
Total0.850.84
2012Gold0.890.87
Silver0.870.85
Bronze0.840.84
Total0.850.85
2008Gold0.890.87
Silver0.850.84
Bronze0.860.85
Total0.850.84
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Banerjee, S.; Biswas, S.; Chakrabarti, B.K.; Ghosh, A.; Mitra, M. Sandpile Universality in Social Inequality: Gini and Kolkata Measures. Entropy 2023, 25, 735. https://doi.org/10.3390/e25050735

AMA Style

Banerjee S, Biswas S, Chakrabarti BK, Ghosh A, Mitra M. Sandpile Universality in Social Inequality: Gini and Kolkata Measures. Entropy. 2023; 25(5):735. https://doi.org/10.3390/e25050735

Chicago/Turabian Style

Banerjee, Suchismita, Soumyajyoti Biswas, Bikas K. Chakrabarti, Asim Ghosh, and Manipushpak Mitra. 2023. "Sandpile Universality in Social Inequality: Gini and Kolkata Measures" Entropy 25, no. 5: 735. https://doi.org/10.3390/e25050735

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop