Excess death estimates from multiverse analysis in 2009–2021

Levitt, Michael; Zonta, Francesco; Ioannidis, John P. A.

doi:10.1007/s10654-023-00998-2

Excess death estimates from multiverse analysis in 2009–2021

MORTALITY
Open access
Published: 12 April 2023

Volume 38, pages 1129–1139, (2023)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Epidemiology Aims and scope Submit manuscript

Excess death estimates from multiverse analysis in 2009–2021

Download PDF

18k Accesses
7 Citations
838 Altmetric
3 Mentions
Explore all metrics

Abstract

Excess death estimates have great value in public health, but they can be sensitive to analytical choices. Here we propose a multiverse analysis approach that considers all possible different time periods for defining the reference baseline and a range of 1 to 4 years for the projected time period for which excess deaths are calculated. We used data from the Human Mortality Database on 33 countries with detailed age-stratified death information on an annual basis during the period 2009–2021. The use of different time periods for reference baseline led to large variability in the absolute magnitude of the exact excess death estimates. However, the relative ranking of different countries compared to others for specific years remained largely unaltered. The relative ranking of different years for the specific country was also largely independent of baseline. Averaging across all possible analyses, distinct time patterns were discerned across different countries. Countries had declines between 2009 and 2019, but the steepness of the decline varied markedly. There were also large differences across countries on whether the COVID-19 pandemic years 2020–2021 resulted in an increase of excess deaths and by how much. Consideration of longer projected time windows resulted in substantial shrinking of the excess deaths in many, but not all countries. Multiverse analysis of excess deaths over long periods of interest can offer an approach that better accounts for the uncertainty in estimating expected mortality patterns, comparative mortality trends across different countries, and the nature of observed mortality peaks.

Estimates of excess mortality during the COVID-19 pandemic strongly depend on subjective methodological choices

Article 04 May 2023

Excess all-cause mortality in the USA and Europe during the COVID-19 pandemic, 2020 and 2021

Article Open access 03 November 2022

How to measure premature mortality? A proposal combining “relative” and “absolute” approaches

Article Open access 26 October 2021

Introduction

Calculation of excess deaths is considered to be a very useful tool for estimating patterns of mortality changes over time in different countries and the impact of major events, such as pandemics [1,2,3]. Excess deaths are meant to capture the composite sum of perturbations in disease incidence and other factors, including social, health care, lifestyle and natural catastrophes that may shape population fatalities in a given year. However, excess death calculations can lead to controversy with different teams of researchers generating markedly different estimates for the same country and year(s) [4,5,6]. The reason is that the calculation of excess deaths requires making analytical choices for which there is no consensus. Specifically, one needs to select a reference baseline period (a time window in the past that will be used for extrapolating how many deaths would be expected in subsequent years) and a projected period (the time window for which an excess death estimate is made by comparing the observed versus expected number of deaths based on the past experience). Moreover, one should decide whether there are any time patterns and what is the form of these time patterns (e.g. whether overall mortality should be declining or increasing over time and, if so, in what form, e.g. linear or spline fit). Empirical work and simulations [4,5,6,7,8,9,10] have shown that these choices can make a substantial difference in the obtained excess death estimates.

When results depend on analytical choices, one methodological strategy is to explore the full range of results that can be obtained when a wide range of possible analytical choices and combinations thereof are considered [11,12,13,14,15,16,17,18,19,20]. Analyses may range from a few dozen to several million different options (e.g. in selecting covariate sets in regressions) [15, 17]. Different terminology has been used for such approaches that generalize the concept of sensitivity analysis. Commonly used terms are “multiverse analysis” [11,12,13,14], “vibration of effects” [16,17,18] and “multi-analyst analysis” [19, 20] (when multiple researchers are each asked to select independently their preferred analysis). Here, we propose a multiverse approach for excess deaths. Instead of making unavoidably arbitrary choices in selecting reference baseline and projected periods, we consider all possible reference baseline periods and projected periods in adjacent year time windows during a lengthy period of interest. Instead of prespecifying time patterns, this multiverse approach allows the data to demonstrate what might be the time patterns and how sensitive the results are to different analytical choices. All possible choices are considered for reference baseline periods (extending as far back as 2009). The multiverse approach also allows us to understand to what extent excess death estimates may shrink when longer projected periods are considered, in the range of 1–4 years. If perturbations lead to excess deaths increases due to the demise of individuals with limited life expectancy [21], then excess death peaks that are seen with short projected periods (e.g. 1 year) will diminish or even disappear when longer projected periods are considered. People who died at some point due to the perturbation would have died very soon anyhow. Conversely, if perturbations result in mortality peaks due to deaths of people who had long life expectancy, extending the projected period window will not have the same attenuating impact.

We applied this approach to 33 high-income countries studied before [6] and which have the most reliable data for mortality according to age-stratified groups for the extended period 2009–2021. Our aim here is to propose the multiverse method, illustrate its application, and see how it can offer insights about evolving relative patterns of mortality over many years in each country and how these patterns compare across countries. The multiverse approach focuses on relative comparisons rather than on obtaining absolute estimates of excess deaths during a specific given pandemic period. However, we have also used it to generate absolute estimates of excess deaths during the pandemic period, by considering different types of down weighting of older reference years as opposed to newer reference years.

Results

Variability of excess death estimates according to reference baselines

The absolute value of excess death estimates can vary substantially depending on the selection of reference years used for baseline. We considered all 66 possible time windows of whole consecutive calendar years (1–11 years long) in the years 2009–2019 as representing baseline values. Table 1 shows the average, standard deviation, minimum, maximum and range for estimates of relative excess deaths (expressed as percentage of expected deaths) for the two-year pandemic period 2020–2021 for each of the 33 countries. The average value is highly correlated with either the maximum or minimum value but not with standard deviation or range (correlation coefficients of 0.96, 0.95, − 0.22 and − 0.15, respectively). Table 1 also shows the average excess death estimates across 66 possible time windows when different down weighting is applied for older years in the reference range. Figure S1 shows that the average multiverse percentage relative excess death values are highly correlated to the corresponding values calculated with the previously used single reference period of years 2017–19 [6]. The multiverse values with the dw3 weighting scheme are very similar to those with the 2017–19 baseline. The multiverse values averaged with equal weights for all 66 baselines are lower by 4.6 percentage points.

Table 1 Average, standard deviation, minimum, maximum and range for estimates of relative excess deaths (expressed as percentage of expected deaths referred to as p%) for the two-year pandemic period 2020 + 2021 for each of the 33 countries

Full size table

Stability of relative ranking for the pandemic years’ excess deaths across 33 countries

The estimates of relative excess deaths (as percentage of expected deaths) can be used to compare different countries in a given time period. Despite large variabilities in the absolute estimates, the relative ranking of the 33 countries for a given period of interest was largely unperturbed, regardless of what reference baseline years were chosen. Figure 1 shows the ranking of relative excess death estimates (as percentage of expected deaths) for the pandemic years 2020–2021 in all 66 analyses with different reference baseline windows. The USA had the highest estimates of relative excess deaths among all 33 countries in 50 of 66 analyses, the second highest in 15 analyses and the fourth highest in 1 analysis. Conversely, South Korea had the lowest estimates in 59 of 66 analyses, the second to lowest in 5, the third to lowest in 1, and the sixth to lowest in 1. Eastern European and Balkan countries closely followed the USA in the top excess death ranks consistently. Scandinavian countries, Australia, and New Zealand consistently were placed among the lowest excess death ranks next to South Korea. Other Western European countries typically occupied middle ranks. Figures S2A, S2B and S2C show that the distribution of country ranks for projected periods of 1 year, 3 years and 4 years are similar to that shown in Fig. 1 for 2020–2021; summing over more years does blur the ranking of middle-ranked countries.

Diversity in time patterns across 33 countries

Figure 2 maps the emerging time patterns for mortality in each of the analyzed countries for the average of the 66 analyses using different reference baseline periods and the range of maximum and minimum estimates. Although the range of estimates of relative mortality for each given year is large, the rank of different years for a particular country is generally the same for the 66 different sets of reference years (Figure S3). Time patterns across different countries show large variability as well. Differences exist both in the presence and magnitude/steepness of time trends; and on the presence or not of peaks of mortality impact during the COVID-19 pandemic (2020, 2021, both, or neither). All countries had some decline in mortality over the period 2009–2019, but for the USA in particular the change was minimal (change from average of 1.27% in 2009–2010 to − 1.31% in 2018–2019 for an overall decline of only 2.58%, using data in Table S1B). The other 4 countries with the smallest changes for the averages between 2009–2010 and 2018–2019 were Germany, The Netherlands, the United Kingdom and Canada, (changes of − 6.65%, − 8.34%, − 8.49% and − 8.55%, respectively). Conversely, the 5 countries with the largest declines for the averages between 2009–2010 and 2018–2019 were South Korea, Estonia, Denmark, Slovakia and Norway (changes of − 23.3%, − 19.7%, − 17.1%, − 16.0% and − 14.1%, respectively). For the pandemic period 2020–2021, the USA had the steepest increase (change in average 18.00% between 2018–2019 and 2020–2021). Steep increases were seen also in Eastern European and Balkan countries (changes in average from 10.18% to 17.46% between 2018–2019 and 2020–2021 for Slovenia, Hungary, Latvia, Croatia, Lithuania, Czechia, Slovakia and Poland). Most western European countries had more modest disruptions of the declining trend (changes for the averages from 1.86% to 9.98% between 2018–2019 and 2020–2021 for Luxembourg, Germany, Switzerland, France, The Netherlands, Belgium, Portugal, Austria, the United Kingdom, Spain and Italy). Some Scandinavian countries, Australia, New Zealand, and South Korea continued to have declining mortality trends during the pandemic (changes for the averages from − 4.97% to − 2.44% between 2018–2019 and 2020–2021 for New Zealand, South Korea, Iceland, Norway, Denmark and Australia). Figures S4A, S4B, and S4C map the time patterns shown in Figure S2 for periods of 1, 3 and 4 years, respectively and show how longer projected periods reduce fluctuations.

Table S2 and Tables S1A, B, C, D present data on the worst years. Table S1A shows that the worst single year with the highest mortality was 2021 for 10 countries (Slovakia, Poland, United States, Latvia, Lithuania, Hungary, Croatia, Czechia, Chile and Greece), 2020 was the highest for 4 countries (United Kingdom, Italy, Spain and Belgium), 2010 was worst for Luxembourg and 2009 was worst for all other 18 countries. When considering 2-year periods, in 25 of the 33 countries, 2009 + 2010 were the worst pair of years (Table S2). In 9 of the 33 countries (Chile, Czechia, Greece, Hungary, Italy, Lithuania, Poland, Slovakia and United States) the pandemic years 2020 + 2021 were the worst, and in all of them the years 2009 + 2010 were the second worst. (Table S1A & Table 2). In 16 countries, the pandemic years were not among the three worst years, which were always years between 2009 and 2016. When considering 3 or 4 year periods, in 31 of the 33 countries 2009–2011 and 2009–2012, were the worst, respectively. Only in Poland or the United States were period 2019–2021 and 2018–2021, which include the pandemic years, the worst, respectively, (data from Tables S1C & S1D).

Table 2 Effect of changing the projected period of interest from 1 to 4 years for the most recent years: 2021 alone, 2020 alone, <2 Years> = 2020+2021, <3 Years> = 2019+2020+2021, and <4 Years> = 2018+2019+2020+2021).

Full size table

Excess death estimates in recent years using different projected period time windows

Table 2 shows the effect of changing the width of the projected period of interest from 1–4 years for the most recent years (2021 alone, 2020 alone, 2020–2021, 2019–2021, 2018–2021). As shown, there is substantial attenuation of the relative excess mortality between the single worse pandemic year and increasingly wider periods of interest. The attenuation was most prominent when averaging over a 4-year period for Slovakia, Latvia, Lithuania, Poland, Estonia and Croatia, with relative drops of 18.3, 15.2, 12.5, 12.4, 11.0 and 11.2 percentage points, respectively. The attenuation was least prominent for Australia, Norway, Denmark, Iceland, New Zealand and South Korea, with relative drops of − 0.2, − 0.5, − 0.20, − 1.0, − 0.8 and − 1.6, percentage points, respectively. The USA maintained the most prominent peak even with a 4-year window.

With increasing projected periods, both the mean and standard deviation of the relative excess mortality declined substantially. For 2021, 2020–2021, 2019–2021, 2018–2021, the mean was 2.6%, 1.5%, − 1.0%, and − 1.7%, respectively. The standard deviation was 9.3%, 7.2%, 5.2%, and 4.1%, respectively.

Discussion

Our application of a multiverse approach to excess death data shows that consideration of different periods for reference baseline resulted in major variability in the absolute magnitude of the exact excess death estimates, but it did not affect substantially the relative ranking of different countries compared to others for specific years. Moreover, there have been distinct time patterns across different countries during 2009–2021. Countries differed markedly on whether they had a substantial decrease over time or not during 2009–2021, on whether they had a peak during the 2020–2021 pandemic years, and, if so, how high, and in the relative contribution of 2020 and of 2021 to this peak. With longer time windows for the projected period of interest (1 to 4 years), the range of excess deaths across different countries in the pandemic years and the 2 years preceding the pandemic shrank substantially and excess death estimates became less variable across countries. This suggests that it would be inappropriate to dwell too much on small or modest differences between countries, as these are highly model-dependent. However, peaks did not disappear and for the USA in particular, excess deaths remained prominent even with long projected periods of interest.

In the multiverse literature from other fields, some analytical choices may be considered more meaningful or relevant than others. When researchers are asked to select independently what analysis mode they feel is most sensible, not all analytical choices are selected [19, 20] and some types of choices may seem to make more sense. This may apply also for excess death calculations. E.g. it may seem not so appropriate to use a reference window of 2009 alone for projecting mortality in 2021. The baselines created by each of the 66 different windows may have less or more relevance to the current situation. In principle, baselines using more recent years may be more informative for the current time. Common choices include using the last 3 years or the last 5 years.

The obvious heterogeneity of time patterns across different countries suggests that selection of specific time trends in modeling excess deaths may be a situation where one size does not fit all. Selection of specific anticipated time trend patterns may markedly affect the results in ways that are not verifiable for their appropriateness. E.g. selecting a model that anticipates a marked decrease in mortality over time makes it difficult for a country not to have excess deaths even if it does very well in a given year–but still falls short of an anticipated stellar improvement over time. It should be acknowledged that age-adjusted mortality rates usually have decreased over time in most countries in the last several decades. Standard methods for forecasting future mortality rates and life expectancy such as the Lee-Carter forecasts [22, 23] and other methods that use time series approaches end up using some linear trends in the modeling. However, it has been observed [24] that changes in mortality rates may differ markedly in different years even in the same country/location and they may also differ across different age and gender groups in the same country and same year. In the presence of major perturbation events such as pandemics or wars and natural disasters, such modeling estimates will deviate from observed deaths in proportion to the magnitude of the catastrophe. More importantly, there is no guarantee that mortality rates should continue declining, let alone markedly decline, over time with medical and other progress, even in the absence of major negative perturbation events [6]. For advanced economies with aging populations, accumulating frailty and disease burden and restrictions or ceilings to progress and available resources, the typical trends for decreased mortality that were documented in the previous decades may not be sustainable for the future. Furthermore, countries that have already reached very high life expectancies may have less room for improvement than others that are lagging behind. The multiverse approach, when applied to multiple countries, allows a comparative assessment of the trajectory of different countries. This may be preferable and it may offer some genuine insights about which countries do well (short-term and long-term) and which do poorly–in comparison.

In this regard, some stark differences stand out for both long-term trends and for the pandemic years. The USA consistently performed very poorly with both stagnation in mortality during the pre-pandemic years and a sharp increase during the pandemic. Eastern European and Balkan countries showed sharp decreases during the pre-pandemic years and a sharp increase during the pandemic. Most western European countries had sharp decreasing trends with modest disruption during the pandemic. All Scandinavian countries, Australia, New Zealand, and South Korea have had largely unperturbed declining mortality patterns. The markedly different patterns may reflect a combination of social, health care, and pandemic factors. The USA has an ailing health system with approximately 30 million uninsured people [25], large inequalities [26], many people with poor access to care [27], and major ongoing non-infectious epidemics, including obesity [28], opioid abuse and overdose [29], and violent deaths [30]. More detailed data are needed to understand which of the policies and actions during the pandemic or the pre-existing problems were more important for shaping the poor performance in 2020 and beyond. Eastern European and Balkan countries have limited resources for their healthcare systems and lower social welfare than other European countries [31] and some countries like Greece have long suffered from austerity [32]. The best performers are excelling in social welfare and health system functionality and resources, even if there are differences across countries. Exceptions may occur within circumscribed populations and adverse settings even in countries with overall excellent trajectories. For example, the dysfunctional consequences of privatization in nursing homes in countries like Sweden or Canada [33, 34] translated to peaks of excess deaths during circumscribed periods in the long-term care settings [35].

Consideration of longer projected period time windows diminished substantially the range of excess deaths in some countries, but not in others. Overall, when longer periods are considered, differences between most countries become less pronounced. However, larger windows had minimal effect in the USA, and this may reflect that the problems that lead to unfavorable mortality patterns in the USA reflect chronic dysfunctions that might have been accentuated by the pandemic but pre-existed and which affect also people with long life expectancy. Poverty, marginalization, homelessness, inequalities, drug overdoses, and violence affect indeed young and middle-aged populations. We have shown previously that the USA has had 40% of excess deaths contributed by the < 65 age stratum, a higher percentage than all other highly developed countries [6]. Conversely, in many other countries, large time windows for the projected period shrank substantially the excess death fluctuations. This suggests that in these countries excess deaths temporarily affect mostly people with relatively limited life expectancy [21].

Europe, while not a country, has historically aggregated excess death data in the EuroMOMO data base (https://www.euromomo.eu/) to include data from 21 countries in Europe plus Israel [36]. If one were to aggregate data for the 19 of these 21 countries for which we have data (excluding Cyprus & Ireland), the fictional country composite that includes Austria, Belgium, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Israel, Italy, Luxembourg, Netherlands, Norway, Portugal, Slovenia, Sweden, Switzerland, and United Kingdom has a similar population to the USA (410 million versus 330 million) and the relative excess death of this European composite is only 2.46% for the pandemic years (2020 + 2021), which is in stark difference to the USA figures. It is also less than two-year totals for 2009 + 2010 and 2010 + 2011, with values of 5.57% and 3.30% caused apparently by elevated influenza pandemics [37]. One may examine also the excess deaths according to the EuroMOMO model, but the model has been criticized for low baseline values which may lead to overestimation of excess mortality in some countries [38].

Some limitations should be acknowledged. First, there are some additional sources of analytical flexibility that can be considered in excess death calculations. These include the choice of age bins for age adjustment, and the use of additional adjustments for modeling the population profile over time. For example, socioeconomic profile variables would be very useful to incorporate [39], but these are not routinely available and standardized across many countries. Such additional adjustments would add additional variation with more multiverse options, but probably would not invalidate the major patterns that we observed. Second, we only modeled data from 33 countries that are the ones with the most reliable data. Extrapolations to other countries would be precarious, given the unreliability of the mortality information. Time patterns observed in the 33 countries may not necessarily apply to the remaining countries around the globe and local circumstances may make a difference. Third, we considered yearly interval increments so as to capture all 4 seasons in the unit of time, but in theory, the multiverse process can be applied for smaller units of time as well. Fourth, data on population and population structure in each country on a yearly basis are typically inferred from census data collected on more sparse timing, therefore they carry some uncertainty. Fifth, the pandemic impact and its consequences as well as the consequences of aggressive measures that were taken has continued more prominently in 2022 in some countries than others [40]. It would be interesting to see whether differences across countries get further attenuated and/or some countries continue to stand out prominently when longer pandemic and post-pandemic periods (e.g. 2020–2022 and 2020–2023) are considered. Preliminarily results based on the first 8 months of 2022, it seems that several countries with death deficit in 2020–2021 (e.g., Australia, New Zealand and South Korea), had considerable excess deaths in 2022, while some others continued to have limited deaths (e.g. Sweden) and some hard-hit countries like USA and Greece continued to do very poorly [6, 40]. Sixth, we did not consider in the multiverse analyses any superimposed modeling of time trends, specifically because we wanted to allow the data to show whatever trends of patterns existed. If one were to add in the modeling also all the possible functions that might be used to capture time trends (e.g. linear, higher power, splines, and so forth), the analytical options would multiply far more. This explains why mortality forecasting is so difficult and uncertain, and why there is no consensus on the best method on how to do it [41]. Mortality forecasting becomes even more difficult and uncertain when perturbation events such as pandemics occur. The multiverse approach helps understand why obtaining accurate absolute estimates of excess deaths is precarious. However, our approach using down weighted older reference years may be considered, if absolute estimates are desirable (as opposed to relative performance across years and compared with other countries). One may also down weigh previous reference years based on other features, e.g. severe flu seasons or major heat waves.

In conclusion, a multiverse approach to excess death calculations may offer bird’s eye views on mortality patterns in comparative assessments of a large number of countries. These patterns may be more reliably informative than efforts to obtain isolated single-country estimates of excess deaths, which are subject to substantial uncertainty even in countries with the best-collected data. It may be best to avoid pre-specifying time patterns and to allow the data to show what time patterns may be emerging. Finally, observed time patterns may not necessarily continue into the future and multiverse analyses can be updated accordingly for additional years moving forward.

Materials and methods

Data

All data comes from the Human Mortality Database (HMD) [42,43,44]. The data for the most recent years comes from the Short-Term Mortality Fluctuation file stmf.csv downloaded from https://www.mortality.org/File/GetDocument/Public/STMF/Outputs/stmf.csv. (last updated 6 February 2023). The data for earlier years extending back to 2009 was downloaded as the HMD archive file (see Supplementary Links to Data). We considered data from 2009–2021 so as to analyze 13 years including also the years of the 2009–2020 pandemic. We focused on the 33 high-income countries with highly reliable death registration systems, excluding Bulgaria as done in previous work [6]. The most recent data in the file stmf.csv is per week and uses five standard age-bands: 0–14, 15–64, 65–74, 75–84 and Over 85; we sum the data over the weeks assigned to each year as done before [6]. The older data in the HMD archive (downloaded on May 25, 2022) uses 1-year age bands for annual all-cause deaths and annual populations are available for all 33 countries. We sum these 1-year bands to give the same five standard age bands used in stmf.csv.

Excess death calculations

In order to be able to compare different countries and different time periods we focus on relative excess deaths expressed as the number of excess deaths divided by the number of expected deaths. Specifically, the relative excess death p% is the actual all-cause death count, D, minus the estimated death count, E, expressed as a percentage of the estimated death count or p% = (D-E)/E.

Systematic Variation of Assumptions for Multiverse Analyses

We consider all possible reference baseline periods and projected periods in consecutive year time windows during a lengthy period of interest. Instead of prespecifying time patterns, this multiverse approach allows the data to demonstrate how sensitive the results are to different analytical choices. Moreover, by considering the average of the multiverse results, one can reveal time patterns of decreases or increases of mortality during the covered time period.

We consider all possible reference baseline spans of consecutive years in the period 2009–2019. This gives 11 + 10 + ... + 1 = 66 different spans of length 1 to 11 years for the 66 reference baselines. For each reference period, we average the mortality of each of the five age bands. These averaged mortalities are then used to get the expected deaths in any year by multiplying the mortality of a particular age band by the population of that age band and then summing the estimated values over the five age bands to give the total estimated death count.

Projected time periods are also considered in all possible options of length 1 to 4 years, again considering consecutive calendar years. The mortality in the pandemic years 2020 and 2021 is never considered when calculating excess death. Similarly, when calculating excess deaths for projected periods 2018 + 2019 + 2020 + 2021 (or 2019 + 2020 + 2021), the years 2018–2019 (or 2019) are not considered as baselines. For analyses with different assumptions, we present the maximum, minimum, median and IQR or mean and standard deviation, as appropriate).

Analyses with down weighing for older years

Estimates of excess deaths during the pandemic years that are averaged against all possible reference periods may be misleading in absolute magnitude, since very early years such as 2009 may not be as relevant as more recent years like 2019. Therefore, we also rerun the analyses for all 66 possible combinations of reference years with various weights: (a) with weights decreasing linearly by 10% for each year before 2019 (i.e. 100% weight for 2019, 90% weight for 2018, …, 10% weight for 2010, 0% weight for 2009); (b) with weights decreasing by 5% for each year before 2019 (i.e. 100% weight for 2019, 95% weight for 2018, …, 55% for 2010, 50% for 2009); (c) with weights decreasing by half for each year (i.e. 100% weight for 2019, 50% weight for 2018, 25% weight for 2017, 12.5% weight for 2016, 6.25% weight for 2015, 3.125% weight for 2014, 1.563% for 2013, 0.781% for 2012, 0.390% for 2011, 0.195% for 2010, 0.098% for 2009). In all 3 weighting schemes, the weighted average of the 66 options is obtained, weighting each option by the average weight of the reference years that it contains. For example, the 2018–2019 reference years option is weighed by a factor of 1.9/2 = 0.95, 1.95/2 = 0.975, and 1.50/2 = 0.75, for each of the three weighting schemes above, respectively. The 2017–2019 reference years option is weighed by a factor of 2.7/3 = 0.9, 2.85/3 = 0.95, and 1.75/3 = 0.58, respectively.

Data availability

All data are in the manuscript, tables, and supplementary tables and in the publicly available databases listed in Supplementary Links to Data and deposited online at https://zenodo.org/record/7095753.

References

Kiang MV, Irizarry RA, Buckee CO, Balsari S. Every body counts: measuring mortality from the COVID-19 pandemic. Ann Intern Med. 2020;173(12):1004–7.
Article PubMed Google Scholar
Islam N. Excess deaths is the best metric for tracking the pandemic. BMJ. 2022;4(376): o285.
Article Google Scholar
Vandenbroucke JP. Covid-19: excess deaths should be the outcome measure. Ned Tijdschr Geneeskd. 2021;7(165):D6219.
Google Scholar
COVID-19 Excess Mortality Collaborators. Estimating excess mortality due to the COVID-19 pandemic: a systematic analysis of COVID-19-related mortality, 2020–21. Lancet, March 10, 2022. https://doi.org/10.1016/S0140-6736(21)02796-3
World Health Organization, Global excess deaths associated with COVID-19, January 2020 - December 2021. In: https://www.who.int/data/stories/global-excess-deaths-associated-with-covid-19-january-2020-december-2021, last accessed May 6, 2022.
Levitt M, Zonta F, Ioannidis JPA. Comparison of pandemic excess mortality in 2020–2021 across different empirical calculations. Environ Res. 2022;213: 113754.
Article CAS PubMed PubMed Central Google Scholar
Nepomuceno MR, Klimkin I, Jdanov DA, Aluztiza-Galarza A, Shkolnikov VM. Sensitivity analysis of excess mortality due to the COVID-19 pandemic. Popul Dev Rev 2022;48(2): 279–302. https://doi.org/10.1111/padr.12475.
Kowall B, Standl F, Oesterling F, Brune B, Brinkmann M, Dudda M, Pflaumer P, Jöckel KH, Stang A. Excess mortality due to Covid-19? A comparison of total mortality in 2020 with total mortality in 2016 to 2019 in Germany, Sweden and Spain. PLoS ONE. 2021;16(8): e0255540.
Article CAS PubMed PubMed Central Google Scholar
Gianicolo EAL, Russo A, Büchler B, Taylor K, Stang A, Blettner M. Gender specific excess mortality in Italy during the COVID-19 pandemic accounting for age. Eur J Epidemiol. 2021;36(2):213–8.
Article CAS PubMed PubMed Central Google Scholar
Schöley J. Robustness and bias of European excess death estimates in 2020 under varying model specifications. medRxiv 2021.06.04.21258353.
Dafflon J, Da Costa FP, Váša F, Monti RP, Bzdok D, Hellyer PJ, Turkheimer F, Smallwood J, Jones E, Leech RA. A guided multiverse study of neuroimaging analyses. Nat Commun. 2022;13(1):3758.
Article CAS PubMed PubMed Central Google Scholar
Weermeijer J, Lafit G, Kiekens G, Wampers M, Eisele G, Kasanova Z, Vaessen T, Kuppens P, Myin-Germeys I. Applying multiverse analysis to experience sampling data: Investigating whether preprocessing choices affect robustness of conclusions. Behav Res Methods 2022;54(6):2981–2992. https://doi.org/10.3758/s13428-021-01777-1.
Kołodziej A, Magnuski M, Ruban A, Brzezicka A. No relationship between frontal alpha asymmetry and depressive disorders in a multiverse analysis of five studies. Elife. 2021;26(10): e60595.
Article Google Scholar
Steegen S, Tuerlinckx F, Gelman A, Vanpaemel W. Increasing transparency through a multiverse analysis. Perspect Psychol Sci. 2016;11(5):702–12.
Article PubMed Google Scholar
Sala-i-Martin XX. I just run four million regressions. National Bureau of Economic Research, working paper 6252. Accessible in www.nber.org/paper/w6252.
Klau S, Hoffmann S, Patel CJ, Ioannidis JP, Boulesteix AL. Examining the robustness of observational associations to model, measurement and sampling uncertainty with the vibration of effects framework. Int J Epidemiol. 2021;50(1):266–78.
Article PubMed Google Scholar
Patel CJ, Burford B, Ioannidis JP. Assessment of vibration of effects due to model specification can demonstrate the instability of observational associations. J Clin Epidemiol. 2015;68(9):1046–58.
Article PubMed PubMed Central Google Scholar
Ioannidis JP. Why most discovered true associations are inflated. Epidemiology. 2008;19(5):640–8.
Article PubMed Google Scholar
Aczel B, Szaszi B, Nilsonne G, van den Akker OR, Albers CJ, van Assen MA, Bastiaansen JA, Benjamin D, Boehm U, Botvinik-Nezer R, Bringmann LF, Busch NA, Caruyer E, Cataldo AM, Cowan N, Delios A, van Dongen NN, Donkin C, van Doorn JB, Dreber A, Dutilh G, Egan GF, Gernsbacher MA, Hoekstra R, Hoffmann S, Holzmeister F, Huber J, Johannesson M, Jonas KJ, Kindel AT, Kirchler M, Kunkels YK, Lindsay DS, Mangin JF, Matzke D, Munafò MR, Newell BR, Nosek BA, Poldrack RA, van Ravenzwaaij D, Rieskamp J, Salganik MJ, Sarafoglou A, Schonberg T, Schweinsberg M, Shanks D, Silberzahn R, Simons DJ, Spellman BA, St-Jean S, Starns JJ, Uhlmann EL, Wicherts J, Wagenmakers EJ. Consensus-based guidance for conducting and reporting multi-analyst studies. Elife. 2021;10:e72185.
Article PubMed PubMed Central Google Scholar
Botvinik-Nezer R, Holzmeister F, Camerer CF, Dreber A, Huber J, Johannesson M, Kirchler M, Iwanir R, Mumford JA, Adcock RA, Avesani P, Baczkowski BM, Bajracharya A, Bakst L, Ball S, Barilari M, Bault N, Beaton D, Beitner J, Benoit RG, Berkers RMWJ, Bhanji JP, Biswal BB, Bobadilla-Suarez S, Bortolini T, Bottenhorn KL, Bowring A, Braem S, Brooks HR, Brudner EG, Calderon CB, Camilleri JA, Castrellon JJ, Cecchetti L, Cieslik EC, Cole ZJ, Collignon O, Cox RW, Cunningham WA, Czoschke S, Dadi K, Davis CP, Luca A, Delgado MR, Demetriou L, Dennison JB, Di X, Dickie EW, Dobryakova E, Donnat CL, Dukart J, Duncan NW, Durnez J, Eed A, Eickhoff SB, Erhart A, Fontanesi L, Fricke GM, Fu S, Galván A, Gau R, Genon S, Glatard T, Glerean E, Goeman JJ, Golowin SAE, González-García C, Gorgolewski KJ, Grady CL, Green MA, Guassi Moreira JF, Guest O, Hakimi S, Hamilton JP, Hancock R, Handjaras G, Harry BB, Hawco C, Herholz P, Herman G, Heunis S, Hoffstaedter F, Hogeveen J, Holmes S, Hu CP, Huettel SA, Hughes ME, Iacovella V, Iordan AD, Isager PM, Isik AI, Jahn A, Johnson MR, Johnstone T, Joseph MJE, Juliano AC, Kable JW, Kassinopoulos M, Koba C, Kong XZ, Koscik TR, Kucukboyaci NE, Kuhl BA, Kupek S, Laird AR, Lamm C, Langner R, Lauharatanahirun N, Lee H, Lee S, Leemans A, Leo A, Lesage E, Li F, Li MYC, Lim PC, Lintz EN, Liphardt SW, Losecaat Vermeer AB, Love BC, Mack ML, Malpica N, Marins T, Maumet C, McDonald K, McGuire JT, Melero H, Méndez Leal AS, Meyer B, Meyer KN, Mihai G, Mitsis GD, Moll J, Nielson DM, Nilsonne G, Notter MP, Olivetti E, Onicas AI, Papale P, Patil KR, Peelle JE, Pérez A, Pischedda D, Poline JB, Prystauka Y, Ray S, Reuter-Lorenz PA, Reynolds RC, Ricciardi E, Rieck JR, Rodriguez-Thompson AM, Romyn A, Salo T, Samanez-Larkin GR, Sanz-Morales E, Schlichting ML, Schultz DH, Shen Q, Sheridan MA, Silvers JA, Skagerlund K, Smith A, Smith DV, Sokol-Hessner P, Steinkamp SR, Tashjian SM, Thirion B, Thorp JN, Tinghög G, Tisdall L, Tompson SH, Toro-Serey C, Torre Tresols JJ, Tozzi L, Truong V, Turella L, van ’t Veer AE, Verguts T, Vettel JM, Vijayarajah S, Vo K, Wall MB, Weeda WD, Weis S, White DJ, Wisniewski D, Xifra-Porxas A, Yearling EA, Yoon S, Yuan R, Yuen KSL, Zhang L, Zhang X, Zosky JE, Nichols TE, Poldrack RA, Schonberg T. Variability in the analysis of a single neuroimaging dataset by many teams. Nature. 2020;582(7810):84–8.
Article CAS PubMed PubMed Central Google Scholar
Ballin M, Ioannidis JPA, Bergman J, Kivipelto M, Nordstrom A, Nordstrom P. Time-varying death risk after SARS-CoV-2-infection in Swedish long-term care facilities. BMJ Open. 2022;12(11):e066258. https://doi.org/10.1136/bmjopen-2022-066258
Lee RD, Carter LR. Modeling and forecasting US mortality. J Am Stat Assoc. 1992;87:659–71.
Google Scholar
Booth H, Hyndman RJ, Tickle L, de Jong P. Lee-Carter mortality forecasting: a multi-country comparison of variants and extensions. Demogr Res. 2006;15:289–310.
Article Google Scholar
Bergeron-Boucher M-P, Kjærgaard S. Mortality forecasting at age 65 and above: an age-specific evaluation of the Lee-Carter model. Scand Actuar J. 2022;1:64–79.
Article Google Scholar
https://www.cdc.gov/nchs/data/nhis/earlyrelease/insur201905.pdf, last accessed August 31, 2022.
Woolf SH. Progress in achieving health equity requires attention to root causes. Health Aff (Millwood). 2017;36(6):984–91.
Article PubMed Google Scholar
Sommers BD, Gunja MZ, Finegold K, Musco T. Changes in self-reported insurance coverage, access to care, and health under the affordable care act. JAMA. 2015;314(4):366–74.
Article CAS PubMed Google Scholar
Flegal KM, Carroll MD, Kit BK, Ogden CL. Prevalence of obesity and trends in the distribution of body mass index among US adults, 1999–2010. JAMA. 2012;307(5):491–7.
Article PubMed Google Scholar
Kuehn BM. Massive costs of the US opioid epidemic in lives and dollars. JAMA. 2021;325(20):2040.
PubMed Google Scholar
Bauchner H, Rivara FP, Bonow RO, Bressler NM, Disis MLN, Heckers S, Josephson SA, Kibbe MR, Piccirillo JF, Redberg RF, Rhee JS, Robinson JK. Death by gun violence-a public health crisis. JAMA. 2017;318(18):1763–4.
Article PubMed Google Scholar
Health systems in transition series, in: https://eurohealthobservatory.who.int/publications/health-systems-reviews?publicationtypes=e8000866-0752-4d04-a883-a29d758e3413&publicationtypes-hidden=true, last accessed August 31, 2022.
Laliotis I, Ioannidis JPA, Stavropoulou C. Total and cause-specific mortality before and after the onset of the Greek economic crisis: an interrupted time-series analysis. Lancet Public Health. 2016;1(2):e56–65.
Article PubMed Google Scholar
Akhtar-Danesh N, Baumann A, Crea-Arsenio M, Antonipillai V. COVID-19 excess mortality among long-term care residents in Ontario, Canada. PLoS ONE. 2022;17(1): e0262807.
Article CAS PubMed PubMed Central Google Scholar
August M. Socialize, De-commodify and de-financialize long-term care. Healthc Pap. 2021;20(1):34–9.
Article PubMed Google Scholar
Ballin M, Bergman J, Kivipelto M, Nordström A, Nordström P. Excess mortality after COVID-19 in Swedish long-term care facilities. J Am Med Dir Assoc. 2021;22(8):1574-1580.e8.
Article PubMed PubMed Central Google Scholar
Vestergaard LS, Nielsen J, Richter L, Schmid D, Bustos N, Braeye T, Denissov G, Veideman T, Luomala O, Möttönen T, Fouillet A, Caserio-Schönemann C, an der Heiden M, Uphoff H, Lytras T, Gkolfinopoulou K, Paldy A, Domegan L, O’Donnell J, de’ Donato F, Noccioli F, Hoffmann P, Velez T, England K, van Asten L, White RA, Tønnessen R, da, Silva SP, Rodrigues AP, Larrauri A, Delgado-Sanz C, Farah A, Galanis I, Junker C, Perisa D, Sinnathamby M, Andrews N, O’Doherty M, Marquess DFP, Kennedy S, Olsen SJ, Pebody R, Krause TG, Mølbak K, ECDC Public Health Emergency Team for COVID-19. Excess all-cause mortality during the COVID-19 pandemic in Europe–preliminary pooled estimates from the EuroMOMO network, March to April 2020. Eurosurveillance. 2020;25(26):2001214.
Article PubMed PubMed Central Google Scholar
Nielsen J, Mazick A, Andrews N, Detsis M, Fenech TM, Flores VM, Foulliet A, Gergonne B, Green HK, Junker C, Nunes B, O’donnell J, Oza A, Paldy A, Pebody R, Reynolds A, Sideroglou T, Snijders BE, Simon-Soria F, Uphoff H, Van Asten L, Virtanen MJ, Wuillaume F, Mølbak K. Pooling European all-cause mortality: methodology and findings for the seasons 2008/2009 to 2010/2011. Epidemiol Infect. 2013;141(9):1996–2010.
Article CAS PubMed Google Scholar
Portugal L. Mortality and Excess Mortality: Improving FluMOMO. Journal of Environmental and Public Health, 2021, 1–8.
Kjærgaard S, Ergemen YE, Bergeron-Boucher MP, Oeppen J, Kallestrup-Lamb M. Longevity forecasting by socio-economic groups using compositional data analysis. J R Stat Soc Ser A. 2020;183:1167–87.
Article Google Scholar
Ioannidis JP. The end of the COVID-19 pandemic. Eur J Clin Invest. 2022;53: e13782.
Article Google Scholar
Stoeldraijer L, van Duin C, van Wissen L, Janssen F. Impact of different mortality forecasting methods and explicit assumptions on projected future life expectancy: the case of the Netherlands. Demogr Res. 2013;29:323–54.
Article Google Scholar
Human mortality database, short-term mortality fluctuations. In: https://www.mortality.org/Data/STMF, last accessed August 20, 2022.
Wilmoth JR, Andreev K, Jdanov D, Glei DA, Boe C, Bubenheim M, Philipov D, Shkolnikov V, Vachon P.. Methods protocol for the human mortality database. University of California, Berkeley, and Max Planck Institute for Demographic Research, Rostock. URL: https://www.mortality.org [version 31/05/2007], 9, pp.10–11.
Jdanov DA, Jasilionis D, Shkolnikov VM and Barbieri M. Human mortality database. Encyclopedia of gerontology and population aging/editors Danan Gu, Matthew E. Dupre. Cham: Springer International Publishing, 2020.

Download references

Funding

National Institute of General Medical Sciences, NIH R35 GM122543

Author information

Authors and Affiliations

Department of Structural Biology, Stanford University, Stanford, CA, 94305, USA
Michael Levitt
Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, Shanghai, 201210, China
Francesco Zonta
Departments of Medicine, of Epidemiology and Population Health, of Biomedical Data Science, and of Statistics, and Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, 94305, USA
John P. A. Ioannidis

Authors

Michael Levitt
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Zonta
View author publications
You can also search for this author in PubMed Google Scholar
John P. A. Ioannidis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ML and JPAI had the original idea. ML analyzed the data with contributions from FZ and JPAI. JPAI and ML wrote the paper, and all three authors interpreted the data, edited the paper, and approved the final version.

Corresponding author

Correspondence to John P. A. Ioannidis.

Ethics declarations

Conflict of interest

The authors declars that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 7716 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Levitt, M., Zonta, F. & Ioannidis, J.P.A. Excess death estimates from multiverse analysis in 2009–2021. Eur J Epidemiol 38, 1129–1139 (2023). https://doi.org/10.1007/s10654-023-00998-2

Download citation

Received: 03 November 2022
Accepted: 27 March 2023
Published: 12 April 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s10654-023-00998-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Excess death estimates from multiverse analysis in 2009–2021

Abstract

Similar content being viewed by others

Estimates of excess mortality during the COVID-19 pandemic strongly depend on subjective methodological choices

Excess all-cause mortality in the USA and Europe during the COVID-19 pandemic, 2020 and 2021

How to measure premature mortality? A proposal combining “relative” and “absolute” approaches

Introduction

Results

Variability of excess death estimates according to reference baselines

Stability of relative ranking for the pandemic years’ excess deaths across 33 countries

Diversity in time patterns across 33 countries

Excess death estimates in recent years using different projected period time windows

Discussion

Materials and methods

Data

Excess death calculations

Systematic Variation of Assumptions for Multiverse Analyses

Analyses with down weighing for older years

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (PDF 7716 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Excess death estimates from multiverse analysis in 2009–2021

Abstract

Similar content being viewed by others

Estimates of excess mortality during the COVID-19 pandemic strongly depend on subjective methodological choices

Excess all-cause mortality in the USA and Europe during the COVID-19 pandemic, 2020 and 2021

How to measure premature mortality? A proposal combining “relative” and “absolute” approaches

Introduction

Results

Variability of excess death estimates according to reference baselines

Stability of relative ranking for the pandemic years’ excess deaths across 33 countries

Diversity in time patterns across 33 countries

Excess death estimates in recent years using different projected period time windows

Discussion

Materials and methods

Data

Excess death calculations

Systematic Variation of Assumptions for Multiverse Analyses

Analyses with down weighing for older years

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (PDF 7716 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation