Introduction

The ongoing global COVID-19 pandemic is a challenge to humankind with a high death toll — more than 2.2 million persons lost their lives, with a large number of countries worldwide affected, and in excess of 102 million people had contracted the viral disease as of January 2021 (Johns Hopkins University 2021) [1, 2]. COVID-19 is an exogenous health threat that poses a particular challenge for health systems worldwide [9] because (1) vaccines do not yet apply any relevant influence, (2) forecasting the number and severity of infections is difficult, and (3) empirical evidence regarding the suitability of pharmaceutical (COVID-19 testing) or non-pharmaceutical interventions (travel restrictions, social distancing measures, partial or complete lockdowns) is missing [119]. As the pandemic is still ongoing, this poses a specific and comprehensive challenge to research and science to address this problem with all available tools, methods, and insights. The efficient allocations of health care resources, e.g., testing policies [27], hospital admissions [73], and intensive care capacity [132], are major health policy challenges [43, 55]. Therefore, an evaluation of the actions taken to date to address the pandemic is highly relevant for health economics [23, 72]. Of particular interest in the context of pandemics is the tension between medical efficacy and economic efficiency [69], which is increased by the quality of health care and the equitable use of health goods. When evaluating medical or health policy interventions during or after pandemic outbreaks, health economics analyses adopt different perspectives: (1) the perspective of health service providers (doctors, hospitals) studying, e.g., direct costs necessary to treat patients [22, 84], stockpiling of drugs [7, 63], or withholding effective novel antidotes [90], (2 patient-centered investigations on, e.g., consumer learning in vaccination decisions [88], (3) examinations through the lens of vaccine producers, e.g., analyzing the profit-maximizing capacity [53], and (4) studies evaluating the economic effect of pandemics, e.g., on companies through the lens of employee absences from work [40], effects on tourism and certain production sectors [108], and overall effects on a country’s economy [74, 77]. The objective of our study was to provide a country-specific efficiency evaluation of the fight against COVID-19 for 19 OECD countries while focusing on the role of pre-pandemic health care policy and its interconnection to ad hoc interventions (case 1 and case 2), as well as the impact of COVID-19 as an exogenous health threat to the country’s economy (case 3) [8]. We aspired to answer the following research questions: (1) "How efficiently did OECD countries handle the COVID-19 outbreak?" and (2) "What are the reasons for efficient or less efficient COVID-19 handling?" We employed a multi-factor evaluation approach based on data envelopment analysis (DEA) as a neutral efficiency measurement method to enable fact-based discussions and decision processes. DEA is an established and widely used method for efficiency measurement in healthcare management [71, 79, 81, 118, 135]. The evaluation approach integrates different levels of governmental decision-making and from a cross-country, also chosen by, e.g., Vogler and Fischer [128],: (1) factors considering pre-pandemic government health strategies measurable through governmental expenditures, (2) pre-pandemic characteristics of the respective health system representing health resources, (3) indicators expressing the consequences of governmental interventions during the COVID-19 outbreak on the economy, and (4) governmental interventions against COVID-19 taking into account the country-specific state of the pandemic. While our input factors are chosen to quantify the direct costs that are necessary for the treatment of a patient (health expenditures, number of doctors and hospital beds), the output factors are not quantified in monetary terms but as therapeutic outcomes and presented as clinical or physical quantities (infections, deaths, recoveries) because we wanted to consider patient-relevant measures.

The contribution of this paper is (1) the specific quantitative efficiency measurement approach with the DEA technique applied to a strategic-level evaluation of COVID-19 responses in the 19 examined OECD countries. In addition, this enables (2) a new perspective on COVID-19 countermeasures from an integrated health care and economics perspective based on empirical real-life data. Furthermore, (3) evaluations of individual measures directed toward the objective of an overall resource-efficient answer to viral pandemics that can be analyzed as a general objective measure are introduced. Therefore, we add new insights on the existing lessons learned from the management of the COVID-19 pandemic presented by Forman et al. [52].

This paper is structured as follows: the literature review section highlights the intersection of efficiency measurement and health economic activities during epidemics and pandemics from an interdisciplinary perspective (Sect. Literature view). Then, the methodology section describes the data used in this paper and elaborates on the use of DEA models to measure the efficiency of health care policies during the COVID-19 outbreak (Sect. 3). Section 4 presents the results of three longitudinal efficiency analyses (1) investigating the health system efficiency during COVID-19, (2) decompensating the health system and governmental ad hoc intervention efficiency, and (3) examining the impact of interventions on the country's economies. The results are discussed in Sect. 5. Finally, conclusions and the outlook toward future research are presented in Sect. 6.

Literature review

A brief literature review was applied to structure a conceptual basis for the specific analysis implemented in this paper regarding COVID-19 overall efficiency evaluation. This is important as though it seems from the 2020 perspective that pandemic challenges are new on a global scale, this is not the case when looking back in a larger timeframe. In order to identify the key words and topics from previous research on this issue for the last 150 years, an overview is implemented as follows: For keywords, the following entries were used in searches of international academic journals: (1) “pandemic”, in combination with (2a) “public economics”, (2b) “government policy”, and (2c) “efficiency”. In a second search round, “pandemic” was replaced by “COVID-19”. Papers were selected to achieve a topical representation.

The structured results are presented in Table 1. Pandemic situations have been regarded as rare but special public economic and health challenges since the start of scientific discussions thereof in the nineteenth century (No 1–3 in Table 1). Additionally, characteristically, efficacy evaluation to date has occurred from many different medical science perspectives, including health care, public economics, and public health science as well as social sciences; discipline-specific journals are indicated in Table 1 (No 4–11). This pandemic is nevertheless seldom seen as a public economic challenge, as only very large pandemic situations actually strain public resources in the sense of crucial political and economic decisions to be made. Only rare events have a significant impact on public and private economic development in societies, and this is not yet reflected adequately in the research literature; this is another thing the current 2020 pandemic has changed. Finally, the COVID-19 pandemic in 2020 has already brought about a dedicated and enormous body of research literature in a broad range of disciplines, which corresponds with the global public economic impact of COVID-19 within this short timeframe (No 12–22). Case studies and different research results are reported from a very diverse set of countries as the COVID-19 pandemic has struck all countries (No 23–26). The research includes very specific questions, such as economic perspectives and business impact, engineering perspectives regarding safety impacts, and medical perspectives on cross-effects from other diseases and their treatment (No 27–32). Further interesting comparative analyses with earlier results can and will be addressed in future research with further insights. For the subsequent parts of this paper it is important to recognize that pandemic crises have been subject to research from many disciplines already — but without the methodology to integrate that into a coherent quantified evaluation scheme as it is proposed in this paper [18, 19].

Table 1 Literature analysis regarding pandemic situations from the public health perspectiveKlicken oder tippen Sie hier, um Text einzugeben

Data and methodology

Dataset

The data used for the efficiency analysis were retrieved from several databases. First, general country-specific data, e.g., health spending, number of doctors and hospital beds, and unemployment rates, were obtained through database queries from the OECD database OECD Stat [100]. Second, the number of performed COVID-19 tests was gained through the United Nations Office for the Coordination of Humanitarian Affairs (OCHA) and downloaded on November 16, 2020 [96]. Third, the number of COVID-19 cases and deaths, as well as the number of recovered COVID-19 patients, was retrieved from the COVID-19 Dashboard by the Center for Systems Science and Engineering at Johns Hopkins University (JHU) [39] and downloaded on November 16, 2020. Fourth, to interpret the efficiency scores of each country, we use data on non-pharmaceutical governmental interventions from the Assessment Capacities Project [6], which had 14,848 database records on reported measures for 194 countries and the COVID-19 Government Response Stringency Index (GRSI) of the Oxford Covid-19 Government Response Tracker [62]. In total, our dataset includes 19 countries with mostly publicly financed healthcare systems and a minimum total population of two million inhabitants: Australia (AUS), Austria (AUT), Belgium (BEL), Canada (CAN), Czech Republic (CZE), Denmark (DEN), Finland (FIN), France (FRA), Germany (DEU), Ireland (IRE), Italy (ITA), Japan (JPN), Netherlands (NLD), Norway (NOW), Slovenia (SLN), South Korea (KOR), Spain (ESP), Sweden (SWE), and the United Kingdom (GBR).

Data envelopment analysis

To calculate the efficiency of each country, this paper proposed a DEA model with an output-oriented ratio form under constant returns to scale (CRS). In general, DEA is a non-parametric optimization method of mathematical programming for measuring the relative efficiency of decision-making units (DMUs) that have multiple inputs and outputs. A basic model was introduced by Charnes, Cooper, and Rhodes [29] based on the Koopmans activity analysis concept [82] together with the publications of Debreu and Farrell dealing with radial efficiency measurement [36, 49]. The optimization method can be based on CRS in the CCR model [29] or variable returns on scale (VRS) in the BCC model [11], and each case has an input or output orientation. The linear program for the CCR model is [32], pp. 23–24):

$$\left( {LP_{0} } \right) max_{\mu , \vartheta } \theta = \mu_{1} \gamma_{1o} + ... + \mu_{s} \gamma_{so}$$
(1)
$$subject\, to \,\vartheta_{1} x_{1o} + ... + \vartheta_{m} x_{mo} = 1$$
(2)
$$\begin{gathered} \mu_{1} \gamma_{1j} + .... + \mu_{s} \gamma_{sj} \le \vartheta_{1} x_{1j} + \ldots + \vartheta_{m} x_{mj} \hfill \\ \left( {j = 1, \ldots , n} \right) \hfill \\ \end{gathered}$$
(3)
$$\vartheta_{1} , \vartheta_{2} , \ldots , \vartheta_{m} \ge 0$$
(4)
$$\mu_{1} , \mu_{2} , \ldots , \mu_{s} \ge 0$$
(5)

The basic idea is to calculate an efficiency frontier that is used as a best practice input–output combination for the underlying production scenario. A score of 1.0 indicates that a DMU is efficient and on the efficiency frontier, whereas the relative inefficiency of a DMU can be determined by measuring the distance between the individual DMU performance and the efficiency frontier. Measuring efficiency under the assumption of CRS is known as overall technical efficiency (OTE). This includes the determination of inefficiency/efficiency based on (1) the input/output transformation, meaning pure technical efficiency (PTE), as well as (2) the size of operations, meaning the scale efficiency (S.E.). DEA is a frequently applied methodology for public health question on several aggregation levels, e.g., regarding (1) hospitals [3, 4],D. C. [50, 51, 87], (2) the evaluation of healthcare reforms [44, 105, 106], (3) healthcare infrastructures [26], and (4) health care systems [94]. It has also been applied to evaluate the health care production of OECD countries [125, 126].

The traditional DEA methodology was extended in several directions: To evaluate the efficiency of panel data and enable time series efficiency measurement per DMU, Charnes et al. [28] proposed DEA window analysis. The dynamic perspective of this model treats the same DMU occurring in different periods as entirely different DMUs. The major benefit of its moving average method is that the number of DMUs increase, and dynamic changes of the efficiency per DMU can be evaluated. Furthermore, to disclose the black-box assumption of the traditional DEA model, Färe and Primont [48] and Färe [46] proposed Network DEA where a production process is decompensated in sub-technologies or nodes. The specification of these nodes enables the examination of input/output allocations as well as intermediate products, which represent the entire production process [47, 120].

Design and application of the DEA model

The selection of applicable inputs and outputs, as well as the design of a suitable DEA model, is a well-known source of pitfalls within the DEA literature [41]. Researchers are often facing a choice between the (1) empirical quantitative data that are published, e.g., by governments or organizations, and (2) the restrictions of the DEA model concerning input/output selection. Therefore, we followed a four-step framework for DEA application proposed by Jain et al. [70]: (1) select applicable inputs and outputs for the subject of research, (2) validate the inputs and outputs, (3) select the applicable DEA model, and (4) execute the DEA model in step 4.

First, possible input and output factors must be directly related to government policy and health policy, as well as the COVID-19 outbreak, and can be separated into four groups: (1) factors considering pre-pandemic health strategies measurable through governmental expenditures or revenues, e.g., health or pharmaceutical spending; (2) pre-pandemic characteristics of the respective health systems representing resources that are available during an epidemic outbreak and measurable through the existing health infrastructure, e.g., number of doctors and nurses, number of medical and nursing graduates, number of hospital beds and intensive care beds; (3) indicators expressing the quantifiable economic consequences of governmental interventions during the COVID-19 outbreak, e.g., unemployment rates or trade volumes; and (4) governmental interventions against COVID-19 taking into account the country-specific state of the epidemic outbreak, e.g., number of COVID-19 tests performed, number of COVID-19 infections, number of COVID-19 deaths, or number of patients recovered from COVID-19.

Second, to validate the inputs and outputs through empirical data, we follow the steps proposed by Dyson et al. [41]. First, mixing indices and volume measures, as well as integrating percentages, can lead to distortions of the efficiency values. Since most data are available in the form of ratios, volume measures, e.g., total population, are excluded from the analysis or converted to ratios. Consequently, the DEA model is calculated by applying indices per inhabitant or per 1,000,000 inhabitants. Second, linked input/output values have to be avoided, e.g., considering the total number of tests performed and the number of tests per inhabitant leads to distortions of the efficiency scores. Third, a cross-correlation of the available factors has to be avoided. For instance, health spending per capita and pharmaceutical spending per capita (r = 0.68), as well as the number of nurses and doctors (r = 0.76), are highly correlated. To finish step two, we define the following input (I) and output (O) factors: I1 health expenditures in US dollars per inhabitant, I2 number of doctors per 1,000,000 inhabitants, I3 number of hospital beds per 1,000,000 inhabitants, I4 number of COVID-19 tests per 1,000,000 inhabitants, O1 number of COVID-19 deaths per 1,000,000 inhabitants, O2 number of patients recovered from COVID-19 per 1,000,000 inhabitants, O3 number of COVID-19 cases per 1,000,000 inhabitants, and O4 unemployed individuals per 1,000,000 inhabitants assigned to the labor force. Especially for I1 to I3, we are aware of the fact that the possible regional organization of health care systems is a key issue for the efficient handling of pandemics. E.g., hospitals traditionally organized to deliver patient-centric care are ill-equipped to deliver the type of community-focused care needed during a pandemic (For research regarding the further COVID-19-related development of public hospitals, the reader is referred to Rodríguez et al. [110]. But as most health care systems are highly decentralized, it is difficult to quantify that different regions tried various policy responses. As spatial doctor density or any index accounting for the local public health infrastructure and surveillance (e.g., the number of community doctors as compared to that of hospital specialists) is not available in cross-country databases, our approach is based on the currently available data aggregation level of countries. Appendix Table 4 summarizes the key attributes of the dataset by applying descriptive statistics, and a further correlation analysis proves that there is no linear statistical relationship between the applied input and output measures. Hereafter, we explain the variables in detail:

I1 Health expenditures in US dollars per inhabitant measures the final consumption of health care goods and services that are financed through a mix of financing arrangements, including government spending and compulsory health insurance, as well as voluntary health insurance and private funds, such as households' out-of-pocket payments, NGOs and private corporations [98]. This factor quantifies the pre-pandemic government health care policies determining the majority of the health care resources that are available during the COVID-19 pandemic (fixed).

I2 Number of doctors per 1,000,000 inhabitants, which defines doctors as practicing doctors providing direct care to patients. Doctors are usually generalists who assume responsibility for the provision of continuing care to individuals and families or specialists, such as pediatricians, obstetricians, gynecologists, psychiatrists, medical specialists, and surgical specialists [97]. As doctors are an essential resource in fighting pandemics, the input factor expresses a major pre-pandemic variable that reflects the resources of a health system (fixed).

I3 Number of hospital beds per 1,000,000 inhabitants quantifies the available resources for delivering health services to patients in hospitals in terms of the number of beds that are maintained, staffed, and available for use. It is the second measure to quantify the pre-pandemic health system resources available to fight against COVID-19 [99].

I4 Number of COVID-19 tests per 1,000,000 inhabitants includes diagnostic testing for COVID-19, which looks for the presence of the virus in specimens obtained from patients. The number of performed tests is reported by OCHA [96]. It is a factor that quantifies the nature and extent of ad hoc government policies against COVID-19 and enables its integration in an efficiency analysis. An even better fit to quantify this aspect could be the number of tests performed per day compared to the total number of tests that can be processed per day. However, as this capacity is mostly depending on highly decentralized lab capacity, it was not quantifiable in the course of this research [21, 25].

O1 Number of COVID-19 deaths per 1,000,000 inhabitants expresses the mortality of the pandemic and is, therefore, one of the most important measures to quantify the burden of COVID-19. Countries throughout the world have reported very different case-fatality ratios, e.g., the number of deaths divided by the number of confirmed cases [39]. O1 solely includes the reported deaths associated with COVID-19. As DEA would value a large number of deaths as a large output and, therefore, as highly efficient, O1 is integrated as an undesirable output.

O2 Number of patients recovered from COVID-19 per 1,000,000 inhabitants quantifies the number of individuals who successfully recovered from their COVID-19 infection and were discharged from hospitals and self-isolation facilities. As the exact date of recovery is unknown in most cases, the Robert Koch Institute and Johns Hopkins University use algorithms to estimate the number of recovered cases [39].

O3Number of COVID-19 cases per 1,000,000 inhabitants quantifies the number of individuals who are infected within an examined time period. Reducing the number of new COVID-19 cases will help countries prevent overloading of their health care systems. Because the main transmission pathway for COVID-19 is the respiratory absorption of virus-containing fluid particles that are produced during breathing, coughing, speaking, and sneezing, governmental restrictions include partial or total lockdown, traveling limitations, and social distancing restrictions. O3 can be used to integrate an evaluation regarding the suitability of timing, nature, and extent for ad hoc governmental measures [39]. As DEA would value a large number of cases as a large output and, therefore, as highly efficient, O3 is integrated as an undesirable output.

O4Unemployed individuals per 1,000,000 inhabitants assigned to the labor force are integrated into a separate model to measure the impact of governmental restrictions on the economy, as well as the success of governmental interventions through, e.g., tax reductions, monetary subsidies for short-term work, or loan programs. Unemployed inhabitants are people of working age who are without work, are available for work and have taken specific steps to find work. This indicator is measured in numbers of unemployed people per 1,000,000 inhabitants of the labor force. The labor force is defined as the total number of unemployed people plus those in employment [101]. Thus, inhabitants outside of the working age are excluded. O4 is integrated as an undesirable output [31, 33, 35].

In step three of the DEA application framework, before the execution of the computations in step four, the number of defined inputs and outputs can also provide another pitfall, as discussed by Dyson et al. [41], who proposes that the number of observation points must be at least 2 times the number of inputs times the number of outputs. With four inputs, four outputs, and 228 observation points (19 countries times 12 periods, each lasting one week), our model fulfills this criterion. Furthermore, DEA models can aspire to (1) maximize desirable outputs, (2) minimize undesirable outputs, (3) maximize desirable inputs, or (4) minimize normal inputs [64]. In the course of this paper, we use an output-oriented model. As the output-oriented model seeks to maximize outputs, O1, O3, and O4 are treated as undesirable outputs that have to be minimized [61, 114, 115].

To support the choice of available scale assumption, semi-parametric statistical tests, known as Banker's tests or tests of goodness of fit, were applied; H0 was the assumption of CRS against the alternative of H1 with VRS [10, 12]. For execution, we follow the example of Giokas et al. [58], assuming that hC and hB are the DEA inefficiency scores estimated from the BBC and CCR models. In the first hypothesis test (formula 1), h is assumed to follow the exponential distribution in the two models and evaluated relative to the critical value of the F-distribution with (2 N, 2 N) degrees of freedom, while in the second (formula 2), h is assumed to be half-normally distributed for both models and evaluated relative to the critical value of the F-distribution with (N, N) degrees of freedom [58], p. 1941).

$$T_{EX} = \frac{{\mathop \sum \nolimits_{j} \left( {h^{C} - 1} \right)}}{{\mathop \sum \nolimits_{j} \left( {h^{B} - 1} \right)}}$$
(6)
$$T_{HN} = \frac{{\mathop \sum \nolimits_{j} \left( {h^{C} - 1} \right)^{2} }}{{\mathop \sum \nolimits_{j} (h^{B} - 1)^{2} }}$$
(7)

The results of TEX = 2.03 and THN = 4.11 indicate that the null hypothesis of constant returns to scale cannot be rejected at the 5% level of significance for an exponential distribution and normally distributed h score. Therefore, we applied the CCR model with CRS.

Empirical results

Comparability of health systems in time-delayed COVID-19 outbreaks

Being able to evaluate and compare the efficiency of healthcare systems during the COVID-19 pandemic requires considering two circumstances that need to be addressed before applying the evaluation framework elaborated in the previous chapter: (1) the comparability of the health systems of OECD countries, and (2) time-delayed COVID-19 outbreaks and their operationalization. Several dimensions of a healthcare system, e.g., the level and financing of resources, the role of key actors including the state and other societal or private actors, and access regulation, should be noted when comparing various countries [45]. To systematize country-specific characteristics of health systems, research approaches have developed typologies of health systems that aim to better characterize and categorize health systems for European (P. L. [50, 51] and OECD countries [20, 109, 121, 131]. Böhm et al. [20] applied the typology developed by Rothgang et al. [111] that distinguishes the three dimensions of (1) regulation, (2) financing, and (3) service provision, as well as the three types of actors, namely (1) the state, (2) societal actors, and (3) private actors. Wendt [131] proposed four types of healthcare systems that mainly focus on health expenditure with public financing and out-of-pocket payments by patients, as well as on the payment of general practitioners and access regulation. Reibling et al. [109] develop five typologies integrating the dimensions: (1) the level of resources of healthcare systems operationalized through expenditure on healthcare per capita and the number of general practitioners per thousand inhabitants, (2) the public–private mix operationalized through the share of public health expenditures of the total health expenditure, the share of out-of-pocket payments of the total health expenditure, and the payment of specialists; (3) access regulation, including whether individuals are required to register at a general practitioner and how a person can access specialist care; (4) primary care orientation operationalized through the ratio of general practitioners to specialists and the share of health expenditure on outpatient care of the total health expenditure, and (5) the performance measured by the level of achievement for goals in the prevention and the quality of care. In our analysis, we included all countries that were and categorized then into the following clusters: (1) the supply- and choice-oriented public types (AUS, AUT, BEL, CZE, DEU, FRA, IRL, SVN), (2) the performance- and primary-care-oriented public types (FIN, JPN, KOR, NOR, SWE), and (3) the regulation-oriented public types (CAN, DNK, ESP, GBR, ITA, NLD). The fourth cluster (4), the low-supply and low-performance mixed type, was excluded because the available data were suitable, not suitable, or sufficient for the purpose of our analysis. Therefore, we only compare public health systems and exclude the supply- and performance-oriented private types [42].

Time-delayed outbreaks with a varying number of infections lead to country-specific scenarios in the context of COVID-19. While (1) AUT, BEL, CZE, ESP, FIN, FRA, IRE, ITA, NLD, NOR, and KOR flattened the COVID-19 curve in terms of the number of new infections per day until July 1, 2020; (2) CAN, DEU, DNK, ISL, SVN lowered the number of new infections significantly but had a noteworthy number of new infections per day; (3) GBR and SWE had a constantly growing number of new COVID-19 infections since the first death; and (4) AUS and JPN started the second infection wave in July 2020. Being able to deal with these time lags requires the normalization of all countries to a starting time t0 [92]. Thus, we chose the date of the first COVID-19-related death reported by Johns Hopkins University as the beginning of the country-specific COVID-19 outbreak. With this normalization approach, we were able to compare the healthcare system efficiency for the selected OECD countries with the individual progression of COVID-19-related deaths and infections.

DEA window analysis with panel data for time-dependent COVID-19 development

To evaluate the efficiency of health system policies before the epidemic outbreak, we separated the dataset into periods lasting seven days each and conducted a DEA window analysis. The first period started when the first COVID-19-related death appeared. We chose w = 2 as the window width for this analysis, as the maximum incubation time of the COVID-19 virus is 14 days, which is, on the other hand, the minimum period until the effects of governmental practices are measurable. The first model used I1, I2, and I3, as well as O1 and O2. Table 2 summarizes the results.

Table 2 Evaluation of country-specific efficiency through the lens of health system efficiency
  1. (1)

    AUT: Two announcements for partial lockdowns were implemented on March 16, 2020, and March 19, 2020, which was at the end of period 1 in our analysis (Austrian Federal Ministry of Social Affairs, Health, Care and Consumer Protection, 2020). Two weeks later, the efficiency level of Austria increased significantly. After releasing the lockdown in period 6, several local COVID-19 outbreaks led to COVID-19-related death peaks and consequently to a low level of efficiency in subsequent periods.

  2. (2)

    BEL: An equal logic can be inferred in the case of Belgium, where a general lockdown starting on March 18, 2020 [15] limited the leaving of homes, except for emergencies, in period two of our analysis. At this time, the efficiency level was 0.53. Two weeks later, the efficiency level of Belgium increased significantly to 0.80 [59].

  3. (3)

    CZE: A nationwide quarantine was introduced from March 16, 2020 until April 11, 2020 (period 1), except for essential needs, helping others, and necessary trips to families (U.S. Embassy in the Czech Republic, 2020). Two weeks later, the efficiency level of CZE increased significantly from 0.77 in period 1 to 0.90 in period 3. The low-efficiency level in periods 7 to 10 results from a constant (low) death rate, while other countries manage to significantly reduce the number of new deaths in weeks 7 to 10 after the first death.

  4. (4)

    DEU: After a constant increase in new COVID-19 infections, Germany introduced a national lockdown in period 2 on March 22, 2020 [57]. Only essential trips, including for work, were allowed and Germany had more stringent public movement rules. The efficiency level increased significantly from 0.66 to 0.89 and remained constant until the end of the first COVID-19 infection wave.

  5. (5)

    ESP: Starting on March 16, 2020, the government only allowed movements for travels to make essential purchases (food, hygiene, health, first necessity), in strict compliance with basic precautionary measures, travels to access banking services, or travels to provide care and assistance to vulnerable people [117]. After introducing these measures in period 2, with an average efficiency of 0.52, positive development started in period 4 (eff. = 0.77). The decline in efficiency in periods 10 to 13 can be traced back to a high death rate in relation to other countries in these late periods. Therefore, the speed of flattening the curve had a significant impact on countries’ efficiency in fighting the pandemic.

  6. (6)

    FRA: One of the countries with the lowest efficiency level among all periods implemented the lockdown on March 17, 2020, and from a calendrical perspective, FRA had efficiency levels quite similar to those in other European countries [54]. However, France was already in period 5, meaning one month and two days after the first reported COVID-19-related death. It is furthermore interesting to observe that the late lockdown did not have the same effect as the early (period 1 or 2) lockdowns of other countries, as the efficiency level of France constantly remained low in subsequent periods (period 7 and further).

  7. (7)

    UK: The government implemented a lockdown on March 24, 2020 (end of period 3), which was late compared to more efficient European countries, e.g., Germany or Spain. Additionally, the health system has significantly fewer doctors per million inhabitants and hospital beds per million inhabitants compared to Italy and France, which have similar populations. Finally, the resulting constantly high death ratio, with 500 to 1,100 deaths per day during April, as well as high death ratios in the late periods (compared to other countries), led to an overall weak health system performance [60, 66].

  8. (8)

    IRE: The government enforced a partial lockdown on March 23, 2020 (period 2); people could only leave their homes to travel to or from work if they were providing an essential service, to shop for food, to collect medical prescriptions and medical supplies and attend medical appointments, or to carry out vital services, such as caring for a family member, within 2 kms of their houses, while keeping 2 m away from others for social distancing. Similar to other states, the results were observable after two periods in period 4 [67]. Ireland’s health system and additional measures made it possible to maintain a high efficiency level for the rest of the pandemic.

  9. (9)

    ITA: In Italy as one of the worst affected countries during the first COVID-19 wave the political measures developed as follows: On March 8, a decree approved by the Council of Ministers introduced quarantine measures in northern Italy. The regions Lombardy, Veneto, Emilia Romagna and Piedmont are affected and 15 million people — about a quarter of the Italian population – are confined. On March 9, this lockdown was extended to the whole country. All public spots were closed; travel was limited to the strictly necessary (health, food, and work when working remotely is impossible). All gatherings were prohibited and punishable by fines. On March 11, adoption of a new decree from the Council of Ministers imposed the closure of all businesses except for pharmacies and grocery stores. March 11 was therefore basically the start of the nation-wide lockdown. On March 22 and 23, this was de jure adopted by an ordinance by the Ministries of Health and of the Interior prohibiting citizens from leaving their city of lockdown, but this simply did not add anything at all to March 11 decree [68]. About three weeks later, the efficiency level in Italy increased significantly. However, the low average efficiency of Italy can be connected to the fact that the lockdown was implemented late when comparing with the timing in other states. The same observation was made for Great Britain [75, 76, 78]. For a deeper discussion on the measures taken by the Italian government, the reader is referred to Mauro and Giancotti [89] and Valent et al. [124].

  10. (10)

    KOR: Until July 2020, South Korea was the only country with a population of over 50 million inhabitants that had slowed the spread of the virus and flattened the curve of new infections without shutting down the whole country. Furthermore, there were no extreme personal travel or movement restrictions and no closure of airports. After the first COVID-19 case on January 19, 2020, and the first COVID-19-related death on February 20, 2020, South Korea raised the alert level to red on February 23, 2020 (period 1) (The Government of the Republic of Korea, 2020). The country managed to avoid high case numbers through intelligent and digital COVID-19 management, as well as persistent tracking, tracing, and testing of infected persons who were quickly identified and treated at an early stage. The country is thus a unique example of efficient COVID-19 control without lockdown. However, in many European and North American countries, involving police in matters of public health combined together with data sharing is seen as an act of criminalizing illnesses and, therefore, highly critical [93].

  11. (11)

    SVN: The country with the least inhabitants in the whole dataset also had the lowest input values for the average health spending per capita. On the other hand, the number of doctors per million inhabitants and the number of hospital beds per million inhabitants are higher in better-developed countries, e.g., Belgium, Great Britain, the Netherlands, or Norway. Due to the number of inhabitants (2 million), it is difficult for Slovenia to find measures that keep the relative number of deaths or new cases of infections as low as for countries with more than 50 times more inhabitants (e.g., Japan). Therefore, Slovenia can be seen as an example for the case that fewer infections should have been possible through a theoretic performance-oriented view, while this seems unrealistic from a practical viewpoint [86].

  12. (12)

    AUS, CAN, JPN, and NLD have early lockdowns in the first period and consequent infection tracking in common, which led to a stable and, compared to other countries, highly efficient health system with a mean efficiency score above 0.90.

  13. (13)

    Similar policies were adopted in the Northern European countries of DNK, FIN, NOR, and SWE, to keep restaurants and primary schools open and rely on citizens adhering to social distancing recommendations themselves. This seems to be another highly successful strategy, as the mean efficiency score was constantly above 0.90. Furthermore, these states have highly developed public health systems.

Network DEA for pre-epidemic health strategy and COVID-19 testing as an ad hoc intervention

To determine more about the interdependency of pre-epidemic health system strategies and ad hoc interventions when fighting COVID-19, the following network DEA approach separates the efficiencies of both contributing factors. Figure 1 summarizes the approach [95].

Fig. 1
figure 1

Network DEA with pre-epidemic health strategy and COVID-19 testing ad hoc intervention

The results of the network DEA (appendices 2 and 3) indicate that the efficiency of the pre-epidemic health strategy and ad hoc COVID-19 intervention are positively related to each other (r = 0.66). This seems reasonable when remembering the example of Germany expanding the intensive care hospital capacity based on the existing health care system capacity. Furthermore, the possibility of testing for COVID-19 infections depends on the available health care resources, e.g., doctors or medical professionals. Therefore, an important finding of the network DEA is that the institution of only ad hoc decisions, e.g., lockdowns, testing, or expanding intensive care capacity, will not lead to an overall successful fight against COVID-19. Rather, ad hoc decisions can contribute to the fight against COVID-19 and increase efficiency. Figure 2 demonstrates the relationship between efficiency for pre-epidemic health strategy and COVID-19 testing policy, where the size of the bubbles depends on the population of the country and the color indicates the membership to one of the examined healthcare system typologies: (1) the supply- and choice-oriented public type (blue), (2) the performance- and primary-care-oriented public type (green), (3) the regulation-oriented public type (yellow).

Fig. 2
figure 2

Relationship of efficiency for pre-epidemic health strategy and COVID-19 testing policy

Through a closer examination of the efficiency values per typology, it can be stated that performance- and primary-care-oriented public healthcare systems have been the most efficient ones in fighting the COVID-19 pandemic. This is true for the pre-pandemic health system efficiency, as well as for the efficiency of ad hoc measures. According to Reibling et al. [109], this type is dominated by public financing but spends less money and uses fewer human resources to provide healthcare. The focus of the FIN, JPN, KOR, NOR, and SWE healthcare systems was clearly on primary care orientation, with a comparatively high proportion of primary care doctors compared with specialists [107, 108].

Furthermore, the regulation-oriented public type was the least efficient in regard to the efficiency of the pre-pandemic healthcare strategy. This type was characterized by a medium level of resources that come primarily through public funding, but it has the highest level of access regulation and limits the choice of providers. The system is also characterized by the absence of formalized cost-sharing and the lowest level of out-of-pocket expenditures [109]. It is important to mention that countries in the regulation-oriented public type can increase their overall performance in fighting COVID-19 with good ad hoc management, e.g., through lockdowns or social distancing policies (DNK, NLD). However, incorrect decisions such as lockdowns that are introduced late (GBR) or only in small steps and inconsistently (ITA), together with regulation-oriented or inefficient public pre-pandemic healthcare system policies, led to the lowest efficiency levels among all OECD countries. A more counter-intuitive finding is that the efficiency was not dependent on the population size of the countries (node 1: r =– 0.12; node 2: r =– 0.06), population density (node 1: r =– 0.01; node 2: r =– 0.24), median age (node 1: r =– 0.11; node 2: r =– 0.26), proportion of the population that is older than 65 years (node 1: r = 0.03; node 2: r =– 0.17), the proportion of the population that is older than 70 years (node 1: r =– 0.08; node 2: r =– 0.23), the GDP per capita (node 1: r = 0.05; node 2: r = 0.33), or the Human Development Index (node 1: r =– 0.14; node 2: r =– 0.11) [123].

DEA window analysis for the impact of governmental programs on the economy

Governmental regulations to lower the spread of COVID-19 through, e.g., isolation and quarantine policies, public gatherings limitations, or lockdowns, had versatile impacts on the economy. While the northern European countries Sweden (Government Response Stringency Index, GRSI 46.3 of 100) and Finland (GRSI 57.41 of 100) were defensive in their regulation policies compared to, e.g., France (GRSI 90.74 of 100) or Germany (GRSI 60.65 of 100). Therefore, the question of which policy is the best for a country’s economy arises [62]. A second important factor is a direct intervention through, e.g., limiting product imports/exports and government aid programs. Table 3 presents the results of the DEA window analysis using I1, I2, and I3, as well as O1, O2, and O4.

Table 3 Evaluation of country-specific efficiency through the lens of economic efficiency

Furthermore, the average efficiency level in the economic model was correlated with the average GRSI between periods 1 and 12 per country. Taking the averages for the whole dataset (periods 1–12) showed a negative correlation of efficiency and GRSI (r =– 0.45); the average of periods 5–12 showed a strong negative relationship, with r =– 0.54, and no significant relation for the last periods (10–2; r =– 0.28). For all moving average calculations, the correlation coefficient was negative (Appendix Table 7).

Summarizing these findings from an economic point of view, the higher the strength of the governmental response to COVID-19, e.g., through lockdowns and social distancing, the less efficient the country response is through the lens of economic efficiency. As presented in the previous chapters, lockdowns lead to a direct increase in efficiency, but the price is more unemployment and a weakening of the economy. The most promising approach to respond to COVID-19-related lockdowns of the economy and society is short-term work with governmental support, which had a positive impact on the efficiency observed the top 4 countries of Sweden (eff. = 1.00), Germany (eff. = 0.97), Spain (eff. = 0.93), as well as Switzerland (eff. = 0.84)Footnote 1 and was not reported for the rest of the examined OECD countries [113, 116].

Discussion

The following items for discussion stem from the presented results and are important for the objective of improving the public health economic answer to pandemic crises in a political and managerial sense:

  1. (1)

    Traditional health systems and economic success factors for wealth and development, such as country and population size or the respective Human Development Index (HDI), are found to be less relevant to the efficiency of pandemic management in some countries. In contrast, we have found that small and large countries, Eastern and Western counties, and developed and less developed are represented in both the efficient group and the less efficient group in terms of their efforts and results against the COVID-19 pandemic. This is important for health economics research and decisions, as new predictor variables for decisions and analysis regarding the resource-efficient answer to pandemic situations have to be identified and applied in future (e.g., the discussion by [30] or in a pre-COVID-19 perspective the results by [83] and [34]).

  2. (2)

    It is important to note that the development situation during a pandemic crisis is very dynamic, and individual countries are transitioning between various phases of success and failure at different times – successful crisis management in the past does not guarantee resistance to the pandemic for the future. This has already been found before the COVID-19 pandemic but has to be remembered nevertheless [24, 91]. In some cases, even a reciprocal connection can be recognized as successful crisis management is, by and large, lowering the acceptance of future restrictive measures in the fight against COVID-19. This is not only relevant to the ongoing discussions about “second waves” of COVID-19 but also the general risk of a long-term perspective and trade-off regarding health economic decisions. Research has to improve the analytical and prognosis perspective to enhance the pandemic foresight aspect of health economics. This also addresses a continuous learning perspective for the health care management discipline [65].

  3. (3)

    Looking at the integrated model of health care management and economic evaluation of country-specific crisis management provides insight that governmental support of short-term work is an efficient concept to balance COVID-19 intervention and economic success. This might be true for further instruments developed during the pandemic in 2020, including the latest measures taken by China, the U.S. and the European Union countries. Further methods and research discourse are required to systematize and evaluate all the comprehensive approaches regarding analysis and policies in this regard; in many cases, only long-term hindsight might reveal the full picture, as we have learned from previous pandemics (Almond & Bhashkar Mazumder, 2005 for the Spanish flu of 1918/19) [122, 127, 129].

  4. (4)

    Analysis and decision endeavors in health care management science have to take into account that international pandemic situations are real “rule changers” — many assumptions applied in health economics analysis and discourse simply do not hold in such situations, as already shown by Beutels et al. [17]. Therefore, dynamic and “out of the box” approaches have to be developed for application in situations that are beyond the routine steady state of health systems economics on normal days [85].

  5. (5)

    For further discussion regarding successful COVID-19 interventions, it may be beneficial to integrate data from deeper aggregation levels. An example is the impact of population size which has been found to be influential comparing rural and urban areas in Turkey [13], but had no impact on in our country-related analysis. The same applied for approaches that seek to evaluate regional changes of peoples’ actions, such as a more strict adoption of social distancing measures among the population, that reduce the spread [14]. However, as our research was intended to provide a bigger picture going beyond regional developments, the data quality on this aggregation level does not exist for a majority of OECD countries until now. The country managed to avoid high case numbers through intelligent and digital COVID-19 management, as well as persistent tracking, tracing, and testing of infected persons who were quickly identified and treated at an early stage. The country is thus a unique example of efficient COVID-19 control without lockdown.

  6. (6)

    The implementation of telemedical health services can be a promising concept for people seeking for a rapid first advice regarding a possible infection while keeping to social distancing measures at the same time. It could also help relieve pressure on the often overburdened primary care systems or emergency departments [56]. Parsons and Romanis [104] discusses an interesting case of a mobile abortion care service for the United Kingdom [103].

  7. (7)

    During our analysis, we found that all countries chose diverse strategies that are mostly disconnected. This applies for the OECD countries, as well as when looking at European countries in isolation (see also, [112]. The paradigms of strict legally backed lockdowns in central Europe stand against softer responses of northern European countries. Therefore, we agree with Forman et al. [52] and call for a unified responses to pandemics.

  8. (8)

    Additionally, our results on OECD countries may be supplemented by integrating developing OECD countries that we did not evaluate within the DEA model, e.g., (a) Chile, (b) Columbia, and (c) Mexico: We start with (a) Chile and the results presented by Oliveira et al. [102] showing that timely and coordinated social distancing was a powerful non-pharmaceutical intervention, but difficulties in keeping the population under control decreased the impact of this measure. For (a) Chile and (b) Columbia, Benítez et al. [16] show that although stringent measures of containment and mitigation were introduced, pre-pandemic conditions, e.g., high informal employment and social inequalities, have undermined the effectiveness of the countries’ responses to the pandemic. For (c) Mexico, Knaul et al. [80] demonstrate the absence of a uniform national response of Mexico and argue that coordinated, timely, rigorous response to the pandemic did not occur in Mexico. Arellanos-Soto et al. [5] evaluated Mexican Governmental measures by comparing data on influenza diagnosis, finding that the implementation of public health measures leads to a significant decrease. This is supported by Díaz-Castro et al. [38], concluding that Mexico's health policies had an effect on slowing the pandemic’s propagation, but population density and mobility played a fundamental role. In summary, the results for developing OECD countries are in line with our previously stated quantitative findings, including the suitability of social distancing measured and the fact that a lack of uniform, coordinated, and timely response may be reasons for countries to fall behind in an early stage. Furthermore, the cases of (a) Chile and (b) Columbia underline the relevance of pre-pandemic conditions [130, 133, 134, 136].

    Altogether, this approach, involving country-specific modeling of input and output types for a health and general economic efficiency analysis to be used as a central evaluation measurement for bundles of health care and political measures, has proven to be worthwhile and should be subject to further research and discussion. Further research approaches are warranted regarding, for example, the identification of further relevant input and output types, as well as the inclusion of further individual countries or bundles of countries besides the OECD set applied here. Additionally, as we identified (partial) lockdowns and social distance measures as a possible key to fight the pandemic, more research on the evolvement of public values and perceptions during the COVID-19 pandemic should foster our understanding on the acceptance level of these measures as discussed by Denburg et al. [37].

Conclusion

This paper has outlined specific approaches and results regarding the efficiency of health economic measures and interventions for the COVID-19 pandemic worldwide, including country-specific analysis of 19 OECD countries. The results show that country-specific efficiency regarding multiple input and output factors varies very broadly and is not connected to traditional clustering factors or economic success factors, such as size, population, or development status. Instead, specific individual types of measures like selected lockdown or dedicated economic support instruments seem to make a difference, which is an important message for health economic research and management. Looking into further analyses based on these finding presented here will pay off in future as government interventions can be improved and finetuned to optimize the resource-dependent results from health policy-related answers to the COVID-19 crisis. The contribution of this paper consists of (1) the specific identification of efficient countries in the fight against the COVID-19 pandemic during the first waves in 2020, which can serve as role models and sources of further information through research into comparative efficiency the specific measures taken in these countries; (2) the establishment of an evaluation scale through multidimensional efficiency measures that provides room for further elaboration in health economic research to identify success factors; and (3) the introduction of specific political measures that are expected to be useful in many future situations similar to the COVID-19 situation. Therefore, the paper provides a valuable contribution toward an improvement regarding public measures for pandemic preparation and management.