Introduction

In 1997, Egypt launched a comprehensive Health Sector Reform Program (HSRP) to address fundamental challenges in the healthcare system. The HSRP introduced an integrated package of service delivery and financing interventions to address the means by which primary health care (PHC) is financed, delivered, organized, and managed [1, 2]. One of the cornerstones of the HSRP was the facility accreditation program, which was defined by the country’s Ministry of Health and Population (MOHP) as a process for evaluating PHC facilities according to a set of standards that define activities and structures that directly contribute to improved patient outcomes. The main aim of the program was to provide the HSRP with a framework for continuous quality improvement [3, 4].

Accreditation of healthcare providers has been established in many high-income countries and some low- and middle-income countries (LMICs) as an approach to improve the quality of care that combines the two elements of quality assurance and quality improvement [5, 6]. There is consistent evidence that accreditation programs improve the process of care in all settings, especially those related to the quality of healthcare services [7,8,9,10,11]. However, there is limited evidence on the effectiveness of accreditation on patient-related outcomes in all settings [12,13,14]. One interesting study on patient outcomes in a high-income country is Falstie-Jensen et al. [15], who report a statistically significant association between persistent low compliance with accreditation and higher 30-day mortality and longer length of stay at public Danish hospitals. However, Lam et al. [16] report that U.S. hospital accreditation is not associated with lower mortality and is only statistically weakly associated with lower re-admission rates for some medical conditions. Similarly, accreditation of public health hospitals in Saudi Arabia is found to have no significant effect on mortality rates but significant positive effects on infection and length of stay [17].

Although a number of studies explored the effect of accreditation on the quality of care in LMICs (e.g., [18]), we could only identify one study on the effect of quality improvement through accreditation on patient outcomes [19]. This study concludes that accreditation of Chinese chest pain centers was associated with better in-hospital outcomes for acute myocardial infarction patients. Since accreditation usually entails significant costs, investigating its effectiveness is crucial, especially in settings where resources are extremely constrained. Moreover, in light of evidence on the non-monotonic effect of access to health care on gender inequality [20], it is particularly important to estimate the effect of having access to an accredited facility on disadvantaged groups, such as women and children.

We attempt to fill the gap in the literature by investigating the effect of accreditation of PHC units on the quality of care and patient outcomes in Egypt, a middle-income country. The paper exploits the quasi-natural experiment associated with the introduction of Egypt’s facility accreditation program to estimate the effect of having access to an accredited facility on a set of family planning, maternal health, and child health outcomes. To do this, difference-in-differences (DiD) fixed-effects models are used for the period 1992–2014. DiD models are also combined with propensity score matching (PSM) for the specific periods of 2000–2005, 2005–2008, and 2008–2014.

The paper is organized as follows: section “Background” provides some background; section “Data” describes the exploited data; section “Estimation methods” explains our empirical strategy; section “Results” presents some descriptive statistics and estimation results; section “Discussion” discusses the paper’s findings; and section “Conclusion” concludes. Appendix contains descriptive statistics, robustness checks, and extensions.

Background

The facility accreditation program

In 1997, the Government of Egypt (GOE) launched the HSRP, a new PHC strategy to reform the health system in phases over a period of 15–20 years. The program came into operation in 2000 [2]. The simultaneous implementation of the program across the country was deemed infeasible due to pre-existent constraints in the healthcare system and the complex nature of interventions to be introduced. Hence, GOE decided to implement the program over phases. The targeting took place at the district level in the participating governorates, whose master plans relied on a social vulnerability index to target districts of the most vulnerable populations [1]. Early entrants to the program included a group of PHC facilities in the governorates of Alexandria, Menoufia, and Sohag, which represent, respectively, urban governorates, Lower Egypt, and Upper Egypt. The three pilot governorates represent the three major regions in Egypt. Other governorates followed subsequently [1].

One of the key marketing points of the HSRP service provision reform of PHC was that it would improve access to quality care. To ensure this, the facility accreditation program was established by MOHP as a process to monitor and facilitate the quality of services and influence the behavior of healthcare providers. Thus, as part of the HSRP, accreditation became obligatory to all PHC facilities in the districts targeted by GOE to join the program. Technical assistance was provided to develop improvement plans [3].

In the preparatory phase of the facility accreditation program, a PHC facility needed to meet the following criteria: (i) had a process to monitor, evaluate, and improve quality of care; (ii) had a patient record system; (iii) provided a defined package of services including reproductive health obstetrics and gynecology, neonatal care, pediatric and adult medical care, basic emergency care, preventive health services, and ambulatory care; and (iv) had been in operation for at least six months, had appropriate license by MOHP and relevant medical union, and operated within the laws and regulations [21].

The survey was a key step in the facility accreditation program. A site visit to PHC facilities was conducted by a team of experts trained in accreditation using pre-set accreditation survey instruments and tools. The purpose of the accreditation survey was to evaluate the extent to which a facility complied with the nationally established accreditation standards, and accordingly, determine whether it was to be awarded or denied accreditation. The assessment initially included eight categories: patient rights, patient care, safety, management of support services, management of information, quality improvement program, family practice, and management of the facility. Optimal standards in each category focused on key processes, activities, or outcomes that facilities should achieve [2, 3, 22].

During the accreditation survey, trained surveyors used three approaches to collect data and measure compliance with the established standards: review of specific administrative and clinical records; observe the performance of specified tasks in particular areas; and conduct personal interviews. If a facility scored 80% or above in the total survey scores, it was granted full accreditation for a period of 2 years. If a facility scored between 50 and 79% of the total survey score, it was granted provisional accreditation for one year, after which a re-assessment survey is conducted to investigate if the deficits pointed out by the first round have been addressed. If a facility scored less than 50%, accreditation was denied [22]. The scoring by areas is presented in Tables 3 and 4 in Appendix A: the facility accreditation program.

In terms of contribution to the total score, the quality dimension patient care contributed the most. This dimension measured the extent to which patients received appropriate care, and focused on compliance with clinical practice guidelines and appropriate diagnosis [22]. Details of the patient care assessment process are included in Appendix A: the facility accreditation program (see Box A.1).

Anticipated effect of the facility accreditation program

One important characteristic of healthcare markets is the presence of asymmetric information [23]. It is well known that healthcare providers may act as ‘imperfect’ agents of patients and over or under provide care or fail to deliver the adequate health care quality, which can become a health concern. Different interventions, including accreditation of providers, have evolved in response to these problems. By subjecting healthcare providers to a formal process that makes them meet pre-determined standards, accreditation is expected to minimize variations in medical practice, eliminate medically inappropriate care, control costs, and address the possibility that quality is underprovided [24, 25]. Recent studies suggest that accreditation can influence quality of care through three mechanisms: coherence, organizational buy-in, and collective quality improvement action [26], with possible effects on patient-related outcomes. Earlier studies suggest that the accreditation process can stimulate organizational changes that enhance the quality of care (e.g., [27]).

In the context of this study, accreditation is expected to have a direct effect on some maternal health, child health, and family planning outcomes, through improving the quantity and quality of pertinent health services provided, and an indirect effect on other outcomes. Accreditation of health facilities certifies high compliance with standards defining activities and structures that directly contribute to improved patient outcomes. Hence, within the quality dimension patient care, accreditation standards established to measure compliance of facilities in the sub-areas of antenatal care (ANC), integrated management of child illnesses (IMCI), and family planning are expected to have a primary effect on ANC coverage (number of visits), quality of ANC (being informed of signs of pregnancy complications, weight measurement, blood pressure measurement, and urine sample collection), child morbidity prevalence (acute respiratory infection (ARI), fever, and diarrhea), and informed choice of contraceptive methods (knowledge of side effects of contraceptive method used and knowledge of other methods of contraception that could be used). These outcomes reflect some of the standards assessed during the accreditation survey (see Appendix A: the facility accreditation program, Box A.1). We expect improvements in these outcomes in accredited compared to non-accredited facilities.

In parallel, accreditation is expected to have a secondary effect on the utilization of antenatal and delivery care services. Quality improvement in accredited facilities introduces an incentive for individuals to seek care at these facilities. The effect of this incentive is expected to be more significant in the sub-areas of care included in the assessment of the accreditation survey. Thus, we expect having access to an accredited facility to be associated with higher ANC coverage, higher institutional birth-delivery, and higher skilled assistance during delivery. This expectation holds given that accredited facilities were not functioning at full capacity prior to accreditation and can increase supply in the short term.

Evidence on the effect of accreditation

There exists a large body of literature on the effects of accreditation as a quality signaling device for firms (for instance, a good summary on firm behavior and accreditation can be found in Dranove and Jin [28]). However, less is known about the impact of accreditation on healthcare provision and about patient health outcomes.

The majority of studies on accreditation in health care in LMICs report on its positive effects on process indicators. A recent study by Terra and Berssaneti [11] provides evidence that accreditation promoted healthcare processes in Brazilian hospitals, thus strengthening the culture for healthcare quality and increasing patients’ satisfaction. Similarly, in Brazil, Saut et al. [10] indicates that accreditation mainly influenced the internal processes of healthcare organizations. Saadati et al. [29] show that the accreditation of an Iranian teaching hospital improved patient-centeredness, patient safety, logistics and managerial processes, as well as decision-making.

Several studies in LMICs focus on accreditation’s effect on compliance with quality standards [30,31,32,33]. A quasi-experimental study in Egypt found that accredited non-governmental health units had higher compliance with quality standards compared with non-accredited units [30]. Another study in Egypt on public clinics found that providers that had earned an accreditive Gold Star were more likely to adhere to higher quality practices in counseling and examination than non-Gold Star facilities [32]. According to a descriptive study in Zambia, a national hospital accreditation program was associated with significant improvement in compliance of accredited hospitals with standards in overall scores and in seven out of 13 functional areas [31]. In South Africa, Salmon et al. [33] used a randomized control trial to investigate the effect of an accreditation program on public hospitals and found that the processes’ and outcomes’ average compliance of accredited hospitals improved significantly, while no significant improvement was observed in non-accredited hospitals.

Besides compliance with standards, the majority of the studies report on the effect of accreditation on quality of care measures. These are, for the most part, not patient health outcomes, but downstream process indicators [18, 30, 32,33,34,35]. Unlike compliance with standards, there is no conclusive evidence on the effect of accreditation on quality of care. While Avia and Hariyati [8], El-Jardali et al. [34], Hong et al. [32], Quimbo et al. [35], and Reisi et al. [18] report positive effects of accreditation on different indicators of quality of care, studies employing more robust study designs report mixed effects. These are Salmon et al. [33] and Al Tehewy et al. [30], which used a randomized controlled trial and a quasi-experimental design, respectively. In a study based on data from hospitals in South Africa, Salmon et al. [33] found limited or no effect of a randomized accreditation program on quality measures apart from increases in perception of quality among nurses. In Egypt, Al Tehewy et al. [30] found a positive effect of accreditation of non-governmental health units on patient satisfaction with respect to all areas of health service (cleanliness, waiting area, waiting time, and staff performance). As for provider satisfaction, the study found a positive effect on the overall satisfaction score, but no significant difference in the mean satisfaction score between accredited and non-accredited units with respect to social environment, administrative environment, and family health model.

In conclusion, the available evidence on the effectiveness of quality improvement through accreditation of healthcare providers is limited and of questionable quality but suggests that accreditation can improve the process of care provided by different providers. However, evidence is limited on the effectiveness of accreditation on patient outcomes. One study found that accreditation of Chinese chest pain centers was associated with better in-hospital outcomes for acute myocardial infarction patients [19]. We could not identify any study on the effect of accreditation on key patient outcomes such as family planning, maternal health, and child health outcomes.

Data

To answer our research questions, we exploit the Egypt Demographic Health Survey (DHS). This survey consists of two questionnaires: one for households (HHs) and the other for ever-married women (EMW), and has a consistent response rate of over 99% for all waves.Footnote 1

For this study in particular, we make use of all the relevant data made available by the Egypt DHS on family planning and maternal and child health. We expect these variables to be affected by the changes in quality standards, policies, and procedures, which is the focus of accreditation assessment.Footnote 2

To construct our dependent variables at the facility level, we collapse the responses of 97,990 women over the period 1992–2014 at the facility level, drawing from six DHS waves: 1992, 1995, 2000, 2005, 2008, and 2014. To do so, we use the Global Positioning System (GPS) coordinates of both interviewed women and health facilities to link each woman to the nearest mapped facility for each wave of the Egypt DHS. The aim is to identify women who live in the catchment area of accredited facilities (treatment group) and those in the catchment area of non-accredited facilities (control group). All eligible PHC facilities across Egypt are used during the joining process. We are confident that women do not bypass their closest PHC facility because they are obliged by MOHP to use the facility in their catchment area.

The gradual uptake of the facility accreditation program by health facilities provides a quasi-natural experiment. Thus, for our main explanatory variable reflecting reform status, we draw on facility-level data from MOHP to categorize facilities as treated (accredited, either fully or provisionally) and non-treated (non-accredited). To ensure that treatment reflects only accreditation, we remove from the sample accredited health facilities that were subject to additional interventions under the HSRP such as performance-based financing (PBF) and introducing user fees.

Dependent variables

Having linked each woman to a respective health facility, we are able calculate health outcomes at the facility level for each of the Egypt DHS waves and obtain a panel. By construction, our dependent variables are at the facility-level: informed choice of contraceptive methods (‘family planning’), ANC, delivery care, and child morbidity prevalence, which we describe in detail below.

Family planning

As part of the family planning sub-area of the patient quality of care dimension, the accreditation surveyor checks if the facility has a good information/education/communication (IEC) system. For this sub-area, we include two family planning outcomes capturing the effect of accreditation on choice of contraceptive methods by calculating the percentage of current users of selected contraceptive methods who were informed of the side effect of or problems associated to the method used.Footnote 3 Those receiving information on the efficacy and side effect of contraceptives used tend to have higher continuation rates than those who do not [36]. We also calculate the percentage of current users of selected contraceptive methods who were informed of other methods of contraception that could be used.Footnote 4 Informed choice emphasizes that women choose the method that best satisfies their personal and reproductive health needs based on a thorough understanding of other methods of contraception they could use.

ANC

As part of the ANC sub-area of the patient quality of care dimension, the surveyor checks if physical examination is performed for all patients. We include six ANC outcomes that capture the effect of accreditation on the quality of ANC. We calculate the percentages of mothers who received the following components of ANC: being informed of signs of pregnancy complications, weight measurement, blood pressure measurement, and urine sample collection.Footnote 5 As part of the accreditation survey, the surveyor also checks if the number of ANC visits falls within the clinical guidelines. Therefore, we calculate an outcome of ANC coverage indicator (at least four visits).Footnote 6 This outcome is used as a global preferred outcome of access to and use of health care during pregnancy to track performance in maternal health programs.Footnote 7 A pregnant woman is expected to receive health interventions during antenatal visits that can be vital to her health and the health of her infant as well.

Delivery care

As part of the ANC sub-area of the patient quality of care dimension assessed by the accreditation survey, the surveyor assesses patient’s knowledge and understanding of delivery services provided in the facility. Egypt DHS data allows us to calculate two delivery care outcomes to capture the effect of compliance with the accreditation standards in this regard: institutional deliveryFootnote 8 and skilled assistance during delivery.Footnote 9 The two outcomes are widely advocated for reducing maternal, perinatal, and neonatal mortality. Institutional delivery captures the effect of accreditation on expanding access to childbirth facilities and, more importantly, is a proxy measure of maternal and neonatal morbidity and mortality.Footnote 10

The second but most important measure of delivery care included in our analyses is skilled assistance during delivery. Empirical literature provides evidence that wider access to professional care during pregnancy and childbirth reduces maternal mortality. Women assisted by skilled health personnel during delivery are less likely to die from any cause related to or aggravated by childbirth [37].

Child morbidity prevalence

As part of the IMCI sub-area of the patient care dimension of quality assessed by the accreditation survey, the surveyor checks if child is checked for cough, diarrhea, sore throat, ear infection, and fever. We use the prevalence of childhood ARI, fever, and diarrhea from the Egypt DHS data as three outcomes reflecting morbidity prevalence.Footnote 11

ARI is the leading infectious cause of death in children worldwide.Footnote 12 Diarrheal diseases are the second leading cause of death in children under age five [38]. The risk of under-five mortality can be diminished substantially through reducing the prevalence of ARI and diarrheal diseases and encouraging women to seek treatment for their children at a health facility and/or from a healthcare provider.Footnote 13

Explanatory variables

The explanatory variables included in the analyses of this study are a treatment variable that reflects participation in the facility accreditation program, facility-level controls, district-level social and economic controls, and regional dummies to control for regional variation.

In particular, the facility characteristics include labor force, the facility’s building condition, and population coverage. For labor force, we incorporate the number of eight types of workers in a health facility: practitioners, specialists, pharmacists, nurses, lab technicians, X-ray technicians, health observers, and social workers. For building infrastructure, a dummy variable that describes the condition of a facility building as ‘bad,’ ‘average,’ or ‘good’ is included. As for population, we control for the size of population in the facility catchment area. This information is obtained from Egypt’s MOHP.

At the district level, we use Egypt’s 2006 Population and Housing Census to construct eight district-level social and economic controls: illiteracy ratio, unemployment ratio, income dependency ratio, inaccessibility to electricity, inaccessibility to potable water, average family size, HH crowding factor, and population size. In addition, regional dummies are defined for fully urban governorates, urban Lower Egypt, rural Lower Egypt, urban Upper Egypt, rural Upper Egypt, and frontier governorates. These district-level covariates control for both the selection criteria of the HSRP targeting and the demographic variation across districts. The regional targeting of the HSRP followed a socio-economic vulnerability index that was constructed from the eight social and economic indicators outlined earlier.

Estimation methods

Difference-in-differences

As the accreditation policy is staggered, we first follow Wooldridge [39] and use a general DiD fixed-effects model to estimate the effect of accreditation by comparing the health outcomes of accredited facilities (treatment group) to that of non-accredited facilities (control group) before and after accreditation (treatment) between 1992 and 2014.

Accredited and non-accredited facilities presumably differ in observed characteristics, such as labor force, and unobserved ones too, such as managerial ability. The DiD method controls for both observed and unobserved characteristics that are time invariant. Fixed effects further eliminate any confounding that might be caused by facility effects, whether observed or unobserved, which are constant over time within each facility. With regard to time-varying unobservable factors, we report in Appendix C: checks and robustness extensions the results of the parallel-trends test, which provide evidence of the absence of unobserved time-varying confounding, establishing the unbiasedness of our DiD estimates.

Treatment self-selection is not a concern in our context as treatment, i.e., the decision of whether or not to obtain accreditation is exogenous. Accreditation is not a function of some choice of the treated unit, but is rather a function of a policy that differentially affects units based on pre-determined characteristics. As noted earlier, the facility accreditation program is only rolled out in certain geographical areas. Hence, whether one facility can obtain accreditation or not is neither self-selected nor screened.

For each health facility \(i\) at time \(t\), we estimate the following DiD fixed-effects model:

$$\it y_{it} = \alpha + \beta {policy}_{it} + \gamma {year}_{t} + \delta_{i} + \varepsilon_{it} ,$$
(1)

\({y}_{it}\) denotes a health outcome of interest \(y\) for facility \(i\) at time \(t\). We include outcomes of informed choice of contraceptive methods, ANC, delivery care, and child morbidity prevalence drawing from secondary data from Egypt DHS. The variable \(\it {\mathrm{\it policy}}_{it}\) equals 1 if facility \(i\) is subject to accreditation in year \(t\). \(\it {\mathrm{\it year}}_{t}\) is a time-period dummy. \({\delta }_{i}\) is the unobserved facility effect.

Additionally, to compare between the effects of accreditation for the specific periods of 2000 to 2005, 2005 to 2008, and 2008 to 2014; we estimate the typical DiD specification below for each health facility \(i\) at time \(t\).

$$\it \it y_{it} = \alpha + \beta acc_{it} + \gamma d_{{{post}}} + \delta acc_{it} \times d_{{{post}}} + \zeta fac_{i} + \eta dist_{i} + \varepsilon_{it} { },$$
(2)

where \(t\) equals 0 for the baseline years (2000, 2005, or 2008) and 1 for the follow-up years (2005, 2008, or 2014); \({acc}_{it}\) is an indicator variable that takes value 1 if facility \(i\) is accredited and 0 if not; \(\it {d}_{\mathrm{\it post}}\) is an indicator variable for the follow-up year; the interaction term \(\it {acc}_{it}\times {d}_{\mathrm{\it post}}\) measures the effect of accreditation in the follow-up year; and \(\delta\), our main coefficient of interest, captures the effect of accreditation on the outcome at the facility level.

Finally, to eliminate potential unobserved heterogeneity and account for possible differences between accredited and non-accredited facilities prior to accreditation, the vector \({fac}_{i}\) contains facility-level controls that reflect different characteristics of facility \(i\) including labor force, the facility’s building condition, and population coverage; and \({dist}_{i}\) is a vector of district-level controls including social, economic, and demographic characteristics of the district in which facility \(i\) is located.

Propensity score matching difference-in-differences

The targeting of the reform interventions at the district level under the HSRP followed a socio-economic vulnerability index of the areas around health facilities. As such, the comparison of health outcomes without accounting for this would be biased. To ensure that no bias exists due to targeting, we combine DiD with the PSM approach.Footnote 14 Matching on observables mitigates the potential bias by pairing accredited and non-accredited health facilities based on pre-accreditation observable characteristics, which were initially used by GOE for accreditation targeting. Additionally, as a stand-alone method, DiD can be used to identify treatment effects if there is a selection based on (time-invariant) non-observables. Thus, while conventional PSM cannot account for non-observables, combining matching with DiD enables us to account for both the effect of observable and unobservable heterogeneity that is constant over time, as well as the targeting policy. To minimize any potential bias due to time-varying unobservable factors, we also control for an extensive set of facility-level characteristics and population coverage of facilities. Reassuringly, we generally find no significant differences in outcomes or characteristics between the population covered by treated and non-treated facilities (see Appendix C: checks and robustness extensions, Table 12). Table 12 suggests that our matching strongly satisfies the requirement of conditional independence.

To obtain the PSM DiD estimates, we follow Rosenbaum and Rubin [40]. We first apply PSM to match facilities and then extend the conventional DiD estimates by defining health outcomes conditional on propensity scores and applying semi-parametric methods to construct the differences. First, we match treated and control health facilities based on pre-treatment observable characteristics and use Kernel functions to assign weight to the jth control facility matched to the ith treated facility.Footnote 15 As such, in our context, the propensity score is the probability of being targeted by the HSRP intervention given a set of observable social and economic indicators used to construct the socio-economic vulnerability index. Second, we estimate a DiD specification in Eq. (2) with health outcomes defined conditional on the propensity score generated earlier. The Kernel PSM DiD estimate for each treated facility i is calculated as

$$\hat{\delta }_{i} = \left( {y_{i{post}}^{T} - y_{i{pre}}^{T} } \right) - \mathop \sum \limits_{j \in C} \omega \left( {i,j} \right)\left( {y_{j{post}}^{C} - y_{j{pre}}^{C} } \right).$$
(3)

Prior to the DiD estimation, we verify that the common support assumption is satisfied by checking the overlap between treatment (accredited facilities) and control (non-accredited facilities) groups (see Appendix C: checks and robustness extensions, Figs. 1, 2, 3). Once the matching is applied, we use two-sample t-tests to examine if there are significant differences in the means of observable characteristics for both groups [40].

While we use district-level social and economic indicators to estimate the propensity score, facility-level characteristics are used as additional covariates later in the DiD estimations. For each of our health outcomes, we report the results for three study periods: 2000–2005, 2005–2008, and 2008–2014.

Results

Descriptive statistics

Table 7 in Appendix B: data sources and descriptive statistics summarizes the descriptive statistics of health outcomes and district characteristics of all health facilities observed between years 1992 and 2014. Overall, the table indicates relatively moderate variability across facilities and districts for most variables. The table shows that Egypt has an overall moderate level of ANC coverage. On average, 56% of pregnant women in Egypt reported at least four ANC visits. However, the country performed differently with respect to various ANC components. Women were more likely to be weighed (82%) and get their blood pressure measured (82%) during ANC visits but far less likely to be informed of signs of pregnancy complications (34%). In parallel, Egypt has a fairly high level of delivery care coverage through access to health facilities and skilled health personnel. On average, over 60% of women delivered their most recent birth in an institutional setting and over 70% of births were assisted by skilled health personnel. The country had a higher level of prevalence of childhood fever (23%) than ARI (14%) and diarrhea (13%).

The descriptive statistics of district characteristics of facilities based on which targeting took place are also reported in Table 8 in Appendix B: data sources and descriptive statistics. The table highlights the difference in the district characteristics between accredited and non-accredited facilities. We use the two-sample t-test to check whether the means of the two groups differ significantly. On average, districts to which accredited facilities belong have significantly higher HH overcrowding during the period 2000–2005; significantly lower illiteracy, income dependency, inaccessibility to electricity, and inaccessibility to potable water; smaller family size; and bigger population size during the period 2005–2008; and significantly lower income dependency, inaccessibility to electricity, and inaccessibility to potable water, smaller family size, and lower HH overcrowding during the period 2008–2014. These results suggest that the actual targeting of the HSRP did not strictly follow the socio-economic vulnerability index.

Estimated effects of accreditation

Table 1 reports the DiD fixed-effects estimates of the effects of accreditation during the period 1992–2014. We additionally report the DiD and Kernel PSM DiD results for the specific periods 2000–2005, 2005–2008, and 2008–2014 in Table 2.

Table 1 Difference-in-differences fixed-effects estimates of the effects of accreditation, 1992–2014
Table 2 Difference-in-differences and Kernel propensity score matching difference-in-differences estimates of the effects of accreditation: 2000–2005, 2005–2008, and 2008–2014

Using the pooled sample covering the period of 1992–2014, Table 1 shows that accreditation did not have a significant positive effect on all health outcomes except the prevalence of childhood diarrhea. This unexpected finding invited us to disentangle the observed effects of the program from each time period.

For the study comparing the years 2000 and 2005, Table 2 provides evidence that having access to an accredited facility was associated with higher likelihood of being informed of the side effects of contraceptives: the proportion of women who were informed of the side effects of the contraceptives used increased significantly by 16 percentage points (ppts) among those with access to accredited facilities, compared to women with access to non-accredited facilities. This positive effect disappeared in the subsequent periods. The proportion of women with access to accredited facilities, who were informed of other methods of contraception that could be used, increased by 7 ppts in 2005, but this effect was counterbalanced by a decrease of 9 ppts in 2008, again compared to women with access to non-accredited facilities. The effect also disappeared during the period 2008–2014.

With respect to ANC, Table 2 shows that accreditation had a limited positive effect on ANC during the period 2000–2005, specifically on being informed of signs of pregnancy complications (6 ppts); this positive effect is again observed in the period 2005–2008, but vanishes in 2008–2014. The proportion of women with access to accredited facilities, who had 4+ ANC visits or had a weight measurement during ANC visits, slightly increased significantly by 5 ppts and 3 ppts (respectively) in 2005–2008, compared to women with access to non-accredited facilities. We also observe significant negative effects of accreditation on blood pressure measurement during ANC visits in the period 2000–2005, with no significant effect in the following periods.

In terms of delivery care, institutional delivery and skilled assistance during delivery increased by more than 7 ppts and 11 ppts, respectively, among women with access to accredited facilities in the first period of analysis. Nevertheless, the estimates of both outcomes are statistically insignificant during the periods 2005–2008 and 2008–2014. In parallel, we observe that accreditation had multiple significant positive effects on child morbidity prevalence during the period 2000–2005: accreditation reduced the prevalence of childhood ARI, childhood fever, and childhood diarrhea among children with access to accredited facilities by about 10 ppts, 10 ppts, and 4 ppts, respectively, compared to children with access to non-accredited facilities. We also observe a significant positive effect of accreditation on child morbidity prevalence later during the period 2008–2014, but we do not observe any significant effects on all child morbidity prevalence outcomes during the period 2005–2008.

Robustness tests

We test the robustness and plausibility of our results by running several alternative checks which are discussed in Appendix C: checks and robustness extensions. Mainly, we test the parallel-trends requirement for the acceptable application of DiD; we run placebo models; we verify the common support requirement for the feasibility of the matching; we provide tests on the quality of the matching, and do several sensitivity analyses on the matching method.

Discussion

The findings of this paper suggest that the HSRP’s accreditation process in Egypt was associated with significant improvements in child morbidity, family planning, delivery care, and—to a lesser extent—in ANC.

A comparison between the early effects of accreditation during the study period 2000–2005 versus later during the periods 2005–2008 and 2008–2014 indicates that the positive effect of the facility accreditation program faded over time. Some of the positive effects were even reversed (e.g., information on contraceptives). One explanation could be that interventions under the HSRP had been slowing down and weakening since 2005. Reviewing Egypt’s HSRP, World Bank [1] shows that the pace of implementation of the program decelerated over time. This is compounded by the extent to which facilities complied with reform rather than the rate by which facilities joined the HSRP. A plausible indicator of compliance is the accreditation score. In this regard, Grun and Ayala [41] find that although more facilities got accredited, accreditation scores were increasing until 2004 but decreased after that. Further, accreditation compliance varied across governorates. Supporting our hypothesis, using compound quality indices, El-Zanaty and Associates et al. [42] demonstrate that the quality of services delivered at accredited facilities improved gradually during the first four years post reform, after which it decreased and hit its lowest level after 9 years. A high level of political commitment for reform was evident in the preparation and early implementation phases, but this was not sustained, undermining the reputational gains of accreditation [43].

Our results can also be explained by the findings of Braithwaite et al. [44], who compared accreditation programs in LMICs with those in high-income countries and concluded that, regardless of country context, the sustainability of those programs is determined by continued policy support from government, stable program funding, assorted incentives promoting facilities’ participation in accreditation, and constant improvement in accreditation agency operations and program delivery. In Egypt, the HSRP has suffered from a number of institutional sustainability issues. Most critically, political support has weakened in the absence of a robust analysis of the effectiveness and rationale of the interventions proposed under the program. The relationships between parent (existing) organizations and entities newly created under the program have not been clearly delineated by the program as well [1]. On the financial front, while the initial investment costs of the HSRP were substantial, a significant fraction of these costs was financed through donor funding. The ability of the HSRP to generate its own revenues from different sources was constrained, making the program financially unsustainable in the long run [1]. In this regard, Mansour et al. [6] emphasize the lack of financial resources as a major challenge to the implementation and sustainability of accreditation programs in limited-resource settings.

The study has a number of limitations. The most notable arises from the fact that Egypt’s DHS does not track the women and children over time. To overcome this, we collapsed the data of each wave at the facility level, estimated the effects of accreditation at the facility, and constructed a longitudinal dataset. Also, our analysis on child health is limited to morbidity prevalence of common early childhood illnesses instead of treatment received because the data of Egypt DHS is patchy and incomplete on treatments. Further, being a survey, some of our calculations rely on self-reported information (number of ANC visits, information on contraceptives received, etc.), which also introduces some noise. Related to this, although the response rate is very high for each wave, there are some outcome variables with a notable higher level of missing values (contraceptive information, for instance) and we are obliged to rely on the responses available. Lastly, a small amount of randomness may be due to treatment and controls groups being defined assuming that women follow the rules and attend their nearest PHC facility, but we trust this disturbance to be very small or inexistent due to the legal obligation to do so.

Conclusion

This paper contributes to the existing literature by investigating the effect of accreditation as a policy tool to improve quality on key patient outcomes rather than downstream process indicators, as used by most of the related literature.

We exploit six waves of the Egypt DHS to investigate the effect of HSRP’s facility accreditation programme in Egypt between 2000 and 2014 on family planning, ANC, delivery care, and child morbidity prevalence. To be able to measure the effect, we first spatially link women to their nearest mapped health facilities using their GPS coordinates. We then use DiD fixed-effects models and also combine DiD with Kernel PSM to correct our estimates from the potential endogeneity biases.

The results indicate that accreditation had multiple positive effects, especially on delivery care and child morbidity prevalence. The positive effects of accreditation appear to weaken over time though. Accreditation alone is not sufficient to sustain high quality of care, especially with respect to delivery care.

These results emphasize that a high, continued, level of commitment, which is a reflection of strong political will, is indispensable for the success of quality improvement interventions in LMICs in the long run. Decentralization in no way diminishes the necessity of a high level of pledge from the central government (see Braithwaite et al. [44] and Mansour et al. [6] for a detailed discussion).

Our findings encourage an enquiry into which interventions, if combined with accreditation, are associated to improved patient outcomes. There is evidence that improvements can be achieved, for example, through combining accreditation with properly monitored and well-designed payment or incentive schemes [35].