Introduction

Perinatal health care regionalisation of care calls for an organised neonatal retrieval service [26, 41]. The aims of a quality control programme in newborn transport are similar to those in other areas of medicine: measure the service’s performance, prevent adverse events and progress to continuous improvement, amongst others [14, 15, 38]. Nevertheless, due to the intrinsic characteristics of inter-hospital transport [2], quality assessment can be challenging [24]. Moreover, many newborn transport models coexist, with variations on team composition, resources and training [22, 26, 39]. Performance measures in other areas of medicine, such as emergency medicine, critical care and neonatology, are already defined [31, 35], but consensus on the ideal indicators solely for neonatal transport lack at the moment. Scientific societies, like the American Academy of Pediatrics (AAP) and Section on Transport Medicine (SOTM), and recent guidelines [36, 41, 46] emphasise the need to measure indicators in newborn transport. To date, some studies have approached the subject [21, 32], suggesting the use of quality indicators applied to paediatric transport but not exclusive for newborns.

The primary aim of this study is to make a proposal for quality indicators for newborn transport and to evaluate the feasibility of their use, assessing their performance through time in a specialised newborn retrieval service: newborn transport programme of Madrid (SUMMA 112). A secondary aim of the study is to evaluate the possible influence of factors only applicable to newborn population that might affect the measures or the outcomes.

SQUIRE guidelines (Standards for Quality Improvement Reporting Excellence guidelines) were followed when preparing the manuscript [40].

Patients and methods

A retrospective descriptive study was performed. The study included every patient that was retrieved by the newborn transport programme of Comunidad de Madrid from 2009 to 2015.

This transport programme is activated if retrieval is needed for any patient less than 29 days old or less than 44 weeks of post-conceptual age. Data were collected from the transport medical charts of all newborns retrieved during the study period.

Newborn transport system organisation

In Spain, out of its 17 administrative regions, each with an independent public health care management, only 4 of them provide a specialised newborn-paediatric inter-hospital transport system, Comunidad of Madrid being one of them. The newborn health care in the region of Madrid is a highly regionalised network comprised by 4 maximum level of care hospitals (IIIB-C [34], IIIA and II level hospitals) and private maternity centres up to 44 potential points of referral. It covers an area of 8021.80 km2 and a population of 6,377,364 (19.80% of Spain’s population). The birth rate in Comunidad de Madrid in the last 5 years remained stable, approximately 10.16 newborns per 1000 inhabitants [20].

The Madrid Newborn Transport System is a full-time specialised retrieval system with coverage of the whole region resulting in the cooperation of the hospital newborn consultants and the emergency services: “Servicio de Urgencias Médicas de Madrid (SUMMA 112)”. A dedicated newborn transport consultant, that belongs to one of the 4 maximum newborn level hospitals with specific training in newborn critical care and transport, provides specialised medical care. The rest of health care personnel, an Intensive Care Mobile Unit (nurse and emergency driver technician) and the material resources belong to the local emergency service (SUMMA 112) and do not routinely work with newborns. Emergency driver technicians in SUMMA 112 emergency service are trained in radio communications, ambulance operations and emergency response driving skills. The transport process starts with the referring hospital demanding transport for the affected newborn. The incoming call is received at SUMMA 112 call centre, where patient data are collected and the on-duty newborn transport consultant is contacted to decide the level of care to which the patient should be transported and to provide medical advice to a referring doctor. Suitable level-of-care cot is searched. The newborn transport consultant is deployed along with the dispatch of an Intensive Care Mobile team out of the 27 that are operating everyday in Comunidad de Madrid (SUMMA 112). Both assets are coordinated to arrive separately at the referring hospital to stabilise and retrieve the patient. Stabilisation process takes place according to evidence-based algorithms. Clinical data and transport times are documented on a paper-based medical record. Adverse events are registered through a specific report system.

Quality indicators

There is a lack of evidence-based scientific evidence on transport quality indicators possibly on account of the difficulties related to study design in transport medicine [33]. Quality indicators were selected and defined based on published consensus among health professionals [6, 36] and literature on the subject [1, 8, 21, 24, 30, 32, 38]. Some other metrics were considered clinically relevant based on the experience acquired through the development of the specialised newborn transport programme in the last 10 years and were included because of their high frequency or their impact on the performance. Twenty-four indicators were selected suiting the principles of good quality metrics [28, 29]. Table 1 shows the definition of the quality indicators. They were classified in the six dimensions of quality suggested by the Institute of Medicine, renamed Health and Medicine Division (HMD) of the National Academies of Sciences, Engineering, and Medicine [10, 38]: safety, timeliness, effectiveness, efficiency, equity and patient-centred care. Trends in indicator values were evaluated through the study period.

Table 1 Quality indicator definitions

The time of study was divided into two consecutive periods for data analysis: first period from 2009 to 2011 (3 years) and second period from 2012 to 2015 (4 years). The decision of dividing our study period in two was made to analyse the evolution of the programme’s performance through time. In 2009, major changes in human resources (including adding 2 transport consultants to the team) and material resources (new equipment) were made. The first 2 years after these major changes included training and technical problem solving. From 2011 onward, all the personnel had received their training and new protocols were developed to equipment maintenance and operation.

Transport times are defined as follows: (a) Mobilisation time is the time frame from the initial emergency call to team dispatch; (b) Response time is considered the period between call reception and the arrival of the team at patient’s bedside [1], and it includes the mobilisation time; (c) Stabilisation time is defined as the time from arrival of the team to the patient’s bedside until its departure from the referral hospital [7, 45].

The incidence of arterial hypotension, as a way to evaluate patient’s worsening, is suggested in the literature as a quality transport indicator [32, 36]. In newborns, there is still a challenge in defining what systemic arterial hypotension is, the best definition being the one adjusted by gestational age [47], but taking into consideration the patient’s clinical context (presence of acidosis, need for inotropic support, etc.).

The severity of clinical condition in our study was assessed using TRIPS-II score [23, 25]. It is a validated, physiology-based transport score that includes four empirically weighted items: temperature, blood pressure, respiratory status and response to noxious stimuli. TRIPS-II score may be used to assess the physiologic stability of the newborn, the lower the score, the more stable the patient is (range 0–54 points), and as a tool to evaluate the risk of mortality at neonatal intensive care unit (NICU) admission [23]. TRIPS-II score was calculated retrospectively at three different time points: (a) at the beginning of the transport, to estimate the severity of the infant’s clinical condition; (b) before team’s departure at the referral hospital, to assess the stabilisation process and (c) after transport, to estimate clinical evolution.

Statistical analysis

Baseline characteristics and indicators were analysed as follows: normal distribution of continuous data was analysed using visual methods and the Shapiro-Wilk test. If normally distributed, comparison between study periods was made using Student’s t test, and values are described as mean and standard deviation. If continuous data were not normally distributed, comparison was made using U Mann-Whitney test, and values were described as median and interquartilic range. Box plots were used to graphically represent the distribution of continuous variables. Categorical variables were defined as absolute counts and percentages and were assessed using chi square test. A two-tailed p value of less than 0.05 was considered statistically significant. Stratified analysis was performed when required. When considered relevant to quality improvement, multivariate linear or logistic regression models, where appropriate, were used to investigate associations of the events of interest with the analysed factors. Factors whose association with the indicator was significant, or close to significance (p value < 0.2), in the univariate analysis were entered in the multivariate model. Results of linear regression were expressed as regression coefficients, and results of logistic regression were expressed as odds ratios (ORs), with their 95% confidence interval (CI). Stata® 14.2 (Statacorp, College Station, TX, USA) was used for data analysis.

Results

During the study period, 2904 patients were retrieved, with an average of 6.4 transports/1000 births; most of them were immediate or emergency retrievals (75.92%). Patient’s epidemiologic characteristics in both periods (Table 2) were similar with small but statistically significant demographic differences among periods in birthweight and sex. There was a significantly lower prevalence of prematurity in the second period.

Table 2 Patient’s characteristics

The evaluation of the first quality dimension, safety, was performed defining 9 indicators; their itemised rate analysis per period is expressed in Table 3. The global incidence of safety-related events was 4.55% (132/2904) and it descended significantly through time. The only variable independently associated to the incidence of safety issues during transport was the unavailability of neonatal retrieval resource caused by simultaneous transports (OR 5.00; CI 95% 3.24–8.02, p < 0.001).

Table 3 Quality metrics

Two indicators were identified for the dimension timeliness: mobilisation time and total response time. A significant improvement in both indicators was observed through the study period (Fig. 1a, b): time until transport team left base was lower in the second period, with a mean time of 54.3 min (SD 57.45) compared with the first period with a mean mobilisation time of 60.66 min (SD 57.23) (p = 0.001). Total response time showed also a significant descent: 79.57 min (SD 49.51) in the second period versus 88.27 min (SD 36.73) in the first one (p = 0.004). The variables independently associated to mobilisation time were as follows: daytime frame (afternoon − 9.27 min; 95% CI − 13.8 to − 4.72; p < 0.001 and night − 9.35 min; 95% CI − 14.95 to − 3.76; p = 0.001), diagnosis of respiratory distress (− 7.80 min; 95% CI − 13.86 to − 1.74; p = 0.012), diagnosis of perinatal asphyxia (− 8.45 min; 95% CI − 15.36 to − 1.54; p = 0.017), gastrointestinal pathology (− 9.16 min; 95% CI − 16.62 to − 1.70; p = 0.016) and lack of coordination of transport assets (+ 70.31 min; 95% CI 65.35–75.27; p < 0.001).

Fig. 1
figure 1

Team mobilisation time (a) and response time (b) comparing two periods of study. *p < 0.05

The quality metrics in the dimension of effectiveness are itemised in Table 3. Stabilisation and total transport time did not vary significantly during the whole study period (Table 3).

Stabilisation time was correlated to several factors: low level of complexity of the referral centre (levels IIA and IIB) (+ 6.97 min; 95% CI 1.25–12.98; p = 0.017), extreme prematurity (+ 7.37 min; 95% CI 1.41–13.33; p = 0.015), patient on any kind of respiratory support (+ 11.76 min; 95% CI 7.14–16.35; p < 0.001) or vasoactive-inotropic therapy (+ 18.90 min; 95% CI 13.19–24.61; p < 0.001) and performance of any major medical intervention during transport (+ 23.19 min; 95% CI 17.76–28.62; p < 0.001).

The prevalence of arterial hypotension significantly diminished in the second period (Table 3). This indicator was associated with factors related to patient’s severity of condition: cardiovascular nature of the newborn’s disease (OR 2.39; 95% CI 1.23–4.63; p = 0.01), perinatal asphyxia (OR 2.04; 95% CI 1.23–4.63; p = 0.001) and the need of any respiratory support during transport (OR 6.26; 95% CI 4.01–9.79; p < 0.001).

A significant number of missing values in the variables included in TRIPS-II score, such as temperature at retrieval and systemic arterial pressure, precluded us from assessing the indicator “patient’s clinical evolution”.

Unnoticed hypothermia at hospital arrival occurred in 33.36% of patients (978/2904), with a significant trend to descent through study time (Table 3). It was related to postnatal days of age (OR 0.99; 95% CI 0.985 to 0.99; p = 0.004), gestational age in weeks (OR 0.88; 95% CI 0.86 to 0.90; p < 0.001), respiratory support (OR 1.55; 95% CI 1.26 to 1.92; p < 0.001) and high complexity level of referral hospital (IIIC) (OR 0.66; 95% CI 0.47 to 0.95; p = 0.023). Due to the lower prevalence of prematurity in the second period, a stratified analysis was conducted (Fig. 2), and unnoticed hypothermia was still significantly less prevalent in the second period in both preterm and term newborn babies.

Fig. 2
figure 2

Unnoticed hypothermia stratified by term/preterm babies in both periods

Efficiency was measured by 2 indicators: wrong patient triage that happened just once in the first period and none in the second, and back transport prevalence that did not change through study time (Table 3).

Equity was measured through the analysis of the delay caused by unavailability of a NICU bed of the needed level of care, and in the delay of access to the patient to the retrieval resource. The latter showed a nonsignificant increase in the second period (Table 3); the unavailability of the newborn retrieval care, due to simultaneous transports that needed to be attended or to a coordination error, was more frequent during winter (OR 1.63; 95% CI 1.25–2.15; p < 0.001) and varied along with the time frame of the day, being less possible in the afternoon (OR 0.72; 95% CI 0.55–0.94; p = 0.017) and during night shift (OR 0.45; 95% CI 0.31–0.64; p < 0.001) compared with daytime.

Patient-centred care was evaluated through 3 indicators: familial presence was stable through time of study; written informed consent fulfilment increased significantly in the second period, while appropriate pain control in newborns with potentially painful pathology was lower in the second period (Table 3).

Discussion

The present study summarises the quality evaluation of a newborn transport system using twenty-four quality indicators; some had been previously defined in literature and some were designated based on relevance considering the programme’s experience. The study shows how quality evaluation in newborn transport is feasible, and how some quality metrics need to be adapted to or defined specifically for the newborn population.

Newborn patients have unique characteristics inherent to their size, maturity and pathology. This may affect the performance of the transport programme itself [1, 6, 32, 44]. Gestational age affects significantly some of the transport performance measures in the effectiveness domain: stabilisation time and unintended hypothermia [1, 3, 13, 27, 32]. In the present study, unnoticed hypothermia was outstandingly more frequent than reported in other studies [37] and similar to others [42], and prevalent in preterm and term babies, but significantly descended in the second period probably reflecting a better performance. Besides, the adequate definition for systemic arterial hypotension in newborns is yet to be established, but it is clear that gestational age needs to be taken into account [47].

Some of the indicators used in the present study were selected because they represent an important matter of newborn transport, such as some patient-centred indicators. Newborn pain evaluation during transport lacks the appropriate tools as there is none validated in this context [19]. In our programme, pain perception during transport is determined by the clinician and the pharmacological sedation-analgesia is provider-driven. Pain control in pain-susceptible situations seemed more appropriate as an indicator than pain evaluation using validated tools, as the latter does not entail right management. Patient-centred indicators show internal validation with consistent values through time. Their usefulness is yet to be determined, based on the transport’s unit quality policy, if social and psychological aspects of the team performance are added to definition of quality [14].

Patient’s clinical evolution during transport, defined relying on the change in the patient’s TRIPS-II score during transport, has been previously defined in published literature as a quality metric [11, 17]. The transport process can influence greatly in the patient’s clinical status, especially in preterm newborns, but the contribution to the transport process itself is still unclear [9, 16, 43]. A quality metric achievable during transport, such as TRIP-II score, with prognostic implications can be of great use to evaluate the transport team’s performance and, after adjusting for population characteristics, to benchmarking. Regrettably, due to the retrospective nature of the present study, the significant amount of missing values of some of the items included in TRIPS-II score made its calculation impossible; this circumstance highlights the importance of data registry to quality control.

In Spain, there is universal access to health care. One of the goals of a highly regionalised neonatal care system is to guarantee equity in terms of access to appropriate care. Equity in newborn transport in this study was estimated using the delay of access to the patient to the retrieval resource due to the transport team being busy. This was an indicator more prevalent in winter, probably due to bronchiolitis annual epidemic and during morning time frame because transport overlapping resulted in an increase in response time. Minimising the response time is a prime goal in any transport service [1], especially since it may have a negative effect on the patient, as shown in our study with a significant increase in adverse events. Safety events occurred, which is consistent with the published literature [24, 30, 32]. Safety events in the present study improved through time.

There is not an ideal and single model of transport system [26]. It has to be adapted to the regional available resources [46]. Madrid’s newborn transport programme infrastructure is designed to share resources with the local emergency transport system (SUMMA 112), and this study shows how this may limit the performance of the newborn transport programme in some quality areas: dislodgement of medical devices probably due to lack of training of the transport nurse in newborn practice, response time caused by concurrence of transports, etc.

There are some limitations to the study, the first one being the lack of external validation of the suggested indicators. The retrospective evaluation in two consecutive periods of different lengths may affect the strength of the results; the decision of the period division was made upon major internal changes that were performed in the transport protocol through the first period (2009), in order to evaluate if this affected the performance of the service. The patient’s epidemiologic characteristics and main diagnosis were quite similar through the time study except for the fact that prematurity significantly descended in the second period. To adjust for this confounding factors, logistic regression and stratification was performed. The significant differences found in the quality metrics between the two periods studied may reflect an improvement (i.e. response time) or a worsening (i.e. medical device dislodgement rate) in quality of service but, due to the design and the retrospective nature of the study, it could reflect the influence of other factors or lack of registry of an indicator by transport personnel [30]. The observational nature of the study design limits the possibility of drawing robust conclusions from the data. Observational studies, though, illustrate the real daily conditions of the situation evaluated [33]: the newborn transport process. Some of the indicators, such as TRIPS-II score, were obtained retrospectively with significant missing values probably reflecting a lack of registry of normal values [16].

Quality metrics definition can be a challenging task, and the ideal way to select and define them is still to be agreed upon [41]. Quality indicators for paediatric and newborn transport have been defined in previous studies using different methodologies, the expert panel consensus, Delphi method, being the most accepted [6, 32, 36]. Despite increasing international interest for improving the quality of health care, there is still a great variation in the kind of care delivered during newborn transport around the world. It is important to benchmark and compare the performance of different transport systems in order to seek for excellence, but it is also important to choose indicators adapted to the population’s characteristics, newborns in this case, and suitable to assess the quality of team performance through time. More clinical research is needed in newborn transport quality metrics because there is still paucity of data on this matter.

Conclusions

Quality evaluation through the definition of specific metrics in newborn transport is feasible. The chosen indicators should be defined specifically or adjusted for newborn population in order to measure the real performance of the transport service and to facilitate benchmarking.