Introduction

Therapeutic hypothermia (TH) has been widely adopted into clinical practice for treatment of moderate to severe neonatal encephalopathy secondary to presumed perinatal asphyxia. The five large randomized trials of TH demonstrated a significant reduction in the risk of death and disability compared to non-treated neonates; however, there remain a large proportion of neonates who die or develop disability despite treatment.1 Following animal studies that suggested further benefit of deeper TH or longer duration of treatment, the NICHD Neonatal Research Network (NRN) performed the Optimizing Cooling (OC) trial. The OC trial randomized newborns with moderate or severe encephalopathy to four hypothermia regimens: 33.5 °C for 72 h (usual treatment); 33.5 °C for 120 h; 32 °C for 72 h; and 32 °C for 120 h. However, this trial was stopped early for safety and futility and later animal studies question the potential benefit and possible harm of deeper or longer cooling.2,3 A lower death rate was observed in the usual treatment arm compared to the longer and/or deeper cooling arms. When comparing outcomes from the OC trial to the initial NRN Induced Hypothermia (IH) trial, patients in the usual care arm of OC had lower unadjusted rates of in-hospital death and death or neurodevelopmental disability (7% vs. 19%, p = 0.02) and (29% vs. 44%, p = 0.03), respectively.4,5 Enrollment criteria and study procedures in the OC trial were similar to the original IH trial, except that clinical cooling could be initiated prior to randomization. Neonates were excluded if they had a temperature recorded of <32 °C for 2 h.

It is important to understand possible reasons for the lower rate of death or disability in the “standard of care” arm of the OC trial as other trials of adjunctive treatment to hypothermia are currently ongoing or in development. Changes in clinical care may contribute to the observed differences in morbidity and mortality rates. Secondary analyses of TH trials reported on the associations between blood gas tensions of carbon dioxide,6 oxygen,7 and blood glucose levels8 and outcome. The use of sedation and analgesia may have also changed between trials; however, its association with outcome is controversial.9,10 Changes in the approach to the identification, diagnosis, and treatment of seizures have been cited in the literature with potential impact on outcome.11,12,13,14 Intensive care by a team now experienced in the provision of TH may have differentially impacted the clinical outcome of patients in the OC trial. Finally, as the OC trial was initiated, TH was recognized as a safe and effective treatment likely resulting in increased focus on the recognition and enrollment of eligible patients, particularly those with moderate encephalopathy, and shorter time to initiation of TH. However, therapeutic drift is also occurring with treatment being applied to neonates who may not have met all of the criteria outlined by the trials. Registry data in the United Kingdom confirm an increase in the use of TH after the TOBY trial with concern for therapeutic drift between 2007 and 2011, indicating that patients with less severe hypoxic–ischemic encephalopathy (HIE) are now being treated.15 Further systematic analysis of patient characteristics and clinical care practices among infants enrolled in the two trials of TH may provide helpful information for planning future trials and determining sample size. Our objective was to perform an analysis of differences in patient characteristics and care practices between the IH and OC trials.

Methods

Enrollment into the IH and OC trials occurred 2000–2003 and 2010–2013, respectively, at NICHD NRN centers (15 centers in IH and 18 centers in OC). The institutional review board at each NRN center approved the trial protocols. Inclusion criteria for both trials were similar and as follows: neonates born at a gestational age of at least 36 weeks and admitted to an NRN center within 6 h of birth, the presence of physiologic criteria, and seizures (clinical or electrographic) or moderate to severe encephalopathy identified by a certified examiner using a standardized modified Sarnat examination.4,16,17 A training session for research personnel was held to standardize the neurologic examination and each site principal investigator certified additional neonatologists to perform the neurologic examinations. Encephalopathy was defined as the presence of moderate or severe signs in three out of the following six categories: level of consciousness, spontaneous activity, posture, tone, primitive reflexes, and autonomic nervous system. The number of moderate or severe signs determined the severity of encephalopathy at enrollment, and if the number of signs were equally distributed, then the determination was based on the level of consciousness. There were definitional differences in the OC trial in the categories of tone and one sub-category of neonatal reflexes, suck. In the OC trial, both hypotonia and hypertonia were defined as moderate abnormalities wherein IH-only hypotonia was included. In the severe category for tone, a neonate could be described as flaccid or rigid wherein IH-only flaccid was used. For moderate abnormalities in suck reflex, the presence of a “bite” was added to the description of a “weak” suck used in IH. Exclusion criteria for both trials were similar, except for the addition of a core temperature of <32.5 °C for at least 2 h prior to randomization for the OC trial. Passive or active cooling was allowed prior to randomization in the OC trial.

For all neonates enrolled in the IH trial (N = 208) and all enrolled in the OC trial (N = 364), we compared maternal and baseline neonatal characteristics, and variables reflecting resuscitative measures at birth. To evaluate subtle differences in the severity of encephalopathy between the two trials, we compared the distribution of the severity of findings in the pre-randomization modified Sarnat exam categories in neonates determined to have moderate HIE and those with severe HIE between the two trials. In the subgroup of IH infants (n = 102) randomized to therapeutic hypothermia (33.5 °C for 72 h) and OC infants (n = 95) randomized to usual cooling (33.5 °C for 72 h), we also compared baseline maternal and neonatal characteristics.

Specific clinical care practices prior to and during the 72 h of treatment were compared between the two groups treated with cooling at 33.5 °C for 72 h in IH and in OC. The following variables were analyzed: lowest PCO2 and highest PO2 levels; lowest and highest blood glucose levels; proportion of neonates who were intubated and mechanically ventilated, fraction of inspired oxygen (FiO2), use of anticonvulsant medication, analgesics and sedatives, inotropic agents prior to baseline, at baseline (defined as time esophageal probe was placed and initiation of study treatment), and at 24, 48, and 72 h of treatment. Differences in the use of electroencephalogram (EEG) and short-term outcomes, including electrographic seizures, length of stay, need for gastrostomy tube or gavage feeds at discharge, discharge on anticonvulsant medications, and in-hospital death were evaluated. The primary outcome of each trial, death or moderate or severe disability at 18–22 months was compared. The definition of moderate or severe disability was similar in both trials.4,5 To assess cognitive development, in the IH trial, the Bayley II was used, and in the OC trial, the Bayley III, developed in 2006, was used. The cutoffs for moderate or severe cognitive impairments were identical in both studies, each reflecting either 1 or 2 SD (±15) below the population mean of 100, respectively. A mental developmental index (MDI) on the Bayley II or a cognitive score in the Bayley III of 70–84 was used as part of the definition for moderate and an MDI or cognitive score of <70 was used as part of the definition of severe disability.

Statistical analyses

χ2 Tests (or Fisher’s exact test) for categorical variables, and t tests for continuous variables were used to evaluate differences in maternal and neonatal baseline characteristics, care practices, and outcome measures. Logistic and linear regression analysis were used to evaluate differences in care practices and outcome between the subgroups in IH treated with TH and those in the usual care arm of OC with adjustment for severity of HIE and center (as a random effect) to evaluate differences in the primary outcome of death or disability between the IH and OC trials. Tests for interaction between severity of encephalopathy and trial (IH or OC) were performed.

Results

Maternal and neonatal characteristics

Maternal and baseline neonatal characteristics of all subjects enrolled in the IH and OC trials, as well as those randomized to the cooling arm of the IH trial (33.5 °C for 72 h) and the usual care arm of the OC trial (33.5 °C for 72 h), are shown in Table 1. In the OC trial, maternal thyroid disease, hypertension, and hemorrhage were more common than in the IH trial, while uterine rupture occurred more frequently in IH than OC. Fewer neonates in OC were delivered by emergent cesarean section. Compared to OC, neonates enrolled in IH had more frequent: Apgar score <5 at 10 min after birth, lower pH and greater base deficits, intubation at delivery, delayed onset of spontaneous respirations, severe encephalopathy, and clinical seizures prior to randomization. In contrast, neonates enrolled in OC compared to IH were born at slightly younger gestational ages and more were outborn and thus transferred to an NRN center.

Table 1 Maternal and neonatal characteristics across all randomized patients and subgroups of TH-treated IH trial and OC usual cooling arm

Use of pre-randomization cooling, initiation of cooling, and temperature at baseline

The majority of subjects (69%) in OC were either passively or actively cooled prior to randomization. In the usual cooling arm in OC, 70 of 95 (74%) were cooled prior to randomization, 52 were clinically cooled (placed on a cooling device), 17 were cooled passively, and 1 was cooled with ice/gel packs. Despite randomization occurring at a later age in OC compared to IH (5.1 ± 1.0 vs. 4.3 ± 1.3 h), subjects clinically cooled prior to randomization in OC had cooling initiated earlier (3.8 ± 1.3 vs. 5.0 ± 1.2 h, p < 0.0001). Overall mean age at initiation of cooling occurred earlier in OC, 4.2 ± 1.4 h, compared to, 5.0 ± 1.2 h, that in IH, p < 0.0001.

Continuous temperature data prior to randomization was not available and after baseline core temperature was measured every 15 min for the first 4 h of intervention. At the time of randomization, 54% of those in OC and only 4% of subjects in IH had a temperature within one degree of goal temperature (32.5−34.5 °C), p value <0.0001. The age at which a core temperature of <34 °C was first documented occurred earlier in OC 5.27 ± 1.66 h compared to IH 5.83 ± 1.23h, p = 0.008 (t test). Those who were cooled prior to randomization in OC (n = 70) had a temperature of 33.8 ± 1.4 °C at randomization. Overall, neonates in OC had significantly lower mean temperatures at baseline (initiation of study intervention) compared to IH (34.6 ± 1.8 vs. 36.9 ± 1.0 °C, p  < 0.0001).

Modified Sarnat examination

Infants were diagnosed with moderate/severe encephalopathy if they had ≥3 of the six categories that were moderately or severely abnormal. Differences in the distribution of the severity of the modified Sarnat exam components in neonates determined to have moderate HIE at randomization (IH vs. OC) and those determined to have severe HIE (IH vs. OC) at enrollment are shown in Supplemental Table S1. For neonates with moderate HIE, the distribution of severity of abnormalities in the level of consciousness, spontaneous activity, tone, and autonomic nervous system differed between IH and OC. For neonates with severe HIE at randomization, the distribution of severity of findings in the different exam categories was similar, except for the autonomic nervous system, with the difference being in the subcomponent of respiration.

Care practices during therapeutic hypothermia

Blood gas tensions of oxygen and carbon dioxide in the first 72 h after birth differed between IH and OC with lower PCO2 and higher PO2 values being observed in IH compared to OC (Table 2). These differences in PCO2 and PO2 remained significant after adjustment for severity of encephalopathy and for center. At baseline (start of intervention), 90% of neonates in IH were intubated and on mechanical ventilation compared to 78% of those in OC, adjusted relative risk (RR) 0.85, 95% confidence interval (CI): 0.73–0.99, P value 0.04. At 24, 48, and 72 h, fewer subjects were intubated and on mechanical ventilation in OC compared to IH, but these differences were not statistically significant after adjusting for severity of encephalopathy and center. FiO2 at baseline and at 24, 48, and 72 h is shown in Table 2. Neonates in OC received a lower FiO2 compared to IH at baseline, 24, and 48 h (P value 0.004–0.03). The use of inotropic support was lower at all time points in OC, but not statistically significant after adjusting for severity of encephalopathy and center. The adjusted RR of anticonvulsant exposure prior to randomization was significantly decreased in OC compared to IH, but did not remain significant at ≥24 h. Analgesics and sedating medications were more frequently used in OC compared to IH and was statistically significant after adjustment for severity of encephalopathy and center at 24, 48, and 72 h of treatment. A table describing the types of sedative and analgesic medications used in both studies is available as Supplementary material (Supplemental Table S2).

Table 2 Variables reflecting care practices prior to and during TH

In-hospital, discharge, and 18–22 month outcomes

The frequency of the diagnosis of clinical or electrographic seizures prior to hospital discharge was similar (62% in IH vs. 55% in OC). EEGs were not required but were performed in 68% of patients in IH and 76% of those in OC; this difference was not statistically significant. Data on EEG confirmation of seizures was not available for IH. In OC usual care arm, 76 patients had an EEG performed and 24% had electrographic seizures identified. The use of anticonvulsants at discharge was decreased in OC, but this difference was not statistically significant after adjusting for severity of HIE and center (Table 3). Length of stay and need for gavage tube or gastrostomy feeds at discharge were similar between the trials. A significant difference in in-hospital death was present between the hypothermia-treated patients in IH and OC (19% vs. 7%, p = 0.02) and in primary outcome at 18–22 months (44% vs. 29%, p = 0.03). After adjusting for severity of HIE and center, the relative risk of in-hospital death (RR 0.46, 95% CI: 0.17–1.27, p = 0.13) and the primary outcome (RR 0.73, 95% CI: 0.47–1.15, p = 0.17) in OC compared to IH was not statistically significant. In moderate and severe groups alone, the frequencies of in-hospital death and the primary outcome were lower in OC compared to IH (Table 3), but this difference was not statistically significant. On multivariate analysis, there was no interaction between severity of HIE and trial. Rates of cerebral palsy and distribution of the gross motor function classification system (GMFCS) are also shown in Table 3.

Table 3 Hospital, discharge, and 18–22 month outcomes

Discussion

Therapeutic hypothermia treatment has emerged rapidly as the standard of care for moderate or severe HIE following completion of the initial clinical trials. There was rapid diffusion of this therapy around the United States and abroad.15 In 2005, as the IH trial was published, only 6.4% of NICU directors reported utilizing TH and 30% felt it was effective or very effective, but by 2011 this increased to 50% using TH and 85% felt it was an effective or very effective treatment.18 This increase in the use of TH likely led to the development of expertise in the care of central nervous system (CNS) and non-CNS organ dysfunction observed in this patient population. Multiple publications during this period focused on optimizing the clinical management of these patients.19 In this analysis, we documented that both patient characteristics and care practices of infants receiving TH changed in OC relative to IH.

There were differences identified in both maternal and neonatal characteristics that may reflect differences in severity of perinatal asphyxia and encephalopathy between the two trials. We did not have obstetric practice data to compare in the two trials, but identified that neonates in OC were born at slightly younger gestational ages and fewer were delivered by emergent cesarean section. In the OC trial, neonates were less critically ill after birth with higher pH and lower base deficits and fewer had severe encephalopathy (23% in OC vs. 32% in IH). This difference in the proportion of subjects with severe encephalopathy despite using similar inclusion criteria may reflect unmeasured changes in obstetric practice and acceptance of hypothermia as a safe treatment, as well as improved awareness of its effectiveness in reducing the risk of death or disability for neonates with moderate or severe encephalopathy.1 Evaluation of the distribution of the severity of modified Sarnat exam components identified differences in several categories in those with moderate HIE between the trials. In contrast, in those with severe HIE the only difference observed between OC and IH was in the distribution of the respiratory subcomponent of the autonomic nervous system. Despite the differences in patient characteristics, there was not a statistically significant difference in the adjusted risk of death or disability in OC compared to IH (RR 0.73, 95% CI: 0.47–1.15, p = 0.17).

Our analysis indicates that there were several changes in care practices between the two trials. The relative contribution of each difference in practice is not able to be determined in this study. A potentially important difference was the use of passive or active cooling prior to randomization in OC. Centers were encouraged to not delay initiation of clinical cooling and the majority of neonates in OC were cooled passively or actively before randomization. More than half of the patients in OC had core temperatures that were within one degree of goal temperature at study baseline. Pre-clinical and an observational study indicates that earlier onset of cooling delays the onset of secondary energy failure and that the latent phase duration is inversely related to the severity of the insult.20,21,22 Meta-analyses have not been performed evaluating the impact of earlier cooling on outcome; however there has been an educational focus on improving identification of eligible patients and timely initiation of TH.23 Thoresen et al.24 retrospectively analyzed a cohort of infants with HIE in which cooling initiation at <3 h after birth was associated with an improved motor outcome.20 Reasons for the initiation of cooling at <3 and >3 h are not provided. The TOBY trial demonstrated a trend (p = 0.08) towards better outcomes in newborns cooled within 4 h of birth.

Blood gas tensions of oxygen and carbon dioxide differed between the IH and OC trials, even after adjusting for severity of encephalopathy. This suggests that practitioners were more aware of the deleterious effects of hypocarbia and hyperoxia and perhaps reflects changes in management to respiratory care in order to minimize exposure to low carbon dioxide and high oxygen tensions. This may also reflect that infants in OC were less sick and therefore less likely to have hypocarbia and hyperoxia. Free oxygen radicals are thought to mediate the adverse effects of hyperoxia, while hypocarbia is known to decrease cerebral blood flow. Both have been found in a multivariate model to be associated with adverse outcomes, including death, cerebral palsy, and poor developmental outcome.6,25

Differences in the use of medications commonly used during cooling, including inotropes, anticonvulsants, and sedatives/analgesics, were documented. Some may be related to differences in severity of illness, although in some cases the difference persisted after adjusting for severity of HIE at enrollment and for center. Changes in the use of anticonvulsants may be related to changes in seizure identification and management practices between the two trials. Neonates with HIE are known to be at high risk for the development of seizures, with a documented frequency ranging from 30 to 60% in recent studies.13,26,27,28,29 The use of EEG was higher in the OC trial and this may reflect the impact of multiple publications highlighting the high incidence of electrographic seizures.13,29 In 2011 the American Clinical Neurophysiology Society recommended that 24 h of continuous EEG monitoring should be implemented in neonates with HIE.11 During this same period, there was increased awareness that many seizures are subclinical and that the diagnosis of seizures based on clinical exam is difficult and is known to be inaccurate regardless of the level of experience of the clinician.30 Seizures also likely impact outcome with prolonged electrographic seizures being independently associated and a risk factor for poor outcome.31,32,33 We also identified that fewer infants were discharged home on anticonvulsants in the OC trial compared to the IH trial, although this difference was not statistically significant. Exposure of the immature brain to anti-epileptic drugs is known in animal studies to increase apoptosis and may alter behavior and cognition.34 In a cohort study of hospital exposure to phenobarbital and/or levetiracetam, increased cumulative phenobarbital and/or levetiracetam exposure were both associated with decreased cognitive and motor scores, although the degree of brain injury on imaging was not controlled for and the study did not address the impact of ongoing outpatient treatment with these medications.35 Given the limited sample size it is difficult to evaluate the impact that a change in the use of both in-hospital and post-discharge anticonvulsants could have had on the primary outcome.

Finally, we also identified a difference in the use of sedation and analgesia between the trials with increased use in OC compared to IH during TH. Increased use of sedation/analgesia may reflect increased surveillance by practitioners to address patient comfort. The NRN did not prescribe the use of sedation or analgesia to participating centers. Use of analgesia during hypothermia was identified in the Simbruner trial of TH as a possible mechanism for improved efficacy in his trial.36 The mechanism of action is not well understood, but may be related to a reduction in cold stress and shivering and therefore a decrease in metabolic rate.9 In a secondary analysis of the IH trial exposure to sedation/analgesia was not associated with the primary outcome.10

The risk ratios of death alone and death or disability after adjustment for center and level of encephalopathy suggest that at least part of the difference in mortality and disability between trials is related to patient acuity. Randomized controlled trials would be needed to determine the impact of earlier initiation of treatment, sedation/analgesia, monitoring for seizures, and thresholds to treat seizures on outcomes. It is also important to consider that TH is being applied in clinical practice to neonates with less severe perinatal asphyxia and encephalopathy including some with mild HIE.37,38,39 Design of future trials of neuroprotection will need to consider the impact of the severity of perinatal asphyxia, degree of encephalopathy after birth, and common care practices.

Limitations

Given the small number of patients included in this analysis and who experienced the primary outcome, we are not powered to perform a multivariate model to determine the impact of differing care practices between the trials. Data elements collected were not all identical between the two studies and some direct comparisons were difficult. We were not able to determine if there were differences in obstetric practice between the trials that may have contributed to the differences in severity of illness although neonates in OC were born at a slightly younger gestational age and fewer were delivered by emergent cesarean section. There was likely variation in practice across and within centers in blood pressure, seizure management, and use of analgesics and sedatives as standard management guidelines were not part of either study protocol. Use of two different standardized tests, the Bayley II in IH and Bayley III in OC may limit the ability to make direct comparisons in outcome between the trials. Direct comparison between Bayley II (MDI) and Bayley III (Cognitive) scores in both preterm and term HIE populations have been published.40 In both these populations, Bayley III cognitive scores are thought to underestimate the degree of impairment when compared directly to the Bayley II. In the term HIE population, a Bayley III cognitive score of <85 identified all but 1 patient who had a Bayley II MDI of <70.40 Thus, the use of Bayley III in the OC trial with a cut-off of <85 for moderate–severe cognitive impairment should identify all neonates with severe impairments as determined by the Bayley II, but may have underestimated the frequency of those with mild–moderate impairments. Finally any differences observed may be related to chance.

Conclusions

Despite similar inclusion criteria between the two NRN trials of TH there were significant differences in patient characteristics at enrollment. In OC compared to IH, newborns were less critically ill, fewer had severe HIE, they were cooled earlier, anticonvulsant use was lower, there was less hypocarbia and hyperoxia, and use of sedation or analgesia was more prevalent. These demographic and care practice differences may have contributed to the unadjusted differences in survival and disability rates in the usual care arm of the OC trial when compared to the cooled arm of the IH trial, although these differences were no longer significant after adjustment. Severity of perinatal asphyxia, degree of encephalopathy, and care practices need to be considered in the design of future trials.