Traditionally, much of anesthesia research has focused on testing new drugs, novel indications for older drugs, or new devices. Nevertheless, it is also important to know whether clinically important within-class differences exist for existing drugs or devices. Unfortunately, within-class comparisons are rarely studied, and significant knowledge gaps exist. Comparative effectiveness research (CER) has emerged as a type of pragmatic research targeting “real-world” comparisons of the benefits and harms of commonly used interventions, including within-class comparisons.1,2

Strong evidence exists from studies in animals that the administration of volatile anesthetics before (preconditioning) and after (postconditioning) a period of myocardial ischemia is associated with cardioprotective properties.3,4 In humans, administration of volatile anesthetics at doses of 0.5-2.0 minimum alveolar concentration (MAC) throughout cardiac surgery results in less myocardial injury, fewer patients requiring inotropic support, and reduced mortality compared with total intravenous anesthesia.5-9 It is currently unknown if the cardioprotective effects of volatile anesthetics extend equally to all members of the class, or whether there is some differential benefit of one volatile anesthetic over another.

Two commonly used drugs for the maintenance of anesthesia in cardiac surgical patients are isoflurane (approved for use in the USA in 1979) and sevoflurane (approved for use in the USA in 1995). The relatively newer sevoflurane has some advantages over isoflurane. For example, it is less soluble than isoflurane, resulting in a faster onset and offset of action. It is also less irritating to the airway and not as pungent as isoflurane, and therefore, it can be used for inhalational induction of anesthesia (although this is uncommonly performed for cardiac surgery).10 Anecdotally, there appears to be an opinion among many cardiac anesthesiologists that sevoflurane is superior to isoflurane. This may be because, since the year 2000, considerable cardiac anesthesia research has focused on sevoflurane.6 The ubiquity of sevoflurane in cardiac anesthesia research may have caused some anesthesiologists to conflate commonly studied with beneficial. There are very few data to support the contention that sevoflurane is superior to isoflurane for cardiac anesthesia. Previous studies comparing the two agents are small,11-13 old (i.e., not reflective of current surgical/anesthesia practice),12-14 focused on surrogate outcomes,11-14 or were performed in highly specific patient populations.11

Although it is possible that there are no clinically important differences between sevoflurane and isoflurane when given as a maintenance anesthetic during cardiac surgery, this assumption should not be made without high-quality evidence. If sevoflurane and isoflurane are clinically similar, other practical considerations (e.g., availability, preference, or cost) may factor into the decision regarding which anesthetic to use. Alternatively, if one agent is clinically superior to the other, clinicians should be informed and should consider using the more beneficial anesthetic.

The objective of the Randomized Isoflurane and Sevoflurane Comparison in Cardiac Surgery (RISCCS) trial was to determine whether sevoflurane and isoflurane are comparable in terms of their effects on clinically important outcomes in a heterogeneous group of adults undergoing cardiac surgery. Because sevoflurane is the newer of the two anesthetics and has the theoretical advantages listed above, we hypothesized that sevoflurane would be non-inferior to isoflurane on the composite outcome of prolonged intensive care unit (ICU) stay and 30-day mortality when given in a dose of 0.5-2.0 MAC throughout the entire cardiac surgery. If sevoflurane was non-inferior to isoflurane, we further hypothesized that sevoflurane would be superior to isoflurane on other clinically important outcomes.

Methods

Study design

The RISCCS trial was a single-centre, prospective, pragmatic, randomized, parallel, non-inferiority comparative effectiveness trial that was conducted at University Hospital, London, Ontario, Canada (a university-affiliated quaternary care cardiac centre performing about 1,400 cardiac surgeries per year). Ethics approval was obtained from the Health Sciences Research Ethics Board at the University of Western Ontario in October 2009 (#16497). All study participants provided written informed consent before taking part in this trial.

Eligibility criteria

Patients ≥18 yr old having coronary artery bypass graft (CABG) surgery on- or off-pump, single valve repair/replacement, or CABG/single valve combined procedures were included. We excluded patients requiring emergency surgery, redo surgery, surgeries requiring planned deep hypothermic circulatory arrest, planned surgery on more than one valve, planned tracheal extubation in the operating room, and pericardial stripping. We also excluded patients who refused blood products, pregnant patients, and those with a risk of malignant hyperthermia. No changes to the trial’s methods or eligibility criteria occurred after trial commencement.

Intervention and anesthetic conduct

The intervention was randomization to anesthesia maintenance with either sevoflurane or isoflurane. The designated anesthetic was given at a strict minimal amount throughout the entire cardiac surgery [including cardiopulmonary bypass (CPB)].15 Randomization was performed just before induction of anesthesia in the operating room (OR). The randomization list was computer generated in a 1:1 ratio using randomly permuted blocks of sizes 2, 4, and 6. The perfusionist in the OR activated the randomization by opening the next consecutively numbered, sealed, opaque envelope. The vaporizers of the anesthetic machine and the heart-lung machine were then switched to the allocated anesthetic. The name of the anesthetic was not used on the anesthetic record or on the perfusion record—the record showed only the proportion of MAC that was delivered. Therefore, the patient, caregivers (except the anesthesiologist and perfusionist who were actually giving the anesthetic), and outcome assessors were all blinded to group allocation. Before the trial was started, approximately one-third of the anesthesiologists at our centre used isoflurane routinely, one-third used sevoflurane routinely, and the remainder used either volatile anesthetic. All anesthesiologists at our centre had extensive clinical experience using both anesthetics.

The target exposure to the randomized anesthetic agent was 0.5-2.0 end-tidal MAC, from just after anesthesia induction until the end of the surgery, including while on CPB. By protocol, any increase in the depth of anesthesia was accomplished by first increasing the concentration of volatile anesthetic and then by administering bolus doses of propofol, opioids, or benzodiazepines according to the anesthesiologist’s preference. To maximize the volatile anesthetic exposure, no intravenous sedatives, hypnotics, or opioids were permitted as infusions before protamine was administered after CPB.

In addition to invasive monitoring of radial arterial and central venous pressure, our institutional standard for monitoring included continuous electrocardiography, pulse oximetry, analysis of end-tidal carbon dioxide and anesthetic agent, temperature, transesophageal echocardiography (TEE), and bispectral index electroencephalography (BIS).16 Although not protocolized, induction of anesthesia typically involved a combination of fentanyl (5-10 µg·kg−1) or sufentanil (1-5 µg·kg−1), midazolam (0.05-0.1 mg·kg−1), propofol (0.25-1 mg·kg−1), and rocuronium (0.6-1.2 mg·kg−1). For on-pump surgeries, weaning from CPB involved integration of information from direct visualization of the heart, TEE, and arterial and central venous pressures. Inotropic and vasopressor drugs were used at the discretion of the anesthesiologist and surgeon.

We quantified the mean exposure to the allocated volatile anesthetic every 15 min by converting the end-tidal anesthetic concentration (or the directly administered concentration while on CPB), titrated to a BIS < 60, to its corresponding MAC value using a nomogram, adding the MAC values, and dividing this number by the number of 15-min intervals.

Patients were admitted to the ICU postoperatively, where all patient care was provided as per institutional standards, including routine laboratory tests and planned extubation within six hours. The time of ICU admission constituted “time zero” for the calculation of all postoperative durations.

Outcomes

The primary outcome was a composite of prolonged ICU length of stay (≥ 48 hr) and death from any cause within 30 days of the operation. The secondary outcomes included high-sensitivity cardiac troponin T (cTnT) measured six hours post-ICU admission, hospital and ICU lengths of stay (based on the times the participants actually left the hospital and the ICU—i.e., not when they were fit for discharge), duration of tracheal intubation, inotrope or vasopressor usage in the ICU at any time, prolonged (≥ 12 hr) inotrope or vasopressor usage, peak postoperative serum creatinine, new-onset hemodialysis, new-onset atrial fibrillation, use of an intra-aortic balloon pump (IABP), perioperative stroke, and ICU readmission.

Statistical analysis

Local historical data from 1,920 patients showed that 25% of our patients experienced an ICU length of stay > 48 hr or death from any cause within 30 days of surgery, driven almost exclusively by the prolonged ICU length of stay. Based on consensus from the trial’s investigators regarding the difference needed for clinical similarity between the two anesthetic agents, we considered sevoflurane to be non-inferior to isoflurane if the upper limit of the one-sided 95% confidence interval (CI) of the absolute difference in the primary outcome (sevoflurane—isoflurane) was < 10%. We chose 10% because, if surpassed, we considered this difference to represent a clinically relevant increase signifying increased cost of care and increased morbidity.

To achieve a power of 80% to show that the true difference in event rate was < 10%, using a true primary outcome incidence of 25% and a one-sided alpha error of 5%, 232 patients were required in each group.17 As supported by the CONSORT statement extension for non-inferiority trials, if non-inferiority was shown, a conventional two-sided 95% CI would then be calculated for the relative risk of sevoflurane compared with isoflurane.18

Summary statistics were computed for baseline demographic variables. Histograms constructed for continuous variables were first assessed visually to determine if they were approximately normally distributed. If they were, they were analyzed using the Student’s t test. If skewed, the difference in medians between groups, its respective 95% CI, and the null hypothesis test of no difference between medians were calculated using 0.5 quantile (median) regression, conditioning on group allocation, and bootstrapping with 10,000 replications for standard error estimation.19,20 Categorical variables, including the primary outcome, were analyzed using a two-sided Fisher’s exact test. Time-to-event data were visualized using the Kaplan-Meier method; differences in median times-to-event were calculated using median regression with bootstrapped standard errors, and hypothesis tests were performed using the log-rank test.

One blinded interim analysis was conducted after the 30-day follow-up had occurred on the first 232 patients. A Peto-Haybittle rule21 was used, such that a P < 0.001 (on a superiority hypothesis test) was required for statistical significance at the time of the interim analysis. We also prespecified subgroup analyses for the primary outcome based on sex, on-pump vs off-pump surgery, presence of diabetes, and left ventricular ejection fraction. Subgroup effects were assessed by tests of interaction.

All analyses were conducted according to the intention-to-treat principle. A P < 0.05 was considered statistically significant. No corrections for multiple comparisons were made.22,23 Stata® version 13 (StataCorp LP, College Station, TX, USA) was used for all analyses.

Results

Of 559 patients screened from November 2011 to March 2014, 542 (97%) were eligible to participate, and 464 patients were randomized. There were 233 patients allocated to the isoflurane group and 231 patients allocated to the sevoflurane group. No losses to follow-up occurred for the primary outcome (see trial flow diagram in Fig. 1). After enrolment of 232 patients and review of the blinded interim analysis data, the Data Monitoring Committee recommended continuing the trial as planned. No patient harm attributable to the study intervention occurred. Anonymized raw data and all statistical analysis codes are available as online open data (doi:10.6084/m9.figshare.3180352).

Fig. 1
figure 1

Trial flow diagram

Baseline characteristics are presented in Table 1. The majority of the study sample was male, and the prevalence of comorbidities and the details of medications taken were as expected for this patient population. Surgery details are presented in Table 2. Most surgeries were on-pump CABG, followed by single valve repair or replacement. There were no significant differences between groups in the duration of surgery or CPB, the duration of aortic cross-clamping, or the number of coronary grafts performed. The exposure to the allocated anesthetic agent, in MAC equivalents, was similar between groups. There was a small, but statistically significant, difference between groups in the median dose of heparin given intraoperatively.

Table 1 Characteristics of the patients at baseline
Table 2 Details of cardiac surgical procedures

Primary outcome

The primary outcome results are in Table 3. The incidence of the composite primary outcome of prolonged ICU stay and 30-day all-cause mortality was 25% in the sevoflurane group and 30% in the isoflurane group, with an absolute risk difference in the primary outcome between sevoflurane and isoflurane of −5.4% (one-sided 95% CI, 1.4). Since 1.4% was below the pre-specified non-inferiority margin of 10%, non-inferiority was declared. Using a two-sided 95% CI, the relative risk of the primary outcome in the sevoflurane group compared with the isoflurane group was 0.82 (95% CI, 0.61 to 1.11; P = 0.21). Therefore, for the primary outcome, sevoflurane was non-inferior to, but not superior to, isoflurane.

Table 3 Trial outcomes
Table 4(a) Estimated cost of isoflurane and sevoflurane for a cardiac surgical procedure lasting 4.5 hr in 2009 (in Canada)
Table 4(b) Estimated cost of isoflurane and sevoflurane for a cardiac surgical procedure lasting 4.5 hr in 2016 (in Canada)

Secondary outcomes

Categorical secondary outcome results are shown in Table 3 and Fig. 2. Overall, 124 patients (27%) had a prolonged ICU length of stay, and nine patients (1.9%) died within 30 days of surgery. There were no significant differences between groups in prolonged ICU stay, 30- and 365-day all-cause mortality, any inotrope/vasopressor usage, prolonged inotrope/vasopressor usage, new-onset hemodialysis or atrial fibrillation, stroke, IABP usage, or readmission to the ICU. Continuous secondary outcome results are shown in Table 3. The cTnT sample drawn six hours after ICU admission and the peak postoperative serum creatinine were significantly lower in the isoflurane group than in sevoflurane group. There was no difference between groups in the time to tracheal extubation, time to discharge from ICU, or time to discharge from hospital (Table 3, Figs 3 and 4).

Fig. 2
figure 2

Graphical depiction of primary and categorical secondary outcomes

Fig. 3
figure 3

Kaplan-Meier estimates of the time to extubation. Shown for the first 48 hr after admission to the intensive care unit. Four patients were right censored at the time of their death (two patients in each group). Five patients in the isoflurane group and two in the sevoflurane underwent tracheal extubation after 48 hr. P value from the log-rank test

Fig. 4
figure 4

Kaplan-Meier estimates of the time to discharge from the (a) intensive care unit and (b) hospital. Shown for the first five days and 15 days, respectively, after admission to the intensive care unit. P values from the log-rank test

Subgroup analyses

The effect of the allocated anesthetic agent within the four prespecified subgroups is depicted in Fig. 5. All tests for interaction were non-significant, indicating no signal of a differential effect of the allocated anesthetic within any of the subgroups.

Fig. 5
figure 5

Subgroup analyses for the primary outcome of prolonged intensive care unit length of stay or 30-day mortality. Left ventricular function was quantified as follows: grade 1: > 54%, grade 2: 40-54%, grade 3: 20-39%, grade 4: < 20%

Discussion

In the RISCCS study, sevoflurane was found to be non-inferior to isoflurane in the clinically important composite primary outcome of prolonged ICU length of stay or 30-day mortality. On subsequent superiority testing for this outcome, sevoflurane was not found to be superior to isoflurane. Furthermore, no differences between groups were seen in other clinically important secondary outcomes. In our view, these results have important clinical relevance. Specifically, for those anesthesiologists who previously favoured using sevoflurane, we have shown that sevoflurane is not inferior to isoflurane for cardiac surgery, and that it is also not superior to isoflurane. Therefore, these anesthesiologists could likely switch to isoflurane without concern of harm to their patients. For those anesthesiologists who previously favoured using isoflurane, we have shown that sevoflurane does not have any clinically significant advantages over isoflurane.

Analysis of secondary outcomes showed that the median cTnT measurements six hours after ICU admission were lower in the isoflurane group. This could potentially represent better pharmacologic preconditioning with isoflurane compared with sevoflurane, and this result is consistent with some previous animal research.24 Nevertheless, this difference in cTnT is likely not clinically important, as it is a surrogate outcome and no differences between groups were observed on any clinically important outcomes. Similarly, although peak creatinine was very slightly increased in the sevoflurane group compared with the isoflurane group, clinically important renal dysfunction would be expected to prolong hospital length of stay or require hemodialysis,25 which did not happen.

Non-inferiority trials have been described as testing “whether a new product is not unacceptably worse than a product already in use”.26 Accordingly, sevoflurane (the newer drug) should have been tested against isoflurane (the drug already in use) when it was first released in 1995, but this was not done. At the time of the conception of this trial in 2009, sevoflurane, when given at doses equivalent to isoflurane, was approximately 14 times the cost of isoflurane in Canada. For a typical cardiac surgical procedure lasting 4.5 hr, the cost per patient for isoflurane was $4.24 vs $58.92 for sevoflurane (Canadian dollars — see details of calculation in Table 4a). While the cost of sevoflurane has declined since it has now become a generic drug, in Canada, it is still about eight times the cost of isoflurane (2016 costs using the above assumptions: isoflurane $5.25 vs sevoflurane $41.24, a savings of about $36—Table 4b). The cost differential between the two anesthetics is similar in the USA.27 By 2020, cardiovascular disease is expected to be the primary cause of morbidity and mortality in many developing countries where every dollar counts;28,29 consequently, the stakes are even higher for the countries that are least able to pay for the premium of using sevoflurane. The RISCCS trial did not detect any advantage to using sevoflurane over isoflurane for the cardiac surgical procedures included in the trial, and since sevoflurane is still more expensive than isoflurane, we recommend that anesthesiologists consider using isoflurane for these procedures. If this recommendation is followed, millions of dollars could be saved yearly worldwide. In the USA alone, there could be an annual savings of about US$10 million if isoflurane were used instead of sevoflurane. This is assuming that sevoflurane is currently used in half of the roughly 550,000 cardiac surgeries performed annually in the USA and that there is a US$36 savings per case by using isoflurane instead of sevoflurane. While these calculations are crude and approximate, this large potential savings indicates the value of CER to determine the relative cost-effectiveness of agents within a particular class of drug.

Comparative effectiveness research has been defined as “… the generation and synthesis of evidence that compares the benefits and harms of alternative methods to prevent, diagnose, treat, and monitor a clinical condition or to improve the delivery of care.30 The concept of CER emerged due to the recognition that much of healthcare is not based on high-quality evidence. Research efforts commonly do not focus on outcomes that matter to patients, and head-to-head clinical trials comparing two or more alternative options, such as competing drugs within a class, are infrequently performed.31 Many previous studies of anesthetic preconditioning have focused on surrogate outcomes, which are used because it is usually easier to detect a difference when compared with clinically important outcomes.

The RISCCS trial used the CER approach to determine if there were any clinically important differences between sevoflurane and isoflurane when given as a maintenance anesthetic during cardiac surgery. Strengths of the trial include a pragmatic “real-world” trial design that posed a simple question and minimized disruption to regular clinical care, measured clinically important outcomes, and used a representative moderately large sample of patients. These factors give the RISCCS trial high external validity.

Nonetheless, as with any clinical trial, the RISCCS study also has some limitations. First, it was not possible to blind the anesthesiologists and perfusionists to group allocation. Nevertheless, since all trial outcomes were either factual (e.g., mortality, presence of atrial fibrillation, cTnT measurement, etc.) or decided by the blinded ICU staff (e.g., when to perform tracheal extubation, how long to use inotropes/vasopressors, when to discharge the patient from the ICU, etc.), we do not consider this to be a substantial risk. Second, by design, we did not protocolize the anesthesia in order to maximize generalizability of the results and to concentrate on effectiveness. Despite the lack of such a protocol, we did not detect any significant differences in the average depth of anesthesia or the usage of intravenous anesthetics; therefore, in our view, the lack of anesthesia standardization is not a significant concern. Third, this is a single-centre trial and the results may not be generalizable to other contexts. Finally, our choice of non-inferiority margin may seem to be overly generous; however, it is important to emphasize that, if the margin had been reduced to as low as 1.5%, the conclusions of this trial would not have changed.

Conclusions

In a representative patient sample undergoing common cardiac surgical procedures, sevoflurane was non-inferior to isoflurane on a composite outcome of prolonged ICU stay and all-cause 30-day mortality. Sevoflurane was not superior to isoflurane on any clinically important outcomes.