Main

Chemotherapy for patients with recurrent ovarian cancer remains palliative, with the aim to improve quality of life and prolong time to progression. The platinum-free interval is of predictive significance as responses to re-exposure to platinum are less likely with shorter time since last platinum-based chemotherapy (Gore et al, 1990; Markman et al, 1991). Ultimately, most patients with recurrent disease will develop platinum resistance. In this setting, combination therapies are associated with increased toxicity without advantage in overall survival (OS) (Buda et al, 2004; Sehouli et al, 2008; Lortholary et al, 2012), and single-agent therapies have been favoured. The most active agents are pegylated liposomal doxorubicin (PLD), weekly paclitaxel, gemcitabine and topotecan (ten Bokkel Huinink et al, 1997; Gordon et al, 2001; Mutch et al, 2007; Vergote et al, 2010) with response rates between 8.3 and 21%. Only recently the combination of bevacizumab and single-agent chemotherapy has been reported to significantly increase progression-free survival (PFS) compared with chemotherapy alone (Pujade-Lauraine et al, 2014). Still, treatment with chemotherapy is associated with side effects like neutropenia, alopaecia, neuropathy and palmar-plantar erythrodysesthesia (PPE) and these have been shown to impair patient’s quality of life (Lakusta et al, 2001; Stiggelbout and de Haes, 2001; Le et al, 2004). Hormonal treatment has been considered a worthwhile alternative to chemotherapy due to its limited side effects and simple dosing. The antitumour activity of tamoxifen is believed to be mediated by competitive binding to oestrogen receptors, but a direct antiproliferative effect of oestrogen antagonists and progesterone has also been suggested (Batra and Iosif, 1996). To our knowledge, there is to date only one comparative study of endocrine treatment (leuprorelin) vs chemotherapy (treosulfan) (du Bois et al, 2002). This study was stopped prematurely due to concerns about the lack of efficacy. For tamoxifen, phase II studies have reported response rates around 10–13% (Markman et al, 1996; Williams et al, 2010) with stabilisation of disease in 32% of the patients, which is not dissimilar to the results of chemotherapy in this population (Hatch et al, 1991).

In the context of platinum-resistant ovarian cancer (PROC), the trade-offs between clinical benefit and quality of life are important and the performance of tamoxifen against chemotherapy in this patient group has not been previously investigated in a phase III setting.

This is a phase III randomised study comparing single-agent chemotherapy with tamoxifen in patients with PROC. Preliminary results were presented at the 2008 Annual Meeting of the American Society of Clinical Oncology.

Patients and methods

The study was designed and carried out in accordance with good clinical practice, the declaration of Helsinki and national laws. Local ethics committee of each participating centre approved the study. All patients provided written informed consent before study entry. The study is registered at ClinicalTrials.gov with Identifier: NCT02728622.

Eligibility criteria, randomisation and quality assurance

Patients with histologically confirmed invasive epithelial ovarian, fallopian tube or peritoneal cancer, being resistant to treatment with platinum and a taxane (in a three-weekly regimen), either given in combination or sequentially, were eligible. Resistance was defined as one of the following. (1) Clinical progression during or within 6 months after end of treatment for primary disease or relapse. (2) Stable disease after six courses of chemotherapy for primary disease or relapse. (3) Doubling of CA 125 to at least 70 IU ml−1 within 3 months after end of treatment for primary disease or relapse.

Patients were to be aged 18 years, have ECOG performance status 2, and have adequate liver, renal and bone marrow function. Exclusion criteria were symptomatic brain metastases, active infection or other severe underlying medical condition.

Randomisation was performed by computerised minimisation techniques. Patients were randomised 1 : 2 to either hormonal treatment or chemotherapy of investigator's choice.

The responsible study office (NSGO-CTU) followed the protocol-defined guidelines on central monitoring and reviewed case report forms for completeness and consistency.

Treatment plan and dose modifications

Patients randomised to hormonal treatment received oral tamoxifen 40 mg daily. Patients randomised to chemotherapy received either paclitaxel 80 mg m−2 as a 1-h infusion, given every 7 days or PLD 40 mg m−2 given every 4 weeks. The first dose was infused over 2 h; later doses were infused over 1 h. Chemotherapy was administered with appropriate premedication.

Study specific guidelines were followed for dose delay and dose reduction for haematologic and non-haematologic toxicity and for supportive care use.

The use of supportive G-CSF treatment was not recommended instead of dose reduction but was left to the investigator’s discretion.

Assessment of PROs, efficacy and follow-up

Health-related Quality of Life (HRQoL) was evaluated using the European Organization for Research and Treatment of Cancer (EORTC) quality-of-life questionnaire (QLQ-C30, version 3.0) and the specific module for ovarian cancer (QLQ-OV-28, version 1.0). Health-related Quality of Life was assessed at baseline and every 8 weeks (±2 weeks) during treatment and every 12 weeks (±2 weeks) thereafter until patient death or withdrawal. In this analysis, questionnaires from all assessments until disease progression were analysed.

Tumour measurements according to Response Evaluation Criteria in Solid Tumors (RECIST) (Therasse et al, 2000) were recorded at baseline and every 8 weeks using the same tumour assessment technique throughout the study. Treatment was discontinued in case of clinical tumour progression or deterioration of performance status not being compatible with continued treatment. In case of complete clinical remission, treatment could be terminated after two more courses of PLD or six more courses of paclitaxel.

In case of progression on protocol treatment, a crossover from tamoxifen to chemotherapy and from chemotherapy to tamoxifen was recommended if continued treatment was in the best interest of the patient.

Adverse events and toxicity were graded during each cycle by the study investigator according to the National Institute Common Toxicity Scale version 1.0 (Cancer Therapy Evaluation Program, 1992). Toxicities were evaluated using the worst score per toxicity in each patient.

Date of last follow-up was the 5 January 2009.

Statistical analyses

The primary end point was evaluation of HRQoL in both treatment arms. Secondary end points were PFS and OS.

The study was designed in 2002 when detailed aspects of HRQoL outcomes and sample size were described in general rather than in specific terms in the protocol. As the analysis of the primary end point HRQoL was not adequately prespecified in the protocol, the results presented here are considered secondary analyses. Nevertheless, the study was conducted to investigate whether the less toxic but potentially less effective treatment with tamoxifen would demonstrate an improvement in HRQoL compared with the presumably more toxic treatment with chemotherapy. In exploratory analyses of HRQoL, we assessed quality-adjusted survival (QAS) as a measure to address this research question.

In the original protocol, the sample size for the study was based on detecting a difference in HRQoL and also in PFS, with no adjustment for type I error. The power calculation for HRQoL was based on detecting an improvement of 1 point on the 7-point Likert scale; however, the version of QLQ-C30 used in the study did not present items on a 7-point Likert scale. For PFS, it was powered to detect at least a 54% increase in the risk of progression (hazard ratio (HR) of 1.54) on tamoxifen assuming a median time to progression of 13 weeks for those receiving tamoxifen and 20 weeks for those receiving chemotherapy. Based on a 2 : 1 randomisation, a 3-year accrual and 1-year follow-up, a sample size of 223 patients was required to detect this difference. Power calculation should ideally have been based on the primary end point, change in HRQoL, alone.

PFS was defined as the time from randomisation to disease progression or death from any cause. Overall survival was defined as the time from randomisation to death from any cause. Response was recorded as best response on treatment.

Efficacy and HRQoL analyses were based on the intent-to-treat population of eligible patients. Analysis of toxicities was based on the safety population (all patients who received at least one dose of study treatment). The scoring manuals from EORTC were applied to the HRQoL data to calculate scales for each instrument (Fayers et al, 2001). The abdominal/gastrointestinal (GI) symptom subscale comprised items 1–6 of the EORTC QLQ-OV28. The chemotherapy side effects subscale comprised items 13–17 and hormonal side effects items 18 and 19. Compliance of HRQoL was calculated using all patients, who had not progressed or died prior to the time point as the denominator.

All analyses of HRQoL questionnaires were performed using a mixed model for repeated measures with time by treatment interaction, together with the appropriate baseline scores, treatment allocation and time point. HRQoL subscales were presented for patients up to 30 days after their end of treatment day. HRQoLs were reassigned a visit based on the date of completion. On treatment this was any QoL within an 8 week window.

For the QAS analysis, the weights of McKenzie (McKenzie and van der Pol, 2009) were applied to the QLQ-C30 to estimate EQ5D utility. Survival probabilities were calculated using the area under the curve with an appropriate truncation time point of 43 weeks. An estimation of the QAS was calculated with a 95% CI and P-value using bootstrapping methods with a 1000 replications. As visits were scheduled every 12±2 weeks, QAS included all patients with at least one HRQoL measurement 12+2 weeks prior to the date of progression or 12+2 weeks after if no HRQoL prior to PD was available.

In responder analysis we studied the proportion of patients with a clinically significant change in HRQoL score on a particular scale compared with their baseline score. An improvement was defined as at least 15% increase with a 10% increase used in sensitivity analysis (Osoba et al, 2005; Stockler et al, 2014). Patients with a HRQoL decrease of 15% or more (10% in sensitivity analysis) were classified as ‘worsened’, whereas a change within those ranges was classified as ‘no change’. Chi-square test was used to determine the statistical significance of differences between the treatment arms. Subgroup analysis was performed for the GI symptom scale in relationship to patients’ severity of GI symptoms at baseline. Category boundaries were determined using median score at baseline for the dividing patients into those with high and low baseline symptoms.

Time-to-event data were analysed using the Kaplan–Meier method, and the log-rank test was used to compare the distributions between groups. Hazard ratios with 95% CIs were estimated using the Cox proportional hazards model. Sensitivity analysis used a stratified log-rank test stratifying for the a priori defined factor of progression during or after discontinuation of chemotherapy.

Toxicity was described for each group separately and proportions compared by the Wilcoxon–Mann–Whitney test.

SAS statistical package, version 9.3, was used for the analysis. Plots were created in Stata 13.0.

Results

Patients and follow-up

Between March 2002 and December 2007, 241 patients were enrolled. Three patients were excluded from the intention-to-treat population. One patient was excluded due to evident brain metastases, one patient had no data ever recorded, and a third patient was randomised twice under two different ID numbers. Two-hundred and thirty-eight patients were included in the intention-to-treat analysis of eligible patients, 82 in the tamoxifen arm and 156 in the chemotherapy arm (Figure 1). Of the patients randomised to chemotherapy, 70 were allocated to weekly paclitaxel and 86 to PLD. Five patients did not receive the allocated treatment: two patients in the chemotherapy arm and three allocated to tamoxifen. Thus, 233 patients were included in the safety analysis. Baseline characteristics are summarised in Table 1. The CONSORT diagram for the population analysed for quality of life is shown in Supplementary Figure S1. The compliance rates with HRQoL assessments in the tamoxifen arm were 85% (47 of 55) and 72% (23 of 32) at weeks 8 and 16, respectively. In the chemotherapy arm, these rates were 89% (119 of 133) and 77% (65 of 84), respectively. For analysis of QAPFS, 2 patients had no QoL information 12+2 weeks prior to PD and thus had QoLs included 4 and 10 days after PD. Median follow-up was 169 weeks (range: 0–326).

Figure 1
figure 1

CONSORT diagram: enrolment, randomization and treatment.

Table 1 Baseline characteristics

Toxicity

There was a statistically significant difference in non-haematological and haematological toxicities between the treatment arms with more nausea, mucositis, motor neuropathy, neurosensory, PPE, anaemia, leukopenia and neutropenia (Supplementary Table S1) in the chemotherapy arm. Two patients had febrile neutropenia in the chemotherapy arm, none in the tamoxifen arm. Patients on PLD had significantly more nausea, mucositis and PPE, but less anaemia compared with paclitaxel. In the chemotherapy arm, 4 (2.6%) patients received G-CSF and 12 patients (7.8%) experienced a dose reduction. Of the four dose reductions on paclitaxel, three were due to haematological toxicity (neutropenia and thrombocytopenia). On PLD, eight patients had a reduction in dose, seven due to non-haematological toxicity (PPE) and one due to haematological reasons (neutropenia). In 37 patients (24%) chemotherapy was delayed for at least one cycle. In the tamoxifen arm, eight patients (10.1%) had a loss in treatment days, ranging from 1 to 52 days.

Deaths on study

Ten (6.4%) patients died in the chemotherapy arm; seven of those were related to the disease. One patient died due to sepsis of unknown origin, and one developed headache and became unconsciousness before she died. No autopsy was performed. The third patient was admitted to hospital with a subarachnoid haemorrhage and died the next day. Eight (10.1%) deaths occurred on tamoxifen, with seven being disease related. One patient died of sepsis of unknown origin.

Efficacy

A total of 203 patients (85.3%) had progressive disease during follow-up. Median PFS in the tamoxifen group was 8.3 weeks (95% CI, 8.0–10.4) compared with 12.7 weeks (95% CI, 9.0–16.3) in the chemotherapy arm, corresponding to a HR of 1.54 (95% CI, 1.16–2.05; log-rank P=0.003) (Figure 2). There was no statistically significant difference in PFS between the two chemotherapy arms (HR of 1.21, 95% CI: 0.85–1.72).

Figure 2
figure 2

Kaplan–Meier estimates of progression-free and OS according to treatment arm.

By the end of follow-up, 204 (85.7%) patients had died. Median OS time was 44.9 weeks (95% CI, 31.1–57.7) in the tamoxifen arm and 46.9 weeks (95% CI, 37.1–51.0) in the chemotherapy arm, corresponding to a HR of 1.10 (95% CI, 0.82–1.47; log-rank P=0.53) (Figure 2). Stratification for type of prior progression did not affect the estimates of PFS or OS.

Two hundred patients (84%) (136 in the chemotherapy arm and 64 in the tamoxifen arm) were evaluable and assessed for response. Of those, 2 patients (1.5%) experienced complete response and 21 (15.4%) partial response in the chemotherapy arm. In the tamoxifen arm, the best response obtained was partial response in three patients (4.7%). Stable disease was achieved in 52 patients (38.2%) in the chemotherapy arm and 20 (31.3%) in the tamoxifen arm. Progression was reported in 61 (44.9%) and 41 patients (64.1%) in the chemotherapy arm and tamoxifen arm, respectively.

Mean quality-of-life scores over time adjusted for the baseline

Over the study period, there was no statistical difference between the two arms for all the functioning scales (Figure 3). A statistically significant difference between the two arms existed at week 16 only for the abdominal GI symptom (P=0.04) and hormonal side effects (P=0.01) subscales, with more symptoms in the tamoxifen arm. Chemotherapy side effects symptoms were more common in the chemotherapy arm; however, the difference did not reach statistical significance.

Figure 3
figure 3

Baseline and scores from a mixed model for health-related quality of life function scales in both treatment arms. Error bars represent 95% confidence interval for the estimated mean values. *Subscales derived from the EORTC_QLQ-C30; Subscales derived from EORTC-OV-28.

Patient response analysis

Figure 4 illustrates the responder analysis (improved, stable or worsened) for each HRQoL scale and at week 8 based on a change of 15%. In the chemotherapy arm, a significantly larger proportion of patients experienced a worsening in their social functioning at week 8. For a change in HRQoL of 10% significantly more patients reported a worsening in both their social and physical functioning on chemotherapy compared with patients on tamoxifen (Supplementary Figure S2). The majority, 200 (90.9%), reported abdominal symptoms at baseline with a median symptom score of 22.2. There was no difference in the proportion of patients who reported improvement of their GI symptoms between the treatment arms. There was further no statistical significant difference between the treatment arms in the proportion of patient with symptom improvement in subgroups of more or less GI symptoms at baseline.

Figure 4
figure 4

HQOL response proportions at 8 weeks (15% cutoff).

Quality-of-life adjusted survival

Table 2 includes the quality-adjusted survival estimates for PFS. There was a significant difference between the utilities in favour of tamoxifen. However, this difference was offset by the difference in PFS and thus there was no difference in quality-adjusted PFS between the treatment arms (P=0.14).

Table 2 Quality-adjusted survival

Discussion

In this study of patients with PROC, single-agent chemotherapy was more effective in prolonging PFS compared with tamoxifen, but this small gain was traded off against poorer quality of life and increased treatment-related toxicity. Treatment with chemotherapy did not yield better control over GI symptoms compared with tamoxifen. There was no difference in OS between the treatment arms.

Our survival data in patients treated with single-agent chemotherapy are disappointing but in line with most previous reports (ten Bokkel Huinink et al, 1997; Gordon et al, 2001; Mutch et al, 2007; Vergote et al, 2010). Only about 17% of the evaluable patients showed complete or partial response on chemotherapy. Similar response rates have previously been reported for tamoxifen in PROC (Markman et al, 1996). Tamoxifen has not previously been directly compared with single-agent chemotherapy in PROC. A randomised study on the alkylating agent treosulfan and the GnRH (gonadotropin-releasing hormone) analoga leuprorelin reported no objective response in either of the treatment arms but significantly longer PFS in the chemotherapy arm (17 weeks vs 10 weeks, P=0.035). In our study only 4.7% responded to tamoxifen. Even though chemotherapy yielded better response rates than tamoxifen, the median PFS just exceeded 3 months. Patients on chemotherapy experienced more haematological and non-haematological toxicities and a deterioration of their social and physical functioning during treatment. We also studied the potential palliative effect of chemotherapy on GI symptoms as HRQoL data from second-line treatment with platinum-based chemotherapy suggested better symptomatic control especially in symptomatic patients (Brundage et al, 2012). The proportion of patients reporting GI-symptom improvement here was the same in both treatment arms, also in patients with higher symptom burden at baseline, suggesting that single-agent chemotherapy is not better than tamoxifen in palliating these symptoms. There is evidence that patients can accept a more toxic regimen if a benefit in survival is expected (Lee et al, 2013). In this study, we tried to assess these trade-offs by analysing QAPFS. There was no difference in QAPFS between the treatment arms, which may mean that the small gain in PFS was traded off against the poorer quality of life on chemotherapy.

The strengths of this study include its randomised design, the assessment of response and progression by RECIST criteria and the high completeness rate of HRQoL data until progression. There are several limitations of this study. The analysis of the primary end point of this study, HRQoL, was poorly defined in the protocol. Still, the primary and prespecified research question was to evaluate quality of survival with either of the regimens. We can therefore interpret the effects of the regimens on the quality-of-life outcomes as assessed here but need to be cautious to conclude from analyses of QAPFS as they may be underpowered. The higher completion rate of HRQoL assessments in the chemotherapy arm may also have biased our results. There was no evaluation of hormone receptor status and better patient selection to endocrine treatment may have led to better outcomes on tamoxifen treatment (Hatch et al, 1991). Also, the time to first response assessment may have been too short to observe an antiprofilerative effect. In the chemotherapy arm patients with unfavourable baseline characteristics were overrepresented. The patients had more previous lines of chemotherapy and more mucinous and clear-cell tumour histologies, which are known to be less chemosensitive compared with other histologies. These differences may have contributed to the slightly poorer results when compared with the results in the chemotherapy arm of the Aurelia study (Pujade-Lauraine et al, 2014). We could not demonstrate any benefit in OS for patients treated with chemotherapy probably in part due to the allowance of crossover between the trial arms post-progression. The tamoxifen and chemotherapy arms, 77 and 62%, respectively, received further chemotherapy. Subsequent treatment after progression may also have had an impact on our estimates of HRQoL at progression as these data in a minority of patients were collected after progression.

Since the study was designed, treatment strategies of platinum-resistant disease have evolved. The combination of single-agent chemotherapy with bevacizumab has been shown to increase response rates and PFS compared with single-agent chemotherapy alone, also with a benefit for GI symptoms (Pujade-Lauraine et al, 2014; Stockler et al, 2014). There was also a marginally significant benefit in OS in patients who had received paclitaxel plus bevacizumab (Poveda et al, 2015). However, due to the strict exclusion criteria with regard to the increased risk of GI perforation on bevacizumab, these findings may not be generalisable to all patients with PROC. Funding restrictions in many countries are another reason why single-agent chemotherapy is still widely used. There is also growing evidence that a subgroup of patients may again respond to platinum-based chemotherapy despite short disease-free survival. Especially patients with germline-BRCA mutation, but possibly also patients with somatic mutations in the DNA homologous repair pathway, may be more likely to respond to re-challenge with carboplatin and thus yield longer PFS (Alsop et al, 2012) compared with non-platinum based chemotherapy. Strategies targeting that pathway directly (i.e., PARP inhibitors) have also shown promising results in PROC (Kaufman et al, 2015) with a more favourable toxicity profile compared with chemotherapy.

In this phase III study in PROC a more toxic but presumably more effective treatment (single-agent chemotherapy) is compared with a potentially less toxic one (tamoxifen). The marginal longer PFS on chemotherapy was traded off against poorer quality of life questioning the palliative effect of single-agent chemotherapy. Our explorative analysis of QAPFS is suggested as a model to assess the trade-offs between prolonged PFS and QoL in this poor prognostic group of patients. The ongoing GCIG (Gynecologic Cancer Intergroup) symptom benefit study will hopefully provide more information on the benefit of chemotherapy on symptom control and quality of life (Friedlander et al, 2009). In this patient population with particularly poor prognosis, potential treatment strategies should also be assessed in terms of their benefit for QAS. Our analyses may serve as a model for further research on QAS estimates. Selection of patients based on molecular profiling may hopefully allow us in the future to better tailor management and ultimately improve survival in patients with PROC.