Beyond D’Amico risk classes for predicting recurrence after external beam radiotherapy for prostate cancer: the Candiolo classifier

Background The aim of this work is to develop an algorithm to predict recurrence in prostate cancer patients treated with radical radiotherapy, getting up to a prognostic power higher than traditional D’Amico risk classification. Methods Two thousand four hundred ninety-three men belonging to the EUREKA-2 retrospective multi-centric database on prostate cancer and treated with external-beam radiotherapy as primary treatment comprised the study population. A Cox regression time to PSA failure analysis was performed in univariate and multivariate settings, evaluating the predictive ability of age, pre-treatment PSA, clinical-radiological staging, Gleason score and percentage of positive cores at biopsy (%PC). The accuracy of this model was checked with bootstrapping statistics. Subgroups for all the variables’ combinations were combined to classify patients into five different “Candiolo” risk-classes for biochemical Progression Free Survival (bPFS); thereafter, they were also applied to clinical PFS (cPFS), systemic PFS (sPFS) and Prostate Cancer Specific Survival (PCSS), and compared to D’Amico risk grouping performances. Results The Candiolo classifier splits patients in 5 risk-groups with the following 10-years bPFS, cPFS, sPFS and PCSS: for very-low-risk 90 %, 94 %, 100 % and 100 %; for low-risk 74 %, 88 %, 94 % and 98 %; for intermediate-risk 60 %, 82 %, 91 % and 92 %; for high-risk 43 %, 55 %, 80 % and 89 % and for very-high-risk 14 %, 38 %, 56 % and 70 %. Our classifier outperforms D’Amico risk classes for all the end-points evaluated, with concordance indexes of 71.5 %, 75.5 %, 80 % and 80.5 % versus 63 %, 65.5 %, 69.5 % and 69 %, respectively. Conclusions Our classification tool, combining five clinical and easily available parameters, seems to better stratify patients in predicting prostate cancer recurrence after radiotherapy compared to the traditional D’Amico risk classes. Electronic supplementary material The online version of this article (doi:10.1186/s13014-016-0599-5) contains supplementary material, which is available to authorized users.


Background
Prostate cancer (PCa) is the most common cancer in men and the second most common cause of death from tumors in males [1]. Several nomograms have been developed to guide therapy and predict outcome following radiotherapy: one of the most popular classifications is the D'Amico risk classifier, dividing patients according to pre-treatment PSA, clinical stage and biopsy-based Gleason score (bGS) into three categories: low-risk (PSA <10 ng/ml and cT1-cT2a and bGS ≤6) , intermediate-risk (PSA 10-20 ng/ml or cT2b or bGS 7) and high-risk (PSA >20 ng/ml or clinical stage ≥ cT2c or bGS ≥8) [2].
Several additional pathological, clinical-radiological and molecular markers have been studied in the past 15 years to implement D' Amico classification [3]: of these, the simplest and more readily available information are age at treatment and the percentage of positive biopsy cores (%PC). In 2000-2001, D' Amico reported the independent prognostic ability of %PC to foresee biochemical recurrence beyond traditional risk factors in a radical prostatectomy cohort of 823 men [4] and in 473 men treated by external-beam radiotherapy (EBRT) [5].
Regarding age, Siddiqui [6] reported in a radical prostatectomy cohort of 5509 men a higher risk of systemic progression and death from prostate cancer in patients younger than 55 compared to patients older than 70. In addition, Ribeiro [7] confirmed that age lower than 60 predicts for a poor outcome (time to progression and overall survival) in metastatic prostate cancer receiving androgen deprivation therapy (ADT).
In our study, based on a large database of thousands of patients affected by prostate cancer and treated with EBRT, we developed a 5-classes nomogram for predicting biochemical Progression Free Survival (bPFS) including not only pre-treatment PSA, stage and bGS, but also age and %PC. Thereafter, we applied it to predict clinical PFS (cPFS), systemic PFS (sPFS) and Prostate Cancer Specific Survival (PCSS) too, and compared the performance of our nomogram with the D' Amico risk classification.

Patient population
Two thousand four hundred ninety three men affected by prostate cancer and treated with EBRT as primary treatment comprised the study population.
This cohort belongs to the EUREKA-2 retrospective multi-centric database, including 3776 cases of radiotreated prostate cancer cases in North-West Italy between 1997 and 2012, approved by FPO-IRCCS Cancer Center of Candiolo Ethical Committee in July 2013 and amended in November 2014. Ten Radiotherapy Divisions participated to EUREKA-2 study, seven of which in Piedmont Italian region, two in Lombardy and one in Tuscany (full information are provided in Additional file 1: Table S1). Inclusion criteria for EUREKA-2 study were: histologic diagnosis of adenocarcinoma of the prostate; Radical Radiotherapy as first-line treatment, performed with conformational technique within the period January 1 st , 1997 to December 31 st , 2012; at least two of the following three pre-treatment parameters: PSA, staging, Gleason Score; case histories, clinical and serological follow-ups available for clinical data collection. Concerning privacy procedures, data were pseudoanonymized by local researchers within the participating Hospitals. The two lists with patients' personal data and clinical data were kept separated and only clinical data, and not personal data, were sent to EUREKA-2 common database. Consequently, outside the hospitals and in the common EUREKA-2 database data are anonymous and by no way amenable to the single patient identity. The files are stored with Microsoft Excel and Microsoft Access data management software (®MicroSoft Corporation, Redmond, Washington, USA) according to different domains: socio-demographic group of data, biopsy and staging, therapies, oncologic outcomes, serologic follow-up and collateral effects.
In particular, from the whole EUREKA-2 database were excluded 1283 patients without complete information regarding established pre-treatment factors (PSA, clinical-radiologic stage and bGS) and the number of total and positive biopsy cores.

Preoperative staging and treatment
In all cases, staging evaluation included anamnesis, physical exam with Digital Rectal Examination (DRE), serum PSA and a trans-rectal ultrasound (TRUS) guided needle biopsy of the prostate with GS histologic grading. Radiological examinations (abdominal CT, endo-coil or pelvic MRI and bone scan) were performed according to the patient risk-class, to the physician's opinion and to the available hospital facilities.
PSA was obtained before biopsy and radiologic studies. Primary, secondary and total GS were mainly attributed according to ISUP 2005 revised Gleason Score system [8]. The clinical-radiologic stage was obtained integrating the clinical AJCC 2010 staging system [9] with all the radiologic information available, while biopsy data were not taken into account. Age at treatment was calculated as the difference between the first day of radiotherapy and the date of birth, rounded to the closest integer number. The %PC was calculated multiplying 100 by the number of positive cores containing prostate cancer, of any length, and dividing by the total number of cores sampled.
All patients were treated with curative 3-Dimensional Conformal EBRT (3D-CRT) or Intensity Modulated Radiation Therapy (IMRT). The fractionation schedules to prostate-GTV (Gross Tumor Volume) varied between traditional fractionation of 1.8-2 Gy per fraction to moderate hypo-fractionation of 2.5-2.7 Gy per fraction; all doses were normalized to Equivalent Dose at 2 Gy per fraction (ED2Gy) using a mean α/β of 2.5 Gy for prostate cancer (according to literature α/β ratio for prostate cancer ranges between 1.5 and 5.7 Gy [10][11][12]). Treatment consisted of radiotherapy alone or radiotherapy combined with ADT in 38 % and 62 % of the cases, respectively.

Follow-up
Median follow-up of the 2493 patients was 50 months. Standard follow-up included PSA and DRE every 3months for 2 years, every 6-months until the fifth year and annually thereafter.
Time 0 was defined as the last day of EBRT for all patients and PSA failure according to Phoenix consensus definition (i.e. a rise by 2 ng/mL or more above the nadir PSA [13]). Clinical relapse was defined as a recurrence in the prostate bed, regional lymph nodes or distant metastasis shown by radiologic examinations (bone scan, choline-PET-CT, MRI, CT, ultrasound) or by physical examination or by biopsy. Systemic relapse was defined as a distant metastasis, including bone or other visceral organs, shown by radiologic examinations or by physical examination. Prostate cancer specific mortality was defined as death because of prostate cancer, checked by a physician through patients' case history reports, cancer regional registries and, if necessary, phone calls to the patient or to a close relative or General Practitioner of him (if the patient was dead).

Statistical analyses
A Cox regression time to PSA failure analysis was performed in univariate and multivariate settings, evaluating the predictive ability of age, pre-treatment PSA, clinicalradiological stage, GS and %PC; besides, the regression algorithm was adjusted for RT dose (as a continuous variable) and combined therapy (RT + ADT or RT alone, as a dichotomous variable). The assumptions of the Cox model were tested and met.
Effects cell coding (i.e. 1 or 0 or −1 coding) was applied to the stratified variables, in order to calculate the relative Hazard Ratios (HRs) of the multivariate analysis compared to the outcome of the mean of the cohort. The accuracy of the model was checked with bootstrapping statistics (2493 patients, resampling with 1000 cases each replication for a total of 10,000 replications).
A 360-cells-table was built by multiplying the HRs of the subgroups for all the variables' combinations; the following combined HRs were classified into 5 different risk classes for biochemical relapse: very-low-risk for HRs 0.17-0.30; low-risk for HRs 0.31-0.55; intermediaterisk for HRs 0.56-1.20; high-risk for HRs 1.21-2.40; and very-high-risk for HRs 2.41-6.60. In addition, an equivalent nomogram was built multiplying 100 by the beta coefficients associated to the HRs, normalizing each reference value of the variables to 0 points (i.e. age ≥70, PSA <7 ng/ml, stage cT1, bGS ≤6 and %PC 1-20 %), and summing up the values of the 5 prognostic factors into a scale ranging in between 0 and 363 total points, corresponding to the previously defined 5 risk classes.
Kaplan-Meier survival curves for bPFS, divided according to our 5-risk-classes and to D' Amico risk classes (for comparison), were graphed, overall and paired logrank tests were performed and Concordance Indexes calculated. In addition, D' Amico risk classification and our 5-groups classification tool were further analyzed for cPFS, sPFS and PCSS and their prediction performances compared.
All statistical analyses were performed with Stata SE 13.1 Software (®StataCorp, Texas, USA).

Results
The stage and treatment characteristics of the 2493 patients are listed in Table 1.
All variables evaluated, i.e. age, pre-treatment PSA, clinical-radiologic stage, bGS and %PC are highly significant predictors of biochemical relapse in both univariate and multivariate Cox models: all Ps are lower than 0.001 except for age, whose values are P = 0.001 and P = 0.019 in univariate and multivariate analyses, respectively ( Table 2). In particular, the risk of recurrence rises in younger patients and increases gradually with higher PSA, wider clinical-radiologic extension in/out prostate, higher bGS and a higher percentage of biopsy cores affected by cancer. Internal validation performed with bootstrapping shows a good reliability of the model as a whole: PSA and bGS remain highly significant (P < 0.001 and P = 0.012, respectively), %PC and clinical-radiologic stage are significant (P = 0.008 and P = 0.031), while age shows a trend but loses its statistical significance (P = 0.16; see Table 2 last column).
The 360-cells-table combining all the possible combinations of the stratified parameters clearly shows a strong trend, going from very-low risk (in blue) on the upper-left corner to very-high-risk (in red) in the lowerright corner; in between can be noticed low-risk (in green), intermediate-risk (in yellow) and high-risk (in orange, see Table 3 and Additional file 1: Table S2). Verylow-risk group includes 529 patients (21 %), low-risk 770 (31 %), intermediate risk 696 (28 %), high-risk 329 (13 %) and very-high risk 169 (7 %); full data on patients' distribution according to model parameters are illustrated in (Additional file 1: Table S3). Besides, the related Candiolo nomogram is displayed in Fig. 1.

Discussion
The Candiolo classifier differs significantly from the D' Amico risk classification. As far as pre-treatment PSA is concerned, D' Amico advises the cut-offs of 10 and 20 ng/ml, while our optimal boundaries are slightly lower: 7 and 15 ng/ml. Probably the difference relies in the cohort, our whole patients pertaining to post-PSA screening era; but also in the introduction of percentage of positive cores as a covariate, carrying a second information about cancer extension and therefore able to shift the best stratification cut-offs.
Concerning bGS, we definitely refined the traditional coarse classification in 3-classes (≤6, 7 or ≥8) dividing    , there is an almost significant time trend concerning total biopsic Gleason Score with bGS means increasing from 6.5 to 6.7, respectively (both medians are equal to 7, P = 0.06 at Wilcoxon-Mann-Whitney test). However, we think this trend does not affect substantially our nomogram because of three main reasons: the difference in absolute terms is quite small, the number of patients treated prior to ISUP 2005 Gleason Score revision is only 23 % of the total, and finally we reduced the under-grading influence in older cases (in which total bGS of 2-3-4 were much more frequent) by combining together all bGS ≤ 6 into a single risk category. Considering staging, the D'amico clinical staging, splitting patients according to their cT2 stage (≤cT2a, cT2b or ≥ cT2c) was replaced by a clinical and radiological staging taking advantage of radiologic information and dividing patients according to more objective criteria: cT1 microscopic cancer, cT2 macroscopic but intraprostatic cancer, cT3-4 macroscopic and extra-prostatic adenocarcinoma. Actually, Epstein [14] found, even if in a radical prostatectomy cohort, that pT2b staging almost doesn't exist and that data are conflicting about the power of pT2 disease sub-staging to further stratify patients. According to Algarra [15] MRI data optimize D' Amico risk groups in predicting bPFS after radical prostatectomy. Unfortunately, only 15 % of our patients were locally staged through MRI (we calculated with logistic regression a strong association between MRI staging and T3 with an OR = 6.98 and a P < 0.001). However, this relationship may be over-estimated because of a selection bias (MRI is performed usually in suspect of extra-prostatic extension) and it may be partially compensated by the widespread execution of CT and ultra-sound in 59 % and 49 % of our patients, respectively; precisely, the percentage of patients having neither CT nor MRI for loco-regional staging decreases in our cohort to 35 %.
In addition, we added %PC and age in the prognostic algorithm. The prognostic significance of tumor volume has been tested both in radical prostatectomy and needle biopsy specimens. Therefore, it is recommended to record at least the number of positive and total cores along with one other more detailed measurement such as the percent of the core involvement or length of cancer [14]. Besides, in 2007 Briganti [16] showed that %PC can improve the ability to predict lymph nodes invasion in patients undergoing radical prostatectomy and extended pelvic lymph node dissection: %PC and bGS were the most informative predictors of Lymph Node Involvement, overcoming PSA and stage. The addition of %PC to a nomogram including traditional risk factors was able to improve overall accuracy from 79.7 % to 83 % (P < 0.001). In the early 2000s, D'Amico clearly proved the independent value of positive prostate biopsies in predicting biochemical outcome after radical prostatectomy [4] or after EBRT [5]. The information   [17] further showed that %PC was superior to traditional risk factors in predicting prostate cancer specific mortality (PCSM) in patients with low or favorable intermediate-risk; the relative risk of PCSM for patients with %PC ≥ 50 % as compared with <50 % was 10.4. However, Kupka [18] reported in a cohort of 249 men treated with RP that even prostate cancer with single positive core is associated with considerable rates of over-grading for the pathological GS, pT2c-pT3 and positive surgical margin. Therefore, %PC should not be used alone to predict patients' prognosis after radical therapy, and treatment plan has always to take into account the traditional prognostic factors, too. Therefore, several reports and our analysis strongly support the statistical significance of %PC as an additional prognostic factor of prostate cancer recurrence.
Literature is more scarce concerning age, showing for older patients a lower rate of distant metastases or prostate cancer deaths, but not an advantage in biochemical relapse [6,7]. The finding is confirmed by our data, where the statistical significance of the variable age was the lowest between the selected prognostic factors of biochemical recurrence and failed the internal validation with bootstrapping. However, its relevance may be much more powerful for late end-points like sPFS and PCSS, and so this could be one of the determinant factors of the higher concordance indexes for these end-points by the Candiolo classifier (around 80 %). In fact, two main reasons can justify the worse prognosis of younger  [19]. It showed that patients older than 70 years have better outcomes compared with younger ones, and precisely that, at 90 months of follow-up, Overall Survival and Disease-Free Survival (biochemical and clinical) were 10 % and 16 % higher in older patients, respectively; besides, there was no significant difference in the distribution of pre-RT risk-classes between age groups. Moreover, the 70 years cut-off is clinically useful because next to the median and mean ages of our cohort, that are 73 and 71.7 years, respectively. Our five classes are statistically different at paired comparisons for bPFS, cPFS, sPFS, while for PCSS three main groups may be identified: very-low and low-risk together (summing up to 52 % of the overall cohort) with almost no dead at 10 years, intermediate and high-risk with a specific mortality at 10-yy around 10 % and veryhigh-risk alone with a 10-yy mortality of 30 %.
The mean prostate cancer specific survival in our cohort at 5 and 10 years are 97.5 % and 92.5 %, respectively. These high values may be justified by the intense therapy, performed both as a first-line treatment (62 % of the cases where treated with the combination RT + ADT) and/or as a salvage or palliative therapy after biochemical and/or clinical recurrence (ADT alone 82 %, chemotherapy 3 %, local therapies i.e. surgery, RT, HIFU and cryotherapy 6 %, ADT plus local therapy 3 %, none 6 %).
The main strengths of our study are: the large numerosity of our sample (2493 patients), widely distributed according to risk factors combinations (e.g. PSA from <1 to >100 ng/ml), and with complete staging and treatment information; the integration of five significant prognostic factors according to all their possible combinations; the five-risk classification instead of three-risk-classes with wider prognostic differences; the Proportional Hazard assumption of the Cox model fulfilled (see in the Additional file 1: Figure  S1); the internal validation with bootstrapping performed; the nomogram, developed with bPFS as setting end-point, was successfully applied to other three following outcomes, i.e. cPFS, sPFS and PCSS.
On the other hand, our study has several limitations, mainly related to its retrospective and multi-centric nature, and to the absence of an external validation. In particular, the biopsy schemes adopted by each center were heterogeneous and there wasn't any central pathologic review (even if by a decade Piedmont Oncology Web fosters pathology quality assessment on prostate cancer through guidelines and periodic meetings [20,21]). Furthermore, radiologic staging protocols changed in time with a low total number of MRI, as well as the RT prescription doses, fractionation schedules and the use of combined therapy with ADT; and median follow-up of 50 months is still short for a "slow killer" like prostate cancer. Besides, the model applied was a simple linear regression with no interaction factors considered: it allows the modelling to be simple and robust, but does not take into consideration more complex modelling. In addition, an external validation is needed as soon as possible.

Conclusions
In conclusion, our study develops a new risk classification tool, that we called the "Candiolo classifier", in a wide cohort of radio-treated patients at higher risk of relapse (46.5 % of our cases are in D' Amico high-risk group versus 21 % of D' Amico early 2000s studies [4,5]) treated with higher doses of RT (median dose of 76 Gy versus 70.2 Gy of the D' Amico studies) and treated with RT + ADT in 62 % of cases. Besides, we subdivided our patients into five different classes instead of three, with Concordance Indexes around 10 % higher in foreseeing bPFS, cPFS, sPFS and PCSS.
We would like to evidence some points about our 5-classes distribution. First, even rare 3 + 4 or 4 + 3 cancers with all other positive factors may be included in very-low-risk group. Second, a quite high number of patients (21 %) can be classified in verylow risk class with 100 % sPFS and PCSS at 10 years, and can be of consequence selected for watchful waiting approach too. Third, only a restricted number of patients (7 % of the total) belong to very-high-risk class, with 30 % of prostate cancer specific deaths at 10 years.
We further recommend the development of a nomogram integrating in this subset of patients also therapeutic information (RT dose and ADT) and the enhancement of a tailored follow-up schedule according to the risk of recurrence.