Introduction

Breast cancer (BC) is the most frequent tumor and the first cause of cancer death among European women in the recent years [1]. In Spain, it has been estimated that 28,000 new BC cases were diagnosed in 2015, with an incidence rate (adjusted for the European population) of 88.3 cases per 100,000 women-years, that leads to an intermediate position in Europe [2]. It is estimated that 1 in 9 Spanish women will develop BC throughout their life [3], with a mean age at diagnosis of 60 years, although BC incidence rates have shown a downturn since 2001 among Spanish women over 45 [4].

Improvements in 5-year BC survival have been observed [5,6,7,8] in parallel with a decline in the risk of BC mortality specially marked in women under 50 years of age [9]. Since life expectancy can lengthen by several decades in young BC patients [10], to provide long-term survival estimates is called for. To our knowledge, population-based survival studies in Spain have been provided up to 5-year survival estimates [5,6,7, 11], but not beyond this follow-up. It is of crucial interest the assessment of the most recent survival estimates, since several international studies have suggested a non-decreasing excess mortality beyond 10 years after BC diagnosis [12,13,14,15,16,17,18].

In a previous study, we provided long-term estimates of the survival probabilities of BC patients diagnosed before 50 years by stage and period of diagnosis using a small cohort (N = 998) from the population-based cancer registry of Girona [7]. In the present study, we provide survival indicators of BC patients based on a larger cohort of N = 10,195 women from two population-based cancer registries, Girona and Tarragona, taking into account the age and BC stage and period of diagnosis. Our aim is in quantifying the contribution of BC to overall mortality, assessing the long-term crude probabilities of death, by age and stage, due to BC and other causes among patients diagnosed before 84 years.

Materials and methods

Data

BC data was obtained from the population-based cancer registries of Girona and Tarragona, which cover a population of 771,854 women (2011 Catalonia census) [19]. Each woman with BC in these provinces has been followed-up to December 31st, 2014. In addition to the active and passive follow-up via hospitals, two passive follow-ups were performed using record linkage: one linking BC data with the Catalonian Mortality Registry (which covers the four Catalan provinces, Girona, Tarragona, Lleida and Barcelona) and another linking data with the National Death Index of the Spanish Ministry of Health. The patients not found to be dead at the end of follow-up were considered as censored.

We included a total of N = 10,195 female patients aged 15–84 years and diagnosed with invasive primary BC (codes 174 and C50 of the 9th and 10th editions of the International Classification of Diseases, ICD-9 and ICD-10, respectively) during the time periods 1985–1994 (N = 4211) and 1995–2004 (N = 5984). From these, data on the stage at diagnosis was extracted through medical records review. Variables considered for the analysis were age, partitioned into ≤ 49, (49–59), (59–74) and (74–84), and stage at diagnosis. Stage classification was based on the TNM classification system unified to the 5th edition of the American Joint Committee on cancer staging manual [20] to compare the survival indicators by stage between 1985–1994 and 1995–2004. Patients were classified as stage I, II, III and IV when staging was available at the moment of diagnosis, and missing stage otherwise.

Statistical analyses

We calculated the observed survival (OS), and the probabilities of death due to BC (PBC) and other causes (POC) by age and stage at diagnosis at 5, 10, 15 and 20 years of follow-up and we compared these three probabilities between time periods at diagnosis 1985–1994 and 1995–2004 and across age groups. Since life expectancy among Catalan women is 90 years [19], we did not estimate these probabilities beyond 5 years of follow-up for patients aged between 74 and 84 years at diagnosis. In the same line, we did not estimate these probabilities beyond 15 years of follow-up for patients aged between 59 and 74 years at diagnosis.

In brief, up to year T, P(t ≤ T) is the cumulative probability to die from any cause in the cohort, whereas OS(T > t) is the cumulative probability to survive in the cohort beyond the year T, where OS(T > t) + P(t ≤ T) = 1. The P(t ≤ T) can be estimated as a sum of two probabilities: PBC(t ≤ T) is the estimated cumulative probability to die due to the disease under study, BC in our case, and POC(t ≤ T), the estimated cumulative probability to die due to causes other than the cancer of interest [21]. These probabilities can be estimated making use of the excess hazard of death [22, 23] under a competing risk’s approach to survival, as described in a previous study [7]. This approach allows providing estimates of PBC and POC without knowing the exact cause of death [7, 22, 23], where it is only needed to know if the patient dies or not because of any cause at the end of study. Therefore, since OS(t > T) + PBC(t ≤ T) + POC(t ≤ T) = 1, we will present these probabilities depicted in a survival graph in the results section. We derived their corresponding 2.5, and 97.5% percentiles using the R-library relsurv [23].

Since the closing date of follow-up was December 31st, 2014, for patients diagnosed during 1995–2004, we estimated the probabilities of interest beyond 10 years of follow-up using different cohorts, as performed in the previous study [7]. Therefore, 11-year follow-up was estimated using the patients diagnosed during the period 1995–2003, whereas 12-year follow-up was estimated using the patients diagnosed during the period 1995–2002 and so on up to 19 years. We used the cohort of patients diagnosed in 1994 to estimate these probabilities at 20-year follow-up.

First we assessed up to 20-year survival estimates between periods of diagnosis across age groups. Second, we compared 10-year OS between 1985–1994 and 1995–2004 across stage and age groups. A Cox proportional hazards model was used to assess hazard ratios of all-cause mortality by age, stage and period of diagnosis. Finally, since stage information was available in 86.2% of the patients diagnosed during 1995–2004, 10-year estimates of the probabilities of interest provide the most recent long-term survival indicators by age and stage in Girona and Tarragona.

The Supplementary material file presents additional tables, figures and an extension of the statistical methods.

Results

Comparison of long-term survival between 1985–1994 and 1995–2004 by age groups

Comparing the time periods 1985–1994 and 1995–2004, there were no significant differences in the age distribution of the patients considered in the cancer registries, see Table 1. Table S1 in the supplementary material presents survival indicators and crude probabilities of death by period of diagnosis across specific age groups. Figure 1 depicts the PBC, POC and the OS comparing survival indicators by period of diagnosis in women ≤ 49 and > 49. This figure shows the improvement in OS in parallel with the reduction in the PBC, whereas POC did not change between diagnostic periods. Worth noting is that, among women ≤ 49, 20-year OS for women was 55% (PBC = 42%) if they were diagnosed during 1985–1994, whereas based on our estimations, it could reach 68% (PBC = 29%) if they were diagnosed during 1995–2004.

Table 1 Age-group distribution of the breast cancer patients included in the study during the time periods 1985–1994 and 1995–2004
Fig. 1
figure 1

Long-term survival of breast cancer patients diagnosed in Girona and Tarragona before 85 years of age: a diagnosed ≤ 49 years during 1985–1994; b diagnosed ≤ 49 years during 1995–2004; c diagnosed > 49 years during 1985–1994; d diagnosed > 49 years during 1995–2004. *Since complete follow-up until December 31st, 2014 for patients diagnosed between 1995–2004 was 10 years, follow-up beyond 10 years was estimated using the cohorts diagnosed between 1995–1993 (11-year probabilities), 1995–1992 (12-year probabilities) and so on

Comparing 10-year probabilities of death due to BC and OC by period of diagnosis and age group (Fig. 2), differences in the PBC between 1985–1994 and 1995–2004 were 11% among women ≤ 49 (1985–1994: 33%;1995–2004: 22%), 17% among women (50–59) (1985–1994: 37%; 1995–2004: 20%), and 14% among women (60–74) (1985–1994: 37%; 1995–2004:23%), and 10% among women aged (74–84). PBC surpassed POC in all age groups except in the oldest age group, where confidence intervals of POC overlap those of PBC beyond 5 years of follow-up. Beyond 10 years of follow-up (Table S1), the largest differences in PBC between periods could have reached 12% among women aged (59–74) at 15 years (PBC 41% in 1985–1994; 29% in 1995–2004) and 20% among women aged (49–59) at 20 years (48% in 1985–1994; 28% in 1995–2004). Differences in the OS between diagnostic periods are similar to those found in the PBC.

Fig. 2
figure 2

Comparison of the 10-year probabilities of death due to breast cancer and other causes between diagnostic periods 1985–1994 and 1995–2004 by age groups: a ≤ 49 years; b (50–59) years; c (60–74) years; d (75–84) years. Shaded bands represent 95% confidence intervals

Comparison of observed survival by age–stage between diagnostic periods

Table 2 presents the stage distribution across age groups by period of diagnosis. Stage was available in 20.6% (N = 867) of the patients diagnosed during the period 1985–1994 and in 86.2% (N = 5160) in those patients diagnosed during 1995–2004. Among patients with staging information in 1985–1994, there was a higher percentage of patients diagnosed in Stage I during 1995–2004 (34.6%) than during 1985–1994 (26.5%) and a higher percentage of patients diagnosed in stage IV (13.3 vs 6.8%) than during 1995–2004 (6.8%). Differences in the mean age at diagnosis were also assessed and not found to be statistically significant. We also compared 10-year OS between time periods across stages: I) hazard rates in periods 1985–1994 and 1995–2004 were of similar magnitude by age and stage (Table 3), but showing that patients diagnosed during 1995–2004 had 19% reduction in risk of death compared to those diagnosed in 1985–2004; II) we found significant differences in survival between periods in stage I patients (Supplementary material: Figure S1 Wilcoxon test p < 0.001; Table S2 10-year OS in 1985–1994: 85%; 1995–2004: 89%) and in stage IV (Wilcoxon test p < 0.05; 10-year OS in 1985–1994: 2%; 1995–2004: 7%).

Table 2 Stage-distribution and mean age at diagnosis within stage between periods 1985–1994 and 1995–1994
Table 3 All-cause mortality adjusted by age and stage using a Cox model

Age-stage observed survival and crude probabilities of death for patients diagnosed during 1995–2004

Table 4 presents the 5-year and 10-year OS and the crude probabilities of death for the cohort of patients diagnosed during 1995–2004 by age and stage at diagnosis. Differences in 5-year OS switched from 7 to 11% comparing stages I and II, and from 13 to 22% comparing stages II and III. We note that the PBC switched from 1 to 3% in stage I, from 9 to 12% in stage II and from 23 to 34% in stage III. The PBC in stage IV patients was estimated to surpass 70% in all age groups, since these patients showed that less than 29% and less than 20% could survive more than 5 years among those diagnosed ≤ 49 years and beyond 49 years, respectively. These figures must be interpreted with caution due to the small sample size (less than 30 patients at risk).

Table 4 Age-specific 5- and 10-year observed survival and crude probabilities of death due to cancer and to other causes among women diagnosed with breast cancer in Girona and Tarragona during the period 1995–2004: results stratified by stage at diagnosis

Similar gradient of differences was found in 10-year OS: 13–18% comparing stages I and II, and 14–23% comparing stages II and III. The 10-year PBC in 1995–2004 switched from 3% (ages (49–59]) to 6% (≤ 49) in stage I, 18% (49–59) to 20% in stage II and from 34% (≤ 49) to 46% (59–74) in stage III. In stage IV patients, PBC surpassed 88% in all age groups and a very small sample size (less than 10 patients) could reach 10-year follow-up.

Discussion

We have estimated that risk of death among BC patients was reduced by 19% in 1995–2004 compared to 1985–1994 and the improvement in 10-year survival between the two periods of diagnosis was mainly due to the decrease in the PBC, since the POC did not change between these time periods. We have estimated that the difference in the PBC between diagnostic periods may continue or even increase beyond 10 years of follow-up among women diagnosed before 60 years. These survival improvements between periods of diagnoses could be due to better survival prospects for stage I and stage IV patients in 1995–2004 compared to 1985–1994. The most recent estimates of 10-year PBC among BC patients could switch from 3 to 6% in stage I, 18 to 20% in stage II and 34 to 46% in stage III, whereas PBC surpasses 70% in stage IV beyond 5 years of diagnoses.

Our approach has several strengths, in comparison with previous survival studies. First, we provide estimates of the crude probabilities of death due to BC and other causes using population-based cancer registry data from a Spanish cohort of BC patients, and, to date, this is the first study of this type carried out in Spain. These probabilities can be used as indicators to assess the improvement in overall OS and in quantifying the contribution of the disease to overall mortality [22]. Our approach makes a major difference with relative survival. Relative survival is used as population-based cancer survival indicator and as estimator for the net survival, but relative survival is not a survival probability measure [7, 21, 22]. Second, we provided the most recent estimates of the 10-year figures of these probabilities, where our study had 86.2% completeness of this information in 1995–2004.

It is worth noting the strength limitation related with staging classification between diagnostic periods in our study, which was based on the 5th edition of the AJCC staging manual [20]. Since stage migration due to different classifications could artifactually inflate cancer survival rates by shifting patients with better prognosis group into a worse prognosis group [24], we used unified staging classification to minimize this bias. However, there was a limitation related with survival outcomes in the classification of patients making use of the AJCC fifth edition, since differences in survival among BC cases classified into stage IV could exist but not detected using this classification. The sixth edition of the AJCC has further amended the staging classification of patients with supraclavicular metastases at diagnosis to include them into the IIIC category [25], since evidence suggests that these patients had similar outcomes than stage IIIB patients and even better outcomes than patients with visceral stage IV disease [26]. There are other limitations to be noted in the survival comparison between periods of diagnosis. Survival estimates beyond 10 year for patients diagnosed during 1995–2004 were estimated using different subcohorts of patients during this time period. Second, availability of stage information in 1985–1994 was limited to N = 867 patients and probability estimates drawn from stage indicators in this time period could not be robust. These could also be biased, since patients with stage information in 1985–1994 had better survival than patients without this information, and this may lead to an underestimation of the survival differences between time periods (Supplementary material: Table S3).

In our study, PBC substantially decreased beyond 1994, but significant excess mortality may remain beyond 10 years after diagnosis. This is in agreement with recent long-term survival studies [7, 18, 27]. Improvements in BC survival between 1985–1994 and 1995–2004 in Spain could be attributable to changes due to BC screening, active in Girona and Tarragona since 1998–1999 [3], management and treatment [6]. In this line, we detected a 19% of risk of death reduction for BC patients diagnosed beyond 1995. Since BC screening is related to lead-time bias, it may increase the number of BC diagnosed at early stages (I) and decrease that number at advanced stages (IV). However, it is also difficult to quantify the contribution of BC screening in the mortality reduction without information about the diagnostic method, clinical or by screening, of tumor [28,29,30].

The introduction of the screening program may imply a reduction in advanced invasive cancers and then patient’s risk of mastectomy or chemotherapy for breast cancer, or side effects of these, may decrease [31]. In 1994, when most of the screening in Catalonia was opportunistic, rates of screening mammography switched from 27% (women aged 50–64) to 43% (women aged 40–49), whereas in 2004, 61.2% of the invited women participated in the national BC screening program and 75.7% either participated in that program or reported that they had received recent mammograms [30]. Overdiagnosis caused by the screening [30] may also be related with the improvement in survival prospects between periods. The influence of a potential stage migration (Will Rogers phenomenon) is a factor that could have influenced changes in survival by stages but not in the global survival [24]. On the other hand, the contribution in the reduction of BC mortality by treatment improvements is also difficult to quantify, since we found that hazard ratios by age and stage were of similar magnitude between diagnostic periods.

Our probability estimates are based in a competing’ risks modeling for which we attribute the excess risk of death of our patients to BC. This can be seen as the death probabilities that the cancer patients could “expect to have” in the hypothetical scenario where cancer patients have the same death risks as the general population [7]. If we compare the 10-year PBC and POC estimates with the cause-specific proportion of deaths obtained making use of the death certificates (Supplementary material: Table S4) considering all age groups, the difference between PBC and the proportion of deaths due to BC is 1.8% in both periods. In this line, the difference between POC and the proportion of deaths due to other causes is also small: − 1.9% in 1985–1994 and − 1.8% in 1995–2004. In our study, we associated these improvements with the decrease in the PBC between periods, since we did not find significant differences in the 10-year POC between time periods.

Future research will continue updating the follow-up up to 20 years for the most recent cohort. On the other hand, the data available did not allow us to investigate survival by molecular subtype of breast cancer (e.g., luminal A or B, triple negative, HER2), but evidence suggests that the prognostic value of molecular subtype persists when adjusting for age, stage and histological grade, and certain subtypes may metastasized even when the tumor itself is localized [32]. However, this information will be retrieved for the cohort diagnosed beyond 2000, since hormone’s evaluation and HER2 status was not the standard of care in Spain before this year.

Conclusion

First, our analysis suggests that women diagnosed with BC have higher PBC than POC during the first 10 years after diagnosis. The reduction of 19% in risk of death beyond 1995 was markedly due to the reduction in the PBC, since the POC did not change between periods. Access to BC screening and the improvements in treatment have had a positive impact on survival prospects of BC patients. To continue the improvement in long-term survival rates of BC patients, continued efforts should be pursued in (1) increasing adherence to screening programmes with better quality and in (2) the introduction of therapeutic innovations.