Introduction

Despite the declining incidence trend over the past decades, gastric cancer accounts for a great many cancer-related deaths and remains a considerable health burden globally [1,2,3]. The prognosis of early gastric cancer after radical surgery is favorable [4, 5]. However, survival outcomes for advanced gastric cancer are dismal [6]. An accurate and uniform staging system for cancer is essential for physicians to tailor appropriate therapeutic decisions, as well as to inform patients regarding their prognosis, and to evaluate different treatment options, thus facilitating continuing investigation [7,8,9].

Currently, the tumour-lymph node-metastasis (TNM) stage system based on the anatomic extent of malignancies remains the benchmark in clinical practice globally. The first edition of the TNM stage system was released by the Union for International Cancer Control (UICC) and American Joint Committee on Cancer (AJCC) in 1977 [10]. During the past 40 years, it has undergone several major revisions, and the current AJCC 7th edition for gastric cancer was published in 2004 [11]. Several important modifications (refinements of pT, pN, stage categories) were introduced into the 7th edition [12], and its validity has been demonstrated by previous studies [13,14,15]. However, the pN3b category was not incorporated into any stage subgroups in the 7th edition, and additional stage subgroups (stages IIB and IIIC) failed to improve predictive ability in stage-based prognosis [16].

In view of these shortcomings, the UICC/AJCC published the 8th edition stage system last year, and this new system will be implemented in 2018. Several small modifications have been incorporated into the new pathological stage classification, including inclusion of pN3b into the stage system and modifications of existing stage groupings [17]. These evidence-based revisions were based on the International Gastric Cancer Association (IGCA) staging project [16]. Patients in the IGCA project were mainly Japanese and Korean (84.8%), and Western cases accounted for only 8.8%. As is known, the incidence and prognosis of gastric cancer present with significant geographic differences [18, 19], which may be subject to selection bias and inevitably raise the concern whether the 8th edition system is appropriate and suitable for Western populations. Therefore, we conducted a population-based study to assess the prognostic performance of this new edition in the USA. In addition, we also wanted to examine whether the changes of the 8th edition resulted in improved discriminatory ability as compared to the 7th edition.

Methods

Study cohort

Data were obtained from the National Cancer Institute’s surveillance, epidemiology, and end results (http://seer.cancer.gov/) by SEER*Stat Software. We identified gastric cancer patients using the following inclusion criteria: patients with primary microscopically confirmed stomach cancer (primary site code, C16.1 through C16.9, exclusion of cardiac localization), aged ≥18 years, histology codes of adenocarcinoma (the International Classification of Disease for Oncology 3rd edition: M-8140/3, M-8142/3 to M-8145/3, M-8210/3, M-8211/3, M-8255/3, M-8260/3 to M-8263/3, M-8310/3, M-8323/3, M-8480/3, M-8481/3, M-8490/3), who underwent surgical resection (Surg Prim Site code, 30–80). The pT, pN, and pM categories of patients were redefined using information available in SEER about collaborative stage extension (2004+), Collaborative Stage (CS) regional nodes positive (1988 +), and CS metastases (mets) at distant sites (dx) (2004+), respectively. Stage subgroups were determined according to the 7th edition and 8th edition staging manuals. We included patients who were diagnosed between 2004 and 2011 to ensure at least 5 years of follow-up. The current study is considered exempt by the institutional review board of Sir Run Run Shaw hospital because all data are public.

Statistical analysis

Overall survival (OS) and cancer-specific survival (CSS) were compared by Kaplan–Meier estimates and examined by log-rank tests. Cox proportional hazards models were adopted to determine the risk factors of OS and CSS. Such variables as age, sex, year, race, marital status, tumour grade, radiation, surgery, chemotherapy, and number of lymph nodes examined were adjusted. The Harrell’s c-index (c-index) [20] was used to compare the relative discriminative abilities of the two staging systems. The c-index measures the general predictive power of a regression mode [21], and a value close to 1 signifies a perfect predictive ability. All analyses were two sided, and a value of P < 0.05 was considered statistically significant. All analyses were carried out with R version 3.4 (http://www.r-project.org) and STATA version 13.0 (Stata Corp, College Station, TX, USA).

Results

From 2004 to 2011, a total of 10,451 patients having undergone surgical resection of gastric cancer fulfilled the inclusion criteria. The median follow-up period of the included cohort was 8.7 years. Detailed characteristics of cases are summarized in Table 1. More than half the patients were male, married, and aged >65 years. One third of cases were located at the gastric antrum, and approximately two thirds of the tumors were poorly differentiated or undifferentiated. In the 8th edition system, most cases were classified into pT3 and pT4 (67.44%). Almost two thirds of patients had lymph node metastases, and at least 15 lymph nodes were examined in fewer than half the cases. In this study, stage III was the most common stage group.

Table 1 Clinicopathological features of patients with gastric cancer

In the 8th edition, both overall survival rate and cancer-specific survival rate decreased in a stepwise fashion with increasing pT and pN categories (Fig. 1). Multivariable Cox analysis also demonstrated that high risks of overall and cancer-specific death were associated with increasing pT and pN categories even after adjusting for other variables (Table 2), suggesting the well-established predication of pT and pN in OS and CSS. The principal modification of the AJCC 8th edition is the separation of the pN3b from the pN3 category and the incorporation of pN3b into IIIB/IIIC stage groups. Significant OS and CSS differences between pN3a and pN3b were also noted in current studies (Supplemental Fig. 1; log-rank test, P < 0.01). It is commonly acknowledged that the number of lymph nodes examined could affect the accuracy of pN staging [22]. Thus, we further examined survival differences in patients for whom more than 15 lymph nodes were examined. In OS and CSS Kaplan–Meier curves, the distance between curves became wider, reflecting the better stratification of mortality risks in patients with more than 15 lymph nodes examined. To specifically evaluate the prognostic utility of pN3a and pN3b, subgroup analyses stratified by pT category were conducted. As the number of patients was limited (N = 38) in the pT1 group, no significant difference between pN3a and pN3b was found. In patients with pT3/pT4a tumours, pN3b nodal status was associated with worse OS and CSS (all P < 0.01). However, survival difference between pN3a and pN3b in pT2 and pT4b groups became nonsignificant (all P > 0.05; Fig. 2, Supplemental Fig. 2). Survival differences between pN3a and pN3b appeared to be influenced by the pT category. Surprisingly, for cases for which more than 15 lymph nodes were examined, significant differences between pN3a and pN3b were also observed, even in the pT2 and pT4b subgroups (data not shown). This result suggested to us that a recommendation of a minimum of 16 examined lymph nodes could improve the prognostication role of pN3a and pN3b.

Fig. 1
figure 1

Kaplan–Meier survival curves of overall and cancer-specific survival of gastric cancer according to pT and pN categories in the American Joint Committee on Cancer (AJCC) 8th edition. a Overall survival stratified by pT category. b Overall survival stratified by pN category. c Cancer-specific survival stratified by pT category. d Cancer-specific survival stratified by pN category

Table 2 Multivariate Cox analysis of T, N, and stage classification on overall and cancer-specific survival in AJCC 8th edition staging system
Fig. 2
figure 2

Comparison of overall survival between N3a and N3b groups stratified by pT category in the AJCC 8th edition. a T1 group. b T2 group. c T3 group. d T4a group. e T4b group. P was calculated by log-rank test

Subsequently, we calculated the short-term (1-year) and long-term (5-year) stage-related survival rates of the two edition systems. As Fig. 3 shows, both the 7th and 8th editions provided good stratification of OS and CSS. Comparing stages IIIA, IIIB, and IIIC in the ASCC 7th edition with the ASCC 8th edition, patients with the same stages had worse OS and CSS. Multivariable Cox analysis also determined the prognostic role of the AJCC 8th stage system. A more advanced stage indicated markedly higher risks of overall and cancer-specific death (Table 2) (stage IA as reference, OS, IB, HR 1.63; IIA, HR 2.56; IIB, HR 4.01; IIIA, HR 5.82; IIIB, HR 8.42; IIIC, HR 13.67; IV, HR 13.07, all P < 0.001; CSS, IB, HR 2.01; IIA, HR 3.98; IIB, HR 7.23; IIIA, HR 10.95; IIIB, HR 16.36; IIIC, HR 27.01; IV, HR 26.32; all P < 0.001.) Similar results were also observed in the AJCC 7th edition (Supplemental Table 2). The OS and CSS c-indices for the 8th edition were 0.701 and 0.736, which are largely comparable to those of the AJCC 7th edition (c-indices: 0.701, 0.736). For patients for whom more than 15 lymph nodes were examined, however, OS and CSS c-indices for the AJCC 8th were 0.717 and 0.744, which were higher than those for the AJCC 7th edition system (c-indices: 0.712 and 0.739). For cases in which fewer than 15 lymph nodes were examined, the OS and CSS c-indices were similar as compared between the 7th and 8th edition systems (data not shown). These findings suggested that better survival stratification of the new staging system might be independent of the number of lymph nodes examined.

Fig. 3
figure 3

Kaplan–Meier survival curves of overall and cancer-specific survival according to the AJCC 7th and 8th edition stage system. a Overall survival according to the 7th edition. b Overall survival according to the 8th edition. c Cancer-specific survival according to the 7th edition. d Cancer-specific survival according to the 8th edition

Finally, we investigated stage migration between the 7th and 8th editions. Supplemental Tables 3 and 4 summarize the main changes and distribution of patients between the two editions. Most modifications occurred in stage III groups relative to their counterparts in the AJCC 7th edition. Overall, downstaging was observed in 12.6% of cases, whereas 2.3% of patients were upstaged. Compared with the 7th edition, the percentage of cases in stage IIIA increased (from 11.1% to 16.9%), whereas the proportion of patients in both stage IIIB and IIIC decreased in the 8th edition (IIIB, from 15.1% to 13.9%; IIIC, from 12.1% to 7.5%). As Supplemental Table 4 shows, patients with T1N3bM0 (AJCC 7th IIB) and T2N3bM0 (AJCC 7th IIIA) shifted to advanced-stage IIIB in the AJCC 8th, whereas cases with T3bN3bM0 (AJCC 7th IIIB) shifted to stage IIIC in the AJCC 8th. Patients in AJCC 7th edition stage IIIB (T4bN0M0, T4aN2M0) and IIIC (T4bN2M0, T4aN3aM0) generally dropped to earlier stages in the AJCC 8th edition (IIIA and IIIB). Furthermore, we analyzed survival differences of altered staging groups. As Fig. 4 and Supplemental Fig. 3 show, OS and CSS in patients with T1N3bM0, T4bN0M0, and T4bN2M0 were similar to that in the new staging categories in the AJCC 8th edition, which supported the rationality of the new staging system. However, other revised staging groups were not validated in the current study. Significant survival differences were noted between T4aN2M0 and stage IIIA (AJCC 8th), which were the opposite of the new stage system.

Fig. 4
figure 4

Comparison of overall survival among groups with stage migration in the AJCC 8th edition. a T1N3b shifts from IIB to IIIB: P1 for T1N3b and IIB, P2 for T1N3b and IIIB. b T2N3b shifts from IIIA to IIIB: P1 for T2N3b and IIIA, P2 for T2N3b and IIIB. c T3N3b shifts from IIIB to IIIC: P1 for T3N3b and IIIB, P2 for T3N3b and IIIC. d T4aN2 shifts from IIIB to IIIA: P1 for T4aN2 and IIIB, P2 for T4aN2 and IIIA. e T4aN3a shifts from IIIC to IIIB: P1 for T4aN3a and IIIB, P2 for T4aN3a and IIIC. f T4bN0 shifts from IIIB to IIIA: P1 for T4bN0 and IIIB, P2 for T4bN0 and IIIA. T4bN2 shifts from IIIC to IIIB: P1 for T4bN2 and IIIB, P2 for T4bN2 and IIIC

Discussion

Based on the population-based cohort of gastric cancer patients from the USA, we observed that the AJCC 8th edition effectively predicted overall and cancer-specific survival outcomes. The principal revision of the new edition was incorporation of pN3b as a separate group in the staging system. Lymph node metastasis was considered as an important prognostic factor for gastric cancer [23,24,25,26]. Although the optimal number of lymph nodes required for examination remains controversial, it is traditionally recommended that at least 16 lymph nodes should be examined during the resection of gastric cancer, especially in Asia [27,28,29,30]. Examination of an adequate number of lymph nodes is required for accurate staging, and insufficient lymph node sampling might be associated with a risk of understaging. An increasing number of surgeons in the Western world are also aware of the prognostic significance of the number of lymph nodes removed. Previously, D2 lymphadenectomy was not considered as the standard procedure in Western countries [31, 32], in contrast to the Eastern counterparts. Several randomized clinical trials (RCTs) reported from Western countries demonstrated that D2 lymphadenectomy associated with high perioperative complications and mortality did not provide additional survival benefits [33,34,35]. However, with improved surgical skills and management of complications, the D2 procedure has been proved to be safe even in Western countries when spleen and pancreas are preserved [36, 37]. Recent RCTs [36, 38] also showed a survival benefit derived from the D2 procedure. In a recent analysis based on the SEER database, Arsoniadis et al. [39] reported that the percentage of patients with more than 15 lymph nodes examined has been consistently increasing and exceeded 50% in 2008–2013. It was speculated that adequate lymph node examination might be associated with decreased mortality [39]. Another study from the US Gastric Cancer Collaborative [40] concluded that the number of lymph nodes removed could be considered as an important predictor for long-term prognosis. In keeping with the previous study [16], pN3b was associated with worse survival outcomes in relationship to pN3a in the current study. This trend was even more obvious in patients from whom more than 15 lymph nodes were examined, suggesting that the introduction of pN3a and pN3b as separate groups in staging had significant clinical value.

Compared with the AJCC 7th edition, only stages IA, IB, IIA, and IV remain unchanged in the new edition, with most changes occurring in stage III. The largest subgroup of stage III in the AJCC 8th is IIIA, whereas the largest subgroup in the AJCC 7th was IIIB. In the current study, we validated the staging migration of T1N3bM0, T4bN0M0, and T4bN2M0. Justification for other staging changes might require further investigation. Comparable OS and CSS c-indices in the entire cohort indicated that the AJCC 8th performed as well as the AJCC 7th edition stage system in the USA. Importantly, it was noteworthy that the new edition system provided a slight improvement in prognostic discrimination for patients for whom more than 15 lymph nodes were examined, which indicated the significance of examining an adequate number of lymph nodes. The lack of modest improvement in the discriminatory ability of the AJCC 8th relative to the AJCC 7th version in the Western cohort might be the result of examining an insufficient number of lymph nodes.

Several limitations of our study also should be mentioned. First, inherent selection biases were inevitable in the current study. Also, the SEER dataset was unable to provide detailed surgery procedures and postoperative treatment, which might affect the prognosis. Second, we were unable to determine whether patients received neoadjuvant therapy that might affect pathological judgment for staging. Third, for more than half the patients fewer than 15 lymph nodes were examined, which might be associated with the risk of pN being understaged. When we included those populations, we observed an improvement in prognostic discrimination of the new stage system.

In summary, the AJCC 8th edition stage system refines risk stratification according to the pN3b and provides modest to good survival stratification for gastric cancer in the Western cohort as well as does the 7th edition system. Importantly, an improvement in discriminatory performance of the new edition was observed for cases in which more than 15 lymph nodes were examined.