Introduction

We recently reported the new concepts of metabolic nodal stage (mN) and response (mNR) of esophageal and gastro-esophageal junction (GEJ) cancer to neoadjuvant chemotherapy (NAC) [1, 2]. We found that patients with FDG-avid nodes within a standard lymphadenectomy field on 18F-fluorodeoxyglucose (FDG) positron emission tomography-computed tomography (PET-CT) that persisted despite NAC (i.e. lacking a complete metabolic nodal response, CMR) were more likely to suffer disease progression during NAC which prevented resection (i.e., distant metastases, or locally unresectable disease). Furthermore, if resection was performed successfully, these patients were more likely to suffer disease recurrence. This was independent of pathological stage, and both pathological and metabolic regression of the primary tumor [1, 2], suggesting mNR to be a surrogate of regression of nodal metastases to NAC. Indeed, in some patients notable disparity was seen between primary and nodal response, suggesting mNR might represent a surrogate of the chemosensitivity of different clones, and perhaps by extrapolation occult distant metastatic clones.

mNR therefore represents a potentially novel and actionable biomarker of the success both of NAC and radical lymphadenectomy, for which there is an urgent need [3,4,5,6]. Such a marker might have the potential to monitor response to NAC and perhaps personalize therapy, while also better prognosticating before surgery, and stratifying the risk of recurrence thereafter. To our knowledge, no studies have subsequently assessed mNR as a marker of disease progression and recurrence. We aimed to perform the first validation study of mNR as a marker of disease progression and recurrence, using a temporal cohort of patients immediately consecutive to our discovery cohort.

Methods

Study design and approval

We routinely stage and restage patients with esophageal or GEJ cancer using 18F-FDG PET-CT, and retrospectively identified all such patients receiving NAC from January 2014 to June 2017. The study was approved by the Oxford University Hospitals NHS Foundation Trust Research and Development committee.

Inclusion and exclusion criteria

We included all patients with histologically confirmed cancer of the esophagus or GEJ, staged in our institution with 18F-FDG PET-CT, undergoing NAC before being restaged with 18F-FDG PET-CT. We excluded patients treated with neoadjuvant chemoradiotherapy, as well as those undergoing PET-CT in other centers.

Staging and restaging

All examinations were reported by subspecialist consultant gastrointestinal radiologists and reviewed at a specialist esophagogastric cancer multidisciplinary team (MDT) meeting, using the TNM 7th edition [7]. Patients underwent intravenous and oral contrast-enhanced CT chest abdomen and pelvis, as previously reported [1, 2]. In the absence of unequivocal incurable disease, patients then underwent PET-CT (see below), followed by laparoscopy (without peritoneal cytology) for tumors extending below the diaphragm. Endoscopic ultrasound (EUS) was performed selectively. Examinations were reported. Patients were restaged 4–6 weeks after NAC.

PET-CT was performed from January 2014 using a General Electric (GE) Discovery 690 64-slice system, then also a GE Discovery 710 from September 2014 (both 90 min post 4 MBq/Kg FDG), all using ordered subset expectation maximization (OSEM) reconstruction; after 1 November 2014, a Bayesian penalized likelihood (BPL) reconstruction technique was used (Q.Clear) without intravenous contrast. Any staging studies initially reconstructed with OSEM and undergoing a subsequent restaging examination with BPL had their baseline studies reconstructed with BPL to ensure validity. Examinations were independently reported by two dedicated PET-CT radiologists, dual trained in nuclear medicine and clinical radiology.

Data

PET-CT data comprised primary tumor maximum standardized uptake value (SUVmax), the presence and number of FDG-avid nodes (mN, as previously defined: mN1 1–2 avid nodes, mN2 > 2), in both cases, a target SUVmax greater than mediastinal blood pool being avid [1]; restaging SUVmax (used to calculate metabolic tumor response [mTR], % reduction SUVmax normalized for background SUVmax [2.5], and PERCIST classification [8]); metastatic disease; mNR as previously defined (complete metabolic response [CMR] to background mediastinal blood pool avidity, partial metabolic response [PMR; reduction in mN stage or % reduction in nodal SUVmax > 30%], stable metabolic disease [SMD; no change in mN stage, or nodal SUVmax < 30%], and progressive metabolic disease [PMD; progression in mN stage, or increase nodal SUVmax > 30%]).

Pathological data comprised pathological stage [7], and primary tumor regression (Mandard tumor regression grade 1–3 being response, 4–5 no response [2, 9]). Survival data comprised recurrence and death (censored on 23 March 2018). Disease progression to incurable disease during NAC was defined as either metastatic disease on imaging (confirmed by MDT discussion ± confirmatory imaging/histopathology) or unresectable disease at surgery (confirmed intra-operatively by frozen section histopathology, or consensus of two consultant surgeons).

Neoadjuvant chemotherapy

Patients received a variety of regimens based on MDT consensus and trial enrolment: either 2 cycles of doublet therapy (cisplatin/oxaliplatin and 5-fluorouracil/capecitabine) or 3 or 4 cycles of pre- ± post-operative triplet therapy (epirubicin, cisplatin/oxaliplatin and 5-fluorouracil/capecitabine) as previously described [10,11,12,13,14].

Surgery

Esophagectomy (for esophageal or GEJ type I–II cancers) or extended total gastrectomy (types II–III) was performed typically 2 weeks after restaging, via left thoracoabdominal, Ivor-Lewis (right thoracotomy plus laparotomy/laparoscopy), and three-stage approaches (laparotomy/laparoscopy, thoracotomy plus neck dissection). A minimum two-field lymphadenectomy was performed; for mid/distal esophageal/type 1 GEJ tumors, this included subcarinal, aortopulmonary window, and left gastric nodes, and for distal/GEJ tumors, left gastric and common hepatic artery nodes.

Follow-up

Patients were reviewed clinically at 2 and 6 weeks after surgery, 3 months to 1 year and 6–12 months thereafter for at least 5 years. Investigations for recurrence were performed only on the basis of clinical suspicion, typically CT or PET-CT for extra-luminal recurrence, and endoscopy for luminal.

Statistical analysis

Analysis was performed using R v3.0.2 [15]. Groups were compared using Fisher’s exact test. Univariate binary logistic and Cox regression analysis were performed using all individual data fields. Multivariate binary logistic and Cox regression were performed using variables with p < 0.1 on univariate analysis for inclusion in the final multivariate model. The sole patient with an adenosquamous carcinoma was excluded from univariate analysis of cell type, being a perfect separator. The final multivariate binary logistic regression models for disease progression comprised mTR and restaging mN; for unresectable disease at surgery, restaging mN and pre-treatment grade of differentiation; and for disease-progression overall, restaging mN and clinical N stage. The final multivariate Cox models for survival comprised mNR, ypT, ypN, ypV (except for overall survival, as p > 0.1 on univariate analysis), ypL, resection margin status, pTR, and mTR. For the model assessing a pNmN hybrid, the models were as above, having excluded mNR and ypN due to multicollinearity. The final multivariate binary logistic regression model for early recurrence/death comprised of mnR, ypT, ypN, ypL, resection margin status, pTR, and mTR. Similarly, the model assessing a pNmN hybrid excluded mNR and ypN. Survival metrics were calculated using the Kaplan-Meier method, and follow-up using the reverse Kaplan-Meier method, excluding patients dying in hospital due to post-operative complications. Kaplan-Meier curves were generated using survfit (Survival Package v2.42-3) using the log-rank test.

Results

mN stage and response

Two hundred patients were included (Table 1). Eighty-one (40.5%) had avid nodes before NAC. Thirty-eight (46.9%) underwent CMR. Eight (4.00% overall) previously mN0 patients developed avid nodes. In total, 51 patients (25.5%) had avid nodes following NAC. This was consistent with the development cohort: 64/280 (22.9%; p = 0.617). PET reconstruction algorithms did not influence mNR, mTR, and detection of metastatic disease (Supplementary Table 1).

Table 1 Patients

Progression to incurable disease on restaging PET-CT

Incurable disease was identified in 10 (5.00%) patients. No pre-treatment factors were associated with this. On multivariate binary logistic regression, both the presence of avid nodes at restaging (mN stage 2) and lack of mTR (% reduction in primary tumor SUVmax) were associated with coexistent metastatic disease (Table 2). Five of 51 (9.80%) patients with avid nodes after NAC had metastatic disease, compared with 5 of 149 (3.36%) without (p = 0.127).

Table 2 Disease progression: multivariate binary logistic regression

Progression to unresectable disease at surgery

A further 2 patients without metastases evident on restaging PET-CT did not proceed to surgery for medical reasons (both had avid nodes). One hundred eighty-eight proceeded to surgery. Unsuspected unresectable disease was encountered in 10 (5.32%). On multivariate analysis, this was associated with the presence of avid nodes following NAC: OR 7.67 (1.94–30.4; p = 0.004; Table 2).

Disease progression overall

Overall, avid nodes following NAC predicted disease progression: OR 3.84 (1.46–10.1; p = 0.006), independent of all variables including clinical N stage. As with progression on PET-CT, this association was primarily seen for patients with mN2 disease after NAC.

Predicting unresectable disease at surgery

Before surgery, 44 of 188 patients (23.4%) had avid nodes following NAC. Of these, 6 (14.0%) had an abandoned resection, compared with 4 of 144 without (2.78%; p = 0.011; 6 true positives, 38 false positives, 4 false negatives, and 140 true negatives) The presence of avid nodes despite NAC was 60.0% sensitive (26.2–87.8) and 78.7% specific (71.9–84.4) for predicting unsuspected unresectable disease. Positive predictive value (PPV) was 13.6% (8.13–22.0), and negative predictive value (NPV) was 97.2% (94.2–98.9%).

These proportions were less than the development cohort, in which 11 of 43 (25.6%) patients with avid nodes had an abandoned resection, compared with 15 of 197 (7.61%) without. Rates of abandoned resections halved from 10.5 to 5.32%.

Prognosis

Two patients (1.06%) died in hospital of post-operative complications and were excluded. Of the remaining 176 patients (Table 1), 55 died (31.3%) and 59 (33.5%) developed recurrence during follow-up. Median OS, DFS, and recurrence were not reached (respective 95% CI 1067 days—not reached; 770—not reached; 872—not reached). Median follow-up was 783 days (737–886).

On multivariate Cox analysis, the presence of avid nodes following NAC was associated with death and recurrence following resection: OS HR 2.46 (1.34–4.51; p = 0.004), DFS HR 1.90 (1.10–3.31; p = 0.022), and time to recurrence HR 2.02 (1.11–3.66; p = 0.021; Table 3; Fig. 1). This was independent of similar negative associations with progressive ypN stage. No other factors, including mTR (either reduction in SUVmax, or PERCIST), were independently associated with prognosis. There were no clinically or statistically significant differences when mNR was quantified using mN stage after NAC (i.e., the number of avid nodes). These HR were similar to the development cohort: OS 1.75 (0.99–3.07; p = 0.053), DFS 2.03 (1.16–3.55; p = 0.013), and recurrence 2.06 (1.10–3.83; p = 0.023).

Table 3 Prognosis: multivariate Cox regression analysis
Fig. 1
figure 1

Disease-free survival and mNR after neoadjuvant chemotherapy; p values generated using log-rank test

Hybrid pathological metabolic nodal stage

As both mNR and ypN independently predicted prognosis, we considered a composite. Overall 109 patients had nodal metastases (ypN+), while 38 had avid nodes (mN+). Considering post-NAC mN, a predictor of ypN, there were 31 true positives, 7 false positives, 78 false negatives, and 60 true negatives. mNR was 28.4% sensitive (20.2–37.9), and 89.5% specific (79.7–95.7), PPV 81.6% (76.4–90.5), and NPV 43.5% (40.0–47.0).

This ypNmN stage was associated with prognosis (Table 3; Fig. 2): relative to pN0mN0 disease, patients with non-avid nodal metastases (ypN + mN−) had a worse prognosis. However, prognosis was even worse for patients with avid nodal metastases (ypN + mN+). Interestingly, the small number with ypN-mN+ disease (i.e., presumed false avid positive) demonstrated a trend towards worse prognosis for DFS (p = 0.085) with significantly worse recurrence, suggesting the possibility of pathologically missed micro-metastases.

Fig. 2
figure 2

Disease-free survival pN mN hybrid stage after neoadjuvant chemotherapy; p values generated using log-rank test

Predicting early recurrence and death

One hundred twenty-nine of 161 (78.2%) patients were alive and disease-free at 1 year, reducing to 68 of 128 (53.1%) at 2 years. On multivariate regression, lack of complete mNR was associated with recurrence and death at both time points: OR 4.18 (1.56–11.2; p = 0.004) and 3.63 (1.26–10.5; p = 0.017) respectively. This was again independent of associations with ypN (Table 4). The associations with ypN were primarily for recurrence/death within 1 year, rather than 2 years. At 2 years, recurrence and death were associated with hybrid pNmN stage.

Table 4 Predicting early recurrence and death: multivariate binary logistic regression

At 1 year, 16 of 36 (44.4%) patients with persistent avid nodes had either died or developed recurrence, compared with 20 of 129 (15.5%) without (p < 0.001 Fisher’s exact test). Lack of a nodal CMR was 44.4% sensitive (27.9–61.9) and 84.5% specific (77.1–90.3), with PPV 44.4% (31.7–57.8) and NPV 84.5 (80.1–88.1).

At 2 years, 20 of 27 (74.1%) patients with persistent avid nodes had either died or developed recurrence, compared with 38 of 99 without (38.4%; p = 0.001). Lack of nodal CMR was 34.5% sensitive (22.5–48.1) and 89.7% specific (79.9–95.8), with PPV 74.1% (56.6–86.3) and NPV 61.6% (56.7–66.3).

Discussion

We previously reported lack of complete metabolic nodal response of esophageal cancer to NAC to be a new and independent predictor of disease progression to unresectable disease [1], death, and recurrence [2]. In this validation study of an immediately consecutive cohort of 200 patients, we confirmed these associations and were able to risk stratify recurrence and death within 1 and 2 years. Compared with our development cohort, effect sizes were entirely consistent for prognosis. That for unresectable disease while significant and consistent, was less clear, reflective of a reduction in event rate overall with time.

These findings have a number of implications. Most immediately, mNR appears to a simple way to risk stratify patients before surgery, firstly identifying patients at higher risk of occult disease resulting in an abandoned resection. Guidelines presently recommend restaging with CT to exclude disease progression before surgery [16]. We previously reported PET-CT to be more sensitive [1]; however, this still failed to identify incurable disease in 5% of our patients. Identifying lack of complete mNR might allow these patients to undergo additional investigations (such as laparoscopy, thoracoscopy, or magnetic resonance imaging). These patients may also benefit from more personalized and realistic information about their immediate risk of an abandoned resection, and medium- and long-term prognosis if surgery is successful. While this may not routinely alter practice—other than for patients unclear as to whether to proceed with surgery—patients can receive more realistic information before consenting to surgery. While there is little evidence or consensus as to follow-up, mNR might be incorporated into decision support tools for targeted surveillance, and future editions of the TNM classification. Finally, the suggestion that patients with avid nodal disease, seemingly in the absence of pathological nodal disease, have a worse prognosis than those without either is notable. Such instances might be dismissed as false positives on PET-CT, but the converse might be true, with these instances identifying occult but viable nodal micrometastases.

Biologically, this statistical independence of pN and mNR is intriguing. As evidenced by the prognostic significance of a novel pNmN hybrid stage, avid nodal metastases confer a worse prognosis than non-avid metastases. This suggests persistent nodal avidity to be a surrogate of a worse cancer phenotype. While the reasons for nodal avidity are complex and incompletely understood (perhaps reflecting mutational burden disrupting glucose utilization, in conjunction with tumor micro-environment, perfusion, and hypoxia), at a simplistic level, it may be possible to infer chemo(in)sensitivity. We previously reported primary tumor response on PET-CT (mTR) to be an imperfect surrogate of pathological response (pTR) [17], in agreement with other groups [18], including its successful use within the MUNICON trial [19]. Esophageal cancer comprises a limited number of cancer clones, which vary genomically and probably phenotypically, including their chemosensitivity and propensity to metastasize [20]. Nodal metastases may therefore comprise just one or two selected oligoclones [21, 22]. We, and other groups, have previously reported dramatic clonal evolution within the primary tumor during the selection pressure of NAC [23, 24], with chemo-resistant clones tending to persist or emerge, while chemosensitive clones are lost. Lack of mNR may therefore be an indicator of chemo-resistant clones within resectable lymph nodes, and therefore by extrapolation, a surrogate for the occult distant nodal or hematogenous micrometastases that are responsible for recurrence. This hypothesis is supported by evidence that nodal downstaging, evidenced by imaging in conjunction with pN stage, represents an independent prognostic marker, likely a surrogate of pathological nodal regression [25, 26]. Whether this plausibly might be used to guide neoadjuvant and adjuvant therapy is unknown.

While this study provides temporal validation of mNR as a biomarker, we acknowledge a number of limitations. Firstly, this represents a single-center study with a degree of subjectivity inherent to all imaging studies (although one subject to a rigorous MDT process); whether these findings can be generalized to other centers and imaging platforms is not clear. However, similar results have been reported from Japan [27] following largely platinum-based NAC as a marker of overall survival (although recurrence was not assessed). While we previously found no differences across reconstruction platforms [2], these, and other variables (such as NAC regimen) inevitable of evolving practice, may exert subtle effects we were unable to identify. We were also unable to reliably adjust for adjuvant therapy, and there may be additional factors that we have failed to identify. Secondly, detection biases have been introduced by “on-demand” investigations for recurrence rather than surveillance imaging. Thirdly, while we found consistent direction and sizes of effect for prognosis compared with our development cohort, we saw fewer patients overall with occult unresectable disease resulting in an abandoned resection. This probably reflects improvements in care with time, and hence while the association persisted, its effect size is likely to be less accurate, and requires further validation.

In conclusion, in a cohort of 200 patients immediately consecutive to our discovery cohort, we found lack of a complete mNR (i.e., the development or persistence of avid nodes despite NAC) to again be an independent and seemingly consistent marker of disease progression, recurrence, and death after surgery. This temporal validation provides supportive evidence justifying the assessment of mNR as a new candidate biomarker for these endpoints in external validation sets. If supported, mNR might represent a new mechanism by which to both risk stratify patients and personalize therapy with esophageal and GEJ cancer.