Introduction

Despite decreasing incidence and mortality rates in developed countries, gastric cancer remains the sixth most commonly diagnosed cancer and the third most common cause of cancer mortality globally [1]. D2 lymph node dissection is recommended as a surgical approach for patients with curable gastric cancer [24] Two European randomized controlled trials (Dutch and MRC trials) that compared D1 and D2 lymph node dissection revealed that D2 lymph node dissection should be restricted to specialized centers where adequate training and supervision can be provided [5, 6].

The most important outcome of cancer surgery is survival. The outcome of surgery is generally thought to depend on the experience of the surgeon [7]. Actual learning curves based on survival data are rarely presented. Recent reports from prostate cancer have shown that as a surgeon’s experience increases, cancer control after radical prostatectomy improves [8, 9]. However, differences in cases seen by surgeons might contribute to residual confounding findings, and the endpoint was not actual survival but instead biochemical recurrence in those studies; therefore, findings of related reports are of uncertain clinical relevance to patients.

Thus, the aim of the present study was to investigate learning curves for surgeons performing D2 lymph node dissection based on actual survival. In this study, we could observe a trend of learning curves in cancer surgery.

Patients and methods

Study cohort and data sources

The study cohort consisted of 3,284 patients with gastric cancer who were treated by open gastric cancer surgery between January 2001 and December 2006 at one of eight institutions in Korea: Chonbuk National University, Dong-a University, Jaeseng Hospital, Kemyeong University, Seoul National University, Soonchunhayng University, Yonsei University, and National Cancer Center. All patients were treated by participating surgeons as a staff surgeon. Patients who (1) had a histologically confirmed gastric adenocarcinoma and (2) had newly diagnosed cancer without previous treatment were included in the analysis. Patients who underwent R2 resection (n = 122) or who underwent exploratory laparotomy only or bypass surgery (n = 102) were excluded. Finally, 3,060 patients who underwent potentially curative resection for gastric cancer were included in this study. All information was obtained with appropriate Institutional Review Board waivers, and data were collected without revealing any personal information.

Personal characteristics and clinical data

All characteristics of patients were obtained from a review of medical records. Demographic characteristics included age, sex, and body mass index. Clinicopathological characteristics included tumor location, tumor size, differentiation, Lauren classification, gross type, depth of invasion, lymph node metastasis, stage at diagnosis, operation method, and extent of lymph node dissection. Pathological diagnosis was determined with adherence to the sixth edition of the Union International Centre Cancer (UICC)/American Joint Committee on Cancer classification system [10]. In patients with multiple synchronous gastric cancers, the lesion with the deepest infiltration of the gastric wall was regarded as the main lesion and any others were regarded as accessory lesions. The clinicopathological characteristics of the main lesion were used for the analysis.

Requirements for surgeon

Each participating surgeon had an appointment as a staff surgeon during the study period, in which each surgeon had to perform at least 150 cases of open gastric cancer surgery to be eligible. Before the study period, all surgeons had completed a 4-year-long residency course on general surgery at a domestic teaching hospital followed by a 2-year-long fellowship course specific to gastric cancer surgery. The fellowship course had to be attended at the institutions where the surgeons resumed their career as a staff surgeon during the study period. Surgeons who had previously conducted gastric cancer surgeries either as a fellow or as a staff surgeon at other institutions were not included in this study because of difficulties in accurate data collection. Eligible patients were treated by one of nine surgeons, all of whom saw patients only at a study institution while on the staff there.

Operative procedures

The surgical procedures and reconstruction methods were as follows. Gastrectomy was performed if a tumor-free margin of 4 cm was present in cases with advanced gastric carcinoma or of 2 cm in cases with early gastric carcinoma. Extent of lymph node dissection was determined using the recommendations of the Japanese Research Society for Gastric Carcinoma [11]. After laparotomy, surgeons examined intraabdominal cavities and inspected the peritoneum, diaphragm, liver capsule, and pelvic cavity. All patients enrolled in the present study underwent gastrectomy with D1 + β or more lymph node dissection [11]. Patients with early gastric cancer underwent D1 + β lymph node dissection and those with advanced gastric cancer underwent D2 lymph node dissection. Tumors were staged in accordance with the 6th UICC tumor node metastasis (TNM) classification [10].

Chemotherapy

Not all patients were given postoperative chemotherapy. The indication for adjuvant treatment was not standardized among the various institutions. However, most patients with advanced gastric cancer more than TNM stage I were given 5-fluorouracil-based postoperative chemotherapy [12]. No patients were given postoperative or preoperative radiotherapy.

Follow-up schedule

Patient follow-up was conducted according to accepted clinical practice procedures at each institution. In general, follow-up consisted of an abdominopelvic computed tomogram every 6 months for 5 years after surgery and esophagogastro-duodenoscopy annually for 5 years after surgery. Cancer recurrence was defined as positive radiologic evidence.

Statistical methods and analyses

Follow-up of patients was completed until death or until the cutoff date of December 30, 2010. At the time of the last follow-up, 97 patients (3.2 %) had been lost to follow-up. The median follow-up interval for 3,060 patients alive at the cutoff date was 52 months (range, 0–106 months). The lost cases were treated as censored data for the analysis of survival rates. Overall survival was defined as the time from surgery to death (resulting from any cause). The Kaplan–Meier method [13] was used for estimating survival. The difference between survival curves was assessed using the log-rank test. The accepted level of statistical significance was P < 0.05.

Each surgeon’s experience was coded as the number of D1 + β or more gastrectomies performed before each patient, which indicates the surgeon’s total number of prior surgical experiences. Surgeon experience was grouped into two sets of categories, which included four groups of experience (≤50, 51–100, 101–200, and >200 applicable operations).

To generate surgical learning curves, a stratified Cox regression model was used. A quadratic function of surgical experience was used in the model. The median predicted survival probability was calculated from the model for each level of experience.

Results

Patient characteristics by surgeon experience

Clinical and pathological characteristics of the tumors are shown in Table 1. The median age of the 3,060 patients was 54 years (range, 20–91 years). Of the total patients, 2,113 (69.1 %) were men and 947 (30.9 %) were women. In terms of location, 1,662 gastric cancers (54.3 %) were in the lower one third of the stomach and the other 1,398 cancers were located in the middle or upper one third of the stomach. Among the cancers, 1,275 cancers (41.7 %) were early gastric cancers. Clinical and pathological characteristics were comparable throughout the periods.

Table 1 Patient characteristics

Surgical outcomes by surgeon experience

Surgical outcomes based on the number of surgical experiences are summarized in Table 2. Two of nine surgeons had performed fewer than 200 cases. As surgical experience increased, operating times (t test, P < 0.001) and duration of hospital stay (t test, P = 0.001) were shortened and the number of dissected lymph nodes (t test, P < 0.001) was increased. The rate of R0 resection was the lowest at the second period (χ2 test, P = 0.011). R1 resection represented cases of positive resection margin. Postoperative mortalities were not different throughout the periods (χ2 test, P = 0.237).

Table 2 Surgical outcomes by surgeon experience

Effect of surgeon experience on the overall survival of gastric cancer patients

The more experience the surgeons had the greater was the overall survival of patients [≤50 versus 100< ≤200; hazard ratio (HR) 0.775, 95 % confidence interval (CI) 0.617–0.914, P = 0.029]. Figure 1a shows the 5-year overall survival based on surgeon experience. There was an improvement in survival with increasing surgeon experience up to 100 prior surgeon experiences, and no further improvement after that. Analysis of patients who underwent R0 resection only showed that there was a significant association between overall survival and surgical experience (Fig. 1b). These trends were shown in stage II, III, and IV tumors (Fig. 2a–d).

Fig. 1
figure 1

a Survival curves by surgical experience for all patients. As surgical experience increased, the survival of patients improved. Overall survival (OS) rate at 5 years after 100 surgical experiences remained steady. b Survival curves based on surgical experience for patients who underwent R0 resection only. As surgical experience increased, the survival of patients improved. Overall survival rates here were higher than those of patients who underwent R0 and R1 resection in each period. Overall survival rate at 5 years after 100 surgical experiences was steady

Fig. 2
figure 2

ad Overall survival (OS) curves based on surgical experience for stages I–IV, respectively. The 5-year overall survival rates after 100 surgical experiences were higher than those at 100 or fewer surgical experiences at each TNM stage except at stage I

Univariate and multivariate analyses

Univariate and multivariate analyses showed that the ages of patients (HR 1.526, 95 % CI 1.310–1.778, P < 0.001), macroscopic type (HR 1.329, 95 % CI 1.084–1.629, P < 0.001), TNM stage (HR 2.241, 95 % CI 1.511–3.322, P < 0.001), and classification of residual tumor (HR 1.840, 95 % CI 1.362–2.485, P = 0.001) were significantly associated with survival (Table 3). After adjusting for well-known factors for survival including age, sex, tumor size, macroscopic type, and TNM stages, surgeon experience was a significant, independent risk factor for survival of patients with gastric cancer (<51 versus ≥ 201, HR 0.797, 95 % CI 0.642–0.989, P = 0.040).

Table 3 Univariate and multivariate analysis for overall survival

Discussion

The main findings of the present study are (1) the overall survival after gastrectomy with D1 + β or more lymph node dissection increased as the experience of the operating surgeon increased; (2) overall survival does not steadily increase but decreased after 50 experiences and then steadily again increased after 100 prior experiences; and (3) short-term surgical outcomes including operating time, duration of hospital stay, and operative mortality seemed to improve as operative experience increased.

A previous publication regarding the learning curve of gastric cancer surgeries that used mortality and morbidity rates as outcome measures suggested that the learning curve in gastrectomy with D2 lymph node dissection ranges from 15 to 25 cases in a study including 38 patients [15]. This previous study focused primarily on the feasibility of D2 lymph node dissection, used a restricted sample size, and presented limited information on oncological outcome. However, in the case of cancer surgery, oncological outcome should be considered so that clinical trial results are correctly interpreted. Another report of the learning curve for total gastrectomy in patients with gastric cancer analyzed the number of retrieved lymph nodes as a main outcome in terms of oncological surgery. However, the endpoint of this trial still was not actual survival [15].

We observed that overall survival after gastrectomy with D2 lymph node dissection increases with surgical experience. This finding was consistent when we compared the overall survival in each of the TNM stages except stage I. Recent reports of prostate cancer surgery have shown that as a surgeon’s experience increases, cancer control after radical prostatectomy improves [8]. However, differences in the mix of cases seen by surgeons might have contributed to residual confounding. Further, the endpoint of this study was biochemical recurrence, so the findings are of uncertain clinical relevance to patients.

Although this study included patients with a predominance of early gastric cancers, overall survival after gastrectomy with D2 lymph node dissection increased with surgical experience. Improved quality of D2 lymph node dissection might be one reason because the number of retrieved lymph nodes was increased and operating times were decreased with surgical experience. Although two European randomized controlled trials (Dutch and MRC trials) that compared D1 and D2 lymph node dissection failed to show a benefit of D2 lymph node dissection, 10-year survival follow-up data have shown the beneficial effect of D2 lymph node dissection [5, 6].

Our study here is the first report using actual survival of cancer patients. We found that the overall survival was not steadily increased but decreased after the initial experiences and then steadily increased again. This blended pattern of survival might be caused by the selection bias of the surgeons themselves. After initial experiences (specifically, the initial 50 experiences), surgeons tried to do more aggressive surgeries. The highest postoperative mortalities and the rates of R1 resection after initial experiences might be the result of less selection of patients by surgeons. However, this selection bias seemed to be overcome as experience further increased.

Surgical experience is hard to assess. We used the number of operations as surgical experience. However, we can expect that as the number of operations is increased, much operation quality, including not only the number of dissected lymph nodes but also operating times, blood loss during the operation, and postoperative patient care, might be increased. We assumed that surgical experience is the sum of all those factors. Therefore, improved survival after 100 operations in this study might be results of systematic improvement in care of patients including operation skills.

The major limitation of this study is that it is a retrospective study and as such the treatment strategy was not based on random assignment. In other words, a selection bias for choosing treatment modality might have occurred. However, a randomized clinical trial for the comparison of survival based on a learning curve is not possible. Another limitation of this study was the limited data of date of recurrence. Therefore, we could not clearly show the relationship between experience and disease-specific survival. This study analyzed data from high-volume centers in Korea and included surgeons who were trained for upper gastrointestinal surgery for at least 2 years. These strict inclusion criteria for surgeons and institutions might have influenced the results and might have reduced variation among surgeons. Postoperative chemotherapy might have also influenced patients’ survival. Different chemotherapeutic regimens and indications of adjuvant chemotherapy from different institutions might have influenced survival of patients. Intervals between experiences might also influence results. Although high-volume centers were included in this study, the number of operations during the study period could not be the same. However, this study clearly showed a trend of learning curve in cancer surgery. During the study period, laparoscopic gastric cancer surgeries began to be done in Korea, especially by young surgeons. Experiences of D2 lymph node dissection in open surgeries might help adequate lymph node dissection during laparoscopic surgery, and the experience of laparoscopic surgeries might help to understand the anatomy of the stomach and thus help to raise the quality of lymph node dissection during open surgeries. Experiences of laparoscopic surgeries were variable among the surgeons included in this study, which might have influenced the results.

In summary, as a surgeon’s experience increases, actual survival after gastric cancer surgery improves. Further research is needed to develop specific techniques to reduce the learning curve, including imaginary exercises or computer simulations. Interestingly, not the initial period (<50 experiences) but the second period (50–100 experiences) might be the most dangerous period, based on the fact that survival of patients was the poorest and postoperative morbidities and mortalities were the highest during the second period.