Comparison between artificial neural network and Cox regression model in predicting the survival rate of gastric cancer patients

  • Authors:
    • Lucheng Zhu
    • Wenhua Luo
    • Meng Su
    • Hangping Wei
    • Juan Wei
    • Xuebang Zhang
    • Changlin Zou
  • View Affiliations

  • Published online on: July 18, 2013     https://doi.org/10.3892/br.2013.140
  • Pages: 757-760
Metrics: Total Views: 0 (Spandidos Publications: | PMC Statistics: )
Total PDF Downloads: 0 (Spandidos Publications: | PMC Statistics: )


Abstract

The aim of this study was to determine the prognostic factors and their significance in gastric cancer (GC) patients, using the artificial neural network (ANN) and Cox regression hazard (CPH) models. A retrospective analysis was undertaken, including 289 patients with GC who had undergone gastrectomy between 2006 and 2007. According to the CPH analysis, disease stage, peritoneal dissemination, radical surgery and body mass index (BMI) were selected as the significant variables. According to the ANN model, disease stage, radical surgery, serum CA19‑9 levels, peritoneal dissemination and BMI were selected as the significant variables. The true prediction of the ANN was 85.3% and of the CPH model 81.9%. In conclusion, the present study demonstrated that the ANN model is a more powerful tool in determining the significant prognostic variables for GC patients, compared to the CPH model. Therefore, this model is recommended for determining the risk factors of such patients.

Introduction

Although the global incidence of gastric cancer (GC) is on the decrease, it remains high in eastern Asia (1). GC remains one of the leading causes of cancer-related mortality worldwide, being the second most common type of cancer and the second most common cause of cancer-related mortality in China (2). It is the most frequently diagnosed cancer in rural areas and the incidence of GC among males and females is estimated to be 49.6 and 22.5, respectively, per 100,000 individuals (2).

The determination of prognostic factors and survival rate of patients is crucial. Over the last decades, data analysts have used various survival methods, such as the Cox regression hazard (CPH) or the parametric regression models, for analyzing survival data sets. However, the human body is a complex biological system and the majority of the clinical characteristics exhibit a multidimensional and non-linear relationship. Thus, it is difficult to predict the prognosis of gastric carcinomas with a conventional statistical technique. The artificial neural network (ANN) is a novel computer model inspired by the function of the human brain. It is able to build non-linear statistical models to assess complex biological systems. Over the last few years, ANN models have been introduced in clinical medicine for clinical validations (36).

In this study, we developed an ANN model to determine the risk factors for GC patients.

Materials and methods

Patients

A total of 289 patients, including 218 men and 71 women, with a mean age of 63.20±10.75 years, with histologically proven gastric carcinoma who underwent surgery between March, 2006 and December, 2007 were enrolled in this study. Of these 289 patients, radical total gastrectomy was performed in 76 (26.3%), radical subtotal gastrectomy in 168 (58.1%) and palliative gastrectomy in 41 patients (14.2%). According to the American Joint Committee on Cancer TNM classification (7), 56 patients had stage I, 86 had stage II, 116 had stage III and 31 had stage IV disease. Peripheral blood samples were obtained from each patient within 1 week prior to surgery. The cut-off values for serum CEA and CA19-9 were 5 ng/ml and 37 U/ml, respectively. In this historical cohort study, the required information for each patient, including age at diagnosis, serum CEA and CA19-9 levels, ascites, peritoneal dissemination, curability of surgery, body mass index (BMI), histological grade and disease stage, was gathered from the registered patient documents of the First Affiliated Hospital of Wenzhou Medical College. The survival time of each patient (in months) following surgery was also registered.

This study was approved by the ethics committee of the First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China. Prior to this study, written informed consent was obtained from patients and their families.

Statistical analysis and neural network analysis

In the first step of data analysis, due to the survival nature of the outcome data, preliminary analyses such as the Kaplan-Meier and log-rank tests were performed. Subsequently, we utilized more complex statistical methods, including CPH and ANN models, for predicting patient survival rate. In the modeling process, we randomly divided data into two subsets: 173 patients were used for constructing the models (training subset) and the remaining 116 patients for assessing the validation of the models (testing subset). After evaluating the validation, the receiver operation characteristic (ROC) curve and the concordance index were applied to compare the prediction power of the described models. Of note, in fitting the ANN model we used a three-layer back-propagation neural network with 11 nodes in the input layer, 6 nodes in the hidden layer and 1 node in the output layer (Fig. 1). Since each patient’s status was a binary response variable (deceased or censored), the sigmoid function was utilized as the activation function in the hidden and output layers. For net training, a back-propagation learning algorithm with a learning rate of 0.05 and a momentum of 0.9 was utilized. The learning process was discontinued when the average error (mean square error) in the training set decreased to 0.0001. We also applied the backward selection method (with a significant level of entry of 0.10 and a significant level of removal of 0.15) to fit the CPH model.

Results

Patient characteristics

The study sample comprised 289 GC patients, including 218 men (75.4%) and 71 women (24.6%). The mean age of the patients was 63.20±10.75 years. The characteristics of the patients under study are provided in Table I.

Table I

Patient characteristics.

Table I

Patient characteristics.

VariablesNo.%
Age (years)
 ≤6514851.2
 >6514148.8
Gender
 Male21875.4
 Female7124.6
BMI
 <18.53411.8
 18.5–23.919968.9
 27–29.94716.3
 ≥3093.1
Stagea
 IA3311.4
 IB238.0
 IIA3712.8
 IIB4917.0
 IIIA3411.8
 IIIB4615.9
 IIIC3612.5
 IV3110.7
Peritoneal dissemination
 Absent26892.7
 Present217.3
Ascites
 Absent24283.7
 Present4716.3
Histological grade
 G172.4
 G25519.0
 G320570.9
 G4227.6
Histological type
 Adenocarcinoma22477.5
 Other6522.5
Radical surgery
 No4114.2
 Yes24885.8
CEA level
 Normal24083.0
 Elevated4917.0
CA19-9 level
 Normal22678.2
 Elevated6321.8

a AJCC Cancer Staging Manual Seventh Edition.

{ label (or @symbol) needed for fn[@id='tfn2-br-01-05-0757'] } BMI, body mass index; CEA, carcinoembryonic antigen; CA, carbohydrate antigen.

Modeling process

In the first step of the modeling process, data were divided in training (~60% of patients) and testing (~40% of patients) subsets. The Mantel-Cox test demonstrated that the estimated survival curves using the training and testing subsets exhibited no significant differences (P=0.650). In the following step, based on the validation set, the CPH and ANN models were used to determine the risk factors. The results are shown in Table II. To compare the accuracy of prediction between the models, we used true classification (the proportion of patients that were accurately classified in the ‘deceased’ and ‘survived’ groups) of the patients in the testing subset. The obtained results are shown in Table III. The ANN model led to more accurate predictions compared to the CPH model (true prediction of 85.3 vs. 81.9%). The area under the ROC curve, calculated from testing data, was 0.891 for the ANN model and 0.824 for the CPH model.

Table II

CPH and ANN modeling results of prognostic factors on gastric cancer patient survival.

Table II

CPH and ANN modeling results of prognostic factors on gastric cancer patient survival.

CPH model

Ordered factorsP-value
Stage<0.001
Peritoneal dissemination0.004
Radical surgery0.022
BMI0.092
Age0.134
Histological grade0.176
CEA0.275
CA19-90.401
Ascites0.738
Gender0.758
Adenocarcinoma0.814

ANN model

Ordered factorsNormalized importance

Stage0.241
Radical surgery0.222
CA19-90.115
Peritoneal dissemination0.074
BMI0.073
Histological grade0.067
Adenocarcinoma0.062
CEA0.043
Age0.042
Gender0.032
Ascites0.029

[i] CPH, Cox proportional hazard; ANN, artificial neural network; BMI, body mass index; CEA, carcinoembryonic antigen; CA, carbohydrate antigen.

Table III

Classification accuracy of ANN and CPH models in the testing subset.

Table III

Classification accuracy of ANN and CPH models in the testing subset.

GroupsObserved (no.)True prediction by ANN no. (%)True prediction by CPH no. (%)
Deceased5750 (87.7)48 (84.2)
Survived5948 (81.4)47 (79.7)
Total11698 (85.3)95 (81.9)

[i] ANN, artificial neural network; CPH, Cox proportional hazard.

Discussion

GC is the most prevalent malignancy in China and worldwide. The aim of this study was to identify the most significant prognostic factors of GC and compare the ability of CPH and ANN models in predicting the survival of GC patients.

ANN models exhibit several advantages over conventional statistical methods. They may rapidly recognize linear patterns, non-linear patterns with threshold impacts, categorical and stepwise linear patterns, or even contingency effects (8). ANN analyses need not start with a hypothesis or a priori identification of potential key variables. Therefore, undocumented or quantified potential prognostic factors may be determined if they already exist in the various datasets, although they may have been overlooked in the past.

In this study, the CPH analysis demonstrated that the survival time of the patients was associated with disease stage, peritoneal dissemination, radical surgery, BMI, age at diagnosis, histological grade, serum CEA level, serum CA19-9 level, ascites, gender and adenocarcinoma, in that order of importance. In this analysis, disease stage, peritoneal dissemination, radical surgery and BMI were significantly associated with survival time. Based on the ANN model, disease stage, radical surgery, serum CA19-9 level, peritoneal dissemination, BMI, histological grade, adenocarcinoma, serum CEA level, age at diagnosis, gender and ascites were identified as the significant variables, in that order of importance. Of these variables, disease stage, radical surgery, serum CA19-9 level, peritoneal dissemination and BMI were the most significant. This result may be attributed to the interaction terms between the variables considered in the ANN model.

Previously published studies reported that disease stage is the most important prognostic factor in GC patients (911). Additional studies identified other risk indicators, such as gender, number of involved lymph nodes, histological type and type of complementary treatment, as the significant effective factors for survival of GC patients (1217). Lai et al(18) conducted an ANN-based study for the prediction of tumor staging in GC patients. They reported an accuracy of 81.8% in predicting tumor stage in primary GC patients. In another study conducted by Chien et al(19), the ordinary logistic regression, ANN and decision tree methods were used for predicting postoperative complications of GC patients. The results of that study indicated that the ANN was a more accurate technique for predicting postoperative complications, compared to the logistic regression and decision tree methods.

In the present study, we compared the results of the CPH and ANN models in determining significant risk factors and true prediction of GC patients. Our findings indicated that the ANN is an appropriate technique for this purpose.

In conclusion, the ANN model appears to be more efficient in determining the prognostic factors of GC patients compared to the CPH model. Therefore, it is recommended for determining the significant risk factors and survival of GC patients.

References

1 

Jemal A, Siegel R, Xu J and Ward E: Cancer statistics. CA Cancer J Clin. 60:277–300. 2010.

2 

Chen W, Zheng R, Zhang S, et al: Report of incidence and mortality in China cancer registries, 2009. Chin J Cancer Res. 25:10–21. 2013.

3 

Zheng MH, Shi KQ, Lin XF, et al: A model to predict 3-month mortality risk of acute-on-chronic hepatitis B liver failure using artificial neural network. J Viral Hepat. 20:248–255. 2013. View Article : Google Scholar : PubMed/NCBI

4 

Biglarian A, Hajizadeh E, Kazemnejad A and Zali M: Application of artificial neural network in predicting the survival rate of gastric cancer patients. Iran J Public Health. 40:80–86. 2011.PubMed/NCBI

5 

Biglarian A, Hajizadeh E, Kazemnejad A and Zayeri F: Determining of prognostic factors in gastric cancer patients using artificial neural networks. Asian Pac J Cancer Prev. 11:533–536. 2010.PubMed/NCBI

6 

Amiri Z, Mohammad K, Mahmoudi M, Zeraati H and Fotouhi A: Assessment of gastric cancer survival: using an artificial hierarchical neural network. Pak J Biol Sci. 11:1076–1084. 2008. View Article : Google Scholar : PubMed/NCBI

7 

Egner JR: AJCC cancer staging manual. JAMA. 304:1726–1727. 2010. View Article : Google Scholar

8 

Levine RF: Clinical problems, computational solutions: a vision for a collaborative future. Cancer. 91:1595–1602. 2001. View Article : Google Scholar : PubMed/NCBI

9 

Bozzetti F, Bonfanti G, Morabito A, et al: A multifactorial approach for the prognosis of patients with carcinoma of the stomach after curative resection. Surg Gynecol Obstet. 162:229–234. 1986.PubMed/NCBI

10 

Allgayer H, Heiss M and Schildberg F: Prognostic factors in gastric cancer. Br J Surg. 84:1651–1664. 1997. View Article : Google Scholar

11 

Maruyama K: The most important prognostic factors for gastric cancer patients: a study using univariate and multivariate analyses. Scand J Gastroenterol. 22:63–68. 1987. View Article : Google Scholar

12 

Ghiandoni G, Rocchi M, Signoretti P and Belbusti F: Prognostic factors in gastric cancer evaluated by using Cox regression model. Minerva Chir. 53:497–504. 1998.PubMed/NCBI

13 

Borie F, Rigau V, Fingerhut A and Millat B: Prognostic factors for early gastric cancer in France: Cox regression analysis of 332 cases. World J Surg. 28:686–691. 2004. View Article : Google Scholar : PubMed/NCBI

14 

Toneri M, Mitsumoto Y, Shimabukuro M, et al: Prognosis of patients with gastric carcinoma treated in remote island hospitals in Japan: a multivariable analysis. Anticancer Res. 26:3621–3625. 2006.PubMed/NCBI

15 

Liu C, Lu P, Lu Y, Xu H, Wang S and Chen J: Clinical implications of metastatic lymph node ratio in gastric cancer. BMC Cancer. 7:2002007. View Article : Google Scholar : PubMed/NCBI

16 

Peyre CG, Hagen JA, DeMeester SR, et al: The number of lymph nodes removed predicts survival in esophageal cancer: an international study on the impact of extent of surgical resection. Ann Surg. 248:549–556. 2008.PubMed/NCBI

17 

Ozgüç H, Sönmez Y and Yerci O: Metastatic/resected lymph nodes ratio-based classification in gastric cancer. Turk J Gastroenterol. 19:2–7. 2008.PubMed/NCBI

18 

Lai KC, Chiang HC, Chen WC, Tsai FJ and Jeng LB: Artificial neural network-based study can predict gastric cancer staging. Hepatogastroenterology. 55:1859–1863. 2008.PubMed/NCBI

19 

Chien CW, Lee YC, Ma T, et al: The application of artificial neural networks and decision tree model in predicting post-operative complication for gastric cancer patients. Hepatogastroenterology. 55:1140–1145. 2008.PubMed/NCBI

Related Articles

Journal Cover

September-October 2013
Volume 1 Issue 5

Print ISSN: 2049-9434
Online ISSN:2049-9442

Sign up for eToc alerts

Recommend to Library

Copy and paste a formatted citation
x
Spandidos Publications style
Zhu L, Luo W, Su M, Wei H, Wei J, Zhang X and Zou C: Comparison between artificial neural network and Cox regression model in predicting the survival rate of gastric cancer patients. Biomed Rep 1: 757-760, 2013
APA
Zhu, L., Luo, W., Su, M., Wei, H., Wei, J., Zhang, X., & Zou, C. (2013). Comparison between artificial neural network and Cox regression model in predicting the survival rate of gastric cancer patients. Biomedical Reports, 1, 757-760. https://doi.org/10.3892/br.2013.140
MLA
Zhu, L., Luo, W., Su, M., Wei, H., Wei, J., Zhang, X., Zou, C."Comparison between artificial neural network and Cox regression model in predicting the survival rate of gastric cancer patients". Biomedical Reports 1.5 (2013): 757-760.
Chicago
Zhu, L., Luo, W., Su, M., Wei, H., Wei, J., Zhang, X., Zou, C."Comparison between artificial neural network and Cox regression model in predicting the survival rate of gastric cancer patients". Biomedical Reports 1, no. 5 (2013): 757-760. https://doi.org/10.3892/br.2013.140