Skip to main content

Advertisement

Log in

Predictive models for patients with lung carcinomas to identify EGFR mutation status via an artificial neural network based on multiple clinical information

  • Original Article – Clinical Oncology
  • Published:
Journal of Cancer Research and Clinical Oncology Aims and scope Submit manuscript

Abstract

Purpose

Epidermal growth factor receptor (EGFR) mutation testing has several limitations. Therefore, we built predictive models to determine the EGFR mutation status of patients and guide therapeutic decision-making.

Methods

We collected data from 320 patients with lung carcinoma, including sex, age, smoking history, serum tumour marker levels, maximum standardized uptake value, pathological results, computed tomography images, and EGFR mutation status. Artificial neural network (ANN) models based on multiple clinical characteristics were proposed to predict EGFR mutation status.

Results

A training set (n = 200) was used to develop predictive models of the EGFR mutation status (Model 1: area under the receiver operating characteristic curve [AUROC] = 0.910, 95% CI 0.861–0.945; Model 2: AUROC = 0.859, 95% CI 0.803–0.904; Model 3: AUROC = 0.711, 95% CI 0.643–0.773). A testing set (n = 50) and temporal validation data set (n = 70) were used to evaluate the generalisation performance of the established models (testing set: Model 1, AUROC = 0.845, 95% CI 0.715–0.932; Model 2, AUROC = 0.882, 95% CI 0.759–0.956; Model 3, AUROC = 0.817, 95% CI 0.682–0.912; temporal validation dataset: Model 1, AUROC = 0.909, 95% CI 0.816–0.964; Model 2, AUROC = 0.855, 95% CI 0.751–0.928; Model 3, AUROC = 0.831, 95% CI 0.723–0.910). The predictive abilities of the three ANN models were superior to that of a previous logistic regression model (P < 0.001, 0.027, and 0.050, respectively).

Conclusions

ANN models provide a non-invasive and readily available method for EGFR mutation status prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Data availability

The data sets analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ANN:

Artificial neural network

AUROC:

Area under receiver operating characteristic curve

CI:

Confidence interval

CT:

Computed tomography

CEA:

Carcinoembryonic antigen

CYFRA21-1:

Cytokeratin 19 fragment

EGFR:

Epidermal growth factor receptor

EGFR-TKIs:

Epidermal growth factor receptor tyrosine kinase inhibitors

18F-FDG:

18F-fluorodeoxyglucose

NSE:

Neuron-specific enolase

OR:

Odds ratio

PET/CT:

Positron emission tomography/computed tomography

ROC:

Receiver operating characteristic

SCCA:

Squamous cell carcinoma antigen

SUVmax :

Maximum standardized uptake value

References

Download references

Acknowledgements

We appreciate the help from two respiratory physicians (L. Yang and Y. Lu) for CT image interpretation, and the staff of the record retrieval department from the First Affiliated Hospital of Wenzhou Medical University for their efforts.

Funding

No external funding for this research need be declared.

Author information

Authors and Affiliations

Authors

Contributions

XYQ and WZ conceived and designed this study; HLW, XH and XLG helped with the collection and assembly of data. All authors contributed toward data analysis, drafting and critically revising the paper and agree to be accountable for all aspects of the work. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Wei Zhou.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval and consent to participate

This study was approved by the Ethical Committee of the First Affiliated Hospital of Wenzhou Medical University. All study participants provided written informed consent, and their data confidentiality were protected.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 Table S1. The detailed definitions of tumour characteristics (DOC 26 kb)

Supplementary material 2 Fig. S1. Structure diagram of artificial neural network (ANN) (TIFF 1302 kb)

432_2019_3103_MOESM3_ESM.tif

Supplementary material 3 Fig. S2. The confusion matrices for the three artificial neural network (ANN) models to predict epidermal growth factor receptor (EGFR) mutation status in the training set and testing set of modelling data set. a, all-variable model in the training set; b, significant-variable model in the training set; c, traditional-variable model in the training set; d, all-variable model in the testing set; e, significant-variable model in the testing set; f, traditional-variable model in the testing set (TIFF 4377 kb)

432_2019_3103_MOESM4_ESM.tif

Supplementary material 4 Fig. S3. The confusion matrices for the three artificial neural network (ANN) models to predict epidermal growth factor receptor (EGFR) mutation status in the temporal validation data set. a, all-variable model; b, significant-variable model; c, traditional-variable model (TIFF 2177 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qin, X., Wang, H., Hu, X. et al. Predictive models for patients with lung carcinomas to identify EGFR mutation status via an artificial neural network based on multiple clinical information. J Cancer Res Clin Oncol 146, 767–775 (2020). https://doi.org/10.1007/s00432-019-03103-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00432-019-03103-x

Keywords

Navigation