Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 20 September 2022
Sec. Digital Public Health
This article is part of the Research Topic Machine Learning in Disease Screening, Diagnosis, and Surveillance View all 20 articles

Predictive models based on machine learning for bone metastasis in patients with diagnosed colorectal cancer

  • 1Tianjin Union Medical Center, Tianjin Medical University, Tianjin, China
  • 2Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
  • 3Department of General Surgery, Tianjin Hongqiao Hospital, Tianjin, China
  • 4Department of Colorectal Surgery, Tianjin Union Medical Center, Tianjin, China
  • 5Tianjin Institute of Coloproctology, Tianjin, China
  • 6Department of Spinal Surgery, Tianjin Union Medical Center, Tianjin, China
  • 7The Institute of Translational Medicine, Tianjin Union Medical Center of Nankai University, Tianjin, China
  • 8Nankai University School of Medicine, Nankai University, Tianjin, China

Background: This study aimed to develop an artificial intelligence predictive model for predicting the probability of developing BM in CRC patients.

Methods: From SEER database, 50,566 CRC patients were identified between January 2015 and December 2019 without missing data. SVM and LR models were trained and tested on the dataset. Accuracy, area under the curve (AUC), and IDI were used to evaluate and compare the models.

Results: For bone metastases in the entire cohort, SVM model with poly as kernel function presents the best performance, whose accuracy is 0.908, recall is 0.838, and AUC is 0.926, outperforming LR model. The top three most important factors affecting the model's prediction of BM include extraosseous metastases (EM), CEA, and size.

Conclusion: Our study developed an SVM model with poly as kernel function for predicting BM in CRC patients. SVM model could improve personalized clinical decision-making, help rationalize the bone metastasis screening process, and reduce the burden on healthcare systems and patients.

Introduction

Colorectal cancer (CRC) is a common malignant tumor, ranked the third most malignant tumor worldwide (1, 2). Distant metastasis is the leading cause of death in CRC patients (3), accounting for approximately 50% of patients after CRC surgery (4). The most common metastatic site of CRC is the liver or lung, while bone metastases are rare with an incidence of only 3–7% (5, 6). Patients with bone metastases have a poor prognosis, with a 5-year survival rate of < 5% and a median survival of 5–21 months (79).

Due to the low incidence and insignificant initial symptoms, bone metastases of CRC are difficult to diagnose at an early stage. On the one hand, compared with the low incidence of bone metastases in CRC patients, the incidence at autopsy is higher, reaching 10.7–23.7% (10). On the other hand, bone metastases are identified by further imaging or pathological examination after the occurrence of skeletal-related events (SREs) in CRC patients (11), but the median time to SREs is 2 months after the onset of bone metastases (7). Therefore, bone metastases may not be diagnosed on time in many CRC patients. Due to delayed diagnosis, patients may miss the optimal treatment time, leading to further disease progression and poor prognosis. Therefore, it is significant to predict the occurrence of bone metastasis in CRC patients.

Several predictive models for developing bone metastasis in CRC patients have been reported in previous studies (1214). However, the performance of these models is hardly satisfactory because they are based on simple LR regression models, which may be unsuitable for predicting bone metastases. In addition, these models only identified independent risk factors associated with developing bone metastasis from CRC but did not assess the importance of each factor. Recently, artificial intelligence (AI) models based on machine learning (ML) algorithms have been increasingly used in clinical practice (15, 16). Among them, support vector machine (SVM) and other prediction models based on machine learning are better at predicting the distant metastasis of tumors, such as gastric cancer, thyroid cancer, and prostate cancer (17). SVM used in this study is a binary classification model whose basic model is a linear classifier defined by maximizing the interval on the feature space. SVM can be transformed into a non-linear classifier using the kernel method. SVM learning strategy is to maximize the interval, which can be translated into a convex quadratic programming problem and SVM learning algorithm is the optimization algorithm for solving the convex quadratic programming (18). Notably, SVM has some advantages for solving small sample high-dimensional problems. However, there are remain no studies using artificial intelligence models to predict bone metastasis in CRC patients.

Therefore, this study used population-based data to identify risk factors associated with bone metastasis in CRC patients and then build an artificial intelligence model to predict disease occurrence and help clinicians detect bone metastases in a timely manner. This can provide patients with personalized clinical strategies and promote rational allocation of healthcare resources.

Materials and methods

Study population

This study was based on SEER database, and patient data were collected from “SEER Research Plus Data, 17 Registries, Nov 2021 Sub (2000–2019)” using SEER*stat 8.4.0 software and then extracted from the database between January 2015 and December 2019. SEER database covers 28% of the US population and includes information regarding cancer incidence, survival outcome, and treatment strategy from 17 population-based cancer registries. The patient selection procedure is displayed in Figure 1, and informed consent was not required as the patients were anonymized before publication. This study was approved by the Ethics Committee of Tianjin Union Medical Center.

FIGURE 1
www.frontiersin.org

Figure 1. The analytical cohort and exclusion criteria.

The inclusion criteria were 1) primary CRC cases with histological confirmation, 2) histological classification: adenocarcinoma (icd-o-3:8140 to 8144, 8210 to 8213, 8220 to 8221, 8260 to 8263, 8551–8574) mucinous adenocarcinoma (MC, icd-o-3: 8480, 8481), seal ring cell carcinoma (SRCC, icd-o-3:8490), and 3) with a clear record of bone metastases. The exclusion criteria were (1) unknown information about the size, location, grade, The American Joint Committee on Cancer (AJCC) TNM stage(8th), T stage, N stage, surgery information, extraosseous metastasis, and bone metastatic status, and (2) CRC was not the first tumor.

Data selection

All CRC patients were definitively diagnosed by pathologic examination, and BM was confirmed by imaging examination and/or pathologic examination. A total of 17 population, clinicopathological, serological indicator, extraosseous metastasis, and treatment variables were included. Population variables included age and sex, clinicopathology variables included site, size, grade, histology, AJCC TNM stage, T stage, N stage, and M stage, and serological indicators included carcinoembryonic antigen (CEA) levels. Extraosseous metastasis involves bone, brain, liver, and lung metastasis. All methods were conducted according to SEER database relevant guidelines.

Model establishment

All statistics were calculated using python (version 3.8). First, the initial data were preprocessed (12, 19). (1) Continuous variables: “Age” was divided into “>60 years” and “ <60 years”; “Size” was divided into “>2 cm,” “2–5 cm” and “>5 cm.” (2) Categorical variables: “Grade” was divided into “Grade I-II”, “Grade III-IV”; “T stage” was divided into “T1/2” and “T3/4”; “N stage” was divided into “N0” and “N1/2.” (3) Due to the small sample size and unbalanced distribution of the original distant metastasis variables (including lung metastasis, liver metastasis and brain metastasis) in SEER database, we added the variable of extraosseous metastasis for later model calculation. Pearson correlations between ten variables were calculated, and heatmaps were drawn. As Figure 2 displays, T stage strongly correlates with tumor size. For the features involved in the calculation to have low correlation, it is necessary to remove T stage or size, and the principle of feature removal is to remove the feature with less weight in the model calculation. The weight of each feature calculated by the random forest is presented in Figure 3. The figure shows that T stage occupies the smallest weight, implying that it is the is least important feature in the model analysis, so it is reasonable to remove T stage feature. To sum, nine features were included: age, sex, primary site, histologic type, CEA, size, N stage, extraosseous metastases (EM), and grade. Considering that the extreme imbalance of this sample (200:1) is likely to affect the model performance, it is necessary to adopt some sampling strategies. SMOTE Tomek was used in the training set as an Integrated Sampling method, and then the dataset was divided into a training set and a test set according to a ratio of 8:2.

FIGURE 2
www.frontiersin.org

Figure 2. Feature correlation heatmap after initial preprocessing.

FIGURE 3
www.frontiersin.org

Figure 3. The influence weight of each factor calculated by the random forest algorithm.

SVM, LR, decision tree (DT), random forest (RF), and Extreme Gradient Boosting (XGB) models were used to analyze the data. To select a model with good results, we also include model comparison as part of the study. As a binary classification model, SVM aims to find the optimal hyperplane to partition the samples (Figure 4), the learning strategy is to maximize the interval, and the solution of the model must be transformed into a convex optimization problem. The basic principle is to map the sample training data from the low-dimensional space to the high-dimensional space. Consequently, the sample training data is linearly separable and then the boundaries are linearly partitioned. For the sample (xi, yi) and the hyperplane(ω,b), the geometric interval is defined as follows.

γi=yi(ωω·xi+bω)
FIGURE 4
www.frontiersin.org

Figure 4. Schematic diagram of SVM.

Under the premise of correctly classifying the samples, when the geometric distance is the largest, the obtained separation hyperplane is optimal. The constraints are as follows:

maxω,b Υ             s.t.yi(ω||ω||·xi+b||ω||)Υ,i=1,2,,N

Using the decision boundary function, it can be transformed into the following:

                                  minw,bω22s.t.yi(ω · xi+b)1, i=1, 2,,N

After introducing the Lagrange operatorαi, it can be transformed into the following:

L(ω,b,α)=12ω2-i=1Nαi(yi(ω·xi+b)-1)(αi0)

By taking the partial derivative of a function L(ω,b,α) with respect toωandb, we can obtain a function aboutαi.Let this function be 0 to find the optimal solution, and the optimal hyperplane formula can be obtained as follows.

ω*·x+b*=0

Its corresponding Lagrange operator is optimal, denoted asαi*. At this point the classification decision function is listed below.

f(x)=sign(ω*X+b*)=sign(i=1Nαi*yixi·x+b*)

Using classification decision functions, samples can be classified, which is known as SVM. However, for non-linear problems, a kernel function is required. The function of kernel function is mainly to realize the mapping from a feature space in the support vector machine to another feature space and convert the inner product of high-dimensional vectors into the inner product of low-dimensional vectors.

In the model of LR, the goal of training is to find the best weight and bias for each feature so that the error is minimized. DT is a supervised machine learning algorithm based on if-then-else rules. In the model of RF, it is trained to obtain multiple decision trees by randomly putting back the samples sampled, and finally the results of each decision tree are summed using Bagging algorithm. The above model is built by python to learn the dataset's features.

Model improvement

Linear, poly, rbf, and sigmoid were used as kernel functions for SVM model, and the results are demonstrated in Figure 5. SVM model with poly as the kernel function showed the best performance, with an accuracy of 0.908048, recall of 0.837838, and AUC of 0.908236. Therefore, the linear kernel function was selected to build the SVM model to predict whether CRC patients have bone metastasis.

FIGURE 5
www.frontiersin.org

Figure 5. Performance of SVM models with different kernel functions.

To achieve better model performance, a random search method was used for parameter optimization. After parameter optimization, SVM's Accuracy is 0.908, Recall is 0.838, and AUC is 0.926, demonstrating superior performance to previous models.

Results

Demographic and pathological characteristics

A total of 50,566 CRC patients were included in this study. At initial diagnosis, 50,325 patients (99.5%) had no bone metastases and 241 (0.5%) had bone metastases. All patients were randomized into a training set (n = 40,452) and a test set (n = 10,114) in a ratio of 8:2, and their clinical and pathological characteristics are listed in Table 1.

TABLE 1
www.frontiersin.org

Table 1. Clinical and pathological characteristics of training and test sets.

Model analysis and variable influence on prediction

Pearson correlations between all variables were calculated, and heatmaps were drawn, revealing no significant correlations between variables (Figure 5). For the multivariate LR model with an enter variable selection method, six characteristics were identified as independent risk factors (Table 2), including primary tumor site (p < 0.001), size (p = 0.042), histology (p = 0.018), grade (p < 0.001), CEA level (p < 0.001), EM (p < 0.001). According to RF results (Figure 4), the top three most important factors affecting model prediction of BM are EM, CEA, and size. Notably, the influence weight of EM accounts for 42.32%, which may provide some basis for diagnosing clinical auxiliary BM.

TABLE 2
www.frontiersin.org

Table 2. Multivariable logistic regression model with enter variable selection.

Model performance

The training set was used to train the model, and the test set was used to test the accuracy and generalization ability of the model. The performance indicators of the evaluation model were AUC, Accuracy, and Recall. After comparing the performance of different kernel functions in the SVM model (Figure 6), the linear kernel function was selected. The results were compared and analyzed using SVM, LR, DT, RF, and XGB models. The performance comparison of different models is provided in Table 3, showing that SVM model is better than the other models and may be used clinically. Previous models have mostly used LR, and to better compare the improvements brought about by SVM model, ROC curves were plotted, and Integrated Discrimination Improvement (IDI) was calculated. As displayed in Figure 7, LR AUC is 0.92, and SVM AUC is 0.93, with an IDI of 22.66% (Figure 8), confirming that SVM model outperforms LR in this scenario.

FIGURE 6
www.frontiersin.org

Figure 6. Results of Pearson correlation analysis between all variables. The heatmap shows the correlation between the variables.

TABLE 3
www.frontiersin.org

Table 3. Comparing the prediction performances of different models for BM.

FIGURE 7
www.frontiersin.org

Figure 7. ROC curve, in which the new model refers to SVM and the old one refers to LR.

FIGURE 8
www.frontiersin.org

Figure 8. IDI curve, in which the new model refers to SVM and the old one refers to LR.

Discussion

The incidence of bone metastasis in CRC patients is only 3–7% (6), but these patients have a poor clinical prognosis and often suffer from SREs, such as pathological fractures, severe bone pain, spinal cord compression, and hypercalcemia (20) which can seriously impair their function and quality of life, and even further affect the outcomes. Therefore, early identification and clinical intervention of bone metastasis are critical to prevent SREs and improve the clinical prognosis.

There remains a lack of accurate and effective methods to predict bone metastasis in CRC patients. Pathological diagnosis is the gold standard, but if the pathological diagnosis is unclear, the identification of bone metastasis in CRC patients relies on SREs and imaging examinations such as X-ray, CT, MRI, emission computed tomography (ECT), and positron emission tomography/computed tomography (PET/CT) (21, 22). However, these imaging modalities are expensive and associated with radiation risks, so they are not recommended as routine screening for CRC patients until SREs occur (12). For this reason, we developed an artificial intelligence model based on SVM algorithm to predict bone metastasis in CRC patients.

The advantage of this model is that it can effectively deal with the imbalance of medical data, as SVM algorithm can effectively solve the problem of inaccurate judgment results caused by small sample data in machine learning, which has stronger practicability (18, 23). In this study, this model displayed better accuracy and generalization than other models (LR, DT, and RF) and can be used to predict the occurrence of bone metastasis in CRC patients, which is helpful for doctors to make timely and effective clinical decisions.

Previous studies have reported risk factors associated with bone metastasis in CRC. Zheng et al. (21) conducted a retrospective study of 106 patients with bone metastasis of CRC, indicating that primary tumor location, lung metastasis, and serum CEA are independent risk factors. Moreover, compared with colon cancer and liver metastasis, colorectal cancer and lung metastasis were more likely to predict disease progression to bone metastasis. Wang et al. (13) determined that the degree of tumor differentiation, N stage, serum alkaline phosphatase (ALP), lactate dehydrogenase (LDH), CEA, liver and lung metastasis were risk factors for bone metastasis of CRC, and further developed a nomogram to evaluate the risk of bone metastasis in CRC patients. In addition, studies have shown that the most common risk factors for BM in CRC patients include cancer site, lymph node invasion, and lung metastasis (6).

In this study, EM, followed by CEA level and tumor size, were the top three most important factors for developing bone metastasis in CRC patients. Notably, EM has an influence weight of 42.32%, an important predictor of bone metastasis in CRC patients. Studies have depicted that about 25% of CRC patients have distant metastases at the time of diagnosis (24). In our study, EM occurred in 10% of CRC patients and was an independent predictor of bone metastasis in CRC patients, consistent with previous findings (12). Due to the low incidence and insidious symptoms of bone metastasis, it is often identified after the occurrence of SREs, when the disease has already advanced, so the treatment effect and prognosis are poor (25). In addition, due to the environment of a specific organ and its effect on tumor cell adhesion, CRC tends to metastasize first to the liver or lungs before the bones (8, 9, 26). Therefore, for CRC patients with extraosseous metastasis, regular health monitoring and follow-up may be helpful for the early identification of bone metastases.

Serum CEA is considered a specific biomarker for CRC, and its concentration is significantly elevated in patients with metastatic colon cancer (2729). In this study, CEA level was an independent predictor of bone metastasis in CRC patients, consistent with previous findings (21). Higher CEA levels may be associated with distant metastasis of CRC and nerve infiltration (30). In addition, in the current AJCC TNM staging of CRC, T staging is determined by the depth of the tumor invading the intestinal wall rather than the tumor size, but previous studies have shown that solid tumors, including those of the gastrointestinal tract, exhibit the potential to spread not only during the vertical invasion but also during horizontal growth; (31). As the tumor size increases, the potential for metastasis is higher (32). A retrospective study by Luo et al. showed that tumor size was positively correlated with distant metastasis of rectal cancer (33). Similarly, our study depicted that size was an independent risk factor for bone metastasis of CRC, with a significantly higher incidence of bone metastases in tumors larger than 2 cm. This may provide some basis for diagnosing CRC patients with bone metastases in the clinic.

Nonetheless, this study has some limitations. First, since the model was not externally validated and was based on retrospective data, prospective cohort studies are needed to validate its accuracy and stability. Second, the model is based on an SVM algorithm, so it may be clinically difficult to interpret key features screened out by the model. In addition, since all study subjects were representative of the US population, the application of this risk model to other countries and ethnicities is limited.

Nowadays, with the rapid development of artificial intelligence technology, deep learning is widely applied in the detection and treatment of various diseases, such as cancer, diabetes, Alzheimer's disease and Parkinson's disease, and better results have been obtained (34, 35). In future research, it is planned to apply deep learning techniques in the prediction of bone metastasis occurring in colorectal cancer.

Conclusion

This study developed and validated an artificial intelligence model based on machine learning algorithms to individually predict the occurrence of bone metastasis in CRC patients by using clinical characteristics and quantifying the major factors leading to the increased risk of bone metastases. Among them, EM, followed by CEA level and size, were the top three most important factors for bone metastasis in CRC patients. Compared with the traditional LR model, the prediction performance of SVM algorithm is better (IDI: 22.66%); consequently, it could be used to timely detect bone metastases providing patients with personalized treatment and allocating health resources more effectively.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of Tianjin Union Medical Center. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

XZ, MZ, and LL conceived the idea for the study. LL and MZ were involved in planning and supervised data collection. HJ and SZ performed data collection. SZ, YZ, and HJ conducted data analysis. TL and HH drafted the manuscript. MZ, LL, and TS contributed to writing of manuscript. All authors have discussed and decided that this manuscript is the final version and agreed to publish it.

Funding

This study was funded by Foundation of Tianjin Union Medical Center (grant number: 2016RMNK002 and 2019ZDXK01). This work was funded by Tianjin Key Medical Discipline (Specialty) Construction Project.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2018) 68:394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Wong MCS, Huang J, Lok V, Wang J, Fung F, Ding H, et al. Differences in incidence and mortality trends of colorectal cancer worldwide based on sex, age, and anatomic location. Clin Gastroenterol Hepatol. (2021) 19:955–66 e61. doi: 10.1016/j.cgh.2020.02.026

PubMed Abstract | CrossRef Full Text | Google Scholar

3. de Krijger I, Mekenkamp LJ, Punt CJ, Nagtegaal ID. MicroRNAs in colorectal cancer metastasis. J Pathol. (2011) 224:438–47. doi: 10.1002/path.2922

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Kim HK, Cho JH, Lee HY, Lee J, Kim J. Pulmonary metastasectomy for colorectal cancer: how many nodules, how many times? World J Gastroenterol. (2014) 20:6133–45. doi: 10.3748/wjg.v20.i20.6133

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Weiss L, Grundmann E, Torhorst J, Hartveit F, Moberg I, Eder M, et al. Haematogenous metastatic patterns in colonic carcinoma: an analysis of 1541 necropsies. J Pathol. (1986) 150:195–203. doi: 10.1002/path.1711500308

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Christensen TD, Jensen SG, Larsen FO, Nielsen DL. Systematic review: Incidence, risk factors, survival and treatment of bone metastases from colorectal cancer. J Bone Oncol. (2018) 13:97–105. doi: 10.1016/j.jbo.2018.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Santini D, Tampellini M, Vincenzi B, Ibrahim T, Ortega C, Virzi V, et al. Natural history of bone metastasis in colorectal cancer: final results of a large Italian bone metastases study. Ann Oncol. (2012) 23:2072–7. doi: 10.1093/annonc/mdr572

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Riihimaki M, Hemminki A, Sundquist J, Hemminki K. Patterns of metastasis in colon and rectal cancer. Sci Rep. (2016) 6:29765. doi: 10.1038/srep29765

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Sundermeyer ML, Meropol NJ, Rogatko A, Wang H, Cohen SJ. Changing patterns of bone and brain metastases in patients with colorectal cancer. Clin Colorectal Cancer. (2005) 5:108–13. doi: 10.3816/CCC.2005.n.022

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Katoh M, Unakami M, Hara M, Fukuchi S. Bone metastasis from colorectal cancer in autopsy cases. J Gastroenterol. (1995) 30:615–8. doi: 10.1007/BF02367787

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Farooki A, Leung V, Tala H, Tuttle RM. Skeletal-Related events due to bone metastases from differentiated thyroid cancer. J Clin Endocrinol Metab. (2012) 97:2433–9. doi: 10.1210/jc.2012-1169

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Guan X, Ma CX, Quan JC Li S, Zhao ZX, Chen HP, et al. A clinical model to predict the risk of synchronous bone metastasis in newly diagnosed colorectal cancer: a population-based study. BMC Cancer. (2019) 19:704. doi: 10.1186/s12885-019-5912-x

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Wang N, Liu F, Xi W, Jiang J, Xu Y, Guan B, et al. Development and validation of risk and prognostic nomograms for bone metastases in Chinese advanced colorectal cancer patients. Ann Transl Med. (2021) 9:875. doi: 10.21037/atm-21-2550

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Xu W, He Y, Wang Y, Li X, Young J, Ioannidis JPA, et al. Risk factors and risk prediction models for colorectal cancer metastasis and recurrence: an umbrella review of systematic reviews and meta-analyses of observational studies. BMC Med. (2020) 18:172. doi: 10.1186/s12916-020-01618-6

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Jones OT, Matin RN, van der Schaar M, Prathivadi Bhayankaram K, Ranmuthu CKI, Islam MS, et al. Artificial intelligence and machine learning algorithms for early detection of skin cancer in community and primary care settings: a systematic review. Lancet Digit Health. (2022) 4:e466–76. doi: 10.1016/S2589-7500(22)00023-1

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Wang X, Zheng Z, Xie Z, Yu Q, Lu X, Zhao Z, et al. Development and validation of artificial intelligence models for preoperative prediction of inferior mesenteric artery lymph nodes metastasis in left colon and rectal cancer. Eur J Surg Oncol. (2022). doi: 10.1016/j.ejso.2022.06.009

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Liu WC Li MX, Qian WX, Luo ZW, Liao WJ, Liu ZL, et al. Application of machine learning techniques to predict bone metastasis in patients with prostate cancer. Cancer Manag Res. (2021) 13:8723–36. doi: 10.2147/CMAR.S330591

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Zhou S. Sparse SVM for sufficient data reduction. IEEE Trans Pattern Anal Mach Intell. (2021) 44:5560–71. doi: 10.1109/TPAMI.2021.3075339

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Li X, Hu W, Sun H, Gou H. Survival outcome and prognostic factors for colorectal cancer with synchronous bone metastasis: a population-based study. Clin Exp Metastasis. (2021) 38:89–95. doi: 10.1007/s10585-020-10069-5

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Coleman RE. Skeletal complications of malignancy. Cancer. (1997) 80:1588–doi: 10.1002/(SICI)1097-0142(19971015)80:8+<1588::AID-CNCR9>3.0.CO;2-G

CrossRef Full Text | Google Scholar

21. Zheng H, Zhu ZH, Guo WJ, Zhang N, Cai YY, Ying JS, et al. Retrospective study of predictors of bone metastasis in colorectal cancer patients. J Bone Oncol. (2017) 9:25–8. doi: 10.1016/j.jbo.2017.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Park HS, Chun YJ, Kim HS, Kim JH, Lee CK, Beom SH, et al. Clinical features and KRAS mutation in colorectal cancer with bone metastasis. Sci Rep. (2020) 10:21180. doi: 10.1038/s41598-020-78253-x

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M, Haussler D. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. (2000) 16:906–14. doi: 10.1093/bioinformatics/16.10.906

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Liu C, Wang T, Yang J, Zhang J, Wei S, Guo Y, et al. Distant metastasis pattern and prognostic prediction model of colorectal cancer patients based on big data mining. Front Oncol. (2022) 12:878805. doi: 10.3389/fonc.2022.878805

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Baek SJ, Hur H, Min BS, Baik SH, Lee KY, Kim NK. The characteristics of bone metastasis in patients with colorectal cancer: a long-term report from a single institution. World J Surg. (2016) 40:982–6. doi: 10.1007/s00268-015-3296-x

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Roth ES, Fetzer DT, Barron BJ, Joseph UA, Gayed IW, Wan DQ. Does colon cancer ever metastasize to bone first? a temporal analysis of colorectal cancer progression. BMC Cancer. (2009) 9:274. doi: 10.1186/1471-2407-9-274

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Min YL, Gong YX, Zhu PW, Lin Q, Li B, Shi WQ, et al. CEA as a risk factor in predicting ocular metastasis from colorectal cancer. J Cancer. (2020) 11:51–6. doi: 10.7150/jca.31196

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Pakdel A, Malekzadeh M, Naghibalhossaini F. The association between preoperative serum CEA concentrations and synchronous liver metastasis in colorectal cancer patients. Cancer Biomark. (2016) 16:245–52. doi: 10.3233/CBM-150561

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Kanellos I, Zacharakis E, Kanellos D, Pramateftakis MG, Tsahalis T, Altsitsiadis E, et al. Prognostic significance of CEA levels and detection of CEA mRNA in draining venous blood in patients with colorectal cancer. J Surg Oncol. (2006) 94:3–8. doi: 10.1002/jso.20549

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Gao Y, Wang J, Zhou Y, Sheng S, Qian SY, Huo X. Evaluation of Serum CEA, CA19-9, CA72-4, CA125 and ferritin as diagnostic markers and factors of clinical parameters for colorectal cancer. Sci Rep. (2018) 8:2732. doi: 10.1038/s41598-018-21048-y

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Li F, Kishida T, Kobayashi M. Serum iron and ferritin levels in patients with colorectal cancer in relation to the size, site, and disease stage of cancer. J Gastroenterol. (1999) 34:195–9. doi: 10.1007/s005350050243

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Norton L, Massague J. Is cancer a disease of self-seeding? Nat Med. (2006) 12:875–8. doi: 10.1038/nm0806-875

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Luo D, Shan Z, Liu Q, Cai S, Ma Y, Li Q, et al. The correlation between tumor size, lymph node status, distant metastases and mortality in rectal cancer patients without neoadjuvant therapy. J Cancer. (2021) 12:1616–22. doi: 10.7150/jca.52165

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Schneider L, Laiouar-Pedari S, Kuntz S, Krieghoff-Henning E, Hekler A, Kather JN, et al. Integration of deep learning-based image analysis and genomic data in cancer pathology: a systematic review. Eur J Cancer. (2022) 160:80–91. doi: 10.1016/j.ejca.2021.10.007

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Qu C, Zou Y, Ma Y, Chen Q, Luo J, Fan H, et al. Diagnostic performance of generative adversarial network-based deep learning methods for alzheimer's disease: a systematic review and meta-analysis. Front Aging Neurosci. (2022) 14:841696. doi: 10.3389/fnagi.2022.841696

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: predictive model, artificial intelligence, colorectal cancer, machine learning, bone metastasis

Citation: Li T, Huang H, Zhang S, Zhang Y, Jing H, Sun T, Zhang X, Lu L and Zhang M (2022) Predictive models based on machine learning for bone metastasis in patients with diagnosed colorectal cancer. Front. Public Health 10:984750. doi: 10.3389/fpubh.2022.984750

Received: 02 July 2022; Accepted: 25 August 2022;
Published: 20 September 2022.

Edited by:

Yu-Hsiu Lin, National Chung Cheng University, Taiwan

Reviewed by:

Weihua Gong, Zhejiang University, China
Mohammed Ambusaidi, UTAS, Oman

Copyright © 2022 Li, Huang, Zhang, Zhang, Jing, Sun, Zhang, Lu and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xipeng Zhang, zhxp1011@163.com; Liangfu Lu, liangfulv@tju.edu.cn; Mingqing Zhang, zmq@nankai.edu.cn

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.