Deep learning for evaluation of microvascular invasion in hepatocellular carcinoma from tumor areas of histology images

Chen, Qiaofeng; Xiao, Han; Gu, Yunquan; Weng, Zongpeng; Wei, Lihong; Li, Bin; Liao, Bing; Li, Jiali; Lin, Jie; Hei, Mengying; Peng, Sui; Wang, Wei; Kuang, Ming; Chen, Shuling

doi:10.1007/s12072-022-10323-w

Deep learning for evaluation of microvascular invasion in hepatocellular carcinoma from tumor areas of histology images

Original Article
Open access
Published: 28 March 2022

Volume 16, pages 590–602, (2022)
Cite this article

Download PDF

You have full access to this open access article

Hepatology International Aims and scope Submit manuscript

Deep learning for evaluation of microvascular invasion in hepatocellular carcinoma from tumor areas of histology images

Download PDF

Qiaofeng Chen¹^na1,
Han Xiao²^na1,
Yunquan Gu³^na1,
Zongpeng Weng³,
Lihong Wei⁴,
Bin Li³,
Bing Liao⁴,
Jiali Li⁵,
Jie Lin⁶,
Mengying Hei⁴,
Sui Peng^1,3,
Wei Wang²,
Ming Kuang^2,7 &
…
Shuling Chen²

3879 Accesses
10 Citations
2 Altmetric
Explore all metrics

Abstract

Background

Microvascular invasion (MVI) is essential for the management of hepatocellular carcinoma (HCC). However, MVI is hard to evaluate in patients without sufficient peri-tumoral tissue samples, which account for over a half of HCC patients.

Methods

We established an MVI deep-learning (MVI-DL) model with a weakly supervised multiple-instance learning framework, to evaluate MVI status using only tumor tissues from the histological whole slide images (WSIs). A total of 350 HCC patients (2917 WSIs) from the First Affiliated Hospital of Sun Yat-sen University (FAHSYSU cohort) were divided into a training and test set. One hundred and twenty patients (504 WSIs) from Dongguan People’s Hospital and Shunde Hospital of Southern Medical University (DG-SD cohort) formed an external test set. Unsupervised clustering and class activation mapping were applied to visualize the key histological features.

Results

In the FAHSYSU and DG-SD test set, the MVI-DL model achieved an AUC of 0.904 (95% CI 0.888–0.920) and 0.871 (95% CI 0.837–0.905), respectively. Visualization results showed that macrotrabecular architecture with rich blood sinus, rich tumor stroma and high intratumor heterogeneity were identified as the key features associated with MVI ( +), whereas severe immune infiltration and highly differentiated tumor cells were associated with MVI (−). In the simulation of patients with only one WSI or biopsies only, the AUC of the MVI-DL model reached 0.875 (95% CI 0.855–0.895) and 0.879 (95% CI 0.853–0.906), respectively.

Conclusion

The effective, interpretable MVI-DL model has potential as an important tool with practical clinical applicability in evaluating MVI status from the tumor areas on the histological slides.

Graphical abstract

Predicting microvascular invasion in hepatocellular carcinoma: a deep learning model validated across hospitals

Article Open access 09 October 2021

Using deep learning to predict microvascular invasion in hepatocellular carcinoma based on dynamic contrast-enhanced MRI combined with clinical parameters

Article 10 April 2021

Deep-learning-based analysis of preoperative MRI predicts microvascular invasion and outcome in hepatocellular carcinoma

Article Open access 08 June 2022

Introduction

Microvascular invasion (MVI) is one of the most important histological features of the prognosis and treatment management of hepatocellular carcinoma (HCC) [1, 2]. Postoperative adjuvant transarterial chemoembolization for HCC patients with MVI significantly reduced tumor recurrence and improved survival [3, 4]. MVI is also the only histological feature that has been proven to be predictive of the efficacy of adjuvant therapy by a clinical trial [5]. Additionally, MVI status can also be an indicator for therapeutic decision-making in recurrent HCC [6]. Therefore, an accurate histological diagnosis of MVI is highly critical to the precise management of HCC.

Postoperative histological assessment is the gold standard for the diagnosis of MVI. However, MVI is commonly scattered in the adjacent peri-tumor liver tissues, leading to difficulties in its evaluation. Accordingly, 83.3% of the MVIs were located within 1 cm from the tumor boundary, but also approximately 8.4% were located beyond 2 cm or even further [7]. Therefore, multipoint sampling at the peri-tumor region is necessary to ensure the detection rate of MVI [8, 9]. Nevertheless, in clinical practice, not all patients could obtain sufficient sampling tissues for MVI evaluation. More than 60% of HCC patients received nonsurgical treatment with only biopsy specimens [10]. For patients who receive surgical treatment, the background cirrhosis leads to a considerable portion of them having narrow surgical margins to maintain sufficient remnant liver volume [11]. According to the previous studies, the proportion of HCC patients with margins < 0.5 cm ranged from 43.6 to 44.2% [7]. The MVI status could hardly be evaluated in these patients with limited information on the peri-tumor region, ultimately affecting the treatment decisions and clinical outcomes.

Histological whole slide images (WSIs) of HCC contain a massive amount of biological information. Recent studies demonstrated that some histological features such as macrotrabecular-massive type, cholangiocarcinoma-like and stem cell-like traits were positively correlated with the incidence of vascular invasion [12, 13]. Therefore, quantitative analysis of this information in tumor tissue has the potential to help diagnose MVI without sampling from the peri-tumor region. However, quantitative evaluation of this information is challenging with the naked eyes of pathologists. Deep learning could automatically extract imaging features which are invisible to human observers and potentially provide important clinical, biological and molecular-morphologic information [14]. Previous studies on deep-learning models in other tumors also indicated the possibility of predicting features outside the tumor (such as lymph node metastasis) by evaluating information from tumor areas only [15, 16]. Therefore, constructing a deep-learning model based on the imaging information of tumor areas might have the potential to evaluate MVI status effectively and assist with the clinical management of HCC patients.

In this study, we developed MVI deep-learning (MVI-DL) prediction model in multicenter HCC cohorts, by learning the characteristic information from the tumor areas of histological WSIs, to automatically and accurately evaluate the MVI status.

Materials and methods

Patient cohorts and data preparation

This study protocol conforms to the ethical guidelines of the 1975 Declaration of Helsinki as reflected in a priori approval by the institution's Human Research Committee. Informed consent was waived since this was a retrospective cohort study.

We retrospectively collected 368 patients from the First Affiliated Hospital of Sun Yat-sen University (FAHSYSU) and 120 patients from the Dongguan People’s Hospital and Shunde Hospital of Southern Medical University (DG-SD) who underwent curative hepatectomy from January 2016 to December 2018. All these patients were pathologically diagnosed with HCC, and all hematoxylin and eosin histological slides were collected to train and validate the MVI prediction model. Slides with poor staining quality or images with artifacts after scanning were excluded. Finally, a total of 2917 WSIs of 350 patients from FAHSYSU were included and were randomly divided into a training set and an independent test set. A total of 504 WSIs of 120 HCC patients from DG-SD formed an external test set (Fig. 1a). In order to ensure the high-quality ground truth labels of the data used for model development and validation, all the resected HCC specimens from the FAHSYSU and DG-SD cohorts had surgical margins over 2 cm, and sufficient postoperative sampling tissues were obtained for MVI evaluation. According to our previous large retrospective study, a threshold of four, six, eight and eight sampling tissues within peri-tumor liver parenchyma were required for evaluating MVI in solitary tumors measuring 1.0–3.0 cm, 3.1–4.9 cm and ≥ 5.0 cm and multiple tumors [8]. Additionally, the histological diagnosis of MVI for each slide was prospectively evaluated based on the consensus of three pathologists with over 5 years of experience in liver pathology.

Additionally, a total of 376 WSIs from 376 HCC patients were obtained from the TCGA database via the Genomic Data Commons (https://gdc.cancer.gov/). Information on recurrence-free survival (RFS) and overall survival (OS) was collected from these patients. Patients without survival data or slides with poor image quality were excluded and finally 304 WSIs of 304 HCC patients were included to evaluate the correlation of predicted MVI results by the MVI-DL model with patients’ survival outcomes (Fig. 1a).

All WSIs were scanned at 40 × magnification by a KF-PRO-020 type of scanning machine (KFBIO, Ningbo, China) and were stored in SVS file format. A pathologist with over 1 year of working experience in liver pathology performed the image quality control to screen out poorly stained slides or had obvious artifacts. To prevent overlaps between datasets, the WSIs from a given patient were kept together in the same set. To extract the information under different magnifications, we divided the WSIs into nonoverlapping 512 × 512 pixel patches at magnifications of 5 × , 10 × , 20 × and 40 × , respectively. Patches with over 50% of background coverage were excluded (Supplementary Methods).

Development of the MVI-DL model

Image sampling and magnification selection

To clarify the contribution of different tissue areas in the WSI to the prediction of MVI, and to further confirm the sampling strategy for model training, we compared the performances of models developed based on different tissue areas. We first trained a segmentation network, details in Supplementary Methods. We then applied the segmentation model to all other WSIs in the FAHSYSU and DG-SD cohorts for automatic segmentation. Prediction models were constructed based on tumor area, peri-tumor area and the whole WSI, respectively, to compare models’ performance with different sampling strategies.

To further determine the optimal number of sampling patches under different magnifications for the prediction model, we performed sensitivity analyses of the number of sampling patches under different magnifications. We also compared the performances of the models under different magnifications with the ensemble model integrating different magnifications to determine the network structure of the final prediction model (MVI-DL model).

Training of the MVI-DL model

The MVI-DL model was constructed based on a weakly supervised multiple-instance learning (MIL) framework [17]. The framework consists of a convolutional neural network (CNN) feature extraction layer, a MIL pooling layer and a fully connected layer. Each WSI obtained a patch bag after tiling, the label of which was the patient’s MVI status (Fig. 1b). We used patch bags and their corresponding labels as the input to train the prediction network. A pre-trained Inception-v4 model was used as the backbone to extract the features of patches. In the MIL pooling layer, we introduced the attention mechanism, aggregated the patch features through the attention score, and finally output the predicted value of the WSI through the fully connected layer (Fig. 1c and Supplementary Methods). We used a fine-tuned set (a part of the training set) to select five optimal prediction models before overfitting and took the average of the five models as the final prediction score (Fig. 2a and Fig. S1, S2). The average of the prediction scores under different magnifications formed the prediction scores of the MVI-DL model.

Visualization of the MVI prediction

To further understand the key histological features that contribute the most to the model prediction of MVI, we extracted the top 4000 and the bottom 4000 patches based on the MVI predictive attention score and then clustered and visualized them using t-SNE and DCCS algorithms [18, 19]. Pathologists reviewed the pathological features in each cluster of patches without being informed of the label or prediction score for each patch. We also applied gradient-weighted class activation mapping (Grad-CAM) [20] to provide an insight into regions within each patch of the corresponding cluster that the MVI-DL model used to generate predictions (Supplementary Methods).

Validation of the MVI-DL model performance

The predictive performance of the MVI-DL model was evaluated in the FAHSYSU test cohort and DG-SD cohort. Clinical information (age, gender, serum AFP level, tumor number and size, BCLC, Edmondson grade and tumor encapsulation) was collected and analysed by multivariable logistic analysis to determine the MVI-associated clinical characteristics. A clinical-MVI-DL model was constructed based on the MVI-DL prediction score and the clinical MVI-associated characteristics. We further compared the performance of the MVI-DL model with MVI-associated clinical characteristics and the clinical-MVI-DL model.

For patients in the FAHSYSU test cohort, the DG-SD cohort and the TCGA cohort, we further divided them by the predicted MVI status from the MVI-DL model. Subsequently, we compared the RFS and OS between patients predicted to be MVI ( +) and MVI (−) to evaluate the correlation between the predicted MVI status and patient’s survival outcomes in these three cohorts independently.

Simulation of the clinical application of the MVI-DL model

Patients with insufficient surgical margin for MVI evaluation

Considering that some HCC patients with narrow surgical margins had insufficient histological sections for MVI evaluation, we simulated a clinical scenario where patients had only one WSI. We randomly selected one WSI for each patient in the FAHSYSU and DG-SD test set as the input and then analysed the predictive performance of the MVI-DL model. Additionally, to further investigate the impact of different number of the WSIs randomly selected from each patient on the prediction performance of the model, we performed 100 rounds of iteration for each point of the number of the WSIs and then performed a sensitivity analysis with these mean values (standard deviation) of each point that calculated from the generated 100 results.

Patients with biopsies only

Considering that a large part of HCC patients could only acquire biopsy specimens, we simulated a clinical scenario where the tissue size of the WSIs were similar to liver biopsy specimens. Therefore, we randomly selected one WSI for each patient in the FAHSYSU and DG-SD test set and then randomly sampled adjacent patches with a similar area of a liver biopsy from each WSI as the one simulated biopsy. Clinically, three biopsies at most were routinely acquired for HCC patients. Therefore, we analysed the predictive performance of the MVI-DL model with one to three simulated biopsies. The detailed simulation method is described in Supplementary Methods.

Statistical analysis

Receiver operating characteristic (ROC) curves were generated to evaluate the performance of the MVI-DL model. Subsequently, the area under the receiver operating characteristic curve (AUC) values were calculated accordingly. A two-sided DeLong test was used to compare the AUCs. An optimal cut-off was determined by the ROC curve to reach the best accuracy, which was 0.58 in this study. The accuracy, sensitivity and specificity were then calculated according to this cut-off for the prediction results (≥ 0.58 as positive, < 0.58 as negative). RFS and OS curves were analysed using the Kaplan–Meier method and compared using the Mantel–Cox log-rank test. Logistic regression analyses were performed to select the MVI-associated characteristics. Each variable was assessed by univariate logistic regression analysis, and variables with a p < 0.05 were enrolled in a stepwise multivariate analysis. The clinical-MVI-DL model was then constructed based on the results of multivariable logistic regression analysis. A two-sided p value less than 0.05 was considered statistically significant. Scikit-learn was used for ROC curve analysis and the calculation of the confusion matrix.

Results

Patient characteristics

We initially obtained 3568 slides from 488 HCC patients across two independent cohorts. A total of 147 (4.1%) slides from 22 patients who did not meet the inclusion criteria were excluded. Finally, in the FAHSYSU cohort, a total of 2917 slides from 350 HCC patients were enrolled, 180 (51.4%) of whom were MVI ( +), while in the DG-SD cohort, 504 slides from 120 HCC patients were enrolled, 44 (36.7%) of whom were MVI ( +). Patients from the FAHSYSU cohort were randomly divided into a training set and an independent test set. Patients from the DG-SD cohort were used as the external test set (Fig. 1a). Baseline clinical and demographic characteristics were generally well balanced between the two cohorts, except for a significantly higher Edmondson grade (p = 0.009) and higher incidence of MVI ( +) (p = 0.015) in the FAHSYSU cohort (Table S1).

Development of the MVI-DL model

Considering that there is a huge amount of information on a single WSI, we first trained a tissue segmentation network to automatically segment all WSIs in the two cohorts (Table S2 and S3) and compared the contribution of different tissue areas on the WSI to MVI prediction to reduce the redundant information. One WSI was divided into the tumor area and peri-tumoral area by a segmentation network (Fig. S3). We tested the performance of our segmentation network, achieving accuracies of 0.960, 0.958, 0.934 and 0.932, and AUCs of 0.991, 0.986, 0.984 and 0.981, under 5 × , 10 × , 20 × and 40 × magnification scales, respectively (Fig. S3).

Then, we used patches from the tumor area, peri-tumoral area and the whole WSI as inputs to construct prediction models for MVI and compared the performances between different tissue categories and different magnification scales. The predictive performances of the models under 40 × magnification were significantly lower than those of other magnifications, with the best AUC of only 0.68, and were excluded from the following analysis. The models using patches from the tumor area as the inputs had significantly higher AUCs than those using patches from the peri-tumoral area or the whole WSI at every magnification scale (all p < 0.001, Fig. 2b).

Next, we evaluated the optimate sample counts for the input patches and select the sample count with the best training efficacy under each magnification scales. We chose 8 patches for the 5 × , 32 patches for the 10 × and 64 patches for the 20 × magnification scales to achieve the best training efficacy and sufficient sampling numbers to represent the features of the tumor area (Fig. 2c). The ensemble model integrating different magnification scales reached an AUC of 0.831 for tumor area and was significantly higher than those under a single magnification scale (p = 0.004, Fig. 2d). Therefore, we took the ensemble model for the tumor area as the backbone model for the following training and validation of the MVI-DL model (Fig. 2a).

Validation of the MVI-DL model

We validated the performances of the MVI-DL model on the FAHSYSU and DG-SD test set (Fig. 3a, b). On the FAHSYSU test set, the AUC of the MVI-DL model was 0.904 (95% CI 0.888–0.920), and the accuracy, sensitivity and specificity were 83.3%, 92.6% and 71.0%, respectively. For the DG-SD cohort, the AUC reached 0.871 (95% CI 0.837–0.905), and its accuracy, sensitivity and specificity were 79.1%, 90.0% and 69.8%, respectively (Table S4).

Furthermore, we also compared the RFS and OS between patients predicted with different MVI status, and the results showed that patients who were predicted to be MVI ( +) had significantly worse survival outcomes than patients with MVI (−) (median RFS: 9.39 vs. 66.15 months, p < 0.001; 13.09 vs. 40.20 months, p < 0.001; 7.23 vs. 36.70 months, p < 0.001 in the FAHSYSU test set, DG-SD cohort and TCGA cohort, respectively) (Fig. 3c). The OS analysis also indicated the similar results (Fig. S4).

To compare the predictive performance of the MVI-DL model to the clinical characteristics and the combined clinical-MVI-DL model, we performed a univariable and multivariable logistic regression analysis of factors associated with MVI in the training set (Table S5). Multivariable analysis revealed that the MVI-DL prediction score (OR 1.07, 95% CI 1.05–1.09, p < 0.0001) was an independent predictive factor of MVI and was higher than the combined clinical score (p < 0.001, Fig. S5). The combination of the clinical score with the MVI-DL model did not improve the predictive performance of the MVI-DL model (p = 0.304 and 0.289 for the FAHSYSU and DG-SD test set, respectively).

Visualization and interpretability of the MVI-DL model

Figure 4a shows two WSIs predicted with MVI ( +) and MVI (−) by the model and the corresponding heatmaps as examples. The unsupervised classification classified the patches into 8 clusters (Fig. 4b). Clusters with over 60% patches from MVI ( +) patients were defined as MVI ( +)-related clusters, while those with over 60% patches from MVI (−) were defined as MVI (−)-related clusters (Fig. 4c). We found that high intratumor heterogeneity (Cluster 2), rich tumor stroma (Cluster 7), and macrotrabecular architecture with a rich blood sinus (Cluster 8) were associated with MVI ( +). In contrary, severe immune infiltration (Cluster 4) and highly differentiated tumor cells (Cluster 5) were associated with MVI (−) (Fig. S6). Grad-CAMs also showed that regions occupied by these features received higher or lower weights (Fig. 4d).

Clinical implementation of the MVI-DL model

Clinically, patients with limited surgical margins or patients who underwent ablation therapy could hardly acquire sufficient histological sections to evaluate MVI. Therefore, we next simulated the implementation of the MVI-DL model in these two clinical scenarios (Fig. 5a). First, we validated the performance of the MVI-DL model using only one WSI from each patient in the FAHSYSU and DG-SD test set to simulate patients with limited surgical margins. The results showed that the AUC was 0.875 (95% CI 0.855–0.895) and 0.837 (95% CI 0.800–0.874), respectively (Fig. 5b). The detailed evaluation metrics are shown in Table S6. Additionally, we also performed a sensitivity analysis to explore the impact of the number of the WSIs per patient fed to the model on the performance of the MVI-DL model, and the results showed that the predictive accuracy of the model did not increase significantly as the number of the input WSIs increased (the AUCs range from 0.849 to 0.878, Fig. S7). Then, we validated the performance of MVI-DL model in simulated biopsy specimens to evaluate the performance of the MVI-DL model in patients with only biopsy tissues. The results showed that the predictive performance positively correlated with the number and length of simulated biopsy tissues (Fig. 5c, Table S6 and Fig. S8), and the AUC reached 0.879 (95% CI 0.853–0.906) by three biopsies and similar trends were shown in the external test set.

Discussion

In this study, we constructed MVI-DL model for MVI evaluation in HCC patients. The MVI-DL model was well-validated in the independent external cohort. We found that the tumor areas contributed the most to the MVI-DL model, indicating that some imaging features of the tumor area were strongly associated with the existence of peri-tumoral MVI. We then identified these histological features by clustering and visualization. We further simulated the clinical scenarios where tissue samples were insufficient for MVI evaluation, and the results showed that the MVI-DL model could accurately diagnose MVI in these scenarios.

The precise histological diagnosis of MVI is critical for managing HCC patients but faces two main challenges in clinical circumstances. On the one hand, multipoint sampling of peri-tumor is essential for accurate diagnosis of MVI [8, 9] but multiplies the diagnosis time of pathologists. On the other hand, a considerable proportion of patients had insufficient peri-tumor regions to evaluate MVI status [10]. To solve these clinical dilemmas, we first retrospectively collected all the WSIs of those HCC patients from two cohorts with high-quality MVI labels, and we then established an MVI-DL model for MVI evaluation with high accuracy and was well validated in external cohorts. By automatically analysing WSIs, the MVI-DL model can reduce the workload of pathologists in MVI evaluation greatly. Furthermore, we verified the model in two simulated clinical scenarios, where the patient had only one single section or only a biopsy specimen and achieved similar performances. Additionally, we also confirmed that the overall predictive accuracy did not increase significantly as the number of the input WSIs increased. These results not only indicated the possibility of predicting MVI status in patients who could not be evaluated before, but also greatly reduce the increased workload caused by multipoint tissue sampling. By applying the MVI-DL model, a pretreatment biopsy could reflect the MVI status and further contribute to various therapeutic decision-making. Physicians could plan to achieve a surgical margin > 1 cm for MVI ( +) patients to reduce postoperative recurrence [7], design safe ablation margins for RFA in MVI ( +) patients [21], and rearrange the priority of the liver transplantation list [22].

Although MVIs are located in the peri-tumoral area, our results showed that the tumor area-based model had the best performance. The scattered distribution of MVI could cause difficulties in constructing a detection model directly from the peri-tumoral area. Not all slides from MVI ( +) patients contain MVIs leads to a high false-positive rate in MVI labels. The tumor area-based MVI-DL model we constructed predicts the existence of MVI by certain MVI-associated histological features in the tumor area instead of trying to find the location of MVI, avoiding the inconsistent labels in different slides. Interestingly, we found that the performances of tumor area-based models were diverse under different magnifications. The 40 × model showed limited predictive efficacy compared with the lower magnifications. The reason could be that images under different magnifications reveal different types of histological features [23]. Images under 40 × magnification better reflect the characteristics of tumor cell morphology and internal cell structure, while images under lower magnifications reflect the tumor cell morphology and the relationship between tumor cells and their surrounding microenvironments, such as tumor-associated stromal cells and tumor-infiltrated lymphocytes. Therefore, we speculate that the relationship of tumor cells and their surrounding microenvironments may be more important for predicting MVI. By assembling models under different magnifications, the final MVI-DL model better combined cell morphology, tumor microenvironment and intercell relationships, and achieved better predictive performance.

This is confirmed by the unsupervised clusters and heatmaps of our study. We found that macrotrabecular architecture with a rich blood sinus, rich tumor stroma, high intratumor heterogeneity, severe immune cell infiltration and highly differentiated tumor cells strongly contributed to the MVI prediction, all of which were features of cell morphology, tumor microenvironment and intercell relationships. Previous studies have shown that macrotrabecular architecture in HCC indicates stronger tumor invasiveness [12, 24]. This type of HCC can express high levels of angiopoietin 2 and vascular endothelial growth factor A to regulate angiogenesis and vascular remodeling and is more likely to develop vascular invasion and metastasis [12, 25]. A rich tumor stroma may promote the production of TGF-β, which directly upregulates the expression of tumor stem cell markers (EpCAM, K19, CD133, etc.), thereby promoting vascular invasion [13]. High intratumoral heterogeneity may also be related to stronger tumor invasiveness. Studies have revealed that high intratumoral heterogeneity affects key cancer pathways and drives phenotypic variation [26], and ultimately promotes tumor progression and metastasis through a complex intercell competition mechanism [27]. In contrast, highly differentiated tumor cells and immune cell infiltration are closely related to the reduction of postoperative tumor recurrence and better prognosis [28,29,30]. Tumor immune cell infiltration has also been proven to be negatively correlated with vascular invasion in colorectal cancer [31]. These results indicated high interpretability and clinical reliability of the MVI-DL model, which are essential for clinical acceptability.

There are still some limitations in this study. First, the MVI-DL model was constructed and validated in three medical centers, all of which were from China. Therefore, population of this study was populated mainly with HBV-related HCC patients. The generalizability of this model to HCC with other etiologies needs further validation. Second, the performance of this model in biopsy specimens was in a simulated scenario. Since that the diagnosis of HCC did not require preoperative biopsy, patients with both preoperative biopsy tissue and confirmed MVI diagnosis from postoperative histology were limited, making it hard to evaluate in this population. Third, this study is a retrospective cohort study, and a large prospective clinical trial is necessary for the implementation of the MVI-DL model in clinical practice.

Conclusions

The efforts presented in our work highlighted the possibility of accurately evaluating the MVI status of HCC patients from the tumor area on the histological slides using a deep-learning model. With the validations on multicenter cohort, the MVI-DL model we developed exhibited excellent accuracy, robustness and considerable clinical interpretability, which might provide an important tool with practical clinical applicability for better patient management.

Availability of data and materials

The external validation of TCGA dataset is publicly available at the TCGA portal (https://portal.gdc.cancer.gov). All other data generated in this study are not publicly available due to patient privacy constraints, but are available upon reasonable request from the corresponding author (Kuang).

Code availability

The codes that were used to train and validate the deep-learning model in the manuscript were available from the corresponding author (Kuang) upon reasonable request.

References

Lim KC, Chow PK, Allen JC, et al. Microvascular invasion is a better predictor of tumor recurrence and overall survival following surgical resection for hepatocellular carcinoma compared to the Milan criteria. Ann Surg. 2011;254:108–113
Article PubMed Google Scholar
Mazzaferro V, Llovet JM, Miceli R, et al. Predicting survival after liver transplantation in patients with hepatocellular carcinoma beyond the Milan criteria: a retrospective, exploratory analysis. Lancet Oncol. 2009;10:35–43
Article PubMed Google Scholar
Sun JJ, Wang K, Zhang CZ, et al. Postoperative adjuvant transcatheter arterial chemoembolization after R0 hepatectomy improves outcomes of patients who have hepatocellular carcinoma with microvascular invasion. Ann Surg Oncol. 2016;23:1344–1351
Article PubMed Google Scholar
Wei W, Jian PE, Li SH, et al. Adjuvant transcatheter arterial chemoembolization after curative resection for hepatocellular carcinoma patients with solitary tumor and microvascular invasion: a randomized clinical trial of efficacy and safety. Cancer Commun. 2018;38:61
Article Google Scholar
Wang Z, Ren Z, Chen Y, et al. Adjuvant transarterial chemoembolization for HBV-related hepatocellular carcinoma after resection: a randomized controlled study. Clin Cancer Res. 2018;24:2074–2081
Article CAS PubMed Google Scholar
Peng Z, Chen S, Xiao H, et al. Microvascular invasion as a predictor of response to treatment with sorafenib and transarterial chemoembolization for recurrent intermediate-stage hepatocellular carcinoma. Radiology. 2019;292:237–247
Article PubMed Google Scholar
Zhou KQ, Sun YF, Cheng JW, et al. Effect of surgical margin on recurrence based on preoperative circulating tumor cell status in hepatocellular carcinoma. EBioMedicine. 2020;62:103107
Article PubMed PubMed Central Google Scholar
Chen L, Chen S, Zhou Q, et al. Microvascular invasion status and its survival impact in hepatocellular carcinoma depend on tissue sampling protocol. Ann Surg Oncol. 2021;28:6747–6757
Article PubMed Google Scholar
Sheng X, Ji Y, Ren GP, et al. A standardized pathological proposal for evaluating microvascular invasion of hepatocellular carcinoma: a multicenter study by LCPGC. Hepatol Int. 2020;14:1034–1047
Article PubMed Google Scholar
Park JW, Chen M, Colombo M, et al. Global patterns of hepatocellular carcinoma management from diagnosis to death: the BRIDGE Study. Liver Int. 2015;35:2155–2166
Article PubMed PubMed Central Google Scholar
Fattovich G, Stroffolini T, Zagni I, et al. Hepatocellular carcinoma in cirrhosis: incidence and risk factors. Gastroenterology. 2004;127:S35–S50
Article PubMed Google Scholar
Ziol M, Pote N, Amaddeo G, et al. Macrotrabecular-massive hepatocellular carcinoma: a distinctive histological subtype with clinical relevance. Hepatology. 2018;68:103–112
Article PubMed Google Scholar
Seok JY, Na DC, Woo HG, et al. A fibrous stromal component in hepatocellular carcinoma reveals a cholangiocarcinoma-like gene expression trait and epithelial-mesenchymal transition. Hepatology. 2012;55:1776–1786
Article CAS PubMed Google Scholar
Niazi M, Parwani AV, Gurcan MN. Digital pathology and artificial intelligence. Lancet Oncol. 2019;20:e253–e261
Article PubMed PubMed Central Google Scholar
Wessels F, Schmitt M, Krieghoff-Henning E, et al. Deep learning approach to predict lymph node metastasis directly from primary tumor histology in prostate cancer. BJU Int. 2021;128:352–360
Article CAS PubMed Google Scholar
Kwak MS, Lee HH, Yang JM, et al. Deep convolutional neural network-based lymph node metastasis prediction for colon cancer using histopathological images. Front Oncol. 2020;10:619803
Article PubMed Google Scholar
Campanella G, Hanna MG, Geneslaw L, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med. 2019;25:1301–1309
Article CAS PubMed PubMed Central Google Scholar
Junjie Z, Donghuan L, Kai M, et al. Deep image clustering with category-style representation. In: Vedaldi A, Bischof H, Brox T, Frahm JM(eds), 2020;54–70.
van der Maaten L. Accelerating t-SNE using tree-based algorithms. J Mach Learn Res. 2014;15:3221–3245
Google Scholar
Selvaraju RR, Cogswell M, Das A et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. 2017 IEEE International Conference on Computer Vision (ICCV) 2017; 618–626.
Santambrogio R, Barabino M, D’Alessandro V, et al. Micronvasive behaviour of single small hepatocellular carcinoma: which treatment? Updates Surg. 2021;73:1359–1369
Article PubMed Google Scholar
Kluger MD, Salceda JA, Laurent A, et al. Liver resection for hepatocellular carcinoma in 313 Western patients: tumor biology and underlying liver rather than tumor size drive prognosis. J Hepatol. 2015;62:1131–1140
Article PubMed Google Scholar
Coudray N, Ocampo PS, Sakellaropoulos T, et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat Med. 2018;24:1559–1567
Article CAS PubMed Google Scholar
Calderaro J, Couchy G, Imbeaud S, et al. Histological subtypes of hepatocellular carcinoma are related to gene mutations and molecular tumour classification. J Hepatol. 2017;67:727–738
Article CAS PubMed Google Scholar
Hashizume H, Falcon BL, Kuroda T, et al. Complementary actions of inhibitors of angiopoietin-2 and VEGF on tumor angiogenesis and growth. Cancer Res. 2010;70:2213–2223
Article CAS PubMed PubMed Central Google Scholar
Burrell RA, McGranahan N, Bartek J, et al. The causes and consequences of genetic heterogeneity in cancer evolution. Nature. 2013;501:338–345
Article CAS PubMed Google Scholar
Parker TM, Henriques V, Beltran A, et al. Cell competition and tumor heterogeneity. Semin Cancer Biol. 2020;63:1–10
Article PubMed Google Scholar
Zhang Q, Lou Y, Yang J, et al. Integrated multiomic analysis reveals comprehensive tumour heterogeneity and novel immunophenotypic classification in hepatocellular carcinomas. Gut. 2019;68:2019–2031
Article CAS PubMed Google Scholar
Wada Y, Nakashima O, Kutami R, et al. Clinicopathological study on hepatocellular carcinoma with lymphocytic infiltration. Hepatology. 1998;27:407–414
Article CAS PubMed Google Scholar
Unitt E, Marshall A, Gelson W, et al. Tumour lymphocytic infiltrate and recurrence of hepatocellular carcinoma following liver transplantation. J Hepatol. 2006;45:246–253
Article CAS PubMed Google Scholar
Teng MW, Swann JB, Koebel CM, et al. Immune-mediated dormancy: an equilibrium with cancer. J Leukoc Biol. 2008;84:988–993
Article CAS PubMed Google Scholar

Download references

Funding

This work was supported by the National Key Research and Development Program of China (2020AAA0109504 to Kuang), the National Science Fund for Distinguished Young Scholars (81825013 to Kuang), the National Natural Science Foundation of China (81801703 to S. Chen and 81771958 to Kuang), the Guangdong Natural Science Fund for Distinguished Young Scholars (2021B1515020054 to Wang), the Guangdong Natural Science Foundation (2021A1515010450 to S. Chen) and Guangdong Basic and Applied Basic Research Foundation (2019A1515111168 to Xiao).

Author information

Qiaofeng Chen, Han Xiao and Yunquan Gu contributed equally to this work.

Authors and Affiliations

Department of Gastroenterology, the First Affiliated Hospital of Sun Yat-Sen University, Guangzhou, Guangdong, China
Qiaofeng Chen & Sui Peng
Department of Medical Ultrasonics, Institute of Diagnostic and Interventional Ultrasound, the First Affiliated Hospital of Sun Yat-Sen University, No. 58, Zhongshan 2nd Road, Guangzhou, 510080, Guangdong, China
Han Xiao, Wei Wang, Ming Kuang & Shuling Chen
Clinical Trials Unit, the First Affiliated Hospital of Sun Yat-Sen University, Guangzhou, Guangdong, China
Yunquan Gu, Zongpeng Weng, Bin Li & Sui Peng
Department of Pathology, the First Affiliated Hospital of Sun Yat-Sen University, Guangzhou, Guangdong, China
Lihong Wei, Bing Liao & Mengying Hei
Department of Liver and Pancreatobiliary Surgery, Dongguan People’s Hospital, Dongguan, Guangdong, China
Jiali Li
Department of Liver and Pancreatobiliary Surgery, Shunde Hospital of Southern Medical University, Shunde, Guangdong, China
Jie Lin
Department of Liver Surgery, Cancer Center, Institute of Precision Medicine, the First Affiliated Hospital of Sun Yat-Sen University, No. 58, Zhongshan 2nd Road, Guangzhou, 510080, Guangdong, China
Ming Kuang

Authors

Qiaofeng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Han Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yunquan Gu
View author publications
You can also search for this author in PubMed Google Scholar
Zongpeng Weng
View author publications
You can also search for this author in PubMed Google Scholar
Lihong Wei
View author publications
You can also search for this author in PubMed Google Scholar
Bin Li
View author publications
You can also search for this author in PubMed Google Scholar
Bing Liao
View author publications
You can also search for this author in PubMed Google Scholar
Jiali Li
View author publications
You can also search for this author in PubMed Google Scholar
Jie Lin
View author publications
You can also search for this author in PubMed Google Scholar
Mengying Hei
View author publications
You can also search for this author in PubMed Google Scholar
Sui Peng
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ming Kuang
View author publications
You can also search for this author in PubMed Google Scholar
Shuling Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SC, WW and MK had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. All authors read and approved the final manuscript. Concept and design: SP, SC and MK. Acquisition, analysis, or interpretation of data: QC, HX, YG, ZW, BL, JL, JL and SP. Drafting of the manuscript: QC, HX and YG. Critical revision of the manuscript for important intellectual content: SP, WW, SC and MK. Statistical analysis: QC, HX, YG, ZW and BL. Obtained funding: MK, WW, SC and HX. Administrative, technical, or material support: MK, LW, BL and MH. Supervision: WW, SC and MK.

Corresponding authors

Correspondence to Wei Wang, Ming Kuang or Shuling Chen.

Ethics declarations

Conflict of interest

QC, HX, YG, ZW, LW, BL, BL, JL, JL, MH, SP, WW, MK and SC have declared that no conflict of interest exists.

Ethics approval and consent to participate

The study complied with the Declaration of Helsinki and was reviewed and centrally approved by the ethics committees of the First Affiliated Hospital of Sun Yat-sen University (Approval number: [2021]152).

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 53 KB)

Supplementary file2 (TIF 685 KB)

Supplementary file3 (EPS 1018 KB)

Supplementary file4 (TIF 3592 KB)

Supplementary file5 (EPS 1216 KB)

Supplementary file6 (EPS 959 KB)

Supplementary file7 (TIF 8307 KB)

Supplementary file8 (EPS 1289 KB)

Supplementary file9 (EPS 887 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, Q., Xiao, H., Gu, Y. et al. Deep learning for evaluation of microvascular invasion in hepatocellular carcinoma from tumor areas of histology images. Hepatol Int 16, 590–602 (2022). https://doi.org/10.1007/s12072-022-10323-w

Download citation

Received: 22 October 2021
Accepted: 16 February 2022
Published: 28 March 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s12072-022-10323-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deep learning for evaluation of microvascular invasion in hepatocellular carcinoma from tumor areas of histology images

Abstract

Background

Methods

Results

Conclusion

Graphical abstract

Similar content being viewed by others

Introduction

Materials and methods

Patient cohorts and data preparation

Development of the MVI-DL model

Image sampling and magnification selection

Training of the MVI-DL model

Visualization of the MVI prediction

Validation of the MVI-DL model performance

Simulation of the clinical application of the MVI-DL model

Patients with insufficient surgical margin for MVI evaluation

Patients with biopsies only

Statistical analysis

Results

Patient characteristics

Development of the MVI-DL model

Validation of the MVI-DL model

Visualization and interpretability of the MVI-DL model

Clinical implementation of the MVI-DL model

Discussion

Conclusions

Availability of data and materials

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethics approval and consent to participate

Consent for publication

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation