BM-SMIL: A Breast Cancer Molecular Subtype Prediction Framework from H&E Slides with Self-supervised Pretraining and Multi-instance Learning

Shang, Zihao; Liu, Hong; Wang, Kuansong; Wang, Xiangdong

doi:10.1007/978-3-031-45087-7_9

Zihao Shang^13,15,
Hong Liu¹³,
Kuansong Wang¹⁴ &
…
Xiangdong Wang¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14243))

Included in the following conference series:

International Workshop on Computational Mathematics Modeling in Cancer Analysis

259 Accesses

Abstract

Breast cancer is the most commonly diagnosed cancer, and accurate molecular subtype prediction plays a crucial role in determining treatment strategies. While immunohistochemistry (IHC) is commonly used for molecular subtype diagnosis, it suffers from cost and labor limitations. The prediction of molecular subtypes using hematoxylin and eosin (H&E) stained slides has gained importance. However, the task is challenged by limited samples, weak annotations, and strong tumor heterogeneity. This paper proposes a scalable framework, BM-SMIL, for molecular subtype prediction of breast cancer based on self-supervised pretraining and multi-instance learning. Firstly, a self-supervised pretraining framework utilizing multi-scale knowledge distillation is introduced to obtain a representative patch encoder. Then, an attention-based instance selection strategy is employed to filter out noise instances. Finally, a Transformer integrated with subtype contrastive loss is proposed for effective aggregation and WSI-level prediction. Experimental results on the dataset from cooperative hospital demonstrate the effectiveness of our proposed framework. The BM-SMIL framework has the potential to enhance molecular subtype prediction performance and can be extended to other pathology image classification tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pusztai, L., Mazouni, C., Anderson, K., et al.: Molecular classification of breast cancer: limitations and potential. Oncologist 11(8), 868–877 (2006)
Article Google Scholar
Sengal, A.T., Haj-Mukhtar, N.S., Elhaj, A.M., et al.: Immunohistochemistry defined subtypes of breast cancer in 678 Sudanese and Eritrean women; hospitals based case series. BMC Cancer 17(1), 1–9 (2017)
Article Google Scholar
Rawat, R.R., Ortega, I., Roy, P., et al.: Deep learned tissue “fingerprints” classify breast cancers by ER/PR/Her2 status from H&E images. Sci. Rep. 10(1), 1–13 (2020)
Article Google Scholar
Jaber, M.I., Song, B., Taylor, C., et al.: A deep learning image-based intrinsic molecular subtype classifier of breast tumors reveals tumor heterogeneity that may affect survival. Breast Cancer Res. 22(1), 1–10 (2020)
Article Google Scholar
Liu, H., Xu, W.D., Shang, Z.H., et al.: Breast cancer molecular subtype prediction on pathological images with discriminative patch selection and multi-instance learning. Front. Oncol. 12, 858453 (2022)
Article Google Scholar
Han, B., Yao, Q., Yu, X., et al.: Co-teaching: robust training of deep neural networks with extremely noisy labels. In: Advances in Neural Information Processing Systems, vol. 31 (2018)
Google Scholar
Breunig, M.M., Kriegel, H.P., Ng, R.T., et al.: LOF: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 93–104 (2000)
Google Scholar
He, K., Fan, H., Wu, Y., et al.: Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9729–9738 (2020)
Google Scholar
Chen, T., Kornblith, S., Norouzi, M., et al.: A simple framework for contrastive learning of visual representations. In: Proceedings of the 37th International Conference on Machine Learning, vol. 119, pp. 1597–1607. JMLR.org, (2020)
Google Scholar
Caron, M., Touvron, H., Misra, I., et al.: Emerging properties in self-supervised vision transformers. In: ICCV 2021 - International Conference on Computer Vision, p. 1 (2021)
Google Scholar
Grill, J.B., Strub, F., Altché, F., et al.: Bootstrap your own latent-a new approach to self-supervised learning. In: Advances in Neural Information Processing Systems, vol. 33, pp. 21271–21284 (2020)
Google Scholar
He, K., Chen, X., Xie, S., et al.: Masked autoencoders are scalable vision learners. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 16000–16009 (2022)
Google Scholar
Doersch, C., Gupta, A., Efros, A.A.: Unsupervised visual representation learning by context prediction. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1422–1430 (2015)
Google Scholar
Li, B., Li, Y., Eliceiri, K.W.: Dual-stream multiple instance learning network for whole slide image classification with self-supervised contrastive learning. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14313–14323 (2021)
Google Scholar
Li, J., Lin, T., Xu, Y.: SSLP: spatial guided self-supervised learning on pathological images. In: de Bruijne, M., et al. (eds.) MICCAI 2021. LNCS, vol. 12902, pp. 3–12. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87196-3_1
Chapter Google Scholar
Wang, X., Yang, S., Zhang, J., et al.: Transformer-based unsupervised contrastive learning for histopathological image classification. Med. Image Anal. 81, 102559 (2022)
Article Google Scholar
Lerousseau, M., et al.: Weakly supervised multiple instance learning histopathological tumor segmentation. In: Martel, A.L., et al. (eds.) MICCAI 2020. LNCS, vol. 12265, pp. 470–479. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59722-1_45
Chapter Google Scholar
Campanella, G., Hanna, M., Geneslaw, L., et al.: Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 25, 1 (2019)
Article Google Scholar
Tomita, N., Abdollahi, B., Wei, J., et al.: Attention-based deep neural networks for detection of cancerous and precancerous esophagus tissue on histopathological slides. JAMA Netw. OpenNetw. Open 2(11), e1914645 (2019)
Article Google Scholar
Hashimoto, N., Fukushima, D., Koga, R., et al.: Multi-scale domain-adversarial multiple-instance CNN for cancer subtype classification with unannotated histopathological images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3852–3861 (2020)
Google Scholar
Ilse, M., Tomczak, J., Welling, M.: Attention-based deep multiple instance learning. In: Dy, J., Krause, A.: Proceedings of the 35th International Conference on Machine Learning, vol. 80, pp. 2127–2136. PMLR (2018)
Google Scholar
Shao, Z., Bian, H., Chen, Y., et al.: TransMIL: transformer based correlated multiple instance learning for whole slide image classification. In: Advances in Neural Information Processing Systems (2022)
Google Scholar
Lu, M.Y., Williamson, D.F., Chen, T.Y., et al.: Data-efficient and weakly supervised computational pathology on whole-slide images. Nat. Biomed. Eng. 5(6), 555–570 (2021)
Article Google Scholar
Yu, S., et al.: Mil-vt: Multiple instance learning enhanced vision transformer for fundus image classification. In: de Bruijne, M., et al. (eds.) MICCAI 2021. LNCS, vol. 12908, pp. 45–54. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87237-3_5
Chapter Google Scholar
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 815–823 (2015)
Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern.Cybern. 9(1), 62–66 (1979)
Article Google Scholar
Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. In: International Conference on Learning Representations (2022)
Google Scholar

Download references

Acknowledgement

This work was supported by the National Natural Science Foundation of China (62276250).

Author information

Authors and Affiliations

Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
Zihao Shang, Hong Liu & Xiangdong Wang
Department of Pathology, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, People’s Republic of China
Kuansong Wang
University of Chinese Academy of Sciences, Beijing, 100086, China
Zihao Shang

Authors

Zihao Shang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Kuansong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiangdong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hong Liu .

Editor information

Editors and Affiliations

Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Wenjian Qin
College of Information Technology, United Arab Emirates University, Al Ain, United Arab Emirates
Nazar Zaki
Beijing Institute of Technology, Beijing, China
Fa Zhang
Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Jia Wu
Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Fan Yang
University of Cambridge, Cambridge, UK
Chao Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shang, Z., Liu, H., Wang, K., Wang, X. (2023). BM-SMIL: A Breast Cancer Molecular Subtype Prediction Framework from H&E Slides with Self-supervised Pretraining and Multi-instance Learning. In: Qin, W., Zaki, N., Zhang, F., Wu, J., Yang, F., Li, C. (eds) Computational Mathematics Modeling in Cancer Analysis. CMMCA 2023. Lecture Notes in Computer Science, vol 14243. Springer, Cham. https://doi.org/10.1007/978-3-031-45087-7_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-45087-7_9
Published: 08 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45086-0
Online ISBN: 978-3-031-45087-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)