Abstract
Prostate cancer (PCa) is the second most common cancer in men worldwide. The Gleason score, determined by pathologists through microscopic examination of pathological tissue, is the most powerful prognostic indicator for patients diagnosed with prostate cancer. However, there is inter- and intra-observer variability among pathologists, and the scoring process imposes a significant workload on them. This study presents a deep learning model that utilizes gigapixel pathology images and slide-level labels for prostate cancer detection and Gleason grading. WSIs are first cropped into small patches, and a deep learning model trained using self-supervised learning is used to extract features from the patches. An attention-LSTM is then used to aggregate features and perform the final classification. Our pipeline for Gleason grading, which utilizes a 6-level classification system, achieves a quadratic weighted kappa of 0.930 on the internal test dataset (n = 531). Furthermore, it demonstrates good generalization on the external test dataset (n = 155) with a quadratic weighted kappa of 0.8668. The proposed model integrating a self-supervised feature extraction model and an attention LSTM aggregator, is one of the most advanced scoring methods for grading prostate cancer using only slide-level labels. It represents a significant advancement in automating the extraction of prostate cancer grading information in digital diagnostic pathology and will help to address the critical shortage of pathologists.
Similar content being viewed by others
Data Availibility
Data used in the present study are publicly available, and ethical approval and informed consent were obtained in each original study.
Code Availability
References
Siegel RL, Miller KD, Fuchs HE, Jemal A (2022) Cancer statistics, 2022. CA: A Cancer Journal for Clinicians 72:7–33. https://doi.org/10.3322/caac.21708
Ryu HS, Jin M-S, Park JH, Lee S, Cho J, Oh S, Kwak T-Y, Woo JI, Mun Y, Kim SW, Hwang S, Shin S-J, Chang H (2019) Automated gleason scoring and tumor quantification in prostate core needle biopsy images using deep neural networks and its comparison with pathologist-based assessment. Cancers 11:1860. https://doi.org/10.3390/cancers11121860
Bulten W, Pinckaers H, Boven H, Vink R, Bel T, Ginneken B, Laak J, Kaa CH-v, Litjens G (2020) Automated deep-learning system for gleason grading of prostate cancer using biopsies: a diagnostic study. Lancet Oncol 21:233–241. https://doi.org/10.1016/S1470-2045(19)30739-9
Tellez D, Litjens G, Laak J, Ciompi F (2021) Neural image compression for gigapixel histopathology image analysis. IEEE Transactions on pattern analysis and machine intelligence 43:567–578. https://doi.org/10.1109/TPAMI.2019.2936841
Wang J, Mao X, Wang Y, Tao X, Chu J, Li Q (2023) Automatic generation of pathological benchmark dataset from hyperspectral images of double stained tissues. OpticsOptics & Laser Technology 163:109331. https://doi.org/10.1016/j.optlastec.2023.109331
Li J, Weng Z, Xu H, Zhang Z, Miao H, Chen W, Liu Z, Zhang X, Wang M, Xu X, Ye Q (2018) Support vector machines (svm) classification of prostate cancer gleason score in central gland using multiparametric magnetic resonance images: a cross-validated study. European J Radiol 98:61–67. https://doi.org/10.1016/j.ejrad.2017.11.001
Toth R, Schiffmann H, Hube-Magg C, Büscheck F, Gerhuser C (2019) Random forest-based modelling to detect biomarkers for prostate cancer progression. Clin Epigenetics 11(1):148
Karimi D, Nir G, Fazli L, Black PC, Goldenberg L, Salcudean SE (2020) Deep learning-based gleason grading of prostate cancer from histopathology images-role of multiscale decision aggregation and data augmentation. IEEE J Biomed Health Inform 24(5):1413–1426
Duong LT, Le NH, Tran TB, Ngo VM, Nguyen PT (2021) Detection of tuberculosis from chest x-ray images: boosting the performance with vision transformer and transfer learning. Expert Syst Appl 184:115519. https://doi.org/10.1016/j.eswa.2021.115519
Lucas M, Jansen I, Savci-Heijink CD, Meijer SL, Boer OJ, Leeuwen TG, Bruin DM, Marquering HA (2019) Deep learning for automatic gleason pattern classification for grade group determination of prostate biopsies. Virchows Archiv 475:77–83. https://doi.org/10.1007/s00428-019-02577-x
Otálora S, Marini N, Müller H, Atzori M (2021) Combining weakly and strongly supervised learning improves strong supervision in gleason pattern classification. BMC Med Imaging 21:77. https://doi.org/10.1186/s12880-021-00609-0
Silva-Rodríguez J, Colomer A, Dolz J, Naranjo V (2021) Self-learning for weakly supervised gleason grading of local patterns. IEEE J Biomed Health Inform 25:3094–3104
Nagpal K, Foote D, Liu Y, Po-Hsuan Chen Wulczyn E, Tan F, Olson N, Smith JL, Mohtashamian A (2018) Development and validation of a deep learning algorithm for improving gleason scoring of prostate cancer
Arvaniti E, Fricker KS, Moret M, Rupp N, Fankhauser C, Hermanns T, Wey N, Wild P, Rüschoff J, Claassen M (2018) Automated gleason grading of prostate cancer tissue microarrays via deep learning. European Urology Supplements 17(14):3020–3021
Bulten W, Balkenhol M, Belinga J-JA, Brilhante A, Çakıc A, Egevad L, Eklund M, Farré X, Geronatsiou K, Molinié V, Pereira G, Roy P, Saile G, Salles P, Schaafsma E, Tschui J, Vos A-M, Panel IPIE, Boven H, Vink R, Laak J, Kaa CH-v, Litjens G, Delahunt B, Samaratunga H, Grignon DJ, Evans AJ, M.Berney D, Pan C-C, Kristiansen G, Kench JG, Oxley J, Leite KRM, McKenney JK, Humphrey PA, Fine SW, Tsuzuki T, Varma M, Zhou M, Comperat E, Bostwick DG, Iczkowski KA, Magi-Galluzzi C, Srigley JR, Takahashi H, Kwast T (2021) Artificial intelligence assistance significantly improves gleason grading of prostate biopsies by pathologists. Modern Pathology 34:660–671. https://doi.org/10.1038/s41379-020-0640-y
Silva-Rodríguez J, Colomer A, Sales MA, Molina R, Naranjo V (2020) Going deeper through the gleason scoring scale: an automatic end-to-end system for histology prostate grading and cribriform pattern detection. Comput Methods Programs Biomed 195:105637 https://doi.org/10.1016/j.cmpb.2020.105637
Bulten W, Kartasalo K, Chen P-HC, Ström P, Pinckaers H, Nagpal K, Cai Y, Steiner DF, Boven H, Vink R, Kaa CH-v, Laak J, Amin MB, Evans AJ, Kwast T, Allan R, Humphrey PA, Grönberg H, Samaratunga H, Delahunt B, Tsuzuki T, Häkkinen T, Egevad L, Demkin M, Dane S, Tan F, Valkonen M, Corrado GS, Peng L, Mermel CH, Ruusuvuori P, Litjens G, Eklund M, Brilhante A, Çakır A, Farré X, Geronatsiou K, Molinié V, Pereira G, Roy P, Saile G, Salles PGO, Schaafsma E, Tschui J, Billoch-Lima J, Pereira EM, Zhou M, He S, Song S, Sun Q, Yoshihara H, Yamaguchi T, Ono K, Shen T, Ji J, Roussel A, Zhou K, Chai T, Weng N, Grechka D, Shugaev MV, Kiminya R, Kovalev V, Voynov D, Malyshev V, Lapo E, Campos M, Ota N, Yamaoka S, Fujimoto Y, Yoshioka K, Juvonen J, Tukiainen M, Karlsson A, Guo R, Hsieh C-L, Zubarev I, Bukhar HST, Li W, Li J, Speier W, Arnold C, Kim K, Bae B, Kim YW, Lee H-S, Park J (2022) Artificial intelligence for diagnosis and gleason grading of prostate cancer: the panda challenge. Nature Medicine 28:154–163. https://doi.org/10.1038/s41591-021-01620-2
Mun Y, Paik I, Shin S-J, Kwak T-Y, Chang H (2021) Yet another automated gleason grading system (yaaggs) by weakly supervised deep learning. npj Digital Medicine 4:99. https://doi.org/10.1038/s41746-021-00469-6
Hammouda K, Khalifa F, Alghamdi NS, Darwish HE, El-Baz AS (2022) Multi-stage classification-based deep learning for gleason system grading using histopathological images. Cancers 14
Li W, Li J, Sarma KV, Ho KC, Shen S, Knudsen BS, Gertych A, Arnold CW (2019) Path r-cnn for prostate cancer diagnosis and gleason grading of histological images. IEEE Transactions on Medical Imaging 38:945–954 https://doi.org/10.1109/TMI.2018.2875868
Duran-Lopez L, Dominguez-Morales JP, Gutierrez-Galan D, Rios-Navarro A, Jimenez-Fernandez A, Vicente-Diaz S, Linares-Barranco A (2021) Wide & deep neural network model for patch aggregation in cnn-based prostate cancer detection systems. Comput Biol Med 136:104743. https://doi.org/10.1016/j.compbiomed.2021.104743
Xiang J, Wang X, Wang X, Zhang J, Yang S, Yang W, Han X, Liu Y (2023) Automatic diagnosis and grading of prostate cancer with weakly supervised learning on whole slide images. Comput Biol Med 152:106340. https://doi.org/10.1016/j.compbiomed.2022.106340
Pinckaers H, Bulten W, Laak J, Litjens G (2021) Detection of prostate cancer in whole-slide images through end-to-end training with image-level labels. IEEE Transactions on medical imaging 40:1817–1826. https://doi.org/10.1109/TMI.2021.3066295
Campanella G, Hanna MG, Geneslaw L, Miraflor AP, Silva VWK, Busam KJ, Brogi E, Reuter VE, Klimstra DS, Fuchs TJ (2019) Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med pp 1–9
Yao J, Zhu X, Jonnagaddala J, Hawkins N, Huang J (2020) Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks. Med Image Anal 65: 101789. https://doi.org/10.1016/j.media.2020.101789
Lu MY, Williamson DFK, Chen TY, Chen RJ, Mahmood F (2021) Data-efficient and weakly supervised computational pathology on whole-slide images. Nat Biomed Eng pp 1–16
Pasa F, Golkov V, Pfeiffer F, Cremers D, Pfeiffer D (2019) Efficient deep network architectures for fast chest x-ray tuberculosis screening and visualization. Scientific Reports 9:6268. https://doi.org/10.1038/s41598-019-42557-4
Foysal Haque K, Farhan Haque F, Gandy L, Abdelgawad A (2020) Automatic detection of covid-19 from chest x-ray images with convolutional neural networks. In: 2020 International conference on computing, electronics & communications engineering (iCCECE), pp 125–130. https://doi.org/10.1109/iCCECE49321.2020.9231235
Wang X, Du Y, Yang S, Zhang J, Wang M, Zhang J, Yang W, Huang J, Han X (2023) Retccl: clustering-guided contrastive learning for whole-slide image retrieval. Med Image Anal 83:102645. https://doi.org/10.1016/j.media.2022.102645
Wang X, Yang S, Zhang J, Wang M, Zhang J, Yang W, Huang J, Han X (2022) Transformer-based unsupervised contrastive learning for histopathological image classification. Med Image Anal 81: 102559. https://doi.org/10.1016/j.media.2022.102559
Sun L, Zhou M, Li Q, Hu M, Wen Y, Zhang J, Lu Y, Chu J (2022) Diagnosis of cholangiocarcinoma from microscopic hyperspectral pathological dataset by deep convolution neural networks. Methods 202:22–30. https://doi.org/10.1016/j.ymeth.2021.04.005
Wang Q, Sun L, Wang Y, Zhou M, Hu M, Chen J, Wen Y, Li Q (2021) Identification of melanoma from hyperspectral pathology image using 3d convolutional networks. IEEE Transactions on medical imaging 40:218–227. https://doi.org/10.1109/TMI.2020.3024923
Lu MY, Williamson DFK, Chen TY, Chen RJ, Barbieri M, Mahmood F (2021) Data-efficient and weakly supervised computational pathology on whole-slide images. Nat Biomed Eng 5:555–570. https://doi.org/10.1038/s41551-020-00682-w
Ilse M, Tomczak JM, Welling M (2018) Attention-based deep multiple instance learning. In: International conference on machine learning. https://api.semanticscholar.org/CorpusID:3537028
Bilal M, Jewsbury R, Wang R, AlGhamdi HM, Asif A, Eastwood M, Rajpoot N (2023) An aggregation of aggregation methods in computational pathology. Med Image Anal 88:102885. https://doi.org/10.1016/j.media.2023.102885
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Li B, Li Y, Eliceiri KW (2021) Dual-stream multiple instance learning network for whole slide image classification with self-supervised contrastive learning. In: 2021 IEEE/CVF Conference on computer vision and pattern recognition (CVPR), pp 14313–14323. https://doi.org/10.1109/CVPR46437.2021.01409
Singhal N, Soni S, Bonthu S, Chattopadhyay N, Samanta P, Joshi U, Jojera A, Chharchhodawala T, Agarwal A, Desai M, Ganpule A (2022) A deep learning system for prostate cancer diagnosis and grading in whole slide images of core needle biopsies. Sci Rep 12(1):3383. https://doi.org/10.1038/s41598-022-07217-0
Pati P, Jaume G, Ayadi Z, Thandiackal K, Bozorgtabar B, Gabrani M, Goksel O (2023) Weakly supervised joint whole-slide segmentation and classification in prostate cancer. Med Image Anal pp102915. https://doi.org/10.1016/j.media.2023.102915
Bilal M, Raza SEA, Azam A, Graham S, Ilyas M, Cree IA, Snead D, Minhas F, Rajpoot NM (2021) Development and validation of a weakly supervised deep learning framework to predict the status of molecular pathways and key mutations in colorectal cancer from routine histology images: a retrospective study. The Lancet. Digital health 3(12):763–772
Acknowledgements
The authors would like to thank the Radboud University Medical Center and Karolinska Institute for making the PANDA dataset publicly available. This work was not supported by any research funding. The authors would also like to express their gratitude to Mr. Lu and Mr. Jiang for their meticulous guidance and outstanding technical support. I also need to thank my colleague, Mrs. Xie, for her guidance and valuable advice on my writing.
Author information
Authors and Affiliations
Contributions
Liang Ma and Hao chen performed the data analysis and wrote the manuscript. Liang Ma, Hao chen and Ming Gong conduct the experiment together.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liang, M., Hao, C. & Ming, G. Prostate cancer grade using self-supervised learning and novel feature aggregator based on weakly-labeled gbit-pixel pathology images. Appl Intell 54, 871–885 (2024). https://doi.org/10.1007/s10489-023-05224-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-023-05224-w