H^2-MIL: Exploring Hierarchical Representation with Heterogeneous Multiple Instance Learning for Whole Slide Image Analysis

Authors

  • Wentai Hou Xiamen University
  • Lequan Yu The University of Hong Kong
  • Chengxuan Lin Xiamen University
  • Helong Huang Xiamen University
  • Rongshan Yu Xiamen University
  • Jing Qin The Hong Kong Polytechnic University
  • Liansheng Wang Xiamen University

DOI:

https://doi.org/10.1609/aaai.v36i1.19976

Keywords:

Computer Vision (CV), Machine Learning (ML)

Abstract

Current representation learning methods for whole slide image (WSI) with pyramidal resolutions are inherently homogeneous and flat, which cannot fully exploit the multiscale and heterogeneous diagnostic information of different structures for comprehensive analysis. This paper presents a novel graph neural network-based multiple instance learning framework (i.e., H^2-MIL) to learn hierarchical representation from a heterogeneous graph with different resolutions for WSI analysis. A heterogeneous graph with the “resolution” attribute is constructed to explicitly model the feature and spatial-scaling relationship of multi-resolution patches. We then design a novel resolution-aware attention convolution (RAConv) block to learn compact yet discriminative representation from the graph, which tackles the heterogeneity of node neighbors with different resolutions and yields more reliable message passing. More importantly, to explore the task-related structured information of WSI pyramid, we elaborately design a novel iterative hierarchical pooling (IHPool) module to progressively aggregate the heterogeneous graph based on scaling relationships of different nodes. We evaluated our method on two public WSI datasets from the TCGA project, i.e., esophageal cancer and kidney cancer. Experimental results show that our method clearly outperforms the state-of-the-art methods on both tumor typing and staging tasks.

Downloads

Published

2022-06-28

How to Cite

Hou, W., Yu, L., Lin, C., Huang, H., Yu, R., Qin, J., & Wang, L. (2022). H^2-MIL: Exploring Hierarchical Representation with Heterogeneous Multiple Instance Learning for Whole Slide Image Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 933-941. https://doi.org/10.1609/aaai.v36i1.19976

Issue

Section

AAAI Technical Track on Computer Vision I