Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

Authors

  • Xiaoyi Bao School of Artificial Intelligence, University of Chinese Academy of Sciences Institute of Automation, Chinese Academy of Sciences Alibaba Group
  • Jie Qin School of Artificial Intelligence, University of Chinese Academy of Sciences Institute of Automation, Chinese Academy of Sciences
  • Siyang Sun Alibaba Group
  • Xingang Wang Institute of Automation, Chinese Academy of Sciences
  • Yun Zheng Alibaba Group

DOI:

https://doi.org/10.1609/aaai.v38i2.27834

Keywords:

CV: Segmentation

Abstract

For few-shot semantic segmentation, the primary task is to extract class-specific intrinsic information from limited labeled data. However, the semantic ambiguity and inter-class similarity of previous methods limit the accuracy of pixel-level foreground-background classification. To alleviate these issues, we propose the Relevant Intrinsic Feature Enhancement Network (RiFeNet). To improve the semantic consistency of foreground instances, we propose an unlabeled branch as an efficient data utilization method, which teaches the model how to extract intrinsic features robust to intra-class differences. Notably, during testing, the proposed unlabeled branch is excluded without extra unlabeled data and computation. Furthermore, we extend the inter-class variability between foreground and background by proposing a novel multi-level prototype generation and interaction module. The different-grained complementarity between global and local prototypes allows for better distinction between similar categories. The qualitative and quantitative performance of RiFeNet surpasses the state-of-the-art methods on PASCAL-5i and COCO benchmarks.

Published

2024-03-24

How to Cite

Bao, X., Qin, J., Sun, S., Wang, X., & Zheng, Y. (2024). Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 765-773. https://doi.org/10.1609/aaai.v38i2.27834

Issue

Section

AAAI Technical Track on Computer Vision I