Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object Detection

Authors

  • Yucheng Han Nanyang Technological University
  • Na Zhao Singapore University of Technology and Design
  • Weiling Chen Hyundai Motor Group Innovation Center in Singapore
  • Keng Teck Ma Hyundai Motor Group Innovation Center in Singapore
  • Hanwang Zhang Nanyang Technological University

DOI:

https://doi.org/10.1609/aaai.v38i3.27976

Keywords:

CV: 3D Computer Vision, CV: Object Detection & Categorization, CV: Representation Learning for Vision, ML: Semi-Supervised Learning

Abstract

Semi-supervised 3D object detection is a promising yet under-explored direction to reduce data annotation costs, especially for cluttered indoor scenes. A few prior works, such as SESS and 3DIoUMatch, attempt to solve this task by utilizing a teacher model to generate pseudo-labels for unlabeled samples. However, the availability of unlabeled samples in the 3D domain is relatively limited compared to its 2D counterpart due to the greater effort required to collect 3D data. Moreover, the loose consistency regularization in SESS and restricted pseudo-label selection strategy in 3DIoUMatch lead to either low-quality supervision or a limited amount of pseudo labels. To address these issues, we present a novel Dual-Perspective Knowledge Enrichment approach named DPKE for semi-supervised 3D object detection. Our DPKE enriches the knowledge of limited training data, particularly unlabeled data, from two perspectives: data-perspective and feature-perspective. Specifically, from the data-perspective, we propose a class-probabilistic data augmentation method that augments the input data with additional instances based on the varying distribution of class probabilities. Our DPKE achieves feature-perspective knowledge enrichment by designing a geometry-aware feature matching method that regularizes feature-level similarity between object proposals from the student and teacher models. Extensive experiments on the two benchmark datasets demonstrate that our DPKE achieves superior performance over existing state-of-the-art approaches under various label ratio conditions. The source code and models will be made available to the public.

Published

2024-03-24

How to Cite

Han, Y., Zhao, N., Chen, W., Ma, K. T., & Zhang, H. (2024). Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2049-2057. https://doi.org/10.1609/aaai.v38i3.27976

Issue

Section

AAAI Technical Track on Computer Vision II