Hypercorrelation Evolution for Video Class-Incremental Learning

Authors

  • Sen Liang University of Science and Technology of China
  • Kai Zhu University of Science and Technology of China
  • Wei Zhai University of Science and Technology of China
  • Zhiheng Liu University of Science and Technology of China
  • Yang Cao University of Science and Technology of China Institute of Artificial Intelligence, Hefei Comprehensive National Science Center

DOI:

https://doi.org/10.1609/aaai.v38i4.28117

Keywords:

CV: Video Understanding & Activity Analysis, ML: Life-Long and Continual Learning, ML: Transfer, Domain Adaptation, Multi-Task Learning

Abstract

Video class-incremental learning aims to recognize new actions while restricting the catastrophic forgetting of old ones, whose representative samples can only be saved in limited memory. Semantically variable subactions are susceptible to class confusion due to data imbalance. While existing methods address the problem by estimating and distilling the spatio-temporal knowledge, we further explores that the refinement of hierarchical correlations is crucial for the alignment of spatio-temporal features. To enhance the adaptability on evolved actions, we proposes a hierarchical aggregation strategy, in which hierarchical matching matrices are combined and jointly optimized to selectively store and retrieve relevant features from previous tasks. Meanwhile, a correlation refinement mechanism is presented to reinforce the bias on informative exemplars according to online hypercorrelation distribution. Experimental results demonstrate the effectiveness of the proposed method on three standard video class-incremental learning benchmarks, outperforming state-of-the-art methods. Code is available at: https://github.com/Lsen991031/HCE

Published

2024-03-24

How to Cite

Liang, S., Zhu, K., Zhai, W., Liu, Z., & Cao, Y. (2024). Hypercorrelation Evolution for Video Class-Incremental Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3315-3323. https://doi.org/10.1609/aaai.v38i4.28117

Issue

Section

AAAI Technical Track on Computer Vision III