A Unified Knowledge Transfer Network for Generalized Category Discovery

Authors

  • Wenkai Shi School of Automation Science and Engineering, Xi’an Jiaotong University National Engineering Laboratory for Big Data Analytics
  • Wenbin An School of Automation Science and Engineering, Xi’an Jiaotong University National Engineering Laboratory for Big Data Analytics
  • Feng Tian School of Computer Science and Technology, Xi’an Jiaotong University National Engineering Laboratory for Big Data Analytics
  • Yan Chen School of Computer Science and Technology, Xi’an Jiaotong University
  • Yaqiang Wu Lenovo Research
  • Qianying Wang Lenovo Research
  • Ping Chen Department of Engineering, University of Massachusetts Boston

DOI:

https://doi.org/10.1609/aaai.v38i17.29862

Keywords:

NLP: Text Classification, NLP: Applications

Abstract

Generalized Category Discovery (GCD) aims to recognize both known and novel categories in an unlabeled dataset by leveraging another labeled dataset with only known categories. Without considering knowledge transfer from known to novel categories, current methods usually perform poorly on novel categories due to the lack of corresponding supervision. To mitigate this issue, we propose a unified Knowledge Transfer Network (KTN), which solves two obstacles to knowledge transfer in GCD. First, the mixture of known and novel categories in unlabeled data makes it difficult to identify transfer candidates (i.e., samples with novel categories). For this, we propose an entropy-based method that leverages knowledge in the pre-trained classifier to differentiate known and novel categories without requiring extra data or parameters. Second, the lack of prior knowledge of novel categories presents challenges in quantifying semantic relationships between categories to decide the transfer weights. For this, we model different categories with prototypes and treat their similarities as transfer weights to measure the semantic similarities between categories. On the basis of two treatments, we transfer knowledge from known to novel categories by conducting pre-adjustment of logits and post-adjustment of labels for transfer candidates based on the transfer weights between different categories. With the weighted adjustment, KTN can generate more accurate pseudo-labels for unlabeled data, which helps to learn more discriminative features and boost model performance on novel categories. Extensive experiments show that our method outperforms state-of-the-art models on all evaluation metrics across multiple benchmark datasets. Furthermore, different from previous clustering-based methods that can only work offline with abundant data, KTN can be deployed online conveniently with faster inference speed. Code and data are available at https://github.com/yibai-shi/KTN.

Published

2024-03-24

How to Cite

Shi, W., An, W., Tian, F., Chen, Y., Wu, Y., Wang, Q., & Chen, P. (2024). A Unified Knowledge Transfer Network for Generalized Category Discovery. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18961-18969. https://doi.org/10.1609/aaai.v38i17.29862

Issue

Section

AAAI Technical Track on Natural Language Processing II