AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation

Tang, Zihao; Lv, Zheqi; Zhang, Shengyu; Zhou, Yifan; Duan, Xinyu; Wu, Fei; Kuang, Kun

Computer Science > Machine Learning

arXiv:2403.07030 (cs)

[Submitted on 11 Mar 2024 (v1), last revised 18 Mar 2024 (this version, v2)]

Title:AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation

Authors:Zihao Tang, Zheqi Lv, Shengyu Zhang, Yifan Zhou, Xinyu Duan, Fei Wu, Kun Kuang

View PDF HTML (experimental)

Abstract:Due to privacy or patent concerns, a growing number of large models are released without granting access to their training data, making transferring their knowledge inefficient and problematic. In response, Data-Free Knowledge Distillation (DFKD) methods have emerged as direct solutions. However, simply adopting models derived from DFKD for real-world applications suffers significant performance degradation, due to the discrepancy between teachers' training data and real-world scenarios (student domain). The degradation stems from the portions of teachers' knowledge that are not applicable to the student domain. They are specific to the teacher domain and would undermine students' performance. Hence, selectively transferring teachers' appropriate knowledge becomes the primary challenge in DFKD. In this work, we propose a simple but effective method AuG-KD. It utilizes an uncertainty-guided and sample-specific anchor to align student-domain data with the teacher domain and leverages a generative method to progressively trade off the learning process between OOD knowledge distillation and domain-specific information learning via mixup learning. Extensive experiments in 3 datasets and 8 settings demonstrate the stability and superiority of our approach. Code available at this https URL .

Comments:	Accepted to ICLR 2024
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.07030 [cs.LG]
	(or arXiv:2403.07030v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.07030

Submission history

From: Zihao Tang [view email]
[v1] Mon, 11 Mar 2024 03:34:14 UTC (1,092 KB)
[v2] Mon, 18 Mar 2024 02:45:04 UTC (1,092 KB)

Computer Science > Machine Learning

Title:AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators