Divergence-Based Domain Transferability for Zero-Shot Classification

Pugantsov, Alexander; McCreadie, Richard

Computer Science > Computation and Language

arXiv:2302.05735 (cs)

[Submitted on 11 Feb 2023 (v1), last revised 28 Feb 2023 (this version, v2)]

Title:Divergence-Based Domain Transferability for Zero-Shot Classification

Authors:Alexander Pugantsov, Richard McCreadie

View PDF

Abstract:Transferring learned patterns from pretrained neural language models has been shown to significantly improve effectiveness across a variety of language-based tasks, meanwhile further tuning on intermediate tasks has been demonstrated to provide additional performance benefits, provided the intermediate task is sufficiently related to the target task. However, how to identify related tasks is an open problem, and brute-force searching effective task combinations is prohibitively expensive. Hence, the question arises, are we able to improve the effectiveness and efficiency of tasks with no training examples through selective fine-tuning? In this paper, we explore statistical measures that approximate the divergence between domain representations as a means to estimate whether tuning using one task pair will exhibit performance benefits over tuning another. This estimation can then be used to reduce the number of task pairs that need to be tested by eliminating pairs that are unlikely to provide benefits. Through experimentation over 58 tasks and over 6,600 task pair combinations, we demonstrate that statistical measures can distinguish effective task pairs, and the resulting estimates can reduce end-to-end runtime by up to 40%.

Comments:	Accepted at EACL 2023, Findings. Figure 1 caption corrected to describe NDCG@K graph (Figure 1 caption was mistakenly describing Figure 2 before correction)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2302.05735 [cs.CL]
	(or arXiv:2302.05735v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.05735

Submission history

From: Alexander Pugantsov [view email]
[v1] Sat, 11 Feb 2023 16:04:38 UTC (6,728 KB)
[v2] Tue, 28 Feb 2023 11:26:32 UTC (6,728 KB)

Computer Science > Computation and Language

Title:Divergence-Based Domain Transferability for Zero-Shot Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Divergence-Based Domain Transferability for Zero-Shot Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators