Measuring Task Similarity and Its Implication in Fine-Tuning Graph Neural Networks

Authors

  • Renhong Huang Zhejiang University Fudan University
  • Jiarong Xu Fudan University
  • Xin Jiang Lehigh University
  • Chenglu Pan Zhejiang University
  • Zhiming Yang Fudan University
  • Chunping Wang FinVolution
  • Yang Yang Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v38i11.29156

Keywords:

ML: Graph-based Machine Learning, ML: Transfer, Domain Adaptation, Multi-Task Learning

Abstract

The paradigm of pre-training and fine-tuning graph neural networks has attracted wide research attention. In previous studies, the pre-trained models are viewed as universally versatile, and applied for a diverse range of downstream tasks. In many situations, however, this practice results in limited or even negative transfer. This paper, for the first time, emphasizes the specific application scope of graph pre-trained models: not all downstream tasks can effectively benefit from a graph pre-trained model. In light of this, we introduce the measure task consistency to quantify the similarity between graph pre-training and downstream tasks. This measure assesses the extent to which downstream tasks can benefit from specific pre-training tasks. Moreover, a novel fine-tuning strategy, Bridge-Tune, is proposed to further diminish the impact of the difference between pre-training and downstream tasks. The key innovation in Bridge-Tune is an intermediate step that bridges pre-training and downstream tasks. This step takes into account the task differences and further refines the pre-trained model. The superiority of the presented fine-tuning strategy is validated via numerous experiments with different pre-trained models and downstream tasks.

Published

2024-03-24

How to Cite

Huang, R., Xu, J., Jiang, X., Pan, C., Yang, Z., Wang, C., & Yang, Y. (2024). Measuring Task Similarity and Its Implication in Fine-Tuning Graph Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 38(11), 12617-12625. https://doi.org/10.1609/aaai.v38i11.29156

Issue

Section

AAAI Technical Track on Machine Learning II