A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling

Authors

  • Ye Wang East China Normal University
  • Huazheng Pan East China Normal University
  • Tao Zhang Tsinghua University
  • Wen Wu East China Normal University
  • Wenxin Hu East China Normal University

DOI:

https://doi.org/10.1609/aaai.v38i17.29888

Keywords:

NLP: Information Extraction, ML: Semi-Supervised Learning

Abstract

The goal of document-level relation extraction (RE) is to identify relations between entities that span multiple sentences. Recently, incomplete labeling in document-level RE has received increasing attention, and some studies have used methods such as positive-unlabeled learning to tackle this issue, but there is still a lot of room for improvement. Motivated by this, we propose a positive-augmentation and positive-mixup positive-unlabeled metric learning framework (P3M). Specifically, we formulate document-level RE as a metric learning problem. We aim to pull the distance closer between entity pair embedding and their corresponding relation embedding, while pushing it farther away from the none-class relation embedding. Additionally, we adapt the positive-unlabeled learning to this loss objective. In order to improve the generalizability of the model, we use dropout to augment positive samples and propose a positive-none-class mixup method. Extensive experiments show that P3M improves the F1 score by approximately 4-10 points in document-level RE with incomplete labeling, and achieves state-of-the-art results in fully labeled scenarios. Furthermore, P3M has also demonstrated robustness to prior estimation bias in incomplete labeled scenarios.

Published

2024-03-24

How to Cite

Wang, Y., Pan, H., Zhang, T., Wu, W., & Hu, W. (2024). A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 19197-19205. https://doi.org/10.1609/aaai.v38i17.29888

Issue

Section

AAAI Technical Track on Natural Language Processing II