SENCR: A Span Enhanced Two-Stage Network with Counterfactual Rethinking for Chinese NER

Authors

  • Hang Zheng School of Big Data and Software Engineering, Chongqing University, China
  • Qingsong Li School of Big Data and Software Engineering, Chongqing University, China
  • Shen Chen School of Big Data and Software Engineering, Chongqing University, China
  • Yuxuan Liang The Hong Kong University of Science and Technology (Guangzhou), China
  • Li Liu School of Big Data and Software Engineering, Chongqing University, China

DOI:

https://doi.org/10.1609/aaai.v38i17.29941

Keywords:

NLP: Information Extraction, NLP: Ethics -- Bias, Fairness, Transparency & Privacy

Abstract

Recently, lots of works that incorporate external lexicon information into character-level Chinese named entity recognition(NER) to overcome the lackness of natural delimiters of words, have achieved many advanced performance. However, obtaining and maintaining high-quality lexicons is costly, especially in special domains. In addition, the entity boundary bias caused by high mention coverage in some boundary characters poses a significant challenge to the generalization of NER models but receives little attention in the existing literature. To address these issues, we propose SENCR, a Span Enhanced Two-stage Network with Counterfactual Rethinking for Chinese NER, that contains a boundary detector for boundary supervision, a convolution-based type classifier for better span representation and a counterfactual rethinking(CR) strategy for debiased boundary detection in inference. The proposed boundary detector and type classifier are jointly trained with the same contextual encoder and then the trained boundary detector is debiased by our proposed CR strategy without modifying any model parameters in the inference stage. Extensive experiments on four Chinese NER datasets show the effectiveness of our proposed approach.

Published

2024-03-24

How to Cite

Zheng, H., Li, Q., Chen, S., Liang, Y., & Liu, L. (2024). SENCR: A Span Enhanced Two-Stage Network with Counterfactual Rethinking for Chinese NER. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 19679-19687. https://doi.org/10.1609/aaai.v38i17.29941

Issue

Section

AAAI Technical Track on Natural Language Processing II