skip to main content
research-article

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition

Published:20 July 2023Publication History
Skip Abstract Section

Abstract

Named entity recognition (NER) is a fundamental task for information extraction applications. NER is challenging because of semantic ambiguities in academic literature, especially for non-Latin languages. Besides word semantic information, recognizing Chinese named entities needs to consider word boundary information, as words contained in Chinese texts are not separated with spaces. Leveraging word boundary information could help to determine entity boundaries and thus improve entity recognition performance. In this article, we propose to combine word boundary information and semantic information for named entity recognition based on multi-task adversarial learning. Specifically, we learn commonly shared boundary information of entities from multiple kinds of tasks, including Chinese word segmentation (CWS), part-of-speech (POS) tagging, and entity recognition, with adversarial learning. We learn task-specific semantic information of words from these tasks and combine the learned boundary information with the semantic information to improve entity recognition with multi-task learning. We then propose a compression method based on improved clustering to accelerate the proposed model. We conduct extensive experiments on four public benchmark datasets and two private datasets, compared with state-of-the-art baseline models, and the experimental results demonstrate that our model achieves considerable performance improvements on various evaluation datasets.

REFERENCES

  1. [1] Bojanowski Piotr, Grave Edouard, Joulin Armand, and Mikolov Tomas. 2017. Enriching word vectors with subword information. Trans. Assoc. Computat. Ling. 5 (2017), 135146.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Bunescu Razvan C. and Mooney Raymond J.. 2005. A shortest path dependency kernel for relation extraction. In HLT-EMNLP. 724731.Google ScholarGoogle Scholar
  3. [3] Cao Pengfei, Chen Yubo, Liu Kang, Zhao Jun, and Liu Shengping. 2018. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In EMNLP. 182192.Google ScholarGoogle Scholar
  4. [4] Che Wanxiang, Wang Mengqiu, Manning Christopher D., and Liu Ting. 2013. Named entity recognition with bilingual constraints. In NAACL. 5262.Google ScholarGoogle Scholar
  5. [5] Chen Aitao, Peng Fuchun, Shan Roy, and Sun Gordon. 2006. Chinese named entity recognition with conditional probabilistic models. In SIGHAN. 173176.Google ScholarGoogle Scholar
  6. [6] Cheng Dawei, Niu Zhibin, and Zhang Liqing. 2020. Delinquent events prediction in temporal networked-guarantee loans. IEEE Trans. Neural Netw. Learn. Syst. 34, 4 (2020).Google ScholarGoogle Scholar
  7. [7] Chiu Jason P. C. and Nichols Eric. 2016. Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Computat. Ling. 4 (2016), 357370.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Ding Ruixue, Xie Pengjun, Zhang Xiaoyan, Lu Wei, Li Linlin, and Si Luo. 2019. A neural multi-digraph model for Chinese NER with gazetteers. In ACL. 14621467.Google ScholarGoogle Scholar
  9. [9] Dong Chuanhai, Zhang Jiajun, Zong Chengqing, Hattori Masanori, and Di Hui. 2016. Character-based LSTM-CRF with radical-level features for Chinese named entity recognition. In NLPCC. 239250.Google ScholarGoogle Scholar
  10. [10] Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. 2019. Neural architecture search: A survey. J. Mach. Learn. Res. 20, 1 (2019), 1997–2017.Google ScholarGoogle Scholar
  11. [11] Fader Anthony, Zettlemoyer Luke, and Etzioni Oren. 2013. Paraphrase-driven learning for open question answering. In ACL. 16081618.Google ScholarGoogle Scholar
  12. [12] Fan Mengzhen, Cheng Dawei, Yang Fangzhou, Luo Siqiang, Luo Yifeng, Qian Weining, and Zhou Aoying. 2020. Fusing global domain information and local semantic information to classify financial documents. In CIKM. 24132420.Google ScholarGoogle Scholar
  13. [13] Jianping Gou, Baosheng Yu, Stephen J. Maybank, and Dacheng Tao. 2021. Knowledge distillation: A survey. Int. J. Comput. Vis. 129, 6 (2021), 1789–1819.Google ScholarGoogle Scholar
  14. [14] Gui Tao, Ma Ruotian, Zhang Qi, Zhao Lujun, Jiang Yu-Gang, and Huang Xuanjing. 2019. CNN-based Chinese NER with lexicon rethinking. In IJCAI. 49824988.Google ScholarGoogle Scholar
  15. [15] Gui Tao, Zou Yicheng, Zhang Qi, Peng Minlong, Fu Jinlan, Wei Zhongyu, and Huang Xuanjing. 2019. A lexicon-based graph neural network for Chinese NER. In EMNLP-IJCNLP. 10391049.Google ScholarGoogle Scholar
  16. [16] Han Song, Mao Huizi, and Dally William J.. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and Huffman coding. arXiv preprint arXiv:1510.00149 (2015).Google ScholarGoogle Scholar
  17. [17] He Hangfeng and Sun Xu. 2017. F-score driven max margin neural network for named entity recognition in Chinese social media. In EACL. 713718.Google ScholarGoogle Scholar
  18. [18] He Hangfeng and Sun Xu. 2017. A unified model for cross-domain and semi-supervised named entity recognition in Chinese social media. In AAAI. 32163222.Google ScholarGoogle Scholar
  19. [19] Huang Zhiheng, Xu Wei, and Yu Kai. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015).Google ScholarGoogle Scholar
  20. [20] Ji Yu, Liang Ling, Deng Lei, Zhang Youyang, Zhang Youhui, and Xie Yuan. 2018. TETRIS: Tile-matching the tremendous irregular sparsity. In NIPS. 41194129.Google ScholarGoogle Scholar
  21. [21] Levow Gina-Anne. 2006. The Third International Chinese Language Processing Bakeoff: Word segmentation and named entity recognition. In SIGHAN. 108117.Google ScholarGoogle Scholar
  22. [22] Li Xiaonan, Yan Hang, Qiu Xipeng, and Huang Xuanjing. 2020. FLAT: Chinese NER using flat-lattice transformer. In ACL. 68366842.Google ScholarGoogle Scholar
  23. [23] Liang Xin, Cheng Dawei, Yang Fangzhou, Luo Yifeng, Qian Weining, and Zhou Aoying. 2020. F-HMTC: Detecting financial events for investment decisions based on neural hierarchical multi-label text classification. In IJCAI. 44904496.Google ScholarGoogle Scholar
  24. [24] Liu Wei, Xu Tongge, Xu Qinghua, Song Jiayu, and Zu Yueran. 2019. An encoding strategy based word-character LSTM for Chinese NER. In NAACL. 23792389.Google ScholarGoogle Scholar
  25. [25] Lu Yanan, Zhang Yue, and Ji Donghong. 2016. Multi-prototype chinese character embedding. In LREC. 855859.Google ScholarGoogle Scholar
  26. [26] Mengge Xue, Bowen Yu, Tingwen Liu, Bin Wang, Erli Meng, and Quangang Li. 2019. Porous lattice-based transformer encoder for Chinese NER. arXiv preprint arXiv:1911.02733 (2019).Google ScholarGoogle Scholar
  27. [27] Andriy Mnih and Russ R. Salakhutdinov. 2007. Probabilistic matrix factorization. In NIPS. 1257–1264.Google ScholarGoogle Scholar
  28. [28] Peng Minlong, Ma Ruotian, Zhang Qi, and Huang Xuanjing. 2020. Simplify the usage of lexicon in Chinese NER. In ACL. 59515960.Google ScholarGoogle Scholar
  29. [29] Peng Nanyun and Dredze Mark. 2015. Named entity recognition for Chinese social media with jointly trained embeddings. In EMNLP. 548554.Google ScholarGoogle Scholar
  30. [30] Peng Nanyun and Dredze Mark. 2016. Improving named entity recognition for Chinese social media with word segmentation representation learning. In ACL. 149155.Google ScholarGoogle Scholar
  31. [31] Cheng Fangzhou Yang Yifeng Luo Weining Qian andAoying Zhou Peng Zhu, Dawei. 2021. ZH-NER: Chinese named entity recognition with adversarial multi-task learning and self-attentions. In DASFAA. 18.Google ScholarGoogle Scholar
  32. [32] Sui Dianbo, Chen Yubo, Liu Kang, Zhao Jun, and Liu Shengping. 2019. Leverage lexical knowledge for Chinese named entity recognition via collaborative graph network. In EMNLP-IJCNLP. 38213831.Google ScholarGoogle Scholar
  33. [33] Sun Changzhi and Wu Yuanbin. 2019. Distantly supervised entity relation extraction with adapted manual annotations. In AAAI. 70397046.Google ScholarGoogle Scholar
  34. [34] Ullrich Karen, Meeds Edward, and Welling Max. 2017. Soft weight-sharing for neural network compression. arXiv preprint arXiv:1702.04008 (2017).Google ScholarGoogle Scholar
  35. [35] Wang Mengqiu, Che Wanxiang, and Manning Christopher D.. 2013. Effective bilingual constraints for semi-supervised learning of named entity recognizers. In AAAI. 5262.Google ScholarGoogle Scholar
  36. [36] Weischedel Ralph, Palmer Martha, Marcus Mitchell, Hovy Eduard, Pradhan Sameer, Ramshaw Lance, Xue Nianwen, Taylor Ann, Kaufman Jeff, Franchini Michelle et al. 2011. OntoNotes 4.0. Linguistic Data Consortium LDC2011T03 (2011).Google ScholarGoogle Scholar
  37. [37] Wu Fangzhao, Liu Junxin, Wu Chuhan, Huang Yongfeng, and Xie Xing. 2019. Neural Chinese named entity recognition via CNN-LSTM-CRF and joint training with word segmentation. In WWW. 33423348.Google ScholarGoogle Scholar
  38. [38] Xin Ji, Lin Yankai, Liu Zhiyuan, and Sun Maosong. 2018. Improving neural fine-grained entity typing with knowledge attention. In AAAI. 59976004.Google ScholarGoogle Scholar
  39. [39] Yan Hang, Deng Bocao, Li Xiaonan, and Qiu Xipeng. 2019. TENER: Adapting transformer encoder for name entity recognition. arXiv preprint arXiv:1911.04474 (2019).Google ScholarGoogle Scholar
  40. [40] Yang Fan, Zhang Jianhu, Liu Gongshen, Zhou Jie, Zhou Cheng, and Sun Huanrong. 2018. Five-stroke based CNN-BiRNN-CRF network for Chinese named entity recognition. In NLPCC. 184195.Google ScholarGoogle Scholar
  41. [41] Yang Jie, Teng Zhiyang, Zhang Meishan, and Zhang Yue. 2016. Combining discrete and neural features for sequence labeling. In CICLing. 140154.Google ScholarGoogle Scholar
  42. [42] Ye Zhixiu and Ling Zhen-Hua. 2018. Hybrid semi-Markov CRF for neural sequence labeling. In ACL. 235240.Google ScholarGoogle Scholar
  43. [43] Zhang Suxiang, Qin Ying, Wen Juan, and Wang Xiaojie. 2006. Word segmentation and named entity recognition for SIGHAN Bakeoff3. In SIGHAN. 158161.Google ScholarGoogle Scholar
  44. [44] Zhang Yue and Yang Jie. 2018. Chinese NER using lattice LSTM. In ACL. 15541564.Google ScholarGoogle Scholar
  45. [45] Zhou Junsheng, Qu Weiguang, and Zhang Fen. 2013. Chinese named entity recognition via joint identification and categorization. Chinese J. Electron. 22, 2 (2013), 225230.Google ScholarGoogle Scholar
  46. [46] Zhu Peng, Cheng Dawei, Yang Fangzhou, Luo Yifeng, Huang Dingjiang, Qian Weining, and Zhou Aoying. 2022. Improving Chinese named entity recognition by large-scale syntactic dependency graph. IEEE/ACM Trans. Audio, Speech Lang. Process. 30 (2022), 979991.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Zhu Yuying, Wang Guoxin, and Karlsson Börje F.. 2019. CAN-NER: Convolutional attention network for Chinese named entity recognition. In NAACL. 33843393.Google ScholarGoogle Scholar

Index Terms

  1. Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 7
      July 2023
      422 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3610376
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 July 2023
      • Online AM: 6 June 2023
      • Accepted: 2 June 2023
      • Revised: 6 October 2022
      • Received: 26 April 2021
      Published in tallip Volume 22, Issue 7

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)273
      • Downloads (Last 6 weeks)23

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text