research-article

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition

Authors:
Yibo Yan

Department of Computer Science and Technology, Tongji University, China

Department of Computer Science and Technology, Tongji University, China

0009-0005-2793-0579
View Profile

,
Peng Zhu

School of Data Science and Engineering, East China Normal University, China

School of Data Science and Engineering, East China Normal University, China

0000-0001-9558-3787
View Profile

,
Dawei Cheng

Department of Computer Science and Technology, Tongji University, China

Department of Computer Science and Technology, Tongji University, China

0000-0002-5877-7387
View Profile

,
Fangzhou Yang

Group of Artificial Intelligence and Big Data, Seek Data Inc., China

Group of Artificial Intelligence and Big Data, Seek Data Inc., China

0009-0001-7293-4975
View Profile

,
Yifeng Luo

School of Data Science and Engineering, East China Normal University, China

School of Data Science and Engineering, East China Normal University, China

0000-0003-4863-3432
View Profile

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 22 Issue 7Article No.: 193pp 1–19https://doi.org/10.1145/3603626

Published:20 July 2023Publication History

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

Named entity recognition (NER) is a fundamental task for information extraction applications. NER is challenging because of semantic ambiguities in academic literature, especially for non-Latin languages. Besides word semantic information, recognizing Chinese named entities needs to consider word boundary information, as words contained in Chinese texts are not separated with spaces. Leveraging word boundary information could help to determine entity boundaries and thus improve entity recognition performance. In this article, we propose to combine word boundary information and semantic information for named entity recognition based on multi-task adversarial learning. Specifically, we learn commonly shared boundary information of entities from multiple kinds of tasks, including Chinese word segmentation (CWS), part-of-speech (POS) tagging, and entity recognition, with adversarial learning. We learn task-specific semantic information of words from these tasks and combine the learned boundary information with the semantic information to improve entity recognition with multi-task learning. We then propose a compression method based on improved clustering to accelerate the proposed model. We conduct extensive experiments on four public benchmark datasets and two private datasets, compared with state-of-the-art baseline models, and the experimental results demonstrate that our model achieves considerable performance improvements on various evaluation datasets.

REFERENCES

[1] Bojanowski Piotr, Grave Edouard, Joulin Armand, and Mikolov Tomas. 2017. Enriching word vectors with subword information. Trans. Assoc. Computat. Ling. 5 (2017), 135–146.Google ScholarCross Ref
[2] Bunescu Razvan C. and Mooney Raymond J.. 2005. A shortest path dependency kernel for relation extraction. In HLT-EMNLP. 724–731.Google Scholar
[3] Cao Pengfei, Chen Yubo, Liu Kang, Zhao Jun, and Liu Shengping. 2018. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In EMNLP. 182–192.Google Scholar
[4] Che Wanxiang, Wang Mengqiu, Manning Christopher D., and Liu Ting. 2013. Named entity recognition with bilingual constraints. In NAACL. 52–62.Google Scholar
[5] Chen Aitao, Peng Fuchun, Shan Roy, and Sun Gordon. 2006. Chinese named entity recognition with conditional probabilistic models. In SIGHAN. 173–176.Google Scholar
[6] Cheng Dawei, Niu Zhibin, and Zhang Liqing. 2020. Delinquent events prediction in temporal networked-guarantee loans. IEEE Trans. Neural Netw. Learn. Syst. 34, 4 (2020).Google Scholar
[7] Chiu Jason P. C. and Nichols Eric. 2016. Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Computat. Ling. 4 (2016), 357–370.Google ScholarCross Ref
[8] Ding Ruixue, Xie Pengjun, Zhang Xiaoyan, Lu Wei, Li Linlin, and Si Luo. 2019. A neural multi-digraph model for Chinese NER with gazetteers. In ACL. 1462–1467.Google Scholar
[9] Dong Chuanhai, Zhang Jiajun, Zong Chengqing, Hattori Masanori, and Di Hui. 2016. Character-based LSTM-CRF with radical-level features for Chinese named entity recognition. In NLPCC. 239–250.Google Scholar
[10] Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. 2019. Neural architecture search: A survey. J. Mach. Learn. Res. 20, 1 (2019), 1997–2017.Google Scholar
[11] Fader Anthony, Zettlemoyer Luke, and Etzioni Oren. 2013. Paraphrase-driven learning for open question answering. In ACL. 1608–1618.Google Scholar
[12] Fan Mengzhen, Cheng Dawei, Yang Fangzhou, Luo Siqiang, Luo Yifeng, Qian Weining, and Zhou Aoying. 2020. Fusing global domain information and local semantic information to classify financial documents. In CIKM. 2413–2420.Google Scholar
[13] Jianping Gou, Baosheng Yu, Stephen J. Maybank, and Dacheng Tao. 2021. Knowledge distillation: A survey. Int. J. Comput. Vis. 129, 6 (2021), 1789–1819.Google Scholar
[14] Gui Tao, Ma Ruotian, Zhang Qi, Zhao Lujun, Jiang Yu-Gang, and Huang Xuanjing. 2019. CNN-based Chinese NER with lexicon rethinking. In IJCAI. 4982–4988.Google Scholar
[15] Gui Tao, Zou Yicheng, Zhang Qi, Peng Minlong, Fu Jinlan, Wei Zhongyu, and Huang Xuanjing. 2019. A lexicon-based graph neural network for Chinese NER. In EMNLP-IJCNLP. 1039–1049.Google Scholar
[16] Han Song, Mao Huizi, and Dally William J.. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and Huffman coding. arXiv preprint arXiv:1510.00149 (2015).Google Scholar
[17] He Hangfeng and Sun Xu. 2017. F-score driven max margin neural network for named entity recognition in Chinese social media. In EACL. 713–718.Google Scholar
[18] He Hangfeng and Sun Xu. 2017. A unified model for cross-domain and semi-supervised named entity recognition in Chinese social media. In AAAI. 3216–3222.Google Scholar
[19] Huang Zhiheng, Xu Wei, and Yu Kai. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015).Google Scholar
[20] Ji Yu, Liang Ling, Deng Lei, Zhang Youyang, Zhang Youhui, and Xie Yuan. 2018. TETRIS: Tile-matching the tremendous irregular sparsity. In NIPS. 4119–4129.Google Scholar
[21] Levow Gina-Anne. 2006. The Third International Chinese Language Processing Bakeoff: Word segmentation and named entity recognition. In SIGHAN. 108–117.Google Scholar
[22] Li Xiaonan, Yan Hang, Qiu Xipeng, and Huang Xuanjing. 2020. FLAT: Chinese NER using flat-lattice transformer. In ACL. 6836–6842.Google Scholar
[23] Liang Xin, Cheng Dawei, Yang Fangzhou, Luo Yifeng, Qian Weining, and Zhou Aoying. 2020. F-HMTC: Detecting financial events for investment decisions based on neural hierarchical multi-label text classification. In IJCAI. 4490–4496.Google Scholar
[24] Liu Wei, Xu Tongge, Xu Qinghua, Song Jiayu, and Zu Yueran. 2019. An encoding strategy based word-character LSTM for Chinese NER. In NAACL. 2379–2389.Google Scholar
[25] Lu Yanan, Zhang Yue, and Ji Donghong. 2016. Multi-prototype chinese character embedding. In LREC. 855–859.Google Scholar
[26] Mengge Xue, Bowen Yu, Tingwen Liu, Bin Wang, Erli Meng, and Quangang Li. 2019. Porous lattice-based transformer encoder for Chinese NER. arXiv preprint arXiv:1911.02733 (2019).Google Scholar
[27] Andriy Mnih and Russ R. Salakhutdinov. 2007. Probabilistic matrix factorization. In NIPS. 1257–1264.Google Scholar
[28] Peng Minlong, Ma Ruotian, Zhang Qi, and Huang Xuanjing. 2020. Simplify the usage of lexicon in Chinese NER. In ACL. 5951–5960.Google Scholar
[29] Peng Nanyun and Dredze Mark. 2015. Named entity recognition for Chinese social media with jointly trained embeddings. In EMNLP. 548–554.Google Scholar
[30] Peng Nanyun and Dredze Mark. 2016. Improving named entity recognition for Chinese social media with word segmentation representation learning. In ACL. 149–155.Google Scholar
[31] Cheng Fangzhou Yang Yifeng Luo Weining Qian andAoying Zhou Peng Zhu, Dawei. 2021. ZH-NER: Chinese named entity recognition with adversarial multi-task learning and self-attentions. In DASFAA. 1–8.Google Scholar
[32] Sui Dianbo, Chen Yubo, Liu Kang, Zhao Jun, and Liu Shengping. 2019. Leverage lexical knowledge for Chinese named entity recognition via collaborative graph network. In EMNLP-IJCNLP. 3821–3831.Google Scholar
[33] Sun Changzhi and Wu Yuanbin. 2019. Distantly supervised entity relation extraction with adapted manual annotations. In AAAI. 7039–7046.Google Scholar
[34] Ullrich Karen, Meeds Edward, and Welling Max. 2017. Soft weight-sharing for neural network compression. arXiv preprint arXiv:1702.04008 (2017).Google Scholar
[35] Wang Mengqiu, Che Wanxiang, and Manning Christopher D.. 2013. Effective bilingual constraints for semi-supervised learning of named entity recognizers. In AAAI. 52–62.Google Scholar
[36] Weischedel Ralph, Palmer Martha, Marcus Mitchell, Hovy Eduard, Pradhan Sameer, Ramshaw Lance, Xue Nianwen, Taylor Ann, Kaufman Jeff, Franchini Michelle et al. 2011. OntoNotes 4.0. Linguistic Data Consortium LDC2011T03 (2011).Google Scholar
[37] Wu Fangzhao, Liu Junxin, Wu Chuhan, Huang Yongfeng, and Xie Xing. 2019. Neural Chinese named entity recognition via CNN-LSTM-CRF and joint training with word segmentation. In WWW. 3342–3348.Google Scholar
[38] Xin Ji, Lin Yankai, Liu Zhiyuan, and Sun Maosong. 2018. Improving neural fine-grained entity typing with knowledge attention. In AAAI. 5997–6004.Google Scholar
[39] Yan Hang, Deng Bocao, Li Xiaonan, and Qiu Xipeng. 2019. TENER: Adapting transformer encoder for name entity recognition. arXiv preprint arXiv:1911.04474 (2019).Google Scholar
[40] Yang Fan, Zhang Jianhu, Liu Gongshen, Zhou Jie, Zhou Cheng, and Sun Huanrong. 2018. Five-stroke based CNN-BiRNN-CRF network for Chinese named entity recognition. In NLPCC. 184–195.Google Scholar
[41] Yang Jie, Teng Zhiyang, Zhang Meishan, and Zhang Yue. 2016. Combining discrete and neural features for sequence labeling. In CICLing. 140–154.Google Scholar
[42] Ye Zhixiu and Ling Zhen-Hua. 2018. Hybrid semi-Markov CRF for neural sequence labeling. In ACL. 235–240.Google Scholar
[43] Zhang Suxiang, Qin Ying, Wen Juan, and Wang Xiaojie. 2006. Word segmentation and named entity recognition for SIGHAN Bakeoff3. In SIGHAN. 158–161.Google Scholar
[44] Zhang Yue and Yang Jie. 2018. Chinese NER using lattice LSTM. In ACL. 1554–1564.Google Scholar
[45] Zhou Junsheng, Qu Weiguang, and Zhang Fen. 2013. Chinese named entity recognition via joint identification and categorization. Chinese J. Electron. 22, 2 (2013), 225–230.Google Scholar
[46] Zhu Peng, Cheng Dawei, Yang Fangzhou, Luo Yifeng, Huang Dingjiang, Qian Weining, and Zhou Aoying. 2022. Improving Chinese named entity recognition by large-scale syntactic dependency graph. IEEE/ACM Trans. Audio, Speech Lang. Process. 30 (2022), 979–991.Google ScholarDigital Library
[47] Zhu Yuying, Wang Guoxin, and Karlsson Börje F.. 2019. CAN-NER: Convolutional attention network for Chinese named entity recognition. In NAACL. 3384–3393.Google Scholar

Index Terms

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction

Recommendations

ZH-NER: Chinese Named Entity Recognition with Adversarial Multi-task Learning and Self-Attentions
Database Systems for Advanced Applications
Abstract
NER is challenging because of the semantic ambiguities in academic literature, especially for non-Latin languages. Besides, recognizing Chinese named entities needs to consider word boundary information, as words contained in Chinese texts are not ...
Read More
Learning multilingual named entity recognition from Wikipedia

We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...
Read More
Semi-joint labeling for chinese named entity recognition
AIRS'08: Proceedings of the 4th Asia information retrieval conference on Information retrieval technology

Named entity recognition (NER) is an essential component of text mining applications. In Chinese sentences, words do not have delimiters; thus, incorporating word segmentation information into an NER model can improve its performance. Based on the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Asian and Low-Resource Language Information Processing Volume 22, Issue 7
July 2023
422 pages
ISSN:2375-4699
EISSN:2375-4702
DOI:10.1145/3610376
Editor:
Imed Zitouni
Google, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 July 2023
- Online AM: 6 June 2023
- Accepted: 2 June 2023
- Revised: 6 October 2022
- Received: 26 April 2021
Published in tallip Volume 22, Issue 7

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Named entity recognition
Chinese word segmentation
part-of-speech tagging
adversarial learning
multi-task learning
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 273
  Total Downloads
- Downloads (Last 12 months)273
- Downloads (Last 6 weeks)23
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

ZH-NER: Chinese Named Entity Recognition with Adversarial Multi-task Learning and Self-Attentions

Learning multilingual named entity recognition from Wikipedia

Semi-joint labeling for chinese named entity recognition

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

Caption

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

ZH-NER: Chinese Named Entity Recognition with Adversarial Multi-task Learning and Self-Attentions

Learning multilingual named entity recognition from Wikipedia

Semi-joint labeling for chinese named entity recognition

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

Share this Publication link

Share on Social Media