Chinese Biomedical NER Based on Self-attention and Word-Relation Decoding Strategy

Mu, Wenxuan; Zhao, Di; Meng, Jiana

doi:10.1007/978-981-97-1717-0_8

Wenxuan Mu¹⁶,
Di Zhao¹⁶ &
Jiana Meng¹⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2080))

Included in the following conference series:

China Health Information Processing Conference

120 Accesses

Abstract

Biomedical named entity recognition plays a crucial role in advancing smart healthcare tasks. However, the scarcity of biomedical data and the extensive annotation required by professionals make achieving remarkable model performance challenging and expensive. Few-shot learning focuses on improving the model’s performance and generalization under limited labeled data, providing effective solutions for biomedical information mining. Therefore, this paper proposes a Chinese biomedical named entity recognition method based on self-attention and word-relation decoding strategy. The aim is to effectively address the task of Chinese biomedical named entity recognition in few-shot scenarios. Our work is based on the 9th China Health Information Processing Conference task 2 and ranked third among all the teams. In the final results of the query set in the test dataset, the F1 score on testA dataset is 0.85, and on testB dataset, it is 0.87.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bach, N., Badaskar, S.: A review of relation extraction. Lit. Rev. Lang. Stat. II(2), 1–15 (2007)
Google Scholar
Leng, J., Jiang, P.: A deep learning approach for relationship extraction from interaction context in social manufacturing paradigm. Knowl.-Based Syst. 100, 188–199 (2016)
Article Google Scholar
Li, L., et al.: Real-world data medical knowledge graph: construction and applications. Artif. Intell. Med. 103, 101817 (2020)
Article Google Scholar
Allam, A., Haggag, M.: The question answering systems: a survey. Int. J. Res. Rev. Inf. Sci. (IJRRIS) 2(3) (2012)
Google Scholar
Mishra, A., Jain, S.: A survey on question answering systems with classification. J. King Saud Univ. Comput. Inf. Sci. 28(3), 345–361 (2016)
Google Scholar
Dang, T., Le, H., Nguyen, T., Vu, S.: D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information. Bioinformatics 34(20), 3539–3546 (2018)
Article Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1423
Tian, Y., Shen, W., Song, Y., Xia, F., He, M., Li, K.: Improving biomedical named entity recognition with syntactic information. BMC Bioinform. 21(1), 1–17 (2020)
Article Google Scholar
Li, D., Yan, L., Yang, J., Ma, Z.: Dependency syntax guided BERT-BiLSTM-GAM-CRF for Chinese NER. Expert Syst. Appl. 196, 116682 (2022)
Article Google Scholar
Yamada, I., Asai, A., Shindo, H., Takeda, H., Matsumoto, Y.: LUKE: deep contextualized entity representations with entity-aware self-attention. In: Webber, B., Cohn, T., He, Y., Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6442–6454. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.emnlp-main.523
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann Publishers Inc (2001). Not Available
Google Scholar
Liu, L., Ding, B., Bing, L., Joty, S., Si, L., Miao, C.: MulDA: a multilingual data augmentation framework for low-resource cross-lingual NER. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, vol. 1, pp. 5834–5846 (2021)
Google Scholar
Ding, B., et al.: DAGA: data augmentation with a generation approach for low-resource tagging tasks. In: Webber, B., Cohn, T., He, Y., Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6045–6057. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.emnlp-main.488
Li, J., et al.: Unified named entity recognition as word-word relation classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 10965–10973 (2022)
Google Scholar
Finkel, J., Grenager, T., Manning, C.: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 363–370 (2005)
Google Scholar
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
Google Scholar
Strubell, E., Verga, P., Belanger, D., McCallum, A.: Fast and accurate entity recognition with iterated dilated convolutions. In: Palmer, M., Hwa, R., Riedel, S. (eds.) Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2670–2680. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/D17-1283
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Knight, K., Nenkova, A., Rambow, O. (eds.) Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/N16-1030
Yan, H., Deng, B., Li, X., Qiu, X.: TENER: adapting transformer encoder for named entity recognition. ArXiv (2019)
Google Scholar
Li, X., Yan, H., Qiu, X., Huang, X.: FLAT: Chinese NER using flat-lattice transformer. In: Jurafsky, D., Chai, J., Schluter, N., Tetreault, J. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6836–6842. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.acl-main.611
Xu, M., Jiang, H., Watcharawittayakul, S.: A local detection approach for named entity recognition and mention detection. In: Barzilay, R., Kan, M. (eds.) Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1237–1247. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/P17-1114
Luan, Y., Wadden, D., He, L., Shah, A., Ostendorf, M., Hajishirzi, H.: A general framework for information extraction using dynamic span graphs. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 3036–3046. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1308

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Dalian Minzu University, Dalian, 116000, Liaoning, China
Wenxuan Mu, Di Zhao & Jiana Meng

Authors

Wenxuan Mu
View author publications
You can also search for this author in PubMed Google Scholar
Di Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jiana Meng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Di Zhao .

Editor information

Editors and Affiliations

The University of Texas Health Science Center at Houston, Houston, TX, USA
Hua Xu
Harbin Institute of Technology, Shenzhen, China
Qingcai Chen
Dalian University of Technology, Dalian, China
Hongfei Lin
Zhejiang University, Hangzhou, China
Fei Wu
Fudan University, Shanghai, China
Lei Liu
Harbin Institute of Technology, Shenzhen, China
Buzhou Tang
South China Normal University, Guangzhou, China
Tianyong Hao
Zhejiang University, Hangzhou, China
Zhengxing Huang
Medical Informatics Center of Peking University, Beijing, China
Jianbo Lei
Takeda Co. Ltd, Shanghai, China
Zuofeng Li
West China Hospital of Sichuan University, Chengdu, China
Hui Zong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mu, W., Zhao, D., Meng, J. (2024). Chinese Biomedical NER Based on Self-attention and Word-Relation Decoding Strategy. In: Xu, H., et al. Health Information Processing. Evaluation Track Papers. CHIP 2023. Communications in Computer and Information Science, vol 2080. Springer, Singapore. https://doi.org/10.1007/978-981-97-1717-0_8

Download citation

DOI: https://doi.org/10.1007/978-981-97-1717-0_8
Published: 20 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1716-3
Online ISBN: 978-981-97-1717-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Chinese Biomedical NER Based on Self-attention and Word-Relation Decoding Strategy