Abstract
Biomedical named entity recognition plays a crucial role in advancing smart healthcare tasks. However, the scarcity of biomedical data and the extensive annotation required by professionals make achieving remarkable model performance challenging and expensive. Few-shot learning focuses on improving the model’s performance and generalization under limited labeled data, providing effective solutions for biomedical information mining. Therefore, this paper proposes a Chinese biomedical named entity recognition method based on self-attention and word-relation decoding strategy. The aim is to effectively address the task of Chinese biomedical named entity recognition in few-shot scenarios. Our work is based on the 9th China Health Information Processing Conference task 2 and ranked third among all the teams. In the final results of the query set in the test dataset, the F1 score on testA dataset is 0.85, and on testB dataset, it is 0.87.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bach, N., Badaskar, S.: A review of relation extraction. Lit. Rev. Lang. Stat. II(2), 1–15 (2007)
Leng, J., Jiang, P.: A deep learning approach for relationship extraction from interaction context in social manufacturing paradigm. Knowl.-Based Syst. 100, 188–199 (2016)
Li, L., et al.: Real-world data medical knowledge graph: construction and applications. Artif. Intell. Med. 103, 101817 (2020)
Allam, A., Haggag, M.: The question answering systems: a survey. Int. J. Res. Rev. Inf. Sci. (IJRRIS) 2(3) (2012)
Mishra, A., Jain, S.: A survey on question answering systems with classification. J. King Saud Univ. Comput. Inf. Sci. 28(3), 345–361 (2016)
Dang, T., Le, H., Nguyen, T., Vu, S.: D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information. Bioinformatics 34(20), 3539–3546 (2018)
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1423
Tian, Y., Shen, W., Song, Y., Xia, F., He, M., Li, K.: Improving biomedical named entity recognition with syntactic information. BMC Bioinform. 21(1), 1–17 (2020)
Li, D., Yan, L., Yang, J., Ma, Z.: Dependency syntax guided BERT-BiLSTM-GAM-CRF for Chinese NER. Expert Syst. Appl. 196, 116682 (2022)
Yamada, I., Asai, A., Shindo, H., Takeda, H., Matsumoto, Y.: LUKE: deep contextualized entity representations with entity-aware self-attention. In: Webber, B., Cohn, T., He, Y., Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6442–6454. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.emnlp-main.523
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann Publishers Inc (2001). Not Available
Liu, L., Ding, B., Bing, L., Joty, S., Si, L., Miao, C.: MulDA: a multilingual data augmentation framework for low-resource cross-lingual NER. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, vol. 1, pp. 5834–5846 (2021)
Ding, B., et al.: DAGA: data augmentation with a generation approach for low-resource tagging tasks. In: Webber, B., Cohn, T., He, Y., Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6045–6057. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.emnlp-main.488
Li, J., et al.: Unified named entity recognition as word-word relation classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 10965–10973 (2022)
Finkel, J., Grenager, T., Manning, C.: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 363–370 (2005)
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
Strubell, E., Verga, P., Belanger, D., McCallum, A.: Fast and accurate entity recognition with iterated dilated convolutions. In: Palmer, M., Hwa, R., Riedel, S. (eds.) Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2670–2680. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/D17-1283
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Knight, K., Nenkova, A., Rambow, O. (eds.) Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/N16-1030
Yan, H., Deng, B., Li, X., Qiu, X.: TENER: adapting transformer encoder for named entity recognition. ArXiv (2019)
Li, X., Yan, H., Qiu, X., Huang, X.: FLAT: Chinese NER using flat-lattice transformer. In: Jurafsky, D., Chai, J., Schluter, N., Tetreault, J. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6836–6842. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.acl-main.611
Xu, M., Jiang, H., Watcharawittayakul, S.: A local detection approach for named entity recognition and mention detection. In: Barzilay, R., Kan, M. (eds.) Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1237–1247. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/P17-1114
Luan, Y., Wadden, D., He, L., Shah, A., Ostendorf, M., Hajishirzi, H.: A general framework for information extraction using dynamic span graphs. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 3036–3046. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1308
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mu, W., Zhao, D., Meng, J. (2024). Chinese Biomedical NER Based on Self-attention and Word-Relation Decoding Strategy. In: Xu, H., et al. Health Information Processing. Evaluation Track Papers. CHIP 2023. Communications in Computer and Information Science, vol 2080. Springer, Singapore. https://doi.org/10.1007/978-981-97-1717-0_8
Download citation
DOI: https://doi.org/10.1007/978-981-97-1717-0_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1716-3
Online ISBN: 978-981-97-1717-0
eBook Packages: Computer ScienceComputer Science (R0)