A Term Normalization Method for Better Performance of Terminology Construction

Hwang, Myunggwon; Jeong, Do-Heon; Jung, Hanmin; Sung, Won-Kyoung; Shin, Juhyun; Kim, Pankoo

doi:10.1007/978-3-642-29347-4_79

A Term Normalization Method for Better Performance of Terminology Construction

Myunggwon Hwang²³,
Do-Heon Jeong²³,
Hanmin Jung²³,
Won-Kyoung Sung²³,
Juhyun Shin²⁴ &
…
Pankoo Kim²⁴

Conference paper

2206 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7267))

Abstract

The importance of research on knowledge management is growing due to recent issues with big data. The most fundamental steps in knowledge management are the extraction and construction of terminologies. Terms are often expressed in various forms and the term variations play a negative role, becoming an obstacle which causes knowledge systems to extract unnecessary knowledge. To solve the problem, we propose a method of term normalization which finds a normalized form (original and standard form defined in dictionaries) of variant terms. The method employs a couple of characteristics of terms: one is appearance similarity, which measures how similar terms are, and the other is context similarity which measures how many clue words they share. Through experiment, we show its positive influence of both similarities in the term normalization.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dowdal, J., Rinaldi, F., Ibekwe-SanJuan, F., SanJuan, E.: Complex Structuring of Term Variants for Question Answering. In: Proc. of the ACM Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, vol. 18, pp. 1–8 (2003)
Google Scholar
Ibekwe-Sanjuan, F.: Terminological Variation, a Means of Identifying Research Topics from Texts. In: Proc. of Intl. Conf. on Computational Linguistics, vol. 1, pp. 564–570 (1998)
Google Scholar
Porter, M.F.: An algorithm for suffix stripping. J. of Program 14(3), 130–137 (1980)
Article Google Scholar
Toutanova, K., Manning, C.: Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In: Proc. Joint SIGDAT Conf. Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 63–70 (2000)
Google Scholar
Hwang, M., Kim, P.: A New Similarity Measure for Automatic Construction of the Unknown Word Lexical Dictionary. Intl. J. on Semantic Web and Information Systems (IJSWIS) 5(1), 48–64 (2009)
Article Google Scholar
Hwang, M., Choi, C., Kim, P.: Automatic Enrichment of Semantic Relation Networks and its Application to Word Sense Disambiguation. IEEE Transactions on Knowledge and Data Engineering 23(6), 845–858 (2011)
Article Google Scholar
Brank, J., Mladenic, D., Grobelnik, M., Milic-Frayling, N.: Feature Selection for the Classification of Large Document Collections. Journal of Universal Computer Science 14(10), 1562–1596 (2008)
MathSciNet Google Scholar
Duong, T.H., Jo, G., Jung, J.J., Nguyen, N.T.: Complexity Analysis of Ontology Integration Methodologies: A Comparative Study. Journal of Universal Computer Science 15(4), 877–897 (2009)
MathSciNet MATH Google Scholar
Jung, J.J.: Semantic business process integration based on ontology alignment. Expert Systems with Applications 36(8), 11013–11020 (2009)
Article Google Scholar
Hwang, M., Choi, D., Choi, J., Kim, H., Kim, P.: Similarity Measure for Semantic Document Interconnections. Information-An International Interdisciplinary Journal 13(2), 253–267 (2010)
Google Scholar
Hwang, M., Choi, D., Kim, P.: A Method for Knowledge Base Enrichment using Wikipedia Document Information. Information-An International Interdisciplinary Journal 13(5), 1599–1612 (2010)
Google Scholar
Bawakid, A., Oussalah, M.: Using features extracted from Wikipedia for the task of Word Sense Disambiguation. In: Proc. of IEEE Intl. Conf. on Cybernetic Intelligent Systems, pp. 1–6 (2010)
Google Scholar
Fogarolli, A.: Word Sense Disambiguation Based on Wikipedia Link Structure. In: Proceedings of IEEE Intl. Conf. on Semantic Computing, pp. 77–82 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Korea Institute of Science and Technology Information (KISTI), 245 Daehak-ro, Yuseong-gu, Daejeon, South Korea
Myunggwon Hwang, Do-Heon Jeong, Hanmin Jung & Won-Kyoung Sung
Chosun University, 375 Seoseok-dong, Dong-gu, Gwangju, South Korea
Juhyun Shin & Pankoo Kim

Authors

Myunggwon Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Do-Heon Jeong
View author publications
You can also search for this author in PubMed Google Scholar
Hanmin Jung
View author publications
You can also search for this author in PubMed Google Scholar
Won-Kyoung Sung
View author publications
You can also search for this author in PubMed Google Scholar
Juhyun Shin
View author publications
You can also search for this author in PubMed Google Scholar
Pankoo Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Częstochowa University of Technology, Armii Krajowej 36, 42-200, Częstochowa, Poland
Leszek Rutkowski , Marcin Korytkowski & Rafał Scherer , &
AGH University of Science and Technology, Mickiewicza 30, 30-059, Kraków, Poland
Ryszard Tadeusiewicz
Department of Electrical Engineering and Computer Sciences, Computer Science Division, University of California Berkeley, 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh
Computational Intelligence Laboratory, Electrical and Computer Engineering, University of Louisville, 405 Lutz Hall, 40292, Louisville, KY, USA
Jacek M. Zurada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hwang, M., Jeong, DH., Jung, H., Sung, WK., Shin, J., Kim, P. (2012). A Term Normalization Method for Better Performance of Terminology Construction. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2012. Lecture Notes in Computer Science(), vol 7267. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29347-4_79

Download citation

DOI: https://doi.org/10.1007/978-3-642-29347-4_79
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29346-7
Online ISBN: 978-3-642-29347-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics