Skip to main content

Towards Vietnamese Entity Disambiguation

  • Conference paper
  • 1014 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 245))

Abstract

Entity Disambiguation (ED) is a fundamental task in Natural Language Processing (NLP). The term Entity is used to mean either a Named Entity or an Abstract Concept. Although there have been many works on the ED task for English and some for Vietnamese, this is the first time this paper tackles the general ED task for Vietnamese that deal with both named entities and abstract concepts. In this paper, we propose a method for linking named entities and abstract concepts in Vietnamese documents to the corresponding articles in the Vietnamese Wikipedia. In particular, it first has to recognize Vietnamese entity mentions, i.e., phrases that represent named entities or abstract concepts. Experimental evaluation is also presented to demonstrate the performance of the proposed method.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nguyen, H.T., Cao, T.H., Nguyen, T.T., Vo-Thi, T.-L.: Heuristics and Statistics-based Wikification. In: Anthony, P., Ishizuka, M., Lukose, D. (eds.) PRICAI 2012. LNCS, vol. 7458, pp. 879–882. Springer, Heidelberg (2012)

    Google Scholar 

  2. Mihalcea, R., Csomai, A.: Wikify!: Linking Documents to Encyclopedic Knowledge. In: Proc. of the 16th ACM International Conference on Information and Knowledge Management, pp. 233–242 (2007)

    Google Scholar 

  3. Milne, D., Witten, I.H.: Learning to Link with Wikipedia. In: Proc. of the 17th ACM International Conference on Information and Knowledge Management, pp. 509–518 (2008)

    Google Scholar 

  4. Nguyen, H.T., Cao, T.H.: A Knowledge-based Method to Resolve Name Ambiguity in Vietnamese Texts. In: Addendum Contributions of the 5th International Conference on Research, Innovation and Vision for the Future, Studia Informatica Universalis, pp. 83–88 (2007)

    Google Scholar 

  5. Ji, H., Grishman, R., Dang, H.T.: An Overview of the TAC 2011 Knowledge Base Population Track. In: Proc. of Text Analysis Conference (2011)

    Google Scholar 

  6. Ji, H., Grishman, R.: Knowledge Base Population Successful Approaches and Challenge. In: Proc. of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 1148–1158 (2011)

    Google Scholar 

  7. Zhang, W., Su, J., Tan, C.L., Wang, W.: Entity Linking Leveraging Automatically Genrated Annotation. In: Proc. of 23rd International Conference on Computational Linguistics, pp. 1290–1298 (2010)

    Google Scholar 

  8. Han, X., Sun, L., Zhao, J.: Collective Entity Linking in Web Text: A Graph Based Method. In: Proc. of the 34th Annual ACM Special Interest Group on Information Retrieval Conference, pp. 765–774 (2011)

    Google Scholar 

  9. Pham, T.X.T., Tran, T.Q., Dinh, D., Collier, N.: Named Entity Recognition in Vietnamese Using Classifier Voting. ACM Transactions on Asian Language Information Processing 6(4) (2007)

    Google Scholar 

  10. Dinh, D.: Natural Language Processing. VNU-Ho Chi Minh Publisher (2006) (in Vietnamese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Truong, L.M., Cao, T.H., Dinh, D. (2014). Towards Vietnamese Entity Disambiguation. In: Huynh, V., Denoeux, T., Tran, D., Le, A., Pham, S. (eds) Knowledge and Systems Engineering. Advances in Intelligent Systems and Computing, vol 245. Springer, Cham. https://doi.org/10.1007/978-3-319-02821-7_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-02821-7_26

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-02820-0

  • Online ISBN: 978-3-319-02821-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics