Skip to main content

Simple Yet Effective Method for Entity Linking in Microblog-Genre Text

  • Conference paper
Book cover Natural Language Processing and Chinese Computing (NLPCC 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 400))

Abstract

Semantic analysis microblog data is a challenging, emerging research area. Unlike news text, microblogs pose several new challenges, due to their short, noisy, contextualized and real-time nature. In this paper, we investigate how to link entities in microblog posts with knowledge base and adopt a cascade linking approach. In particular, we first use a mention expansion model to identify all possible entities in the knowledge base for a mention based on a variety of sources. Then we link the mentions with the corresponding entities in the knowledge base by collectively considering lexical matching, popularity probability and textual similarity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Meij, E., Weerkamp, W., Rijke, M.D.: Adding Semantics to Microblog Posts. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 563–572 (2012)

    Google Scholar 

  2. Guo, S., Chang, M.W., Kıcıman, E.: To Link or Not to Link? A Study on End-to-End Tweet Entity Linking. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2013)

    Google Scholar 

  3. Derczynski, L., Maynard, D., Aswani, N., Bontcheva, K.: Microblog-Genre Noise and Impact on Semantic Annotation Accuracy. In: Proceedings of the 24th ACM Conference on Hypertext and Social Media, pp. 21–30 (2013)

    Google Scholar 

  4. Cassidy, T., Ji, H., Ratinov, L., Zubiaga, A., Huang, H.Z.: Analysis and Enhancement of Wikification for Microblogs with Context Expansion. In: Proceedings of 24th International Conference on Computational Linguistics, pp. 441–456 (2012)

    Google Scholar 

  5. Bontcheva, K., Rout, D.: Making Sense of Social Media Streams through Semantics: a Survey. Semantic Web Journal (2012)

    Google Scholar 

  6. Kiryakov, A., Popov, B., Ognyanoff, D., Manov, D., Kirilov, A., Goranov, M.: Semantic Annotation, Indexing and Retrieval. Journal of Web Semantics 1(2), 49–79 (2004)

    Article  Google Scholar 

  7. Mihalcea, R., Csomai, A.: Wikify! Linking Documents to Encyclopedic Knowledge. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 233–242 (2007)

    Google Scholar 

  8. Milne, D., Witten, I.H.: Learning to Link with Wikipedia. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 509–518 (2008)

    Google Scholar 

  9. Ferragina, P., Scaiella, U.: TAGME: On-the-fly Annotation of Short Text Fragments. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 1625–1628 (2010)

    Google Scholar 

  10. Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia Spotlight: Shedding Light on the Web of Documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1–8 (2011)

    Google Scholar 

  11. Ratinov, L., Roth, D.: Design Challenges and Misconceptions in Named Entity Recognition. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning, pp. 147–155 (2009)

    Google Scholar 

  12. Yosef, M.A., Hoffart, J., Bordino, I., Spaniol, M., Weikum, G.: AIDA: an Online Tool for Accurate Disambiguation of Named Entities in Text and Tables. In: Proceedings of the PVLDB 2011, pp. 1450–1453 (2011)

    Google Scholar 

  13. Varma, V., Bharat, V., Kovelamudi, S., Bysani, P.: GSK, S., Kumar, N. K., Reddy, K., Kumar, K., Maganti, N.: IIIT Hyderabad at TAC 2009. In: Proceedings of Text Analysis Conference, TAC (2009)

    Google Scholar 

  14. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia. In: Proceedings of the International World Wide Web Conference, pp. 697–706 (2007)

    Google Scholar 

  15. Han, X.P., Sun, L.: A Generative Entity-Mention Model for Linking Entities with Knowledge Base. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 945–954 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Miao, Q., Lu, H., Zhang, S., Meng, Y. (2013). Simple Yet Effective Method for Entity Linking in Microblog-Genre Text. In: Zhou, G., Li, J., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2013. Communications in Computer and Information Science, vol 400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41644-6_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41644-6_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41643-9

  • Online ISBN: 978-3-642-41644-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics