ABSTRACT
Named Entity Linking (NEL) in microblogs is a challenging task due to the use of cryptic abbreviations, insufficient contextual information, and the time-varying importance of entities. We propose three techniques to target these challenges: Mention Normalization, Contextual Enrichment, and Temporal Entity Importance. By combining these novel techniques, we achieve 13% improvement in precision over a state-of-the-art NEL tool.
- A. E. Cano, G. Rizzo, A. Varga, M. Rowe, M. Stankovic, and A.-S. Dadzie. Making sense of microposts (#microposts2014) named entity extraction & linking challenge. #Microposts2014, pages 54--60, 2014.Google Scholar
- T. Cassidy, H. Ji, L.-A. Ratinov, A. Zubiaga, and H. Huang. Analysis and enhancement of wikification for microblogs with context expansion. In COLING, volume 12, pages 441--456, 2012.Google Scholar
- J. Daiber, M. Jakob, C. Hokamp, and P. N. Mendes. Improving efficiency and accuracy in multilingual entity extraction. In Proceedings of the 9th International Conference on Semantic Systems (I-Semantics), pages 121--124. ACM, 2013. Google ScholarDigital Library
- M. Dredze, P. McNamee, D. Rao, A. Gerber, and T. Finin. Entity disambiguation for knowledge base population. In COLING, pages 277--285. ACM, 2010. Google ScholarDigital Library
- M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, pages 226--231. ACM, 1996.Google ScholarDigital Library
- S. Guo, M.-W. Chang, and E. Kıcıman. To link or not to link? a study on end-to-end tweet entity linking. In Proceedings of NAACL-HLT, pages 1020--1030, 2013.Google Scholar
- J. Hoffart, M. A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, and G. Weikum. Robust disambiguation of named entities in text. In EMNLP, pages 782--792. ACM, 2011. Google ScholarDigital Library
- H. Huang, Y. Cao, X. Huang, H. Ji, and C.-Y. Lin. Collective tweet wikification based on semi-supervised graph regularization. pages 380--390, 2014.Google Scholar
- X. Liu, Y. Li, H. Wu, M. Zhou, F. Wei, and Y. Lu. Entity linking for tweets. In ACL, pages 1304--1311, 2013.Google Scholar
- E. Meij, W. Weerkamp, and M. de Rijke. Adding semantics to microblog posts. In WSDM, pages 563--572. ACM, 2012. Google ScholarDigital Library
- P. N. Mendes, M. Jakob, A. Garcıa-Silva, and C. Bizer. Dbpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems, pages 1--8. ACM, 2011. Google ScholarDigital Library
- A. Moro, A. Raganato, and R. Navigli. Entity linking meets word sense disambiguation: A unified approach. TACL, 2:231--244, 2014.Google ScholarCross Ref
- E. L. Murnane, B. Haslhofer, and C. Lagoze. Reslve: leveraging user interest to improve entity disambiguation on short text. In WWW companion, pages 1275--1284, 2013. Google ScholarDigital Library
- L. Ratinov, D. Roth, D. Downey, and M. Anderson. Local and global algorithms for disambiguation to wikipedia. In ACL, pages 1375--1384. ACM, 2011. Google ScholarDigital Library
- S. R. Yerva, M. Catasta, G. Demartini, and K. Aberer. Entity disambiguation in tweets leveraging user social profiles. In IEEE 14th International Conference on Information Reuse and Integration (IRI), pages 120--128. IEEE, 2013.Google ScholarCross Ref
- M. A. Yosef, J. Hoffart, I. Bordino, M. Spaniol, and G. Weikum. Aida: An online tool for accurate disambiguation of named entities in text and tables. Proceedings of the VLDB Endowment, 4:1450--1453, 2011.Google ScholarDigital Library
- M. A. Yosef, J. Hoffart, Y. Ibrahim, A. Boldyrev, and G. Weikum. Adapting aida for tweets. Making Sense of Microposts (# Microposts2014), 2014.Google Scholar
Index Terms
- AIDA-Social: Entity Linking on the Social Stream
Recommendations
Evaluating Entity Linking with Wikipedia
Named Entity Linking (nel) grounds entity mentions to their corresponding node in a Knowledge Base (kb). Recently, a number of systems have been proposed for linking entity mentions in text to Wikipedia pages. Such systems typically search for candidate ...
Named Entity Disambiguation for Resource-Poor Languages
ESAIR '15: Proceedings of the Eighth Workshop on Exploiting Semantic Annotations in Information RetrievalNamed entity disambiguation (NED) is the task of linking ambiguous names in natural language text to canonical entities like people, organizations or places, registered in a knowledge base. The problem is well-studied for English text, but few systems ...
Building a semantically annotated corpus of clinical texts
In this paper, we describe the construction of a semantically annotated corpus of clinical texts for use in the development and evaluation of systems for automatically extracting clinically significant information from the textual component of patient ...
Comments