ABSTRACT
We extend previous work on tree kernels to estimate the similarity between the dependency trees of sentences. Using this kernel within a Support Vector Machine, we detect and classify relations between entities in the Automatic Content Extraction (ACE) corpus of news articles. We examine the utility of different features such as Wordnet hypernyms, parts of speech, and entity types, and find that the dependency tree kernel achieves a 20% F1 improvement over a "bag-of-words" kernel.
- Eugene Agichtein and Luis Gravano. 2000. Snowball: Extracting relations from large plain-text collections. In Proceedings of the Fifth ACM International Conference on Digital Libraries. Google ScholarDigital Library
- Sergey Brin. 1998. Extracting patterns and relations from the world wide web. In WebDB Workshop at 6th International Conference on Extending Database Technology, EDBT'98. Google ScholarDigital Library
- M. Collins and N. Duffy. 2002. Convolution kernels for natural language. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural Information Processing Systems 14, Cambridge, MA. MIT Press.Google Scholar
- Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine Learning, 20(3):273--297. Google ScholarDigital Library
- N. Cristianini and J. Shawe-Taylor. 2000. An introduction to support vector machines. Cambridge University Press. Google ScholarDigital Library
- Chad M. Cumby and Dan Roth. 2003. On kernel methods for relational learning. In Tom Fawcett and Nina Mishra, editors, Machine Learning, Proceedings of the Twentieth International Conference (ICML 2003), August 21--24, 2003, Washington, DC, USA. AAAI Press.Google Scholar
- K. Fukunaga. 1990. Introduction to Statistical Pattern Recognition. Academic Press, second edition. Google ScholarDigital Library
- D. Haussler. 1999. Convolution kernels on discrete structures. Technical Report UCS-CRL-99-10, University of California, Santa Cruz.Google Scholar
- Thorsten Joachims, Nello Cristianini, and John Shawe-Taylor. 2001. Composite kernels for hypertext categorisation. In Carla Brodley and Andrea Danyluk, editors, Proceedings of ICML-01, 18th International Conference on Machine Learning, pages 250--257, Williams College, US. Morgan Kaufmann Publishers, San Francisco, US. Google ScholarDigital Library
- Huma Lodhi, John Shawe-Taylor, Nello Cristianini, and Christopher J. C. H. Watkins. 2000. Text classification using string kernels. In NIPS, pages 563--569.Google Scholar
- A. McCallum and B. Wellner. 2003. Toward conditional models of identity uncertainty with application to proper noun coreference. In IJCAI Workshop on Information Integration on the Web.Google Scholar
- S. Miller, H. Fox, L. Ramshaw, and R. Weischedel. 2000. A novel use of statistical parsing to extract information from text. In 6th Applied Natural Language Processing Conference. Google ScholarDigital Library
- H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. 2002. Identity uncertainty and citation matching.Google Scholar
- Dan Roth and Wen-tau Yih. 2002. Probabilistic reasoning for entity and relation recognition. In 19th International Conference on Computational Linguistics. Google ScholarDigital Library
- Sam Scott and Stan Matwin. 1999. Feature engineering for text classification. In Proceedings of ICML-99, 16th International Conference on Machine Learning. Google ScholarDigital Library
- Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition. In Walter Daelemans and Miles Osborne, editors, Proceedings of CoNLL-2003, pages 142--147. Edmonton, Canada. Google ScholarDigital Library
- Vladimir Vapnik. 1998. Statistical Learning Theory. Whiley, Chichester, GB.Google Scholar
- D. Zelenko, C. Aone, and A. Richardella. 2003. Kernel methods for relation extraction. Journal of Machine Learning Research, pages 1083--1106. Google ScholarDigital Library
Recommendations
Semantic Compositionality in Tree Kernels
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementKernel-based learning has been largely applied to semantic textual inference tasks. In particular, Tree Kernels (TKs) are crucial in the modeling of syntactic similarity between linguistic instances in Question Answering or Information Extraction tasks. ...
Dependency tree kernels for relation extraction from natural language text
ECMLPKDD'09: Proceedings of the 2009th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part IIThe automatic extraction of relations from unstructured natural text is challenging but offers practical solutions for many problems like automatic text understanding and semantic retrieval. Relation extraction can be formulated as a classification ...
Dependency trigram model for social relation extraction from news articles
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalWe propose a kernel-based model to automatically extract social relations such as economic relations and political relations between two people from news articles. To determine whether two people are structurally associated with each other, the proposed ...
Comments