skip to main content
10.3115/1218955.1219009dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free Access

Dependency tree kernels for relation extraction

Published:21 July 2004Publication History

ABSTRACT

We extend previous work on tree kernels to estimate the similarity between the dependency trees of sentences. Using this kernel within a Support Vector Machine, we detect and classify relations between entities in the Automatic Content Extraction (ACE) corpus of news articles. We examine the utility of different features such as Wordnet hypernyms, parts of speech, and entity types, and find that the dependency tree kernel achieves a 20% F1 improvement over a "bag-of-words" kernel.

References

  1. Eugene Agichtein and Luis Gravano. 2000. Snowball: Extracting relations from large plain-text collections. In Proceedings of the Fifth ACM International Conference on Digital Libraries. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Sergey Brin. 1998. Extracting patterns and relations from the world wide web. In WebDB Workshop at 6th International Conference on Extending Database Technology, EDBT'98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Collins and N. Duffy. 2002. Convolution kernels for natural language. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural Information Processing Systems 14, Cambridge, MA. MIT Press.Google ScholarGoogle Scholar
  4. Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine Learning, 20(3):273--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Cristianini and J. Shawe-Taylor. 2000. An introduction to support vector machines. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chad M. Cumby and Dan Roth. 2003. On kernel methods for relational learning. In Tom Fawcett and Nina Mishra, editors, Machine Learning, Proceedings of the Twentieth International Conference (ICML 2003), August 21--24, 2003, Washington, DC, USA. AAAI Press.Google ScholarGoogle Scholar
  7. K. Fukunaga. 1990. Introduction to Statistical Pattern Recognition. Academic Press, second edition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Haussler. 1999. Convolution kernels on discrete structures. Technical Report UCS-CRL-99-10, University of California, Santa Cruz.Google ScholarGoogle Scholar
  9. Thorsten Joachims, Nello Cristianini, and John Shawe-Taylor. 2001. Composite kernels for hypertext categorisation. In Carla Brodley and Andrea Danyluk, editors, Proceedings of ICML-01, 18th International Conference on Machine Learning, pages 250--257, Williams College, US. Morgan Kaufmann Publishers, San Francisco, US. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Huma Lodhi, John Shawe-Taylor, Nello Cristianini, and Christopher J. C. H. Watkins. 2000. Text classification using string kernels. In NIPS, pages 563--569.Google ScholarGoogle Scholar
  11. A. McCallum and B. Wellner. 2003. Toward conditional models of identity uncertainty with application to proper noun coreference. In IJCAI Workshop on Information Integration on the Web.Google ScholarGoogle Scholar
  12. S. Miller, H. Fox, L. Ramshaw, and R. Weischedel. 2000. A novel use of statistical parsing to extract information from text. In 6th Applied Natural Language Processing Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. 2002. Identity uncertainty and citation matching.Google ScholarGoogle Scholar
  14. Dan Roth and Wen-tau Yih. 2002. Probabilistic reasoning for entity and relation recognition. In 19th International Conference on Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Sam Scott and Stan Matwin. 1999. Feature engineering for text classification. In Proceedings of ICML-99, 16th International Conference on Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition. In Walter Daelemans and Miles Osborne, editors, Proceedings of CoNLL-2003, pages 142--147. Edmonton, Canada. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Vladimir Vapnik. 1998. Statistical Learning Theory. Whiley, Chichester, GB.Google ScholarGoogle Scholar
  18. D. Zelenko, C. Aone, and A. Richardella. 2003. Kernel methods for relation extraction. Journal of Machine Learning Research, pages 1083--1106. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image DL Hosted proceedings
    ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
    July 2004
    729 pages

    Publisher

    Association for Computational Linguistics

    United States

    Publication History

    • Published: 21 July 2004

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate85of443submissions,19%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader