skip to main content
research-article
Free Access

Open information extraction from the web

Published:01 December 2008Publication History
Skip Abstract Section

Abstract

Targeted IE methods are transforming into open-ended techniques.

References

  1. Agichtein, E. and Gravano, L. Snowball: Extracting relations from large plain-text collections. In Proceedings of the 5th ACM International Conference on Digital Libraries (2000). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. ARPA. Proceedings of the 3rd Message Understanding Conference (1991).Google ScholarGoogle Scholar
  3. Banko, M., Cafarella, M., Soderland, S., Broadhead, M. and Etzioni, O. Open information extraction from the Web. In Proceedings of the International Joint Conference on Artificial Intelligence (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Banko, M. and Etzioni, O. The tradeoffs between traditional and open relation extraction. In Proceedings of the Association of Computational Linguistics (2008).Google ScholarGoogle Scholar
  5. Brin, S. Extracting patterns and relations from the World Wide Web. In Proceedings of the Workshop at the 6th International Conference on Extending Database Technology, (Valencia, Spain, 1998), 172--183. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Bunescu, R. and Mooney, R. Learning to extract relations from the Web using minimal supervision In Proceedings of the Association of Computational Linguistics (2007).Google ScholarGoogle Scholar
  7. Downey, D., Etzioni, O. and Soderland, S. A probabilistic model of redundancy in information extraction. In Proceedings of the International Joint Conference on Artificial Intelligence (2005). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Downey, D., Schoenmackers, S. and Etzioni, O. Sparse information extraction: Unsupervised language models to the rescue. In Proceedings of the Association of Computational Linguistics (2007).Google ScholarGoogle ScholarCross RefCross Ref
  9. Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A., Shaked, T., Soderland, S., Weld, D. and Yates, A. Unsupervised named-entity extraction from the Web: An experimental study. Artificial Intelligence 165,1 (2005), 91--134. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Feldman, R., Rosenfeld, B., Soderland, S. and Etzioni, O. Self-supervised relation extraction from the Web. In Proceedings of the International Symposium on Methodologies for Intelligent Systems (2006) 755--764. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kim, J. and Moldovan, D. Acquisition of semantic patterns for information extraction from corpora. In Proceedings of the 9th IEEE Conference on Artificial Intelligence for Applications (1993), 171--176.Google ScholarGoogle ScholarCross RefCross Ref
  12. Lafferty, J., McCallum, A. and Pereira, F. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 2001 International Conference on Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. McCallum, A. Efficiently inducing features of conditional random fields. In Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence (Acapulco, 2003), 403--410. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Poon, H. and Domingos, P. Joint inference in information extraction. In Proceedings of the 22nd National Conference on Artificial Intelligence (2007) 913--918. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Popescu, A. and Etzioni, O. Extracting product features and opinions from reviews. In Proceedings of the Empirical Methods on Natural Language Processing Conference (2005). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Popescu. A-M. Information extraction from unstructured Web text. Ph.D. thesis, University of Washington (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Riloff, E. Automatically constructing extraction patterns from untagged text. In Proceedings of the 13th National Conference on Artificial Intelligence (1996), 1044--1049. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Riloff, E. and Jones, R. Learning dictionaries for information extraction by multi-level bootstrapping. In Proceedings of the AAAI-99 Conference (1999), 1044--1049. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Schubert, L. Can we derive general world knowledge from texts? In Proceedings of the Human Language Technology Conference (2002). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Shinyama, Y. and Sekine, S. Preemptive information extraction using unrestricted relation discovery. In Proceedings of the Human Language Technology/NAACL Conference (2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Soderland, S. Learning information extraction rules for semi-structured and free text. Machine Learning 34, 1--3(1999), 233--272. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Soderland, S., Fisher, D., Aseltine, J. and Lehnert, W. CRYSTAL: Inducing a conceptual dictionary. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (1995), 1314--1321. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Weld, D., Wu, F., Adar, E., Amershi, S., Fogarty, J., Hoffmann, R., Patel, K. and Skinner, M. Intelligence in Wikipedia. In Proceedings of the 23rd Conference on Artificial Intelligence (2008). Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Yates, A. and Etzioni, O. Unsupervised resolution of objects and relations on the Web. In Proceedings of the Human-Language Technology Conference (2007).Google ScholarGoogle Scholar

Index Terms

  1. Open information extraction from the web

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image Communications of the ACM
            Communications of the ACM  Volume 51, Issue 12
            Surviving the data deluge
            December 2008
            126 pages
            ISSN:0001-0782
            EISSN:1557-7317
            DOI:10.1145/1409360
            Issue’s Table of Contents

            Copyright © 2008 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 1 December 2008

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Popular
            • Refereed

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format