skip to main content
10.3115/1220355.1220382dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free Access

Learning theories from text

Published:23 August 2004Publication History

ABSTRACT

In this paper we describe a method of automatically learning domain theories from parsed corpora of sentences from the relevant domain and use FSA techniques for the graphical representation of such a theory. By a 'domain theory' we mean a collection of facts and generalisations or rules which capture what commonly happens (or does not happen) in some domain of interest. As language users, we implicitly draw on such theories in various disambiguation tasks, such as anaphora resolution and prepositional phrase attachment, and formal encodings of domain theories can be used for this purpose in natural language processing. They may also be objects of interest in their own right, that is, as the output of a knowledge discovery process. The approach is generizable to different domains provided it is possible to get logical forms for the text in the domain.

References

  1. A. H. Aho, J. E. Hopcroft, and J. D. Ullman. 1974. The Design and Analysis of Computer Algorithms. Addison-Wesley Publishing Company. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. H. Aho, R. Sethi, and J. D. Ullman. 1986. Compilers - Principles, Techniques, and Tools. Addison-Wesley, Reading, Massachusetts, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. O. Collin, F. Duclaye, and F. Yvon. 2002. Learning Paraphrases to Improve a Question-Answering System. staff.science.uva.nl/mdr/NLP4QA/10duclaye-et-al.pdf.Google ScholarGoogle Scholar
  4. Luc DeHaspe. 1998. Frequent Pattern Discovery in First-Order Logic. Ph.D. thesis, Katholieke Universiteit Leuven.Google ScholarGoogle Scholar
  5. G. Doddington and C. H. J. Godfrey. 1990. The ATIS Spoken Language Systems Pilot Corpus. In Speech and Natural Language Workshop, Hidden Valley, Pennsylvania. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. Grishman and B. Sundheim. 1995. "Message Understanding Conference-6: A Brief History". www.cs.nyu.edu/cs/projects/proteus/muc/muc6-history-coling.ps.Google ScholarGoogle Scholar
  7. M. Liakata and S. Pulman. 2002. From Trees to Predicate-Argument Structures. In International Conference for Computational Linguistics (COLING), pages 563--569, Taipei, Taiwan. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Lin and P. Pantel. 2001. Dirt-Discovery of Inference Rules from Text. In In ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 323--328. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Marcus, G. Kim, M. Marcinkiewicz, R. Mac-Intyre, A. Bies, M. Ferguson, K. Katz, and B. Schasberger. 1994. The Penn Treebank: Annotating predicate argument structure. In ARPA Human Language Technology Workshop. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Stephen Muggleton. 1995. Inverse Entailment and Progol. New Generation Computing, special issue on Inductive Logic Programming, 13(3--4):245--286.Google ScholarGoogle Scholar
  11. Stephen Pulman. 2000. Statistical and Logical Reasoning in Disambiguation. Philosophical Transactions of the Royal Society, 358 number 1769:1267--1279.Google ScholarGoogle Scholar
  12. Ashwin Srinivasan. 1999. "the Aleph Manual". www.comlab.ox.ac.uk/oucl/research/areas/machlearn/Aleph/.Google ScholarGoogle Scholar
  13. M. Subramani and G. F. Cooper. 1999. Causal Discovery from Medical Textual Data. http://www.amia.org/pubs/symposia/D200558.PDF.Google ScholarGoogle Scholar
  14. Gertjan van Noord. 2002. FSA6 Reference Manual. http://odur.let.rug.nl/vannoord/Fsa/.Google ScholarGoogle Scholar
  1. Learning theories from text

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image DL Hosted proceedings
          COLING '04: Proceedings of the 20th international conference on Computational Linguistics
          August 2004
          1411 pages

          Publisher

          Association for Computational Linguistics

          United States

          Publication History

          • Published: 23 August 2004

          Qualifiers

          • Article

          Acceptance Rates

          COLING '04 Paper Acceptance Rate1,411of1,411submissions,100%Overall Acceptance Rate1,537of1,537submissions,100%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader