Skip to main content

Text Onto Miner – A Semi Automated Ontology Building System

  • Conference paper
Foundations of Intelligent Systems (ISMIS 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4994))

Included in the following conference series:

Abstract

This paper presents an overview of the results of the project undertaken by the Warsaw University of Technology Institute of Computer Science as a part of research agreement with France Telecom. The project goal was to create a set of tools – both software and methods, that could be used to speed up and improve a process of creating ontologies. In the course of the project a new ontology building methodology has been devised, new text mining algorithms optimized for extracting information useful for building an ontology from text corpora have been proposed and an universal text mining toolkit – TOM Platform – have been implemented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th Int’l. Conf. on VLDB, Santiago, Chile, Morgan Kaufmann, San Francisco (1994)

    Google Scholar 

  2. Ahonen-Myka, H.: Finding all frequent maximal sequences in text. In: Mladenic, D., Grobelnik, M. (eds.) Proc. of the 16th Int. Con. on Machine Learning ICML 1999 Workshop on Machine Learning in Text Data Analysis, pp. 11–17 (1999)

    Google Scholar 

  3. Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: KDD 2002 (2002)

    Google Scholar 

  4. Byrd, R., Ravin, Y.: Identifying and extracting relations from text. In: NLDB 1999 - 4th Int. Con. on Applications of Natural Language to Information Systems (1999)

    Google Scholar 

  5. Faure, D., Nedellec, C.: A corpus-based conceptual clustering method for verb frames and ontology acquisition. In: LREC Workshop on Adapting Lexical and Corpus Resources to Sublanguages and Applications, Granada, Spain (1998)

    Google Scholar 

  6. Fung, B.C.M., Wan, K., Ester, M.: Hierarchical document clustering Using Frequent Item-sets. In: SDM 2003 (2003)

    Google Scholar 

  7. Grefenstette, G.: Evaluation Techniques for Automatic Semantic Extraction: Comparing Syntatic and Window Based Approaches. In: Boguraev, B., Pustejovsky, J. (eds.) Corpus processing for Lexical Acquisition, pp. 205–216. MIT Press, Cambridge (1995)

    Google Scholar 

  8. Guarino, N., Welty, C.: Evaluating ontological decisions with Ontoclean. Comm. of ACM 45(2) (2002)

    Google Scholar 

  9. Hamon, T., Nazarenko, A., Gros, C.: A step towards the detection of semantic variants of terms in technical documents. In: Proc. 36th Ann. Meeting of ACL (1998)

    Google Scholar 

  10. Harris, Z.: Distributional structure. Word 10(23), 146–162 (1954)

    Google Scholar 

  11. Skonieczny, K.M.: Hierarchical document clustering using frequent closed sets. In. Proc. IIPWM (2006)

    Google Scholar 

  12. Lame, G.: Using text analysis techniques to identify legal ontologie’s components. In: ICAIL 2003, Workshop on Legal Ontologies & Web Based Legal Inf. Manag. (2003)

    Google Scholar 

  13. Lucene home page, http://www.apache.org/lucene

  14. Maedche, A., Staab, S.: Ontology Learning, Handbook on Ontologies. Springer Series on Handbooks in Information Systems. Springer, Heidelberg (2003)

    Google Scholar 

  15. Maedche, A., Staab, S.: Mining Ontologies from Text. In: Dieng, R., Corby, O. (eds.) EKAW 2000. LNCS (LNAI), vol. 1937, pp. 189–202. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  16. Morin, E.: Automatic acquisition of semantic relations between terms from technical corpora. In: Proc. 5th Int’l. Congress on TKE (1999)

    Google Scholar 

  17. Noy, F.N., McGuinness, D.L.: Ontology Development 101: A Guide to Creating Your First Ontology. Stanford Knowledge Systems Laboratory Technical Report KSL-01-05 and Stanford Medical Informatics Techn. Rep. SMI-2001-0880

    Google Scholar 

  18. Protaziuk, G., et al.: TOM Platform Reference Manual, Techn. Rep., WUT (2006)

    Google Scholar 

  19. Protaziuk, G., et al.: Discovering Compound and Proper Nouns. In: Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A. (eds.) RSEISP 2007. LNCS (LNAI), vol. 4585, Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  20. Protaziuk, G., et al.: State of The Art on Ontology and Vocabulary Building & Maintenance Research And Applications, Techn. Rep., WUT (2006)

    Google Scholar 

  21. Rybinski, H., et al.: Discovering Synonyms based on Frequent Termsets. In: Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A. (eds.) RSEISP 2007. LNCS (LNAI), vol. 4585, Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  22. Rybinski, H., et al.: Discovering Word Meanings Based on Frequent Termsets. In: MCD Workshop, PKDD, Warsaw (2007)

    Google Scholar 

  23. Velardi, P., Fabriani, P., Missikoff, M.: Using text processing techniques to automatically enrich a domain ontology. In: Proc. Int’l. Conf. on FOIS (2001)

    Google Scholar 

  24. Wu, H., Zhou, M.: Optimizing Synonym Extraction Using Monolingual and Bilingual Resources. In: Ann. Meeting ACL, Proc. 2nd Int’l Workshop on Paraphrasing, vol. 16, pp. 72–79 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Aijun An Stan Matwin Zbigniew W. Raś Dominik Ślęzak

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gawrysiak, P., Protaziuk, G., Rybinski, H., Delteil, A. (2008). Text Onto Miner – A Semi Automated Ontology Building System. In: An, A., Matwin, S., Raś, Z.W., Ślęzak, D. (eds) Foundations of Intelligent Systems. ISMIS 2008. Lecture Notes in Computer Science(), vol 4994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68123-6_61

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68123-6_61

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68122-9

  • Online ISBN: 978-3-540-68123-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics