Text Onto Miner – A Semi Automated Ontology Building System

Gawrysiak, Piotr; Protaziuk, Grzegorz; Rybinski, Henryk; Delteil, Alexandre

doi:10.1007/978-3-540-68123-6_61

Piotr Gawrysiak¹,
Grzegorz Protaziuk¹,
Henryk Rybinski¹ &
…
Alexandre Delteil²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4994))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

1083 Accesses
12 Citations

Abstract

This paper presents an overview of the results of the project undertaken by the Warsaw University of Technology Institute of Computer Science as a part of research agreement with France Telecom. The project goal was to create a set of tools – both software and methods, that could be used to speed up and improve a process of creating ontologies. In the course of the project a new ontology building methodology has been devised, new text mining algorithms optimized for extracting information useful for building an ontology from text corpora have been proposed and an universal text mining toolkit – TOM Platform – have been implemented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th Int’l. Conf. on VLDB, Santiago, Chile, Morgan Kaufmann, San Francisco (1994)
Google Scholar
Ahonen-Myka, H.: Finding all frequent maximal sequences in text. In: Mladenic, D., Grobelnik, M. (eds.) Proc. of the 16th Int. Con. on Machine Learning ICML 1999 Workshop on Machine Learning in Text Data Analysis, pp. 11–17 (1999)
Google Scholar
Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: KDD 2002 (2002)
Google Scholar
Byrd, R., Ravin, Y.: Identifying and extracting relations from text. In: NLDB 1999 - 4th Int. Con. on Applications of Natural Language to Information Systems (1999)
Google Scholar
Faure, D., Nedellec, C.: A corpus-based conceptual clustering method for verb frames and ontology acquisition. In: LREC Workshop on Adapting Lexical and Corpus Resources to Sublanguages and Applications, Granada, Spain (1998)
Google Scholar
Fung, B.C.M., Wan, K., Ester, M.: Hierarchical document clustering Using Frequent Item-sets. In: SDM 2003 (2003)
Google Scholar
Grefenstette, G.: Evaluation Techniques for Automatic Semantic Extraction: Comparing Syntatic and Window Based Approaches. In: Boguraev, B., Pustejovsky, J. (eds.) Corpus processing for Lexical Acquisition, pp. 205–216. MIT Press, Cambridge (1995)
Google Scholar
Guarino, N., Welty, C.: Evaluating ontological decisions with Ontoclean. Comm. of ACM 45(2) (2002)
Google Scholar
Hamon, T., Nazarenko, A., Gros, C.: A step towards the detection of semantic variants of terms in technical documents. In: Proc. 36^th Ann. Meeting of ACL (1998)
Google Scholar
Harris, Z.: Distributional structure. Word 10(23), 146–162 (1954)
Google Scholar
Skonieczny, K.M.: Hierarchical document clustering using frequent closed sets. In. Proc. IIPWM (2006)
Google Scholar
Lame, G.: Using text analysis techniques to identify legal ontologie’s components. In: ICAIL 2003, Workshop on Legal Ontologies & Web Based Legal Inf. Manag. (2003)
Google Scholar
Lucene home page, http://www.apache.org/lucene
Maedche, A., Staab, S.: Ontology Learning, Handbook on Ontologies. Springer Series on Handbooks in Information Systems. Springer, Heidelberg (2003)
Google Scholar
Maedche, A., Staab, S.: Mining Ontologies from Text. In: Dieng, R., Corby, O. (eds.) EKAW 2000. LNCS (LNAI), vol. 1937, pp. 189–202. Springer, Heidelberg (2000)
Chapter Google Scholar
Morin, E.: Automatic acquisition of semantic relations between terms from technical corpora. In: Proc. 5^th Int’l. Congress on TKE (1999)
Google Scholar
Noy, F.N., McGuinness, D.L.: Ontology Development 101: A Guide to Creating Your First Ontology. Stanford Knowledge Systems Laboratory Technical Report KSL-01-05 and Stanford Medical Informatics Techn. Rep. SMI-2001-0880
Google Scholar
Protaziuk, G., et al.: TOM Platform Reference Manual, Techn. Rep., WUT (2006)
Google Scholar
Protaziuk, G., et al.: Discovering Compound and Proper Nouns. In: Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A. (eds.) RSEISP 2007. LNCS (LNAI), vol. 4585, Springer, Heidelberg (2007)
Chapter Google Scholar
Protaziuk, G., et al.: State of The Art on Ontology and Vocabulary Building & Maintenance Research And Applications, Techn. Rep., WUT (2006)
Google Scholar
Rybinski, H., et al.: Discovering Synonyms based on Frequent Termsets. In: Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A. (eds.) RSEISP 2007. LNCS (LNAI), vol. 4585, Springer, Heidelberg (2007)
Chapter Google Scholar
Rybinski, H., et al.: Discovering Word Meanings Based on Frequent Termsets. In: MCD Workshop, PKDD, Warsaw (2007)
Google Scholar
Velardi, P., Fabriani, P., Missikoff, M.: Using text processing techniques to automatically enrich a domain ontology. In: Proc. Int’l. Conf. on FOIS (2001)
Google Scholar
Wu, H., Zhou, M.: Optimizing Synonym Extraction Using Monolingual and Bilingual Resources. In: Ann. Meeting ACL, Proc. 2^nd Int’l Workshop on Paraphrasing, vol. 16, pp. 72–79 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

ICS, Warsaw University of Technology,
Piotr Gawrysiak, Grzegorz Protaziuk & Henryk Rybinski
France Telecome R & D,
Alexandre Delteil

Authors

Piotr Gawrysiak
View author publications
You can also search for this author in PubMed Google Scholar
Grzegorz Protaziuk
View author publications
You can also search for this author in PubMed Google Scholar
Henryk Rybinski
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Delteil
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Aijun An Stan Matwin Zbigniew W. Raś Dominik Ślęzak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gawrysiak, P., Protaziuk, G., Rybinski, H., Delteil, A. (2008). Text Onto Miner – A Semi Automated Ontology Building System. In: An, A., Matwin, S., Raś, Z.W., Ślęzak, D. (eds) Foundations of Intelligent Systems. ISMIS 2008. Lecture Notes in Computer Science(), vol 4994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68123-6_61

Download citation

DOI: https://doi.org/10.1007/978-3-540-68123-6_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68122-9
Online ISBN: 978-3-540-68123-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics