Skip to main content

Weighted Pseudo-distances for Categorization in Semantic Hierarchies

  • Conference paper
Conceptual Structures: Common Semantics for Sharing Knowledge (ICCS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3596))

Included in the following conference series:

Abstract

Ontologies, taxonomies, and other semantic hierarchies are increasingly necessary for organizing large quantities of data. We continue our development of knowledge discovery techniques based on combinatorial algorithms rooted in order theory by aiming to supplement the pseudo-distances previously developed as structural measures of vertical height in poset-based ontologies with quantitative measures of vertical distance based on additional statistical information. In this way, we seek to accommodate weighting of different portions of the underlying ontology according to this external information source. We also wish to improve on the deficiencies of existing such measures, in particular Resnik’s measure of semantic similarity in lexical databases such as Wordnet. We begin by recalling and developing some basic concepts for ordered data objects, including our pseudo-distances and the operation of probability distributions as weights on posets. We then discuss and critique Resnik’s measure before introducing our own sense of links weights and weighted normalized pseudo-distances among comparable nodes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aho, A.V., Garey, M.R., Ullman, J.D.: The Transitive Reduction of a Directed Graph. SIAM Journal of Computing 1(2), 131–137 (1972)

    Article  MATH  MathSciNet  Google Scholar 

  2. Bodenreider, O., Mitchell, J.A., McCray, A.T.: Evaluation of the UMLS As a Terminology and Knowledge Resource for Biomedical Informatics. In: AMIA 2002 Annual Symposium, pp. 61–65 (2002)

    Google Scholar 

  3. Davis, A.R.: Types and Constraints for Lexical Semantics and Linking, Cambridge, UP (2000)

    Google Scholar 

  4. Ganter, B., Wille, R.: Formal Concept Analysis. Springer, Heidelberg (1999)

    MATH  Google Scholar 

  5. Gene Ontology Consortium: Gene Ontology: Tool For the Unification of Biology. Nature Genetics 25(1), 25–29 (2000)

    Google Scholar 

  6. Joslyn, C.A.: Poset Ontologies and Concept Lattices as Semantic Hierarchies. In: Wolff, K.E., Pfeiffer, H.D., Delugach, H.S. (eds.) ICCS 2004. LNCS (LNAI), vol. 3127, pp. 287–302. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  7. Joslyn, C., Mniszewski, S., Fulmer, A., Heaton, G.G.: The Gene Ontology Categorizer. Bioinformatics 20(s1), 169–177 (2004)

    Article  Google Scholar 

  8. Joslyn, C., Oliverira, J., Scherrer, C.: Order Theoretical Knowledge Discovery: A White Paper, LAUR = 04-5812 (2004), ftp://ftp.c3.lanl.gov/pub/users/joslyn/white.pdf

  9. Joslyn, C., Cohn, J.D., Verspoor, K.M., Mniszewski, S.M.: Automating Ontological Function Annotation: Towards a Common Methodological Framework. Submitted to 2005 Bio-Ontologies Meeting, ISMB 2005 (2005)

    Google Scholar 

  10. Klir, G., Elias, D.: Architecture of Systems Problem Solving, 2nd edn. Plenum, New York (2003)

    MATH  Google Scholar 

  11. Klir, G., Yuan, B.: Fuzzy Sets and Fuzzy Logic. Prentice-Hall, New York (1995)

    MATH  Google Scholar 

  12. Knoblock, Todd, B., Rehof, J.: Type Elaboration and Subtype Completion for Java Bytecode. In: Proc. 27th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (2000)

    Google Scholar 

  13. Lord, P.W., Stevens, R., Brass, A., Goble, C.: Investigating Semantic Similarity Measures Across the Gene Ontology: the Relationship Between Sequence and Annotation. Bioinformatics 10, 1275–1283 (2003)

    Article  Google Scholar 

  14. Monjardet, B.: Metrics on Partially Ordered Sets - A Survey. Discrete Mathematics 35, 173–184 (1981)

    Article  MATH  MathSciNet  Google Scholar 

  15. Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Int. Joint Conf. on Artificial Intelligence, pp. 448–452. Morgan Kaufmann, San Francisco (1995)

    Google Scholar 

  16. Schröder, Bernd, S.W.: Ordered Sets. Birkhauser, Boston (2003)

    MATH  Google Scholar 

  17. Verspoor, K., Cohn, J., Joslyn, C., Mniszewski, S.M., Rechtsteiner, A., Rocha, L.M., Simas, T.: Protein Annotation as Term Categorization in the Gene Ontology Using Word Proximity Networks. BMC Bioinformatics 6(suppl. 1) (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Joslyn, C.A., Bruno, W.J. (2005). Weighted Pseudo-distances for Categorization in Semantic Hierarchies. In: Dau, F., Mugnier, ML., Stumme, G. (eds) Conceptual Structures: Common Semantics for Sharing Knowledge. ICCS 2005. Lecture Notes in Computer Science(), vol 3596. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11524564_26

Download citation

  • DOI: https://doi.org/10.1007/11524564_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27783-5

  • Online ISBN: 978-3-540-31885-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics