skip to main content
10.1145/2536146.2536147acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedesConference Proceedingsconference-collections
research-article

Structural and semantic similarity for XML comparison

Authors Info & Claims
Published:28 October 2013Publication History

ABSTRACT

XML has experimented a rapid growth mostly because of its application on the Web. Application varies from version control management, data storage to clustering and information retrieval. In this context, it is necessary to develop efficient techniques for comparing XML documents. Many method proposed are based only on structural commonalities, ignoring semantics. In this paper, we propose a new method for comparing XML documents based on LevelEdge combining tag structural and semantic similarities.

References

  1. P. Antonellis, C. Makris, and N. Tsirakis. Xedge: clustering homogeneous and heterogeneous xml documents using edge summaries. In Proceedings of the 2008 ACM symposium on Applied computing, SAC '08, pages 1081--1088, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. S. Chawathe. Comparing hierarchical data in external memory. In Proceedings of the 25th International Conference on Very Large Data Bases, VLDB '99, pages 90--101, San Francisco, CA, USA, 1999. Morgan Kaufmann Publishers Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Lin. An information-theoretic definition of similarity. In Proceedings of the Fifteenth International Conference on Machine Learning, ICML '98, pages 296--304, San Francisco, CA, USA, 1998. Morgan Kaufmann Publishers Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Nayak and S. Xu. Xcls: A fast and effective clustering algorithm for heterogenous xml documents. Lecture Notes in Computer Science, pages 292--302, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. R. Sibson. Slink: An optimally efficient algorithm for the single-link cluster method. The Computer Journal, 16(1): 30--34, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  6. J. Tekli and R. Chbeir. A novel xml document structure comparison framework based-on sub-tree commonalities and label semantics. Web Semant., 11: 14--40, Mar. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Tekli, R. Chbeir, and K. Yetongnon. Structural similarity evaluation between xml documents and dtds. In Proceedings of the 8th international conference on Web information systems engineering, WISE'07, pages 196--211, Berlin, Heidelberg, 2007. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Q. Wang, Z. Ren, L. Dong, and Z. Sheng. Path-based xml relational storage approach. Physics Procedia, 33(0): 1621--1625, 2012. 2012 International Conference on Medical Physics and Biomedical Engineering (ICMPBE2012).Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Structural and semantic similarity for XML comparison

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      MEDES '13: Proceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems
      October 2013
      358 pages
      ISBN:9781450320047
      DOI:10.1145/2536146
      • Conference Chairs:
      • Latif Ladid,
      • Antonio Montes,
      • General Chair:
      • Peter A. Bruck,
      • Program Chairs:
      • Fernando Ferri,
      • Richard Chbeir

      Copyright © 2013 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 October 2013

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      MEDES '13 Paper Acceptance Rate56of122submissions,46%Overall Acceptance Rate267of682submissions,39%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader