skip to main content
10.1145/1247480.1247518acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Identifying meaningful return information for XML keyword search

Published:11 June 2007Publication History

ABSTRACT

Keyword search enables web users to easily access XML data without the need to learn a structured query language and to study possibly complex data schemas. Existing work has addressed the problem of selecting qualified data nodes that match keywords and connecting them in a meaningful way, in the spirit of inferring a where clause in XQuery. However, how to infer the return clause for keyword search is an open problem.

To address this challenge, we present an XML keyword search engine, XSeek, to infer the semantics of the search and identify return nodes effectively. XSeek recognizes possible entities and attributes inherently represented in the data. It also distinguishes between search predicates and return specifications in the keywords. Then based on the analysis of both XML data structures and keyword match patterns, XSeek generates return nodes. Extensive experimental studies show the effectiveness of XSeek.

References

  1. XQuery 1.0: An XML query language, June 2001. http://www.w3.org/XML/Query.Google ScholarGoogle Scholar
  2. S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A System for Keyword-Based Search over Relational Databases. In Proceedings of ICDE, pages 5--16, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Amer-Yahia, C. Botev, J. Dorre, and J. Shanmugasundaram. XQuery Full-Text extensions explained. In IBM Systems Journal, pages 335--352, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Barg and R. K. Wong. Structural proximity searching for large collections of semi-structured data. In Proceedings of CIKM, pages 175--182, New York, NY, USA, 2001. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. Bhalotia, C. Nakhe, A. Hulgeri, S. Chakrabarti, and S. Sudarshan. Keyword Searching and Browsing in Databases using BANKS. In ICDE, 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Carmel, Y. Maarek, Y. Mass, N. Efraty, and G. Landau. An extension of the vector space model for querying xml documents via xml fragments. In ACM SIGIR 2002: Workshop on XML and Information Retrieval, 2002.Google ScholarGoogle Scholar
  7. J. Clark and S. DeRose. XML Path language (XPath) 1.0, November 1999. http://www.w3.org/TR/xpath.Google ScholarGoogle Scholar
  8. S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv. XSEarch: A semantic Search Engine for XML, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Deutsch, M. Fernandez, and D. Suciu. Storing semistructured data with STORED. In SIGMOD '99: Proceedings of the 1999 ACM SIGMOD international conference on Management of data, pages 431--442, New York, NY, USA, 1999. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Florescu, D. Kossmann, and I. Manolescu. Integrating Keyword Search into XML Query Processing. Computer Networks( Amsterdam, Netherlands: 1999), 33(1--6):119--135, 2000.Google ScholarGoogle Scholar
  11. N. Fuhr and K. GroBjohann. XIRQL: A Query Language for Information Retrieval in XML Documents. In Proceedings of SIGIR, pages 172--180, New York, NY, USA, 2001. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. XRANK: Ranked Keyword Search over XML Documents. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pages 16--27, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. V. Hristidis, N. Koudas, Y. Papakonstantinou, and D. Srivastava. Keyword Proximity Search in XML Trees. IEEE Transactions on Knowledge and Data Engineering, 18(4), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. V. Hristidis and Y. Papakonstantinou. Discover: Keyword search in relational databases. In Procs. VLDB, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. V. Hristidis, Y. Papakonstantinou, and A. Balmin. Keyword Proximity Search on XML Graphs, 2003. In ICDE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Koutrika, A. Simitsis, and Y. E. Ioannidis. Précis: The essence of a query answer. In ICDE, page 69, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Li, C. Yu, and H. V. Jagadish. Schema-Free XQuery. In VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. F. Liu, C. Yu, W. Meng, and A. Chowdhury. Effective keyword search in relational databases. In SIGMOD '06: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pages 563--574, New York, NY, USA, 2006. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. V. Vesper. Let's Do Dewey. http://www.mtsu.edu/vvesper/dewey.html.Google ScholarGoogle Scholar
  20. Extensible markup language (xml) 1.0, 2004. http://www.w3.org/TR/REC-xml/.Google ScholarGoogle Scholar
  21. Y. Xu and Y. Papakonstantinou. Efficient Keyword Search for Smallest LCAs in XML Databases. In Proceedings of the 2005 ACMS IGMOD International Conference on Management of Data, pages 527--538, New York, NY, USA, 2005. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. Yu and H. V. Jagadish. Schema Summarization. In Proceedings of VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Identifying meaningful return information for XML keyword search

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
      June 2007
      1210 pages
      ISBN:9781595936868
      DOI:10.1145/1247480
      • General Chairs:
      • Lizhu Zhou,
      • Tok Wang Ling,
      • Program Chair:
      • Beng Chin Ooi

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 June 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader