Abstract
XML-based Selective Dissemination of Information (SDI) systems aims to quickly deliver useful information to the users based on their profiles or user subscriptions. These subscriptions are specified in the form of XML queries. This paper investigates how clustering and aggregation of user queries can help scale SDI systems by reducing the number of document-subscription matchings required. We design a new distance function to measure the similarity of query patterns, and develop a filtering technique called YFilter* that is based on YFilter. Experiment results show that the proposed approach is able to achieve high precision, high recall, while reducing runtime requirement.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Altinel, M., Franklin, M.J.: Efficient filtering ofXMLdocuments for selective dissemination of information. In: 26th Int. Conference on Very Large Data Bases (2000)
Busse, R., Carey, M.: Benchmark DTD for XMark, an XML Benchmark project (2002), http://monetdb.cwi.nl/xml/downloads.html
Chan, C.-Y., Fan, W., Felber, P., Garofalakis, M., Rastogi, R.: Tree pattern aggregation for scalable XML data dissemination. In: 28th Int. Conference on Very Large Data Bases (2002)
Chan, C.-Y., Felber, P., Garofalakis, M., Rastogi, R.: Efficient filtering of XML documents with XPath expressions. In: 18th IEEE Int. Conf. on Data Engineering (2002)
Diao, Y., Franklin, M.J.: High-Performance XML Filtering: An Overview of YFilter. IEEE Data Engineering Bulletin 26(1), 41–48 (2003)
Diaz, A.L., Lovell, D.: XML Generator (1999), http://www.alphaworks.ibm.com/tech/xmlgenerator
Shasha, D., Zhang, K.: Pattern Matching in Strings, Trees and Arrays. Oxford University Press, Oxford (1995)
W3C. XML Path Language (XPath) Version 1.0. (November 1999), http://www.w3.org/TR/xpath
W3C. XQuery 1.0: An XML Query Language (May 2003), http://www.w3.org/TR/xquery
Yang, L.H., Lee, M.L., Hsu, W., Acharya, S.: Mining frequent query patterns from XML queries. In: 8th Int. Symposium on Database Systems for Advanced Applications (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, X., Yang, L.H., Lee, M.L., Hsu, W. (2004). Scaling SDI Systems via Query Clustering and Aggregation. In: Lee, Y., Li, J., Whang, KY., Lee, D. (eds) Database Systems for Advanced Applications. DASFAA 2004. Lecture Notes in Computer Science, vol 2973. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24571-1_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-24571-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21047-4
Online ISBN: 978-3-540-24571-1
eBook Packages: Springer Book Archive