skip to main content
10.1145/1142473.1142509acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Continuous query processing in data streams using duality of data and queries

Published:27 June 2006Publication History

ABSTRACT

Recent data stream systems such as TelegraphCQ have employed the well-known property of duality between data and queries. In these systems, query processing methods are classified into two dual categories -- data-initiative and query-initiative -- depending on whether query processing is initiated by selecting a data element or a query. Although the duality property has been widely recognized, previous data stream systems do not fully take advantages of this property since they use the two dual methods independently: data-initiative methods only for continuous queries and query-initiative methods only for ad-hoc queries. We contend that continuous query processing can be better optimized by adopting an approach that integrates the two dual methods. Our primary contribution is based on the observation that spatial join is a powerful tool for achieving this objective. In this paper, we first present a new viewpoint of transforming the continuous query processing problem to a multi-dimensional spatial join problem. We then present a continuous query processing algorithm based on spatial join, which we name Spatial Join CQ. This algorithm processes continuous queries by finding the pairs of overlapping regions from a set of data elements and a set of queries, both defined as regions in the multi-dimensional space. The algorithm achieves the advantages of the two dual methods simultaneously. Experimental results show that the proposed algorithm outperforms earlier algorithms by up to 36 times for simple selection continuous queries and by up to 7 times for sliding window join queries.

References

  1. {1} Babcock, B. et al., "Models and Issues in Data Stream Systems," In Proc. the 21st ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems (PODS), Madison, Wisconsin, pp. 1-16, June 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. {2} Berchtold, S., Bohm, C., and Kriegel, H.-P., "The Pyramid-Technique: Towards Breaking the Curse of Dimensionality," In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Seattle, Washington, pp. 142-153, June 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. {3} Brinkhoff, T., Kriegel, H.-P., and Seeger, B., "Efficient Processing of Spatial Join Using R-trees," In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Washington, DC., pp. 237-246, May 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. {4} Chandrasekaran, S. and Franklin, M. J., "Streaming Queries over Streaming Data," In Proc. the 28th Int'l Conf. on Very Large Data Bases, Hong Kong, China, pp. 203-214, Aug. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. {5} Chandrasekaran, S. et al., "TelegraphCQ: Continuous Dataflow Processing for an Uncertain World," In Proc. the First Biennial Conf. on Innovative Data Systems Research, Asiloma, Califonia, pp. 269-280, Jan. 2003.Google ScholarGoogle Scholar
  6. {6} Chen, J. et al., "NiagaraCQ: A Scalable Continuous Query System for Internet Databases," In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Dallas, Texas, pp. 379-390, June 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. {7} Faloutsos, C. and Roseman, S., "Fractals for Secondary Key Retrieval," In Proc. the Eighth ACM SIGACT-SIGMOD Symp. on Principles of Database Systems(PODS), Philadelphia, Pennsylvania, pp. 247-252, Mar. 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. {8} Fox, E. A. et al., "Order-preserving minimal perfect hash functions and information retrieval," ACM Trans. on Information Systems, Vol. 9, No. 3, pp. 281-308, July 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. {9} Golab, L. and Ozsu, M. T., "Issues in Data Stream Management," ACM SIGMOD Record, Vol. 32, No. 2, pp. 5-14, June 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. {10} Hanson, E. N. et al., "A Predicate Matching Algorithm for Database Rule Systems," In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Atlantic City, New Jersey, pp. 271-280, June 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. {11} Hinrichs, K. and Nievergelt, J., "The Grid File: A Data Structure Designed to Support Proximity Queries on Spatial Objects," In Proc. Int'l Workshop on Graphtheoretic Concepts in Computer Science, Linz, Austria, pp. 100-113, Aug. 1983.Google ScholarGoogle Scholar
  12. {12} Huang, Y.-W., Jing, N., and Rundensteiner, E. A., "Spatial Joins Using R-trees: Breadth-First Traversal with Global Optimizations," In Proc. the 23rd Int'l Conf. on Very Large Data Bases, Athens, Greece, pp. 396-405, Aug. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. {13} Kang, J., Naughton, J. F., and Viglas, S. D., "Evaluating Window Joins over Unbounded Streams," In Proc. the 19th IEEE Int'l Conf. on Data Engineering(ICDE), Bangalore, India, pp. 341-352, Mar. 2003.Google ScholarGoogle Scholar
  14. {14} Kriegel, H.-P. et al., "Spatial Query Processing for High Resolutions," In Proc. the Eighth Int'l Conf. on Database Systems for Advanced Applications, Tokyo, Japan, pp. 17-26, Mar. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. {15} Mokbel, M. F., Xiong, X., and Aref, W. G., "SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases," In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Paris, France, pp. 623-634, June 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. {16} Motwani, R. et al., "Query Processing, Approximation, and Resource Management in a Data Stream Management System," In Proc. the First Biennial Conf. on Innovative Data Systems Research, Asiloma, California, pp. 245-256, Jan. 2003.Google ScholarGoogle Scholar
  17. {17} Orenstein, J. A. and Merrett, T. H., "A Class of Data Structures for Associative Searching," In Proc. the Third ACM SIGACT-SIGMOD Symp. on Principles of Database Systems(PODS), Waterloo, Canada, pp. 181-190, Apr. 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. {18} Seeger, B. and Kriegel, H.-P., "Techniques for Design and Implementation of Efficient Spatial Access Methods," In Proc. the 14th Int'l Conf. on Very Large Data Bases, Los Angeles, California, pp. 360-371, Aug. 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. {19} Song, J.-W., Whang, K.-Y., Lee, Y.-K., and Kim, S.-W, "Spatial Join Processing Using Corner Transformation," IEEE Trans. on Knowledge and Data Engineering, Vol. 11, No. 4, July 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. {20} Terry, D. et al., "Continuous Queries over Append-Only Databases," In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, San Diego, California, pp. 321-330, June 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. {21} Weber, R., Schek, H.-J., and Blott, S., "A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces," In Proc. the 24th Int'l Conf. on Very Large Data Bases, New York City, New York, pp. 194-205, Aug. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. {22} Whang, K.-Y. and Krishnamurthy, R., Multilevel Grid Files, IBM Research Report RC11516, IBM Thomas J. Watson Research Center, Yorktown Heights, New York, Nov. 1985.Google ScholarGoogle Scholar
  23. {23} Whang, K.-Y. and Krishnamurthy, R., "The Multilevel Grid File - a Dynamic Hierarchical Multidimensional File Structure," In Proc. Int'l Conf. on Database Systems for Advanced Applications, Tokyo, Japan, pp. 449-459, Apr. 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. {24} Zdonik, S. et al., "The Aurora and Medusa Projects," IEEE Data Engineering Bulletin, Vol. 26, No. 1, pp. 3-10, Mar. 2003.Google ScholarGoogle Scholar

Index Terms

  1. Continuous query processing in data streams using duality of data and queries

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data
        June 2006
        830 pages
        ISBN:1595934340
        DOI:10.1145/1142473

        Copyright © 2006 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 June 2006

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate785of4,003submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader