Abstract
This paper introduces a class of join algorithms, termed W-join, for joining multiple infinite data streams. W-join addresses the infinite nature of the data streams by joining stream data items that lie within a sliding window and that match a certain join condition. In addition to its general applicability in stream query processing, W-join can be used to track the motion of a moving object or detect the propagation of clouds of hazardous material or pollution spills over time in a sensor network environment. We describe two new algorithms for W-join and address variations and local/global optimizations related to specifying the nature of the window constraints to fulfill the posed queries. The performance of the proposed algorithms is studied experimentally in a prototype stream database system, using synthetic data streams and real time-series data. Tradeoffs of the proposed algorithms and their advantages and disadvantages are highlighted, given variations in the aggregate arrival rates of the input data streams and the desired response times per query.
Similar content being viewed by others
References
Aref, W.G., Barbará, D., Johnson, S., Mehrotra, S.: Efficient processing of proximity queries for large databases. In: Proceedings of the 11th ICDE Conference, March 1995
Avnur, R., Hellerstein, J.M.: Eddies: continuously adaptive query processing. In: Proceedings of the SIGMOD conference, May 2000
Babu S., Widom J. (2001). Continuous queries over data streams. SIGMOD Rec. 30(3):109–120
Bonnet, P., Gehrke, J.E., Seshadri, P.: Towards sensor database systems. In: Proceedings of the 2nd International. Conference on Mobile Data Management, January 2001
Carney, D., Cetintemel, U., Cherniack, M., Convey, C., Lee, S., Seidman, G., Stonebraker, M., Tatbul, N., Zdonik, S.: Monitoring streams—a new class of data management applications. In: Proceedings of the 28th VLDB Conference, August 2002
Chandrasekaran, S., Cooper, O., Deshpande, A., , M.J., Hellerstein, J.M., Hong, W., Krishnamurthy, S., Madden, S.R., Raman, V., Reiss, F., Shah, M.A.: Telegraphcq: continuous dataflow processing for an uncertain world. In: Proceedings of the 1st CIDR Conference, January 2003
Chandrasekaran, S., Franklin, M.J.: Streaming queries over streaming data. In: Proceedings of the VLDB Conference, August 2002
Chen, J., DeWitt, D.J., Naughton, J.F.: Design and evaluation of alternative selection placement strategies in optimizing continuous queries. In: Proceedings of the ICDE Conference, February 2002
Chen, J., DeWitt, D.J., Tian, F., Wang, Y.: Niagracq: a scalable continuous query system for internet databases. In: Proceedings of the SIGMOD Conference, May 2000
DeWitt, D.J., Naughton, J.F., Schneider, D.A.: An evaluation of non-equijoin algorithms. In: Proceedings of the 17th VLDB Conference, September 1991
Eisenberg A., Melton J. (2000). Sql standardization: the next steps. SIGMOD Rec. 29(1):63–67
Gehrke, J., Korn, F., Srivastava, D.: On computing correlated aggregates over continual data streams. In: Proceedings of SIGMOD Conference, May 2001
Gilbert, A.C., Kotidis, Y., Muthukrishnan, S., Strauss, M.: Surfing wavelets on streams: one-pass summaries for approximate aggregate queries. In: Proceedings of the 27th VLDB Conference, September 2001
Hammad, M.A., Franklin, M.J., Aref, W.G., Elmagarmid, A.K.: Scheduling for shared window joins over data streams. In: Proceedings of the VLDB Conference, September 2003
Hammad, M.A., Mokbel, M.F., Ali, M.H., Aref, W.G., , A.C., Elmagarmid, A.K., Eltabakh, M., Elfeky, M.G., Ghanem, T., Gwadera, R., Ilyas, I.F., Marzouk, M., Nile: A query processing engine for data streams. In: Proceedings of the ICDE Conference, March 2004
Henzinger, M., Raghavan, P., Rajagopalan, S.: Computing on data streams. Technical Note 1998-011, Digital Systems Research
Jagadish, H.V., Mumick, I.S., Silberschatz, A.: View maintenance issues for the chronicle data model. In: Proceedings of PODS, May 1995
Kang, J., Naughton, J.F., Viglas, S.D.: Evaluating window joins over unbounded streams. In: Proceedings of the ICDE Conference, February 2003
Lu, H., Ooi, B.C., Tan, K.L.: On spatially partitioned temporal join. In: Proceedings of the 20th VLDB Conference, September 1994
Madden, S., Franklin, M.: Fjording the stream: an architecture for queries over streaming sensor data. In: Proceedings of the ICDE Conference, February 2002
Madden, S., Franklin, M.J., Hellerstein, J.M., Hong, W.: The design of an acquisitional query processor for sensor networks. In: Proceedings of the SIGMOD Conference, June 2003
Madden, S., Shah, M., Hellerstein, J., Raman, V.: Continuously adaptive continuous queries over streams. In: Proceedings of the SIGMOD Conference, June 2002
Motwani, R., Widom, J., Arasu, A., Babcock, B., Babu, S., Datar, M., Manku, G., Olston, C., Rosenstein, J., Query processing, approximation, and resource management in a data stream management system. In: Proceedings of the 2003 Conference on Innovative Data Systems Research (CIDR), January 2003
Seshadri P. (1998). Predator: a resource for database research. SIGMOD Rec. 27(1):16–20
Seshadri, P., Livny, M., Ramakrishnan, R.: The design and implementation of a sequence database system. In: Proceedings of 22nd VLDB Conference, September 1996
Snodgrass R.T. (2000). Developing Time-Oriented Database Applications in SQL. Morgan Kaufmann, San Fransisco
Yao, Y., Gehrke, J.: Query processing in sensor networks. In: Proceedings of the 2003 Conference on Innovative Data Systems Research (CIDR), January 2003
Zhang, D., Tsotras, V.J., Seeger, B.: Efficient temporal join processing using indices. In: Proceedings of the ICDE Conference, February 2002
Author information
Authors and Affiliations
Corresponding author
Additional information
This is an extended version of the paper published in the Proceedings of the 15th International Conference on Scientific and Statistical Database Management, SSDBM 2003, Boston, U.S.A., pp. 75–84.
Rights and permissions
About this article
Cite this article
Hammad, M.A., Aref, W.G. & Elmagarmid, A.K. Query processing of multi-way stream window joins. The VLDB Journal 17, 469–488 (2008). https://doi.org/10.1007/s00778-006-0017-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00778-006-0017-y