Skip to main content
Log in

Aurora: a new model and architecture for data stream management

  • Original Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract.

This paper describes the basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications. Monitoring applications differ substantially from conventional business data processing. The fact that a software system must process and react to continual inputs from many sources (e.g., sensors) rather than from human operators requires one to rethink the fundamental architecture of a DBMS for this application area. In this paper, we present Aurora, a new DBMS currently under construction at Brandeis University, Brown University, and M.I.T. We first provide an overview of the basic Aurora model and architecture and then describe in detail a stream-oriented set of operators.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • 1. Informix White Paper. (2003) Time series: the next step for telecommunications data management. http://www-3.ibm.com/software/data/informix/pubs/whitepapers/nextstep_wp.pdf

  • 2. Lojack.com (2003) http://www.lojack.com/

  • 3. Mitre Corporation (2003) http://www.mitre.org/

  • 4. Altinel M, Franklin MJ (2000) Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of the 26th international conference on very large data bases (VLDB), Cairo, 10--14 September 2000, pp 53--64

  • 5. Avnur R, Hellerstein J (2000) Eddies: continuously adaptive query processing. In: Proceedings of the 2000 ACM SIGMOD international conference on management of data, Dallas, pp 261--272

  • 6. Babcock B, Babu S, Datar M, Motwani R (2003) Chain: operator scheduling for memory minimization in stream systems. In: Proceedings of the international SIGMOD conference, San Diego, 9--12 June 2003 (in press)

  • 7. Babu S, Widom J (2001) Continuous queries over data streams. SIGMOD Record 30(3):109--120

    Google Scholar 

  • 8. Barbara D, DuMouchel W, Faloutsos C, Haas PJ, Hellerstein JM, Ioannidis YE, Jagadish HV, Johnson T, Ng RT, Poosala V, Ross KA, Sevcik KC (1997) The New Jersey data reduction report. IEEE Data Eng Bull 20(4):3--45

    Google Scholar 

  • 9. Chen J, DeWitt DJ, Tian F, Wang Y (2000) NiagaraCQ: a scalable continuous query system for internet databases. In: Proceedings of the 2000 ACM SIGMOD international conference on management of data, Dallas, 14--19 May 2000, pp 379--390

  • 10. Garcia-Molina H, Salem K (1992) Main memory database systems: an overview. IEEE Trans Knowledge Data Eng 4(6):509--516

    Google Scholar 

  • 11. Gehrke J, Korn F, Srivastava D (2001) On computing correlated Aggregates over continual data streams. In: Proceedings of the 2001 ACM SIGMOD international conference on management of data, Santa Barbara, CA, 21--24 May 2001, pp 13--24

  • 12. Gupta A, Mumick IS (1995) Maintenance of materialized views: problems, techniques, and applications. IEEE Data Eng Bull 18(2):3--18

    Google Scholar 

  • 13. Hanson EN, Carnes C, Huang L, Konyala M, Noronha L, Parthasarathy S, Park JB, Vernon A (1999) Scalable trigger processing. In: Proceedings of the 15th international conference on data engineering, Sydney, 23--26 March 1999, pp 266--275

  • 14. Hellerstein JM, Haas PJ, Wang HJ (1997) Online aggregation. In: Proceedings of the 1997 ACM SIGMOD international conference on management of data, Tucson, 13--15 May 1997, pp 171--182

  • 15. Ives ZG, Florescu D, Friedman M, Levy A, Weld DS (1999) An adaptive query execution system for data integration. In: Proceedings of the 1999 ACM SIGMOD international conference on management of data, Philadelphia, 1--3 June 1999, pp 299--310

  • 16. Kao B, Garcia-Molina H (1994) An overview of realtime database systems. In: Stoyenko AD (ed) Real time computing. Springer, Berlin Heidelberg New York

  • 17. Madden S, Franklin MJ (2002) Fjording the stream: an architecture for queries over streaming sensor data. In: Proceedings of the 18th international conference on data engineering, San Jose, 26 February--1 March 2002

  • 18. Madden SR, Shaw MA, Hellerstein JM, Raman V (2002) Continuously adaptive continuous queries over streams. In: Proceedings of the ACM SIGMOD international conference on management of data, Madison, WI, 3--6 June 2002, pp 49--60

  • 19. Mohan C, Agrawal D, Alonso G, Abbadi AE, Gunther R, Kamath M (1995) Exotica: a project on advanced transaction management and workflow systems. SIGOIS Bull 16(1):45--50

    Google Scholar 

  • 20. Motwani R, Widom J, Arasu A, Babcock B, Babu S, Datar M, Manku G, Olston C, Rosenstein J, Varma R (2002) Query processing, approximation, and resource management in a data stream management system. Stanford University, Computer Science Department 2002-41, August 2002

  • 21. Nieh J, Lam MS (1997) The design, implementation and evaluation of SMART: a scheduler for multimedia applications. In: Proceedings of the 16th ACM symposium on operating systems principles, Saint-Malo, France, 5--8 October 1997, pp 184--197

  • 22. Ozsoyoglu G, Snodgrass RT (1995) Temporal and real-time databases: a survey. IEEE Trans Knowledge Data Eng 7(4):513--532

    Google Scholar 

  • 23. Paton N, Diaz O (1999) Active database systems. ACM Comput Surv 31(1):63--103

    Google Scholar 

  • 24. Schreier U, Pirahesh H, Agrawal R, Mohan C (1991) Alert: an architecture for transforming a passive DBMS into an active DBMS. In: Proceedings of the 17th international conference on very large data bases, Barcelona, 3--6 September 1991, pp 469--478

  • 25. Sellis TK, Ghosh S (1990) On the multiple-query optimization problem. IEEE Trans Knowledge Data Eng 2(2):262--266

    Google Scholar 

  • 26. Seshadri P, Livny M, Ramakrishnan R (1995) SEQ: a model for sequence databases. In: Proceedings of the 11th international conference on data engineering, Taipei, Taiwan, 6--10 March 1995, pp 232--239

  • 27. Sullivan M, Heybey A (1998) Tribeca: a system for managing large databases of network traffic. In: Proceedings of the USENIX annual technical conference, New Orleans, 15--19 June 1998, pp 13--24

  • 28. Urhan T, Franklin MJ (2001) Dynamic pipeline scheduling for improving interactive query performance. In: Proceedings of the 27th international conference on very large data bases, Rome, 11--14 September 2001, pp 501--510

  • 29. Viglas S, Naughton JF (2002) Rate-based query optimization for streaming information sources. In: Proceedings of the ACM SIGMOD international conference on management of data, Madison, WI, 3--6 June 2002, pp 37--48

  • 30. Yang C, Reddy AVS (1995) A taxonomy for congestion control algorithms in packet switching networks. IEEE Netw 9(5):34--44

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel J. Abadi.

Additional information

Received: 12 September 2002, Accepted: 26 March 2003, Published online: 21 July 2003

Edited by Y. Ioannidis

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abadi, D.J., Carney, D., Çetintemel, U. et al. Aurora: a new model and architecture for data stream management. VLDB 12, 120–139 (2003). https://doi.org/10.1007/s00778-003-0095-z

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-003-0095-z

Keywords:

Navigation