Fault Tolerance and High Availability in Data Stream Management Systems

Balazinska, Magdalena; Hwang, Jeong-Hyon; Shah, Mehul A.

doi:10.1007/978-1-4899-7993-3_160-2

Magdalena Balazinska³,
Jeong-Hyon Hwang⁴ &
Mehul A. Shah⁵

67 Accesses
1 Citations

Definition

Just like any other software system, a data stream management system (DSMS) can experience failures of its different components. Failures are especially common in distributed DSMSs, where query operators are spread across multiple processing nodes, i.e., independent processes typically running on different physical machines in a local-area network (LAN) or in a wide area network (WAN). Failures of processing nodes or failures in the underlying communication network can cause continuous queries (CQ) in a DSMS to stall or produce erroneous results. These failures can adversely affect critical client applications relying on these queries.

Traditionally, availability has been defined as the fraction of time that a system remains operational and properly services requests. In DSMSs, however, availability often also incorporates end-to-end latencies as applications need to quickly react to real-time events and thus can tolerate only small delays. A DSMS can handle failures using a...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Author information

Authors and Affiliations

University of Washington, Seattle, WA, USA
Magdalena Balazinska
Department of Computer Science, State University of New York at Albany, Albany, NY, USA
Jeong-Hyon Hwang
Amazon Web Services (AWS), Seattle, WA, USA
Mehul A. Shah

Authors

Magdalena Balazinska
View author publications
You can also search for this author in PubMed Google Scholar
Jeong-Hyon Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Mehul A. Shah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Magdalena Balazinska .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, Georgia, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, Ontario, Canada
M. Tamer Özsu

Section Editor information

Department of Computer Science, Brown University, Providence, RI, USA
Ugur Cetintemel

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Balazinska, M., Hwang, JH., Shah, M.A. (2017). Fault Tolerance and High Availability in Data Stream Management Systems. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_160-2

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7993-3_160-2
Received: 17 September 2014
Accepted: 14 March 2017
Published: 03 April 2017
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4899-7993-3
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Fault Tolerance and High Availability in Data Stream Management Systems

Definition

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Fault Tolerance and High Availability in Data Stream Management Systems

Definition

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Search

Navigation