Abstract
Dynamic resource management is a crucial part of the infrastructure for emerging mission-critical distributed real-time embedded system. Because of this, the resource manager must be fault-tolerant, with nearly continuous operation. This paper describes an ongoing effort to develop a fault-tolerant multi-layer dynamic resource management capability and the challenges we have encountered, including multi-tiered structure, rapid recovery, the characteristics of component middleware, and the co-existence of replicated and non-repli-cated elements. While some of these have been investigated before, this work exhibits all of these characteristics simultaneously, presenting a significant fault-tolerance research challenge.
This work was supported by the Defense Advanced Research Projects Agency (DARPA) under contract NBCHC030119.
The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-3-540-35127-6_28
Chapter PDF
Similar content being viewed by others
References
Amir, Y., Danilov, C., Miskin-Amir, M., Schultz, J., Stanton, J.: The Spread Toolkit: Architecture and Performance. Johns Hopkins University, Center for Networking and Distributed Systems (CNDS) Technical report CNDS-2004-1
Campbell, R., Daley, R., Dasarathy, B., Lardieri, P., Orner, B., Schantz, R., Coleburn, R., Welch, L.R., Work, P.: Toward an Approach for Specification of QoS and Resource Information for Dynamic Resource Management. In: Second RTAS Workshop on Model-Driven Embedded Systems (MoDES 2004), Toronto, Canada, May 25-28 (2004)
Narasimhan, P., Dumitras, T.A., Paulos, A.M., Pertet, S.M., Reverte, C.F., Slember, J.G., Srivastava, D.: MEAD: Support for Real-Time Fault-Tolerant CORBA. Concurrency and Computation: Practice and Experience 17(12), 1527–1545 (2005)
Wang, N., Schmidt, D.C., Gokhale, A., Rodrigues, C., Natarajan, B., Loyall, J.P., Schantz, R.E., Gill, C.D.: QoS-enabled Middleware. In: Mahmoud, Q. (ed.) Middleware for Communications. Wiley and Sons, New York (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 IFIP International Federation for Information Processing
About this paper
Cite this paper
Rubel, P., Loyall, J., Schantz, R., Gillen, M. (2006). Adding Fault-Tolerance to a Hierarchical DRE System. In: Eliassen, F., Montresor, A. (eds) Distributed Applications and Interoperable Systems. DAIS 2006. Lecture Notes in Computer Science, vol 4025. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11773887_23
Download citation
DOI: https://doi.org/10.1007/11773887_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35126-9
Online ISBN: 978-3-540-35127-6
eBook Packages: Computer ScienceComputer Science (R0)