Abstract
Modern complex information systems require management mechanisms that operate to a large extent independently and autonomously. One such mechanism is the restart of components or transactions in case a failure in the system occurs. In this paper we introduce a pragmatic algorithm to determine close to optimal restart times on-line.
We present a method for choosing best restart times based on empirical data, if no theoretical distribution is known. The best restart time is determined based on the empirical hazard rate. We study the sample size required to come to a reasonably good estimate, the effect of the failure probability of a job and issues of parameter selection for the hazard rate estimation. The application considered in this paper is the connection setup time in HTTP GET necessary for the download of web pages.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Klein, J.P., Moeschberger, M.L.: Survival Analysis, Techniques for Censored and Truncated Data. Springer, Heidelberg (1997)
Maurer, S.M., Huberman, B.A.: Restart strategies and Internet congestion. Journal of Economic Dynamics and Control 25, 641–654 (2001)
van Moorsel, A., Wolter, K.: Analysis and Algorithms for Restart. In: Proc. 1st International Conference on the Quantitative Evaluation of Systems (QEST), Twente, Netherlands, September 2004, pp. 195–204 (2004)
van Moorsel, A., Wolter, K.: Making Deadlines through Restart. In: Proc. 12th GI/ITG Conference on Measuring, Modelling and Evaluation of Computer and Communication Systems (MMB 2004), Dresden, Germany, September 2004, pp. 155–160 (2004)
Reinecke, P., van Moorsel, A., Wolter, K.: A Measurement Study of the Interplay between Application Level Restart and Transport Protocol. In: Malek, M., Reitenspiess, M., Kaiser, J. (eds.) ISAS 2004. LNCS, vol. 3335, pp. 86–100. Springer, Heidelberg (2005)
Schroeder, M., Buro, L.: Does the Restart Method Work? Preliminary Results on Efficiency Improvements for Interactions of Web-Agents. In: Wagner, T., Rana, O. (eds.) Proceedings of the Workshop on Infrastructure for Agents, MAS, and Scalable MAS at the Conference Autonomous Agents 2001. Springer, Montreal (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wolter, K. (2005). Self-Management of Systems Through Automatic Restart. In: Babaoglu, O., et al. Self-star Properties in Complex Information Systems. SELF-STAR 2004. Lecture Notes in Computer Science, vol 3460. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11428589_13
Download citation
DOI: https://doi.org/10.1007/11428589_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26009-7
Online ISBN: 978-3-540-32013-5
eBook Packages: Computer ScienceComputer Science (R0)