Abstract
Understanding and resolving failure causalities in modern enterprise applications is one of the main challenges daily faced by application administrators. Such applications indeed integrate multiple heterogeneous components, and identifying which components are causing the failure of which other components requires to delve through distributed application logs. In this work-in-progress paper, we present our idea of devising an analysis approach based on management protocols, a fault-aware compositional modelling for the management of multi-component applications. We discuss how they can be used to identify causalities of failures in multi-component applications, and to design countermeasures to avoid (or, at least, limit) failure propagation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
A well-formed management protocol ensures that the fault handling transition are only handling the fault of requirements that were assumed in the source state, while a deterministic management protocol ensures that the effects of executing a management operation or handling faulted requirements are deterministic [4].
References
Bergmayr, A., et al.: A systematic review of cloud modeling languages. ACM Comput. Surv. 51(1), 22:1–22:38 (2018)
Betin Can, A., Bultan, T., Lindvall, M., Lux, B., Topp, S.: Eliminating synchronization faults in air traffic control software via design for verification with concurrency controllers. Autom. Softw. Eng. 14(2), 129–178 (2007)
Breitenbücher, U., et al.: The OpenTOSCA Ecosystem - Concepts & Tools, pp. 112–130. SciTePress (2016)
Brogi, A., Canciani, A., Soldani, J.: Fault-aware management protocols for multi-component applications. J. Syst. Softw. 139, 189–210 (2018)
Brogi, A., Canciani, A., Soldani, J.: True concurrent management of multi-component applications. In: Kritikos, K., Plebani, P., de Paoli, F. (eds.) ESOCC 2018. LNCS, vol. 11116, pp. 17–32. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99819-0_2
Butler, M., Jones, C.B., Romanovsky, A., Troubitsyna, E. (eds.): Rigorous Development of Complex Fault-Tolerant Systems. LNCS, vol. 4157. Springer, Heidelberg (2006). https://doi.org/10.1007/11916246
Candea, G., Brown, A.B., Fox, A., Patterson, D.: Recovery-oriented computing: building multitier dependability. Computer 37(11), 60–67 (2004)
Console, L., Picardi, C., Dupré, D.T.: A framework for decentralized qualitative model-based diagnosis. In: Veloso, M.M. (ed.) IJCAI 2007, Proceedings of the 20th International Joint Conference on Artificial Intelligence, Hyderabad, India, January 6–12, 2007, pp. 286–291 (2007)
Durán, F., Salaün, G.: Robust and reliable reconfiguration of cloud applications. J. Syst. Softw. 122(C), 524–537 (2016)
Etchevers, X., Salaün, G., Boyer, F., Coupaye, T., De Palma, N.: Reliable self-deployment of distributed cloud applications. Softw.: Practice Exp. 47(1), 3–20 (2017)
Friedrich, G., Fugini, M.G., Mussi, E., Pernici, B., Tagni, G.: Exception handling for repair in service-based processes. IEEE Trans. Softw. Eng. 36(2), 198–215 (2010)
Genc, S., Lafortune, S.: Predictability of event occurrences in partially-observed discrete-event systems. Automatica 45(2), 301–311 (2009)
Jamshidi, P., Pahl, C., Mendonca, N., Lewis, J., Tilkov, S.: Microservices: the journey so far and challenges ahead. IEEE Softw. 35(3), 24–35 (2018)
Johnsen, E., Owe, O., Munthe-Kaas, E., Vain, J.: Incremental fault-tolerant design in an object-oriented setting. In: Proceedings of the Second Asia-Pacific Conference on Quality Software, p. 223. APAQS, IEEE Computer Society (2001)
Qiang, W., Yan, L., Bliudze, S., Xiaoguang, M.: Automatic fault localization for BIP. In: Li, X., Liu, Z., Yi, W. (eds.) SETTA 2015. LNCS, vol. 9409, pp. 277–283. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25942-0_18
Soldani, J., Tamburri, D.A., Van Den Heuvel, W.J.: The pains and gains of microservices: a systematic grey literature review. J. Syst. Softw. 146, 215–232 (2018)
Zaytoon, J., Lafortune, S.: Overview of fault diagnosis methods for discrete event systems. Ann. Rev. Control 37(2), 308–320 (2013)
Acknowledgments
This work is partly funded by the projects AMaCA (POR-FSE, Regione Toscana) and DECLware (PRA_2018_66, University of Pisa).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Brogi, A., Soldani, J. (2020). Identifying Failure Causalities in Multi-component Applications. In: Camara, J., Steffen, M. (eds) Software Engineering and Formal Methods. SEFM 2019. Lecture Notes in Computer Science(), vol 12226. Springer, Cham. https://doi.org/10.1007/978-3-030-57506-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-57506-9_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-57505-2
Online ISBN: 978-3-030-57506-9
eBook Packages: Computer ScienceComputer Science (R0)