Fault tolerance mechanisms for virtual data center architectures

Joshi, Sagar C.; Sivalingam, Krishna M.

doi:10.1007/s11107-014-0463-1

Fault tolerance mechanisms for virtual data center architectures

Published: 21 August 2014

Volume 28, pages 154–164, (2014)
Cite this article

Photonic Network Communications Aims and scope Submit manuscript

Sagar C. Joshi¹ &
Krishna M. Sivalingam¹

355 Accesses
8 Citations
Explore all metrics

Abstract

A virtual data center (VDC) is a combination of interconnected virtual servers hosted on a physical data center that hosts multiple such VDCs. This enables efficient sharing of the data center’s resources while handling dynamic resource requirements of the clients. The SecondNet architecture (Guo et al. in Proceedings of ACMSIGCOMM conference on data communication, Barcelona, pp 63–74, 2009) realizes this VDC concept and includes a centralized VDC resource-mapping (virtual to physical) algorithm. Fault tolerance is an important requirement in data center-based services, in order to increase reliability and availability. In this paper, we propose a fault tolerance mechanism to handle server failures by efficiently migrating the virtual machines (VMs) hosted on the failed server to a new location. Using our mechanism, it is shown that recovery from all the faults is possible, even for a server utilization of 90 %. In order to reduce the impact of server failures on the VDCs hosted in the data center, we then present a new load balancing scheme based on clustering that efficiently allocates the VDCs on the data center. Using this scheme, we were able to reduce the affected number of VMs per server failure by 63 %, in case of a BCube network of size 625 nodes, and by 86 %, in case of a BCube network of size 1,296 nodes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Effect of Fault Tolerance in the Field of Cloud Computing

Using Virtualization Technology for Fault-Tolerant Replication in LAN

Real-Time Fault-Tolerant Scheduling Algorithm in Virtualized Clouds

References

Amazon elastic compute cloud (Amazon EC2), Sep. 2012, http://aws.amazon.com/ec2/ (2012)
Windows Azure: Microsoft’s Cloud Platform, Sep. 2012, http://www.windowsazure.com/en-us/ (2012)
Guo, C., Lu, G., Wang, H. J., Yang, S., Kong, C., Sun, P., Wu, W., Zhang, Y.: SecondNet: a data center network virtualization architecture with bandwidth guarantees. In: Proceedings of ACM Co-NEXT, Philadelphia, USA, Dec. 2010, pp. 15:1–15:12 (2010)
Guo, C., Lu, G., Wang, H. J., Yang, S., Kong, C., Sun, P., Wu, W., Zhang, Y.: SecondNet: a data center network virtualization architecture with bandwidth guarantees. Microsoft Research, Tech. Rep. MSR-TR-2010-81 (2010)
Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., Lu, S.: BCube: a high performance, server-centric network architecture for modular data centers. In: Proceedings of ACM SIGCOMM Conference on Data Communication, Barcelona, Spain, Aug. 2009, pp. 63–74 (2009)
Guo, C., Wu, H., Tan, K., Shi, L., Zhang, Y., Lu, S.: DCell: a scalable and fault-tolerant network structure for data centers. In: Proceedings of ACM SIGCOMM Conference on Data Communication, Seattle, USA, Aug. 2008, pp. 75–86 (2008)
Metzler, J.: A guide for understanding cloud computing, Apr. 2009, http://www.webtorials.com/content/2009/11/a-guide-for-understanding-cloud-computing.html (2009)
Armbrust, M., Fox, A., Griffith, R., Joseph, A. D., Katz, R. H., Konwinski, A., Lee, G., Patterson, D. A., Rabkin, A., Stoica, I., Zaharia, M.: Above the Clouds: A Berkeley View of Cloud Computing, Feb. 2009 (2009)
Chowdhury, N., Boutaba, R.: Network virtualization: state of the art and research challenges. IEEE Commun. Mag. 47(7), 20–26 (2009)
Carapinha, J., Jiménez, J.: Network virtualization: a view from the bottom. In: Proceedings of ACM Workshop on Virtualized Infrastructure Systems and Architectures, Barcelona, Spain, Aug. 2009, pp. 73–80 (2009)
Hopcroft, J.E., Karp, R.M.: A \(n^{5/2}\) algorithm for maximum matchings in bipartite graphs. In: 12th Annual Symposium on Switching and Automata Theory. Michigan, USA, Oct. 1971, pp. 122–125 (1971)
Joshi, S. C.: Improvement on SecondNet: Data Center Network Virtualization Architecture, Master’s Thesis, Indian Institute of Technology Madras, Jul. 2012 (2012)
Library for efficient modeling and optimization in networks (lemon) graph library, http://lemon.cs.elte.hu/trac/lemon, Dec. 2011 (2011)
Google cluster data, Dec. 2011, http://code.google.com/p/googleclusterdata/ (2011)

Download references

Acknowledgments

This work was supported in part by DST-EPSRC funded India-UK Advanced Technology Centre of Excellence in Next Generation Networks, Systems and Services (IU-ATC).

Author information

Authors and Affiliations

Department of CSE, Indian Institute of Technology Madras, Chennai, 600036, India
Sagar C. Joshi & Krishna M. Sivalingam

Authors

Sagar C. Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Krishna M. Sivalingam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Krishna M. Sivalingam.

Additional information

Part of this paper was presented at the IEEE International Conference on Advanced Networks and Telecommunication Systems (ANTS) (Chennai, India), December 2013.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Joshi, S.C., Sivalingam, K.M. Fault tolerance mechanisms for virtual data center architectures. Photon Netw Commun 28, 154–164 (2014). https://doi.org/10.1007/s11107-014-0463-1

Download citation

Received: 15 April 2014
Accepted: 23 July 2014
Published: 21 August 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s11107-014-0463-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fault tolerance mechanisms for virtual data center architectures

Abstract

Access this article

Similar content being viewed by others

Effect of Fault Tolerance in the Field of Cloud Computing

Using Virtualization Technology for Fault-Tolerant Replication in LAN

Real-Time Fault-Tolerant Scheduling Algorithm in Virtualized Clouds

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fault tolerance mechanisms for virtual data center architectures

Abstract

Access this article

Similar content being viewed by others

Effect of Fault Tolerance in the Field of Cloud Computing

Using Virtualization Technology for Fault-Tolerant Replication in LAN

Real-Time Fault-Tolerant Scheduling Algorithm in Virtualized Clouds

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation