Skip to main content
Log in

Fault tolerance mechanisms for virtual data center architectures

  • Published:
Photonic Network Communications Aims and scope Submit manuscript

Abstract

A virtual data center (VDC) is a combination of interconnected virtual servers hosted on a physical data center that hosts multiple such VDCs. This enables efficient sharing of the data center’s resources while handling dynamic resource requirements of the clients. The SecondNet architecture (Guo et al. in Proceedings of ACMSIGCOMM conference on data communication, Barcelona, pp 63–74, 2009) realizes this VDC concept and includes a centralized VDC resource-mapping (virtual to physical) algorithm. Fault tolerance is an important requirement in data center-based services, in order to increase reliability and availability. In this paper, we propose a fault tolerance mechanism to handle server failures by efficiently migrating the virtual machines (VMs) hosted on the failed server to a new location. Using our mechanism, it is shown that recovery from all the faults is possible, even for a server utilization of 90 %. In order to reduce the impact of server failures on the VDCs hosted in the data center, we then present a new load balancing scheme based on clustering that efficiently allocates the VDCs on the data center. Using this scheme, we were able to reduce the affected number of VMs per server failure by 63 %, in case of a BCube network of size 625 nodes, and by 86 %, in case of a BCube network of size 1,296 nodes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Amazon elastic compute cloud (Amazon EC2), Sep. 2012, http://aws.amazon.com/ec2/ (2012)

  2. Windows Azure: Microsoft’s Cloud Platform, Sep. 2012, http://www.windowsazure.com/en-us/ (2012)

  3. Guo, C., Lu, G., Wang, H. J., Yang, S., Kong, C., Sun, P., Wu, W., Zhang, Y.: SecondNet: a data center network virtualization architecture with bandwidth guarantees. In: Proceedings of ACM Co-NEXT, Philadelphia, USA, Dec. 2010, pp. 15:1–15:12 (2010)

  4. Guo, C., Lu, G., Wang, H. J., Yang, S., Kong, C., Sun, P., Wu, W., Zhang, Y.: SecondNet: a data center network virtualization architecture with bandwidth guarantees. Microsoft Research, Tech. Rep. MSR-TR-2010-81 (2010)

  5. Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., Lu, S.: BCube: a high performance, server-centric network architecture for modular data centers. In: Proceedings of ACM SIGCOMM Conference on Data Communication, Barcelona, Spain, Aug. 2009, pp. 63–74 (2009)

  6. Guo, C., Wu, H., Tan, K., Shi, L., Zhang, Y., Lu, S.: DCell: a scalable and fault-tolerant network structure for data centers. In: Proceedings of ACM SIGCOMM Conference on Data Communication, Seattle, USA, Aug. 2008, pp. 75–86 (2008)

  7. Metzler, J.: A guide for understanding cloud computing, Apr. 2009, http://www.webtorials.com/content/2009/11/a-guide-for-understanding-cloud-computing.html (2009)

  8. Armbrust, M., Fox, A., Griffith, R., Joseph, A. D., Katz, R. H., Konwinski, A., Lee, G., Patterson, D. A., Rabkin, A., Stoica, I., Zaharia, M.: Above the Clouds: A Berkeley View of Cloud Computing, Feb. 2009 (2009)

  9. Chowdhury, N., Boutaba, R.: Network virtualization: state of the art and research challenges. IEEE Commun. Mag. 47(7), 20–26 (2009)

  10. Carapinha, J., Jiménez, J.: Network virtualization: a view from the bottom. In: Proceedings of ACM Workshop on Virtualized Infrastructure Systems and Architectures, Barcelona, Spain, Aug. 2009, pp. 73–80 (2009)

  11. Hopcroft, J.E., Karp, R.M.: A \(n^{5/2}\) algorithm for maximum matchings in bipartite graphs. In: 12th Annual Symposium on Switching and Automata Theory. Michigan, USA, Oct. 1971, pp. 122–125 (1971)

  12. Joshi, S. C.: Improvement on SecondNet: Data Center Network Virtualization Architecture, Master’s Thesis, Indian Institute of Technology Madras, Jul. 2012 (2012)

  13. Library for efficient modeling and optimization in networks (lemon) graph library, http://lemon.cs.elte.hu/trac/lemon, Dec. 2011 (2011)

  14. Google cluster data, Dec. 2011, http://code.google.com/p/googleclusterdata/ (2011)

Download references

Acknowledgments

This work was supported in part by DST-EPSRC funded India-UK Advanced Technology Centre of Excellence in Next Generation Networks, Systems and Services (IU-ATC).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Krishna M. Sivalingam.

Additional information

Part of this paper was presented at the IEEE International Conference on Advanced Networks and Telecommunication Systems (ANTS) (Chennai, India), December 2013.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Joshi, S.C., Sivalingam, K.M. Fault tolerance mechanisms for virtual data center architectures. Photon Netw Commun 28, 154–164 (2014). https://doi.org/10.1007/s11107-014-0463-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11107-014-0463-1

Keywords

Navigation