ABSTRACT
Planned maintenance is a fact of life in IP networks. Examples of maintenance activities include updating router software as well as processor upgrades, memory upgrades, installation of additional line cards, and other hardware upgrades. While planned maintenance is clearly necessary, it is also costly. Software upgrades, for example, require rebooting the router. Due to the time required to reboot the router, and then synchronize state (such as BGP routing information) with network neighbors, the upgrade process can yield outages of 10--15 minutes.
- M. Reardon, "IP reliability," Light Reading, March 2003.Google Scholar
- Cisco Systems, "A brief overview of packet over SONET APS." Cisco website, Document ID 13566, July 2004.Google Scholar
- Cisco Systems, "Cisco IOS software: guide to performing in-service software upgrades," March 2006.Google Scholar
- J. V. der Merwe et. al., "Dynamic Connectivity Management with an Intelligent Route Service Control Point." Sigcomm INM Workshop, September 2006. Google ScholarDigital Library
- P. Sebos, J. Yates, G. Li, D. Rubenstein, and M. Lazer, "An integrated IP/optical approach for efficient access router failure recovery," in Optical Fiber Communications Conference, IEEE, 2003.Google Scholar
Index Terms
- RouterFarm: towards a dynamic, manageable network edge
Recommendations
Resilience and survivability in communication networks: Strategies, principles, and survey of disciplines
The Internet has become essential to all aspects of modern life, and thus the consequences of network disruption have become increasingly severe. It is widely recognised that the Internet is not sufficiently resilient, survivable, and dependable, and ...
Redundancy, diversity, and connectivity to achieve multilevel network resilience, survivability, and disruption tolerance invited paper
Communication networks are constructed as a multilevel stack of infrastructure, protocols, and mechanisms: links and nodes, topology, routing paths, interconnected realms (ASs), end-to-end transport, and application interaction. The resilience of each ...
An OS-Hypervisor Infrastructure for Automated OS Crash Diagnosis and Recovery in a Virtualized Environment
SBAC-PAD '12: Proceedings of the 2012 IEEE 24th International Symposium on Computer Architecture and High Performance ComputingRecovering from OS crashes has traditionally been done using reboot or checkpoint-restart mechanisms. Such techniques either fail to preserve the state before the crash happens or require modifications to applications. To eliminate these problems, we ...
Comments