Skip to main content
Log in

Replication for Load Balancing and Hot-Spot Relief on Proxy Web Caches with Hash Routing

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

Hash routing is an emerging approach to coordinating a collection of collaborative proxy caches. Hash routing partitions the entire URL space among the proxy caches. Each partition is assigned to a cache server. Duplication of cache contents is eliminated. Client requests to a cache server for non-assigned-partition objects are forwarded to proper sibling caches. In the presence of access skew, the load level of the cache servers can be quite unbalanced, limiting the benefits of hash routing.

We examine an adaptable controlled replication (ACR) of non-assigned-partition objects in each cache server to reduce the load imbalance and relieve the problem of hot-spot references. Trace-driven simulations are conducted to study the effectiveness of ACR. The results show that (1) access skew exists, and the load of the cache servers tends to be unbalanced in hash routing; (2) with a relatively small amount of ACR, say 10% of the cachesize, significant improvements in load balance can be achieved; (3) ACR provides a very effective remedy for load imbalance due to hot-spot references; and (4) increasing the cache size does not improve load balance unless replication is allowed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. M. Abrams et al., “Caching proxies: Limitations and potentials,” in Proc. of 4th Int. World Wide Web Conference, 1995.

  2. C.C. Aggarwal et al., “On caching policies for web objects,” Technical Report, IBM T.J. Watson Research Center, 1996.

  3. C.M. Bowman et al., “The harvest information discovery and access system,” in Proc. of 2nd Int.WorldWide Web Conference, 1994, pp. 763–771.

  4. A. Chankhunthod et al., “Ahierarchical internet object cache,” in Proc. of 1996USENIXTechnical Conference, 1996.

  5. Cisco System, “Scaling the internet web servers,” White paper, availavle at http://www.cisco.com/, Nov. 1997.

  6. M. Colajanni, P.S. Yu, and D. Dias, “Analysis of task assignment policies in scalable distributed web-server system,” IEEE Trans. on Parallel and Distributed Systems, vol. 9, no. 6, pp. 585–600, 1998.

    Google Scholar 

  7. M.D. Dahlin et al., “Cooperative caching: Using remote client memory to improve file system performance,” in Proc. of 1st Symp. on Operating Systems Design and Implementation, 1994.

  8. P. Danzig, “NetCache architecture and deployment,” in Proc. of 3rd International Caching Workshop, 1998.

  9. D.M. Dias et al., “A scalable and highly available web server,” in Proc. of IEEE COMPCON Conf. on Technologies for the Information Superhighway, 1996, pp. 85–92.

  10. Digital Equipment Corporation (merged with Compaq), “Digital's web proxy traces,” available at http://ftp.digital.com/pub/DEC/traces/proxy/webtraces.html, 1996.

  11. L. Fan et al., “Summary cache: A scalable wide-area web cache sharing protocol,” in Proc. of SIGCOMM 98, 1998, pp. 254–265.

  12. G. Goldszmidt and G. Hunt, “NetDispatcher: A TCP connection router,” Technical Report RC 20853, IBM T.J. Watson Research, May 1997.

  13. D. Karger et al., “Web caching with consistent hashing,” in Proc. of 8th Int. World Wide Web Conference, 1999, pp. 125–136.

  14. E.D. Katz, M. Butler, and R. McGrath, “A scalable HTTP server: The NCSA prototype,” Computer Networks and ISDN Systems, vol. 27, pp. 155–163, 1994.

    Google Scholar 

  15. T. Kwan, R. McGrath, and D.A. Reed, “NCSA's world wide web server: Design and performance,” IEEE Computer, pp. 68–74, 1995.

  16. A. Leff, J.L. Wolf, and P.S. Yu, “Replication algorithms in a remote caching architecture,” IEEE Trans. on Parallel and Distributed Systems, vol. 4, no. 11, pp. 1185–1204, 1993.

    Google Scholar 

  17. K. Li and P. Hudak, “Memory coherence in shared virtual memory systems,” ACM Trans. on Computer Systems, vol. 7, no. 4, pp. 321–359, 1989.

    Google Scholar 

  18. R. Malpani, J. Lorch, and D. Berger, “Making world wide web caching servers cooperate,” in Proc. of 4th Int. World Wide Web Conference, 1995.

  19. M.N. Nelson, B.B. Welch, and J.K. Ousterhout, “Caching in the sprite network file system,” ACM Trans. on Computer Systems, vol. 6, no. 1, pp. 134–154, 1988.

    Google Scholar 

  20. O. Pearson, “Squid: A user's guide,” available at http://www.squid-cache.org/, 2000.

  21. K.W. Ross, “Hash-routing for collections of shared web caches,” IEEE Network Magazine, pp. 34–44, 1997.

  22. V. Valloppillil and J. Cohen, “Hierarchical HTTP routing protocol,” Internet Draft, available at http://ircache.nlanr.net/Cache/ICP/draft-vinod-icp-traffic-dist-00.txt, April, 1997.

  23. V. Valloppillil and K.W. Ross, “Cache array routing protocol v1.0,” Internet Draft, available at http://www.ircache.net/Cache/ICP/carp.txt, Feb. 1998.

  24. D. Wessels, “Squid web proxy cache,” available at http://www.squid-cache.org/, 1998.

  25. D. Wessels and K. Claffy, “Internet cache protocol version 2,” Internet Draft, available at http://ds.internic.net/internet-drafts/draft-wessels-icp–v2–00.txt.

  26. N.J. Yeager and R.E. McGrath, Web Server Technology: The Advanced Guide for World Wide Web Information Providers, Morgan Kaufman: San Mateo, CA, 1996.

    Google Scholar 

  27. P.S. Yu and E.A. MacNair, “Performance study of a collaborative method for hierarchical caching in proxy servers,” Computer Networks and ISDN Systems, vol. 30, pp. 215–224, 1998.

    Google Scholar 

  28. G.K. Zipf, Human Behaviour and the Principles of Least Effort, Addison-Wesley: Cambridge, MA, 1949.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, KL., Yu, P.S. Replication for Load Balancing and Hot-Spot Relief on Proxy Web Caches with Hash Routing. Distributed and Parallel Databases 13, 203–220 (2003). https://doi.org/10.1023/A:1021519509203

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1021519509203

Navigation