Abstract
Web caching is a widely deployed technique to reduce the load to web servers and to reduce the latency for web browsers. Peer-to-Peer (P2P) web caching has been a hot research topic in recent years as it can create scalable and robust designs for decentralized internet-scale applications. However, many P2P web caching systems suffer expensive overheads such as lookup and publish messages, and lack locality awareness. In this paper, we present the development of a locality aware cache diffusion system that makes use of routing table locality, aggregation, and soft state to overcome these limitations. The analysis and experiments show that our cache diffusion system reduces the amount of information processed by nodes, reduces the number of index messages sent by nodes, and improves the locality of cache pointers.
Similar content being viewed by others
References
Barford P, Crovella M (1998) Generating representative web workloads for network and server performance evaluation. In: Measurement and modeling of computer systems, 1998, pp 151–160
Busari M, Williamson CL (2001) On the sensitivity of web proxy cache performance to workload characteristics. In: INFOCOM, 2001, pp 1225–1234
Casey J, Zhou W (2005) Reducing the bandwidth requirements of P2P keyword indexing. In: Lecture notes in computer science, vol 3719. Springer, Berlin, pp 50–59
Casey J, Chonka A, Zhou W (2007) Adjacency cache: fast web cache indexing and lookup. In: 6th IEEE/ACIS international conference on computer and information science (ICIS 2007), 2007, pp 616–621
Castro M et al (2002) Exploiting network proximity in distributed hash tables. In: International workshop on future directions in distributed computing, 2002, pp 52–55
Castro M (2002) SCRIBE: A large-scale and decentralized application-level multicast infrastructure. IEEE J Sel Areas Commun (JSAC) 20(8):1489–1499
Castro M et al (2003) SplitStream: high-bandwidth multicast in cooperative environments. In: ACM symposium on operating systems principles, 2003, pp 298–313
Chandramouli B, Yang J, Vahdat A (2006) Distributed network querying with bounded approximate caching. In: Lecture notes in computer science, vol 3882. Springer, Berlin, pp 374–388
Chun B-G, Zhao BY, Kubiatowicz JD (2005) Impact of neighbor selection on performance and resilience of structured P2P networks. In: 4th International workshop on peer-to-peer systems, 2005, pp 264–274
Chun B-G et al (2006) ChunkCast: an anycast service for large content distribution. In: The 5th international workshop on peer-to-peer systems (IPTPS’06), 2006
Cutting D (2004) Lucene, http://lucene.apache.org/java/docs/
Dabek F et al (2004) Vivaldi: a decentralized network coordinate system. In: ACM SIGCOMM’04, 2004, pp 15–26
Dolev D, Mokryn O, Shavitt Y (2003) On multicast trees: structure and size estimation. In: IEEE INFOCOM’03, 2003, pp 1011–1021
El-Ansary S et al (2003) Efficient broadcast in structured P2P networks. In: IPTPS’03, 2003, pp 304–314
Gummadi KP, Saroiu S, Gribble SD (2002) King: estimating latency between arbitrary Internet end hosts. In: Proceedings of the 2nd ACM SIGCOMM workshop on Internet measurement, 2002, pp 5–18
Knuth DE (1973) The art of computer programming, 2nd edn. Addison-Wesley, Reading
Loguinov D et al (2003) Graph-theoretic analysis of structured peer-to-peer systems: routing distances and fault resilience. In: ACM SIGCOMM, 2003, pp 1107–1120
Michel BS et al (2000) URL forwarding and compression in adaptive web caching. In: IEEE INFOCOM, 2000, vol 2, pp 670–678
Ng TSE, Zhang H (2001) Towards global network positioning. In: ACM SIGCOMM Internet measurement workshop, pp 25–29
Oppenheimer D et al (2004) Scalable wide-area resource discovery. Technical Report CSD04-1334. University of California Berkeley, Berkeley, CA, USA
Pappas V et al (2006) A comparative study of the DNS design with DHT-based alternatives. In: IEEE INFOCOM’06, 2006, pp 1–13
Renesse RV, Binnan KP (2001) Astrolabe: a robust and scalable technology for distributed system monitoring, management, and data mining. ACM Trans Comput Syst 15:164–206
Rhea S et al (2004) Handling churn in a DHT. In: USENIX annual technical conference, 2004
Serpanos DN, Karakostas G, Wolf WH (2000) Effective caching of web objects using Zipf’s law. In: The 2000 IEEE international conference on multimedia and expo, 2000, pp 727–730
Shi W, Mao Y (2006) Performance evaluation of peer-to-peer web caching systems. J Syst Softw 79(5):714–726
Sripanidkulchai K, Maggs B, Zhang H (2003) Efficient content location using interest-based locality in peer-to-peer systems. In: INFOCOM, 2003, pp 2166–2176
Tati K, Voelker GM (2004) ShortCuts: using soft state to improve DHT routing. In: Lecture notes in computer science, vol 3293. Springer, Berlin, pp 44–62
Wessels D (2001) Web caching, 1st edn. O’Reilly & Associates, Sebastopol, pp xv, 300
Witten IH, Moffat A, Bell TC (1999) Managing gigabytes: compressing and indexing documents and images, 2nd edn. In: The Morgan Kaufmann series in multimedia information and systems. Morgan Kaufmann, San Francisco, pp xxxi, 519
Zegura EW, Calvert K, Donahoo MJ (1997) A quantitative comparison of graph-based models for internet topology. IEEE/ACM Trans Netw 5(6):770–783
Zhou S, Ganger GR, Steenkiste P (2003) Balancing locality and randomness in DHTs. Carnegie Mellon University Technical Report CMU-CS-03-203
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Casey, J., Zhou, W. A locality aware cache diffusion system. J Supercomput 52, 1–22 (2010). https://doi.org/10.1007/s11227-009-0264-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-009-0264-y