ABSTRACT
With the advent of 5G networks and the rise of the Internet of Things (IoT), Content Delivery Networks (CDNs) are increasingly extending into the network edge. This shift introduces unique challenges, particularly due to the limited cache storage and the diverse request patterns at the edge. These edge environments can host traffic classes characterized by varied object-size distributions and object-access patterns. Such complexity makes it difficult for traditional caching strategies, which often rely on metrics like request frequency or time intervals, to be effective. Despite these complexities, the optimization of edge caching is crucial. Improved byte hit rates at the edge not only alleviate the load on the network backbone but also minimize operational costs and expedite content delivery to end-users. In this paper, we introduce HR-Cache, a comprehensive learning-based caching framework grounded in the principles of Hazard Rate (HR) ordering, a rule originally formulated to compute an upper bound on cache performance. HR-Cache leverages this rule to guide future object eviction decisions. It employs a lightweight machine learning model to learn from caching decisions made based on HR ordering, subsequently predicting the "cache-friendliness'' of incoming requests. Objects deemed "cache-averse'' are placed into cache as priority candidates for eviction. Through extensive experimentation, we demonstrate that HR-Cache not only consistently enhances byte hit rates compared to existing state-of-the-art methods but also achieves this with minimal prediction overhead. Our experimental results, using three real-world traces and one synthetic trace, indicate that HR-Cache consistently achieves 2.2-14.6% greater WAN traffic savings than LRU. It outperforms not only heuristic caching strategies but also the state-of-the-art learning-based algorithm.
- Alfred V Aho, Peter J Denning, and Jeffrey D Ullman. 1971. Principles of optimal page replacement. Journal of the ACM (JACM), Vol. 18, 1 (1971), 80--93.Google ScholarDigital Library
- Martin Arlitt, Ludmila Cherkasova, John Dilley, Rich Friedrich, and Tai Jin. 2000. Evaluating content management techniques for web proxy caches. ACM SIGMETRICS Performance Evaluation Review, Vol. 27, 4 (2000), 3--11.Google ScholarDigital Library
- Laszlo A. Belady. 1966. A study of replacement algorithms for a virtual-storage computer. IBM Systems journal, Vol. 5, 2 (1966), 78--101.Google ScholarDigital Library
- Daniel S Berger. 2018. Towards lightweight and robust machine learning for cdn caching. In Proceedings of the 17th ACM Workshop on Hot Topics in Networks. 134--140.Google ScholarDigital Library
- Daniel S Berger, Nathan Beckmann, and Mor Harchol-Balter. 2018. Practical bounds on optimal caching with variable object sizes. Proceedings of the ACM on Measurement and Analysis of Computing Systems, Vol. 2, 2 (2018), 1--38.Google ScholarDigital Library
- Renato Costa and Jose Pazos. 2017. Mlcache: A multi-armed bandit policy for an operating system page cache. Technical Report. Technical report, University of British Columbia.Google Scholar
- Daryl J Daley. 2014. Introduction to the Theory of Point Processes: Elementary Theory and Methods. Springer.Google Scholar
- Andrés Ferragut, Ismael Rodr'iguez, and Fernando Paganini. 2016. Optimizing TTL caches under heavy-tailed demands. ACM SIGMETRICS Performance Evaluation Review, Vol. 44, 1 (2016), 101--112.Google ScholarDigital Library
- Giulio Garbi, Emilio Incerto, and Mirco Tribastone. 2020. Learning queuing networks by recurrent neural networks. In Proceedings of the ACM/SPEC International Conference on Performance Engineering. 56--66.Google ScholarDigital Library
- Yu Guan, Xinggong Zhang, and Zongming Guo. 2019. Caca: Learning-based content-aware cache admission for video content in edge caching. In Proceedings of the 27th ACM International Conference on Multimedia. 456--464.Google ScholarDigital Library
- Syed Hasan, Sergey Gorinsky, Constantine Dovrolis, and Ramesh K Sitaraman. 2014. Trade-offs in optimizing the cache deployments of CDNs. In IEEE INFOCOM 2014-IEEE conference on computer communications. IEEE, 460--468.Google ScholarCross Ref
- Xinyue Hu, Eman Ramadan, Wei Ye, Feng Tian, and Zhi-Li Zhang. 2022. Raven: belady-guided, predictive (deep) learning for in-memory and content caching. In Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies. 72--90.Google ScholarDigital Library
- Qi Huang, Ken Birman, Robbert Van Renesse, Wyatt Lloyd, Sanjeev Kumar, and Harry C Li. 2013. An analysis of Facebook photo caching. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles. 167--181.Google ScholarDigital Library
- Akanksha Jain and Calvin Lin. 2016. Back to the future: Leveraging Belady's algorithm for improved cache replacement. ACM SIGARCH Computer Architecture News, Vol. 44, 3 (2016), 78--89.Google ScholarDigital Library
- Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems , Vol. 30 (2017).Google Scholar
- Vadim Kirilin, Aditya Sundarrajan, Sergey Gorinsky, and Ramesh K Sitaraman. 2019. Rl-cache: Learning-based cache admission for content delivery. In Proceedings of the 2019 Workshop on Network Meets AI & ML. 57--63.Google ScholarDigital Library
- Dong Liu, Binqiang Chen, Chenyang Yang, and Andreas F Molisch. 2016. Caching at the wireless edge: design aspects, challenges, and future directions. IEEE Communications Magazine, Vol. 54, 9 (2016), 22--28.Google ScholarCross Ref
- Evan Liu, Milad Hashemi, Kevin Swersky, Parthasarathy Ranganathan, and Junwhan Ahn. 2020. An imitation learning approach for cache replacement. In International Conference on Machine Learning. PMLR, 6237--6247.Google Scholar
- Ryan Marcus, Parimarjan Negi, Hongzi Mao, Nesime Tatbul, Mohammad Alizadeh, and Tim Kraska. 2020. Bao: Learning to steer query optimizers. arXiv preprint arXiv:2004.03814 (2020).Google Scholar
- Kianoosh Mokhtarian and Hans-Arno Jacobsen. 2014. Caching in video CDNs: Building strong lines of defense. In Proceedings of the ninth European conference on computer systems. 1--13.Google ScholarDigital Library
- Hans-Georg Muller and Jane-Ling Wang. 1994. Hazard rate estimation under random censoring with varying kernels and bandwidths. Biometrics (1994), 61--76.Google Scholar
- Vikram Nathan, Jialin Ding, Mohammad Alizadeh, and Tim Kraska. 2020. Learning multi-dimensional indexes. In Proceedings of the 2020 ACM SIGMOD international conference on management of data. 985--1000.Google ScholarDigital Library
- Erik Nygren, Ramesh K Sitaraman, and Jennifer Sun. 2010. The akamai network: a platform for high-performance internet applications. ACM SIGOPS Operating Systems Review, Vol. 44, 3 (2010), 2--19.Google ScholarDigital Library
- Elizabeth J O'neil, Patrick E O'neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. Acm Sigmod Record, Vol. 22, 2 (1993), 297--306.Google ScholarDigital Library
- Nitish Kumar Panigrahy. 2021. Resource Allocation in Distributed Service Networks. (2021).Google Scholar
- Nitish K Panigrahy, Philippe Nain, Giovanni Neglia, and Don Towsley. 2022. A new upper bound on cache hit probability for non-anticipative caching policies. ACM Transactions on Modeling and Performance Evaluation of Computing Systems, Vol. 7, 2--4 (2022), 1--24.Google ScholarDigital Library
- Georgios Paschos, Ejder Bastug, Ingmar Land, Giuseppe Caire, and Mérouane Debbah. 2016. Wireless caching: Technical misconceptions and business barriers. IEEE Communications Magazine, Vol. 54, 8 (2016), 16--22.Google ScholarDigital Library
- Anirudh Sabnis and Ramesh K Sitaraman. 2021. TRAGEN: a synthetic trace generator for realistic cache simulations. In Proceedings of the 21st ACM Internet Measurement Conference. 366--379.Google ScholarDigital Library
- Anirudh Sabnis and Ramesh K Sitaraman. 2022. JEDI: model-driven trace generation for cache simulations. In Proceedings of the 22nd ACM Internet Measurement Conference. 679--693.Google ScholarDigital Library
- Alireza Sadeghi, Gang Wang, and Georgios B Giannakis. 2019. Deep reinforcement learning for adaptive caching in hierarchical content delivery networks. IEEE Transactions on Cognitive Communications and Networking, Vol. 5, 4 (2019), 1024--1033.Google ScholarCross Ref
- Zhan Shi, Xiangru Huang, Akanksha Jain, and Calvin Lin. 2019. Applying deep learning to the cache replacement problem. In Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture. 413--425.Google ScholarDigital Library
- Junaid Shuja, Kashif Bilal, Waleed Alasmary, Hassan Sinky, and Eisa Alanazi. 2021. Applying machine learning techniques for caching in next-generation edge networks: A comprehensive survey. Journal of Network and Computer Applications , Vol. 181 (2021), 103005.Google ScholarCross Ref
- Jiongjiong Song, Min Sheng, Tony QS Quek, Chao Xu, and Xijun Wang. 2017. Learning-based content caching and sharing for wireless networks. IEEE Transactions on Communications, Vol. 65, 10 (2017), 4309--4324.Google Scholar
- Zhenyu Song, Daniel S Berger, Kai Li, Anees Shaikh, Wyatt Lloyd, Soudeh Ghorbani, Changhoon Kim, Aditya Akella, Arvind Krishnamurthy, Emmett Witchel, et al. 2020. Learning relaxed belady for content distribution network caching. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20). 529--544.Google ScholarDigital Library
- Stefano Traverso, Mohamed Ahmed, Michele Garetto, Paolo Giaccone, Emilio Leonardi, and Saverio Niccolini. 2015. Unravelling the impact of temporal and geographical locality in content caching systems. IEEE Transactions on Multimedia, Vol. 17, 10 (2015), 1839--1854.Google ScholarDigital Library
- Shreshth Tuli and Giuliano Casale. 2022. Optimizing the Performance of Fog Computing Environments Using AI and Co-Simulation. In Companion of the 2022 ACM/SPEC International Conference on Performance Engineering. 25--28.Google Scholar
- Giuseppe Vietri, Liana V Rodriguez, Wendy A Martinez, Steven Lyons, Jason Liu, Raju Rangaswami, Ming Zhao, and Giri Narasimhan. 2018. Driving cache replacement with $$ML-based$$$$LeCaR$$. In 10th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 18).Google Scholar
- Carl A Waldspurger, Nohhyun Park, Alexander Garthwaite, and Irfan Ahmad. 2015. Efficient $$MRC$$ construction with $$SHARDS$$. In 13th USENIX Conference on File and Storage Technologies (FAST 15). 95--110.Google ScholarDigital Library
- Jane-Ling Wang et al. 2005. Smoothing hazard rates. Encyclopedia of biostatistics , Vol. 7 (2005), 4986--4997.Google Scholar
- Xiaofei Wang, Chenyang Wang, Xiuhua Li, Victor CM Leung, and Tarik Taleb. 2020. Federated deep reinforcement learning for internet of things with decentralized cooperative edge caching. IEEE Internet of Things Journal, Vol. 7, 10 (2020), 9441--9455.Google ScholarCross Ref
- Gang Yan, Jian Li, and Don Towsley. 2021. Learning from optimal caching for content delivery. In Proceedings of the 17th International Conference on emerging Networking EXperiments and Technologies. 344--358.Google ScholarDigital Library
- Chen Zhong, M Cenk Gursoy, and Senem Velipasalar. 2018. A deep reinforcement learning-based framework for content caching. In 2018 52nd Annual Conference on Information Sciences and Systems (CISS). IEEE, 1--6.Google ScholarCross Ref
Index Terms
- A Learning-Based Caching Mechanism for Edge Content Delivery
Recommendations
A New Upper Bound on Cache Hit Probability for Non-Anticipative Caching Policies
Caching systems have long been crucial for improving the performance of a wide variety of network and web-based online applications. In such systems, end-to-end application performance heavily depends on the fraction of objects transferred from the cache, ...
Adaptive TTL-Based Caching for Content Delivery
SIGMETRICS '17 Abstracts: Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer SystemsContent Delivery Networks (CDNs) cache and serve a majority of the user-requested content on the Internet, including web pages, videos, and software downloads. We propose two TTL-based caching algorithms that automatically adapt to the heterogeneity, ...
Adaptive TTL-Based Caching for Content Delivery
Performance evaluation reviewContent Delivery Networks (CDNs) cache and serve a majority of the user-requested content on the Internet, including web pages, videos, and software downloads. We propose two TTL-based caching algorithms that automatically adapt to the heterogeneity, ...
Comments