research-article

Open Access

A Learning-Based Caching Mechanism for Edge Content Delivery

Authors:
Hoda Torabi

Department of Electrical Engineering and Computer Science, York University, Toronto, Canada

Department of Electrical Engineering and Computer Science, York University, Toronto, Canada

0009-0003-0150-7321
View Profile

,
Hamzeh Khazaei

Department of Electrical Engineering and Computer Science, York University, Toronto, Canada

Department of Electrical Engineering and Computer Science, York University, Toronto, Canada

0000-0001-5439-8024
View Profile

,
Marin Litoiu

Department of Electrical Engineering and Computer Science, York University, Toronto, Canada

Department of Electrical Engineering and Computer Science, York University, Toronto, Canada

0000-0003-0383-920X
View Profile

ICPE '24: Proceedings of the 15th ACM/SPEC International Conference on Performance EngineeringMay 2024Pages 236–246https://doi.org/10.1145/3629526.3645037

Published:07 May 2024Publication History

ICPE '24: Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering

Pages 236–246

ABSTRACT

With the advent of 5G networks and the rise of the Internet of Things (IoT), Content Delivery Networks (CDNs) are increasingly extending into the network edge. This shift introduces unique challenges, particularly due to the limited cache storage and the diverse request patterns at the edge. These edge environments can host traffic classes characterized by varied object-size distributions and object-access patterns. Such complexity makes it difficult for traditional caching strategies, which often rely on metrics like request frequency or time intervals, to be effective. Despite these complexities, the optimization of edge caching is crucial. Improved byte hit rates at the edge not only alleviate the load on the network backbone but also minimize operational costs and expedite content delivery to end-users. In this paper, we introduce HR-Cache, a comprehensive learning-based caching framework grounded in the principles of Hazard Rate (HR) ordering, a rule originally formulated to compute an upper bound on cache performance. HR-Cache leverages this rule to guide future object eviction decisions. It employs a lightweight machine learning model to learn from caching decisions made based on HR ordering, subsequently predicting the "cache-friendliness'' of incoming requests. Objects deemed "cache-averse'' are placed into cache as priority candidates for eviction. Through extensive experimentation, we demonstrate that HR-Cache not only consistently enhances byte hit rates compared to existing state-of-the-art methods but also achieves this with minimal prediction overhead. Our experimental results, using three real-world traces and one synthetic trace, indicate that HR-Cache consistently achieves 2.2-14.6% greater WAN traffic savings than LRU. It outperforms not only heuristic caching strategies but also the state-of-the-art learning-based algorithm.

References

Alfred V Aho, Peter J Denning, and Jeffrey D Ullman. 1971. Principles of optimal page replacement. Journal of the ACM (JACM), Vol. 18, 1 (1971), 80--93.Google ScholarDigital Library
Martin Arlitt, Ludmila Cherkasova, John Dilley, Rich Friedrich, and Tai Jin. 2000. Evaluating content management techniques for web proxy caches. ACM SIGMETRICS Performance Evaluation Review, Vol. 27, 4 (2000), 3--11.Google ScholarDigital Library
Laszlo A. Belady. 1966. A study of replacement algorithms for a virtual-storage computer. IBM Systems journal, Vol. 5, 2 (1966), 78--101.Google ScholarDigital Library
Daniel S Berger. 2018. Towards lightweight and robust machine learning for cdn caching. In Proceedings of the 17th ACM Workshop on Hot Topics in Networks. 134--140.Google ScholarDigital Library
Daniel S Berger, Nathan Beckmann, and Mor Harchol-Balter. 2018. Practical bounds on optimal caching with variable object sizes. Proceedings of the ACM on Measurement and Analysis of Computing Systems, Vol. 2, 2 (2018), 1--38.Google ScholarDigital Library
Renato Costa and Jose Pazos. 2017. Mlcache: A multi-armed bandit policy for an operating system page cache. Technical Report. Technical report, University of British Columbia.Google Scholar
Daryl J Daley. 2014. Introduction to the Theory of Point Processes: Elementary Theory and Methods. Springer.Google Scholar
Andrés Ferragut, Ismael Rodr'iguez, and Fernando Paganini. 2016. Optimizing TTL caches under heavy-tailed demands. ACM SIGMETRICS Performance Evaluation Review, Vol. 44, 1 (2016), 101--112.Google ScholarDigital Library
Giulio Garbi, Emilio Incerto, and Mirco Tribastone. 2020. Learning queuing networks by recurrent neural networks. In Proceedings of the ACM/SPEC International Conference on Performance Engineering. 56--66.Google ScholarDigital Library
Yu Guan, Xinggong Zhang, and Zongming Guo. 2019. Caca: Learning-based content-aware cache admission for video content in edge caching. In Proceedings of the 27th ACM International Conference on Multimedia. 456--464.Google ScholarDigital Library
Syed Hasan, Sergey Gorinsky, Constantine Dovrolis, and Ramesh K Sitaraman. 2014. Trade-offs in optimizing the cache deployments of CDNs. In IEEE INFOCOM 2014-IEEE conference on computer communications. IEEE, 460--468.Google ScholarCross Ref
Xinyue Hu, Eman Ramadan, Wei Ye, Feng Tian, and Zhi-Li Zhang. 2022. Raven: belady-guided, predictive (deep) learning for in-memory and content caching. In Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies. 72--90.Google ScholarDigital Library
Qi Huang, Ken Birman, Robbert Van Renesse, Wyatt Lloyd, Sanjeev Kumar, and Harry C Li. 2013. An analysis of Facebook photo caching. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles. 167--181.Google ScholarDigital Library
Akanksha Jain and Calvin Lin. 2016. Back to the future: Leveraging Belady's algorithm for improved cache replacement. ACM SIGARCH Computer Architecture News, Vol. 44, 3 (2016), 78--89.Google ScholarDigital Library
Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems , Vol. 30 (2017).Google Scholar
Vadim Kirilin, Aditya Sundarrajan, Sergey Gorinsky, and Ramesh K Sitaraman. 2019. Rl-cache: Learning-based cache admission for content delivery. In Proceedings of the 2019 Workshop on Network Meets AI & ML. 57--63.Google ScholarDigital Library
Dong Liu, Binqiang Chen, Chenyang Yang, and Andreas F Molisch. 2016. Caching at the wireless edge: design aspects, challenges, and future directions. IEEE Communications Magazine, Vol. 54, 9 (2016), 22--28.Google ScholarCross Ref
Evan Liu, Milad Hashemi, Kevin Swersky, Parthasarathy Ranganathan, and Junwhan Ahn. 2020. An imitation learning approach for cache replacement. In International Conference on Machine Learning. PMLR, 6237--6247.Google Scholar
Ryan Marcus, Parimarjan Negi, Hongzi Mao, Nesime Tatbul, Mohammad Alizadeh, and Tim Kraska. 2020. Bao: Learning to steer query optimizers. arXiv preprint arXiv:2004.03814 (2020).Google Scholar
Kianoosh Mokhtarian and Hans-Arno Jacobsen. 2014. Caching in video CDNs: Building strong lines of defense. In Proceedings of the ninth European conference on computer systems. 1--13.Google ScholarDigital Library
Hans-Georg Muller and Jane-Ling Wang. 1994. Hazard rate estimation under random censoring with varying kernels and bandwidths. Biometrics (1994), 61--76.Google Scholar
Vikram Nathan, Jialin Ding, Mohammad Alizadeh, and Tim Kraska. 2020. Learning multi-dimensional indexes. In Proceedings of the 2020 ACM SIGMOD international conference on management of data. 985--1000.Google ScholarDigital Library
Erik Nygren, Ramesh K Sitaraman, and Jennifer Sun. 2010. The akamai network: a platform for high-performance internet applications. ACM SIGOPS Operating Systems Review, Vol. 44, 3 (2010), 2--19.Google ScholarDigital Library
Elizabeth J O'neil, Patrick E O'neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. Acm Sigmod Record, Vol. 22, 2 (1993), 297--306.Google ScholarDigital Library
Nitish Kumar Panigrahy. 2021. Resource Allocation in Distributed Service Networks. (2021).Google Scholar
Nitish K Panigrahy, Philippe Nain, Giovanni Neglia, and Don Towsley. 2022. A new upper bound on cache hit probability for non-anticipative caching policies. ACM Transactions on Modeling and Performance Evaluation of Computing Systems, Vol. 7, 2--4 (2022), 1--24.Google ScholarDigital Library
Georgios Paschos, Ejder Bastug, Ingmar Land, Giuseppe Caire, and Mérouane Debbah. 2016. Wireless caching: Technical misconceptions and business barriers. IEEE Communications Magazine, Vol. 54, 8 (2016), 16--22.Google ScholarDigital Library
Anirudh Sabnis and Ramesh K Sitaraman. 2021. TRAGEN: a synthetic trace generator for realistic cache simulations. In Proceedings of the 21st ACM Internet Measurement Conference. 366--379.Google ScholarDigital Library
Anirudh Sabnis and Ramesh K Sitaraman. 2022. JEDI: model-driven trace generation for cache simulations. In Proceedings of the 22nd ACM Internet Measurement Conference. 679--693.Google ScholarDigital Library
Alireza Sadeghi, Gang Wang, and Georgios B Giannakis. 2019. Deep reinforcement learning for adaptive caching in hierarchical content delivery networks. IEEE Transactions on Cognitive Communications and Networking, Vol. 5, 4 (2019), 1024--1033.Google ScholarCross Ref
Zhan Shi, Xiangru Huang, Akanksha Jain, and Calvin Lin. 2019. Applying deep learning to the cache replacement problem. In Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture. 413--425.Google ScholarDigital Library
Junaid Shuja, Kashif Bilal, Waleed Alasmary, Hassan Sinky, and Eisa Alanazi. 2021. Applying machine learning techniques for caching in next-generation edge networks: A comprehensive survey. Journal of Network and Computer Applications , Vol. 181 (2021), 103005.Google ScholarCross Ref
Jiongjiong Song, Min Sheng, Tony QS Quek, Chao Xu, and Xijun Wang. 2017. Learning-based content caching and sharing for wireless networks. IEEE Transactions on Communications, Vol. 65, 10 (2017), 4309--4324.Google Scholar
Zhenyu Song, Daniel S Berger, Kai Li, Anees Shaikh, Wyatt Lloyd, Soudeh Ghorbani, Changhoon Kim, Aditya Akella, Arvind Krishnamurthy, Emmett Witchel, et al. 2020. Learning relaxed belady for content distribution network caching. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20). 529--544.Google ScholarDigital Library
Stefano Traverso, Mohamed Ahmed, Michele Garetto, Paolo Giaccone, Emilio Leonardi, and Saverio Niccolini. 2015. Unravelling the impact of temporal and geographical locality in content caching systems. IEEE Transactions on Multimedia, Vol. 17, 10 (2015), 1839--1854.Google ScholarDigital Library
Shreshth Tuli and Giuliano Casale. 2022. Optimizing the Performance of Fog Computing Environments Using AI and Co-Simulation. In Companion of the 2022 ACM/SPEC International Conference on Performance Engineering. 25--28.Google Scholar
Giuseppe Vietri, Liana V Rodriguez, Wendy A Martinez, Steven Lyons, Jason Liu, Raju Rangaswami, Ming Zhao, and Giri Narasimhan. 2018. Driving cache replacement with $$ML-based$$$$LeCaR$$. In 10th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 18).Google Scholar
Carl A Waldspurger, Nohhyun Park, Alexander Garthwaite, and Irfan Ahmad. 2015. Efficient $$MRC$$ construction with $$SHARDS$$. In 13th USENIX Conference on File and Storage Technologies (FAST 15). 95--110.Google ScholarDigital Library
Jane-Ling Wang et al. 2005. Smoothing hazard rates. Encyclopedia of biostatistics , Vol. 7 (2005), 4986--4997.Google Scholar
Xiaofei Wang, Chenyang Wang, Xiuhua Li, Victor CM Leung, and Tarik Taleb. 2020. Federated deep reinforcement learning for internet of things with decentralized cooperative edge caching. IEEE Internet of Things Journal, Vol. 7, 10 (2020), 9441--9455.Google ScholarCross Ref
Gang Yan, Jian Li, and Don Towsley. 2021. Learning from optimal caching for content delivery. In Proceedings of the 17th International Conference on emerging Networking EXperiments and Technologies. 344--358.Google ScholarDigital Library
Chen Zhong, M Cenk Gursoy, and Senem Velipasalar. 2018. A deep reinforcement learning-based framework for content caching. In 2018 52nd Annual Conference on Information Sciences and Systems (CISS). IEEE, 1--6.Google ScholarCross Ref

Index Terms

A Learning-Based Caching Mechanism for Edge Content Delivery

Recommendations

A New Upper Bound on Cache Hit Probability for Non-Anticipative Caching Policies
Caching systems have long been crucial for improving the performance of a wide variety of network and web-based online applications. In such systems, end-to-end application performance heavily depends on the fraction of objects transferred from the cache, ...
Read More
Adaptive TTL-Based Caching for Content Delivery
SIGMETRICS '17 Abstracts: Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems

Content Delivery Networks (CDNs) cache and serve a majority of the user-requested content on the Internet, including web pages, videos, and software downloads. We propose two TTL-based caching algorithms that automatically adapt to the heterogeneity, ...
Read More
Adaptive TTL-Based Caching for Content Delivery
Performance evaluation review

Content Delivery Networks (CDNs) cache and serve a majority of the user-requested content on the Internet, including web pages, videos, and software downloads. We propose two TTL-based caching algorithms that automatically adapt to the heterogeneity, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICPE '24: Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering
May 2024
310 pages
ISBN:9798400704444
DOI:10.1145/3629526
General Chairs:
Simonetta Balsamo
Ca'Foscari University of Venice, Italy
,
William Knottenbelt
Imperial College London, UK
,
Program Chairs:
Cristina L. Abad
Escuela Superior Politecnica del Litoral, Ecuador
,
Weiyi Shang
University of Waterloo, Canada
Copyright © 2024 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 May 2024
Check for updates
Badges
- Artifacts Evaluated & Functional / v1.1
- Artifacts Available / v1.1
Author Tags
caching
content delivery network
hazard rate
hit probability
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate252of851submissions,30%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 31
  Total Downloads
- Downloads (Last 12 months)31
- Downloads (Last 6 weeks)31
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.