research-article

ORL-SDN: Online Reinforcement Learning for SDN-Enabled HTTP Adaptive Streaming

Authors:
Abdelhak Bentaleb

National University of Singapore, Singapore

National University of Singapore, Singapore

0000-0002-5382-6530
View Profile

,
Ali C. Begen

Ozyegin University 8 Networked Media, Turkey

Ozyegin University 8 Networked Media, Turkey
View Profile

,
Roger Zimmermann

National University of Singapore, Singapore

National University of Singapore, Singapore

0000-0002-7410-2590
View Profile

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 14 Issue 3Article No.: 71pp 1–28https://doi.org/10.1145/3219752

Published:09 August 2018Publication History

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

In designing an HTTP adaptive streaming (HAS) system, the bitrate adaptation scheme in the player is a key component to ensure a good quality of experience (QoE) for viewers. We propose a new online reinforcement learning optimization framework, called ORL-SDN, targeting HAS players running in a software-defined networking (SDN) environment. We leverage SDN to facilitate the orchestration of the adaptation schemes for a set of HAS players. To reach a good level of QoE fairness in a large population of players, we cluster them based on a perceptual quality index. We formulate the adaptation process as a Partially Observable Markov Decision Process and solve the per-cluster optimization problem using an online Q-learning technique that leverages model predictive control and parallelism via aggregation to avoid a per-cluster suboptimal selection and to accelerate the convergence to an optimum. This framework achieves maximum long-term revenue by selecting the optimal representation for each cluster under time-varying network conditions. The results show that ORL-SDN delivers substantial improvements in viewer QoE, presentation quality stability, fairness, and bandwidth utilization over well-known adaptation schemes.

References

Saamer Akhshabi, Lakshmi Anantakrishnan, Ali C. Begen, and Constantine Dovrolis. 2012. What happens when HTTP adaptive streaming players compete for bandwidth? In Proceedings of the 22Nd International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV'12). ACM, New York, NY, USA, 9--14. Google ScholarDigital Library
Ahsan Arefin, Raoul Rivas, Rehana Tabassum, and Klara Nahrstedt. 2013. OpenSession: SDN-based cross-layer multi-stream management protocol for 3D teleimmersion. In 21st IEEE International Conference on Network Protocols (ICNP'13). 1--10.Google ScholarCross Ref
Abdelhak Bentaleb, Ali C. Begen, and Roger Zimmermann. 2016. SDNDASH: Improving QoE of HTTP adaptive streaming using software defined networking. In Proceedings of the 2016 ACM on Multimedia Conference (MM'16). ACM, New York, NY, USA, 1296--1305. Google ScholarDigital Library
Niels Bouten, Ricardo de O’Schmidt, Jeroen Famaey, Steven Latré, Aiko Pras, and Filip De Turck. 2015. QoE-driven in-network optimization for adaptive video streaming based on packet sampling measurements. Computer Networks 81, C (2015), 96--115. Google ScholarDigital Library
Valentín Carela-Español, Pere Barlet-Ros, Albert Cabellos-Aparicio, and Josep Solé-Pareta. 2011. Analysis of the impact of sampling on NetFlow traffic classification. Computer Networks 55, 5 (2011), 1083--1099. Google ScholarDigital Library
Federico Chiariotti, Stefano D’Aronco, Laura Toni, and Pascal Frossard. 2016. Online learning adaptation strategy for DASH clients. In Proceedings of the 7th International Conference on Multimedia Systems (MMSys'16). ACM, New York, NY, USA, Article 8, 12 pages. Google ScholarDigital Library
Maxim Claeys, Steven Latré, Jeroen Famaey, Tingyao Wu, Werner Van Leekwijck, and Filip De Turck. 2013. Design of a Q-learning-based client quality selection algorithm for HTTP adaptive video streaming. In Proceedings of the Adaptive and Learning Agents Workshop, part of AAMAS2013. 30--37.Google Scholar
Maxim Claeys, Steven Latré, Jeroen Famaey, Tingyao Wu, Werner Van Leekwijck, and Filip De Turck. 2014. Design and optimisation of a (FA)Q-learning-based HTTP adaptive streaming client. Connection Science 26, 1 (2014), 25--43.Google ScholarDigital Library
DASH-IF. 2017. Guidelines for Implementation: DASH-AVC/264 Test cases and Vectors. Retrieved from https://goo.gl/NhJcui (accessed June 5, 2017).Google Scholar
Dash Industry Forum. 2017. DASH-264 JavaScript Reference Client. Retrieved from https://goo.gl/yd8rrt (accessed March 30, 2017).Google Scholar
Johan De Vriendt, Danny De Vleeschauwer, and David Robinson. 2013. Model for estimating QoE of video delivered using HTTP adaptive streaming. In 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM'13). 1288--1293.Google Scholar
Giorgos Dimopoulos, Ilias Leontiadis, Pere Barlet-Ros, and Konstantina Papagiannaki. 2016. Measuring video QoE from encrypted traffic. In Proceedings of the 2016 Internet Measurement Conference (IMC'16). ACM, New York, NY, USA, 513--526. Google ScholarDigital Library
Zhengfang Duanmu, Kai Zeng, Kede Ma, Abdul Rehman, and Zhou Wang. 2017. A quality-of-experience index for streaming video. IEEE Journal of Selected Topics in Signal Processing 11, 1 (2017), 154--166.Google ScholarCross Ref
Marcus Eckert and Thomas Martin Knoll. 2013. QoE management framework for internet services in SDN enabled mobile networks. In Meeting of the European Network of Universities and Companies in Information and Communication Engineering. Springer, 112--123.Google ScholarCross Ref
Zhengzhu Feng and E. Hansen. 2004. An approach to state aggregation for POMDPs. In AAAI-04 Workshop on Learning and Planning in Markov Processes--Advances and Challenges. 7--12.Google Scholar
Markus Fiedler, Tobias Hossfeld, and Phuoc Tran-Gia. 2010. A quantitative relationship between quality of experience and quality of service. IEEE Network 24, 2 (2010), 36--41. Google ScholarDigital Library
Aditya Ganjam, Faisal Siddiqui, Jibin Zhan, Xi Liu, Ion Stoica, Junchen Jiang, Vyas Sekar, and Hui Zhang. 2015. C3: Internet-scale control plane for video quality optimization. In Proceedings of the 12th USENIX Conference on Networked Systems Design and Implementation (NSDI'15). USENIX Association, Berkeley, CA, USA, 131--144. http://dl.acm.org/citation.cfm?id=2789770.2789780 Google ScholarDigital Library
Panagiotis Georgopoulos, Yehia Elkhatib, Matthew Broadbent, Mu Mu, and Nicholas Race. 2013. Towards network-wide QoE fairness using openflow-assisted adaptive video streaming. In Proceedings of the 2013 ACM SIGCOMMWorkshop on Future Human-centric Multimedia Networking (FhMN'13). ACM, New York, NY, USA, 15--20. Google ScholarDigital Library
Simon Haykin. 1998. Neural Networks: A Comprehensive Foundation (2nd ed.). Prentice Hall PTR, Upper Saddle River, NJ, USA. Google ScholarDigital Library
Victor Heorhiadi, Michael K. Reiter, and Vyas Sekar. 2016. Simplifying software-defined network optimization using SOL. In Proceedings of the 13th Usenix Conference on Networked Systems Design and Implementation (NSDI'16). USENIX Association, Berkeley, CA, USA, 223--237. http://dl.acm.org/citation.cfm?id=2930611.2930627 Google ScholarDigital Library
Te-Yuan Huang, Nikhil Handigol, Brandon Heller, Nick McKeown, and Ramesh Johari. 2012. Confused, timid, and unstable: Picking a video streaming rate is hard. In Proceedings of the 2012 Internet Measurement Conference (IMC'12). ACM, New York, NY, USA, 225--238. Google ScholarDigital Library
Te-Yuan Huang, Ramesh Johari, Nick McKeown, Matthew Trunnell, and Mark Watson. 2015. A buffer-based approach to rate adaptation: Evidence from a large video streaming service. In Proceedings of the 2014 ACM Conference on SIGCOMM (SIGCOMM'14). ACM, New York, NY, USA, 187--198. Google ScholarDigital Library
Milosz Marian Hulboj and Ryszard Erazm Jurga. 2007. Packet sampling and network monitoring. Retrieved from https://bit.ly/2mTg88B (accessed June 15, 2016).Google Scholar
InMon. 2004. sFlow. Retrieved from http://www.sflow.org/ (accessed December 25, 2016).Google Scholar
Raj Jain, Dah-Ming Chiu, and William R. Hawe. 1984. A Quantitative Measure of Fairness and Discrimination for Resource Allocation in Shared Computer System. Vol. 38. Eastern Research Laboratory, Digital Equipment Corporation, Hudson, MA.Google Scholar
Junchen Jiang, Vyas Sekar, Henry Milner, Davis Shepherd, Ion Stoica, and Hui Zhang. 2016. CFA: A practical prediction system for video QoE optimization. In Proceedings of the 13th Usenix Conference on Networked Systems Design and Implementation (NSDI'16). USENIX Association, Berkeley, CA, USA, 137--150. http://dl.acm.org/citation.cfm?id=2930611.2930621 Google ScholarDigital Library
Junchen Jiang, Vyas Sekar, and Hui Zhang. 2012. Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with FESTIVE. In Proceedings of the 8th International Conference on Emerging Networking Experiments and Technologies (CoNEXT'12). ACM, New York, NY, USA, 97--108. Google ScholarDigital Library
Junchen Jiang, Shijie Sun, Vyas Sekar, and Hui Zhang. 2017. Pytheas: Enabling data-driven quality of experience optimization using group-based exploration-exploitation. In Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation (NSDI'17). USENIX Association, Berkeley, CA, USA, 393--406. http://dl.acm.org/citation.cfm?id=3154630.3154662 Google ScholarDigital Library
Parikshit Juluri, Venkatesh Tamarapalli, and Deep Medhi. 2015. SARA: Segment aware rate adaptation algorithm for dynamic adaptive streaming over HTTP. In IEEE International Conference on Communication Workshop (ICCW'15). 1765--1770.Google ScholarCross Ref
Jan Willem Kleinrouweler, Sergio Cabrero, and Pablo Cesar. 2016. Delivering stable high-quality video: An SDN architecture with DASH assisting network elements. In Proceedings of the 7th International Conference on Multimedia Systems (MMSys'16). ACM, New York, NY, USA, Article 4, 10 pages. Google ScholarDigital Library
Diego Kreutz, Fernando M. V. Ramos, P. Esteves Verissimo, C. Esteve Rothenberg, Siamak Azodolmolky, and Steve Uhlig. 2015. Software-defined networking: A comprehensive survey. Proceedings of the IEEE 103, 1 (2015), 14--76.Google ScholarCross Ref
Bob Lantz and Brian O’Connor. 2015. Mininet. Retrieved from http://mininet.org/ (accessed January 20, 2017).Google Scholar
Stefan Lederer, Christopher Müller, and Christian Timmerer. 2012. Dynamic adaptive streaming over HTTP dataset. In Proceedings of the 3rd Multimedia Systems Conference (MMSys'12). ACM, New York, NY, USA, 89--94. Google ScholarDigital Library
Zhi Li, Xiaoqing Zhu, Joshua Gahm, Rong Pan, Hao Hu, Ali C. Begen, and David Oran. 2014. Probe and adapt: Rate adaptation for HTTP video streaming at scale. IEEE Journal on Selected Areas in Communications 32, 4 (2014), 719--733.Google ScholarCross Ref
Xi Liu, Florin Dobrian, Henry Milner, Junchen Jiang, Vyas Sekar, Ion Stoica, and Hui Zhang. 2012. A case for a coordinated internet video control plane. In Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM'12). ACM, New York, NY, USA, 359--370. Google ScholarDigital Library
Xiaomei Liu and Li Xiao. 2007. A survey of multihoming technology in stub networks: Current research and open issues. Network 21, 3 (2007), 32--40. Google ScholarDigital Library
Nicholas Mastronarde and Mihaela van der Schaar. 2011. Fast reinforcement learning for energy-efficient wireless communication. IEEE Transactions on Signal Processing 59, 12 (2011), 6262--6266. Google ScholarDigital Library
Nick McKeown, Tom Anderson, Hari Balakrishnan, Guru Parulkar, Larry Peterson, Jennifer Rexford, Scott Shenker, and Jonathan Turner. 2008. OpenFlow: Enabling innovation in campus networks. In SIGCOMM Comput. Commun. Rev. 38, 2 (2008), 69--74. Google ScholarDigital Library
Ricky K. P. Mok, Xiapu Luo, Edmond W. W. Chan, and Rocky K. C. Chang. 2012. QDASH: a QoE-aware DASH system. In Proceedings of the 3rd Multimedia Systems Conference (MMSys'12). ACM, New York, NY, USA, 11--22. Google ScholarDigital Library
Mu Mu, Matthew Broadbent, Arsham Farshad, Nicholas Hart, David Hutchison, Qiang Ni, and Nicholas Race. 2016. A scalable user fairness model for adaptive video streaming over SDN-assisted future networks. IEEE Journal on Selected Areas in Communications 34, 8 (2016), 2168--2184. Google ScholarDigital Library
Matthew K. Mukerjee, David Naylor, Junchen Jiang, Dongsu Han, Srinivasan Seshan, and Hui Zhang. 2015. Practical, real-time centralized control for CDN-based live video delivery. In Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication (SIGCOMM'15). ACM, New York, NY, USA, 311--324. Google ScholarDigital Library
Yustus Eko Oktian, SangGon Lee, HoonJae Lee, and JunHuy Lam. 2017. Distributed SDN controller system: A survey on design choice. Computer Networks 121 (2017), 100--111. Google ScholarDigital Library
Athanasios Papoulis and S. Unnikrishna Pillai. 2002. Probability, Random Variables, and Stochastic Processes. Tata McGraw-Hill Education.Google Scholar
Stefano Petrangeli, Maxim Claeys, Steven Latré, Jeroen Famaey, and Filip De Turck. 2014. A multi-agent Q-learning-based framework for achieving fairness in HTTP adaptive streaming. In IEEE Network Operations and Management Symposium (NOMS'14). 1--9.Google ScholarCross Ref
Stefano Petrangeli, Jeroen Famaey, Maxim Claeys, Steven Latré, and Filip De Turck. 2016. QoE-driven rate adaptation heuristic for fair adaptive video streaming. ACM Transactions on Multimedia Computing, Communications, and Applications 12, 2 (2016), 28. Google ScholarDigital Library
Stefano Petrangeli, Tim Wauters, Rafael Huysegems, Tom Bostoen, and Filip De Turck. 2016. Software-defined network-based prioritization to avoid video freezes in HTTP adaptive streaming. Netw. 26, 4 (2016), 248--268. Google ScholarDigital Library
Warren B. Powell. 2009. What you should know about approximate dynamic programming. NRL 56, 3 (2009), 239--249.Google ScholarCross Ref
PyPI. 2017. The Python Package Index. Retrieved from https://goo.gl/635J3x (accessed August 25, 2017).Google Scholar
Abdul Rehman, Kai Zeng, and Zhou Wang. 2015. Display device-adapted video quality-of-experience assessment. In SPIE/IS&T Electronic Imaging. Int. Society for Optics and Photonics, 939406--939406.Google Scholar
Martin Riedmiller. 2005. Neural fitted Q iteration--first experiences with a data efficient neural reinforcement learning method. In Proceedings of the 16th European Conference on Machine Learning (ECM'05). Springer-Verlag, Berlin, Heidelberg, 317--328. Google ScholarDigital Library
RYU SDN Community. 2015. RYU SDN Framework. Retrieved from https://osrg.github.io/ryu/ (accessed December 25, 2016).Google Scholar
Sandvine. 2015. Deep Packet Inspection (DPI). Retrieved from https://goo.gl/2Ms8bH (accessed April 10, 2017).Google Scholar
Sandvine. 2016. Video Quality of Experience: Requirements and Considerations for Meaningful Insight. White Paper.Google Scholar
Justine Sherry, Chang Lan, Raluca Ada Popa, and Sylvia Ratnasamy. 2015. Blindbox: Deep Packet Inspection over Encrypted Traffic. In Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication (SIGCOMM'15). ACM, New York, NY, USA, 213--226. Google ScholarDigital Library
SSIMWave. 2015. SSIMWave’s Video QoE Monitor. Retrieved from https://goo.gl/u7pG45 (accessed June 11, 2016).Google Scholar
Thomas Stockhammer. 2011. Dynamic adaptive streaming over HTTP: Standards and design principal. In Proceedings of the Second Annual ACM Conference on Multimedia Systems (MMSys'11). ACM, New York, NY, USA, 133--144. Google ScholarDigital Library
Yi Sun, Xiaoqi Yin, Junchen Jiang, Vyas Sekar, Fuyuan Lin, Nanshu Wang, Tao Liu, and Bruno Sinopoli. 2016. Cs2p: Improving video bitrate selection and adaptation with data-driven throughput prediction. In Proceedings of the 2016 ACM SIGCOMM Conference (SIGCOMM'16). ACM, New York, NY, USA, 272--285. Google ScholarDigital Library
Richard S. Sutton and Andrew G. Barto. 1998. Reinforcement Learning: An Introduction. Vol. 1. MIT Press, Cambridge. Google ScholarDigital Library
Emmanuel Thomas, M. O. van Deventer, Thomas Stockhammer, Ali C. Begen, and Jeroen Famaey. 2017. Enhancing MPEG DASH Performance via Server and Network Assistance. SMPTE Motion Imaging Journal 126, 1 (2017), 22--27.Google ScholarCross Ref
Michel Tokic and Günther Palm. 2011. Value-difference based exploration: Adaptive control between Epsilon-Greedy and Softmax. In KI. Springer, 335--346. Google ScholarDigital Library
Jeroen van der Hooft, Stefano Petrangeli, Maxim Claeys, Jeroen Famaey, and Filip De Turck. 2015. A learning-based algorithm for improved bandwidth-awareness of adaptive streaming clients. In IFIP/IEEE International Symposium on Integrated Network Management (IM'15). 131--138.Google ScholarCross Ref
Yang Wang and Stephen Boyd. 2010. Fast Model Predictive Control using Online Optimization. IEEE Transactions on Control Systems Technology 18, 2 (2010), 267--278.Google ScholarCross Ref
Dapeng Wu, Yiwei Thomas Hou, Wenwu Zhu, Ya-Qin Zhang, and Jon M. Peha. 2001. Streaming video over the Internet: Approaches and directions. IEEE Transactions on Circuits and Systems for Video Technology 11, 3 (2001), 282--300. Google ScholarDigital Library
Xiaoqi Yin, Abhishek Jindal, Vyas Sekar, and Bruno Sinopoli. 2015. A control-theoretic approach for dynamic adaptive video streaming over HTTP. In Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication (SIGCOMM'15). ACM, New York, NY, USA, 325--338. Google ScholarDigital Library
Chao Zhou, Chia-Wen Lin, and Zongming Guo. 2016. mDASH: A Markov decision-based rate adaptation approach for dynamic HTTP streaming. IEEE Transactions on Multimedia 18, 4 (2016), 738--751.Google ScholarDigital Library
Wei Zhou, Li Li, Min Luo, and Wu Chou. 2014. REST API design patterns for SDN northbound API. In 2014 28th International Conference on Advanced Information Networking and Applications Workshops. 358--365. Google ScholarDigital Library

Index Terms

ORL-SDN: Online Reinforcement Learning for SDN-Enabled HTTP Adaptive Streaming
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia streaming

Recommendations

CDN and SDN Support and Player Interaction for HTTP Adaptive Video Streaming
MMSys '21: Proceedings of the 12th ACM Multimedia Systems Conference

Video streaming has become one of the most prevailing, bandwidth-hungry, and latency-sensitive Internet applications. HTTP Adaptive Streaming (HAS) has become the dominant video delivery mechanism over the Internet. Lack of coordination among the ...
Read More
Want to play DASH?: a game theoretic approach for adaptive streaming over HTTP
MMSys '18: Proceedings of the 9th ACM Multimedia Systems Conference

In streaming media, it is imperative to deliver a good viewer experience to preserve customer loyalty. Prior research has shown that this is rather difficult when shared Internet resources struggle to meet the demand from streaming clients that are ...
Read More
Survey on Reinforcement Learning based Efficient Routing in SDN
SMA 2020: The 9th International Conference on Smart Media and Applications

Traffic on the Internet is growing exponentially around the globe, and SDN (Software Defined Network) becomes very attractive to network research communities. The trend switching from a hardware-centered network to a software-centric network allows that ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Multimedia Computing, Communications, and Applications Volume 14, Issue 3
August 2018
249 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/3241977
Editor:
Alberto Del Bimbo
University of Firenze, Italy
Issue’s Table of Contents
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 August 2018
- Accepted: 1 April 2018
- Revised: 1 February 2018
- Received: 1 September 2017
Published in tomm Volume 14, Issue 3

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
HAS
HAS scalability issues
POMDP
QoE optimization
SDN
fastMPC
reinforcement learning
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 12
  Total Citations
  View Citations
- 481
  Total Downloads
- Downloads (Last 12 months)22
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

ORL-SDN: Online Reinforcement Learning for SDN-Enabled HTTP Adaptive Streaming

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

CDN and SDN Support and Player Interaction for HTTP Adaptive Video Streaming

Want to play DASH?: a game theoretic approach for adaptive streaming over HTTP

Survey on Reinforcement Learning based Efficient Routing in SDN