ABSTRACT
In this paper, we study the live streaming workload from a large content delivery network. Our data, collected over a 3 month period, contains over 70 million requests for 5,000 distinct URLs from clients in over 200 countries. To our knowledge, this is the most extensive data of live streaming on the Internet that has been studied to date. Our contributions are two-fold. First, we present a macroscopic analysis of the workload, characterizing popularity, arrival process, session duration, and transport protocol use. Our results show that popularity follows a 2-mode Zipf distribution, session interarrivals within small time-windows are exponential, session durations are heavy-tailed, and that UDP is far from having universal reach on the Internet. Second, we cover two additional characteristics that are more specific to the nature of live streaming applications: the diversity of clients in comparison to traditional broadcast media like radio and TV, and the phenomena that many clients regularly join recurring events. We find that Internet streaming does reach a wide audience, often spanning hundreds of AS domains and tens of countries. More interesting is that small streams also have a diverse audience. We also find that recurring users often have lifetimes of at least as long as one-third of the days in the event.
- J. M. Almeida, J. Krueger, D. L. Eager, and M. K. Vernon. Analysis of Educational Media Server Workloads. In Proceedings of NOSSDAV, June 2001. Google ScholarDigital Library
- V. Almeida, A. Bestavros, M. Crovella, and A. de Oliveira. Characterizing Reference Locality in the WWW. In Proceedings of 1996 International Conference on Parallel and Distributed Information Systems (PDIS '96), 1996. Google ScholarDigital Library
- K. C. Almeroth and M. H. Ammar. Collecting and Modeling the Join/Leave Behavior of Multicast Group Members in the MBone. In Proceedings of International Symposium on High Performance Distributed Computing (HPDC), August 1996. Google ScholarDigital Library
- K. Andreev, B. M. Maggs, A. Meyerson, and R. Sitaraman. Designing Overlay Multicast Networks for Streaming. In Proceedings of the Fifteenth Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), 2003. Google ScholarDigital Library
- L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web Caching and Zipf-like Distributions: Evidence and Implications. In Proceedings of the IEEE INFOCOMM '99, March 1999.Google ScholarCross Ref
- M. Chesire, A. Wolman, G. Voelker, and H. Levy. Measurement and Analysis of a Streaming-Media Workload. In Proceedings of Usenix Symposium on Internet Technologies and Systems (USITS), March 2001. Google ScholarDigital Library
- M. Crovella and M. Taqqu. Estimating the Heavy Tail Index from Scaling Properties. In Methodology and Computing in Applied Probability, vol. 1, no. 1, 1999. Google ScholarDigital Library
- J. Cui, M. Faloutsos, D. Maggiorini, M. Gerla, and K. Boussetta. Measuring and Modelling the Group Membership in the Internet. In Proceedings of Interne Measurement Conference (IMC), October 2003. Google ScholarDigital Library
- C. Cunha, A. Bestavros, and M. Covella. Characteristics of WWWClient Based Traces. Technical Report BU-CS-95-010, Computer Science Department, Boston University, 1995. Google ScholarDigital Library
- S. Floyd, M. Handley, J. Padhye, and J. Widmer. Multicast Routing in Internetworks and Extended LANs. In Proceedings of the ACM SIGCOMM, August 2000.Google Scholar
- S. Glassman. A Caching Relay for the World Wide Web. In Proceedings of the First International Conference on the World-Wide Web, 1994. Google ScholarDigital Library
- K. Gummadi, R. Dunn, S. Saroiu, S. Gribble, H. Levy, and J. Zahorjan. Measurement, Modeling, and Analysis of a Peer-to-Peer File-Sharing Workload. In Proceedings of ACM SOSP, October 2003. Google ScholarDigital Library
- IxMapper. http://www.ixiacom. com/products.Google Scholar
- B. Krishnamurthy and J. Wang. On Network-Aware Clustering of Web Clients. In Proceedings of ACM SIGCOMM, September 2000. Google ScholarDigital Library
- T. M. Kroeger, J. C. Mogul, and C. Maltzahn. Digital's web proxy traces. Available at ftp://ftp.digital. com/pub/DEC/traces/proxy/webtraces.html, August 1996.Google Scholar
- A. Lakhina, J. W. Byers, M. Crovella, and I. Matta. On the Geographic Location of Internet Resources. In IEEE Journaln Selected Areas in Communications Vol. 21 No. 6, Aug 2003. Google ScholarDigital Library
- MaxMind GeoIP Free Country Database. http://www.maxmind.com/app/geoipcountry.Google Scholar
- Microsoft Windows Media Services 9 Series and Windows Media Services Version 4. 1 - Performance Comparison. http://www. veritest.com/clients/reports/microsoft/ms media services. pdf.Google Scholar
- V. N. Padmanabhan, H. J. Wang, P. A. Chou, and K. Sripanidkulchai. Distributing Streaming Media Content Using Cooperative Networking. In Proceedings of NOSSDAV, May 2002. Google ScholarDigital Library
- Route Views Project. http://www.routeviews.org/.Google Scholar
- K. Sripanidkulchai. A Measurement-Driven Approach to Designing Peer-to-Peer Systems. Ph. D. Thesis, Carnegie Mellon University, 2004. Google ScholarDigital Library
- K. Sripanidkulchai, A. Ganjam, B. Maggs, and H. Zhang. The Feasibility of Supporting Large-Scale Live Streaming Applications with Dynamic Application End-Points. In Proceedings of ACM SIGCOMM, 2004. Google ScholarDigital Library
- E. Veloso, V. Almeida, W. Meira, A. Bestavros, and S. Jin. A Hierarchical Characterization of a Live Streaming Media Workload. In Proceedings of Internet Measurement Workshop (IMW), November 2002. Google ScholarDigital Library
- Y. Wang, M. Claypool, and Z. Zuo. An Empirical Study of RealVideo Performance Across the Internet. In Proceedings of Internet Measurement Workshop (IMW), November 2001. Google ScholarDigital Library
Index Terms
- An analysis of live streaming workloads on the internet
Recommendations
Optimizing CDN Infrastructure for Live Streaming with Constrained Server Chaining
ISPA '11: Proceedings of the 2011 IEEE Ninth International Symposium on Parallel and Distributed Processing with ApplicationsContent Delivery Networks (CDNs) are increasingly being used to deliver live streaming on today's Internet. The new application type of live streaming exposes unique characteristics and challenges that require more advanced design of CDN infrastructure. ...
Design and deployment of a hybrid CDN-P2P system for live video streaming: experiences with LiveSky
MM '09: Proceedings of the 17th ACM international conference on MultimediaWe present our design and deployment experiences with LiveSky, a commercially deployed hybrid CDN-P2P live streaming system. CDNs and P2P systems are the common techniques used for live streaming, each having its own set of advantages and disadvantages. ...
HTTP/2-Based Methods to Improve the Live Experience of Adaptive Streaming
MM '15: Proceedings of the 23rd ACM international conference on MultimediaHTTP Adaptive Streaming (HAS) is today the number one video technology for over-the-top video distribution. In HAS, video content is temporally divided into multiple segments and encoded at different quality levels. A client selects and retrieves per ...
Comments