Skip to main content
Log in

Measurement Based Analysis of One-Click File Hosting Services

  • Published:
Journal of Network and Systems Management Aims and scope Submit manuscript

Abstract

It is commonly believed that file sharing traffic on the Internet is mostly generated by peer-to-peer applications. However, we show that HTTP based file sharing services are also extremely popular. We analyzed the traffic of a large research and education network for three months, and observed that a large fraction of the inbound HTTP traffic corresponds to file download services, which indicates that an important portion of file sharing traffic is in the form of HTTP data. In particular, we found that two popular one-click file hosting services are among the top Internet domains in terms of served traffic volume. In this paper, we present an exhaustive study of the traffic generated by such services, the behavior of their users, the downloaded content, and their server infrastructure.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. We avoid disclosing the name of the organization for privacy concerns.

  2. Throughout this work, we use the terms direct download and one-click file hosting to refer to HTTP-based file sharing services interchangeably.

  3. According to our data, RapidShare enhanced their login procedure on June 8th. As of this writing, sessions cannot be tracked with this simple approach.

References

  1. Fielding, R., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P., Berners-Lee, T.: Hypertext Transfer Protocol—HTTP/1.1. RFC 2616 (Draft Standard) (1999)

  2. Anderson, P.: What is Web 2.0? Ideas, technologies and implications for education. In: JISC Technology and Standards Watch, pp. 2–64 (2007)

  3. Garrett, J.: Ajax: a new approach to web applications. Adaptive path (2005). http://www.adaptivepath.com/ideas/e000385

  4. Adobe Flash: http://www.adobe.com/

  5. Schneider, F., Agarwal, S., Alpcan, T., Feldmann, A.: The new web: characterizing ajax traffic. In: Proceedings of the 9th International Conference on Passive and Active Network Measurement (2008)

  6. Cha, M., Kwak, H., Rodriguez, P., Ahn, Y.Y., Moon, S.: I tube, you tube, everybody tubes: analyzing the world’s largest user generated content video system. In: Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement (2007)

  7. Li, W., Moore, A., Canini, M.: Classifying HTTP traffic in the new age. In: ACM SIGCOMM, Poster Session (2008)

  8. RapidShare AG: http://www.rapidshare.com/

  9. Megaupload: http://www.megaupload.com/

  10. Cuevas, R., Kryczka, M., Cuevas, A., Kaune, S., Guerrero, C., Rejaie, R.: Is content publishing in bittorrent altruistic or profit-driven? In: Proceedings of ACM CoNext (2010)

  11. Antoniades, D., Markatos, E.P., Dovrolis, C.: One-click hosting services: a file-sharing hideout. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement (2009)

  12. Borgnat, P., Dewaele, G., Fukuda, K., Abry, P., Cho, K.: Seven years and one day: sketching the evolution of internet traffic. In: Proceedings of INFOCOM (2009)

  13. Claffy K., Braun H., Polyzos G. (1994) Tracking long-term growth of the NSFNET. Commun. ACM 37(8):34–45

    Google Scholar 

  14. Schulze, H., Mochalski, K.: P2P survey 2006. http://www.ipoque.com/resources

  15. Schulze, H., Mochalski, K.: Internet study 2007. http://www.ipoque.com/resources

  16. Schulze, H., Mochalski, K.: Internet study 2008–2009. http://www.ipoque.com/resources

  17. Feldmann, A., Rexford, J., Caceres, R.: Efficient policies for carrying Web traffic over flow-switched networks. IEEE/ACM transactions on Networking 6(6), 673–685 (1998)

    Article  Google Scholar 

  18. Catledge, L., Pitkow, J.: Characterizing browsing strategies in the World-Wide Web. Computer Networks and ISDN systems 27(6), 1065–1073 (1995)

    Article  Google Scholar 

  19. Barford, P., Bestavros, A., Bradley, A., Crovella, M.: Changes in web client access patterns: Characteristics and caching implications. World Wide Web 2(1), 15–28 (1999)

    Article  Google Scholar 

  20. Sen, S., Wang, J.: Analyzing peer-to-peer traffic across large networks. In: Proceedings of the 2nd ACM SIGCOMM Workshop on Internet Measurment (2002)

  21. Saroiu, S., Gummadi, P., Gribble, S., et al.: A measurement study of peer-to-peer file sharing systems. In: Proceedings of Multimedia Computing and Networking (2002)

  22. Gummadi, K.P., Dunn, R.J., Saroiu, S., Gribble, S.D., Levy, H.M., Zahorjan, J.: Measurement, modeling, and analysis of a peer-to-peer file-sharing workload. In: Proceedings of ACM SOSP (2003)

  23. Pouwelse, J., Garbacki, P., Epema, D., Sips, H.: The bittorrent p2p file-sharing system: Measurements and analysis. Lect. Notes Comput. Sci. 3640, 205 (2005)

    Article  Google Scholar 

  24. Tutschku, K.: A measurement-based traffic profile of the eDonkey filesharing service. Lect. Notes Comput. Sci., 12–21 (2004)

  25. Guha, S., Daswani, N., Jain, R.: An experimental study of the skype peer-to-peer voip system. In: Proceedings of IPTPS (2006)

  26. Karagiannis, T., Rodriguez, P., Papagiannaki, K.: Should internet service providers fear peer-assisted content distribution? In: Proceedings of the 5th ACM SIGCOMM Conference on Internet Measurement (2005)

  27. Barlet-Ros, P., Iannaccone, G., Sanjuàs-Cuxart, J., Amores-López, D., Solé-Pareta, J.: Load shedding in network monitoring applications. In: Proceedings of USENIX Annual Technical Conference, pp. 59–72. Usenix Association (2007)

  28. JDownloader: http://www.jdownloader.org

  29. Nguyen, T., Armitage, G.: A survey of techniques for internet traffic classification using machine learning. IEEE Commun. Surv. Tutor. 10(4) (2008)

  30. ipoque Protocol and Application Classification Engine: http://www.ipoque.com/products/pace-application-classification

  31. Alexa: http://www.alexa.com

  32. Von Ahn, L., Blum, M., Hopper, N.J., Langford, J.: CAPTCHA: using hard AI problems for security. In: Proceedings of the 22nd International Conference on Theory and Applications of Cryptographic Techniques (2003)

  33. RapidShare news: http://www.rapidshare.com/news.html

  34. Allman, M., Paxson, V., Blanton, E.: TCP congestion control. RFC 5681 (Draft Standard) (2009)

  35. Briscoe B. (2007) Flow rate fairness: dismantling a religion. ACM SIGCOMM Comput. Commun. Rev. 37(2):63–74

    Google Scholar 

  36. WinRAR archiver: http://www.rarlab.com/

  37. Team Cymru: http://www.team-cymru.org

  38. MaxMind GeoLite Country: http://www.maxmind.com/app/geoip_country

Download references

Acknowledgments

The authors thank the anonymous research institution for allowing the collection and analysis of their traffic for research purposes. We are also grateful to Ismael Castell-Uroz for his assistance and feedback. This work includes GeoLite data created by MaxMind, available from http://www.maxmind.com. We acknowledge Ipoque for kindly providing access to their PACE [30] traffic classification engine for this research work. This research has been partially funded by the Comissionat per a Universitats i Recerca del DIUE de la Generalitat de Catalunya (ref. 2009SGR-1140).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Josep Sanjuàs-Cuxart.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sanjuàs-Cuxart, J., Barlet-Ros, P. & Solé-Pareta, J. Measurement Based Analysis of One-Click File Hosting Services. J Netw Syst Manage 20, 276–301 (2012). https://doi.org/10.1007/s10922-011-9202-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10922-011-9202-4

keywords

Navigation