Skip to main content
Log in

Autonomous learning of load and traffic patterns to improve cluster utilization

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Adaptive clustering aims at improving cluster utilization for varying load and traffic patterns. Locality-based least-connection with replication (LBLCR) scheduling that comes with Linux is designed to help improve cluster utilization through adaptive clustering. A key issue with LBLCR, however, is that cluster performance depends much on a single threshold value that is used to determine adaptation. Once set, the threshold remains fixed, regardless of the load and traffic patterns. If a cluster of PCs is to adapt to different traffic patterns for high utilization, a good threshold has to be selected and used dynamically. We present in this paper an adaptive clustering framework that autonomously learns and adapts to different load and traffic patterns at runtime with no administrator intervention. The cluster is configured once and for all. As the patterns change, the cluster automatically expands/contracts to meet the changing demands. At the same time, the patterns are proactively learned so that when similar patterns emerge in the future, the cluster knows what to do to improve utilization. We have implemented this autonomous learning method and compared it with LBLCR using published Web traces. Experimental results indicate that our autonomous learning method produces high performance scalability and adaptability for different patterns. On the other hand LBLCR-based clustering suffers from performance scalability and adaptability for different traffic patterns since it is not designed to obtain good threshold values and use them at runtime.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Advanced Micro Devices: AMD64 virtualization codenamed “Pacifica” technology: secure virtual machine architecture reference manual. Publication No. 33047, Revision 3.01 (2005)

  2. Amdahl, G., Blaauw, G., Brooks, F.: Architecture of the IBM system 360. In: Readings in Computer Architecture, pp. 17–31. Kaufmann, Los Altos (2000)

    Google Scholar 

  3. Bairavasundaram, L., Sivathanu, M., Arpaci-Dusseau, A., Arpaci-Dusseau, R.: X-RAY: a non-invasive exclusive caching mechanism for RAIDs. In: 31st Annual International Symposium on Computer Architecture, pp. 176–187 (2004)

    Chapter  Google Scholar 

  4. Bochs IA-32 Emulator Project. http://bochs.sourceforge.net

  5. Bosque, A., Ibanez, P., Vinals, V., Stenstrom, P., Llabeia, J.: Characterization of Apache web server with Specweb2005. In: Proceedings of the Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, pp. 65–72 (2007)

    Chapter  Google Scholar 

  6. Chandra, R., Bahl, P.: MultiNet: connecting to multiple IEEE 802.11 networks using a single wireless card. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 882–893 (2004)

    Google Scholar 

  7. Chang, Y., Chen, J.: Designing an enhanced PC cluster system for scalable network services. In: 19th International Conference on Advanced Information Networking and Applications, pp. 163–166 (2005)

    Google Scholar 

  8. Cherkasova, L., Tang, W.: Sizing the streaming media cluster solution for a given workload. In: IEEE International Symposium on Cluster Computing and Grid, pp. 144–151 (2004)

    Chapter  Google Scholar 

  9. Chiang, M., Wu, C., Liao, Y., Chen, Y.: New content-aware request distribution policies in web clusters providing multiple services. In: Proceedings of the ACM Symposium on Applied Computing, pp. 79–83 (2009)

    Google Scholar 

  10. Christiansen, M., Jeffay, K., Otta, D., Smith, F.: Tuning RED for Web traffic. IEEE/ACM Trans. Netw. 9(3), 249–264 (2001)

    Article  Google Scholar 

  11. Condor Project. http://www.cs.wisc.edu/condor/

  12. Emeneker, W., Stanzione, D.: HPC cluster readiness of Xen and user mode Linux. In: IEEE International Conference on Cluster Computing, pp. 1–8 (2006)

    Chapter  Google Scholar 

  13. Ernemann, C., Hamscher, V., Vahyapour, R.: Benefits of global grid computing for job scheduling. In: 5th IEEE/ACM International Workshop on Grid Computing, pp. 374–379 (2004)

    Chapter  Google Scholar 

  14. Fang, X., Veeraraghavan, M.: Internetworking circuit and connectionless networks. In: 11th International Conference on Advanced Communication Technology, pp. 63–68 (2009)

    Google Scholar 

  15. Ganek, A., Corbi, T.: The dawning of the autonomic computing era. IBM Syst. J. 42(1), 5–18 (2003)

    Article  Google Scholar 

  16. Govindan, S., Choi, J., Nath, A., Das, A., Urgaonkar, B., Sivasubramaniam, A.: Xen and Co.: communication-aware CPU management in consolidated Xen-based hosting platforms. IEEE Trans. Comput. 58(8), 1111–1125 (2009)

    Article  MathSciNet  Google Scholar 

  17. Henderson, T., Katz, R.: Transport protocols for Internet-compatible satellite networks. IEEE J. Sel. Areas Commun. 17(2), 326–344 (1999)

    Article  Google Scholar 

  18. Ho, R., Wang, C., Lau, F.: Lightweight process migration and memory prefetching in openMosix. In: IEEE International Symposium on Parallel and Distributed Processing, pp. 1–12 (2008)

    Chapter  Google Scholar 

  19. Hong, Y., No, J., Han, I.: Evaluation of fault-tolerant distributed Web systems. In: 10th IEEE International Conference on Advanced Information Networking and Applications, pp. 163–166 (2005)

  20. Hoskins, M.: User-mode Linux. Linux J. 145 (2006)

  21. HP Utility Computing. http://h20219.www2.hp.com/services/cache/284428-0-0-000-121.html

  22. Intel Virtualization Technology Specification for the IA-32 Intel Architecture. http://dforeman.cs.binghamton.edu/~foreman/552pages/Readings/intel05virtualization.pdf, C97063-002, Intel Corporation, April (2005)

  23. Jamjoom, H., Chou, C., Shin, K.: The impact of concurrency gains on the analysis and control of multi-threaded Internet services. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 827–837 (2004)

    Google Scholar 

  24. Job Scheduling Algorithms in Linux Virtual Server. http://www.linuxvirtualserver.org/docs/scheduling.html

  25. Karlsson, M., Covell, M.: Dynamic black-box performance model estimation for self-tuning regulators. In: 2nd International Conference on Autonomic Computing, pp. 172–182 (2005)

    Chapter  Google Scholar 

  26. Linux Virtual Server Project. http://www.linuxvirtualserver.org/

  27. Lu, D., Qiao, Y., Dinda, P., Bustamante, F.: Modeling and taming parallel TCP on the wide area network. In: Proceedings of 19th IEEE International Parallel and Distributed Processing Symposium (2005)

    Google Scholar 

  28. Nahum, E., Barzilai, T., Kandlur, D.: Performance issues in WWW servers. IEEE/ACM Trans. Netw. 10(1), 2–11 (2002)

    Article  Google Scholar 

  29. Olaru, V., Tichy, W.: Request distribution-aware caching in cluster-based Web servers. In: 3rd IEEE International Symposium on Network Computing and Applications, pp. 311–316 (2004)

    Google Scholar 

  30. OpenMosix. http://openmosix.sourceforge.net

  31. Pacifici, G., Spreitzer, M., Tantawi, A., Youssef, A.: Performance management for cluster-based web services. IEEE J. Sel. Areas Commun. 23(12), 2333–2343 (2005)

    Article  Google Scholar 

  32. Papadimitriou, C.: On selecting a satisfying truth assignment. In: 32nd Annual Symposium on Foundations of Computer Science, pp. 163–169 (1991)

    Chapter  Google Scholar 

  33. Plex86 x86 Virtual Machine Project. http://plex86.sourceforge.net

  34. Smith, F., Campos, F., Jeffay, K., Ott, D.: What TCP/IP protocol headers can tell us about the web. In: ACM SIGMETRICS (2001)

    Google Scholar 

  35. So, J., Cho, D.: Performance analysis of a DS/SSMA unslotted ALOHA system with two user classes. IEEE Trans. Veh. Technol. 51(6), 1628–1639 (2002)

    Article  Google Scholar 

  36. Squid Web Proxy Cache. http://www.squid-cache.org

  37. Sun N1: n Computers Operating as 1. Sun Microsystems. http://www.sun.com/software/learnabout/n1/

  38. Sun Containers: Server Virtualization and Manageability. http://www.sun.com/software/whitepapers/solaris10/grid_containers.pdf. Sun Microsystems (2004)

  39. Virtual Server 2005 R2 Technical Overview. http://www.microsoft.com/windowsserversystem/virtualserver/overview/vs2005tech.mspx, Microsoft Corporation, September 2004

  40. VMware vSphere. http://www.vmware.com/products/vi/esx/, VMware Inc.

  41. Wang, C., Mueller, F., Engelmann, C., Scott, S.: Proactive process-level live migration in HPC environments. In: Proceedings of the ACM/IEEE conference on Supercomputing (2008)

    Google Scholar 

  42. Wang, C., Alqaralleh, B., Zhou, B., Till, M., Zomaya, A.: A BLAST service built on data indexed overlay Network. In: 1st International Conference on e-Science and Grid Computing, pp. 16–23 (2005)

    Google Scholar 

  43. WebStone: The Benchmark for Web Server. http://www.mindcraft.com/webstone, Mindcraft Inc.

  44. Xu, J., Lee, W.: Sustaining availability of Web services under distributed denial of service attacks. IEEE Trans. Comput. 52(2), 195–208 (2003)

    Article  MathSciNet  Google Scholar 

  45. Yang, C., Luo, M.: Building an adaptable, fault tolerant, and highly manageable web server on clusters of non-dedicated workstations. In: IEEE International Conference on Parallel Processing, pp. 413–420 (2000)

    Chapter  Google Scholar 

  46. Zhang, R., Abdelzaher, T., Stankovic, J.: Efficient TCP connection failover in Web server clusters. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 1219–1228 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hukeun Kwak.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kwak, H., Sohn, A. & Chung, K. Autonomous learning of load and traffic patterns to improve cluster utilization. Cluster Comput 14, 397–417 (2011). https://doi.org/10.1007/s10586-011-0168-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-011-0168-5

Keywords

Navigation