Autonomous learning of load and traffic patterns to improve cluster utilization

Kwak, Hukeun; Sohn, Andrew; Chung, Kyusik

doi:10.1007/s10586-011-0168-5

Autonomous learning of load and traffic patterns to improve cluster utilization

Published: 19 August 2011

Volume 14, pages 397–417, (2011)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Hukeun Kwak¹,
Andrew Sohn² &
Kyusik Chung¹

107 Accesses
1 Citation
Explore all metrics

Abstract

Adaptive clustering aims at improving cluster utilization for varying load and traffic patterns. Locality-based least-connection with replication (LBLCR) scheduling that comes with Linux is designed to help improve cluster utilization through adaptive clustering. A key issue with LBLCR, however, is that cluster performance depends much on a single threshold value that is used to determine adaptation. Once set, the threshold remains fixed, regardless of the load and traffic patterns. If a cluster of PCs is to adapt to different traffic patterns for high utilization, a good threshold has to be selected and used dynamically. We present in this paper an adaptive clustering framework that autonomously learns and adapts to different load and traffic patterns at runtime with no administrator intervention. The cluster is configured once and for all. As the patterns change, the cluster automatically expands/contracts to meet the changing demands. At the same time, the patterns are proactively learned so that when similar patterns emerge in the future, the cluster knows what to do to improve utilization. We have implemented this autonomous learning method and compared it with LBLCR using published Web traces. Experimental results indicate that our autonomous learning method produces high performance scalability and adaptability for different patterns. On the other hand LBLCR-based clustering suffers from performance scalability and adaptability for different traffic patterns since it is not designed to obtain good threshold values and use them at runtime.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Advanced Micro Devices: AMD64 virtualization codenamed “Pacifica” technology: secure virtual machine architecture reference manual. Publication No. 33047, Revision 3.01 (2005)
Amdahl, G., Blaauw, G., Brooks, F.: Architecture of the IBM system 360. In: Readings in Computer Architecture, pp. 17–31. Kaufmann, Los Altos (2000)
Google Scholar
Bairavasundaram, L., Sivathanu, M., Arpaci-Dusseau, A., Arpaci-Dusseau, R.: X-RAY: a non-invasive exclusive caching mechanism for RAIDs. In: 31st Annual International Symposium on Computer Architecture, pp. 176–187 (2004)
Chapter Google Scholar
Bochs IA-32 Emulator Project. http://bochs.sourceforge.net
Bosque, A., Ibanez, P., Vinals, V., Stenstrom, P., Llabeia, J.: Characterization of Apache web server with Specweb2005. In: Proceedings of the Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, pp. 65–72 (2007)
Chapter Google Scholar
Chandra, R., Bahl, P.: MultiNet: connecting to multiple IEEE 802.11 networks using a single wireless card. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 882–893 (2004)
Google Scholar
Chang, Y., Chen, J.: Designing an enhanced PC cluster system for scalable network services. In: 19th International Conference on Advanced Information Networking and Applications, pp. 163–166 (2005)
Google Scholar
Cherkasova, L., Tang, W.: Sizing the streaming media cluster solution for a given workload. In: IEEE International Symposium on Cluster Computing and Grid, pp. 144–151 (2004)
Chapter Google Scholar
Chiang, M., Wu, C., Liao, Y., Chen, Y.: New content-aware request distribution policies in web clusters providing multiple services. In: Proceedings of the ACM Symposium on Applied Computing, pp. 79–83 (2009)
Google Scholar
Christiansen, M., Jeffay, K., Otta, D., Smith, F.: Tuning RED for Web traffic. IEEE/ACM Trans. Netw. 9(3), 249–264 (2001)
Article Google Scholar
Condor Project. http://www.cs.wisc.edu/condor/
Emeneker, W., Stanzione, D.: HPC cluster readiness of Xen and user mode Linux. In: IEEE International Conference on Cluster Computing, pp. 1–8 (2006)
Chapter Google Scholar
Ernemann, C., Hamscher, V., Vahyapour, R.: Benefits of global grid computing for job scheduling. In: 5th IEEE/ACM International Workshop on Grid Computing, pp. 374–379 (2004)
Chapter Google Scholar
Fang, X., Veeraraghavan, M.: Internetworking circuit and connectionless networks. In: 11th International Conference on Advanced Communication Technology, pp. 63–68 (2009)
Google Scholar
Ganek, A., Corbi, T.: The dawning of the autonomic computing era. IBM Syst. J. 42(1), 5–18 (2003)
Article Google Scholar
Govindan, S., Choi, J., Nath, A., Das, A., Urgaonkar, B., Sivasubramaniam, A.: Xen and Co.: communication-aware CPU management in consolidated Xen-based hosting platforms. IEEE Trans. Comput. 58(8), 1111–1125 (2009)
Article MathSciNet Google Scholar
Henderson, T., Katz, R.: Transport protocols for Internet-compatible satellite networks. IEEE J. Sel. Areas Commun. 17(2), 326–344 (1999)
Article Google Scholar
Ho, R., Wang, C., Lau, F.: Lightweight process migration and memory prefetching in openMosix. In: IEEE International Symposium on Parallel and Distributed Processing, pp. 1–12 (2008)
Chapter Google Scholar
Hong, Y., No, J., Han, I.: Evaluation of fault-tolerant distributed Web systems. In: 10th IEEE International Conference on Advanced Information Networking and Applications, pp. 163–166 (2005)
Hoskins, M.: User-mode Linux. Linux J. 145 (2006)
HP Utility Computing. http://h20219.www2.hp.com/services/cache/284428-0-0-000-121.html
Intel Virtualization Technology Specification for the IA-32 Intel Architecture. http://dforeman.cs.binghamton.edu/~foreman/552pages/Readings/intel05virtualization.pdf, C97063-002, Intel Corporation, April (2005)
Jamjoom, H., Chou, C., Shin, K.: The impact of concurrency gains on the analysis and control of multi-threaded Internet services. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 827–837 (2004)
Google Scholar
Job Scheduling Algorithms in Linux Virtual Server. http://www.linuxvirtualserver.org/docs/scheduling.html
Karlsson, M., Covell, M.: Dynamic black-box performance model estimation for self-tuning regulators. In: 2nd International Conference on Autonomic Computing, pp. 172–182 (2005)
Chapter Google Scholar
Linux Virtual Server Project. http://www.linuxvirtualserver.org/
Lu, D., Qiao, Y., Dinda, P., Bustamante, F.: Modeling and taming parallel TCP on the wide area network. In: Proceedings of 19th IEEE International Parallel and Distributed Processing Symposium (2005)
Google Scholar
Nahum, E., Barzilai, T., Kandlur, D.: Performance issues in WWW servers. IEEE/ACM Trans. Netw. 10(1), 2–11 (2002)
Article Google Scholar
Olaru, V., Tichy, W.: Request distribution-aware caching in cluster-based Web servers. In: 3rd IEEE International Symposium on Network Computing and Applications, pp. 311–316 (2004)
Google Scholar
OpenMosix. http://openmosix.sourceforge.net
Pacifici, G., Spreitzer, M., Tantawi, A., Youssef, A.: Performance management for cluster-based web services. IEEE J. Sel. Areas Commun. 23(12), 2333–2343 (2005)
Article Google Scholar
Papadimitriou, C.: On selecting a satisfying truth assignment. In: 32nd Annual Symposium on Foundations of Computer Science, pp. 163–169 (1991)
Chapter Google Scholar
Plex86 x86 Virtual Machine Project. http://plex86.sourceforge.net
Smith, F., Campos, F., Jeffay, K., Ott, D.: What TCP/IP protocol headers can tell us about the web. In: ACM SIGMETRICS (2001)
Google Scholar
So, J., Cho, D.: Performance analysis of a DS/SSMA unslotted ALOHA system with two user classes. IEEE Trans. Veh. Technol. 51(6), 1628–1639 (2002)
Article Google Scholar
Squid Web Proxy Cache. http://www.squid-cache.org
Sun N1: n Computers Operating as 1. Sun Microsystems. http://www.sun.com/software/learnabout/n1/
Sun Containers: Server Virtualization and Manageability. http://www.sun.com/software/whitepapers/solaris10/grid_containers.pdf. Sun Microsystems (2004)
Virtual Server 2005 R2 Technical Overview. http://www.microsoft.com/windowsserversystem/virtualserver/overview/vs2005tech.mspx, Microsoft Corporation, September 2004
VMware vSphere. http://www.vmware.com/products/vi/esx/, VMware Inc.
Wang, C., Mueller, F., Engelmann, C., Scott, S.: Proactive process-level live migration in HPC environments. In: Proceedings of the ACM/IEEE conference on Supercomputing (2008)
Google Scholar
Wang, C., Alqaralleh, B., Zhou, B., Till, M., Zomaya, A.: A BLAST service built on data indexed overlay Network. In: 1st International Conference on e-Science and Grid Computing, pp. 16–23 (2005)
Google Scholar
WebStone: The Benchmark for Web Server. http://www.mindcraft.com/webstone, Mindcraft Inc.
Xu, J., Lee, W.: Sustaining availability of Web services under distributed denial of service attacks. IEEE Trans. Comput. 52(2), 195–208 (2003)
Article MathSciNet Google Scholar
Yang, C., Luo, M.: Building an adaptable, fault tolerant, and highly manageable web server on clusters of non-dedicated workstations. In: IEEE International Conference on Parallel Processing, pp. 413–420 (2000)
Chapter Google Scholar
Zhang, R., Abdelzaher, T., Stankovic, J.: Efficient TCP connection failover in Web server clusters. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 1219–1228 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Engineering, Soongsil University, 511 Sangdo-dong, Dongjak-gu, Seoul, 156-743, Republic of Korea
Hukeun Kwak & Kyusik Chung
Computer Science Department, New Jersey Institute of Technology, Newark, NJ, 07102, USA
Andrew Sohn

Authors

Hukeun Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Sohn
View author publications
You can also search for this author in PubMed Google Scholar
Kyusik Chung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hukeun Kwak.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kwak, H., Sohn, A. & Chung, K. Autonomous learning of load and traffic patterns to improve cluster utilization. Cluster Comput 14, 397–417 (2011). https://doi.org/10.1007/s10586-011-0168-5

Download citation

Received: 25 July 2009
Accepted: 22 June 2011
Published: 19 August 2011
Issue Date: December 2011
DOI: https://doi.org/10.1007/s10586-011-0168-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Autonomous learning of load and traffic patterns to improve cluster utilization

Abstract

Access this article

Similar content being viewed by others

A Brief Study of Adaptive Clustering for Self-aware Machine Analytics

A predictive framework for load balancing clustered web servers

Robust Service Selection Through Intelligent Clustering in an Uncertain Environment

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Autonomous learning of load and traffic patterns to improve cluster utilization

Abstract

Access this article

Similar content being viewed by others

A Brief Study of Adaptive Clustering for Self-aware Machine Analytics

A predictive framework for load balancing clustered web servers

Robust Service Selection Through Intelligent Clustering in an Uncertain Environment

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation