Abstract
Adaptive clustering aims at improving cluster utilization for varying load and traffic patterns. Locality-based least-connection with replication (LBLCR) scheduling that comes with Linux is designed to help improve cluster utilization through adaptive clustering. A key issue with LBLCR, however, is that cluster performance depends much on a single threshold value that is used to determine adaptation. Once set, the threshold remains fixed, regardless of the load and traffic patterns. If a cluster of PCs is to adapt to different traffic patterns for high utilization, a good threshold has to be selected and used dynamically. We present in this paper an adaptive clustering framework that autonomously learns and adapts to different load and traffic patterns at runtime with no administrator intervention. The cluster is configured once and for all. As the patterns change, the cluster automatically expands/contracts to meet the changing demands. At the same time, the patterns are proactively learned so that when similar patterns emerge in the future, the cluster knows what to do to improve utilization. We have implemented this autonomous learning method and compared it with LBLCR using published Web traces. Experimental results indicate that our autonomous learning method produces high performance scalability and adaptability for different patterns. On the other hand LBLCR-based clustering suffers from performance scalability and adaptability for different traffic patterns since it is not designed to obtain good threshold values and use them at runtime.
Similar content being viewed by others
References
Advanced Micro Devices: AMD64 virtualization codenamed “Pacifica” technology: secure virtual machine architecture reference manual. Publication No. 33047, Revision 3.01 (2005)
Amdahl, G., Blaauw, G., Brooks, F.: Architecture of the IBM system 360. In: Readings in Computer Architecture, pp. 17–31. Kaufmann, Los Altos (2000)
Bairavasundaram, L., Sivathanu, M., Arpaci-Dusseau, A., Arpaci-Dusseau, R.: X-RAY: a non-invasive exclusive caching mechanism for RAIDs. In: 31st Annual International Symposium on Computer Architecture, pp. 176–187 (2004)
Bochs IA-32 Emulator Project. http://bochs.sourceforge.net
Bosque, A., Ibanez, P., Vinals, V., Stenstrom, P., Llabeia, J.: Characterization of Apache web server with Specweb2005. In: Proceedings of the Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, pp. 65–72 (2007)
Chandra, R., Bahl, P.: MultiNet: connecting to multiple IEEE 802.11 networks using a single wireless card. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 882–893 (2004)
Chang, Y., Chen, J.: Designing an enhanced PC cluster system for scalable network services. In: 19th International Conference on Advanced Information Networking and Applications, pp. 163–166 (2005)
Cherkasova, L., Tang, W.: Sizing the streaming media cluster solution for a given workload. In: IEEE International Symposium on Cluster Computing and Grid, pp. 144–151 (2004)
Chiang, M., Wu, C., Liao, Y., Chen, Y.: New content-aware request distribution policies in web clusters providing multiple services. In: Proceedings of the ACM Symposium on Applied Computing, pp. 79–83 (2009)
Christiansen, M., Jeffay, K., Otta, D., Smith, F.: Tuning RED for Web traffic. IEEE/ACM Trans. Netw. 9(3), 249–264 (2001)
Condor Project. http://www.cs.wisc.edu/condor/
Emeneker, W., Stanzione, D.: HPC cluster readiness of Xen and user mode Linux. In: IEEE International Conference on Cluster Computing, pp. 1–8 (2006)
Ernemann, C., Hamscher, V., Vahyapour, R.: Benefits of global grid computing for job scheduling. In: 5th IEEE/ACM International Workshop on Grid Computing, pp. 374–379 (2004)
Fang, X., Veeraraghavan, M.: Internetworking circuit and connectionless networks. In: 11th International Conference on Advanced Communication Technology, pp. 63–68 (2009)
Ganek, A., Corbi, T.: The dawning of the autonomic computing era. IBM Syst. J. 42(1), 5–18 (2003)
Govindan, S., Choi, J., Nath, A., Das, A., Urgaonkar, B., Sivasubramaniam, A.: Xen and Co.: communication-aware CPU management in consolidated Xen-based hosting platforms. IEEE Trans. Comput. 58(8), 1111–1125 (2009)
Henderson, T., Katz, R.: Transport protocols for Internet-compatible satellite networks. IEEE J. Sel. Areas Commun. 17(2), 326–344 (1999)
Ho, R., Wang, C., Lau, F.: Lightweight process migration and memory prefetching in openMosix. In: IEEE International Symposium on Parallel and Distributed Processing, pp. 1–12 (2008)
Hong, Y., No, J., Han, I.: Evaluation of fault-tolerant distributed Web systems. In: 10th IEEE International Conference on Advanced Information Networking and Applications, pp. 163–166 (2005)
Hoskins, M.: User-mode Linux. Linux J. 145 (2006)
HP Utility Computing. http://h20219.www2.hp.com/services/cache/284428-0-0-000-121.html
Intel Virtualization Technology Specification for the IA-32 Intel Architecture. http://dforeman.cs.binghamton.edu/~foreman/552pages/Readings/intel05virtualization.pdf, C97063-002, Intel Corporation, April (2005)
Jamjoom, H., Chou, C., Shin, K.: The impact of concurrency gains on the analysis and control of multi-threaded Internet services. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 827–837 (2004)
Job Scheduling Algorithms in Linux Virtual Server. http://www.linuxvirtualserver.org/docs/scheduling.html
Karlsson, M., Covell, M.: Dynamic black-box performance model estimation for self-tuning regulators. In: 2nd International Conference on Autonomic Computing, pp. 172–182 (2005)
Linux Virtual Server Project. http://www.linuxvirtualserver.org/
Lu, D., Qiao, Y., Dinda, P., Bustamante, F.: Modeling and taming parallel TCP on the wide area network. In: Proceedings of 19th IEEE International Parallel and Distributed Processing Symposium (2005)
Nahum, E., Barzilai, T., Kandlur, D.: Performance issues in WWW servers. IEEE/ACM Trans. Netw. 10(1), 2–11 (2002)
Olaru, V., Tichy, W.: Request distribution-aware caching in cluster-based Web servers. In: 3rd IEEE International Symposium on Network Computing and Applications, pp. 311–316 (2004)
OpenMosix. http://openmosix.sourceforge.net
Pacifici, G., Spreitzer, M., Tantawi, A., Youssef, A.: Performance management for cluster-based web services. IEEE J. Sel. Areas Commun. 23(12), 2333–2343 (2005)
Papadimitriou, C.: On selecting a satisfying truth assignment. In: 32nd Annual Symposium on Foundations of Computer Science, pp. 163–169 (1991)
Plex86 x86 Virtual Machine Project. http://plex86.sourceforge.net
Smith, F., Campos, F., Jeffay, K., Ott, D.: What TCP/IP protocol headers can tell us about the web. In: ACM SIGMETRICS (2001)
So, J., Cho, D.: Performance analysis of a DS/SSMA unslotted ALOHA system with two user classes. IEEE Trans. Veh. Technol. 51(6), 1628–1639 (2002)
Squid Web Proxy Cache. http://www.squid-cache.org
Sun N1: n Computers Operating as 1. Sun Microsystems. http://www.sun.com/software/learnabout/n1/
Sun Containers: Server Virtualization and Manageability. http://www.sun.com/software/whitepapers/solaris10/grid_containers.pdf. Sun Microsystems (2004)
Virtual Server 2005 R2 Technical Overview. http://www.microsoft.com/windowsserversystem/virtualserver/overview/vs2005tech.mspx, Microsoft Corporation, September 2004
VMware vSphere. http://www.vmware.com/products/vi/esx/, VMware Inc.
Wang, C., Mueller, F., Engelmann, C., Scott, S.: Proactive process-level live migration in HPC environments. In: Proceedings of the ACM/IEEE conference on Supercomputing (2008)
Wang, C., Alqaralleh, B., Zhou, B., Till, M., Zomaya, A.: A BLAST service built on data indexed overlay Network. In: 1st International Conference on e-Science and Grid Computing, pp. 16–23 (2005)
WebStone: The Benchmark for Web Server. http://www.mindcraft.com/webstone, Mindcraft Inc.
Xu, J., Lee, W.: Sustaining availability of Web services under distributed denial of service attacks. IEEE Trans. Comput. 52(2), 195–208 (2003)
Yang, C., Luo, M.: Building an adaptable, fault tolerant, and highly manageable web server on clusters of non-dedicated workstations. In: IEEE International Conference on Parallel Processing, pp. 413–420 (2000)
Zhang, R., Abdelzaher, T., Stankovic, J.: Efficient TCP connection failover in Web server clusters. In: 23rd Annual Joint Conference of the IEEE Computer and Communications Societies, pp. 1219–1228 (2004)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kwak, H., Sohn, A. & Chung, K. Autonomous learning of load and traffic patterns to improve cluster utilization. Cluster Comput 14, 397–417 (2011). https://doi.org/10.1007/s10586-011-0168-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-011-0168-5