Abstract
It is increasing important to get accurate information from the web. In this paper, the authors used the virtualization technology which is the key in cloud computing to build up a web data mining cloud model. This model is shown as Fig. 309.1. It consists of Storage Cloud and Calculation Cloud. Finally, this paper described a specific instance of Web Date Mining combined with the application of Cloud Computing. This instance compared the proposed method with the traditional method with Figs. 309.4 and 309.5. In addition, the Table 309.2 shows the new model with appropriate nodes can reduce the consumed time definitely.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Liu B (2007) Web data mining. Springer, New York
Roth H, Schiefer J, Obweger H, Rozsnyai S (2010) Event data warehousing for complex event processing. In: Proceedings of the 4th international conference on research challenges in information science, IEEE Press, Washington, pp 22–27
Szabolcs Rozsnyai, Aleksander Slominski, Yurdaer Doganata (2011) Large-scale distributed storage system for business provenance. Cloud Comput 5(02):516–524
Wu KL, Yu PS, Ballman A (2010) A web usage mining and analysis tool. IBM Systems Journal 22(04):321–329
Rozsnyai S, Vecera R, Schiefer J, Schatten A (2007) Event cloud-searching for correlated business events. In: Proceedings of the 9th IEEE international conference on E-commerce technology and the 4th IEEE international conference on enterprise computing, ecommerce and e-services (CEC-EEE 2007). IEEE Computer Society, Washington, vol 9, pp 409–420
Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R (2010) Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium on cloud computing (SoCC’10). ACM, New York, vol 1, pp 143–154
Borthaku D The hadoop distributed file system: architecture and design [EB/OL]. (2011-01-20). http://hadoop.apache.org/common/docs/r0.18.0/hdfs_design.pdf
Amazon, Inc Amazon simple store service (Amazon S3) [EB/OL]. (2011-01-20). http://www.amazon.com/S3
Chang F, Dean J, Chema Wat S (2008) Big table: a distributed storage system for structured data. ACM Trans Comput Syst 26(2):1–26
John Shafer, Rakesh Agrawal, Manish Mehta SPRINT (1996) A scalable parallel classifier for data mining. IBM Almaden Research Center, US, vol 6, pp 544–555
Wang JZ, Wan JG, Liu Z, Wang P (2010) Data mining of mass storage based on cloud computing. Grid cooperative computing (GCC) 10:426–431
Gopalakrishnan Nair TR, Lakshmi Madhuri K (2011) Data mining using hierarchical virtual k-means approach integrating data fragmenting data fragments in cloud computing environment. Cloud Comput Intell Syst (CCIS) 1:230–234
Pieter Noordhuis, Michiel Heijkoop, Alexander Lazovik (2010) Mining twitter in the cloud: a case study. Cloud Comput 1:107–114
Raymond Kosala, Hendrik Blockeel (2000) Web mining research: a survey. In: ACM SIGKDD, July
Chen MS, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 08(6):866–883
Acknowledgment
Foundation: the National Natural Science Foundation of Shandong Province of China under Grant No.ZR2010FM040; Special Funding Project for Independent Innovation Achievements Transform of Shandong Province under Grant No.2009ZHZX1A0108, No.2010ZHZX1A1001.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media New York
About this paper
Cite this paper
Xue, L., Yuan, D., Jiang, M. (2014). Web Data Mining Based on Cloud Computing. In: Zhong, S. (eds) Proceedings of the 2012 International Conference on Cybernetics and Informatics. Lecture Notes in Electrical Engineering, vol 163. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3872-4_309
Download citation
DOI: https://doi.org/10.1007/978-1-4614-3872-4_309
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-3871-7
Online ISBN: 978-1-4614-3872-4
eBook Packages: EngineeringEngineering (R0)