Skip to main content

Web Data Mining Based on Cloud Computing

  • Conference paper
  • First Online:
  • 307 Accesses

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 163))

Abstract

It is increasing important to get accurate information from the web. In this paper, the authors used the virtualization technology which is the key in cloud computing to build up a web data mining cloud model. This model is shown as Fig. 309.1. It consists of Storage Cloud and Calculation Cloud. Finally, this paper described a specific instance of Web Date Mining combined with the application of Cloud Computing. This instance compared the proposed method with the traditional method with Figs. 309.4 and 309.5. In addition, the Table 309.2 shows the new model with appropriate nodes can reduce the consumed time definitely.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   429.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Liu B (2007) Web data mining. Springer, New York

    MATH  Google Scholar 

  2. Roth H, Schiefer J, Obweger H, Rozsnyai S (2010) Event data warehousing for complex event processing. In: Proceedings of the 4th international conference on research challenges in information science, IEEE Press, Washington, pp 22–27

    Google Scholar 

  3. Szabolcs Rozsnyai, Aleksander Slominski, Yurdaer Doganata (2011) Large-scale distributed storage system for business provenance. Cloud Comput 5(02):516–524

    Google Scholar 

  4. Wu KL, Yu PS, Ballman A (2010) A web usage mining and analysis tool. IBM Systems Journal 22(04):321–329

    Google Scholar 

  5. Rozsnyai S, Vecera R, Schiefer J, Schatten A (2007) Event cloud-searching for correlated business events. In: Proceedings of the 9th IEEE international conference on E-commerce technology and the 4th IEEE international conference on enterprise computing, ecommerce and e-services (CEC-EEE 2007). IEEE Computer Society, Washington, vol 9, pp 409–420

    Google Scholar 

  6. Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R (2010) Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium on cloud computing (SoCC’10). ACM, New York, vol 1, pp 143–154

    Google Scholar 

  7. Borthaku D The hadoop distributed file system: architecture and design [EB/OL]. (2011-01-20). http://hadoop.apache.org/common/docs/r0.18.0/hdfs_design.pdf

  8. Amazon, Inc Amazon simple store service (Amazon S3) [EB/OL]. (2011-01-20). http://www.amazon.com/S3

  9. Chang F, Dean J, Chema Wat S (2008) Big table: a distributed storage system for structured data. ACM Trans Comput Syst 26(2):1–26

    Article  MATH  Google Scholar 

  10. John Shafer, Rakesh Agrawal, Manish Mehta SPRINT (1996) A scalable parallel classifier for data mining. IBM Almaden Research Center, US, vol 6, pp 544–555

    Google Scholar 

  11. Wang JZ, Wan JG, Liu Z, Wang P (2010) Data mining of mass storage based on cloud computing. Grid cooperative computing (GCC) 10:426–431

    Google Scholar 

  12. Gopalakrishnan Nair TR, Lakshmi Madhuri K (2011) Data mining using hierarchical virtual k-means approach integrating data fragmenting data fragments in cloud computing environment. Cloud Comput Intell Syst (CCIS) 1:230–234

    Article  Google Scholar 

  13. Pieter Noordhuis, Michiel Heijkoop, Alexander Lazovik (2010) Mining twitter in the cloud: a case study. Cloud Comput 1:107–114

    Google Scholar 

  14. Raymond Kosala, Hendrik Blockeel (2000) Web mining research: a survey. In: ACM SIGKDD, July

    Google Scholar 

  15. Chen MS, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 08(6):866–883

    Article  Google Scholar 

Download references

Acknowledgment

Foundation: the National Natural Science Foundation of Shandong Province of China under Grant No.ZR2010FM040; Special Funding Project for Independent Innovation Achievements Transform of Shandong Province under Grant No.2009ZHZX1A0108, No.2010ZHZX1A1001.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Liangfei Xue .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this paper

Cite this paper

Xue, L., Yuan, D., Jiang, M. (2014). Web Data Mining Based on Cloud Computing. In: Zhong, S. (eds) Proceedings of the 2012 International Conference on Cybernetics and Informatics. Lecture Notes in Electrical Engineering, vol 163. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3872-4_309

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-3872-4_309

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-3871-7

  • Online ISBN: 978-1-4614-3872-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics