Skip to main content

Data-Mining Process Overview

  • Chapter
Collaborative Engineering

Abstract

This chapter gives a description of data mining and its methodology. First, the definition of data mining along with the purposes and growing needs for such a technology are presented. A six-step methodology for data mining is then presented and discussed. The goals and methods of this process are then explained, coupled with a presentation of a number of techniques that are making the data-mining process faster and more reliable. These techniques include the use of neural networks and genetic algorithms, which are presented and explained as a way to overcome several complexity problems that the data-mining process possesses. A deep survey of the literature is done to show the various purposes and achievements that these techniques have brought to the study of data mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Adriaans, P. and D. Zantinge, Data Mining, Harlow: Addison-Wesley, 1996.

    Google Scholar 

  2. Andrade, M. and P. Bork, “Automated extraction of information in molecular biology,” FEBS Letters, 476: 12–17, 2000

    Article  Google Scholar 

  3. Delesie, L. and L. Croes, “Operations research and knowledge discovery: A data mining method applied to health care management,” International Transactions in Operational Research, 7: 159–170, 2000

    Article  Google Scholar 

  4. Fayyad, U., D. Madigan, G. Piatetsky-Shapiro, and P. Smyth, “From data mining to knowledge discovery in databases,” AI Magazine, 17: 37–54, 1996

    Google Scholar 

  5. Fayyad, U. and P. Stolorz, “Data mining and KDD: Promise and challenges,” Future Generation Computer Systems, 13: 99–115, 1997

    Article  Google Scholar 

  6. Feelders, A., H. Daniels, and M. Holsheimer, “Methodological and practical aspects of data mining,” Information & Management, 37: 271–281, 2000

    Article  Google Scholar 

  7. Fu, Z., “Dimensionality optimization by heuristic greedy learning vs. genetic algorithms in knowledge discovery and data mining,” Intelligent Data Analysis, 3: 211–225, 1999

    Article  Google Scholar 

  8. Glymour, C., D. Madigan, D. Pregibon, and P. Smyth, “Statistical themes and lessons for data mining,” Data Mining and Knowledge Discovery, 1: 11–28, 1997

    Article  Google Scholar 

  9. Hand, D.J., “Data mining: Statistics and more?” The American Statistician, 52: 112–118, 1998.

    Article  Google Scholar 

  10. Heckerman, D., Bayesian Networks for Knowledge Discovery, Advances in Knowledge Discovery and Data Mining, Menlo Park, CA: AAAI Press, pp. 273–305, 1996

    Google Scholar 

  11. Holmes, J.H., D.R. Durbin, and F.K. Winston, “The learning classifier system: An evolutionary computation approach to knowledge discovery in epidemiologic surveillance,” Artificial Intelligence in Medicine, 19: 53–74, 2000

    Article  Google Scholar 

  12. Koonce, D.A., C. Fang, and S. Tsai, “A data mining tool for learning from manufacturing systems,” Computers & Industrial Engineering, 33: 27–30, 1997

    Article  Google Scholar 

  13. Kusiak, A., Computational Intelligence in Design and Manufacturing. Wiley-Interscience Publications , pp. 498–526, 1999

    Google Scholar 

  14. Lin, X., X. Zhou, and C. Liu, “Efficient computation of a proximity matching in spatial databases,” Data & Knowledge Engineering, 33: 85–102, 2000

    Article  MATH  Google Scholar 

  15. Michalski, R.S., “Learnable evolution model: Evolutionary processes guided by machine learning,” Machine Learning, 38: 9–40, 2000

    Article  MATH  Google Scholar 

  16. Russell, S., P. Norvig, Artificial Intelligence: A Modern Approach. New Jersey: Prentice Hall, 1995.

    MATH  Google Scholar 

  17. Scott, P.D. and E. Wilkins, “Evaluating data mining procedures: Techniques for generation artificial data sets,” Information and Software Technology, 41: 579–587, 1999

    Article  Google Scholar 

  18. Skarmeta, A., A. Bensaid, and N. Tazi, “Data mining for text categorization with semi-supervised agglomerative hierarchical clustering,” International Journal of Intelligent Systems, 15: 633–646, 2000

    Article  MATH  Google Scholar 

  19. Subramanian, A., L.D. Smith, A.C. Nelson, J.F. Campbell, and D.A. Bird, “Strategic planning for data warehousing,” Information and Management, 33: 99–113, 1997

    Article  Google Scholar 

  20. Yevich, R., “Data Mining,” in Data Warehouse: Practical Advice from the Experts, Prentice Hall , pp. 309–321, 1997

    Google Scholar 

  21. Yuanhui, Z., L. Yuchang, and S. Chunyi, “Mining classification rules in multi-strategy learning approach,” Intelligent Data Analysis, 2: 165–185, 1998

    Article  Google Scholar 

  22. Vila, M.A., J.C. Cubero, J.M. Medina, and O. Pons, “Soft computing: A new perspective for some data mining problems,” Vistas in Astronomy, 41: 379–386, 1997.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Kamrani, A.K., Gonzalez, R. (2008). Data-Mining Process Overview. In: Kamrani, A.K., Nasr, E.S.A. (eds) Collaborative Engineering. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-47321-5_5

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-47321-5_5

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-47319-2

  • Online ISBN: 978-0-387-47321-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics