Abstract
This chapter gives a description of data mining and its methodology. First, the definition of data mining along with the purposes and growing needs for such a technology are presented. A six-step methodology for data mining is then presented and discussed. The goals and methods of this process are then explained, coupled with a presentation of a number of techniques that are making the data-mining process faster and more reliable. These techniques include the use of neural networks and genetic algorithms, which are presented and explained as a way to overcome several complexity problems that the data-mining process possesses. A deep survey of the literature is done to show the various purposes and achievements that these techniques have brought to the study of data mining.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adriaans, P. and D. Zantinge, Data Mining, Harlow: Addison-Wesley, 1996.
Andrade, M. and P. Bork, “Automated extraction of information in molecular biology,” FEBS Letters, 476: 12–17, 2000
Delesie, L. and L. Croes, “Operations research and knowledge discovery: A data mining method applied to health care management,” International Transactions in Operational Research, 7: 159–170, 2000
Fayyad, U., D. Madigan, G. Piatetsky-Shapiro, and P. Smyth, “From data mining to knowledge discovery in databases,” AI Magazine, 17: 37–54, 1996
Fayyad, U. and P. Stolorz, “Data mining and KDD: Promise and challenges,” Future Generation Computer Systems, 13: 99–115, 1997
Feelders, A., H. Daniels, and M. Holsheimer, “Methodological and practical aspects of data mining,” Information & Management, 37: 271–281, 2000
Fu, Z., “Dimensionality optimization by heuristic greedy learning vs. genetic algorithms in knowledge discovery and data mining,” Intelligent Data Analysis, 3: 211–225, 1999
Glymour, C., D. Madigan, D. Pregibon, and P. Smyth, “Statistical themes and lessons for data mining,” Data Mining and Knowledge Discovery, 1: 11–28, 1997
Hand, D.J., “Data mining: Statistics and more?” The American Statistician, 52: 112–118, 1998.
Heckerman, D., Bayesian Networks for Knowledge Discovery, Advances in Knowledge Discovery and Data Mining, Menlo Park, CA: AAAI Press, pp. 273–305, 1996
Holmes, J.H., D.R. Durbin, and F.K. Winston, “The learning classifier system: An evolutionary computation approach to knowledge discovery in epidemiologic surveillance,” Artificial Intelligence in Medicine, 19: 53–74, 2000
Koonce, D.A., C. Fang, and S. Tsai, “A data mining tool for learning from manufacturing systems,” Computers & Industrial Engineering, 33: 27–30, 1997
Kusiak, A., Computational Intelligence in Design and Manufacturing. Wiley-Interscience Publications , pp. 498–526, 1999
Lin, X., X. Zhou, and C. Liu, “Efficient computation of a proximity matching in spatial databases,” Data & Knowledge Engineering, 33: 85–102, 2000
Michalski, R.S., “Learnable evolution model: Evolutionary processes guided by machine learning,” Machine Learning, 38: 9–40, 2000
Russell, S., P. Norvig, Artificial Intelligence: A Modern Approach. New Jersey: Prentice Hall, 1995.
Scott, P.D. and E. Wilkins, “Evaluating data mining procedures: Techniques for generation artificial data sets,” Information and Software Technology, 41: 579–587, 1999
Skarmeta, A., A. Bensaid, and N. Tazi, “Data mining for text categorization with semi-supervised agglomerative hierarchical clustering,” International Journal of Intelligent Systems, 15: 633–646, 2000
Subramanian, A., L.D. Smith, A.C. Nelson, J.F. Campbell, and D.A. Bird, “Strategic planning for data warehousing,” Information and Management, 33: 99–113, 1997
Yevich, R., “Data Mining,” in Data Warehouse: Practical Advice from the Experts, Prentice Hall , pp. 309–321, 1997
Yuanhui, Z., L. Yuchang, and S. Chunyi, “Mining classification rules in multi-strategy learning approach,” Intelligent Data Analysis, 2: 165–185, 1998
Vila, M.A., J.C. Cubero, J.M. Medina, and O. Pons, “Soft computing: A new perspective for some data mining problems,” Vistas in Astronomy, 41: 379–386, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Kamrani, A.K., Gonzalez, R. (2008). Data-Mining Process Overview. In: Kamrani, A.K., Nasr, E.S.A. (eds) Collaborative Engineering. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-47321-5_5
Download citation
DOI: https://doi.org/10.1007/978-0-387-47321-5_5
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-47319-2
Online ISBN: 978-0-387-47321-5
eBook Packages: EngineeringEngineering (R0)