Abstract
This chapter introduces the standard formulation for the data input to data mining algorithms that will be assumed throughout this book. It goes on to distinguish between different types of variable and to consider issues relating to the preparation of data prior to use, particularly the presence of missing data values and noise. The UCI Repository of datasets is introduced.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Reference
Dua, D., & Graff, C. (2019). UCI Machine Learning Repository. Irvine: University of California, School of Information and Computer Science. https://archive.ics.uci.edu/ml/.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer-Verlag London Ltd., part of Springer Nature
About this chapter
Cite this chapter
Bramer, M. (2020). Data for Data Mining. In: Principles of Data Mining. Undergraduate Topics in Computer Science. Springer, London. https://doi.org/10.1007/978-1-4471-7493-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-4471-7493-6_2
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-7492-9
Online ISBN: 978-1-4471-7493-6
eBook Packages: Computer ScienceComputer Science (R0)