Abstract
Often either a dataset doesn’t contain an obvious “outcome” or we wish to explore the entire data set to see if there is some natural “order” to the data. In such cases, unsupervised machine learning is appropriate. Once again, unsupervised means that there isn’t an outcome to compare the results of a model to. It is tempting to try to “force” a regression or classification model, but often it is quite enlightening to use unsupervised methods to better understand the dataset. If the data are nominal “marketbasket” lists of “transactions” (for example the set of laboratory tests ordered at one time for a particular patient on a particular day; or items purchased at the supermarket), the technique of association analysis is most appropriate. If the data are quantitative, with a metric of proximity available, a clustering technique can be used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goldstein DB. Common genetic variation and human traits. N Engl J Med. 2009;360(17):1696.
Albinali F, Davies N, Friday A, editors. Structural learning of activities from sparse datasets. Fifth annual IEEE international conference on pervasive computing and communications 2007 PerCom’07. IEEE; 2007.
Tan H. Knowledge discovery and data mining. Berlin: Springer; 2012. p. 3–9.
Tan PNSM, Kumar V. Introduction to data mining. Boston: Pearson-Addison Wesley; 2006. p. 22–36.
Hastie T, Friedman JH, Tibshirani R. The elements of statistical learning: data mining, inference, and prediction. 2, corrected 7 printing edth ed. New York: Springer; 2009.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Fabri, P.J. (2016). Unsupervised Machine Learning: Datasets Without Outcomes. In: Measurement and Analysis in Transforming Healthcare Delivery. Springer, Cham. https://doi.org/10.1007/978-3-319-40812-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-40812-5_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40810-1
Online ISBN: 978-3-319-40812-5
eBook Packages: MedicineMedicine (R0)