Unsupervised Machine Learning: Datasets Without Outcomes

Fabri, Peter J.

doi:10.1007/978-3-319-40812-5_8

Peter J. Fabri²

485 Accesses

Abstract

Often either a dataset doesn’t contain an obvious “outcome” or we wish to explore the entire data set to see if there is some natural “order” to the data. In such cases, unsupervised machine learning is appropriate. Once again, unsupervised means that there isn’t an outcome to compare the results of a model to. It is tempting to try to “force” a regression or classification model, but often it is quite enlightening to use unsupervised methods to better understand the dataset. If the data are nominal “marketbasket” lists of “transactions” (for example the set of laboratory tests ordered at one time for a particular patient on a particular day; or items purchased at the supermarket), the technique of association analysis is most appropriate. If the data are quantitative, with a metric of proximity available, a clustering technique can be used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Goldstein DB. Common genetic variation and human traits. N Engl J Med. 2009;360(17):1696.
Article CAS PubMed Google Scholar
Albinali F, Davies N, Friday A, editors. Structural learning of activities from sparse datasets. Fifth annual IEEE international conference on pervasive computing and communications 2007 PerCom’07. IEEE; 2007.
Google Scholar
Tan H. Knowledge discovery and data mining. Berlin: Springer; 2012. p. 3–9.
Google Scholar
Tan PNSM, Kumar V. Introduction to data mining. Boston: Pearson-Addison Wesley; 2006. p. 22–36.
Google Scholar
Hastie T, Friedman JH, Tibshirani R. The elements of statistical learning: data mining, inference, and prediction. 2, corrected 7 printing edth ed. New York: Springer; 2009.
Google Scholar

Download references

Author information

Authors and Affiliations

Colleges of Medicine and Engineering, University of South Florida, Tampa, FL, USA
Peter J. Fabri

Authors

Peter J. Fabri
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Fabri, P.J. (2016). Unsupervised Machine Learning: Datasets Without Outcomes. In: Measurement and Analysis in Transforming Healthcare Delivery. Springer, Cham. https://doi.org/10.1007/978-3-319-40812-5_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-40812-5_8
Published: 21 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40810-1
Online ISBN: 978-3-319-40812-5
eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics