Skip to main content

Unsupervised Machine Learning: Datasets Without Outcomes

  • Chapter
  • First Online:
Measurement and Analysis in Transforming Healthcare Delivery
  • 485 Accesses

Abstract

Often either a dataset doesn’t contain an obvious “outcome” or we wish to explore the entire data set to see if there is some natural “order” to the data. In such cases, unsupervised machine learning is appropriate. Once again, unsupervised means that there isn’t an outcome to compare the results of a model to. It is tempting to try to “force” a regression or classification model, but often it is quite enlightening to use unsupervised methods to better understand the dataset. If the data are nominal “marketbasket” lists of “transactions” (for example the set of laboratory tests ordered at one time for a particular patient on a particular day; or items purchased at the supermarket), the technique of association analysis is most appropriate. If the data are quantitative, with a metric of proximity available, a clustering technique can be used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Goldstein DB. Common genetic variation and human traits. N Engl J Med. 2009;360(17):1696.

    Article  CAS  PubMed  Google Scholar 

  2. Albinali F, Davies N, Friday A, editors. Structural learning of activities from sparse datasets. Fifth annual IEEE international conference on pervasive computing and communications 2007 PerCom’07. IEEE; 2007.

    Google Scholar 

  3. Tan H. Knowledge discovery and data mining. Berlin: Springer; 2012. p. 3–9.

    Google Scholar 

  4. Tan PNSM, Kumar V. Introduction to data mining. Boston: Pearson-Addison Wesley; 2006. p. 22–36.

    Google Scholar 

  5. Hastie T, Friedman JH, Tibshirani R. The elements of statistical learning: data mining, inference, and prediction. 2, corrected 7 printing edth ed. New York: Springer; 2009.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Fabri, P.J. (2016). Unsupervised Machine Learning: Datasets Without Outcomes. In: Measurement and Analysis in Transforming Healthcare Delivery. Springer, Cham. https://doi.org/10.1007/978-3-319-40812-5_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-40812-5_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-40810-1

  • Online ISBN: 978-3-319-40812-5

  • eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics