Skip to main content Accessibility help
×
Hostname: page-component-76fb5796d-9pm4c Total loading time: 0 Render date: 2024-04-27T22:22:50.210Z Has data issue: false hasContentIssue false

17 - Application of Bayesian Sparse Factor Analysis Models in Bioinformatics

Published online by Cambridge University Press:  05 June 2013

Haisu Ma
Affiliation:
Yale University
Hongyu Zhao
Affiliation:
Yale University
Kim-Anh Do
Affiliation:
University of Texas, MD Anderson Cancer Center
Zhaohui Steve Qin
Affiliation:
Emory University, Atlanta
Marina Vannucci
Affiliation:
Rice University, Houston
Get access

Summary

Introduction

Factor analysis is a statistical method for explaining the variation of a large number of observed, correlated variables using a smaller number of unobserved, latent variables (factors). It is closely related to principal component analysis (PCA) and represents one of the most well-known latent variable models. Although PCA determines the components using standard geometrical procedures and is mostly exploratory, factor analysis can be either exploratory or confirmatory. Exploratory factor analysis assumes that any latent factor can be associated with any observed variable (described by the factor loadings) and is suitable for revealing the underlying structure of relatively large data sets. Confirmatory factor analysis is more subjective, where researchers usually have strong prior belief on the number of latent factors as well as the potential variables associated with each factor. Bayesian sparse factor modeling was first introduced by Mike West for the analysis of microarray gene expression profiles (West, 2003) and was later applied to a diverse range of bioinformatics studies including the inference of transcriptional regulatory networks (Sabatti et al., 2005), biological pathway analysis (Carvalho et al., 2008), population structure analysis (Engelhardt and Stephens, 2010), and other areas. In this chapter, we first introduce classical factor analysis models and Bayesian sparse factor analysis models and associated inferential methods. We then review several applications of this general approach in computational biology. This chapter ends with discussion on future developments.

Type
Chapter
Information
Advances in Statistical Bioinformatics
Models and Integrative Inference for High-Throughput Data
, pp. 350 - 365
Publisher: Cambridge University Press
Print publication year: 2013

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×