Integrative subcellular proteomic analysis allows accurate prediction of human disease-causing genes

  1. Rui Chen1,2,3
  1. 1Structural and Computational Biology and Molecular Biophysics Graduate Program, Baylor College of Medicine, Houston, Texas 77030, USA;
  2. 2Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA;
  3. 3Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
  4. 4Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas 77030, USA
  1. Corresponding author: ruichen{at}bcm.edu
  1. 5 These authors contributed equally to this work.

Abstract

Proteomic profiling on subcellular fractions provides invaluable information regarding both protein abundance and subcellular localization. When integrated with other data sets, it can greatly enhance our ability to predict gene function genome-wide. In this study, we performed a comprehensive proteomic analysis on the light-sensing compartment of photoreceptors called the outer segment (OS). By comparing with the protein profile obtained from the retina tissue depleted of OS, an enrichment score for each protein is calculated to quantify protein subcellular localization, and 84% accuracy is achieved compared with experimental data. By integrating the protein OS enrichment score, the protein abundance, and the retina transcriptome, the probability of a gene playing an essential function in photoreceptor cells is derived with high specificity and sensitivity. As a result, a list of genes that will likely result in human retinal disease when mutated was identified and validated by previous literature and/or animal model studies. Therefore, this new methodology demonstrates the synergy of combining subcellular fractionation proteomics with other omics data sets and is generally applicable to other tissues and diseases.

Footnotes

  • Received September 13, 2015.
  • Accepted February 19, 2016.

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

| Table of Contents

Preprint Server