Ultra-sensitive, high throughput and quantitative proteomics measurements
Introduction
With recent advances in genome sequencing, the biological research paradigm is rapidly transitioning towards incorporating an understanding of biology that benefits from a global “systems” perspective. As a result, biology is evolving from a largely qualitative descriptive science to a quantitative and ultimately predictive science in which the ability to collect and productively use large amounts of biological data is crucial. Developing a systems-level understanding of how an organism functions benefits greatly from global measurements of proteins because of their primary role in nearly all cellular processes. The interest in proteome-wide measurements ranges e.g. from the analysis of human plasma for the identification of diagnostic biomarkers, to the characterization of biochemical pathways of microorganisms important for environmental bioremediation or the understanding of human host-microbial pathogen interactions.
Proteomics measurements that yield insight into biochemical processes can lead us to new global predictive computational models that provide a more solid basis for understanding environmental and human health. However, to successfully reach this stage of predictive modeling, major advances in the ability to measure these highly complex systems are still required. It is expected that numerous proteomics measurements (e.g., time course, comparative disease states, a range of environmental perturbations, etc.) will be needed to provide sufficient data for extracting understandings of even the simplest of biological systems that can involve many thousands of different gene products. mRNA expression studies using microarrays have similarly shown that hundreds or thousands of measurements are often essential to support even relatively modest scientific objectives. In addition to throughput, data quality is highly important since the more quantitative and reproducible the measurements, the fewer measurements are needed to achieve a given objective.
At present, most global proteomics measurements are qualitative in nature, providing little more than “parts lists” of proteins with uncertain quality and limited information on co- and post-translational modifications. While these more qualitative proteome measurements can be useful, they generally have significant limitations. First, the likelihood that both “false positive” and “false negative” identifications will result from these measurements is substantial; their levels of confidence are often ill defined, and provided only in qualitative terms. These uncertainties are greatest for lower abundance proteins, where measurement quality (e.g., the signal to noise ratio) is lower or where related factors (e.g., the identification of only a single tryptic peptide for a protein) degrade confidence in the identification, but again in a poorly defined manner. Second, because protein detection depends significantly on the sensitivity (and other details) of a specific measurement, relatively small run-to-run variations in detection limits and other aspects of sample handling and instrument performance can result in significant changes in the proteins detected. Important biological processes potentially associated with changes in protein abundances may be obscured by measurement noise, and extensive sets of replicates may be needed to achieve acceptable levels of confidence in such cases.
Section snippets
Proteome analysis technologies
Proteome measurement capabilities with the desired comprehensive quality and quantity clearly require further advances in measurement throughput and data quality. The most mature proteome analysis technology is based on separations using two-dimensional polyacrylamide gel electrophoresis (2D PAGE), in conjunction with protein identification using mass spectrometric (MS) analysis and available protein or genome sequence data [1]. However, proteome coverage with 2D PAGE is problematic for
Accurate mass and time tag approach
To overcome the “too many peaks, too little time” bottleneck we developed a strategy that increases throughput by avoiding routine MS/MS measurements. The technical foundation for this strategy involves advanced separations combined with very accurate mass spectrometric measurements; in particular, ultra-high pressure capillary LC combined with Fourier transform ion cyclotron resonance (FTICR) mass spectrometry, and a supporting data analysis and management infrastructure.
The use of FTICR
Proteomic data analysis pipeline
Proteomic analysis of biological samples—identifying peptides and proteins, and quantifying their abundances—using MS technology generally produces large volumes of data. An analysis often provides thousands of separate MS or MS/MS spectra during the LC separation steps. Data analysis tools are used for performing database searches to identify peptides (e.g. from MS/MS datasets or AMT tags), interpret and extract detected masses from MS datasets, and assign peptide identifications to MS
Quantitation strategies
Proteome measurements often involve comparing protein abundances between two cellular populations that differ as a result of some change or perturbation. For comparative studies that employ stable-isotope labeling, the AMT tag strategy can increase throughput and precision by directly comparing two proteomes in the same analysis, such as comparing perturbed systems to a common “reference proteome”. A stable-isotope labeled (e.g., 15N or 18O labeled) reference proteome provides an effective
MS peak intensity based quantitation
Comparative measurements based on isotopic labeling generally require that both versions of the peptide (labeled and unlabeled) be detected, but often large changes in relative protein abundances between two labeled samples result in detection of only one of the peptides (e.g., when there are large abundance changes or low signal to noise levels for the measurements). Approaches based upon the use of peak intensities are attractive for this purpose, but peptide abundance measurements obtained
Electrospray ionization efficiency
The conditions under which ionization suppression occur are relatively well understood and are related to both analyte concentration and ESI volumetric flow rates [62], [63], [64]. Large, conventional flow rates result in greater compound-to-compound variation due to the increased analyte competition for charge and the proximity to the electrospray droplet surface. Converting the electrospray ionization from a conventional flow rate to a nanoflow regime (see Fig. 12 [62]) produces smaller
Sensitivity and dynamic range for quantitative measurements
The use of smaller i.d. capillary columns considerably improves sensitivity, while also improving the practical dynamic range of measurements when the absolute sample size is constrained. To examine both the sensitivity and the range of relative protein abundances measurable for complex proteomic samples, we examined a tryptic digest of a mixture containing a 106:1 difference in protein abundances for two standards (75 femtomoles cytochrome c, and 75 zeptomoles bovine serum albumin) and 5 ng of
Future directions
The field of proteomics continues to advance, increasingly driven by enhanced MS instrumentation, computational technologies, and quantitative methodologies. As capabilities continue to mature, they will push the limits of dynamic range detection, efficiency, and quantitation, while providing faster analyses with more reproducibility, and result in a parallel increase in data production. Improvements in the data pipeline, e.g., improved storage, processing, and analysis techniques, will be
Acknowledgments
We thank the U.S. Department of Energy (DOE) Office of Biological and Environmental Research and the NIH National Center for Research Resources (Grant RR018522) for supporting portions of this research. We also thank the Environmental Molecular Sciences Laboratory, a national scientific user facility sponsored by DOE and located at Pacific Northwest National Laboratory (PNNL) for use of instrumentation. PNNL is operated by Battelle for the DOE under Contract No. DE-AC06-76RLO 1830.
References (86)
- et al.
J. Curr. Biol.
(1993) - et al.
Biochem. Biophys. Res. Commun.
(1993) - et al.
Anal. Biochem.
(1993) - et al.
Anal. Biochem.
(1986) - et al.
J. Am. Soc. Mass Spectrom.
(2002) - et al.
J. Am. Soc. Mass Spectrom.
(1998) - et al.
J. Am. Soc. Mass Spectrom.
(1994) - et al.
J. Amer. Soc. Mass Spectrom.
(2000) - et al.
Biotechnology
(2000) - et al.
Proteome Res.
(2003)
J. Am. Soc. Mass Spectrom.
J. Am. Soc. Mass. Spectrom.
Int. J. Mass Spectrom. Ion Process.
Am. Soc. Mass Spectrom.
J. Am. Soc. Mass Spectrom.
Mol. Cell. Proteomics
Proc. Natl. Acad. Sci. U.S.A.
Anal. Chem.
Nature
Proc. Natl. Acad. of Sci. U.S.A.
Biol. Mass Spectrom.
Biol. Mass Spectrom.
Anal. Chem.
Rapid Commun. Mass Spectrom.
Science
Anal. Chem.
Anal. Chem.
J. Liq. Chromatogr.
Anal. Chem.
Anal. Chem.
Protein Sci.
Electrophoresis
J. Mass Spectrom.
Nat. Biotechnol.
BioTechniques
Anal. Chem.
Nat. Biotechnol.
Rapid Commun. Mass Spectrom.
Anal. Chem.
Nat. Biotechnol.
R. Nat. Biotechnol.
Anal. Chem.
Cited by (15)
Toward design-based engineering of industrial microbes
2010, Current Opinion in MicrobiologyCitation Excerpt :Nowadays, methods combining mass spectrometry and liquid chromatography, either gel-based or nongel-based protein separation, are used to improve complex proteome identification. Since proteomics methods were developed in the middle of 1990s, interactomics [34••,35] and phosphoproteomics [36] have also become interesting for many research applications, characterizing the state of a protein. Now, various methods are developed for proteomics such as MALDI-TOF for peptide mass finger printing and electrospray ionization (ESI), Fourier transform ion cyclotron resonance (FT-ICR) coupling with tandem mass spectrometry (MS/MS) for peptide identification.
Advances and applications of de novo designed affinity ligands in proteomics
2006, Biotechnology AdvancesThe utility of accurate mass and LC elution time information in the analysis of complex proteomes
2005, Journal of the American Society for Mass SpectrometryManaging genomic and proteomic knowledge
2005, Drug Discovery Today: TechnologiesPlatelet proteome changes associated with diabetes and during platelet storage for transfusion
2009, Journal of Proteome Research