Abstract
This chapter discusses the design issues in clinical proteomics study and provides specific suggestions for addressing these questions when using the standard guidelines for the planning. It provides two methods for the sample size estimation in study design. The first method is used for the planning of a clinical proteomic study at the discovery or verification stage; the second method is proposed for the systematic planning of a multistage study. The second part of the chapter introduces three approaches to analyzing the clinical proteomic study and provides analyses for two case studies of clinical proteomic discoveries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chapman JR (1996) Protein and peptide analysis by mass spectrometry. Humana Press Inc., Totowa, NJ
Palmblad M, Tiss A, Cramer R (2009) Mass spectrometry in clinical proteomics—from the present to the future. Proteomics Clin Appl 3:6–17
Anderson L (2005) Candidate-based proteomics in the search for biomarkers of cardiovascular disease. J Physiol 563:23–60
Shadforth IP, Dunkley TP, Lilley KS et al (2005) i-Tracker: for quantitative proteomics using iTRAQ. BMC Genomics 6:145
Corthals GL, Rose K (2007) Quantitation in proteomics, in proteome research: concepts, technology and application. Springer, Berlin
Mertins P, Yang F, Liu T et al (2014) Ischemia in tumors induces early and sustained phosphorylation changes in stress kinase pathways but does not affect global protein levels. Mol Cell Proteomics 13:1690–1704
Galloa V, Eggerc M, McCormack V et al (2011) STrengthening the Reporting of OBservational studies in Epidemiology-Molecular Epidemiology STROBE-ME: an extension of the STROBE statement. J Clin Epidemiol 64:1350–1363
Kleinbaum DG (2003) ActivEpi. Springer, New York
Lagiou P, Adami H-O, Trichopoulos D (2005) Causality in cancer epidemiology. Eur J Epidemiol 20:565–574
Katan M (1986) Apolipoprotein E isoforms, serum cholesterol, and cancer. Lancet 1:507–508
Qi L (2009) Mendelian randomization in nutritional epidemiology. Nutr Rev 67:439–450
Verhoeven KJF, Simonsen KL, McIntyre LM (2005) Implementing false discovery rate control: increasing your power. Oikos 108:643–647
Benjamini Y, Yekutieli D (2001) The control of the false discovery rate in multiple testing under dependency. Ann Stat 29:1165–1188
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc 57:289–300
Zeng ISL, Lumley T, Ruggiero K et al (2013) Two optimization strategies of multi-stage design in clinical proteomic studies. Stat Appl Genet Mol Biol 12:263–283
von Elm E, Altman D, Egger M et al (2007) The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. PLoS Med 4, e297
Tzoulaki I, Ebbels TMD, Valdes A et al (2014) Design and analysis of metabolomics studies in epidemiologic research: a primer on -omic technologies. Am J Epidemiol 180:129–139
McShane LM, Altman DG, Sauerbrei W et al (2005) REporting recommendations for tumor MARKer prognostic studies (REMARK). Nat Clin Pract Oncol 2
Kuller LH, Bracken MB, Ogino S et al (2013) The role of epidemiology in the era of molecular epidemiology and genomics: summary of the 2013 AJE-sponsored Society of Epidemiologic Research Symposium. Am J Epidemiol 178:1350–1354
Malats N, Castaño-Vinyals G (2007) Cancer epidemiology: study designs and data analysis. Clin Transl Oncol 9:290–297
Zeng ISL, Browning S, Gladding P et al (2009) A multi-feature reproducibility assessment of mass spectral data in clinical proteomic studies. Clin Proteomics 5:170–177
Bossuyt PM, Reitsma JB, Bruns DE et al (2003) Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. BMJ 326:41–44
Bonassi S, Au WW (2002) Biomarkers in molecular epidemiology studies for health risk prediction. Mutat Res 511:73–86
Thomas D, Conti D (2004) The concept of ‘Mendelian Randomization’. Int J Epidemiol 33:21–25
Lawlor DA, Harbord RM, Sterne JA et al (2008) Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med 27:1133–1163
Applied Biosystems/MDS SCIEX, ProteinPilotâ„¢ Software (2007) Getting started guide
Breitwieser FP, Muller A, Dayon L et al (2001) General Statistical Modeling of Data from Protein Relative Expression Isobaric Tags. J Proteome Res 10:2758–2766
Oberg AL, Mahoney DW (2012) Statistical methods for quantitative mass spectrometry proteomic experiments with labeling. BMC Bioinformatics 13:S7
Zeng ISL (2014) Doctorate thesis: Statistical methods in clinical proteomic studies 2007-2014—a protein concerto. In: Statistics2014, University of Auckland: ResearchSpace@Auckland. p 218
Goldstain H (1999) Multivariate multilevel model. In: Multilevel statistics models 1999. London, Institute of Education, p 5–6
Li X (2005) PROcess: Ciphergen SELDI-TOF processing. R package
Boehm AM, Putz S, Altenhofer D et al (2007) Precise protein quantification based on peptide quantification using iTRAQâ„¢. BMC Bioinformatics 8:214
Clinical proteomic program data portal: OvarianDataset4-3-02.zip. A.2.1. http://home.ccr.cancer.gov/ncifdaproteomics/ppatterns.asp
Hochberg Y (1988) A sharper Bonferroni procedure for multiple tests of significance. Biometrika 75:800–803
CPTAC, TCGA Cancer proteome study of colorectal tissue. https://cptac-data-portal.georgetown.edu/cptac/s/S016
Zhang B, Wang J, Wang X et al (2013) Proteogenomic characterization of human colon and rectal cancer. Nature 513:382–387
Acknowledgements
The colorectal proteomic data used in this publication were generated by the Clinical Proteomic Tumor Analysis Consortium (NCI/NIH). The mass spectral intensity data of ovarian study are provided by the proteomic databank of Centre for Cancer Research (http://home.ccr.canccr.gov/ncifdaprotcomics/ppatterns.asp). Some codes of producing heat map are modified from the blog: http://learnr.wordpress.com/2010/01/26/ggplot2-quick-heatmap-plotting for NBA game. The author’s first learning of Mendelian randomization method in 2010 is attributed to Professor Thomas Lumley.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this protocol
Cite this protocol
Zeng, I.S.L. (2016). Topics in Study Design and Analysis for Multistage Clinical Proteomics Studies. In: Jung, K. (eds) Statistical Analysis in Proteomics. Methods in Molecular Biology, vol 1362. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-3106-4_2
Download citation
DOI: https://doi.org/10.1007/978-1-4939-3106-4_2
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-3105-7
Online ISBN: 978-1-4939-3106-4
eBook Packages: Springer Protocols