Skip to main content
Log in

Principal components analysis of protein structure ensembles calculated using NMR data

  • Published:
Journal of Biomolecular NMR Aims and scope Submit manuscript

Abstract

One important problem when calculating structures of biomolecules from NMR data is distinguishing converged structures from outlier structures. This paper describes how Principal Components Analysis (PCA) has the potential to classify calculated structures automatically, according to correlated structural variation across the population. PCA analysis has the additional advantage that it highlights regions of proteins which are varying across the population. To apply PCA, protein structures have to be reduced in complexity and this paper describes two different representations of protein structures which achieve this. The calculated structures of a 28 amino acid peptide are used to demonstrate the methods. The two different representations of protein structure are shown to give equivalent results, and correct results are obtained even though the ensemble of structures used as an example contains two different protein conformations. The PCA analysis also correctly identifies the structural differences between the two conformations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abseher, R., Horstink, L., Hilbers, C.W. and Nilges, M. (1998) Proteins Struct. Funct. Genet., 31, 370-382.

    Google Scholar 

  • Amadei, A., Linssen, A.B.M. and Berendsen, H.J.C. (1993) Proteins Struct. Funct. Genet., 17, 412-425.

    Google Scholar 

  • Brünger, A. (1992) X-PLOR Version 3.1. A System for X-ray Crystallography and NMR, Yale University Press, Boston, MA.

    Google Scholar 

  • Brünger, A., Clore, G.M., Gronenborn, A.M., Saffrich, R. and Nilges, M. (1993) Science, 261, 328-331.

    Google Scholar 

  • Chau, P.-L., van Aalten, D.M.F., Bywater, R.P. and Findlay, J.B.C. (1999) J. Comput.-Aided Mol. Design, 13, 11-20.

    Google Scholar 

  • Doreleijers, J.F., Raves, M.L., Rullman, T. and Kaptein, R. (1999) J. Biomol. NMR, 14, 123-132.

    Google Scholar 

  • Egan, W.J. and Morgan, S.L. (1998) Anal. Chem., 70, 2372-2379.

    Google Scholar 

  • Eriksson, L., Johansson, E., Kettaneh-Wold, N. and Wold, S. (1999) Introduction to Multi-and Megavariate Data Analysis using Projection Methods, UMETRICS AB, Umeå , Sweden.

    Google Scholar 

  • Kundrot, C.E. (1996) J. Am. Chem. Soc., 118, 8725-8726.

    Google Scholar 

  • Manly, B. (1986) Multivariate Statistics-A Primer, Chapman & Hall.

  • Mello, V.C., van Aalten, D.M.F. and Findlay, J.B.C. (1998) Biochemistry, 37, 3137-3142.

    Google Scholar 

  • Neuhaus, D. and Williamson, M.P. (2000) The Nuclear Overhauser Effect in Stereochemical and Conformational Analysis, 2nd ed., Wiley, New York, NY.

    Google Scholar 

  • O'Donoghue, S.I., Chang, X., Abseher, R., Nilges, M. and Led, J.J. (2000) J. Biomol. NMR, 16, 93-108.

    Google Scholar 

  • Phillips, D.C. (1970) Biochem. Soc. Symp., 30, 11-28.

    Google Scholar 

  • Schwabe, J.W.R., Chapman, L., Finch, J.T., Rhodes, D. and Neuhaus, D. (1993) Structure, 1, 187-204.

    Google Scholar 

  • van Aalten, D.M.F., Grotewold, E. and Joshua-Tor, L. (1998) Methods, 14, 318-328.

    Google Scholar 

  • Widmer, H., Widmer, A. and Braun, W. (1993) J. Biomol. NMR, 3, 307-324.

    Google Scholar 

  • Wüthrich, K. (1986) NMR of Proteins and Nucleic Acids, Wiley, New York, NY.

    Google Scholar 

Download references

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Howe, P.W. Principal components analysis of protein structure ensembles calculated using NMR data. J Biomol NMR 20, 61–70 (2001). https://doi.org/10.1023/A:1011210009067

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1011210009067

Navigation