Summary
In this chapter, we will focus on the role of bioinformatics to analyze a protein after its protein structure has been determined. First, we present how to validate protein structures for quality assurance. Then, we discuss how to analyze protein–protein interfaces and how to predict the biomolecule which is the biological oligomeric state of the protein. Finally, we discuss how to search for homologs based on the 3-D structure which is an essential step for understanding protein function.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
The Quality Control Check program was developed at the Joint Center for Structural Genomics (JCSG). If you use this tool in preparing a structure please acknowledge the JCSG.
References
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N., and Bourne, P. E. (2000) The Protein Data Bank, Nucleic Acids Res 28, 235–242.
Burley, S. K., Joachimiak, A., Montelione, G. T., and Wilson, I. A. (2008) Contributions to the NIH-NIGMS Protein Structure Initiative from the PSI Production Centers, Structure 16, 5–11.
Murshudov, G. N., Vagin, A. A., and Dodson, E. J. (1997) Refinement of macromolecular structures by the maximum-likelihood method, Acta Crystallogr Sect D Biol Crystallogr 53, 240–255.
The CCP4 suite: programs for protein crystallography. (1994) Acta Crystallogr Sect D Biol Crystallogr 50, 760–763.
Davis, I. W., Leaver-Fay, A., Chen, V. B., Block, J. N., Kapral, G. J., Wang, X., Murray, L. W., Arendall, W. B., 3rd, Snoeyink, J., Richardson, J. S., and Richardson, D. C. (2007) MolProbity: all-atom contacts and structure validation for proteins and nucleic acids, Nucleic Acids Res 35, W375–W383.
Tickle, I. J., Laskowski, R. A., and Moss, D. S. (1998) Rfree and the Rfree ratio. I. Derivation of expected values of cross-validation residuals used in macromolecular least-squares refinement, Acta Crystallogr Sect D Biol Crystallogr 54, 547–557.
Tickle, I. J., Laskowski, R. A., and Moss, D. S. (2000) Rfree and the Rfree ratio. II. Calculation of the expected values and variances of cross-validation statistics in macromolecular least-squares refinement, Acta Crystallogr Sect D Biol Crystallogr 56, 442–450.
Kleywegt, G. J. (1997) Validation of protein models from Calpha coordinates alone, J Mol Biol 273, 371–376.
Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T. J., Higgins, D. G., and Thompson, J. D. (2003) Multiple sequence alignment with the Clustal series of programs, Nucleic Acids Res 31, 3497–3500.
Krissinel, E., and Henrick, K. (2004) Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions, Acta Crystallogr Sect D Biol Crystallogr 60, 2256–2268.
Emsley, P., and Cowtan, K. (2004) Coot: model-building tools for molecular graphics, Acta Crystallogr Sect D Biol Crystallogr 60, 2126–2132.
Feng, Z., Westbrook, J., and Berman, H. M. (1998) NUCheck, NDB-407, Rutgers University, New Brunswick, NJ.
Laskowski, R. A., MacArthur, M. W., Moss, D. S., and Thornton, J. M. (1993) PROCHECK: a program to check the stereochemical quality of protein structures, J Appl Crystallogr 26, 283–291.
Vaguine, A. A., Richelle, J., and Wodak, S. J. (1999) SFCHECK: a unified set of procedures for evaluating the quality of macromolecular structure-factor data and their agreement with the atomic model, Acta Crystallogr Sect D Biol Crystallogr 55, 191–205.
Terwilliger, T. C. (2000) Maximum-likelihood density modification, Acta Crystallogr Sect D Biol Crystallogr 56, 965–972.
Krissinel, E., and Henrick, K. (2007) Inference of macromolecular assemblies from crystalline state, J Mol Biol 372, 774–797.
Henrick, K., and Thornton, J. M. (1998) PQS: a protein quaternary structure file server, Trends Biochem Sci 23, 358–361.
Ponstingl, H., Kabir, T., and Thornton, J. M. (2003) Automatic inference of protein quaternary structure from crystals, J Appl Crystallogr 36, 1116–1122.
Camacho, C. J., and Zhang, C. (2005) FastContact: rapid estimate of contact and binding free energies, Bioinformatics 21, 2534–2536.
Negi, S. S., Schein, C. H., Oezguen, N., Power, T. D., and Braun, W. (2007) InterProSurf: a web server for predicting interacting sites on protein surfaces, Bioinformatics 23, 3397–3399.
Tina, K. G., Bhadra, R., and Srinivasan, N. (2007) PIC: Protein Interactions Calculator, Nucleic Acids Res 35, W473–W476.
Jones, S., and Thornton, J. M. (1996) Principles of protein-protein interactions, Proc Natl Acad Sci USA 93, 13–20.
Saha, R. P., Bahadur, R. P., Pal, A., Mandal, S., and Chakrabarti, P. (2006) ProFace: a server for the analysis of the physicochemical features of protein-protein interfaces, BMC Struct Biol 6, 11.
Porollo, A., and Meller, J. (2007) Prediction-based fingerprints of protein-protein interactions, Proteins 66, 630–645.
Han, G. W., Schwarzenbacher, R., Page, R., Jaroszewski, L., Abdubek, P., Ambing, E., Biorac, T., Canaves, J. M., Chiu, H. J., Dai, X., Deacon, A. M., DiDonato, M., Elsliger, M. A., Godzik, A., Grittini, C., Grzechnik, S. K., Hale, J., Hampton, E., Haugen, J., Hornsby, M., Klock, H. E., Koesema, E., Kreusch, A., Kuhn, P., Lesley, S. A., Levin, I., McMullan, D., McPhillips, T. M., Miller, M. D., Morse, A., Moy, K., Nigoghossian, E., Ouyang, J., Paulsen, J., Quijano, K., Reyes, R., Sims, E., Spraggon, G., Stevens, R. C., van den Bedem, H., Velasquez, J., Vincent, J., von Delft, F., Wang, X., West, B., White, A., Wolf, G., Xu, Q., Zagnitko, O., Hodgson, K. O., Wooley, J., and Wilson, I. A. (2005) Crystal structure of an alanine-glyoxylate aminotransferase from Anabaena sp. at 1.70 Å resolution reveals a noncovalently linked PLP cofactor, Proteins 58, 971–975.
Han, G. W., Schwarzenbacher, R., McMullan, D., Abdubek, P., Ambing, E., Axelrod, H., Biorac, T., Canaves, J. M., Chiu, H. J., Dai, X., Deacon, A. M., DiDonato, M., Elsliger, M. A., Godzik, A., Grittini, C., Grzechnik, S. K., Hale, J., Hampton, E., Haugen, J., Hornsby, M., Jaroszewski, L., Klock, H. E., Koesema, E., Kreusch, A., Kuhn, P., Lesley, S. A., McPhillips, T. M., Miller, M. D., Moy, K., Nigoghossian, E., Paulsen, J., Quijano, K., Reyes, R., Spraggon, G., Stevens, R. C., van den Bedem, H., Velasquez, J., Vincent, J., White, A., Wolf, G., Xu, Q., Hodgson, K. O., Wooley, J., and Wilson, I. A. (2005) Crystal structure of an Apo mRNA decapping enzyme (DcpS) from Mouse at 1.83 Å resolution, Proteins 60, 797–802.
Mathews, I. I, McMullan, D., Miller, M. D., Canaves, J. M., Elsliger, M. A., Floyd, R., Grzechnik, S. K., Jaroszewski, L., Klock, H. E., Koesema, E., Kovarik, J. S., Kreusch, A., Kuhn, P., McPhillips, T. M., Morse, A. T., Quijano, K., Rife, C. L., Schwarzenbacher, R., Spraggon, G., Stevens, R. C., van den Bedem, H., Weekes, D., Wolf, G., Hodgson, K. O., Wooley, J., Deacon, A. M., Godzik, A., Lesley, S. A., and Wilson, I. A. (2007) Crystal structure of 2-keto-3-deoxygluconate kinase (TM0067) from Thermotoga maritima at 2.05 Å resolution, Proteins 70, 603–608.
Kim, S. H., Shin, D. H., Choi, I. G., Schulze-Gahmen, U., Chen, S., and Kim, R. (2003) Structure-based functional inference in structural genomics, J Struct Funct Genomics 4, 129–135.
Holm, L., and Sander, C. (1996) Alignment of three-dimensional protein structures: network server for database searching, Methods Enzymol 266, 653–662.
Martin, A. C. (2000) The ups and downs of protein topology; rapid comparison of protein structure, Protein Eng 13, 829–837.
Gibrat, J. F., Madej, T., and Bryant, S. H. (1996) Surprising similarities in structure comparison, Curr Opin Struct Biol 6, 377–385.
Crowley, C., Sawaya, M. R., and Yeates, T. O. Crystal structure of PduU from the ethanolamine microcompartment of Salmonella, In Preparation.
Kerfeld, C. A., Sawaya, M. R., Tanaka, S., Nguyen, C. V., Phillips, M., Beeby, M., and Yeates, T. O. (2005) Protein structures forming the shell of primitive bacterial organelles, Science 309, 936–938.
Kawabata, T. (2003) MATRAS: a program for protein 3D structure comparison, Nucleic Acids Res 31, 3367–3369.
Ye, Y., and Godzik, A. (2004) FATCAT: a web server for flexible structure comparison and structure similarity searching, Nucleic Acids Res 32, W582–W585.
Shindyalov, I. N., and Bourne, P. E. (1998) Protein structure alignment by incremental combinatorial extension (CE) of the optimal path, Protein Eng 11, 739–747.
Gaspari, Z., Vlahovicek, K., and Pongor, S. (2005) Efficient recognition of folds in protein 3D structures by the improved PRIDE algorithm, Bioinformatics 21, 3322–3323.
Shyu, C. R., Chi, P. H., Scott, G., and Xu, D. (2004) ProteinDBS: a real-time retrieval system for protein structure comparison, Nucleic Acids Res 32, W572–W575.
Pearl, F. M., Bennett, C. F., Bray, J. E., Harrison, A. P., Martin, N., Shepherd, A., Sillitoe, I., Thornton, J., and Orengo, C. A. (2003) The CATH database: an extended protein family resource for structural and functional genomics, Nucleic Acids Res 31, 452–455.
Acknowledgment
G.W.H. and C.R. are affiliated with the JCSG, which is supported by the National Institute of General Medical Sciences, Protein Structure Initiative; Grant Numbers P50 GM62411, U54 GM074898. M.R.S. is supported by the Howard Hughes Medical Institute. We thank Ian Wilson, Anand Kolatkar, and Robyn Stanfield for their comments on this chapter. We also thank Lisa Van Veen for technical assistance. This is TSRI manuscript number 19238. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences or the National Institutes of Health.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Han, G.W., Rife, C., Sawaya, M.R. (2009). Applications of Bioinformatics to Protein Structures: How Protein Structure and Bioinformatics Overlap. In: Astakhov, V. (eds) Biomedical Informatics. Methods in Molecular Biology™, vol 569. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-59745-524-4_8
Download citation
DOI: https://doi.org/10.1007/978-1-59745-524-4_8
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-934115-63-3
Online ISBN: 978-1-59745-524-4
eBook Packages: Springer Protocols