Abstract
A computer program (ORB) has been developed to predict 1H,13C and 15N NMR chemical shifts of previouslyunassigned proteins. The program makes use of the information contained in achemical shift database of previously assigned proteins supplemented by astatistically derived averaged chemical shift database in which the shifts arecategorized according to their residue, atom and secondary structure type[Wishart et al. (1991) J. Mol. Biol., 222, 311–333]. The predictionprocess starts with a multiple alignment of all previously assigned proteinswith the unassigned query protein. ORB uses the sequence and secondarystructure alignment program XALIGN for this task [Wishart et al. (1994)CABIOS, 10, 121–132; 687–688]. The prediction algorithm in ORB isbased on a scoring of the known shifts for each sequence. The scores dependon global sequence similarity, local sequence similarity, structuralsimilarity and residue similarity and determine how much weight one particularshift is given in the prediction process. In situations where no applicablepreviously assigned chemical shifts are available, the shifts derived from theaveraged database are used. In addition to supplying the user with predictedchemical shifts, ORB calculates a confidence value for every prediction. Theseconfidence values enable the user to judge which predictions are the mostaccurate and they are particularly useful when ORB is incorporated into acomplete autoassignment package. The usefulness of ORB was tested on threemedium-sized proteins: an interleukin-8 analog, a troponin C synthetic peptideheterodimer and cardiac troponin C. Excellent results are obtained if ORB isable to use the chemical shifts of at least one highly homologous sequence.ORB performs well as long as the sequence identity between proteins with knownchemical shifts and the new sequence is not less than 30%.
Similar content being viewed by others
References
Anglister, J., Grzesiek, S., Wang, A.C., Ren, H., Klee, C.B. and Bax, A. (1994) Biochemistry, 33, 3540–3547.
Bartels, C., Billeter, M., Güntert, P. and Wüthrich, K. (1996) J. Biomol. NMR, 7, 207–213.
Bax, A. and Grzesiek, S. (1993) Acc. Chem. Res., 26, 131–138.
Chou, P.Y. and Fasman, G.D. (1974) Biochemistry, 13, 222–245.
Chou, P.Y. and Fasman, G.D. (1978) Annu. Rev. Biochem., 47, 251–276.
Clore, G.M., Apella, E., Yamada, M., Matsushima, K. and Gronenborn, A.M. (1989) J. Biol. Chem., 264, 18907–18911.
de Dios, A.C., Pearson, J.G. and Oldfield, E. (1993) Science, 260, 1491–1496.
Eisenberg, D., Weiss, R.M. and Terwilliger, R.C. (1984) Proc. Natl. Acad. Sci. USA, 81, 140–144.
Gagne, S.M., Tsuda, S., Li, M.X., Chandra, M., Smillie, L.B. and Sykes, B.D. (1994) Protein Sci., 3, 1961–1974.
Garnier, J., Ogusthorpe, D.J. and Robson, B. (1978) J. Mol. Biol., 120, 97–120.
Garrett, D.S., Powers, R., Gronenborn, A.M. and Clore, M. (1991) J. Magn. Reson., 95, 214–220.
Gibrat, J.F., Garnier, J. and Robson, B. (1987) J. Mol. Biol., 198, 425–443.
Hare, B.J. and Prestegard, J.H. (1994) J. Biomol. NMR, 4, 35–46.
Ikura, M., Kay, L. and Bax, A. (1990) Biochemistry, 29, 4659–4667.
Kim, K.-S., Clark-Lewis, I. and Sykes, B.D. (1994) J. Biol. Chem., 269, 32909–32915.
Kjaer, M., Andersen, K.V. and Poulsen, F.M. (1994) Methods Enzymol., 239, 288–318.
Kleywegt, G.J., Boelens, R., Cox, M., Llinás, M. and Kaptein, R. (1991) J. Biomol. NMR, 1, 23–47.
Larsen, R.J. and Marx, M.L. (1981) An Introduction to Mathematical Statistics and its Applications, Prentice-Hall, Englewood Cliffs, NJ, U.S.A.
Levin, J.M., Robson, B. and Garnier, J. (1986) FEBS Lett., 205, 303–308.
Levin, J.M. and Garnier, J. (1988) Biochim. Biophys. Acta, 955, 283–295.
Mayo, K.H., Roongta, V., Ilyina, E., Milius, R., Barker, S., Quinlan, C., La Rosa, G. and Daly, T.J. (1995) Biochemistry, 34, 11399–11409.
Merutka, G., Dyson, H.J. and Wright, P.E. (1995) J. Biomol. NMR, 5, 14–24.
Morelle, N., Brutscher, B., Simorre, J.-P. and Morelle, M.D. (1995) J. Biomol. NMR, 5, 154–160.
Olson Jr., J.B. and Markley, J.L. (1994) J. Biomol. NMR, 4, 385–410.
Ösapay, K. and Case, D.A. (1994) J. Biomol. NMR, 4, 215–230.
Rajarathnam, K., Clark-Lewis, I. and Sykes, B.D. (1994) Biochemistry, 33, 6623–6630.
Rajarathnam, K., Clark-Lewis, I. and Sykes, B.D. (1995) Biochemistry, 34, 12983–12990.
Rooman, M.J. and Wodak, S.F. (1988) Nature, 335, 45–49.
Rooman, M.J. and Wodak, S.F. (1990) J. Mol. Biol., 213, 337–350.
Rooman, M.J. and Wodak, S.F. (1991) Proteins Struct. Funct. Genet., 9, 68–78.
Seavey, B.R., Farr, E.A., Westler, W.M. and Markley, J.L. (1991) J. Biomol. NMR, 1, 217–236.
Shaw, G.S., Hodges, R.S. and Sykes, B.D. (1990) Science, 249, 280–283.
Shaw, G.S., Findlay, W.A., Semchuk, P.D., Hodges, R.S. and Sykes, B.D. (1992) J. Am. Chem. Soc., 114, 6258–6259.
Slupsky, C.M., Reinach, F.C., Smillie, L.B. and Sykes, B.D. (1995) Protein Sci., 4, 1279–1290.
Slupsky, C.M. and Sykes, B.D. (1995) Biochemistry, 34, 15953–15964.
Williams, R.W., Chang, A., Juretic, D. and Loughram, S. (1987) Biochim. Biophys. Acta, 916, 200–204.
Williamson, M.P., Kikuchi, J. and Asakura, T. (1995) J. Mol. Biol., 247, 541–546.
Wishart, D.S., Sykes, B.D. and Richards, F.M. (1991) J. Mol. Biol., 222, 311–333.
Wishart, D.S., Richards, F.M. and Sykes, B.D. (1992) Biochemistry, 31, 1647–1651.
Wishart, D.S., Boyko, R.F. and Sykes, B.D. (1994) CABIOS, 10, 687–688.
Wishart, D.S., Boyko, R.F., Willard, L., Richards, F.M. and Sykes, B.D. (1994) CABIOS, 10, 121–132.
Wishart, D.S. and Sykes, B.D. (1994) J. Biomol. NMR, 4, 171–180.
Wishart, D.S. and Sykes, B.D. (1994) Methods Enzymol., 239, 363–391.
Wishart, D.S., Bigam, C.G., Holm, A., Hodges, R.S. and Sykes, B.D. (1995) J. Biomol. NMR, 5, 67–81.
Wüthrich, K. (1986)NMR of Proteins and Nucleic Acids, Wiley, New York, NY, U.S.A.
Zimmermann, D., Kulikowski, C., Wang, L., Lyons, B. and Montelione, G.T. (1994) J. Biomol. NMR, 4, 241–256.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Gronwald, W., Boyko, R.F., Sönnichsen, F.D. et al. ORB, a homology-based program for the prediction of protein NMR chemical shifts. J Biomol NMR 10, 165–179 (1997). https://doi.org/10.1023/A:1018389332160
Issue Date:
DOI: https://doi.org/10.1023/A:1018389332160