An amino acid map of inter-residue contact energies using metric multi-dimensional scaling

https://doi.org/10.1016/j.jtbi.2007.09.032Get rights and content

Abstract

We present an amino map based on their inter-residue contact energies using the Miyazawa–Jernigan matrix. This work is based on the method of metric multi-dimensional scaling (MMDS). The MMDS map shows, among other things, that the MJ contact energies imply the hydrophobic–hydrophilic nature of the amino acid residues. With the help of the map we are able to compare and draw inferences from uncorrelated data sets such as BLOSUM and PAM with MJ methods. We also use a hierarchical clustering method on our MMDS distance matrix to group the amino acids and arrive at an optimum number of groups for simplifying the amino acid set.

Introduction

In this work, we present a map based on the inter-residue contact energies given by the MiyazawaJernigan (MJ) matrix (Miyazawa and Jernigan, 1996). By presenting the data in a visual form, namely a map based on the metric multi-dimensional scaling (MMDS), we hope to reduce the complexity of finding out the inter-relations among the residues which might not be directly evident from the MJ matrix. Each amino acid is represented as a point on the MMDS map. The distance between two points on the map quantifies the dissimilarity in their contact energies. The larger the distance the larger the dissimilarity. This map brings out hidden relationships among the amino acids that are not easily discerned from the MJ matrix. The MMDS method is frequently used for a visual representation from a set of data representing the relation among a number of objects. Similar work was reported by French and Robson (1983) who had derived a map using MMDS for amino acids from Dayhoff's “relatedness odds matrix” (1972). Luthra et al. (2007), who presented a summary of different techniques to reduce the number of amino acid alphabet, note that French and Robson's (1983) work was one of the first attempts in reducing the amino acid alphabet. French and Robson (1983) were able to conclude from their map that hydrophobicity and molecular volume are two key properties that are conserved in the evolution of proteins. The MMDS map presented in this paper verifies that hydrophobicity is the key feature that characterizes the amino acid residues and that the inter-residue contact energies represent a rough hydrophobicity scale (Cornette et al., 1987; Chan, 1999; Venkatarajan and Braun, 2001). This map is based on the revised MJ matrix reported in 1996 and hence includes extensive structure and sequence information. Additionally, with the help of this map, we compare (the similarities/differences among amino acid residues as represented by) the MJ matrix with BLOSUM62 and PAM250 matrices. A novel feature of our map is that it can be used as a visual method of reducing the amino acid set. We support this by determining the groups using a hierarchical clustering method (Johnson and Wichern, 2006). By using the above method we are also able to arrive at an optimum number of groups for reducing the amino acid set. Recently, Agrafiotis et al. (2001) coupled MDS with nonlinear mapping (NLM) and neural networks and used it for mapping of large combinatorial libraries and ensemble of molecular conformations but not for the classification of amino acids. On the other hand, Venkatarajan and Braun (2001) used principal component analysis (Johnson and Wichern, 2006) for creating amino acid maps using large data sets. They used 237 physical–chemical properties of amino acids to form a vector in a 237-dimensional space for each amino acid and reduced the resulting matrix to a five-dimensional space. This was done by using the first five eigenvalues and eigenvectors. They showed that the principal components correspond to important properties such as hydrophobicity–hydrophilicity and molecular volume. As discussed in this paper, the same conclusion is drawn from the MMDS map presented in this paper in addition to some other results.

The rest of the paper is organized as follows. In the next section, we explain the method of multi-dimensional scaling that has been used in constructing the map. In the following section, we present the results and how the map can be used for finding out features that are not immediately apparent from the MJ matrix. The final section contains the concluding remarks.

Section snippets

Method

Metric multi-dimensional scaling (Mead, 1992) is a multi-variate statistical analysis technique that is used for making a visual representation from a n×n matrix representing the interaction between a set of n objects that one is interested to study. The ijth entry in the matrix represents the interaction between ith and jth objects. If the ijth entry in the matrix represents dissimilarity between the ith and jth objects, then the matrix is called the dissimilarity or distance or proximity

Results and discussion

Fig. 4 shows the map created using MMDS on MJ matrix. We note that the residues lie along an axis that corresponds to an approximate increase in hydrophobicity (Cornette et al., 1987). This axis is shown in Fig. 5. The curved axis in Fig. 5 shows the direction of increase in inter-residue contact energies. We also show the classification of amino acids according to their chemical properties as done by Dayhoff et al. (1972) in this figure.

In Fig. 6, we show the residues that favorably substitute

Conclusions

The metric multi-dimensional map (MMDS) of the amino acid residues gives an informative and revealing representation of the MJ matrix on a two-dimensional plane. It shows which amino acid residues are similar to one another in terms of statistics-based contact energy. We infer from the map that the inter-residue contact energies in the MJ matrix underscore the hydrophobic–hydrophilic character of the amino acid residues. We are able to represent on the same platform, i.e., the MMDS map,

Acknowledgments

The authors thank Prof. Saraswathi Vishveswara (Molecular Biophysics Unit, Indian Institute of Science) for helpful technical discussions. The authors are also grateful to an anonymous reviewer for his/her insightful suggestions, which helped improve this paper. This work was supported in part by the Swarnajayanthi fellowship of the Department of Science and Technology, Government of India, to the second author. This support is gratefully acknowledged.

References (24)

  • J.L. Cornette et al.

    Hydrophobicity scales and computational techniques for detecting amphipatic structures in proteins

    J. Mol. Biol.

    (1987)
  • D.K. Agrafiotis et al.

    Multidimensional scaling and visualization of large molecular similarity tables

    J. Comp. Chem.

    (2001)
  • M.F. Ashby et al.

    Materials and Design: The Art and Science of Material Selection in Product Design

    (2002)
  • N. Cannata et al.

    Simplifying amino acid alphabets by means of a branch and bound algorithm and substitution matrices

    Bioinformatics

    (2002)
  • H.S. Chan

    Folding alphabets

    Nat. Struct. Biol.

    (1999)
  • M. Cieplak et al.

    Amino acid classes and protein folding problem

    J. Chem. Phys.

    (2001)
  • Dayhoff, M.O., Eck, R.V., Park, C.M., 1972. A model of evolutionary change in proteins. In: Dayhoff, M.O. (Ed.), Atlas...
  • S. French et al.

    What is conservative substitution?

    J. Mol. Evol.

    (1983)
  • R.A. Johnson et al.

    Applied Multivariate Statistical Analysis

    (2006)
  • C. Koisol et al.

    A new criteria and method for amino acid classification

    J. Theor. Biol.

    (2004)
  • J.B. Kruskal et al.

    Multidimensional Scaling

    (1978)
  • H. Li et al.

    Nature of driving force for protein folding: a result from analyzing statistical potential

    Phys. Rev. Lett.

    (1997)
  • Cited by (10)

    View all citing articles on Scopus
    View full text