An amino acid map of inter-residue contact energies using metric multi-dimensional scaling
Introduction
In this work, we present a map based on the inter-residue contact energies given by the Miyazawa–Jernigan (MJ) matrix (Miyazawa and Jernigan, 1996). By presenting the data in a visual form, namely a map based on the metric multi-dimensional scaling (MMDS), we hope to reduce the complexity of finding out the inter-relations among the residues which might not be directly evident from the MJ matrix. Each amino acid is represented as a point on the MMDS map. The distance between two points on the map quantifies the dissimilarity in their contact energies. The larger the distance the larger the dissimilarity. This map brings out hidden relationships among the amino acids that are not easily discerned from the MJ matrix. The MMDS method is frequently used for a visual representation from a set of data representing the relation among a number of objects. Similar work was reported by French and Robson (1983) who had derived a map using MMDS for amino acids from Dayhoff's “relatedness odds matrix” (1972). Luthra et al. (2007), who presented a summary of different techniques to reduce the number of amino acid alphabet, note that French and Robson's (1983) work was one of the first attempts in reducing the amino acid alphabet. French and Robson (1983) were able to conclude from their map that hydrophobicity and molecular volume are two key properties that are conserved in the evolution of proteins. The MMDS map presented in this paper verifies that hydrophobicity is the key feature that characterizes the amino acid residues and that the inter-residue contact energies represent a rough hydrophobicity scale (Cornette et al., 1987; Chan, 1999; Venkatarajan and Braun, 2001). This map is based on the revised MJ matrix reported in 1996 and hence includes extensive structure and sequence information. Additionally, with the help of this map, we compare (the similarities/differences among amino acid residues as represented by) the MJ matrix with BLOSUM62 and PAM250 matrices. A novel feature of our map is that it can be used as a visual method of reducing the amino acid set. We support this by determining the groups using a hierarchical clustering method (Johnson and Wichern, 2006). By using the above method we are also able to arrive at an optimum number of groups for reducing the amino acid set. Recently, Agrafiotis et al. (2001) coupled MDS with nonlinear mapping (NLM) and neural networks and used it for mapping of large combinatorial libraries and ensemble of molecular conformations but not for the classification of amino acids. On the other hand, Venkatarajan and Braun (2001) used principal component analysis (Johnson and Wichern, 2006) for creating amino acid maps using large data sets. They used 237 physical–chemical properties of amino acids to form a vector in a 237-dimensional space for each amino acid and reduced the resulting matrix to a five-dimensional space. This was done by using the first five eigenvalues and eigenvectors. They showed that the principal components correspond to important properties such as hydrophobicity–hydrophilicity and molecular volume. As discussed in this paper, the same conclusion is drawn from the MMDS map presented in this paper in addition to some other results.
The rest of the paper is organized as follows. In the next section, we explain the method of multi-dimensional scaling that has been used in constructing the map. In the following section, we present the results and how the map can be used for finding out features that are not immediately apparent from the MJ matrix. The final section contains the concluding remarks.
Section snippets
Method
Metric multi-dimensional scaling (Mead, 1992) is a multi-variate statistical analysis technique that is used for making a visual representation from a n×n matrix representing the interaction between a set of n objects that one is interested to study. The ijth entry in the matrix represents the interaction between ith and jth objects. If the ijth entry in the matrix represents dissimilarity between the ith and jth objects, then the matrix is called the dissimilarity or distance or proximity
Results and discussion
Fig. 4 shows the map created using MMDS on MJ matrix. We note that the residues lie along an axis that corresponds to an approximate increase in hydrophobicity (Cornette et al., 1987). This axis is shown in Fig. 5. The curved axis in Fig. 5 shows the direction of increase in inter-residue contact energies. We also show the classification of amino acids according to their chemical properties as done by Dayhoff et al. (1972) in this figure.
In Fig. 6, we show the residues that favorably substitute
Conclusions
The metric multi-dimensional map (MMDS) of the amino acid residues gives an informative and revealing representation of the MJ matrix on a two-dimensional plane. It shows which amino acid residues are similar to one another in terms of statistics-based contact energy. We infer from the map that the inter-residue contact energies in the MJ matrix underscore the hydrophobic–hydrophilic character of the amino acid residues. We are able to represent on the same platform, i.e., the MMDS map,
Acknowledgments
The authors thank Prof. Saraswathi Vishveswara (Molecular Biophysics Unit, Indian Institute of Science) for helpful technical discussions. The authors are also grateful to an anonymous reviewer for his/her insightful suggestions, which helped improve this paper. This work was supported in part by the Swarnajayanthi fellowship of the Department of Science and Technology, Government of India, to the second author. This support is gratefully acknowledged.
References (24)
- et al.
Hydrophobicity scales and computational techniques for detecting amphipatic structures in proteins
J. Mol. Biol.
(1987) - et al.
Multidimensional scaling and visualization of large molecular similarity tables
J. Comp. Chem.
(2001) - et al.
Materials and Design: The Art and Science of Material Selection in Product Design
(2002) - et al.
Simplifying amino acid alphabets by means of a branch and bound algorithm and substitution matrices
Bioinformatics
(2002) Folding alphabets
Nat. Struct. Biol.
(1999)- et al.
Amino acid classes and protein folding problem
J. Chem. Phys.
(2001) - Dayhoff, M.O., Eck, R.V., Park, C.M., 1972. A model of evolutionary change in proteins. In: Dayhoff, M.O. (Ed.), Atlas...
- et al.
What is conservative substitution?
J. Mol. Evol.
(1983) - et al.
Applied Multivariate Statistical Analysis
(2006) - et al.
A new criteria and method for amino acid classification
J. Theor. Biol.
(2004)
Multidimensional Scaling
Nature of driving force for protein folding: a result from analyzing statistical potential
Phys. Rev. Lett.
Cited by (10)
Multi-Branch-CNN: Classification of ion channel interacting peptides using multi-branch convolutional neural network
2022, Computers in Biology and MedicineAlphabet reduction and distributed vector representation based method for classification of antimicrobial peptides
2020, Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020