Regular Articles
Alignment and Searching for Common Protein Folds Using a Data Bank of Structural Templates

https://doi.org/10.1006/jmbi.1993.1323Get rights and content

Abstract

We introduce an approach to protein comparisons in which tertiary-structure information is exploited in the alignment of a protein sequence of known tertiary structure, or an aligned set of sequences of known homologous structures, with one or more sequences. The local tertiary environments of residues in the one or more three-dimensional structures (defined in terms of residue accessibility to solvent, secondary structure and hydrogen bonding) are used to select position-specific amino acid substitution scores and produce a scoring template suitable for aligning sequences or searching sequence data banks. The amino acid substitution score have been accumulated from 72 families of protein structure in which the observed substitutions have been classified according to features of the local structure. Hence, the value attributed to a particular amino acid interchange in the template is not a constant, but is dependent upon the environmental context in which that substitution has occurred. We have used these structural templates to align proteins, as well as to search an amino acid sequence data bank for proteins having a similar fold, Indeed, a database of templates that corresponds to both unique structures and aligned homologous structures from the Brookhaven Protein Data Bank has been produce. A new sequence can be searched against the database of templates in order to identify a similar tertiary fold even if the sequence is not significantly similar to any proteins of known three-dimensional structure.

References (0)

Cited by (174)

  • Depth dependent amino acid substitution matrices and their use in predicting deleterious mutations

    2017, Progress in Biophysics and Molecular Biology
    Citation Excerpt :

    In the light of secondary structure prediction and 3D structure modeling/evaluation exercises, amino acid environments have to be described accurately. This implies that an amino acid substitution table that considered not just the likelihoods of pairwise substitutions but also the environmental context and/or protein categories would be best suited for the purpose (Abascal et al., 2007; Adachi and Hasegawa, 1996; Arvestad, 2006; Braberg et al., 2012; Dimmic et al., 2002; Goldman et al., 1998; Johnson et al., 1993; Jones et al., 1994; Koshi and Goldstein, 1995; Lartillot and Philippe, 2004; Lüthy et al., 1991; Madhusudhan et al., 2009; Mehta et al., 1995; Overington et al., 1990; Rice and Eisenberg, 1997; Shi et al., 2001; Thorne et al., 1996; Wako and Blundell, 1994). As depth is a concise measure of amino acid environment, we developed depth dependent substitution matrices that can capture the substitution likelihoods in different environments.

  • Keynote review: Structural biology and drug discovery

    2005, Drug Discovery Today
    Citation Excerpt :

    Such approaches can be classified either as profile methods or threading. The profile methods introduce structural information into traditional sequence comparison algorithms, either using structure-dependent propensities [16,17] or substitution matrices and gap penalties [18–20]. By contrast, threading fits a probe sequence onto the backbone of a known structure, evaluating the compatibility between the sequence and the proposed structure by means of a set of empirical potentials that are derived from well-resolved protein structure data [21].

View all citing articles on Scopus
View full text