Sequence Specific DNA Binding of Ets-1 Transcription Factor: Molecular Dynamics Study on the Ets Domain–DNA Complexes

https://doi.org/10.1016/S0022-2836(03)00726-5Get rights and content

Abstract

Molecular dynamics (MD) simulations for Ets-1 ETS domain–DNA complexes were performed to investigate the mechanism of sequence-specific recognition of the GGAA DNA core by the ETS domain. Employing the crystal structure of the Ets-1 ETS domain–DNA complex as a starting structure we carried out MD simulations of: (i) the complex between Ets-1 ETS domain and a 14 base-pair DNA containing GGAA core sequence (ETS–GGAA); (ii) the complex between the ETS domain and a DNA having single base-pair mutation, GGAG sequence (ETS–GGAG); and (iii) the 14 base-pair DNA alone (GGAA). Comparative analyses of the MD structures of ETS–GGAA and ETS–GGAG reveal that the DNA bending angles and the ETS domain–DNA phosphate interactions are similar in these complexes. These results support that the GGAA core sequence is distinguished from the mutated GGAG sequence by a direct readout mechanism in the Ets-1 ETS domain–DNA complex. Further analyses of the direct contacts in the interface between the helix-3 region of Ets-1 and the major groove of the core DNA sequence clearly show that the highly conserved arginine residues, Arg391 and Arg394, play a critical role in binding to the GGAA core sequence. These arginine residues make bidentate contacts with the nucleobases of GG dinucleotides in GGAA core sequence. In ETS–GGAA, the hydroxyl group of Tyr395 is hydrogen bonded to N7 nitrogen of A3 (the third adenosine in the GGAA core), while the hydroxyl group makes a contact with N4 nitrogen of C4′ (the complementary nucleotide of the fourth guanosine G4 in the GGAG sequence) in the ETS–GGAG complex. We have found that this difference in behavior of Tyr395 results in the relatively large motion of helix-3 in the ETS–GGAG complex, causing the collapse of bidentate contacts between Arg391/Arg394 and the GG dinucleotides in the GGAG sequence.

Introduction

The ETS protein family contains more than 45 eukaryotic transcription activators and inhibitors, such as Ets-1, PU.1, Fli-1, GABPα, SAP-1, TEL and Elk-1.1., 2., 3. Members of this family play an important role in normal cell proliferation and differentiation. The DNA rearrangement and/or overexpression of ets gene have been known to lead to tumorigenesis.4 In order to regulate gene expression, the ETS family of proteins bind to a consensus DNA sequence centered on the core sequence 5′-GGA(A/T)-3′ through the highly conserved DNA-binding domain.3 The DNA-binding domain for ETS proteins, termed ETS domain, is about 85 amino acid residues in length and forms a winged helix-turn-helix motif consisting of three α-helices and four β-strands. The recent X-ray5., 6., 7., 8., 9., 10. and NMR11., 12., 13. studies of the ETS domain–DNA complexes have shown that the helix-3 in the winged helix-turn-helix motif binds in the major groove of the consensus DNA sequence.

In the crystal structure of Ets-1 ETS domain–DNA [d(TAGTGCCGGAAATGT)2] complex (PDB code: 1K79), two arginine residues, Arg391 and Arg394, which are in the helix-3 region and conserved among the ETS family, make bidentate interactions with G1 and G2, respectively (Figure 1).5 However, the pattern of these hydrogen bonds is not maintained in the crystal structures of other ETS domain–DNA complexes.7., 9., 10. In addition, the interaction between the arginine residues, Arg391 and Arg394, and the consensus DNA sequence is not observed in the NMR study on Ets-1 ETS domain–DNA complex.12., 13.

On the other hand, a few studies have proposed direct contacts of the amino acid residues in ETS domain with the AA region (+3 and +4 positions) in the GGAA core. For example, an X-ray study of the Ets-1 ETS domain–DNA complex indicated what would be a vital role of hydrophobic interaction between the phenyl ring of Tyr395 and 5-methyl group of T4′ or T5′.5 However, this type of interaction was not observed in other ETS domain–DNA complexes,6 nor is the tyrosine residue conserved in other ETS family proteins such as PU.1 and TEL.9., 10. Thus, the precise molecular mechanism that clearly explains the sequence-specific GGAA recognition by ETS domain is still lacking.

The phosphate groups of DNA adjacent to the core sequence GGAA have contacts to the winged segment and the turn region between helix-2 and helix-3 of the ETS domain. It was reported that the neutralization of anionic phosphate charges on one face of DNA resulted in the DNA bending, probably due to the electrostatic repulsions of the remaining anionic charges.14., 15., 16. In fact, DNA bending was observed in the crystal structures of the ETS domain–DNA complexes.5., 6., 7., 8., 9., 10. It was also supposed that the conformational change of DNA caused by the DNA bending would serve to provide effective GGAA core recognition by the helix-3 of the ETS domain. However, the bending angles of DNA previously reported in the X-ray crystallographic analyses of ETS domain–DNA complexes vary from one system to another.5., 6., 7., 8., 9., 10. Thus, additional investigation is required in order to clarify a common role of the DNA bending in the sequence-specific binding of the ETS domain.

Significant developments have been made in the last few years in the procedures of molecular dynamics (MD). Improvements in AMBER,17 CHARMM18., 19. and GROMOS20 force fields and effective treatment of long-range electrostatic interactions by using particle mesh Ewald (PME) method,21 explicit inclusion of solvent and ions have opened the possibilities for accurate determination of protein and DNA structures.22 Besides availability of supercomputers have enabled to undertake simulations on a nanosecond (ns) time-scale which expanded conformational sampling and eventually to elucidate biomolecular interactions. Further understanding of the molecular mechanism of sequence-specific DNA binding of the ETS domain is likely to provide novel clues for the design of drugs that bind to inhibit the interaction between the ETS domain and DNA. We report here 3.5–3.9 ns MD simulations of two Ets-1 ETS domain–DNA complex systems. The amino acid sequence of Ets-1 ETS domain and the DNA sequences used in this study are shown in Figure 2. The binding activity of the ETS domain is known to be higher than that of the whole ETS-1 protein.23., 24., 25., 26., 27. Therefore, the ETS domain–DNA complex would be a good model system for MD simulation. The first system has the ETS domain (103 amino acid residues) of Ets-1 protein (Figure 2(a)) and the high affinity 14 base-pair DNA containing GGAA core (+1 to +4, Figure 2(b)) sequence, while the second one has a low affinity DNA involving a mutation of a single base-pair, GGAG sequence. In the text, we refer to these complexes as ETS–GGAA and ETS–GGAG, respectively. In addition, results from the MD simulation of the 14 base-pair DNA having the GGAA core sequence (referred as GGAA) are also discussed for comparison.

Section snippets

Results

The root-mean-square deviations (RMSD) of the protein backbone and DNA heavy-atoms with respect to the minimized structures of ETS domain–DNA complexes and DNA (GGAA) are given in Figure 3. During the simulation, the RMSD values of the protein fluctuated around 1.16–2.12 Å in ETS–GGAA and 0.92–1.55 Å in ETS–GGAG (Figure 3(a)), while those of DNA in ETS–GGAA and ETS–GGAG are from 1.06 Å to 2.18 Å and from 1.12 Å to 1.96 Å, respectively (Figure 3(b)). The plots indicate the stability at about 900 ps,

Comparison of the MD structures to the experimental data

MD simulations on ETS–GGAA and ETS–GGAG clearly show the presence of meta-stable states of hydrogen bonding in which the conserved residues, Arg391 and Arg394, are participating (Figure 7). In the low affinity ETS–GGAG complex, both arginine residues change hydrogen bonding partners after 2 ns MD simulation. In contrast, Arg394 in ETS–GGAA changes only its side-chain conformation preference at around 1.7 ns to form more stable bidentate hydrogen bonds with the same partner (Figure 8, Figure 8).

Conclusions

We conducted 3.5–3.9 ns MD simulations of Ets-1 ETS domain–14 base-pair DNA complexes with PME treatment of electrostatic interactions. These MD simulations have provided us a good deal of information on the sequence specific interaction between ETS domain and the consensus DNA, as schematically shown in Figure 10. Two conserved arginine residues, Arg391 and Arg394, play an important role in binding with the GGAA core sequence. Although these arginine residues show certain flexibility in

Modeling of initial structures

The crystal structure (PDB code: 1K79) of Ets-1 ETS domain–15 base-pair DNA complex5 was used for preparation of the starting structure of ETS–GGAA. Although the crystal structure involved two complexes in the asymmetric unit, these two structures are essentially identical. So, the first complex was considered in our study. In the crystal structure, the 15 base-pair DNA has a 5′-overhang structure, so the 5′-terminal nucleotide in each strand was deleted. An unusual hydrogen bond pattern was

Acknowledgements

We thank Sun Hur (UCSB) for helpful discussions. This work was supported by NIH grant 5R37DK0917136. We acknowledge computer time on UCSB's SGI Origin 2000.

References (52)

  • S.F. Chen et al.

    Indirect readout of DNA sequence at the primary-kink site in the CAP–DNA complex: alteration of DNA binding specificity through alteration of DNA kinking

    J. Mol. Biol

    (2001)
  • A.M. Martin et al.

    Structural and energetic origins of indirect readout in site-specific DNA cleavage by a restriction endonuclease

    Nature Struct. Biol

    (1999)
  • B.R. Szymczyna et al.

    DNA Binding specificity studies of four ETS proteins support an indirect read-out mechanism of protein–DNA recognition

    J. Biol. Chem

    (2000)
  • T.E. Ferrin et al.

    The Midas display system

    J. Mol. Graph

    (1988)
  • C.C. Huang et al.

    Conic—a fast renderer for space-filling molecules with shadows

    J. Mol. Graph

    (1991)
  • B. Wasylyk et al.

    The Ets family of transcription factors

    Eur. J. Biochem

    (1993)
  • Y. Mo et al.

    Structure of the Elk-1–DNA complex reveals how DNA-distal residues affect ETS domain recognition of DNA

    Nature Struct. Biol

    (2000)
  • A.H. Batchelor et al.

    The structure of GABP alpha/beta: an ETS domain ankyrin repeat heterodimer bound to DNA

    Science

    (1998)
  • R. Kodandapani et al.

    A new pattern for helix-turn-helix recognition revealed by the PU.1 ETS-domain–DNA complex

    Nature

    (1996)
  • H. Liang et al.

    Solution structure of the Ets domain of Fli-1 when bound to DNA

    Nature Struct. Biol

    (1994)
  • M.H. Werner et al.

    Correction of the NMR structure of the ETS1/DNA complex

    J. Biomol. NMR

    (1997)
  • K.M. Kosikov et al.

    Bending of DNA by asymmetric charge neutralization: all-atom energy simulations

    J. Am. Chem. Soc

    (2002)
  • J.K. Strauss et al.

    DNA bending by asymmetric phosphate neutralization

    Science

    (1994)
  • T.E. Cheatham et al.

    A modified version of the Cornell et al. Force field with improved sugar pucker phases and helical repeat

    J. Biomol. Struct. Dynam

    (1999)
  • N. Foloppe et al.

    All-atom empirical force field for nucleic acids: I. Parameter optimization based on small molecule and condensed phase macromolecular target data

    J. Comput. Chem

    (2000)
  • A.D. MacKerell et al.

    All-atom empirical force field for nucleic acids: II. Application to molecular simulations of DNA and RNA in solution

    J. Comput. Chem

    (2000)
  • Cited by (17)

    • Mechanism of cognate sequence discrimination by the ETS-family transcription factor ETS-1

      2019, Journal of Biological Chemistry
      Citation Excerpt :

      We therefore compared explicit solvent simulations of the ETS domain of murine ETS1 (residues 331–440) in the unbound state and in complex with high-affinity, low-affinity cognate (consensus-bearing), or nonspecific DNA. These simulations represent the longest-duration and broadest-scope all-atom studies of any ETS protein to date (29–33). The results are consistent with literature data and reveal novel mechanisms of sequence discrimination.

    • Sequence discrimination by DNA-binding domain of ETS family transcription factor PU.1 is linked to specific hydration of protein-DNA interface

      2012, Journal of Biological Chemistry
      Citation Excerpt :

      Characterization of these variants is essential, even if they are not highly represented in in vivo binding sites, because this knowledge is ultimately required to complete our understanding of the intrinsic preference of ETS protein for one binding site over another. Unfortunately, comparative studies of core consensus variants 5′-GGAN-3′ (10–13) do not directly answer this question because all four core bases receive base-specific inputs. To address the physical contribution of the flanking sequences to sequence discrimination by the PU.1 ETS domain, we investigated the structure and binding properties of two PU.1 ETS-bound sites differing strictly in the flanking sequences.

    • Correlated motions and interactions at the onset of the DNA-induced partial unfolding of ETS-1

      2009, Biophysical Journal
      Citation Excerpt :

      Helix H3 of this domain binds the GGAA/T core sequence (23) in the major groove of DNA, whereas the wing between sheets 3 and 4 interacts with the 3′ minor groove (24). The recognition of the core sequence involves a mixture of direct and indirect readout mechanisms (25,26). The ETS domain is flanked by the autoinhibitory module, consisting of residues 415–440 of the F-domain and residues 301–330 of the D- or exon VII domain (27–30).

    • Molecular dynamics studies on free and bound targets of the bovine papillomavirus type I E2 protein: The protein binding effect on DNA and the recognition mechanism

      2005, Biophysical Journal
      Citation Excerpt :

      Moreover, to investigate the change in DNA flexibility induced by a protein, both free and bound DNA should obviously be studied in parallel. Some MD studies (19–21) consider the two forms of the relevant DNA, but, again, they do not explicitly deal with the effect of the protein on the DNA motion. Finally, the DNA behavior was examined by means of a 50-bp fragment containing the OR1/OR2 sites, free and complexed with the phage 434 operators (22), and, although the size of the system limited the trajectory duration to 1 ns, it was concluded that the OR1 dynamics were enhanced by the operator whereas the OR2 plasticity slightly decreased.

    • Comparison of positively charged DNG with DNA duplexes: A computational approach

      2005, Bioorganic and Medicinal Chemistry
      Citation Excerpt :

      Helical parameters were determined by use of FREEHELIX,43 evaluating the central seven base steps and eight base pairs of each dodecamer. Bending angles were defined as the angle between the normal vectors of base-pairs 3 and 10 of each duplex,44,45 and were evaluated by use of FREEHELIX. Groove widths were defined as the distance between interstrand phosphorous atoms separated by three to four base pairs, that is, P(i−2)⋯P′(i + 2) across the major groove with four base-pair separation, and P′(i−2)⋯P(i + 2) across the minor groove with three base-pair separation.44

    View all citing articles on Scopus
    View full text