Structure of Bacillus subtilis YXKO—A member of the UPF0031 family and a putative kinase

https://doi.org/10.1016/S1047-8477(02)00532-4Get rights and content

Abstract

We determined the 1.6-Å resolution crystal structure of a conserved hypothetical 29.9-kDa protein from the SIGY–CYDD intergenic region encoded by a Bacillus subtilis open reading frame in the YXKO locus. YXKO homologues are broadly distributed and are by and large described as proteins with unknown function. The YXKO protein has an α/β fold and shows high structural homology to the members of a ribokinase-like superfamily. However, YXKO is the only member of this superfamily known to form tetramers. Putative binding sites for adenosine triphosphate (ATP), a substrate, and Mg2+-binding sites were revealed in the structure of the protein, based on high structural similarity to ATP-dependent members of the superfamily. Two adjacent monomers contribute residues to the active site. The crystal structure provides valuable information about the YXKO protein’s tertiary and quaternary structure, the biochemical function of YXKO and its homologues, and the evolution of its ribokinase-like superfamily.

Introduction

The far-reaching goal of structural genomics programs is to map the entire protein folding space. These pilot projects exploit the available genome sequence information to select targets whose structures can be determined. Among genes coding for proteins that show no significant sequence similarity to proteins with known structure, there is a large group of conserved “hypothetical” open reading frames (ORFs)2 (Vitkup et al., 2001). Many of these ORFs include members from diverse organisms and can be grouped into large families. Their incidence in many branches of life suggests that they are important; however, their functions cannot be revealed through sequence analysis. Selection of such targets for structural genomics studies is important because the 3D structure of these proteins may offer insight not only into their folds but also into their biochemical or biophysical functions (Zarembinski et al., 1998)—provided that a structural homologue is already known (Martinez-Cruz et al., 2002)—and allow this knowledge to be extended to their sequence homologues. It is anticipated that as the number of nonredundant protein structures grows in the Protein Data Bank (PDB), this approach will be more successful in defining the functions of “hypothetical” proteins.

For this work we have selected, from a large set of structural genomics targets of the gram-positive bacterium Bacillus subtilis (http://www.mcsg.anl.gov), a conserved hypothetical 29.9-kDa protein from the SIGY–CYDD intergenic region encoded by the B. subtilis ORF in the YXKO locus (DNA bases 50795–51625) (Yoshida et al., 1996). [The code “YXKO” will be used for this protein (aka the Midwest Center for Structural Genomics APC234) throughout this text.] Sequence similarity searches of the National Center for Biotechnology Information nonredundant database, using BLAST (Altschul et al., 1997) and FASTA (Pearson, 2000), and the SWISS-PROT (Bairoch and Apweiler, 2000) and Protein Information Resource (Wu et al., 2002) databases resulted in 69 protein homologues of YXKO with statistically significant E values (E<10−5). These proteins are broadly distributed and are found in 42 species of bacteria, 14 species of archaea, and 13 eukaryotes, including human, and are described as proteins with unknown function. A few are classified as probable sugar kinases (Fig. 1).

These proteins belong to the Pfam (Bateman et al., 1999) UPF0031 family, which includes 62 uncharacterized bacterial proteins and contains two Prosite sequence patterns [UPF0031_1, (SAV)-(IVW)-(LVA)-(LIV)-G-(PNS)-G-L-(GP)-x-(DENQT), and UPF0031_2,(GA)-G-x-G-D-(TV)-(LT)-(STA)-G-x-(LIVM)]. The structures of all these proteins are unknown. YXKO has distant sequence homology with the ThiK protein (PDB entry 1ekk) deposited in the PDB (discussed later in this paper).

We have determined the 1.6-Å resolution crystal structure of YXKO from B. subtilis. The protein shows structural homology to members of the ribokinase-like carbohydrate kinase superfamily, as defined by SCOP (Murzin et al., 1995) and CATH (Orengo et al., 1997). Putative ATP- and substrate-binding sites in the YXKO structure were assigned and they reveal high structural similarity within ATP-dependent ribokinases (RKs). Further elucidation of YXKO’s detailed biochemical function will require a combination of enzymatic, mutational, and structural studies; however, the crystal structure for YXKO provides important insight into the function of this protein and others in the UPF0031 family.

Section snippets

Cloning, expression, and protein purification

The ORF of a B. subtilis YXKO protein (29.9 kDa) in the SIGY–CYDD intergenic region was amplified from genomic DNA with Pfx DNA polymerase using conditions and reagents provided by the vendor (Invitrogen). The gene was cloned into a pMCSG7 vector (Stols et al., 2002) using a modified ligation-independent cloning protocol (Dieckman et al., 2002). This process generated an expression clone producing a fusion protein with an N-terminal His6 tag and a TEV protease recognition site (ENLYFQ↓S). The

Description of YXKO structure

The protein crystallized in centered tetragonal crystal form in an I422 space group with one subunit in the asymmetric unit. In the crystal the YXKO monomer has an α/β fold and shows structural similarity to the RK-like carbohydrate kinase superfamily (Fig. 2) (Murzin et al., 1995; Orengo et al., 1997). Each subunit of YXKO contains 276 amino acids, with approximately 42% of the amino acids in α helices, 12% in β sheets, and 3% in 310 helices. The remaining portion of the structure is composed

Acknowledgements

We thank all the members of the Structural Biology Center at Argonne National Laboratory for their help in conducting experiments. This work was supported by National Institutes of Health Grant GM62414 and by the U.S. Department of Energy, Office of Biological and Environmental Research. Atomic coordinates have been deposited with the Protein Data Bank as 1KYH.

References (29)

  • A.T. Brunger et al.

    Crystallography & NMR system: a new software suite for macromolecular structure determination

    Acta. Crystallogr D Biol. Crystallogr.

    (1998)
  • N. Campobasso et al.

    Crystal structure of 4-methyl-5-beta-hydroxyethylthiazole kinase from Bacillus subtilis at 1.5 Å resolution

    Biochemistry

    (2000)
  • Dieckman, L., Gu, M., Stols, L., Donnelley, M.I., Collart, F.R., (in press) High throughput methods for gene cloning...
  • J. Gough et al.

    SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments

    Nucleic. Acids. Res.

    (2002)
  • Cited by (20)

    • A pathway for repair of NAD(P)H in plants

      2014, Journal of Biological Chemistry
      Citation Excerpt :

      Interestingly, in the case of the dehydratase NNRD, the estimated size (132,700 ± 664 Da) closely corresponded to a tetramer (expected size of a tetramer, 142,800 Da). NNRD homologs that have been characterized (34, 35) and for which the crystal structure has been determined classify them within the ribokinase superfamily (35). Members of this family vary in quaternary structure, and the implication of the oligomeric state observed here is further elaborated on under “Discussion.”

    • Newly-discovered enzymes that function in metabolite damage-control

      2018, Current Opinion in Chemical Biology
      Citation Excerpt :

      The Downs group began working on this family, which is conserved in all domains of life, in 1998 [45]. In a series of papers they progressively homed in on the function of YjgF as an imine deaminase that prevents inactivation of pyridoxal 5′-phosphate (PLP)-dependent enzymes by the reactive imine products of dehydratases [40,46••]. Although the function of YgjF (now called reactive imine deaminase A, RidA) is now well established, not all members of this paralogous family are involved in pre-empting metabolite damage, and their functions have yet to be elucidated [47,48].

    • Fragment and conquer: From structure to complexes to function

      2012, Structure
      Citation Excerpt :

      Sequence similarities connected YxkO to the ATP-dependent ribokinase-like superfamily. This was confirmed by the X-ray structure of the apo-enzyme, but the nature of the second substrate remained unknown (Zhang et al., 2002). Interestingly, the crystal structure revealed that the YjeF-N domain is a novel version of the Rossmann fold, but its precise function remained unsolved (Jha et al., 2008).

    • Identification of unknown protein function using metabolite cocktail screening

      2012, Structure
      Citation Excerpt :

      Enzymes from this clan catalyze the transfer of a phosphate group from ATP or ADP to a hydroxyl group of various small molecule substrates. Based on high structural similarity among members of the ribokinase clan, ATP/Mg2+ binding in PF01256 has been modeled and a catalytic role was proposed for the invariant aspartate residue (D216YxkO/D431Tm0922) (Figures 1A and 1C) (Zhang et al., 2002). However, the second substrate, and therefore the function of PF01256 proteins, remained unknown.

    • Chapter 3 High-Throughput Protein Purification for X-Ray Crystallography and NMR

      2008, Advances in Protein Chemistry and Structural Biology
    View all citing articles on Scopus

    This article has been created by the University of Chicago as Operator of Argonne National Laboratory under Contract W-31-109-ENG-38 with the US Department of Energy. The US Government’s right to retain a nonexclusive royalty-free license in and to the copyright covering this paper, for governmental purposes, is acknowledged.

    1

    Permanent address: Department of Chemistry, Wrocław, University of Technology, Poland.

    View full text