Crystal Structure and Standardized Geometric Analysis of InlJ, a Listerial Virulence Factor and Leucine-Rich Repeat Protein with a Novel Cysteine Ladder

https://doi.org/10.1016/j.jmb.2008.01.100Get rights and content

Abstract

We report on the crystal structure of the internalin domain of InlJ, a virulence-associated surface protein of Listeria monocytogenes, at 2.7-Å resolution. InlJ is a member of the internalin family of listerial cell surface proteins characterized by a common N-terminal domain. InlJ bears 15 leucine-rich repeats (LRRs), the same number as in InlA, the prototypical internalin family member. The LRRs of InlJ differ from those of other internalins by having 21, rather than 22, residues and by replacing 1 LRR-defining hydrophobic residue with a conserved cysteine. These cysteines stack to form an intramolecular ladder and regular hydrophobic interactions in consecutive repeats. Analyzing the curvature, twist, and lateral bending angles of InlJ and comparing these with several other LRR proteins, we provide a systematic geometric comparison of LRR protein structures (http://bragi2.helmholtz-hzi.de/Angulator/). These indicate that both cysteine and asparagine ladders stabilize the LRR fold, whereas substitutions in some repeat positions are more likely than others to induce changes in LRR geometry.

Introduction

Listeria monocytogenes, the causative agent of listeriosis in humans, is a facultative intracellular pathogen readily transmitted by contaminated food.1 Its ability to induce its own uptake into intestinal cells, proliferate locally, and spread throughout the host has been traced to distinct members of its substantial set of surface-exposed cell wall-associated virulence factors.2, 3

Internalins belong to a multigene family exclusive to Listeria species, distinguished by variable numbers of leucine-rich repeats (LRRs).4 Structurally, the LRR domain forms a superhelix, the overall bending and twisting geometry of which is dictated by the consensus sequence of the particular protein and by the amino acid sequence of the individual repeat.5 The availability of extended concave surfaces combined with a rigid scaffold makes LRR proteins ideal protein–protein interaction partners, a feature shared with other repeat proteins.6, 7, 8 LRR proteins are found in plants, yeast, numerous metazoans, and bacteria.9 A family of 25 LRR-containing genes in L. monocytogenes is unique among known Gram-positive bacterial genomes.4, 10, 11

Two members of the family of listerial internalins, InlA and InlB, have been particularly well studied both functionally12, 13 and structurally.14, 15, 16, 17, 18 Both are major virulence factors of L. monocytogenes, and, in particular, both are invasion proteins that enable bacterial uptake by normally non-phagocytic host cells.2, 12 The crystal structures of the functional domains of InlA,14, 17 InlB,15, 16 InlC,19 InlE,20 and InlH [Protein Data Bank (PDB) entry code 1H6U] have been published, as well as those of InlA in complex with its human receptor E-cadherin14, 17 and InlB in complex with its receptor c-Met.18

The family of 7 listerial proteins containing the typical internalin superdomain has recently been extended by the identification of another 6 members4, including InlI and InlJ, encoded by the genes inlI (lmo0333) and inlJ (lmo2821), respectively.21 All 13 members share the N-terminal “internalin” domain,16 consisting of an α-helical cap, an LRR, and an Ig-like interrepeat domain. A C-terminal LPXTG motif suggests that InlJ undergoes sortase-mediated C-terminal cleavage and covalent anchoring to the cell wall peptidoglycan.22 Between the internalin domain and the C-terminal cell surface anchor, InlJ, bears four mucin binding protein (MucBP) repeats.21

The inlJ gene is conserved in pathogenic Listeria species.21 An in-frame deletion of inlJ significantly reduces bacterial loads in the liver and spleen of wild-type mice after intravenous infection. With the use of a transgenic mouse model for food-borne listeriosis,23 oral infection with the ΔinlJ strain was shown to result in reduced bacterial levels in the intestine, mesenteric lymph nodes, liver, and spleen.21 InlJ is thus a bona fide virulence factor. However, a potential receptor has not been identified as yet.

The LRR consensus sequence of InlJ deviates from that of all other internalin family members in that each repeat comprises 21, instead of the standard 22, residues. In addition, a hydrophobic residue in one of the LRR-defining positions, immediately following the β-strand, is replaced by a cysteine in InlJ. This results in a novel subtype of the LRR motif. InlJ bears a total of 14 cysteine residues in 15 LRRs—unequalled by any other LRR protein structure. We have crystallized the functional domain of InlJ and determined its structure at 2.7-Å resolution. Furthermore, we have systematically analyzed the geometry of the InlJ LRR domain, allowing us to compare its molecular architecture with that of other listerial internalin family proteins, as well as other bacterial and eukaryotic LRR proteins.

Section snippets

Overall structure of InlJ34–508

The internalin domain of InlJ (residues 34–508, hereinafter referred to as InlJ) was produced in Escherichia coli, purified, and crystallized as described in Materials and Methods. Data for crystals with cubic symmetry (space group I23) were collected at beamline X11 (EMBL Outstation, DESY, Hamburg, Germany). The structure was solved by molecular replacement using fragments of InlA and InlB as search models and complemented manually (for details, see Materials and Methods). Difficulties in

Protein expression, purification, and crystallization

The InlJ functional domain (cap, LRR, and interrepeat domains, residues 34–508, lacking an N-terminal signal peptide and the C-terminal MucBP repeat region) was produced as an N-terminal glutathione S-transferase fusion protein using the vector pETM30 (EMBL Protein Expression and Purification Facility) in E. coli BL21 CodonPlus cells (Invitrogen). Protein production was induced by 0.1 mM IPTG in cells at an A600 of 0.8 in LB medium with 30 μg/mL of kanamycin and 34 μg/mL of chloramphenicol and

Acknowledgements

Funding by the Deutsche Forschungsgemeinschaft as part of Priority Program 1150 (SCHU 1560/1-1 and 1-2) to W.D.S is gratefully acknowledged. We thank Karl-Heinz Genther for helpful discussions on the geometry calculations and Hartmut Niemann for helpful comments on the manuscript. We also thank Santosh Panjikar and Matthew Groves (EMBL) for help with data collection and technical support, and we gratefully acknowledge beam time at beamlines X11 and BW7A (EMBL, DESY, Hamburg, Germany), BL1

References (56)

  • H. Ton-That et al.

    Protein sorting to the cell wall envelope of Gram-positive bacteria

    Biochim. Biophys. Acta

    (2004)
  • B. Kobe et al.

    The leucine-rich repeat as a protein recognition motif

    Curr. Opin. Struct. Biol.

    (2001)
  • A.G. Evdokimov et al.

    Unusual molecular architecture of the Yersinia pestis cytotoxin YopM: a leucine-rich repeat protein with the shortest repeating unit

    J. Mol. Biol.

    (2001)
  • B. Kobe et al.

    Proteins with leucine-rich repeats

    Curr. Opin. Struct. Biol.

    (1995)
  • M. Blazquez et al.

    Oxidation of sulfhydryl groups of ribonuclease inhibitor in epithelial cells is sufficient for its intracellular degradation

    J. Biol. Chem.

    (1996)
  • J.M. Fominaya et al.

    Inactivation of ribonuclease inhibitor by thiol–disulfide exchange

    J. Biol. Chem.

    (1992)
  • P.A. McEwan et al.

    Structural correlations in the family of small leucine-rich repeat proteins and proteoglycans

    J. Struct. Biol.

    (2006)
  • A.V. Kajava et al.

    Modeling of the three-dimensional structure of proteins with the typical leucine-rich repeats

    Structure

    (1995)
  • C. Chothia

    Conformation of twisted beta-pleated sheets in proteins

    J. Mol. Biol.

    (1973)
  • S. Iyer et al.

    Molecular recognition of human eosinophil-derived neurotoxin (RNase 2) by placental ribonuclease inhibitor

    J. Mol. Biol.

    (2005)
  • H. Neuvirth et al.

    ProMate: a structure based prediction program to identify the location of protein–protein binding sites

    J. Mol. Biol.

    (2004)
  • Y.H. Tsai et al.

    Listeria monocytogenes internalins are highly diverse and evolved by recombination and positive selection

    Infect. Genet. Evol.

    (2006)
  • J.A. Vazquez-Boland et al.

    Pathogenicity islands and virulence evolution in Listeria

    Microbes Infect.

    (2001)
  • Z. Otwinowski et al.

    Processing of X-ray diffraction data collected in oscillation mode

    Methods Enzymol.

    (1997)
  • P.S. Mead et al.

    Food-related illness and death in the United States

    Emerging Infect. Dis.

    (1999)
  • M. Hamon et al.

    Listeria monocytogenes: a multifaceted model

    Nat. Rev. Microbiol.

    (2006)
  • J. Pizarro-Cerda et al.

    Subversion of cellular functions by Listeria monocytogenes

    J. Pathol.

    (2006)
  • Bierne, H., Sabet, C., Personnic, N., & Cossart, P. (2007). Internalins: a complex family of leucine-rich...
  • Cited by (30)

    • A detailed in silico analysis of the amylolytic family GH126 and its possible relatedness to family GH76

      2020, Carbohydrate Research
      Citation Excerpt :

      It seems thus very probable that members classified in the family GH126 currently do not contain in their sequences any CBMs, including none of the 15 CBM families classified as starch- and/or glycogen-binding domains [27]. Concerning the three sequences of hypothetical GH126 proteins used in the present study that possess some N-terminal extensions, i.e. those from Clostridium butyricum (GenBank accession No. APF21752.1; residues 1–146), Lactobacillus brevis (AYM02277.1; residues 1–947) and Lactobacillus paraplantarum (ALO03904.1; residues 1–236), the observations could be summarised as follows: (i) according to Conserved Domain Database [53], the ~150-residue-long N-terminal segment of C. butyricum protein represents a domain adopting a thioredoxin-like fold present, e.g., in the human NHL repeat-containing protein 2 (PDB code: 6GC1; [54]), the mouse selenocysteine-dependent iodothyronine deiodinase (PDB: 4TR3; [55]) and also an electron transporter Rv2874 from Mycobacterium tuberculosis (PDB: 2HYX; [56]); (ii) within the ~950 residues at the N-terminus of the L. brevis protein, the search in the Conserved Domain Database [53] has indicated several (up to nine) motifs of the so-called leucine-rich repeat structurally best resembling those found in various leucine-rich repeat receptor kinases from Arabidopsis thaliana, such as ZAR1 (PDB: 6J5T; [57]), PEPR1 (PDB: 5GR8; [58]) and GSO1 (PDB: 6S6Q [59]); and (iii) none putative conserved domains have been detected in the Conserved Domain Database [53] for the ~235 residue-long region at the N-terminus of the L. paraplantarum protein, although around one hundred residues from that N-terminal extension could contain a homologue of various Listeria monocytogenes virulence factors called internalins (e.g., PDB codes: 4L3A [60], 3BZ5 [61] and 1XEU [62]). Since in the family GH126, there is still a real lack of relevant information concerning the exact enzyme specificity of its members – the CPF_2247 protein has been only partially characterized as an “amylase” [17] – the attention was therefore focused on structural comparison.

    • Structure-Guided Exploration of SDS22 Interactions with Protein Phosphatase PP1 and the Splicing Factor BCLAF1

      2019, Structure
      Citation Excerpt :

      The twelve β strands form a parallel β sheet that defines the concave surface of the structure, whereas the eleven variable fragments define the convex surface (Figure 1B). The curvature of the β sheet is similar to that of the internalins, a family of LRR proteins from Listeria (Bublitz et al., 2008). The LRR superstructure is stabilized by an extensive hydrophobic core comprising conserved, closely packed (iso)leucines; it is also stabilized by inter-repeat backbone-backbone and backbone-side chain (i.e., the asparagine ladder) hydrogen bonds (Figure 1B).

    • Structure of internalin InlK from the human pathogen listeria monocytogenes

      2013, Journal of Molecular Biology
      Citation Excerpt :

      Instead, the homologous residues located internally on the loops that follow β1, β2 and β3 are hydrophobic (Val91, Ile111, and Val135), and add to the strongly nonpolar character of the pocket. This finding is reminiscent of the results of Bublitz et al., who identified that the LRR could be internally stabilized by a non-Asn residue ladder [15]. In the structures of other internalin family members, the LRR regions are followed by an Ig-like segment that protrudes from the LRR but is stably associated to it and is often referred to as the inter-repeat region [20].

    • Identification of a unique TLR2-interacting peptide motif in a microbial leucine-rich repeat protein

      2012, Biochemical and Biophysical Research Communications
      Citation Excerpt :

      BspA comprises a horse-shoe shaped leucine-rich domain [17], which is formed by the 23 tandem repeats of a leucine-rich repeat (LRR) motif in the N-terminal portion, and four bacterial immunoglobulin-like domains in the C-terminal portion [13,18]. The LRR motif in BspA belongs to a cysteine-containing subtype, which might allow the LRR domain to adapt a cysteine-ladder conformation reported in other cysteine containing LRR proteins [19]. We have previously shown that BspA activates TLR2/1 receptor heterodimer and that the LRR region comprising LRR repeats 1–16 is involved in direct binding to TLR2 [14].

    • Quantitative structural analysis of importin-β flexibility: Paradigm for solenoid protein structures

      2010, Structure
      Citation Excerpt :

      The orientation of each repeat was defined with respect to the preceding repeat using three angles: curvature, twist, and lateral bending. In the LRR study, each repeat was represented by a triangle based on three chosen Cα positions (Bublitz et al., 2008). To increase the general applicability to different repeats and the ability to automate the calculations, we instead defined three principal axes for each repeat based on the Cα positions of all residues in a repeat, and calculated the interrepeat angles based on the principal axes of two adjacent repeats (Figure 7).

    View all citing articles on Scopus
    View full text