Review
Microbial genome evolution: sources of variability

https://doi.org/10.1016/S1369-5274(02)00358-2Get rights and content

Abstract

Comparative genome analyses of close relatives have yielded exciting insight into the sources of microbial genome variability with respect to gene content, gene order and evolution of genes with unknown functions. The genomes of free-living bacteria often carry phages and repetitive sequences that mediate genomic rearrangements in contrast to the small genomes of obligate host-associated bacteria. This suggests that genomic stability correlates with the genomic content of repeated sequences and movable genetic elements, and thereby with bacterial lifestyle. Genes with unknown functions present in a single species tend to be shorter than conserved, functional genes, indicating that the fraction of unique genes in microbial genomes has been overestimated.

Introduction

Differences in growth strategies and life histories of bacteria have left ‘footprints’ in the content and architecture of their genomes 1., 2•.. Over the past few years, an increasing number of closely related microbial genomes has been sequenced and compared 3••., 4••., 5., 6., 7., 8., 9., 10.. They are sufficiently similar to permit complete genome alignments, but divergent enough to allow estimates of the frequencies of mutations influencing their genomic structures 11., 12.. By quantifying these differences, we are now able to address questions related to the content and structure of microbial genomes. Can we identify common features in genomic architecture across bacterial taxa? How do genomes achieve functional flexibility? How does the ecology of a given species influence the stability of its genome?

In this review, we address some of these questions using a comparative genomics approach. First, we examine the stability of microbial genomes. Second, we analyse factors that influence the extent of genome variability, and last, we discuss possible explanations for the surprisingly high fraction of species-specific genes with unknown functions. Given the wide spectrum of taxa and lifestyles of species for which two or more complete genomes are available, we can now begin to relate genomic features to the ecology of the species.

Section snippets

Bacterial genome architecture

Gene position plots (dot plots) between bacterial genome pairs display the arrangement of genes in closely related genomes in a simple graphical format. Visual inspection of gene-position plots for bacteria with different lifestyles reveals a remarkable variation in genomic stability (Fig. 1). This provides a starting point for an analysis of the mutational and selective forces underlying the observed variability.

The most stable microbial genomes identified so far are those of aphid

Symmetric genome rearrangements

Manual inspections of the borders of rearrangements in Chlamydia spp. have shown that these are often symmetrically organised around the origin and terminus of replication [17••]. This pattern, illustrated in Fig. 1 with Chlamydia, Rickettsia and Salmonella, is thought to be the outcome of high recombination frequencies at the open replication forks 17••., 18.. Symmetric translocation and inversion events have also been identified in several other genomes, such as those of Helicobacter and

Single-gene translocations and insertion sequence elements

Insertion sequence (IS) elements may be particularly important as mediators of gene rearrangements by offering multiple similar sequences among which recombination can be initiated [22]. The Sulfolobus genomes have atypically high contents of IS elements. For example, as many as 170 IS elements have been identified in Sulfolobus solfataricus [23]. Thus, it may be no coincidence that single gene translocations have been particularly common in the genomes of Sulfolobus spp., as shown by the

Insertion–deletions and rearrangements

Bacterial genomes can achieve functional flexibility by differential gene expression 28., 29. and by variation in gene content [30], achieved by horizontal transfer [31•] or gene duplication [32•]. It is reasonable to assume that the degree of homology between a related genome pair is reciprocally related to the frequency of gene deletions and insertions in each of the lineages [30]. Surprisingly, the relative frequencies of insertions-deletions versus translocations-inversions are well

The mysterious orphans: are they real genes?

The fraction of genes with no homology to previously sequenced genes in other genera (so-called orphan genes) is normally about 25–30%. Because these orphans may represent genes that underlie the functional and metabolic diversity of a given species or its host specificity [40••], they are the primary targets for postgenomic experimental analyses. It is therefore of particular interest to understand the origin and evolution of the many unknown genes identified in the genome projects.

The genome

Conclusions

Comparative genomics has shown that bacterial genomes are highly variable, both structurally and functionally, and that the degree of genome flexibility is dependent on the content of repeated and mobile sequences such as IS elements, plasmids and phages. On one extreme, obligate intracellular symbionts have the smallest genomes and show the highest degree of stability, with a complete absence of genome rearrangements and gene innovation [3••]. At the other extreme, free-living species undergo

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

References (47)

  • S. Casjens

    The diverse and dynamic structure of bacterial genomes

    Annu Rev Genet

    (1998)
  • B. Snel et al.

    Genomes in flux: the evolution of archaeal and proteobacterial gene content

    Genome Res

    (2002)
  • I. Tamas et al.

    50 million years of genomic stasis in endosymbiotic bacteria

    Science

    (2002)
  • N.T. Perna et al.

    Genome sequence of enterohaemorrhagic Escherichia coli O157:H7

    Nature

    (2001)
  • R.A. Alm et al.

    Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori

    Nature

    (1999)
  • S.T. Cole et al.

    Massive gene decay in the leprosy bacillus

    Nature

    (2001)
  • P. Glaser et al.

    Comparative genomics of Listeria species

    Science

    (2001)
  • D.L. Maeder et al.

    Divergence of the hyperthermophilic archaea Pyrococcus furiosus and P. horikoshii inferred from complete genomic sequences

    Genetics

    (1999)
  • T.D. Read et al.

    Genome sequences of Chlamydia trachomatis MoPn and Chlamydia pneumoniae AR39

    Nucleic Acids Res

    (2000)
  • H. Takami et al.

    Complete genome sequence of the alkaliphilic bacterium Bacillus halodurans and genomic sequence comparison with Bacillus subtilis

    Nucleic Acids Res

    (2000)
  • C.M. Fraser et al.

    Comparative genomics and understanding of microbial biology

    Emerg Infect Dis

    (2000)
  • U. Bergthorsson et al.

    Distribution of chromosome length variation in natural isolates of Escherichia coli

    Mol Biol Evol

    (1998)
  • S.L. Liu et al.

    Highly plastic chromosomal organization in Salmonella typhi

    Proc Natl Acad Sci USA

    (1996)
  • Cited by (91)

    • Functional characterization of prokaryotic dark matter: the road so far and what lies ahead

      2022, Current Research in Microbial Sciences
      Citation Excerpt :

      We speculate that this decrease is a product of the representativeness of the total number of sequences from Bacteria in public databases; and consequently, the improvement and fine-tuning of annotation pipelines aimed particularly at this domain. Prokaryotic genes of unknown function were initially considered to be mere “junk” elements, pseudogenes, or misannotations (Andersson and Andersson, 2001; Mira, 2002; Schmid and Aquadro, 2001) owing to the narrow understanding at the time of the functional sequence space (Lobb et al., 2015). But increasing evidence indicates that yet-unclassified elements do encode for specific functions (Hu et al., 2009).

    • Mutualism meltdown in insects: Bacteria constrain thermal adaptation

      2012, Current Opinion in Microbiology
      Citation Excerpt :

      This reduction in Ne is thought to accelerate the fixation of slightly deleterious mutations [23]. In addition, while host-dependent microbes include species with high recombination rates, obligate mutualists of insects show reduced recombination (even complete asexuality) that may reflect the loss of recombination genes, mobile DNA, and opportunities for gene exchange due to their host-associated lifestyle [24]. This lack of recombination may exacerbate effects of genetic drift under reduced Ne.

    • Insertion sequence elements in Cupriavidus metallidurans CH34: Distribution and role in adaptation

      2011, Plasmid
      Citation Excerpt :

      Furthermore, transposition could lead to altered gene expression, which could be advantageous for the survival or the expression of newly acquired genetic traits under certain conditions (Hubner and Hendrickson, 1997; Lin et al., 2007). IS elements are therefore seen as a significant force in the adaptive and evolutionary response of their host, conferring genome plasticity that allows rapid adaptation to new environments (Mira et al., 2002; Schneider and Lenski, 2004). In this study, all IS elements in C. metallidurans CH34 were characterized and classified.

    View all citing articles on Scopus
    View full text