ReviewMicrobial genome evolution: sources of variability
Introduction
Differences in growth strategies and life histories of bacteria have left ‘footprints’ in the content and architecture of their genomes 1., 2•.. Over the past few years, an increasing number of closely related microbial genomes has been sequenced and compared 3••., 4••., 5., 6., 7., 8., 9., 10.. They are sufficiently similar to permit complete genome alignments, but divergent enough to allow estimates of the frequencies of mutations influencing their genomic structures 11., 12.. By quantifying these differences, we are now able to address questions related to the content and structure of microbial genomes. Can we identify common features in genomic architecture across bacterial taxa? How do genomes achieve functional flexibility? How does the ecology of a given species influence the stability of its genome?
In this review, we address some of these questions using a comparative genomics approach. First, we examine the stability of microbial genomes. Second, we analyse factors that influence the extent of genome variability, and last, we discuss possible explanations for the surprisingly high fraction of species-specific genes with unknown functions. Given the wide spectrum of taxa and lifestyles of species for which two or more complete genomes are available, we can now begin to relate genomic features to the ecology of the species.
Section snippets
Bacterial genome architecture
Gene position plots (dot plots) between bacterial genome pairs display the arrangement of genes in closely related genomes in a simple graphical format. Visual inspection of gene-position plots for bacteria with different lifestyles reveals a remarkable variation in genomic stability (Fig. 1). This provides a starting point for an analysis of the mutational and selective forces underlying the observed variability.
The most stable microbial genomes identified so far are those of aphid
Symmetric genome rearrangements
Manual inspections of the borders of rearrangements in Chlamydia spp. have shown that these are often symmetrically organised around the origin and terminus of replication [17••]. This pattern, illustrated in Fig. 1 with Chlamydia, Rickettsia and Salmonella, is thought to be the outcome of high recombination frequencies at the open replication forks 17••., 18.. Symmetric translocation and inversion events have also been identified in several other genomes, such as those of Helicobacter and
Single-gene translocations and insertion sequence elements
Insertion sequence (IS) elements may be particularly important as mediators of gene rearrangements by offering multiple similar sequences among which recombination can be initiated [22]. The Sulfolobus genomes have atypically high contents of IS elements. For example, as many as 170 IS elements have been identified in Sulfolobus solfataricus [23]. Thus, it may be no coincidence that single gene translocations have been particularly common in the genomes of Sulfolobus spp., as shown by the
Insertion–deletions and rearrangements
Bacterial genomes can achieve functional flexibility by differential gene expression 28., 29. and by variation in gene content [30], achieved by horizontal transfer [31•] or gene duplication [32•]. It is reasonable to assume that the degree of homology between a related genome pair is reciprocally related to the frequency of gene deletions and insertions in each of the lineages [30]. Surprisingly, the relative frequencies of insertions-deletions versus translocations-inversions are well
The mysterious orphans: are they real genes?
The fraction of genes with no homology to previously sequenced genes in other genera (so-called orphan genes) is normally about 25–30%. Because these orphans may represent genes that underlie the functional and metabolic diversity of a given species or its host specificity [40••], they are the primary targets for postgenomic experimental analyses. It is therefore of particular interest to understand the origin and evolution of the many unknown genes identified in the genome projects.
The genome
Conclusions
Comparative genomics has shown that bacterial genomes are highly variable, both structurally and functionally, and that the degree of genome flexibility is dependent on the content of repeated and mobile sequences such as IS elements, plasmids and phages. On one extreme, obligate intracellular symbionts have the smallest genomes and show the highest degree of stability, with a complete absence of genome rearrangements and gene innovation [3••]. At the other extreme, free-living species undergo
References and recommended reading
Papers of particular interest, published within the annual period of review, have been highlighted as:
• of special interest
•• of outstanding interest
References (47)
- et al.
Whole genome plasticity in pathogenic bacteria
Curr Opin Microbiol
(2001) - et al.
Comparative genomics of closely related salmonellae
Trends Microbiol
(2002) - et al.
Evolution of prokaryotic gene order: genome rearrangements in closely related species
Trends Genet
(2001) - et al.
The evolution of mycobacterial pathogenicity: clues from comparative genomics
Trends Microbiol
(2001) - et al.
Mobile elements in archaeal genomes
FEMS Microbiol Lett
(2002) - et al.
Where are the pseudogenes in bacterial genomes?
Trends Microbiol
(2001) - et al.
On the total number of genes and their length distribution in complete microbial genomes
Trends Genet
(2001) Distinguising the ORFs from the ELFs: short bacterial genes and the annotation of genomes
Trends Genet
(2002)- et al.
Small RNA in Rickettsia: are they functional
Trends Genet
(2002) - et al.
Parametric genome rearrangement
Gene
(1996)
The diverse and dynamic structure of bacterial genomes
Annu Rev Genet
Genomes in flux: the evolution of archaeal and proteobacterial gene content
Genome Res
50 million years of genomic stasis in endosymbiotic bacteria
Science
Genome sequence of enterohaemorrhagic Escherichia coli O157:H7
Nature
Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori
Nature
Massive gene decay in the leprosy bacillus
Nature
Comparative genomics of Listeria species
Science
Divergence of the hyperthermophilic archaea Pyrococcus furiosus and P. horikoshii inferred from complete genomic sequences
Genetics
Genome sequences of Chlamydia trachomatis MoPn and Chlamydia pneumoniae AR39
Nucleic Acids Res
Complete genome sequence of the alkaliphilic bacterium Bacillus halodurans and genomic sequence comparison with Bacillus subtilis
Nucleic Acids Res
Comparative genomics and understanding of microbial biology
Emerg Infect Dis
Distribution of chromosome length variation in natural isolates of Escherichia coli
Mol Biol Evol
Highly plastic chromosomal organization in Salmonella typhi
Proc Natl Acad Sci USA
Cited by (91)
Bacterial diseases in fish with relation to pollution and their consequences—A Global Scenario
2022, Bacterial Fish DiseasesFunctional characterization of prokaryotic dark matter: the road so far and what lies ahead
2022, Current Research in Microbial SciencesCitation Excerpt :We speculate that this decrease is a product of the representativeness of the total number of sequences from Bacteria in public databases; and consequently, the improvement and fine-tuning of annotation pipelines aimed particularly at this domain. Prokaryotic genes of unknown function were initially considered to be mere “junk” elements, pseudogenes, or misannotations (Andersson and Andersson, 2001; Mira, 2002; Schmid and Aquadro, 2001) owing to the narrow understanding at the time of the functional sequence space (Lobb et al., 2015). But increasing evidence indicates that yet-unclassified elements do encode for specific functions (Hu et al., 2009).
Ecology and evolution of metabolic cross-feeding interactions in bacteria
2018, Natural Product ReportsMutualism meltdown in insects: Bacteria constrain thermal adaptation
2012, Current Opinion in MicrobiologyCitation Excerpt :This reduction in Ne is thought to accelerate the fixation of slightly deleterious mutations [23]. In addition, while host-dependent microbes include species with high recombination rates, obligate mutualists of insects show reduced recombination (even complete asexuality) that may reflect the loss of recombination genes, mobile DNA, and opportunities for gene exchange due to their host-associated lifestyle [24]. This lack of recombination may exacerbate effects of genetic drift under reduced Ne.
Insertion sequence elements in Cupriavidus metallidurans CH34: Distribution and role in adaptation
2011, PlasmidCitation Excerpt :Furthermore, transposition could lead to altered gene expression, which could be advantageous for the survival or the expression of newly acquired genetic traits under certain conditions (Hubner and Hendrickson, 1997; Lin et al., 2007). IS elements are therefore seen as a significant force in the adaptive and evolutionary response of their host, conferring genome plasticity that allows rapid adaptation to new environments (Mira et al., 2002; Schneider and Lenski, 2004). In this study, all IS elements in C. metallidurans CH34 were characterized and classified.
Strategies of genomic integration within insect-bacterial mutualisms
2012, Biological Bulletin