Abstract
DNA cytosine methylation regulates gene expression in mammals. In bacteria, its role in gene expression and genome architecture is less understood. Here we perform high-throughput sequencing of bisulfite-treated genomic DNA from Escherichia coli K12 to describe, for the first time, the extent of cytosine methylation of bacterial DNA at single-base resolution. Whereas most target sites (CmCWGG) are fully methylated in stationary phase cells, many sites with an extended CCmCWGG motif are only partially methylated in exponentially growing cells. We speculate that these partially methylated sites may be selected, as these are slightly correlated with the risk of spontaneous, non-synonymous conversion of methylated cytosines to thymines. Microarray analysis in a cytosine methylation-deficient mutant of E. coli shows increased expression of the stress response sigma factor RpoS and many of its targets in stationary phase. Thus, DNA cytosine methylation is a regulator of stationary phase gene expression in E. coli.
Similar content being viewed by others
Introduction
DNA methylation at cytosines is an epigenetic mark that regulates gene expression and consequently major processes such as stem-cell development and cancer progression in higher eukaryotes1,2,3. In bacteria, methylation is best studied as part of restriction-modification (R-M) systems that have long been considered as contributing to bacterial immune response against incoming DNA including phages4. However, recent research has suggested that R-M systems may be 'selfish' elements that protect themselves against removal, through post-segregational killing of newly born progeny by pre-existing and stable restriction enzyme molecules5,6,7,8,9,10,11.
In addition to being R-M components, methyltransferases are solitary (or orphan) enzymes that methylate DNA at specific sites, but are not associated with any restriction enzyme12. The best-studied solitary methyltransferase in E. coli is DNA adenine methyltransferase, which targets the GmATC motif13. This methyl mark has important roles in regulating on-off gene expression by influencing the binding properties of transcription factors to DNA sites containing the methylated motif14,15; it also influences replication by regulating the recruitment of the initiator of replication to the origin of replication16,17 and regulates DNA repair by informing the methyl-directed mismatch repair system about the age of the DNA strand18,19,20. Another well-studied adenine methyltransferase is CcrM in Caulobacter crescentus, which methylates the GANTC site and acts as a cell cycle regulator21. A second DNA methyl transferase (Dcm) in E. coli methylates the internal cytosine in the CCWGG motif. Dcm is homologous to the methyltransferase that is part of the plasmid-encoded EcoRII R-M system, which also targets the CCWGG site22,23. Dcm confers resistance against parasitism by the EcoRII R-M system. This is because, in the presence of a functional Dcm, EcoRII-target sites will be kept methylated even when the R-M system is disturbed, thus preventing post-segregational killing24. To our knowledge, this, an association between Dcm and Tn3 transposition25, and lambda DNA recombination26, and a very recently discovered role in regulating ribosomal gene expression in stationary phase27, remain the only phenotypes associated with Dcm.
Despite having been identified in the 1970s28 and cloned in the 1980s29, Dcm remains poorly studied. Low-resolution genome-scale studies have shown that not all target sites of Dcm (and DNA adenine methyltransferase) are methylated in-vivo30, confirming previous work by Bhagwat and colleagues31. However, the low resolution of the above studies precluded careful genome-scale analysis of partially methylated and unmethylated sites. Though regulation of gene expression by cytosine methylation has been described for mammals3, such results in bacteria have been limited largely to adenine methylation with little information on its cytosine counterpart.
Here, we present the first single-base-resolution genome-scale study of cytosine methylation in E. coli. We first perform high-throughput sequencing of bisulfite-treated genomic DNA32 from E. coli K12, thus quantifying the extent of cytosine methylation at single-base resolution across various phases of batch culture. We unveil patterns of methylation that might be consistent with reducing the risk of non-synonymous mC->T transitions. We also perform microarray analysis of gene expression in a Δdcm strain of E. coli, thus identifying a role for methylation in controlling gene expression levels during stationary phase.
Results
Occurrence of the CCWGG motif
The E. coli K12 MG1655 genome contains 24,090 instances of the CCWGG motif (on either strand; 12,045 on each strand), which is targeted by the Dcm DNA cytosine methyltransferase. This is ~27% more than the expected by random chance, given the genome's overall A/T content33. This is difficult to interpret; however, this 27% increase over expectation is less dramatic than in organisms such as Pseudomonas aeruginosa, where the occurrence of CCWGG motif is more than double the expected value (http://tools.neb.com/~vincze/cutfreq/CCWGG.html). Unlike the GATC motif in which the adenine is methylated34,35, the occurrence of CCWGG motif is not strongly constrained or particularly over-represented in genes or regulatory genes: ~93% of these motifs are located within genes, which is consistent with the genomic average considering the relative sizes and difference in A/T content between genes and intergenic regions. The number of CCWGG motifs in a gene correlates with its length (ρ=0.75 Pearson Correlation; Fig. 1a). The frequency of occurrence of the CCWGG motif per kilobase of sequence follows a Poisson distribution, with an average of 2.6 occurrences of the motif on each strand; this is as expected by random chance, given the total number of CCWGG motifs in the entire genome (Fig. 1b,c).
Bisulfite sequencing of E. coli genomic DNA
To identify cytosines that are methylated in E. coli K12 MG1655, we first isolated genomic DNA from cultures grown in batch to mid-exponential, late-exponential, transition-to-stationary and stationary phases. We then treated the DNA with a commercial bisulfite reagent that converts unmethylated cytosines to uracil, but leaves methylated residues unaffected, and produces single-stranded DNA. Following synthesis of the second strand of DNA, we carried out multiplexed DNA sequencing on an Illumina GA IIx system producing several million 36-mer reads (Supplementary Table S1). Our data provide a ~20-fold coverage of the E. coli genome per sample. Using Bismark software36, we mapped fully bisulfite-converted forward and reverse read (C->T and G->A conversions of the forward strand) to similarly converted versions of the reference genome; the best of the four possible alignments was compared with the normal reference sequence to enable calls on the methylation state of all cytosine positions in the read. To our knowledge, this is the first genome-wide base-resolution study of cytosine methylation in any bacterium.
To assess the quality of our methylation calls, we performed similar experiments on an otherwise isogenic Δdcm mutant strain. The bisulfite reagent used has a 99% conversion rate; given this and the assumption that dcm is the only gene encoding a DNA cytosine methyltransferase in E. coli, we should expect ~1% of cytosines in the Δdcm experiment to be called methylated. Across the four Δdcm samples, <1.5% (range: 1.33–1.57%) of cytosines are called as methylated, showing that our data are reliable.
Every cytosine is covered by multiple reads each with its own methylation call. This allows us to calculate, for each cytosine, the fraction of mapped reads determined to be methylated, which we define as the 'extent of methylation' ratio (Rmet). Using our stationary phase sample, where the population is least dynamic in terms of DNA replication, we first identified cytosines where every mapped read is defined as methylated (Rmet=1). We then used the sequences surrounding the methylated cytosine and identified sequence motifs, which recovered the known target site for Dcm (CCWGG, where W stands for A or T; Fig. 2a)12. In stationary phase cells, almost all reads mapping to CCWGG sites were called methylated (mean Rmet=0.98). On the other hand, only sporadic cases of methylated reads mapping to non-CCWGG sites were observed (mean Rmet=0.02; Supplementary Fig. S1). A similar difference is observed for Rmet values at CCWGG motifs between the wildtype and Δdcm (Fig. 2b).
In summary, these results reassert that methylation occurs exclusively at CCWGG; show that during stationary phase methylation of CCWGG sites is nearly complete; and demonstrate that Dcm is the only cytosine methyltransferase in the E. coli strain used here (based on data obtained from the Δdcm strain).
Cytosine methylation and gene expression in E. coli
Cytosine methylation is a key regulator of gene expression in mammals, with major influence on processes such as development and cancer progression. In bacteria, however, DNA adenine methylation has been recognized as a regulator of gene expression, primarily by influencing the activities of transcription factors whose effects are correlated with the target's methylation status14,15. However, until very recently27 when cytosine methylation was shown to regulate ribosomal gene expression in stationary phase, no role for cytosine methylation in gene expression control had been documented for bacteria, at least for enterobacteria.
Despite the widespread and apparently 'random' occurrence of the CCWGG motif, a recent study has shown that where present in intergenic regions, several CCWGG sites are clustered together27, thus leading to immediate speculation on the role of methylation in gene regulation. However, our analysis of the occurrence of the CCWGG motif across 2,558 experimentally verified transcription factor-binding sites in the RegulonDB database37 showed that this motif occurs only 69 times in these sequences, which is not significantly different from the trend in the rest of the genome. Further, only two of these transcription factor-binding sequences contained more than one CCWGG motif. This is unlike results obtained with the GATC motif, in which the adenine is methylated, and which is over-represented in CRP-binding loci34. Thus, analysis of the genome sequence alone provides little statistical evidence that cytosine methylation influences gene expression on a global scale by interfering with transcription factor-DNA interactions.
To experimentally investigate the role of cytosine methylation in gene expression in E. coli on a genomic scale, we performed microarray analysis of gene expression in the wild-type and Δdcm strains of E. coli K12 MG1655 (Fig. 3a), thus identifying genes that are differentially expressed in Δdcm during mid-exponential, late-exponential, transition-to-stationary and stationary phases of batch growth. As there is little cytosine methylation in Δdcm, gene expression changes identified here are directly or indirectly caused by the absence of cytosine methylation.
Across the four conditions tested, 510 genes are differentially expressed in Δdcm when compared with the wildtype; of these, 362 are only upregulated, 137 downregulated and the remaining few are upregulated in one condition and downregulated in another. There is no global statistical association between the occurrence of a CCWGG motif in gene-upstream regions or the gene body and differential expression (with differentially expressed and other genes showing similar densities of CCWGG motifs of 2.7–2.9 motifs per kb of gene sequence); nevertheless, it must be noted that this does not rule out any direct role of CCWGG motifs in controlling proximal gene expression at specific loci. Finally, largely distinct sets of genes are differentially expressed in different time points (Fig. 3b). Here, we describe selected trends observed in our analysis of these microarray data.
Effect of Δdcm on stationary phase gene expression
Of the 362 genes that are upregulated (in Δdcm when compared to the wildtype) in at least one of the four conditions tested, 222 (61%) do so in stationary phase. In contrast to other phases of growth where comparable numbers of genes are up- or downregulated, very few (three) genes are downregulated in stationary phase (Fig. 3b). In agreement with a recent work27, 15 genes involved in translation and ribosomal biogenesis (as defined by the COG database) are upregulated in Δdcm. Further analysis of the wild-type expression levels of genes that are upregulated in stationary phase showed the following: these are expressed at higher levels than other genes during stationary phase in wildtype cells (P<10−6; Wilcoxon test; Fig. 3c); show higher fold change in expression levels between stationary and exponential phases in the wild-type strain (P<10−6; Wilcoxon test; Fig. 3d); and are targets of RpoS (the stationary phase sigma factor) identified by Weber and colleagues38 (37 genes; P=9.1×10−6; Fisher test compared with genes upregulated in other phases of growth). Note that difficulties in normalization of microarray data between exponential and stationary phase (as previously discussed by others39) made us use data obtained from RNA-seq experiments40 to perform the calculations on wild-type gene expression described above; though similar issues may be valid even for RNA-seq data, these should not substantially alter our qualitative conclusions.
In summary, in Δdcm, genes that seem to be already stationary phase-specific in wild-type cells are further upregulated. We observe that several members of the RNA polymerase complex are upregulated in Δdcm during stationary phase; this is particularly true for rpoS, which we were able to validate using RT–PCR and western blots (Table 1; Supplementary Fig. S2). We suspect that much of the general transcriptional upregulation observed in stationary phase Δdcm cells is an indirect consequence of the upregulation of RpoS. However, this does not affect the culture's long-term viability during stationary phase in batch culture (Supplementary Fig. S3).
It is notable that there are five CCWGG motifs present in the rpoS promoter and gene body. The first is ~95 bp upstream of the transcription start site (TSS), the second ~140 bp downstream of the TSS and the third ~47 bp upstream of the ORF; the other two are further downstream in the ORF. Of these, there is a dramatic increase in Rmet value from 0.2 to 1.0 from exponential to stationary phase for one motif, present ~370 bp downstream of the ORF start site. There is a weaker increase, from 0.83 to 1.0 and 0.76 to 1.0, respectively, for the first two sites above that are closest to the TSS. Whether any of these have a direct role in regulating rpoS expression by interfering with transcription factor binding is not known and remains to be investigated; however, none of these sites overlap with known CRP or ArcA-binding sites, with the first site being closest, lying ~25 bp upstream of a CRP-binding site.
In wild-type cells, 256 genes are differentially expressed between mid- and late-exponential phase populations; in Δdcm, there is an ~1.9-fold increase in the number of differentially expressed genes (488) between the two phases. Two-thirds of the 256 genes with variable expression in the wildtype are included in the considerably larger list of those that change in expression in Δdcm. Among the genes differentially expressed only in Δdcm are 45 genes involved in ribosomal biogenesis and translation (P<10−6; Fisher's exact test), most of which are upregulated in late-exponential phase. This, we suggest, precedes the enhanced gene expression of ribosomal genes observable in stationary phase27.
Upregulation of lamB in Δdcm during mid-exponential phase
During mid-exponential phase, 85 genes are upregulated in Δdcm when compared to the wildtype. Among these is lamB, which displays a 3-fold upregulation in our microarray data (>4.5-fold upregulation in RT–PCR validation, ΔΔ Ct=2.2), along with malK, another gene encoded in the same operon (Table 1). LamB is a maltoporin, whose functions include being a receptor for lambda phage41. The presence of DNA cytosine methylation might have a role in minimizing the possibility of phage infection in mid-exponential phase E. coli populations where rapid growth is a trigger for lytic explosions.
Partial methylation of CCWGG sites in mid-exponential phase
To assess the differences between different stages of batch growth in the extent of methylation of cytosines, we used our bisulfite-sequencing data to calculate Rmet for the internal cytosine in each CCWGG motif with at least five mapped reads (~89% of all CCWGG motifs across the four conditions combined). Across the four time-points, an average Rmet of 0.91 (median=1.00) was observed; however, the value increased from 0.80 (median=0.85) for mid-exponential phase sample to 0.98 (median=1.00) for the stationary phase sample in a progressive fashion (Figs 4a and 5a). This shows that there are many Dcm-target sites in the genome, which are methylated only in a fraction of the population of mid-exponential phase E. coli DNA.
To test the reliability of this observation, we also performed methylated-DNA immunoprecipitation (MeDIP)-seq experiments42, in which immunoprecipitated fragments of the chromosome containing methylated cytosines were subjected to high-throughput sequencing (Supplementary Fig. S4 and Supplementary Table S2). We find that regions determined by our bisulfite-sequencing experiment to be partially methylated in mid-exponential phase have weaker MeDIP signals than those that are fully methylated (Fig. 4b; P<10−6, Wilcoxon test comparing the MeDIP signals for sites with Rmet<=0.3 and those with Rmet>=0.7). This is despite the considerably poorer resolution of the MeDIP approach and the fact that signals are dependent on the clustering of methylated cytosines.
A trivial explanation for the reduced level of methylation in the exponential phase sample is the presence of newly synthesized DNA, which is yet to be methylated. In rapidly dividing E. coli cells, multiple rounds of replication get initiated between any pair of adjacent division events. Consequently, such cells will have more copies of the region of the chromosome around the origin of replication than near the terminus. This is observed in our data, which, without considering methylation calls, is equivalent to that from a quantitative genome-sequencing experiment (Supplementary Fig. S5). In the exponential phase sample, more reads map to the region around the origin than around the terminus; this, however, is not the case in the stationary phase sample. However, sites of partial methylation are not substantially concentrated around the origin or regions of the chromosome with new DNA to the extent detectable in non-synchronized mixed populations (Fig. 5a; Supplementary Fig. S6). In fact, partially methylated sites are distributed across the chromosome. This implies the existence of non-trivial explanations for the occurrence of partially methylated CCWGG sites in the genome. This is further substantiated by the observation that two independent biological samples, one processed by bisulfite sequencing and the other by MeDIP-seq, give consistent results.
To identify specific signals associated with partial methylation, we performed a motif search using sequences surrounding CCWGG sites with Rmet≤0.2 in mid-exponential phase43. We observe an extended motif CC(m)CWGG at these sites (Fig. 5b). We then compared the Rmet values for all CCCWGG and [ATG]CCWGG motifs in the genome and found that the extent of methylation is significantly less in the former (mean Rmet=0.72) than in the latter (mean Rmet=0.83; P<10−6, Wilcoxon test; Fig. 5c).
Finally, we analysed the positioning of partially methylated cytosines to speculate on the possibility that these sites may be selected. We based this analysis on the foundation that methylated cytosines are mutation hotspots because of their tendency to get deaminated to form thymines44. Though E. coli encodes a very short-patch repair (VSR) system to correct such mutations45,46,47, it is expressed and active only during stationary phase. We classified all CCWGG cytosines located within genes into those that, if mutated in a methylation-dependent fashion, would lead to synonymous or non-synonymous changes. This classification was based purely on their ability to change the amino acid (including non-sense mutations that introduce a stop codon) encoded by the respective codon, and did not include other subtle effects such as changing mRNA stability or influence on protein–DNA interactions. First, we find that the 'methylatable' cytosine in a significantly higher proportion of CCCWGG motifs (~60%) is located at non-synonymous sites, when compared to [ATG]CCWGG sites (~54%; P<10−6; Fisher exact test). Next, using our bisulfite-sequencing data, we tested the association between the extent of methylation (in mid-exponential phase) and the tendency of a particular cytosine to be positioned in synonymous or non-synonymous locations. We find a statistically significant difference between partially and fully methylated cytosines in their tendency to be located at synonymous or non-synonymous positions. Whereas ~60% of CCWGG cytosines with Rmet≤0.5 (67% for Rmet≤0.2) are located at non-synonymous sites, only ~51% of those with Rmet≥0.9 are so located (P=4×10−6; Fisher exact test). In general, there is a slight, yet statistically significant, difference between the distributions of Rmet values for synonymous and non-synonymous sites (P=2×10−5; Kolmogorov–Smirnov test; Fig. 5d,e; Supplementary Fig. S7). In summary, we speculate that during exponential phase when multiple rounds of replication occur in the absence of the VSR repair system, cytosine methylation by Dcm is probably slowed down, particularly at non-synonymous sites, potentially as a consequence of the presence of the extended CCCWGG motif. This might be consistent with the hypothesis that partial methylation might reduce the risk of mutation; however, the strength of the signals observed here suggests that it is likely to be a relatively minor effector.
Discussion
In this study, we performed high-throughput sequencing of bisulfite-treated genomic DNA from E. coli K12 grown to mid-exponential, late-exponential, transition-to-stationary and stationary phases of growth, to identify its methylation status. Whereas most CCWGG sites are fully methylated in stationary phase cells, there is a progressive increase in the extent of methylation from exponential to stationary phase. Partial methylation in mid-exponential phase cells is associated with the presence of the extended CCCWGG motif, and has a slight tendency to be localized to non-synonymous sites. Methylated cytosine is a mutational hotspot that can spontaneously convert to thymine at a rate of 5.8×10−13 per second48; the rate is three orders of magnitude higher in single-stranded DNA, a state that would occur during replication and transcription. Though E. coli has a VSR system to reduce such mutations, it is not expressed during exponential phase by a post-transcriptional mechanism45,46,47. We speculate here that sites for partial methylation are selected to reduce the slight risk of potentially deleterious non-synonymous mutations; this happens especially in exponential phase when numerous new progeny are created and the VSR system is not expressed. The possible products of methylation-dependent mutations, namely the C[C/T]W[A/G]G sites, are severely under-represented (at ~25% of the expected levels) in the E. coli genome. It has been argued that VSR replacing normal CTWGG sites to CCWGG might have caused depletion of the former and the enrichment of the latter49. An alternative argument is that this might be a mechanism to ensure that the VSR system can efficiently detect and correct mutant sites with minimal confusion arising from 'normal' C[C/T]W[G/A]G sites; there might be an interplay of both forces in determining the final oligonucleotide usage in the E. coli genome. However, several other organisms might, in fact, make use of cytosine methylation to generate genotypic diversity during stationary phase; for example, Vibrio cholerae encodes a Dcm (targeting the external cytosine in RCCGGY motif), but not a VSR repair system50. Analysis of the genome of V. cholerae shows that a striking 97% of 'methylatable' cytosines are at non-synonymous positions, suggesting a strong preference for having methyl-dependent mutations at non-synonymous sites (3,964 of 4,074 sites on either strand of both chromosomes).
Contrary to the situation in mammals, a role for methylation in the control of gene expression in bacteria has been limited to N6-adenine methylation. Using microarrays, we show upregulation of genes in Δdcm—in comparison to the wildtype—during stationary phase. Much of this does not represent any reorganization of the transcriptional state, but instead increased expression of genes that are already stationary-phase specific. This might be caused by the increased expression of RpoS, which we observe in our microarrays and validate using RT–PCR and western blots. Thus, our study includes Dcm as a possible regulator of rpoS expression. This is in addition to the recently discovered role of Dcm in controlling ribosomal gene expression during stationary phase27; we suggest that the expression of these genes begins to increase in late-exponential phase.
Does the increased stationary phase gene expression have any consequence to E. coli fitness under laboratory conditions? No difference can be observed between the wildtype and Δdcm strains in their long-term stationary phase survival during batch culture in Luria–Bertani medium. However, this does not rule out specific fitness defects in Δdcm in complex natural environments, where E. coli might be spending much of its time in stationary phase, with an estimated doubling time of 40 h51.
In addition, we reveal increased expression of lamB, which encodes a maltoporin that is also the receptor for lambda phage, during mid-exponential phase in Δdcm. Thus, Dcm might have a role in reducing susceptibility of E. coli to lytic infections by lambda phage. In fact, it has been previously shown that unmethylated lambda DNA is more recombinogenic than methylated DNA, thus establishing a link, albeit at a level different to that implied by our gene expression analysis26, between cytosine methylation and phage infection.
The mechanism by which Dcm/cytosine methylation affects gene expression in E. coli is not known. It is not known whether there are any DNA-binding proteins in E. coli with transcriptional regulatory roles whose activities are dependent on the methylation state of the substrate DNA. However, cytosine methylation also alters the mechanical properties of DNA52, and this might have profound consequences on transcription.
In summary, we suggest that cytosine methylation might have a role in determining aspects of a bacterium's genome structure and may also be a regulator of transcription, especially during the stationary phase.
Methods
Strains and general growth conditions
The E. coli K12 MG1655 bacterial strains used in this work are the wildtype (F- lambda- ilvG- rfb-50 rph-1) and Δdcm (Fig. 3a). Luria–Bertani (0.5% NaCl) broth and agar (15 g l−1) were used for routine growth. Where needed, ampicillin, kanamycin and chloramphenicol were used at final concentrations of 100, 30 and 30 μg ml−1, respectively.
Construction of E. coli MG1655 knock-outs
Disruption of dcm in the E. coli chromosome was achieved by the λ Red recombination system, previously described by Datsenko and Wanner53. Primers designed for this purpose are P4: 5′-TTAACCTGTCGGCCATCTCAGATGGCCGGTGAAATCTATGATTCCGGGGATCCGTCGACC-3′ and P1: 5′-ATAGGCCTGAGTGTCCGAAACCGGAATACGGAATTTCGCTGTGTAGGCTGGAGCTGCTTC-3′. Sets of additional external primers were used to verify the correct integration of the PCR fragment by homologous recombination (Forward: 5′-GTCGATCACAAGGTGATAG-3′; Reverse: 5′-GCAGCGATATTCATCAACG-3′).
RNA extraction and microarrays procedures
RNA was extracted using Trizol reagent (Invitrogen) and mirVana miRNA isolation kit (Ambion) as reported previously40. Briefly, cells pelleted by centrifugation (10,000g, 10 min, 4 °C) were stored at −80 °C until required. RNA was isolated using Trizol (Invitrogen) following the manufacturer's protocol until the chloroform extraction step. The aqueous phase was placed in columns from the mirVana miRNA isolation kit (Ambion) and washed following the manufacturer's protocol. Finally, the total RNA was eluted in 50 μl of RNase-free water. The concentration was then determined using a Nanodrop ND-1000 machine (NanoDrop Technologies), and RNA quality was tested by visualization on agarose gels and by Agilent 2100 Bioanalyser (Agilent Technologies, Palo Alto, CA, USA). For the generation of fluorescence-labelled cDNA, we used the FairPlay III Microarray Labelling Kit (Stratagene) as described previously40. Briefly, 1 μg of total RNA was subjected to cDNA synthesis in a reverse transcription reaction using random primers. The amino allyl labelled cDNA (at dUTP) thus obtained was coupled to a Cy3 dye (GE Healthcare) containing a NHS-ester leaving group. The labelled (Cy3) cDNA was hybridized to the probe DNA on microarrays by incubating at 65 °C for 16 h. The unhybridized labelled cDNA was removed and the hybridized labelled cDNA was visualized using an Agilent Microarray Scanner.
RT–PCR for validation
To validate the results of the microarray analysis, quantitative reverse transcriptase PCR was carried out using specific primers to the mRNA targets showing up- or downregulation, and control targets not showing differential expression. RNA was extracted as described above from wild-type and Δdcm cells, and 30 ng total RNA was used with the Express One-Step SYBR GreenER kit (Invitrogen) according to the manufacturer's guidelines, using a MJ Mini thermal cycler (Bio-Rad). All primers and results are listed in Supplementary Tables S3 and S4, respectively.
Bisulfite conversion
Wildtype E. coli K12 MG1655 and Δdcm cells were grown to the desired OD and the genomic DNA extracted using a Wizard Genomic DNA purification kit (Promega). The DNA was sheared to ~2 kb using a Bioruptor (Diagenode). Bisulfite conversion was performed using the EpiTect Bisulfite kit (Qiagen), following the manufacturer's guidelines on 'Sodium Bisulfite Conversion of Unmethylated Cytosines in Small Amounts of Fragmented DNA'. Single-stranded DNA resulting from the BS conversion was converted to double-stranded DNA using random primers (Invitrogen). The DNA was then further sheared to approximately 200 bp fragments, before proceeding with the library construction.
Library construction and Solexa sequencing
Prior and post library construction, the concentration of DNA was measured using the Qubit HS DNA kit (Invitrogen). Library construction was done using the ChIP-Seq Sample Prep kit with customized barcoded adapters adopted from Lefrancois et al.54, to allow multiplexing of eight samples. Samples were loaded at a concentration of 10 pM.
Public data sources
The E. coli K12 MG1655 genome and gene coordinate annotations were downloaded from the KEGG database55. A list of 140 genes regulated by RpoS were obtained from Weber et al38. RNA-seq data for stationary and mid-exponential phase40 were generated in our lab as reported previously and RPKM56 values obtained from this were used.
Analysis of genomic data
Barcoded reads obtained from Illumina Genome Analyzer IIx were first assigned to their respective samples. Subsequently, methylation calls were made using Bismark software36, which makes use of Bowtie57 for read-to-genome mapping. Sequence motifs were identified using the MEME software43.
Gene expression analyses were performed on a previously described custom-designed isothermal Agilent microarray platform, and analysed as described earlier40. Briefly, array data were background corrected using normexp58 and normalization performed using VSN59. Differential expression in the deletion strains compared with the wild-type was called at FDR-adjusted P-value of 0.05, and a fold-change of at least two. To reduce the false-positive rates, a second round of analysis was performed, in which each gene was represented by the average expression value across all probes that map to it. The above procedure for processing and calling differential expression was adopted again. Finally, genes called as differentially expressed as per both the above analysis pipelines were used and discussed in this study.
All statistical tests were carried out using R (http://r-project.org).
Additional information
Acccession codes: Bisulfite sequencing and MeDIP-seq data have been deposited in the NCBI Sequence Read Archive under accession codes SRX116370 and SRX118253, respectively. Microarray data have been deposited in Array Express under accession code E-MEXP-3569.
How to cite this article: Kahramanoglou C, et al. Genomics of DNA cytosine methylation in Escherichia coli reveals its role in stationary phase transcription. Nat. Comm. 3:886 doi: 10.1038/ncomms1878 (2012).
Accession codes
References
Suzuki, M. M. & Bird, A. DNA methylation landscapes: provocative insights from epigenomics. Nat. Rev. Genet. 9, 465–476 (2008).
Ehrlich, M. DNA methylation in cancer: too much, but also too little. Oncogene 21, 5400–5413 (2002).
Deaton, A. M. & Bird, A. CpG islands and the regulation of transcription. Genes Dev. 25, 1010–1022 (2011).
Rambach, A. & Tiollais, P. Bacteriophage lambda having EcoRI endonuclease sites only in the nonessential region of the genome. Proc. Natl Acad. Sci. USA 71, 3927–3930 (1974).
Handa, N. & Kobayashi, I. Post-segregational killing by restriction modification gene complexes: observations of individual cell deaths. Biochimie 81, 931–938 (1999).
Kobayashi, I., Nobusato, A., Kobayashi-Takahashi, N. & Uchiyama, I. Shaping the genome—restriction-modification systems as mobile genetic elements. Curr. Opin. Genet. Dev. 9, 649–656 (1999).
Kobayashi, I. Behavior of restriction-modification systems as selfish mobile elements and their impact on genome evolution. Nucleic Acids Res. 29, 3742–3756 (2001).
Naito, T., Kusano, K. & Kobayashi, I. Selfish behavior of restriction-modification systems. Science 267, 897–899 (1995).
Jeltsch, A. & Pingoud, A. Horizontal gene transfer contributes to the wide distribution and evolution of type II restriction-modification systems. J. Mol. Evol. 42, 91–96 (1996).
Kita, K. et al. Evidence of horizontal transfer of the EcoO109I restriction-modification gene to Escherichia coli chromosomal DNA. J. Bacteriol. 181, 6822–6827 (1999).
Lin, L. F., Posfai, J., Roberts, R. J. & Kong, H. Comparative genomics of the restriction-modification systems in Helicobacter pylori. Proc. Natl Acad. Sci. USA 98, 2740–2745 (2001).
Palmer, B. R. & Marinus, M. G. The dam and dcm strains of Escherichia coli—a review. Gene 143, 1–12 (1994).
Wion, D. & Casadesus, J. N6-methyl-adenine: an epigenetic signal for DNA-protein interactions. Nat. Rev. Microbiol. 4, 183–192 (2006).
Waldron, D. E., Owen, P. & Dorman, C. J. Competitive interaction of the OxyR DNA-binding protein and the Dam methylase at the antigen 43 gene regulatory region in Escherichia coli. Mol. Microbiol. 44, 509–520 (2002).
Peterson, S. N. & Reich, N. O. Competitive Lrp and Dam assembly at the pap regulatory region: implications for mechanisms of epigenetic regulation. J. Mol. Biol. 383, 92–105 (2008).
Campbell, J. L. & Kleckner, N. E. coli oriC and the dnaA gene promoter are sequestered from dam methyltransferase following the passage of the chromosomal replication fork. Cell 62, 967–979 (1990).
Boye, E. & Lobner-Olesen, A. The role of dam methyltransferase in the control of DNA replication in E. coli. Cell 62, 981–989 (1990).
Glickman, B., van den Elsen, P. & Radman, M. Induced mutagenesis in dam- mutants of Escherichia coli: a role for 6-methyladenine residues in mutation avoidance. Mol. Gen. Genet. 163, 307–312 (1978).
Glickman, B. W. Spontaneous mutagenesis in Escherichia coli strains lacking 6-methyladenine residues in their DNA: an altered mutational spectrum in dam- mutants. Mutat. Res. 61, 153–162 (1979).
Glickman, B. W. & Radman, M. Escherichia coli mutator mutants deficient in methylation-instructed DNA mismatch correction. Proc. Natl Acad. Sci. USA 77, 1063–1067 (1980).
Reisenauer, A., Kahng, L. S., McCollum, S. & Shapiro, L. Bacterial DNA methylation: a cell cycle regulator? J. Bacteriol. 181, 5135–5139 (1999).
Bigger, C. H., Murray, K. & Murray, N. E. Recognition sequence of a restriction enzyme. Nat. New Biol. 244, 7–10 (1973).
Boyer, H. W., Chow, L. T., Dugaiczyk, A., Hedgpeth, J. & Goodman, H. M. DNA substrate site for the EcoRII restriction endonuclease and modification methylase. Nat. New Biol. 244, 40–43 (1973).
Takahashi, N., Naito, Y., Handa, N. & Kobayashi, I. A DNA methyltransferase can protect the genome from postdisturbance attack by a restriction-modification gene complex. J. Bacteriol. 184, 6100–6108 (2002).
Yang, M. K., Ser, S. C. & Lee, C. H. Involvement of E. coli dcm methylase in Tn3 transposition. Proc. Natl Sci. Counc. Repub. China B 13, 276–283 (1989).
Korba, B. E. & Hays, J. B. Partially deficient methylation of cytosine in DNA at CCATGG sites stimulates genetic recombination of bacteriophage lambda. Cell 28, 531–541 (1982).
Militello, K. et al. Conservation of Dcm-mediated cytosine DNA methylation in Escherichia coli. FEMS Microbiol. Lett. 328, 78–85 (2011).
Marinus, M. G. & Morris, N. R. Isolation of deoxyribonucleic acid methylase mutants of Escherichia coli K-12. J. Bacteriol. 114, 1143–1150 (1973).
Bhagwat, A. S., Sohail, A. & Roberts, R. J. Cloning and characterization of the dcm locus of Escherichia coli K-12. J. Bacteriol. 166, 751–755 (1986).
Ringquist, S. & Smith, C. L. The Escherichia coli chromosome contains specific, unmethylated dam and dcm sites. Proc. Natl Acad. Sci. USA 89, 4539–4543 (1992).
Bhagwat, A. S., Johnson, B., Weule, K. & Roberts, R. J. Primary sequence of the EcoRII endonuclease and properties of its fusions with beta-galactosidase. J. Biol. Chem. 265, 767–773 (1990).
Bibikova, M. & Fan, J. B. Genome-wide DNA methylation profiling. Wiley Interdiscip. Rev. Syst. Biol. Med. 2, 210–223 (2010).
Gomez-Eichelmann, M. C. & Ramirez-Santos, J. Methylated cytosine at Dcm (CCATGG) sites in Escherichia coli: possible function and evolutionary implications. J. Mol. Evol. 37, 11–24 (1993).
Seshasayee, A. S. An assessment of the role of DNA adenine methyltransferase on gene expression regulation in E coli. PLoS One 2, e273 (2007).
Henaut, A., Rouxel, T., Gleizes, A., Moszer, I. & Danchin, A. Uneven distribution of GATC motifs in the Escherichia coli chromosome, its plasmids and its phages. J. Mol. Biol. 257, 574–585 (1996).
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 1571–1572 (2011).
Gama-Castro, S. et al. RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation. Nucleic Acids Res. 36, D120–D124 (2008).
Weber, H., Polen, T., Heuveling, J., Wendisch, V. F. & Hengge, R. Genome-wide analysis of the general stress response network in Escherichia coli: sigmaS-dependent genes, promoters, and sigma factor selectivity. J. Bacteriol. 187, 1591–1603 (2005).
Radonjic, M. et al. Genome-wide analyses reveal RNA polymerase II located upstream of genes poised for rapid response upon S. cerevisiae stationary phase exit. Mol. Cell. 18, 171–183 (2005).
Kahramanoglou, C. et al. Direct and indirect effects of H-NS and Fis on global gene expression control in Escherichia coli. Nucleic Acids Res. 39, 2073–2091 (2011).
Randall-Hazelbauer, L. & Schwartz, M. Isolation of the bacteriophage lambda receptor from Escherichia coli. J. Bacteriol. 116, 1436–1446 (1973).
Palmke, N., Santacruz, D. & Walter, J. Comprehensive analysis of DNA-methylation in mammalian tissues using MeDIP-chip. Methods 53, 175–184 (2011).
Bailey, T. L. et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 37, W202–W208 (2009).
Lutsenko, E. & Bhagwat, A. S. Principal causes of hot spots for cytosine to thymine mutations at sites of cytosine methylation in growing cells. A model, its experimental support and implications. Mutat. Res. 437, 11–20 (1999).
Lieb, M. Spontaneous mutation at a 5-methylcytosine hotspot is prevented by very short patch (VSP) mismatch repair. Genetics 128, 23–27 (1991).
Bhagwat, A. S. & Lieb, M. Cooperation and competition in mismatch repair: very short-patch repair and methyl-directed mismatch repair in Escherichia coli. Mol. Microbiol. 44, 1421–1428 (2002).
Macintyre, G., Pitsikas, P. & Cupples, C. G. Growth phase-dependent regulation of Vsr endonuclease may contribute to 5-methylcytosine mutational hot spots in Escherichia coli. J. Bacteriol. 181, 4435–4436 (1999).
Shen, J. C., Rideout, W. M. 3rd. & Jones, P. A. The rate of hydrolytic deamination of 5-methylcytosine in double-stranded DNA. Nucleic Acids Res. 22, 972–976 (1994).
Bhagwat, A. S. & McClelland, M. DNA mismatch correction by very short patch repair may have altered the abundance of oligonucleotides in the E. coli genome. Nucleic Acids Res. 20, 1663–1668 (1992).
Banerjee, S. & Chowdhury, R. An orphan DNA (cytosine-5-)-methyltransferase in Vibrio cholerae. Microbiology 152, 1055–1062 (2006).
Savageau, M. Escherichia coli habitats, cell types, and molecular mechanisms of gene control. Univ. Chicago Press Am. Soc. 122, 732–744 (1983).
Severin, P. M., Zou, X., Gaub, H. E. & Schulten, K. Cytosine methylation alters DNA mechanical properties. Nucleic Acids Res. 39, 8740–8751 (2011).
Datsenko, K. A. & Wanner, B. L. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc. Natl Acad. Sci. USA 97, 6640–6645 (2000).
Lefrancois, P. et al. Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing. BMC Genomics 10, 37 (2009).
Kanehisa, M., Goto, S., Furumichi, M., Tanabe, M. & Hirakawa, M. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res. 38, D355–D360 (2010).
Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628 (2008).
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Ritchie, M. E. et al. A comparison of background correction methods for two-colour microarrays. Bioinformatics 23, 2700–2707 (2007).
Huber, W., von Heydebreck, A., Sultmann, H., Poustka, A. & Vingron, M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 18 (Suppl. 1), S96–S104 (2002).
Acknowledgements
This work was supported by the Ramanujan Fellowship SR/S2/RJN-49/2010, Department of Science and Technology, Government of India (to A.S.N.S.); Spanish Ministry of Science and Innovation (to A.I.P); Biotechnology and Biological Sciences Research Council (BBSRC) grant 'Genomic Analysis of Regulatory Networks for Bacterial Differentiation and Multicellular Behaviour' (to G.M.F. and N.M.L.); Isaac Newton Trust (to G.M.F); European Molecular Biology Laboratory (EMBL; to N.M.L.); Council for Scientific and Industrial Research, India research fellowship 09/860(0122)/2011-EMR-I (to S.K). We thank Vardhman Rakyan for his help with protocols for bisulfite conversion.
Author information
Authors and Affiliations
Contributions
C.K. and A.S.N. conceived the project. C.K., V.B., N.M.L. and A.S.N. designed the experiments. C.K., A.I.P., S.K., B.H. and A.G. performed the experiments. A.S.N. performed the computational experiments and analysed the data. G.M.F., V.B. and N.M.L. provided advice. C.K., N.M.L. and A.S.N. wrote the paper.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Information
Supplementary Figures S1-S7, Supplementary Tables S1-S4 and Supplementary Methods (PDF 372 kb)
Rights and permissions
About this article
Cite this article
Kahramanoglou, C., Prieto, A., Khedkar, S. et al. Genomics of DNA cytosine methylation in Escherichia coli reveals its role in stationary phase transcription. Nat Commun 3, 886 (2012). https://doi.org/10.1038/ncomms1878
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/ncomms1878
This article is cited by
-
The DNA cytosine methylome revealed two methylation motifs in the upstream regions of genes related to morphological and physiological differentiation in Streptomyces coelicolor A(3)2 M145
Scientific Reports (2023)
-
Comparative Genomics of Pesticide-Degrading Enterococcus Symbionts of Spodoptera frugiperda (Lepidoptera: Noctuidae) Leads to the Identification of Two New Species and the Reappraisal of Insect-Associated Enterococcus Species
Microbial Ecology (2023)
-
Non-growth inhibitory doses of dimethyl sulfoxide alter gene expression and epigenetic pattern of bacteria
Applied Microbiology and Biotechnology (2023)
-
Anchor-based bisulfite sequencing determines genome-wide DNA methylation
Communications Biology (2022)
-
Detection of genetic variation and base modifications at base-pair resolution on both DNA and RNA
Communications Biology (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.