Comparative genomics of Australian and international isolates of Salmonella Typhimurium: correlation of core genome evolution with CRISPR and prophage profiles

Fu, Songzhe; Hiley, Lester; Octavia, Sophie; Tanaka, Mark M.; Sintchenko, Vitali; Lan, Ruiting

doi:10.1038/s41598-017-06079-1

Download PDF

Article
Open access
Published: 29 August 2017

Comparative genomics of Australian and international isolates of Salmonella Typhimurium: correlation of core genome evolution with CRISPR and prophage profiles

Songzhe Fu¹,
Lester Hiley ORCID: orcid.org/0000-0003-0955-7713²,
Sophie Octavia¹,
Mark M. Tanaka¹,
Vitali Sintchenko^3,4 &
…
Ruiting Lan¹

Scientific Reports volume 7, Article number: 9733 (2017) Cite this article

2231 Accesses
20 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Salmonella enterica subsp enterica serovar Typhimurium (S. Typhimurium) is a serovar with broad host range. To determine the genomic diversity of S. Typhimurium, we sequenced 39 isolates (37 Australian and 2 UK isolates) representing 14 Repeats Groups (RGs) determined primarily by clustered regularly interspaced short palindromic repeats (CRISPR). Analysis of single nucleotide polymorphisms (SNPs) among the 39 isolates yielded an average of 1,232 SNPs per isolate, ranging from 128 SNPs to 11,339 SNPs relative to the reference strain LT2. Phylogenetic analysis of the 39 isolates together with 66 publicly available genomes divided the 105 isolates into five clades and 19 lineages, with the majority of the isolates belonging to clades I and II. The composition of CRISPR profiles correlated well with the lineages, showing progressive deletion and occasional duplication of spacers. Prophage genes contributed nearly a quarter of the S. Typhimurium accessory genome. Prophage profiles were found to be correlated with lineages and CRISPR profiles. Three new variants of HP2-like P2 prophage, several new variants of P22 prophage and a plasmid-like genomic island StmGI_0323 were found. This study presents evidence of horizontal transfer from other serovars or species and provides a broader understanding of the global genomic diversity of S. Typhimurium.

Comparative genomics of Salmonella enterica serovar Enteritidis ST-11 isolated in Uruguay reveals lineages associated with particular epidemiological traits

Article Open access 27 February 2020

The phylogenomics of CRISPR-Cas system and revelation of its features in Salmonella

Article Open access 03 December 2020

Population structure and ongoing microevolution of the emerging multidrug-resistant Salmonella Typhimurium ST213

Article Open access 08 April 2024

Introduction

Salmonella enterica serovar Typhimurium is the most common Salmonella serovar causing foodborne infections in Australia and many other countries¹. The phenotypic diversity of S. Typhimurium has been traditionally illustrated by the Anderson phage typing scheme with more than 200 phage types defined². Since then, different molecular markers were used to assess its genetic diversity^{3, 4}. S. Typhimurium is divided by multilocus sequence typing into more than 30 sequence types (STs). ST19 is the most prevalent ST internationally and the majority of the STs belong to the ST19 clonal complex⁵. By application of whole-genome-sequencing (WGS) and genomic analysis, our earlier study of six Australian S. Typhimurium strains from different phage types and 7 published genomes revealed three genomic clusters⁶. Hayden et al.⁷ analysed the genomic diversity of 35 US S. Typhimurium isolates together with 21 public genomes⁷ and found that the 56 S. Typhimurium strains could be divided into 3 clades and at least 10 lineages.

Genome sequencing showed that variation among S. Typhimurium strains was mainly due to the accumulation of single nucleotide polymorphisms (SNPs). The S. Typhimurium genome consists of a core genome of around 3,800 genes present in all strains and nearly 1,000 accessory genes which are variably present in one or more strains^{4, 7}. The accessory genome contains mostly genes of prophages and genes of unknown function which have contributed to the genetic diversity of S. Typhimurium^{3, 4}. However, the full depth of genomic diversity of S. Typhimurium remains to be explored.

Clustered regularly interspaced short palindromic repeats (CRISPR) belong to a family of unique repeat sequences and are possibly associated with adaptive resistance against invasive genetic elements such as phages. Salmonella generally possesses two CRISPR loci, which comprise conserved direct repeats separated by unique short sequences typically 32 bp long, called spacers^{8, 9}. Variation in the spacer profiles of CRISPRs has been useful for subtyping Salmonella isolates¹⁰. Shariat et al.¹¹ investigated CRISPR array composition in four major serovars, Enteritidis, Typhimurium, Newport and Heidelberg, and demonstrated serovar specificity of CRISPR array composition¹¹. Other studies have also found that CRISPR variation is associated with serotype and sequence types, providing good phylogenetic signals for inferring strain relationships^{8, 12, 13}. Hiley et al.¹⁴ performed CRISPR and variable number tandem repeat (VNTR) analyses on a diverse collection of 200 Australian S. Typhimurium isolates and 14 reference strains which separated them into 15 Repeats Groups (RGs)¹⁴. This presented an opportunity to apply genomic analysis to selected isolates from these RGs to capture a broad range of genetic diversity of S. Typhimurium.

In this study, we sequenced 39 strains, 37 of which were of Australian origin, representing 14 different RGs and we included in the analysis 66 publicly available S. Typhimurium genomes including 54 of international origins to provide a global overview of S. Typhimurium diversity. We constructed a core genome phylogeny for all 105 strains and examined the correlation of core genome relationships with CRIPSR array composition and prophage profiles.

Results and Discussion

Selection of S. Typhimurium isolates for genome sequencing

Strains were selected to represent 14 of the 15 RGs previously defined by Hiley et al.¹⁴ (Supplementary Figures S1 and Table S1). No RG15 strains were chosen for this study as only 2 RG15 strains were found at the time of study¹⁴. RG4, RG5, RG6, RG9, RG10 and RG12 had been further subtyped into 2 to 4 sub-categories based on one or more spacer differences in CRISPR1 and/or CRISPR2 so isolates from each sub-category were chosen. In all, 37 Australian and two UK S. Typhimurium isolates were selected for genome sequencing to represent the diversity of these 14 RGs (Table 1).

Table 1 List of S. Typhimurium isolates sequenced in this study.

Full size table

An additional 66 strains, including 21 Salmonella reference collection A (SARA) strains⁴ and 45 other publicly available genomes (including 11 from Australia), were also included in the analysis (Supplementary Table S2). The CRISPR compositions of these strains were determined using genome data or by PCR sequencing. We found some strains had CRISPR1 or CRISPR2 composition similar to one RG but the other CRISPR composition was similar to another RG. We called them hybrid RGs and designated the genotype as RG CRISPR1/CRISPR2 to indicate the genotype difference in the two CRISPRs. For instance, strain SARA4 was a RG12D/10A hybrid. Genome sequencing statistics are listed in Table 2. The reads were assembled de novo. The number of contigs produced ranged from 214 to 378, with an average of 294 per genome. SNPs were discovered by mapping to the reference S. Typhimurium strain LT2 (NCBI GenBank Accession No. NC_003197). The number of SNPs ranged from 128 to 11,319 relative to the strain LT2 (Table 2).

Table 2 General features of strains sequenced in this study.

Full size table

Core and accessory genomes of S. Typhimurium

Pan genome analysis showed that there were 8,849 genes, including 3,836 core genes, and 5,013 dispensable genes in S. Typhimurium. The core genome was smaller than in our previous report (3,846 genes)⁴ and the reports of Hayden et al. (3,910 genes)⁷ and Mather et al. (3,890 genes)¹⁵. This finding indicated that the current dataset had a strain coverage broad enough to achieve a stable core genome and adding more strains to the analysis would not substantially reduce the core genome size. Interestingly, prophage genes contributed up to 13.3% (1,175/8,849) of the pan genome and 23.4% (1,175/5,013) of the accessory genome, while plasmid and other mobile elements took up 13.3% (668/5,013) of the accessory genome. For the rest of the genes in the accessory genome, the majority (2,043 genes) encoded hypothetical proteins with currently unknown function.

Genomic relationships and their correlation with Repeats Groups (RGs)

A phylogeny of the 105 strains was constructed based on SNPs derived from the S. Typhimurium core genome⁴. The strains were separated into five clades and 19 lineages (Figs 1 and S1). Clade I contained RGs 10, 11, 12, 13 and 14 while Clade II contained RGs 1, 2, 4, 7, 8 and 9. Clades III, IV and V corresponded to RG3, RG6 and RG5 respectively. Clade V consisting of four strains L1876, L1879, SARA7 and SARA8 was the most distant clade being separated by more than 4,000 SNPs from the other clades as found in a previous study⁴. Clades III and IV were much closer to Clade II. Of the 19 lineages defined, 16 lineages contained more than one isolate and 3 lineages contained only one isolate. The strains falling within each lineage nearly always belonged to a single RG or combination of one RG and a CRISPR hybrid related to that RG and for the most part all the strains belonging to an RG fell into a single lineage. Exceptions were RG12A strains which were split into two lineages (lineages 10 and 8), separated by the RG14 lineage, and lineage 13 which contained strains belonging to RG4 and RG7 even though they are distinctly different by both CRISPR and VNTR profiles¹⁴ (Fig. 1).

Genomic typing resolved the phylogenetic relationships between RGs that were not clearly evident from the CRISPR profiles. Thus RG12D was the likely precursor for both RG10 and RG13 lineages. There was a loss of spacers from the RG12D CRISPR arrays specific to each of the RG10 and RG13 lineages. Three strains SARA10, SARA14 and SARA21 that could not be assigned to any recognised RG formed a separate lineage between RG1 and RG2.

Strains with hybrid RGs were genomically clustered with other strains which had the same or similar CRISPR1 RG or CRISPR2 RG. Five strains (i.e. TN061786, CDC 2011K-0870, SARA16, SARA17 and SARA18) shared similar CRISPR1 profiles with RG1 strains but their CRISPR2 profiles were like those in RG4A (RG1/4A in Fig. 1). They fell into the same lineage as RG1. Four strains, SARA12, SARA13, DT56 and DT99, were RG11A/15 hybrids and clustered with isolates belonging to RG11 genomically. SARA4 and L1852 were RG12D/10A hybrids which clustered with isolates belonging to RG10A. Likewise, strain 798 (NC _017046) which was a RG12B/10A hybrid was also genomically clustered with RG10A. These hybrids suggest that recombination has substituted CRISPR1 or CRISPR2 sequences of the respective strains in their common ancestors.

Our study revealed a higher genomic diversity of S. Typhimurium than that reported by Hayden et al.⁷, in which 10 lineages belonging to 3 clades were found among 35 sequenced US S. Typhimurium isolates and 21 published genomes. Examination of representative isolates in each lineage from Hayden et al.⁷ showed that only RG1, RG2, RG8, RG10, RG11A/15 and RG12 genotypes were included in their study, leaving an under-representation of the genomic diversity. Although genomes belonging to RG1, RG1/4A, RG3, RG5, RG6, RG9A, RG11A/15, RG12, RG13 and RG14, were also included in previous genome-sequencing studies^{4, 6, 16, 17}, ours is the first such comprehensive study to also include RG4, RG7 and RG9B genomes. Further analysis of other published genomes did not uncover more RGs, suggesting that isolates included in this set represented the spectrum of RG diversity available to date (Table S3).

Comparison of Australian isolates and international isolates with published genomes showed that most RGs including RG1, RG2, RG5, RG6A and B, RG8, RG10A and B, RG11A and B, RG12A, B, C and D and RG14, contained isolates from Australia and other parts of the world, suggesting wide distribution of these RGs. Three RGs, RG3, RG9 and RG13, may be unique to Australia. However, more extensive sampling from other countries is required to ascertain whether these three RGs are restricted to Australia. RG8 and RG12A genotypes appear to be infrequently isolated in Australia¹⁴.

Progressive spacer deletion in the CRISPR evolution of S. Typhimurium

CRISPR arrays appear to evolve by the deletion or duplication of spacer-repeat units or by the rare acquisition of new spacers¹⁸. The divergence of the clades was associated with progressive losses of spacers (Fig. 1). For example, Clades I and II have lost sp20 and sp21 of CRISPR2 indicating an earlier divergence of Clade II from Clade III and Clade I has lost both sp25 to sp28 of CRISPR1 and sp31 to sp33 of CRISPR2 when Clades I and II diverged from their most recent common ancestor.

Generally, there were few single spacer deletions. Most deletions involved two or more spacers such as deletion of sp2 to sp32 of CRISPR2 in SARA21 that presumably occurred in one event. Many deletions were across lineages or RGs as single deletion events indicative of common ancestry rather than parallel loss (Fig. 1). There were many RG-specific spacer deletions. Sp31 of CRISPR1 and sp37 of CRISPR2 which were only present in Clade V, and sp30 in CRISPR1 which was only present in Clade IV may be more recent acquisitions.

Duplication of spacers can also now be better understood. Some duplications such as sp5, sp11 and sp21 in CRISPR1 arrays and sp14 in CRISPR2 arrays appeared to be rare events as they were only seen in single isolates¹⁴. Duplication of sp16 in CRISPR1 was only seen in RG9B and duplication of sp28 in CRISPR1 was only seen in some RG2 genotypes. However, duplication of sp15 in CRISPR2 arrays was much more widespread except for RG2, RG6, RG9A, RG12B and RG12C which have only one sp15. Therefore, this duplication may be an ancient event and has only been lost by relatively few genotypes as evolution has progressed.

Prophages found in the 105 S. Typhimurium genomes

Our previous studies have shown that much of the genomic diversity within S. Typhimurium was due to variation in prophage content⁴. This study extends that observation and presents a fuller characterisation of the S. Typhimurium prophages. Prophages in Salmonella are categorised into five groups, P27-like, P2-like, lambdoid, P22-like, and T7-like¹⁹. All except the last group were found in our S. Typhimurium strains. We also identified three new variants of HP2-like P2 prophages, several new variants of P22 prophages, including some in a novel insertion site STM0786 (LT2), a novel OLF-SE9-10012-like prophage and other outlier prophages. The prophage distribution is illustrated in Fig. 2.

Lambdoid prophages

The lambdoid prophages include Gifsy-1, Gifsy-2, Gifsy-3 and Fels-1. Gifsy-1 was previously subtyped into seven variants, Gifsy-1_LT2, Gifsy-1_DT104, Gifsy-1_DT64, Gifsy-1_SL1344, Gifsy-1_DT2, Gifsy-1_CVM23701 and Gifsy-1_DT126 ¹⁴. Gifsy-1_LT2 was present in five of the 13 RG2 strains and in four RG11A/15 strains. A variant of Gifsy-1_LT2 which has three segments different from LT2 occurred in five of the six RG6 strains. Gifsy-1_DT104 was found in RG3, RG4, RG7 and RG8. Gifsy-1_DT126 was exclusive to RG11. Gifsy-1_DT64 was only found in RG14. Gifsy-1_CVM23701 was exclusive to some RG12A strains. Gifsy-1_SL1344 was found in RG10 and one strain from RG2. Gifsy-1_DT2 was distributed widely in many RGs including RG13, RG12, RG1, RG2 and RG1/4A.

Gifsy-2 was found in all strains except SARA8 and is highly conserved with no variants as reported previously²⁰. Gifsy-3 was found in SARA6, L1873, 14028 S and VNP20009 with >99% DNA sequence similarity to each other. Fels-1 was only found in LT2.

P27-like prophages

The P27-like prophage, ST64B has 2 variants, ST64B_DT64 and ST64B_DT104. The ST64B_DT64 variant was found in all Clade I strains and not in any other clades (although a previous study by Hiley et al.¹⁴ showed that it was present in a minority of RG2A isolates in Clade II) while the ST64B_DT104 variant was only found in some strains in Clades II and IV. Comparison of these ST64B_DT104 prophage sequences in RG2, RG7 and RG6B to prototype ST64B_DT104 in DT104 showed that some strains contained ST64B_DT104 with varying degrees of coverage (range from 57.4% to 94.5%) (Supplementary Table S4).

P2 prophages, P2 variants and P4 prophage

Forty eight strains were found to contain one of the P2 prophages, Fels-2, RE-2010, PSP3, SopEφ, SP-004, 186-type, P2-Hawk and other HP2-like or variants of these. Three strains, DT24, SARA10 and T000240, had two P2 prophages in different locations. All RG10 strains had a remnant PSP3 P2 prophage.

Fels-2 was found in nine of the 13 RG2 strains as well as SARA10 (RG unassigned) and L818 (RG11). RE-2010²¹, a variant of Fels-2, was found in two different lineages RG1/4A and RG14. SopEφ, another Fels-2 variant, always in tandem with a P4 prophage, was found in all RG10 strains as well as the three RG10A hybrids. P4 is a satellite phage using P2 as a helper phage²⁰. A SopEφ variant was found in RG5A strains L1876 and SARA7 and was also inserted in the Fels-2 site. The SopEφ variant showed lower similarity to S. Typhimurium SopEφ, (Accession No. AY319521, 85%/97% coverage/identity for L1876) than to the SopEφ in S. Javiana 10721 (AOZA01000057) with 93%/98% (coverage/identity), indicating that it may have come from another serovar. A variant of SP-004 was found only in CVM23701 and DT24 while a variant of 186-type P2 was found in D23580 (BTP5) and DT24.

HP2 phage (NC_003315) was first found in Haemophilus influenzae ²², and a distantly related variant, P2-Hawk, was reported in S. Typhimurium strains in tandem with a P4 prophage¹⁷. In our study, we found 14 HP2-like prophages in various insertion sites. Six of these were identified as P2-Hawk and were all in RG13 strains.They were almost identical, differing by no more than five SNPs from each other and always in tandem with the P4 phage. A comparative analysis showed that a core genome of 10 kb was shared by HP2-like P2 prophages, but only 2.6 kb was shared by all the genomes of HP2-like, P2, Fels-2, RE-2010 and SopEΦ (Table S5). Phylogenetic analysis of the core genome of HP2-like P2 prophages showed that these prophages were divided into three variants all with considerable divergence from HP2 phage (NC_003315) (Figure S2). One variant was found in RG6B strains L1874, DT7 and DT193, inserted at the LT2 gene STM3213 which is the same insertion site as for BTP5, a coliphage 186-type P2 phage in strain D23580. This phage has also been found in strains of serovar Enteritidis and Newport with 99%/99% coverage/identity. The SARA10 HP2-like phage (RG unassigned) was also related to this variant but was closer to that found in serovar Heidelberg SL476. Strain DT97 (RG6B) had a variant which clustered with P2-Hawk but inserts at STM3213 instead of at STM2693. Another variant, which inserted at the LT2 gene STM2665 site, was found in ERR277210, T000240 and SARA19 all in RG2. P4 was present independently of P2 in RG9B, RG12D and more than half of the RG13 isolates.

P22 prophages and P22 variants

Forty six strains were found to contain a P22-like prophage or one of its variants. The P22-like prophages that are located in the STM0323 site have seven publicly available variants including P22 (NC_002371), SE-1 (DQ003260), SPN9CC (JF900176), ST160 (GU573886), ST104 (AAF75053), ST64T (AY052766) and BTP1 (D23580). We first used PHAST to assign the 46 P22-like prophages into the above categories. P22 (NC_002371) or its variant was found in SARA1, SARA17, and SARA21. There were other variants of P22 (NC_002371) in SARA4, L1861, L1867 (RG10), L1853 (RG4A), SARA10 (RG unassigned) and five strains from RG2 (SARA1, DT195, ERR277210, SARA19 and SARA20) but these were located in a novel insertion site, STM0786. SE-1-like prophages were found in SARA12 and L1850 with 100%/99% and 82%/99% coverage/identity, respectively. SPN9CC or one of its variants was found in 16 strains spread across six RGs. ST104 and one of its variants in DT97 were found in another 11 strains. ST160 was found in three strains, of which L1858 and DT99 each had a variant. Additionally, two strains had ST64T and strain D23580 had the BTP1 prophage.

We analysed the genetic diversity within P22 prophages. A phylogenetic tree of core genome sequences showed that the P22-related SPN9CC-like prophages could be divided into three variants (Figure S3). The first variant had only a few SNPs different from SPN9CC and was in RG1 strains L945 and L1866. The second variant contained a 1,213 bp deletion of the nin gene region in the position of 15268–16480 in phage SPN9CC (JF900176) and was in 13 strains from RG12, RG13 and RG10. The third variant had six unique sequences and was present only in L1864 from RG12B.

There was considerable diversity of gene content within the P22-like prophages. To better delineate the gene content variation of various P22-like prophages, we analysed the pan-genome of the 46 P22-like prophages found in this study as well as the genomes of seven publicly available P22 and its variants and obtained a 100 kb pan-genome. The pan-genome consisted of 193 DNA fragments, most of which were only shared by some prophage genomes (Supplementary Table S6). Only 11 fragments of 6.5 kb were shared by all, of which 3 kb belonged to capsid assembly genes and scaffold genes, indicating that the capsid assembly and scaffold genes can be conserved among the different P22-like branches. Based on the presence/absence of genes of the pan-genome (Supplementary Figure S4), SE-1, ST64T, ST160 and their variants were grouped into the same cluster, as were the ST104-like prophages and the SPN9CC-like prophages. P22-like prophages showed high genomic diversity as many variants are not closely related to any other P22 variants. Our pan-genome analysis also showed that some P22-like prophages were incorrectly categorised by PHAST. For example, a P22-like prophage in DT177 was initially identified as ST64T by PHAST but the pan-genome analysis showed that it was closer to SE1 with which 19 genes were shared.

We further explored the evolution of P22-like prophages by comparing their sequences with those from E. coli and other Salmonella serovars (Supplementary Table S6 and Supplementary Figure S4). The composition of these prophage genomes was mosaic with sequences from different sources, indicating frequent exchanges of DNA between subgroups of the P22-like phages as well as from E. coli and other Salmonella serovars. The majority of the fragments (137 out of 193) had high similarity to prophages in multiple serovars, while a few only had high similarity to one or two Salmonella serovars, including serovars closely related to S. Typhimurium such as S. Heidelberg, and more distantly related serovars like S. Newport and S. Paratyphi A. Twenty-four fragments had similarity to E. coli prophages, while 12 fragments were only found in S. Typhimurium prophages. Thus, there has been a considerable exchange of genetic information among diverging P22-like phage lineages, and the exchange appears to be randomly distributed throughout their genomes.

There was considerable sequence diversity for the P22-like prophages located at the STM0786 insertion site. The integrase for these prophages had 68% identity with that from Enterobacteria phage HK106 NC_019768. The P22 variant in SARA10 had 66% coverage/99% identity with P22 (NC_002371) but 99% coverage/99% identity with Paratyphi B ATCC 8759 (AOYE01000028.1). SARA4 which was a RG12D/10A hybrid had another variant with 50% coverage/99% identity with P22 (NC_002371). Another prophage, P22_SARA19, was found in SARA19, DT195, ERR277210 and SARA20 but also with considerable sequence divergence from other P22 variants (Table S7). Another variant, P22_L1867, was found in L1861 and L1867 belonging to RG10B. Strain L1853 had a P22 variant which had greater sequence coverage with BTP1 from D23580 (39% with 99% identity) than with P22 (NC_002371) (34% and 99% identity) and even less coverage with the other STM0786 prophages.

Other prophages

Five outlier prophages, SPN1S, OLF-SE9-10012-like prophage, a SfV-like prophage, φW104²³, and SEN34-like which do not belong to any of the known S. Typhimurium prophage groups, were found. A SPN1S variant was found in SARA7 which was inserted at the same site (STM2510) as the SPN1S in S. Heidelberg strain 12-4374 (CP012924.1). A OLF-SE9-10012-like prophage of 36 kb was found to be inserted at the Fels-1 insertion site in RG9A strains L847, L1855 and L1862. A Shigella SfV-like prophage of 45 kb, which was inserted at STM4243, was found in RG6B strains DT193 and L1874. φW104 (belonging to the family Podoviridae) was detected in all but one RG8 strain and a variant of φW104 was found in SARA10 and SARA14 (unassigned RG) and DT8 (RG14). A 42 kb SEN34-like prophage (belonging to the family Podoviridae) in the STM2067 insertion site was found in strains L1876 and L1879.

Prophage profiles and correlation with core genome and CRISPR evolution

Based on the distribution of ST64B, Gifsy-1, Gifsy-3, Fels-1, P2, P4, P22 and φW104 prophages found in this study, the 105 S. Typhimurium strains were classified into 25 prophage profiles (Table 3). Each profile consisted of a set of shared prophages. Within these profiles other prophages may be variably present as shown in Fig. 2. There was a considerable level of correlation between prophage profile and lineages determined by genomic typing (Fig. 2). Prophage profiles, to a certain extent, also reflected the compositions of CRISPR arrays as the strains in many RGs or subsets within RGs have the same phage profile. Since the CRISPR-cas system is possibly involved in defence against phage invasion we investigated whether there is a link between the loss of spacers and gain of a phage. Loss of one or more spacers within an RG could provide a mechanism for phage invasion. A study in Cronobacter sakazakii showed that the CRISPR-cas system is active in that species where clinical strains carried few CRISPR spacers and had more phages, suggesting that gaining more prophage by clinical strains may offer an advantage to the host in survival and pathogenicity²⁴. However, we did not find any loss of spacers that match the phage genome in corresponding RGs in this study. Shariat et al.¹¹ have reported that the CRISPR-Cas systems in Salmonella are no longer active, arguing against a role in modulating phage invasion in S. Typhimurium in recent evolutionary history. Nevertheless, considering that some prophages carry virulence genes (see below), phages may have acted as a driving force in the evolution of S. Typhimurium, regardless of the mechanisms of phage acquisition25 ^{, 26}.

Table 3 Prophage profiles for various strain clusters.

Full size table

Prophages in S. Typhimurium genomes with similarity to prophages from other serovars and/or other species

The SPN1S variant had some additional genes which shared identity with phage SPC32N or the sequence from Klebsiella pneumoniae (Supplementary Table S8), indicating it was a hybrid prophage. The OLF-SE9-10012-like prophage in RG9A strains showed the highest similarity to a prophage in S. Enteritidis OLF-SE9-10012 (CP009091.1) with 70%/99% coverage/identity (Supplementary Table S9). This prophage was an unclassified member of the Myoviridae not related to P2 and with little similarity to other members of the Myoviridae family. Related prophages were also found in S. Muenchen BAA1674 (AOYT01000011.1), S. bongori N268-08 (CP006608.1), S. Hartford str 2012K-0272 (ARYS01000018) and S. Bovismorbificans 3114 (HF969015.2) as well as in multiple strains of S. Weltevreden and S. Bareilly. Close to the 3’ end of the prophage was the location of a variant form of the sopE gene (AF043239) which had been previously identified as a feature of all RG9A genotypes by PCR¹⁴.

The SfV-like prophage in two RG6B strains showed considerable divergence from SfV (AF339141) (Supplementary Table S10). It had a mosaic composition with some genes encoding hypothetical proteins coming from other species or other Salmonella serovars. These genes were inserted into different positions of the SfV-like prophage genome, indicating that multiple genetic exchanges have occurred (Figure S5). The SfV-like prophage lacked a 6 kb region present in SfV which carries the gtr genes for serotype conversion²⁷. Additionally, the 5′ end region encoding phage packaging and structure, and right-hand side regions encoding replication and regulation, were partially replaced by sequences from other serovars with unknown functions. The SEN34-like prophage shared only 20% of its genome with phage SEN34 (KT630649.1). A SEN34-like prophage was also found in serovar Weltevreden 1655 (CP014996.1) and Paratyphi B SPB7 (NC_010102). L1879 had a 43.5 kb sequence in the same insertion site that had only 14% coverage with SEN34. The remaining sequence was found in a number of S. Typhimurium strains and in other Salmonella serovars.

Distribution of virulence genes carried on prophages

S. Typhimurium prophages may carry genes that enhance the virulence of the bacterial host²⁸ thus the distribution of these genes deserves closer attention (Supplementary Table S11). The four virulence genes carried on Gifsy-1, gipA, gogB, gogD, and gogA, were present in nearly all except seven strains from four different RGs. Interestingly, these genes were mostly absent in the earliest diverged lineage (RG5). The artAB genes on Gifsy-1_DT104 initially found in epidemic DT104 strains²⁹ were found in other strains in six RGs. Similarly, the seven virulence genes carried by Gifsy-2 were present in most strains. However, sopE and sspH1 carried by a P2 and Gifsy-3 respectively were variably present in two different branches, while nanH carried by Fels-1 was only present in one strain (LT2). Some of these genes play a key role in virulence. Specifically, SopE activates RHO GTPases that lead to modification of cytoskeleton of the host cell for invasion and also induce caspase 1 to provoke inflammation³⁰. SopE can also induce the production of nitrate by host cell so that Salmonella can use nitrate respiration in the gut³¹, which enhances the survival of S. Typhimurium inside the host and competition with gut microbiota. SseK, a T3SS effector, encoded on ST64B and shown to play a role in the inhibition of NF-κB activation^{32, 33}, was found in five RGs. Some P22 prophages carry gtrABC genes which encode glucosyltransferases that glycosylate the galactose residues of the somatic O-antigen in S. Typhimurium³⁴. Modification of the O-antigen may help evade host induced immunity. The distribution of the gtrABC carrying P22 prophages was random across the genome tree with only 3 RGs fully carrying these prophages. The variable carriage of these prophages and virulence genes suggest that S. Typhimurium strains can significantly differ in their pathogenic potential.

Genomic islands

A novel genomic island inserted in the STM0323 (thrW tRNA) site was found and named as StmGI_0323. It was present in some strains of RG12B and RG12C including L1848, L1854, L1864 and L1877 as well as L796. In L1864, StmGI_0323 occurred in tandem with a SPN9CC-related P22 prophage. The genomic island was also found in DT8 and L1859 in RG14. StmGI_0323 encodes 14 open reading frames (ORFs) (Table 4) and is clearly of plasmid origin as the majority of the ORFs, four of which encode conjugal transfer proteins, shared high similarity to E. coli plasmid sequences (>99% at protein level). It was noteworthy that another unrelated genomic island GQ478253 was inserted at this site in RG6B strains L1874 and DT193.

Table 4 Detailed identity comparison of each gene in genomic island StmGI_0323.

Full size table

Conclusions

This study compared the genomic diversity of Australian and international strains of S. Typhimurium. The size of core genome in our set was slightly smaller than in previous reports, indicating that we have derived a stable core genome. The 105 strains could be divided into five clades and 19 lineages based on core genome variation. The strains represented 14 different RGs and the RGs primarily derived from CRISPR array composition correlated well with the lineages determined by core genomic typing. Previous studies also found CRISPR composition is correlated with genomic relationship in other Salmonella serovars³⁵, suggesting this is a general phenomenon. The accessory genome of S. Typhimurium contained a fair proportion of prophage genes. Some prophages were widely present in S. Typhimurium while others were sporadic. There was a strong correlation of prophage profiles with lineages and CRISPR profiles. Acquisition of phages may have played an important role in the adaptation and virulence evolution of S. Typhimurium. There was high sequence diversity among related prophages with a considerable level of similarity with prophages from other serovars and/or other species, suggesting extensive horizontal gene transfer. Virulence genes such as sopE carried by prophages were variably present, indicating variation in pathogenicity among S. Typhimurium strains. These findings have extended our understanding of the genomic diversity and core genome evolution of S. Typhimurium, particularly its relationship with CRISPR evolution and prophage variation.

Materials and Methods

Bacterial strains and genomic DNA isolation

Thirty-nine human clinical isolates representing CRISPR diversity¹⁴ collected between 1997 and 2011 were selected for sequencing (Table 2). Thirty-seven isolates in this study had been referred by the laboratory of Queensland Department of Health, Forensic & Scientific Services in Brisbane, Australia. Two other isolates were obtained from the UK. The phenol/chloroform method was used to extract genomic DNA from each strain as described previously³⁶. Sixty-six publicly available genomes were also used as shown in Supplementary Table S2. The plasmid and antibiotic resistance genes among the 105 strains were also analysed (Supplementary text).

CRISPR profiles

The CRISPR sequences of 36 strains (strain No. from L1849 to L1883) were determined in a previous study¹⁴. The CRISPR sequences from 35 S. Typhimurium strains sequenced in our previous studies^{4, 6, 16} were analysed in this study. The CRISPR1 and CRISPR2 sequences in each isolate were amplified using the primer pairs described by Hiley et al.¹⁴. PCR products were sequenced using the Applied Biosystems 3130 sequencer and BigDye Terminator v3.1 Cycle Sequencing Kit. ChromasPro was used to analyse the sequences. For 34 public genomes, the CRISPR finder program (http://crispr.u-psud.fr/) was used to locate the regular repeats and the intervening spacer sequences. Results were represented as filled rectangular blocks for ‘spacer present’ or an X for ‘spacer absent’ in the same order as for S. Typhimurium spacers in Table 6 in Fabre et al.⁸ (Supplementary Table S2).

Genome sequencing, de novo assembly and identification of Single nucleotide polymorphisms (SNPs)

Genomic DNA was sequenced using the Illumina Genome Analyzer (Illumina) with 250 bp paired end sequencing. Contigs were de novo assembled using the Velvet version 1.0.8 and VelvetOptimiser³⁷. Large scaffolds and short contigs generated by Velvet were aligned to the S. Typhimurium LT2 genome (NC_003197) using progressiveMauve version 2.3.1³⁸. RAST was used to annotate the sequences from each NGS genome³⁹. The number of coding sequences in the genomes was predicted based on RAST. For draft genomes, SNP calling was performed by Samtools (version 0.1.19) and followed the previously described criteria⁶. A custom script was used to determine whether a SNP in the genic region is synonymous SNP (sSNP) or non-synonymous SNP (nsSNP). For the complete genome, SNPs were determined by using the NUCmer program in the MUMmer package version 3.0⁴⁰.

Prophage analysis

The presence of prophages from the sequenced strains was screened using PHAST⁴¹. The prophages were confirmed by searching for integrase from annotated genomes. We subtyped the ST64B prophages and Gifsy-1 prophages into two and seven variants, respectively. ST64B prophage has two variants: ST64B_DT104 and ST64B_DT64. The sequence of ST64B_DT64 (AY055382) and the sequence of ST64B_DT104 obtained from DT104 (NC_022569) were used as reference sequences to confirm the variants in our studied strains. Gifsy-1 prophages were subtyped as either Gifsy-1_LT2, Gifsy-1_DT104, Gifsy-1_DT126, Gifsy-1_SL1344, Gifsy-1_DT2, Gifsy-1_CVM23701 or Gifsy-1_DT64 based on the unique sequences among these variants as defined previously¹⁴. The core genome content of P2 prophage and HP2-like group of P2 prophage were obtained by analysing common shared regions of P2 prophages and HP2-like P2 prophages using progressiveMauve version 2.3.1, respectively³⁸.

Phylogenetic analysis

Based on S. Typhimurium core genome SNPs we defined previously⁴ and core genome content of P2 prophage identified in this study, phylogenetic trees were constructed using the Minimum Evolution algorithms in MEGA 5.0 for 105 S. Typhimurium genomes and 15 P2 prophage genomes, respectively⁴². Bootstrap analysis was performed with 1,000 replicates. Based on the presence and absence of DNA segments in the P22 pan-genome, a UPGMA tree was constructed using the web-server DendroUPGMA (http://genomes.urv.cat/UPGMA).

Sequence data accession number

The raw sequencing data were submitted to GenBank (NCBI) under the BioProject No. PRJNA355598.

References

Galanis, E. et al. Web-based surveillance and global Salmonella distribution, 2000–2002. Emerg. Infect. Dis. 12, 381–388, doi:10.3201/eid1205.050854 (2006).
Article PubMed PubMed Central Google Scholar
Anderson, E. S., Ward, L. R., Saxe, M. J. & de Sa, J. D. Bacteriophage-typing designations of Salmonella typhimurium. J. Hyg. 78, 297–300 (1977).
Article CAS PubMed PubMed Central Google Scholar
Pang, S. et al. Genetic relationships of phage types and single nucleotide polymorphism typing of Salmonella enterica Serovar Typhimurium. J. Clin. Microbiol. 50, 727–734, doi:10.1128/JCM.01284-11 (2012).
Article CAS PubMed PubMed Central Google Scholar
Fu, S., Octavia, S., Tanaka, M. M., Sintchenko, V. & Lan, R. Defining the core genome of Salmonella enterica serovar Typhimurium for genomic surveillance and epidemiological typing. J. Clin. Microbiol. 53, 2530–2538, doi:10.1128/JCM.03407-14 (2015).
Article CAS PubMed PubMed Central Google Scholar
Achtman, M. et al. Multilocus sequence typing as a replacement forserotyping in Salmonella enterica. Plos Pathog. 8, doi:ARTN e1002776, doi:10.1371/journal.ppat.1002776 (2012).
Pang, S. et al. Genomic diversity and adaptation of Salmonella enterica serovar Typhimurium from analysis of six genomes of different phage types. BMC Genomics. 14, 718, doi:10.1186/1471-2164-14-718 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hayden, H. S. et al. Genomic analysis of Salmonella enterica serovar Typhimurium characterizes strain diversity for recent U.S. Salmonellosis cases and identifies mutations linked to loss of fitness under nitrosative and oxidative stress. MBio. 7, e00154, doi:10.1128/mBio.00154-16 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fabre, L. et al. CRISPR typing and subtyping for improved laboratory surveillance of Salmonella infections. Plos One. 7, e36995, doi:10.1371/journal.pone.0036995 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pettengill, J. B. et al. The evolutionary history and diagnostic utility of the CRISPR-Cas system within Salmonella enterica ssp. enterica. PeerJ. 2, e340, doi:10.7717/peerj.340 (2014).
Article PubMed PubMed Central Google Scholar
Bhaya, D., Davison, M. & Barrangou, R. CRISPR-Cas systems in bacteria and archaea: versatile small RNAs for adaptive defense and regulation. Annu. Rev. Genet. 45, 273–297, doi:10.1146/annurev-genet-110410-132430 (2011).
Article CAS PubMed Google Scholar
Shariat, N., Timme, R. E., Pettengill, J. B., Barrangou, R. & Dudley, E. G. Characterization and evolution of Salmonella CRISPR-Cas systems. Microbiology 161, 374–386, doi:10.1099/mic.0.000005 (2015).
Article CAS Google Scholar
Liu, F. et al. Novel virulence gene and clustered regularly interspaced short palindromic repeat (CRISPR) multilocus sequence typing scheme for subtyping of the major serovars of Salmonella enterica subsp. enterica. Appl. Environ. Microbiol. 77, 1946–1956, doi:10.1128/AEM.02625-10 (2011).
Article CAS Google Scholar
Liu, F. et al. Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs). Appl. Environ. Microbiol. 77, 4520–4526, doi:10.1128/AEM.00468-11 (2011).
Article CAS Google Scholar
Hiley, L., Fang, N. X., Micalizzi, G. R. & Bates, J. Distribution of Gifsy-3 and of variants of ST64B and Gifsy-1 prophages amongst Salmonella enterica serovar Typhimurium Isolates: evidence that combinations of prophages promote clonality. Plos One. 9, doi:ARTN e86203, doi:10.1371/journal.pone.0086203 (2014).
Mather, A. E. et al. Genomic Analysis of Salmonella enterica Serovar Typhimurium from wild passerines in England and Wales. Appl. Environ. Microbiol. 82, 6728–6735, doi:10.1128/AEM.01660-16 (2016).
Article CAS Google Scholar
Octavia, S. et al. Delineating community outbreaks of Salmonella enterica serovar Typhimurium by use of whole-genome sequencing: insights into genomic variability within an outbreak. J. Clin. Microbiol. 53, 1063–1071, doi:10.1128/JCM.03235-14 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hawkey, J. et al. Evidence of microevolution of Salmonella Typhimurium during a series of egg-associated outbreaks linked to a single chicken farm. BMC Genomics. 14, 800, doi:10.1186/1471-2164-14-800 (2013).
Article CAS PubMed PubMed Central Google Scholar
Barrangou, R. et al. CRISPR provides acquired resistance against viruses in prokaryotes. Science. 315, 1709–1712, doi:10.1126/science.1138140 (2007).
Article ADS CAS PubMed Google Scholar
Switt, A. I. M. et al. Salmonella Phages and Prophages: Genomics, Taxonomy, and Applied Aspects. Methods. Mol. Biol. 1225, 237–287, doi:10.1007/978-1-4939-1625-2_15 (2015).
Article CAS Google Scholar
Canchaya, C., Proux, C., Fournous, G., Bruttin, A. & Brussow, H. Prophage genomics. Microbiol. Mol. Biol. Rev. 67, 238–276, doi:10.1128/Mmbr.67.2.238-276.2003 (2003).
Article CAS Google Scholar
Mohammed, M. & Cormican, M. Whole genome sequencing provides possible explanations for the difference in phage susceptibility among two Salmonella Typhimurium phage types (DT8 and DT30) associated with a single foodborne outbreak. BMC Res. Notes. 8, 728, doi:10.1186/s13104-015-1687-6 (2015).
Article Google Scholar
Williams, B. J. et al. Bacteriophage HP2 of Haemophilus influenzae. J. Bacteriol. 184, 6893–6905, doi:10.1128/Jb.184.24.6893-6905.2002 (2002).
Article CAS PubMed PubMed Central Google Scholar
Balbontin, R., Figueroa-Bossi, N., Casadesus, J. & Bossi, L. Insertion hot spot for horizontally acquired DNA within a bidirectional small-RNA locus in Salmonella enterica. J. Bacteriol. 190, 4075–4078, doi:10.1128/Jb.00220-08 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zeng, H. et al. The driving force of prophages and CRISPR-Cas system in the evolution of Cronobacter sakazakii. Sci. Rep. 7, 40206, doi:10.1038/srep40206 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Wagner, P. L. & Waldor, M. K. Bacteriophage control of bacterial virulence. Infect. Immun. 70, 3985–3993 (2002).
Brussow, H., Canchaya, C. & Hardt, W. D. Phages and the evolution of bacterial pathogens: from genomic rearrangements to lysogenic conversion. Microbiol. Mol.Biol. Rev. 68, 560–602, doi:10.1128/MMBR.68.3.560-602.2004 (2004).
Article PubMed PubMed Central Google Scholar
Allison, G. E., Angeles, D. C., Huan, P. & Verma, N. K. Morphology of temperate bacteriophage SfV and characterisation of the DNA packaging and capsid genes: the structural genes evolved from two different phage families. Virology. 308, 114–127 (2003).
Article CAS PubMed Google Scholar
Figueroa-Bossi, N. & Bossi, L. Inducible prophages contribute to Salmonella virulence in mice. Mol. Microbiol. 33, 167–176 (1999).
Article CAS PubMed Google Scholar
Saitoh, M. et al. The artAB genes encode a putative ADP-ribosyltransferase toxin homologue associated with Salmonella enterica serovar Typhimurium DT104. Microbiology. 151, 3089–3096, doi:10.1099/mic.0.27933-0 (2005).
Article CAS PubMed Google Scholar
Hapfelmeier, S. et al. Role of the Salmonella pathogenicity island 1 effector proteins SipA, SopB, SopE, and SopE2 in Salmonella enterica subspecies 1 serovar Typhimurium colitis in streptomycin-pretreated mice. Infect. Immun. 72, 795–809 (2004).
Article CAS PubMed PubMed Central Google Scholar
Lopez, C. A. et al. Phage-mediated acquisition of a type III secreted effector protein boosts growth of Salmonella by nitrate respiration. MBio. 3, doi:10.1128/mBio.00143-12 (2012).
Brown, N. F. et al. Salmonella phage ST64B encodes a member of the SseK/NleB effector family. Plos One. 6, doi:ARTN e17824, doi:10.1371/journal.pone.0017824 (2011).
Gao, X. F. et al. NleB, a bacterial effector with glycosyltransferase activity, targets GADPH function to inhibit NF-kappa B activation. Cell Host Microbe 13, 87–99, doi:10.1016/j.chom.2012.11.010 (2013).
Article CAS PubMed PubMed Central Google Scholar
Villafane, R., Zayas, M., Gilcrease, E. B., Kropinski, A. M. & Casjens, S. R. Genomic analysis of bacteriophage epsilon 34 of Salmonella enterica serovar Anatum (15+). BMC Microbiol. 8, 227, doi:10.1186/1471-2180-8-227 (2008).
Article PubMed PubMed Central Google Scholar
Deng, X. et al. Comparative analysis of subtyping methods against a whole-genome-sequencing standard for Salmonella enterica serotype Enteritidis. J. Clin. Microbiol. 53, 212–218, doi:10.1128/JCM.02332-14 (2015).
Article PubMed Google Scholar
Octavia, S. & Lan, R. Single nucleotide polymorphism typing of global Salmonella enterica serovar Typhi isolates by use of a hairpin primer real-time PCR assay. J. Clin. Microbiol. 48, 3504–3509, doi:10.1128/JCM.00709-10 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zerbino, D. R. & Birney, E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829, doi:10.1101/gr.074492.107 (2008).
Article CAS PubMed PubMed Central Google Scholar
Darling, A. E., Mau, B. & Perna, N. T. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. Plos One. 5, e11147, doi:10.1371/journal.pone.0011147 (2010).
Article ADS PubMed PubMed Central Google Scholar
Aziz, R. K. et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 9, 75, doi:10.1186/1471-2164-9-75 (2008).
Article PubMed PubMed Central Google Scholar
Delcher, A. L., Salzberg, S. L. & Phillippy, A. M. Using MUMmer to identify similar regions in large sequence sets. Curr. Protoc. Bioinformatics 10.13. 11-10.13. 18 (2003).
Zhou, Y., Liang, Y., Lynch, K. H., Dennis, J. J. & Wishart, D. S. PHAST: a fast phage search tool. Nucleic. Acids. Res. 39, W347–352, doi:10.1093/nar/gkr485 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tamura, K. et al. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 28, 2731–2739, doi:10.1093/molbev/msr121 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by a grant from the National Health and Medical Research Council of Australia and a UNSW Early Career Research grant. SF was supported by a UNSW international postgraduate research award. MT was supported by an Australian Research Council Future Fellowship.

Author information

Authors and Affiliations

School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Songzhe Fu, Sophie Octavia, Mark M. Tanaka & Ruiting Lan
Public Health Microbiology Laboratory, Forensic and Scientific Services, Queensland Department of Health, Brisbane, Queensland, Australia
Lester Hiley
Marie Bashir Institute for Infectious Diseases and Biosecurity, University of Sydney, Sydney, New South Wales, Australia
Vitali Sintchenko
Centre for Infectious Diseases and Microbiology–Public Health, Institute of Clinical Pathology and Medical Research, Westmead Hospital, Sydney, New South Wales, Australia
Vitali Sintchenko

Authors

Songzhe Fu
View author publications
You can also search for this author in PubMed Google Scholar
Lester Hiley
View author publications
You can also search for this author in PubMed Google Scholar
Sophie Octavia
View author publications
You can also search for this author in PubMed Google Scholar
Mark M. Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Vitali Sintchenko
View author publications
You can also search for this author in PubMed Google Scholar
Ruiting Lan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: S.F., L.H., S.O. and R.L. Performed the experiments: S.F., L.H. and S.O. Data analysis and draft of manuscript were performed by S.F., L.H., S.O., M.T., V.S. and R.L. All authors approved the final version of the manuscript for submission.

Corresponding author

Correspondence to Ruiting Lan.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Tables

supplementary info

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fu, S., Hiley, L., Octavia, S. et al. Comparative genomics of Australian and international isolates of Salmonella Typhimurium: correlation of core genome evolution with CRISPR and prophage profiles. Sci Rep 7, 9733 (2017). https://doi.org/10.1038/s41598-017-06079-1

Download citation

Received: 09 December 2016
Accepted: 07 June 2017
Published: 29 August 2017
DOI: https://doi.org/10.1038/s41598-017-06079-1

This article is cited by

Characterisation of IncI1 plasmids associated with change of phage type in isolates of Salmonella enterica serovar Typhimurium
- Lester Hiley
- Rikki M. A. Graham
- Amy V. Jennison
BMC Microbiology (2021)
Survival of Salmonella Under Heat Stress is Associated with the Presence/Absence of CRISPR Cas Genes and Iron Levels
- Amreeta Sarjit
- Joshua T. Ravensdale
- Gary A. Dykes
Current Microbiology (2021)
Genomic dissection of the most prevalent Listeria monocytogenes clone, sequence type ST87, in China
- Yan Wang
- Lijuan Luo
- Changyun Ye
BMC Genomics (2019)
Comparative genomic analysis unravels the transmission pattern and intra-species divergence of acute hepatopancreatic necrosis disease (AHPND)-causing Vibrio parahaemolyticus strains
- Qian Yang
- Xuan Dong
- Jie Huang
Molecular Genetics and Genomics (2019)
Analysis of direct repeats and spacers of CRISPR/Cas systems type I-F in Brazilian clinical strains of Pseudomonas aeruginosa
- Ana Carolina de Oliveira Luz
- Julia Mariana Assis da Silva
- Tereza Cristina Leal-Balbino
Molecular Genetics and Genomics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

Selection of S. Typhimurium isolates for genome sequencing

Core and accessory genomes of S. Typhimurium

Genomic relationships and their correlation with Repeats Groups (RGs)

Progressive spacer deletion in the CRISPR evolution of S. Typhimurium

Prophages found in the 105 S. Typhimurium genomes

Lambdoid prophages

P27-like prophages

P2 prophages, P2 variants and P4 prophage

P22 prophages and P22 variants

Other prophages

Prophage profiles and correlation with core genome and CRISPR evolution

Prophages in S. Typhimurium genomes with similarity to prophages from other serovars and/or other species

Distribution of virulence genes carried on prophages

Genomic islands

Conclusions

Materials and Methods

Bacterial strains and genomic DNA isolation

CRISPR profiles

Genome sequencing, de novo assembly and identification of Single nucleotide polymorphisms (SNPs)

Prophage analysis

Phylogenetic analysis

Sequence data accession number

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links