Complete genome sequence of the plant-associated Serratia plymuthica strain AS13

Serratia plymuthica AS13 is a plant-associated Gammaproteobacteria, isolated from rapeseed roots. It is of special interest because of its ability to inhibit fungal pathogens of rapeseed and to promote plant growth. The complete genome of S. plymuthica AS13 consists of a 5,442,549 bp circular chromosome. The chromosome contains 4,951 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced as part of the project entitled “Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens” within the 2010 DOE-JGI Community Sequencing Program (CSP2010).


Introduction
The members of the genus Serratia are widely distributed in nature. They are commonly found in soil, water, plants, insects, and other animals including humans [1]. The genus includes biologically and ecologically diverse species -from those beneficial to economically important plants, to pathogenic species that are harmful to humans. The plantassociated species comprise both endophytes and free living taxa, such as S. proteamaculans, S. plymuthica, S. liquefaciens and S. grimesii. Most of them are of interest because of their ability to promote plant growth and inhibit plant pathogenic fungi [2][3][4][5][6].
There are currently 16 validly named Serratia species. However, there are several unidentified plantassociated Serratia strains that have an impact on agriculture by stimulating plant growth and/or inhibiting soil borne plant pathogens [3]. S. plymuthica AS13 was isolated from rapeseed roots from Uppsala, Sweden. Our interest in S. plymuthica AS13 is due to its ability to stimulate rapeseed plant growth and to inhibit soil borne fungal pathogens such as Verticillium dahlia and Rhizoctonia solani [6]. Here we present a description of the complete genome of S. plymuthica AS13 and its annotation.
The phylogenetic relationship of S. plymuthica AS13 is shown in Figure 1 in a 16S rRNA based tree. All Serratia lineages clustered together and were distinct from other enterobacteria (except Obesumbacterium proteus). The tree also shows its very close relation with S. plymuthica strains AS9 and AS12, which was confirmed by digital DNA-DNA hybridization values [12] above 70% when compared with the (unpublished) draft genome sequence of the S. plymuthica type strain Breed K-7 T from a culture of DSM 4540, and when compared with the complete genome sequences of S. plymuthica AS9 [13] and S. plymuthica AS12 [14] using the GGDC web server [15].
Strain AS13 is a rod shaped bacterium, 1-2 µm long, 0.5-0.7 µm wide ( Figure 2 and Table 1), is Gram-negative, motile, and a member of the family Enterobacteriaceae. The bacterium is a facultative anaerobe and grows within the temperature range 4 °C -40 °C and within a pH range of 4 -10. It has chitinolytic, cellulolytic, proteolytic, and phospholytic activity [6] and can easily grow on different carbon sources such as glucose, cellobiose, succinate, mannitol, arabinose and inositol. It forms red to pink colored colonies that are 1-2 mm in diameter on potato dextrose agar at low temperature. The color of the bacterium depends on the growth substrate, temperature and pH of the culture medium [30]. The bacterium is deposited in the Culture Collection, University of Göteborg, Sweden (CCUG) as S. plymuthica AS13 (= CCUG 61398).  [8]. The tree was constructed under the maximum likelihood criterion using MEGA5 software [9] and rooted with Xanthomonas cucurbitae (a member of the Xanthomonadaceae family). The branches are scaled based on the expected number of substitutions per site. The numbers above branches are support values from 1,000 bootstrap replicates if larger than 60% [10]. The lineages shown in blue color are the genome sequences of bacterial strains that are registered in GOLD [11]. Standards in Genomic Sciences

Genome sequencing information
S. plymuthica AS13, a bacterial strain isolated from rapeseed roots was selected for sequencing on the basis of its biocontrol activity against fungal pathogens of rapeseed and its plant growth promoting ability. The genome project is deposited in the Genomes On Line Database [11] (GOLD ID = Gc01776) and the complete genome sequence is deposited in GenBank (INSDC ID = CP002775). Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute (JGI). A summary of the project information is shown in Table 2 and its association with MIGS identifiers.

Growth conditions and DNA isolation
S. plymuthica AS13 was grown in Luria Broth (LB) medium at 28 °C until early stationary phase. The DNA was extracted from the cells by using a standard CTAB protocol for bacterial genomic DNA isolation that is available at JGI [32].

Genome sequencing and assembly
The genome of S. plymuthica AS13 was sequenced using a combination of Illumina and 454 sequencing platforms. The details of library construction and sequencing can be found at the JGI [32]. The sequence data from Illumina GAii (1,457.3 Mb) were assembled with Velvet [33] and the consensus sequence was computationally shredded into 1.5 kb overlapping fake reads. The sequencing data from 454 pyrosequencing (79.5 Mb) were assembled with Newbler and consensus sequences were computationally shredded into 2 kb overlapping fake reads. The initial draft assembly contained 86 contigs in 1 scaffold. The 454 Newbler consensus reads, the Illumina Velvet consensus reads and the read pairs in the 454 paired end library were assembled and quality assessment performed in the subsequent finishing process by using software phrap package [34][35][36][37]. Possible mis-assemblies were corrected with gapResolution [32], Dupfinisher [38], or by sequencing cloned bridging PCR fragments with subcloning. The gaps between contigs were closed by editing in the software Consed [37], by PCR and by Bubble PCR primer walks (J.-F. Chang, unpublished). Fifty one additional reactions were necessary to close gaps and to raise the quality of the finished sequence. The sequence reads from Illumina were used to correct potential base errors and increase consensus quality using the software Polisher developed at JGI [39]. The final assembly is based on 46.8 Mb of 454 draft data which provides an average 8.7 × coverage of the genome and 1,415.6 Mb of Illumina draft data which provides an average 262.2 × coverage of the genome.

Genome annotation
The S. plymuthica AS13 genes were identified using Prodigal [40] as part of the genome annotation pipeline at Oak Ridge National Laboratory (ORNL), Oak Ridge, TN, USA, followed by a round of manual curation using the JGI GenePRIMP pipeline [41]. The predicted CDS were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database, Uniport, TIGR-Fam, Pfam, PRIAM, KEGG, COG and InterPro databases. Non-coding genes and miscellaneous features were predicted using tRNAscan-SE [42], RNAmmer [43], Rfam [44], TMHMM [45], and signalP [46]. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Ge-nomes -Expert Review (IMG-ER) platform developed by the Joint Genome Institute, Walnut Creek, CA, USA [47].

Genome properties
The genome of S. plymuthica AS13 has a single circular chromosome of 5,442,549 bp with 55.96% GC content (Table 3 and Figure 3). It has 5,139 predicted genes, of which 4,951 were assigned as proteincoding genes. Among them, most of the protein coding genes (84.41%) were functionally assigned while the remaining ones were annotated as hypothetical proteins. 112 genes were assigned as RNA genes and 76 as pseudogenes. The distribution of genes into COG functional categories is presented in Table 4. Altitude 24-25 m NAS a) Evidence codes -IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [29]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements. Standards in Genomic Sciences   a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.