The data of genomic and phenotypic profiles of the N-acyl homoserine lactone-producing algicidal bacterium Stenotrophomonas rhizophila GA1

Herein, an algicidal strain, Stenotrophomonas rhizophila GA1, was isolated from a marine dinoflagellate and its genome was sequenced using next-generation sequencing technology. The genome size of S. rhizophila GA1 was determined to be 5.92 Mb with a G+C content of 62.39%, comprising eight scaffolds of 67 contigs. A total of 4579 functional proteins were assigned according to COG categories. In silico genome annotation protocols identified multiple putative LuxI-like genes located in the upstream position at contig 4. The thin-layer chromatography analysis showed that three kinds of acyl homoserine lactone (AHL) signals could be produced by S. rhizophila GA1. This work describes an algicidal bacterium capable of generating AHL molecules for its ecological adaptation. The annotated genome sequence of this strain may represent a valuable tool for studying algae-bacterium interactions and developing microbial methods to control harmful algae. The genome scaffolds generated are available in the National Center Biotechnology Information (NCBI) BioProject with accession number PRJNA485554.


Value of the data
It is the first whole genome of algicidal bacterium S. rhizophila GA1. This data allows other researchers to extend the study about the algae-bacterium interactions. The data could be help us developing microbial methods to control harmful algae.

Data
In this study, an algicidal bacterium (Stenotrophomonas rhizophila GA1) was screened from the marine dinoflagellate. The morphology, optimal growth conditions and algicidal ability are shown in Fig. 1A-C. A summary of other data for the isolated strain is listed in Table 1.
The whole genome of S. rhizophila GA1 contained 6,598,546 bases and a G þC content of 62.39% ( Table 1). The analyses of the complete genome identified 4579 open reading frames. Homologous comparison by BLAST found 3395 CDS (coding sequence) involving 25 functional COGs (clusters of orthologous groups) and a part of the CDS involving 34 KEGG (Kyoto encyclopedia of genes and genomes) metabolic pathways ( Fig. 2A).
Based on the functional categories of COGs and KEGG groups, 271 genes were involved in carbohydrate metabolism and 879 genes participated in amino acid transport and energy conversion (Fig. 2B). Several genes encoding putative AHL inducers ( LuxI-like proteins) were found in contig 4 of S. rhizophila GA1. The thin-layer chromatography (TLC) confirmed S. rhizophila GA1 strain has the ability to produce AHL signaling molecules (Fig. 1D). In addition, a chitinase-coding gene (chi) was found downstream of luxR; the protein encoded by this gene is believed to contribute to the ability of S. rhizophila GA1 to lyse its host (algae).

Isolation
The seawater sample was collected from the dinoflagellate (G. aerucyinosum) bloom on the Shenzhen coast of China. An isolated strain GA1 caused significant algal lyse. The 16S rRNA gene sequence analysis revealed that it shared 99.7% similarity to the type strain of Stenotrophomonas rhizophila. We provisionally named this strain S. rhizophila GA1.

DNA extraction, sequencing, and assembly
Genomic DNA of S. rhizophila GA1 was extracted using the genomic DNA extraction kit (MoBio, CA, USA) following the protocols of the manufacturer. Whole-genome sequencing of the normalized DNA was performed using IlluminaHiseq. 2500 (San Diego, USA) instrument, as descripted in Glushchenko et al. [1]. De novo assembly was performed using CLC Genomics Workbench version 5.1 (CLC Bio, Denmark) and trimmed using a minimum Phred quality score of 20, a minimum length of 50 bp, allowing no ambiguous nucleotides and trimming off some low-quality nucleotides [2]. The reads were assembled with SOAPdenovo (V.2.04) [3], and the sequence was annotated using the RAST Error bars indicate the mean value 7 standard deviation of three optical density measurements at 600 nm (OD 600 ). (C) Algicidal activity of S. rhizophila GA1. The control group was sole-cultured G. aerucyinosum, and the initial concentration was 2 Â 10 4 cells/L. The test group was G. aerucyinosum co-cultured with S. rhizophila GA1 (final concentration was 1 Â 10 5 cells/mL). The total experimental cycle was 18 days. Error bars indicate the mean value 7 standard deviation of three measurements of ChII a. (D) Analysis of AHLs from supernatant extracts of GA1 strain. AHLs extracted from cell-free culture supernatants were separated by thin-layer chromatography and detected using an overlay of agar seeded with Agrobacterium tumefaciens 136. Lane 1contains AHL standards (arrows point to C 8 -oxo-, C 10 -, and C 12 -AHL, respectively); lane 2 contains the S. rhizophila GA1 extracts. annotation server [4]. tRNA and rRNA genes were predicted by tRNAscan-SE [5] and RNAmmer [6], respectively. Genes were predicted using Glimmer 3.02 [7] and annotated by searching against the NCBI-nr and KEGG databases.

Thin-layer chromatography (TLC)
To verify the AHL-synthesizing activity of S. rhizophila GA1, reverse-phase thin-layer chromatography (TLC) was performed as described by Shaw et al. [8] and Ma et al. [9].

Data accessibility
The genome sequence data has been deposited in the GenBank database under accession numbers PRJNA485554.