Draft genome sequence data of Paenibacillus Polymyxa strain TH2H2, isolated from a tomato flower in Korea

Members of the genus Paenibacillus are known for their production of useful substances, and some species of the genus are recognized to be plant growth-promoting rhizobacteria. Paenibacillus polymyxa TH2H2, isolated from a tomato flower, had antifungal activity. Here, the draft genome sequence of Paenibacillus polymyxa TH2H2 is reported. The 5,983,104-bp genome, with a G+C content of 45.31%, comprised 5,221 protein-coding genes, 64 ribosomal RNA and 100 transfer RNA. Three intact antibiotic biosynthesis gene clusters were identified using antiSMASH. These encoded the antifungal agent fusaricidin and two antibacterial agents, tridecaptin and polymyxin. Sequence data have been deposited in the DDBJ/ENA/GenBank database under the accession number RPDG01000000. The version described in this paper is RPDG00000000.1. The BioProject ID in the GenBank database is PRJNA505713.


Specifications table
Genome sequencing with MiSeq paired-end protocol at Chunlab Inc. Republic of Korea Data format Raw and Analyzed Parameters for data collection Bacterial genomic DNA was extracted from a culture of Paenibacillus polymyxa TH2H2 Description of data collection Genome features (Table 1), genome map (Fig. 1a), EggNOG functional category (Fig. 1b)

Value of the data
• The complete genome sequence of P. polymyxa TH2H2 provides essential information about the strain that can be applied to plant protection research and biological control of plant. • In the genome of P. polymyxa TH2H2, 37 antibiotic-related metabolite gene clusters were predicted, indicating that the strain could be valuable in investigations of plant-microbe interactions. • P. polymyxa TH2H2 genome data provides information about species of the genus Paenibacillus that will be useful to the wider microbial research community.

Data Description
Species of the genus Paenibacillus are rod-shaped, aerobic or facultatively anaerobic, endospore-forming bacteria with Gram-positive and Gram-variable attributes. They inhabit various sites including soil, sediment, sewage, caves, compost, water, plant and animal tissues, and so on. Some species of the genus Paenibacillus are reported to be plant growth-promoting rhizobacteria that fix nitrogen, solubilize phosphate and act as antagonistic agents for plant pathogens [ 1 , 2 ]. Members of the genus Paenibacillus produce a range of molecules such as amylases, cellulases, lipases, pectinases, oxygenases, dehydrogeneases, lignin-modifying enzyme, exopolysaccharides and antibiotic agents, and these products have applications in the food, paper, biofuel, textiles, agriculture, detergents and medicine industries [1] . Paenibacillus polymyxa TH2H2 was previously isolated from a tomato flower [3] . The strain had antifungal activity, accompanied by high levels of cellulase and proteinase activity, towards the tomato pathogen Botrytis cinera [3] .
The P. polymyxa TH2H2 genome was sequenced to 5,983,104 bp, and comprised 113 contigs and a G + C content of 45.31% ( Fig. 1 A, Table 1 , supplementary data 1). There were 5,221 coding sequences (CDS), 64 ribosomal RNA and 100 transfer RNA. Cluster of Orthologous Groups (COG) analysis categorized 4,646 genes (supplementary data 1). With the exception of genes with unknown functions, the most frequently observed category was carbohydrate transport and metabolism, followed by transcription, then energy production and conversion ( Fig. 1 B). Thirty-  seven secondary metabolite gene clusters were predicted using antiSMASH (supplementary data 2). Among these, three complete non-ribosomal peptides (NRP) clusters were identified. The first NRP was fusaricidin biosynthetic gene cluster [4] ; fusaricidin is a recognized antifungal agent. The other two NRPs were tridecaptin and polymyxin biosynthetic gene clusters [ 5 , 6 ], both of which are antibacterial agents targeting Gram-negative bacteria.

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.