Draft genome of Paraburkholderia caballeronis TNe-841T, a free-living, nitrogen-fixing, tomato plant-associated bacterium

Paraburkholderiacaballeronis is a plant-associated bacterium. Strain TNe-841T was isolated from the rhizosphere of tomato (Solanum lycopersicum L. var. lycopersicum) growing in Nepantla Mexico State. Initially this bacterium was found to effectively nodulate Phaseolus vulgaris L. However, from an analysis of the genome of strain TNe-841T and from repeat inoculation experiments, we found that this strain did not nodulate bean and also lacked nodulation genes, suggesting that the genes were lost. The genome consists of 7,115,141 bp with a G + C content of 67.01%. The sequence includes 6251 protein-coding genes and 87 RNA genes.


Introduction
Paraburkholderia caballeronis was isolated in the State of Mexico, Mexico from the tomato rhizosphere as a free-living, nitrogen-fixing bacterial species [1]. It was described as B. caballeronis and found to nodulate Phaseolus vulgaris L. [2]. Most nodulating bacteria are isolated from root nodules but this was not the case for B. caballeronis, which was isolated from rhizospheric soil. Given the ability of this bacterium to fix nitrogen under both free-living and symbiotic conditions, this type strain was selected for genome sequencing to study its nitrogen-fixing and other plant-growth promoting activities. However, after analyzing the genome, we found that the genes for fixing nitrogen were present but nodulation genes were not. We carried out several unsuccessful tests to check the ability of this strain to nodulate P. vulgaris, strongly suggesting that the strain had lost the nod genes.
The genome sequence of P. caballeronis TNe-841 T was obtained in cooperation with JGI-DOE. The type species is TNe-841 T (= LMG 26416 T = CIP 110324 T ).

Classification and features
Burkholderia caballeronis TNe-841 T has been proposed to belong to the newly described genus Paraburkholderia. The last years, Burkholderia sensu lato has been subjected to some taxonomical changes, where the genus has been split to Burkholderia, Paraburkholderia, Caballeronia and Robbsia andropogonis [3][4][5]. However, this division has caused some skepticism, which has been expressed by The International Committee on Systematics of Prokaryotes, through the Subcommittee for the Taxonomy of Rhizobium and Agrobacterium discussed during the 12th Nitrogen Fixation Conference held in Budapest, Hungary on 25 August 2016 [6]. The Subcommittee stated: "Research efforts directed towards robust characterization and taxonomy of Burkholderia sensu lato species can help in realizing this agricultural potential. Clearly, large-scale phylogenomic study is required for resolving these taxa". In order to analyze this issue and to provide generic limits in Burkholderia sensu lato, a large phylogenomic analysis was carried out using the amino acid and nucleotide sequence of 106 conserved proteins from 92 species [7]. The analysis performed with maximum likelihood unambiguously supported five different lineages: Burkholderia sensu stricto, Paraburkholderia, Caballeronia, Robbsia andropogonis and B. rhizoxinica.
To check the position of P. caballeronis within Paraburkholderia, the 16S rRNA gene sequence (ca. 1500 bp) was amplified and sequenced at Macrogen [8] with the universal primers fD1/rD1 [9]. The nucleotide sequence (accession number EF139186) was compared to other Paraburkholderia species using Muscle 3.57 for alignment [10]. A phylogenetic analysis was performed with ML using the PhyML program [11]. Among-site rate variation was modeled by a gamma distribution with four rate categories [12] with each category being represented by its mean under the GTR + G model. Tree searches were initiated from a BioNJ seed tree retaining the best tree among those found with NNI (Nearest Neighbor Interchange). The robustness of the ML topologies was evaluated using a Shimodaira-Hasegawa (SH)-like test [13]. The ML tree was obtained with the program MEGA version 5 [14]. The position of P. caballeronis in the ML tree shows that it is close to P. kururiensis (Fig. 1). The colony morphology on BSE medium was uniform, 1 mm diameter, with entire margins that were convex, whitish, and translucent transparent. The cells are strictly aerobic Gramnegative, non-spore forming rod (0.49-0.69 μm × 1.2-2.7 μm) and have flagella (Fig. 2). Other phenotypic traits for this strain have been published before [2]. The strain has the following enzymes: arginine dihydrolase, urease catalase, and nitrogenase and associated proteins. It is also able to assimilate D-glucose, DLarabinose, D-mannose, D-mannitol, N-acetyl glucosamine, gluconate, capric acid, malate acetate, D-ribose, D-xylose, D-adonitol, D-galactose, D-fructose, L-rhamnose, inositol, D-sorbitol, D-cellobiose, D-turanose, D-xylose, D-fucose, D-arabitol, potassium 2-ketogluconate, and potassium 5ketogluconate (Table 1). Oxidase activity was weak. The strain grew on MacConkey agar plates at 29°C and 37°C, but weakly at 42°C. P. caballeronis TNe-841 T grew on LB and BSE agar plates at 15, 29, 37, and 42°C and on LB plates at 29°C with up to 5.0% NaCl.

Genome sequencing information
Genome project history P. caballeronis TNe-841 T was sequenced at the JGI-DOE as a part of the project "Root nodule microbial The strain was grown on LB medium and a loop-full of cells was gently suspended in 1 mL distilled water. A drop of the suspension was placed on a formvar-coated copper grid and air-dried for 20 min to allow the cells to adhere. The grid was then covered for 20 s with a solution of 0.5% uranyl acetate, the excess liquid was removed with a filter paper, and then air-dried. A JEOL JEM-1010 transmission electron microscope, operated at 60 kV, was used to observe and photograph negatively stained preparations. F, stands for flagella communities of legume samples collected from USA, Mexico and Botswana" directed by Dr. Ann M. Hirsch. The goal of this project was to identify the microbial community housed within nodules of native legumes living in three arid or semi-arid, nutrientpoor environments in Mexico, Botswana, and the United States. Both Paraburkholderia and Rhizobium bacteria had been previously isolated from Mexico. P. caballeronis TNe-841 T was chosen as the reference strain for a study of bacteria associated with native legume soils and nodules.
The complete sequence was finished on May 2015 and some features are presented in Table 2 and Fig. 3.
Growth conditions and genomic DNA preparation P. caballeronis TNe-841 T cells were grown in 5 ml of LB minus NaCl at 30°C for 18 h at 120 rpm. The DNA extraction was done using Invitrogen's Purelink™ Genomic DNA Mini Kit. The purified DNA was monitored for integrity by gel electrophoresis, and then sent to the JGI for sequencing.
Two surface-sterilized and rinsed seeds of Phaseolus vulgaris L. c.v. Negro Chapingo were planted per pot in surface-sterilized black pots (29.5 cm tall; 17 cm diameter) filled with autoclaved vermiculite:perlite (2:1) and watered with autoclaved 1/4 strength Hoagland's -N medium. Two separate experiments were performed. The . not directly observed for the living isolated sample but based on a generally accepted property for the species or anecdotal evidence). These evidence codes are from the Gene Ontology project [33] pots were either left uninoculated (sterilized water or Hoagland's -N medium was added), inoculated with 10 ml of P. caballeronis TNe-841 T diluted to OD 600 = 0.2 or with B. tuberum DUS833, which was a positive control. Some pots were also watered with 1/4 strength Hoagland's + N medium as an additional positive control. The appropriate medium was added twice weekly and the plants grown in a Conviron growth chamber under 16 h days/8 h nights at 24°C.

Genome sequencing and assembly
The draft genome of P. caballeronis was generated using the PacBio sequencing technology [15]. A Pacbio SMRTbell™ library was constructed and sequenced on the PacBio RS platform, which generated 194,884 filtered sub-reads totaling 879.3 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found at [16]. The raw reads were assembled using HGAP (version: 2.3.0 p5 protocol version = 2.3.0 method = RS HGAP Assembly.3 smrtpipe.py v1.87.139483) [17]. The final draft assembly contained 3 contigs in 3 scaffolds totaling 7.115 Mbp in size. The input read coverage was 62.2X.

Genome annotation
Genes were identified using Prodigal [18] followed by a round of manual curation using GenePRIMP [19] for finished genomes and draft genomes in fewer than   [20] was used to find tRNA genes whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [19]. Other non-coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [20]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes platform [21] developed by the JGI Walnut Creek CA USA [21]. The genome was also manually annotated at IPN and UCLA using the IMG platform [21].

Genome properties
The final draft assembly of P. caballeronis TNe-841 T contained 3 contigs in 3 scaffolds accumulating 7,115,141 bp in size ( Table 3). The G + C content of the genome was 67.01%, which is very close to the one determined during the description of the species (66.0%) [2]. The genome was predicted to encode 6338 genes including 6251 protein-coding genes and 87 RNA genes (15 rRNAs 60 tRNAs and 12 ncRNA). The number of genes associated with general COG functional categories is shown in Table 4, in addition to other functions such as extracellular structures and mobilome.
Insights from the genome sequence P. caballeronis was originally described as a free-living, nitrogen-fixing bacteria with the ability to form nodules on Phaseolus vulgaris L. roots [2]. Although nitrogen fixation genes are present, nodulation genes were not found in the sequenced genome. Moreover, after the initial experiments, P. vulgaris nodulation was no longer detected in greenhouse bioassays in two different laboratories. This nodulation instability seems to be more frequent than originally assumed because a similar loss of nodulation ability has been reported with other Burkholderia strains isolated from nodules. The strains CCGE1002 and CCGE1003 (Marco Antonio Rogel CCG-UNAM, pers. comm.) also lost the ability to nodulate, but strain CCGE1002, which retains the ability to nodulate, was recovered from a stored sample. Its symbiotic plasmid was subsequently sequenced (NCBI BioSample PRJNA37719). In contrast, nodulation genes were no longer detected in The total is based on the total number of protein coding genes in the genome  [22] and Gastrolobium capitatum [23] in Australia. Strain TNe-841 T also contains genes for degrading a large number of xenobiotics including aminobenzoate, atrazine, benzoate, bisphenol, caprolactam, chloroalkane, chloroalkene, chlorohexane, chlorobenzene, dioxin, ethylbenzene, fluorobenzoate, naphthalene, nitrotoluene, polycyclic aromatic hydrocarbons, styrene, toluene, and xylene.
ANI calculation was used to compare the genome of P. caballeronis TNe-841 T and other Paraburkholderia species ( Table 5). The ANI results showed that strains TNe-851 T correspond to a different species since the highest ANI value was 83.32. The accepted ANI cut-off for species is 95-96%, which corresponds to a DNA-DNA hybridization of 70% [24,25].

Conclusions
P. caballeronis TNe-81 T , is a plant-associated bacteria species with the ability to fix nitrogen, although the ability to nodulate legumes as shown in the original description was apparently lost. This nodulation instability seems to be rather common among nodulating bacteria, particularly Burkholderia/Paraburkholderia. Our interest in studying the genome of P. caballeronis TNe-841 T started when we found that this bacterium, isolated from the tomato rhizosphere, was able to nodulate bean. This led us to find out the identity of the original host for this species. Our work team has recently isolated a P. caballeronis strain from bean nodules used as a trap with soil from an area where Mimosoideae plants are present (unpublished results). We are characterizing additional isolates from Mimosoideae plant nodules to try to establish if this plant might be the host of P. caballeronis TNe-841 T .