Complete Genome Sequence of Lactobacillus crispatus AB70, Isolated from a Vaginal Swab from a Healthy Pregnant Korean Woman

The vaginal bacterial strain AB70, belonging to the species Lactobacillus crispatus, was isolated from a vaginal swab from a healthy pregnant Korean woman. Here, we report the 2.37-Mb complete genome sequence of this strain.

T he vaginal microbiota is known to play a significant role in women's health and pregnancy (1,2). Several studies have shown that the vaginal microbiome of healthy pregnant women is dominated by Lactobacillus spp. and is characterized by less richness and diversity than the microbiome of nonpregnant women (3). A poor dominance of vaginal lactobacilli may cause microbial imbalance in the vagina, often resulting in bacterial vaginosis and high-risk pregnancies (4).
To investigate vaginal microbiota using both culture-dependent and cultureindependent techniques, vaginal swabs from healthy pregnant Korean women were collected under anaerobic conditions, as previously described (5). This study received ethical approval from the Konyang University Hospital Institutional Review Board (IRB) (approval number 2014-06-009). Using anaerobic de Man, Rogosa, and Sharpe (MRS) agar plates, several hundred purified vaginal microbes were isolated to purity using single-colony isolation, and most were assigned within Lactobacillus crispatus based on 16S rRNA sequences (100% 16S rRNA gene sequence similarities to that of L. crispatus strain ATCC 43058 T ). The 16S rRNA gene was amplified by colony PCR with the universal PCR primers 27F and 1492R (6), and purified PCR products were sequenced by the BIOFACT Co., Ltd. (Daejeon, Republic of Korea).
Antibacterial activities of ethyl acetate extracts were evaluated with disk diffusion assays against Escherichia coli KCTC 2441 and Bacillus cereus KCTC 3624. One of the isolates, named strain AB70, displayed strong antibacterial activities and was selected for further investigation, including complete genome analysis.
The genomic DNA of strain AB70 was extracted from 1-liter anaerobic MRS liquid cultures, as previously described (5). Genomic library construction and sequencing were carried out at the Chun Lab (Seoul, Republic of Korea) using the PacBio RS II platform (Pacific Biosciences, Menlo Park, CA, USA) with P6-C4 chemistry. We produced 150,292 sequence reads (ca. Lactobacillus crispatus AB70 has a chromosome (2,351,263 bp, 37.3% GϩC content) that encodes 2,333 protein-coding sequences, 71 tRNAs, and 5 rRNA gene clusters, while the circular plasmid pLcAB70 (16,662 bp, 35.8% GϩC content) encodes only 19 protein-coding sequences. The average nucleotide identity between AB70 and the type strain ATCC 33820 calculated with the OrthoANIu algorithm (10) was 97.5%. Genome coverages, however, were only 50.7% (AB70) and 60.0% (ATCC 33820), in accordance with the considerable genetic diversity between strains (11). The genome most similar to AB70 among all 57 publicly available L. crispatus genomes in the RefSeq database, identified with dRep (12) version 2.0.5, was that of strain UMB0803 (GenBank number GCF_002861765, 99.7% average nucleotide identity [ANI]), one of the nine L. crispatus strains isolated from female urine collected with a transurethral catheter (BioProject number PRJNA316969, 200 strains).
The L. crispatus AB70 genome contains no virulence-related or resistance-related genes, as assessed with IslandViewer 4 (13), which identified 266,727-bp putative genomic islands (11.3% of the chromosome), distributed into 17 regions. We also identified two bacteriocin biosynthetic gene clusters with antiSMASH version 4.1.0 (14), one of which was similar to a gassericin T (15) biosynthetic gene cluster.
The genomic information from strain AB70 will improve our understanding of the genetic diversity of the species L. crispatus, a major member of the vaginal microbiome of healthy pregnant women.
Data availability. The complete genome sequences of L. crispatus AB70 have been deposited in DDBJ/ENA/GenBank under the accession numbers CP026503 (chromosome) and CP026504 (plasmid). The versions described in this paper are versions CP026503.1 and CP026504.1. Raw sequencing reads are available in NCBI BioProject under the accession number PRJNA431864.

ACKNOWLEDGMENTS
This work was supported by the Korea Health Technology R&D Project (HI14C0368) and the KRIBB Research Initiative Program.