Draft genome sequence data of a clinical Enterococcus faecalis isolate SHH039 from a patient with cholecystitis from a tertiary care hospital in Sabah, Malaysia

An Enterococcus faecalis strain SHH039 was isolated from a 68 year old man who was hospitalised with cholecystitis. The genomic sequence of this isolate which had a size of 2,990,081 bp and 2,663 proteins with functional assignments is presented here. Analysis of the genome revealed Enterococcus faecalis with multiple antibiotic resistance genes which may be associated with acute cholecystitis. It may be not clear if the infection symptoms are the consequence of enterococci manifestation. However, this opportunistic organism may play a minor role in the disease.


Specifications
Microbiology Specific subject area Genomic. Type of data Data related to genomic sequence and antimicrobial resistance. How data were acquired Whole genome was sequenced with an Illumina HiSeq 40 0 0 sequencing platform. Data format Raw and analyzed. Parameters for data collection Enterococcus faecalis strain SHH039 was isolated from the gallbladder bile of a 58-year-old male patient with acute cholecystitis. Description of data collection Enterococcus faecalis strain SHH039 was sequenced with Illumina HiSeq 40 0 0 platform and core-SNP based phylogenetic tree was then constructed with other 20 clinical enterococcal isolates extracted from the GenBank public database.

Value of the Data
• The draft genome sequence data analysis revealed the presence of Enterococcus faecalis strain SHH039 which was isolated from a male patient with acute cholecystitis in a local tertiary hospital in Kota Kinabalu, Sabah. • The data is important to understand the presence of Enterococcus faecalis in polymicrobial bacteraemia and may be the aetiology of acute cholecystitis in hospitalized patients as well as if it belongs to the same group as the other human clinical enterococcal infections. • The antibiotic sensitivity and resistance data of Enterococcus faecalis will be useful and potentially help to establish new therapies for variety of enterococcal infections and might reveal insights into these roles.

Data Description
Enterococcus species are commensal inhabitants of the gastrointestinal tract in healthy individuals. Two species under this genera, Enterococcus faecium and Enterococcus faecalis , has a high-level resistance to multiple antibiotics and an active transmission in hospitals and communities [1] . The most common symptoms associated with Enterococcus are endocarditis, urinary tract infections (UTIs), wound infections, sepsis and bacteraemia. However, in the last few years, the emergence of multidrug-resistant enterococci strains has been reported from intra-abdomen or biliary sources indwelling central lines, or soft tissue infections [2 , 3] . It is found to be more prevalent as a component of polymicrobial bacteraemia than other pathogens such as Escherichia coli and other gram-negative bacilli [4][5][6][7] . The draft genomic sequence of Enterococcus faecalis SHH039, which was isolated from the gallbladder bile of a hospitalised male patient with acute cholecystitis in July 2020, is presented here. A phylogenetic tree was generated usingthe core-SNP genomes of these isolates and the genomes of another 20 genomes clinical enterococcal isolates ( Fig. 1 ) obtained from the GenBank public database. The susceptibility tests with the Vitek 2 (bioMérieux, Inc., Durham, NC) system were performed using software version 5.01 and AST-GP2 (enterococci) cards according to the manufacturer's protocol showed that the isolate was ampicillin, gentamycin, nitrofurantoin, vancomycin and linezolid sensitive but resistant to

Bacteria strain isolation, DNA extraction, library preparation and sequencing
The Enterococcus faecalis strain SHH039 was isolated from a 68 year old male patient's gallbladder bile who was hospitalized with acute cholecystitis in July 2020. The isolate was streaked onto blood agar and incubated at 37 °C for 24 h. A single pure culture from the overnight blood agar plate was inoculated on Luria Bertani (LB) broth High Salt (Hazelwood, Mo) and incubated again overnight. The fully automated VITEK2 (BioMérieux, Inc, Hazelwood, Mo) technology was initially utilised to identify the bacterial species and subsequently confirmation was done using 16S rRNA Sequencing. We used the the BigDye Terminator (v3.1) on a ABI3130 DNA Sequencer (Applied Biosystem, USA). The Qiagen DNeasy blood and tissue kit (Qiagen, Valencia, CA) was used to extract the genomic DNA according to the manufacturer's protocol. The Nanodrop 20 0 0c spectrophotometer (ThermoFisher Scientific, USA) and Qubit® 2.0 fluorometer (Invitrogen, Ther-moFisher Scientific, USA) were used to quantify the DNA concentrations. Finally, the NEBnext Ultra kit (New England Biolabs, NEB #E7645) were used to prepare the Whole genome-sequencing libraries and then sequenced with the Illumina HiSeq 40 0 0 platform.

Genome sequencing, assembly, annotation and screening of antimicrobial resistance gene
Using Trimmomatic version 0.39 [8] , all raw reads were pre-processed, adapters were trimmed and reads with fewer than 50 bp were eliminated, based on phred with a quality below Q30. Next the genome was assembled with SPAdes version 3.11.1 [9] and annotated with PGAP (Prokaryotic Genome Annotation Pipeline) [10] . Using the Comprehensive Antibiotic Resistance Database (CARD) version 3.0.3 tools [11] , the antimicrobial resistance gene in Enterococcus faecalis strain SHH039 identified were including dfrE (encoding for dihydrofolate reductase), efrA (encoding for ATP-binding cassette-ABC efflux pump) and tet(M) (associated with tetracycline resistance). The data sequence was deposited in the Sequence Read Archive (SRA) under the bioproject accession number PRJNA680852 (Biosample accession number SAMN16925233).

Phylogenetic tree analysis
A phylogenetic tree was reconstructed based on the comparative sequence analysis of the core-SNP genomes of these isolates with 20 genomes clinical enterococcal isolates extracted from the GenBank public database ( Fig. 1 ). The core-SNP sequences were aligned using kSNP3 tools based on k-mer analysis [12] and phylogenetic inferences were obtained using the maximum-likelihood method in MEGA (Molecular Evolutionary Genetic Analysis) software 8.0 package [13] . Bootstrap analysis of 10 0 0 replicates was used to evaluate the significance of the branching patterns.

Nucleotide Sequence Accession Number
This whole-genome project has been deposited in DDBJ/ENA/GenBank under accession no. JAEFCX0 0 0 0 0 0 0 0 0. The version described in this paper is the first version (JAEFCX0 0 0 0 0 0 0 0 0). The raw sequencing reads have been deposited in the SRA under accession no. SRR13153714.

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.