Whole genome sequencing and in vitro activity data of Escherichia phage NTEC3 against multidrug-resistant Uropathogenic and extensively drug-resistant Uropathogenic E. coli isolates

This data article describes the whole-genome sequencing and in vitro activity data of Escherichia phage NTEC3 isolated from a community sewage sample in Chandigarh, India. The phage NTEC3 was active against multi-drug-resistant (MDR) and extensively drug-resistant (XDR) biofilm-forming Uropathogenic Escherichia coli (UPEC) strains. The genome of this phage was linear, double-stranded, and 44.2 kb long in size. A total of 72 ORFs (open reading frames) were predicted and 30 ORFs were encoded for functional proteins. The phage belonged to the Kagunavirus genus of the Siphoviridae family. Phylogenetic analysis using DNA polymerase was performed to understand the phage evolutionary relationships. Genes encoding for lysogeny, virulence, toxins, antibiotic resistance, and the CRISPR/CRISPR-like system were not found during screening. The annotated genome was deposited in Genbank under the accession number OK539620.

Microbiology Specific subject area Medical microbiology Type of data WGS and in vitro testing data are provided in tables and figures. How the data were acquired WGS data were generated using Illumina Novaseq 60 0 0 platform Data format Analyzed Description of data collection The phage NTEC3 was isolated from a community sewage water in Chandigarh, India. DNA was extracted and WGS has performed on Illumina Novaseq 60 0 0 sequencer with a paired-end library of a read length of 2 × 150 bp [1 , 2] . The genome was assembled using the strategic k-mer extension for scrupulous assemblies (SKESA v2.4.0) assembler with default k-mer sizes.

Value of the Data
• Data provides genomic information on a lytic phage to researchers for sequence comparison and evolutionary relationship studies. • Genome sequencing and in vitro activity data of the phage can be used by the scientific community for screening and identification of novel phage-based antimicrobial strategies. • Data provides information about a potentially safe product for therapeutics against drugresistant UPEC as the phage lacked genes encoding for lysogeny, virulence, toxins, and antibiotic resistance. • Data provides information about a lytic enzyme endolysin whose sequence could be used to design recombinant endolysins to treat biofilm-associated infections.

Data Description
UPEC strains have a variety of virulence-associated factors (VFs) like adhesins, toxins, siderophores, chaperone-usher (CU) fibers, invasins, and serum resistance-associated proteins that help to invade and injure the host [3] .
Many UTIs causing UPEC strains carry different antibiotic resistance genes like Extended-Spectrum-ß-Lactamases (ESBLs) and Metallo-ß-Lactamases (MBLs) on their chromosomes. UTIs caused by MDR and XDR UPECs account for one of the major therapeutic challenges in the health sector [4] . The widespread decline in antibiotic effectiveness has sparked renewed interest in alternative therapeutics like phage therapy. Phage therapy is primarily based on the use of obligately lytic phages to eliminate their bacterial hosts while leaving human cells unharmed.
Phage NTEC3 was isolated from community sewage water in Chandigarh, India using a clinical strain UPEC 590B as a host bacterium. The NEBNext Ultra kit was used to prepare the se- quencing library, and sequencing was performed on Novaseq 60 0 0 sequencer, yielding 6974384 paired-end (150-bp-long) raw reads ( Table 1 ). The complete genome of the phage was 44.2 kb in length, with a gene density of 1.60/kbp, and a GC content of 51% ( Table 1 , Figs. 1 and 2 ). Out of the 72 predicted ORFs, 21 (27.3%) were found to be present in the direct strand, and the rest were found in the complementary strand ( Fig. 1 ). Forty-four ORFs (61.6%) were predicted to encode for hypothetical proteins whereas 30 ORFs (41 %) were predicted to encode for functional proteins. The functional proteins were categorized into the following groups (i) DNA replication/metabolism-related proteins (ii) Host lysis and adhesion-related proteins (iii) DNA packaging proteins (iv) Structural proteins (Table S1). The genome of this phage lacked genes encoded for temperate phage markers, toxins, virulence factors, antibiotic resistance, or the CRISPR/CRISPR-like system. The ORF 22 of phage NTEC3 encoded for an endolysin that has an intrinsic feature to disrupt biofilm mass and bacterial cell lysis by breaking the peptidoglycan layer [5] . Phage NTEC3 formed plaques of 4-5 mm in diameter ( Fig. 3 ). The phage was active against 24.4% of 45 MDR and XDR UPEC strains resistant to third-generation cephalosporins,  fluoroquinolones, aminoglycosides, beta-lactamase inhibitor combinations, cotrimoxazole, nitrofurantoin and imipenem (Table S2). In the phylogenetic tree, phage NTEC3 was placed in an outgroup of two Siphoviridae family phages Escherichia phage VB_EcoS-Golestan (BLASTP identity > 93.18%) and Escherichia phage vB_EcoS-phiEc3 (BLASTP identity > 92.78%) isolated from wastewater samples in USA and Spain, respectively ( Fig. 4 ).

Experimental Design, Materials and Methods
A flow chart was designed using Cmap server to demonstrate the experimental design and methods ( Fig. 5 ) [6 , 7] .

Phage Isolation
The Escherichia phage NTEC3 was isolated from the community sewage treatment plant in Chandigarh. MDR UPEC 590 strain was used as host bacterium to amplify and propagate phage NTEC3. The raw sewage water samples were centrifuged at 1500 RPM and the supernatant was filtered through a 0.45 μm membrane filter. The filtrate was incubated at 37 °C with different bacterial cultures for 18 h. After incubation, the mixture was centrifuged at 40 0 0 RPM and filtered using a 0.22 μm syringe filter. The lytic activity of the filtrate was evaluated using a spot assay against the respective bacterial strains.

Phage Purification
A clear spot on agar was scrapped and incubated with the host bacterial culture for overnight incubation at 37 °C. A single isolated plaque was picked after repeating the plaque assay experiment thrice. Phage plaque size was measured and expressed in millimeters. Phages were concentrated using the overnight polyethylene glycol (PEG 80 0 0) precipitation method followed by ultracentrifugation (50,0 0 0 RPM) and the ultrafiltration method [8] . Final purification was performed with CsCl gradient and dialyzed against SM buffer (pH 7.5) using tubing membrane (MWCO 12,0 0 0). Host range activity of phage NTEC3 was tested against 45 MDR and XDR UPEC strains using the standardized spot assay method.

Genome Sequencing and Analysis
The DNA extraction was performed with a phage DNA isolation kit (Norgen Bioteck, Canada). The whole genome of the phage was sequenced using the Illumina Novaseq 60 0 0 platform with a paired-end library of a read length of 2 × 150 bp [9] . SKESA v2.4.0 assembler was used to execute de novo assembly with default k-mer sizes [10] . GLIMMER v3.02 and GeneMarkS v4.28 were used to predict genes from the assembled sequence and annotated with PHASTER and RAST server and also searched against BLASTP in the UniProt database [11][12][13] . The CGView server was used to make a circular genomic map of the phage genome [14] . The amino acid sequences of the DNA polymerase gene of similar phages with BLASTp identity of > 90% were used for constructing the phylogenetic tree.

Ethics Statements
This study was approved by the Institute Ethical Clearance Committee of the postgraduate institute of Medical Education and Research (Chandigarh, India).

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.