Draft genome sequences data of four Salmonella enterica subsp. enterica serovar Dublin archival strains originating from animals in Poland, 1956 – 1957

Salmonella enterica subsp. enterica serovar Dublin (S. Dublin) is a zoonotic pathogen causing infections in animals, especially in cattle. In this study, we report draft genome sequences of four S. Dublin isolated between 1956 and 1957 from cattle and fox in Poland. Whole genome sequencing was performed on the Illumina platform and the data is available at National Center for Biotechnology Information under the BioProject accession number PRJNA865912. In order to better understand the genetic basis of epidemiology of S. Dublin infection, the obtained sequences were analyzed using the tools which are available at Center of Genomic Epidemiology (https://www.genomicepidemiology.org/) including core genome multilocus sequence typing (cgMLST) and core genome single nucleotide polymorphisms (cgSNPs).

Salmonella enterica subsp. enterica serovar Dublin ( S . Dublin) is a zoonotic pathogen causing infections in animals, especially in cattle. In this study, we report draft genome sequences of four S . Dublin isolated between 1956 and 1957 from cattle and fox in Poland. Whole genome sequencing was performed on the Illumina platform and the data is available at National Center for Biotechnology Information under the BioProject accession number PRJNA865912. In order to better understand the genetic basis of epidemiology of S . Dublin infection, the obtained sequences were analyzed using the tools which are available at Center of Genomic Epidemiology ( https://www. genomicepidemiology.org/ ) including core genome multilocus sequence typing (cgMLST) and core genome single nucleotide polymorphisms (cgSNPs). Table   Subject Biological sciences Specific subject area Microbiology: Bacteriology  Type of data  Genome sequence data, table, figure  How the data were acquired  Whole genome

Value of the Data
• The draft genome data may be useful for estimating the degree of genetic diversity of Salmonella Dublin strains. • The scientists could use these genome data for comparative genome analysis and also identify evolutionary changes among Salmonella Dublin. • The draft genome data allow better understanding of Salmonella epidemiology.

Objective
The incidence of S . Dublin infection in cattle is a major problem from both animal and human health perspective. Whole genome sequencing allows accurate genome analysis of pathogens, invasiveness and pathogenicity mechanisms. This study describes the S . Dublin sequences genome to better understand the evolutionary changes that have occurred over dozens of years.

Data Description
Salmonella enterica serovar Dublin ( S . Dublin) is one of the host-specific serovars adapted to cattle. Infections are detected in both calves and adult animals and can cause substantial losses in livestock production. Symptoms of salmonellosis include diarrhea, fever, loss of appetite, in pregnant cows abortion may occur [1] . S . Dublin was confirmed also in foxes and other furbearing animals [2] and can spread to different species as a result of interspecies transmission   Table 1 . Core genome SNPs were used to create a phylogenetic tree ( Fig. 1 ) of described isolates, the reference S . Dublin (ATCC 39184) and the another 20 genome sequences of cattle S . Dublin [6] available from NCBI database. The comparison of PIW15, PIW16, PIW19 and PIW21 sequences was also presented in the form of a circular map ( Fig. 2 ).

Experimental Design, Materials and Methods
The lyophilized strains were rehydrated by adding 0,5 ml NaCl 0.85% Medium (bioMérieux) to each ampoule, the contents were gently mixed, then 10 μl of each suspensions were streaked onto Xylose Lysine Deoxycholate (XLD) medium and incubated overnight at 37 ±1 °C. Single colonies were passaged overnight at 37 ±1 °C onto nutrition agar and used for further analysis [7] . Strains identity was validated on the basis of biochemical reaction (VITEK System, bioMérieux), matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) using the extraction method following the producer guidelines (Bruker Daltonik GmbH) and serologically according to the White-Kaufmann-Le Minor scheme. Isolation of genomic DNA was executed using Maxwell Rapid Sample Concentrator (RSC) cultured cells DNA Kit (Promega). Sequencing libraries were constructed using the Nextera XT sample preparation kit following the manufacturer's recommendations and evaluated by capillary electrophoresis (Fragment Analyzer). Whole genome sequencing was performed on the MiSeq platform (Illumina) with the MiSeq reagent kit (2 × 300-bp paired-end protocol, to 100 × depth of sequencing). Default parameters were used for all software unless otherwise specified. Raw paired-end reads were quality controlled using FastQC v0.11.5 [8] ( https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ ). Trimmomatic 0.36 [9] was used to trimmed and removaled adapter sequence with the following pa-  To merge the trimmed reads the BBMerge from bbtools software suite was used ( https://jgi. doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbmerge-guide/ ) and assembled using SPAdes v3.9.0 [10] with the "-careful" flag. The genome statistics and annotation of the Salmonella strains were determined using the Quality Assessment Tool for Genome Assemblies (QUAST) [11] and NCBI Prokaryotic Genome Annotation Pipeline (PGAP) [12] . Bioinformatics tools from Center of Genomic Epidemiology (CGE) have been used to determine MLST type (ML ST 2.0) and cgML ST (cgML STFinder 1.2) [ 13 , 14 ]. A phylogenetic tree was created in CSIPhylogeny and visualized in MEGA 6 [15] . The similarity of the sequences is shown on the circular map and generated in Proksee server ( https://proksee.ca/ ).

Ethics Statements
Not required.

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data Availability
Salmonella Dublin isolated from animal source Genome sequencing and assembly (Original data) (National Center for Biotechnology Information) Salmonella enterica subsp. enterica serovar Dublin str. ATCC 39184 chromosome, complete sequence (Reference data) (National Center for Biotechnology Information) GenomeTrakr Project: US Food and Drug Administration (Reference data) (National Center for Biotechnology Information)