Draft genome sequences of the oomycete Pythium insidiosum strain CBS 573.85 from a horse with pythiosis and strain CR02 from the environment

Pythium insidiosum is an aquatic oomycete microorganism that causes the fatal infectious disease, pythiosis, in humans and animals. The organism has been successfully isolated from the environment worldwide. Diagnosis and treatment of pythiosis is difficult and challenging. Genome sequences of P. insidiosum, isolated from humans, are available and accessible in public databases. To further facilitate biology-, pathogenicity-, and evolution-related genomic and genetic studies of P. insidiosum, we report two additional draft genome sequences of the P. insidiosum strain CBS 573.85 (35.6 Mb in size; accession number, BCFO00000000.1) isolated from a horse with pythiosis, and strain CR02 (37.7 Mb in size; accession number, BCFR00000000.1) isolated from the environment.


a b s t r a c t
Pythium insidiosum is an aquatic oomycete microorganism that causes the fatal infectious disease, pythiosis, in humans and animals. The organism has been successfully isolated from the environment worldwide. Diagnosis and treatment of pythiosis is difficult and challenging. Genome sequences of P. insidiosum, isolated from humans, are available and accessible in public databases. To further facilitate biology-, pathogenicity-, and evolution-related genomic and genetic studies of P. insidiosum, we report two additional draft genome sequences of the P. insidiosum strain CBS 573.85 (35. 6

Value of the data
Previously, only genome sequence data of P. insidiosum isolated from humans is available in the public databases.
The first draft genome sequences of P. insidiosum isolated from a non-human animal with pythiosis and from the environment are now made available.
The additional genome data will facilitate biology-, pathogenicity-, and evolution-related studies of P. insidiosum, through comparative genomic studies of Pythium species and related species.

Data
Pythium insidiosum is an aquatic oomycete microorganism that causes the lethal infectious condition, pythiosis, in humans and other animals [1,2]. The organism has been isolated from the environment in Australia, Thailand, Brazil and the United States [3][4][5][6]. Genome sequences of P. insidiosum, isolated from humans, are available and accessible in public databases [7,8]. We report two additional draft genome sequences of the organism isolated from a horse with pythiosis, as well as from the environment.

Genome sequencing and assembly
Genomic DNA samples were extracted from P. insidiosum strain CBS573.85 (from an infected horse in Costa Rica) and strain CR02 (from an agricultural area in Thailand), using the conventional extraction method described by Lohnoo and co-workers [9]. rDNA sequence analysis was performed to confirm the identity of the organism [10][11][12]. The extracted genomic DNA of each of these two strains was subjected to preparation of a paired-end library for genome sequencing, using the Illu-minaHiSeq 2500 (strain CBS573.85) or IlluminaHiSeq 2000 (strain CR02) platform (Yourgene Bioscience, Taiwan). Quality trims of the raw reads were executed by CLC Genomics Workbench (www.clcbio.com) to yield read lengths with at least 35 bases. The adaptor sequences were removed by Cutadapt 1.8.1 [13] to obtain a total of 34,651,034 reads with an average read length of 122 bases (4,238,414,330 total bases) for the strain CBS573.85, and a total of 27,436,541 reads with an average read length of 105 bases (2,888,290,738 total bases) for the strain CR02. All reads were assembled by Velvet 1.2.10 [14] to the total sequence length of 35,561,321 bases (number of contigs, 11,223; average contig length, 3169; N 50 , 12,261; 'N' composition, 1.2%) for the strain CBS573.85, and 37,673,126 bases (number of contigs, 22,560; average contig length, 1670; N 50 , 3553; 'N' composition, 2.7%) for the strain CR02. CEGMA analysis with 248 highly-conserved eukaryotic genes [15,16] reported 87% and 77% completeness of draft genomes of the strains CBS573.85 and CR02, respectively. MAKER2 [17] assigned 14,487 (strain CBS573.85) and 15,231 (strain CR02) open reading frames.