Genome Sequencing of Sub-Arctic Mesomycetozoean Sphaeroforma sirkka Strain B5, Performed with the Oxford Nanopore minION and Illumina HiSeq Systems

The Mesomycetozoea branch near the animal-fungal divergence and are believed to be important to understanding the origins of multicellularity. In 2012, a free-living saprotrophic mesomycetozoean was isolated from the sub-Arctic Bering Sea.

T he branching of the Mesomycetozoea near the animal-fungal divergence makes them significant organisms for unraveling the origins of multicellularity (1,2). Today, only a few Mesomycetozoea genomes have been deposited in public repositories, all of which are animal symbionts. In 2012, a free-living saprotrophic mesomycetozoean strain, Sphaeroforma sirkka B5, was isolated from pollen grains in a nearshore estuarine environment of the sub-Arctic Bering Sea.
We performed a hybrid assembly using long-read Oxford Nanopore minION reads for scaffolding and Illumina HiSeq sequencing for error correction. DNA was extracted using the Mo Bio PowerFood (PF) kit from 48-h-old cells grown on PmTG agar medium with penicillin G and streptomycin (3). DNA was extracted by bead beating twice for 10 min. Lysates were combined on spin filters and subsequently washed twice with PF4 solution. The resulting DNA fragments ranged from 2 kbp to 30 kbp. We purified 5 g of extracted DNA with AMPure magnetic beads with a bead-to-DNA (vol/vol) ratio of 0.4. We immediately prepared a DNA library with the Oxford Nanopore technology 1D ligation kit (SQK-LSK108) using an optimized protocol. Specifically, an input of 3.7 g of DNA was used (the shearing step was omitted), and all incubations at 20°C were performed at room temperature. Magnetic bead clean-ups were performed with freshly prepared 80% ethanol solution, and the DNS control strain (DCS) was replaced by 5 l of nuclease-free water. Sequencing was performed on Oxford Nanopore Mk1 minION system using a R9.4 SpotON FLO-MIN106 flow cell and the 48h_1D_LSK108 script. The device was controlled by minKNOW_1.5.12 implemented on a MacOS system. The library generated 2.19 ϫ 10 6 reads, with an average size of 6 kbp. The Illumina HiSeq sequencing library was prepared using the TruSeq library kit and loaded on one lane of an Illumina HiSeq 2500 rapid run flow cell (v2). Sequencing was performed with HiSeq Rapid SBS reagents (v2) in a 2 ϫ 100-bp paired-end format.
Nanopore reads were basecalled with Albacore 1.2.4, providing 13.04 Gbp of data subsequently filtered with NanoFilt (4), with a minimum Q score of 10. The final data set (1,141,813 reads with an N 50 value of 7,793 bp, for a total of 7,200,406,494 bp) was assembled using the high-noise robust single-molecule sequencing assembler Canu version 1.7 (5), assuming a genome size of 100 Mbp. Raw contigs were corrected with two iterations of the standalone consensus module Racon. For polishing, two rounds of Pilon (6) were performed with the adapter-trimmed (bbduk and bbtols package) HiSeq data set. Homology search with BLAST on the resulting contigs revealed no contamination. The assembled genome of Sphaeroforma sirkka consists of 2,688 contigs (all greater than 500 bp) with N 50 /N 90 values of 76,469/21,989 bp and a total length of 125,635,304 bp. The average contig coverage of the final assembled genome is 180ϫ. Genome completeness was evaluated with BUSCO_V3 (7) using the eukaryote gene set odb9 and Saccharomyces (assessment represented in BUSCO notation), C:84.2% [S: 82.5.3%, D:1.7%], F:5.3%, M:10.5%, n:303. The mitochondrion was assembled into a single contig, for a total length of 194,049 bp.
Data availability. This whole-genome shotgun project has been deposited in GenBank under the accession number LUCW00000000. The version described in this paper is the third version, LUCW03000000.

ACKNOWLEDGMENTS
This work was supported by the University of Alaska's Center for Global Change and Arctic System Research and the National Science Foundation grant DGE-0801720. The funders had no role in the study design, data collection, or the decision to submit this work for publication.