Draft genome sequence of Japanese wood mouse, Apodemus speciosus

The wood mouse (genus Apodemus) is one of the most common rodents in broad-leaf forests in the temperate zone of the Palaearctic region. Molecular studies of wood mice have critically enhanced the understanding of their evolution and ancestral biogeographic events. However, their molecular data are currently only limited to partial mitochondrial sequences and a few genes. Therefore, we sequenced the wood mouse genome to facilitate the acquisition of useful resources for inferring their molecular evolution. We sampled a wild wood mouse at Tsukuba, Japan, and sequenced its whole-genome using the Illumina Hiseq. 2000. To reduce the risk of non-randomness, three paired-end libraries (insert sizes: 150, 300, and 500 bp) and, two mate-pair reads (insert sizes: 8 and 20 kbp) were constructed. In total, we generated approximately 210 Gbp data. From these sequences, we reconstructed 336,124 scaffolds. These data will enhance our understanding of the evolution and ecological factors that affect their genetic constitution. The genome scaffolds generated are available in the National Center Biotechnology Information (NCBI) BioProject with accession number PRJDB5914.


a b s t r a c t
The wood mouse (genus Apodemus) is one of the most common rodents in broad-leaf forests in the temperate zone of the Palaearctic region. Molecular studies of wood mice have critically enhanced the understanding of their evolution and ancestral biogeographic events. However, their molecular data are currently only limited to partial mitochondrial sequences and a few genes. Therefore, we sequenced the wood mouse genome to facilitate the acquisition of useful resources for inferring their molecular evolution. We sampled a wild wood mouse at Tsukuba, Japan, and sequenced its whole-genome using the Illumina Hiseq. 2000. To reduce the risk of non-randomness, three paired-end libraries (insert sizes: 150, 300, and 500 bp) and, two mate-pair reads (insert sizes: 8 and 20 kbp) were constructed. In total, we generated approximately 210 Gbp data. From these sequences, we reconstructed 336,124 scaffolds. These data will enhance our understanding of the evolution and ecological factors that affect their genetic constitution. The genome scaffolds generated are

Experimental features
Freshly collected healthy liver was used for total DNA extraction

Data source location
Tsukuba, Ibaraki, Japan

Data accessibility
The genome scaffolds generated are available in the National Center Biotechnology Information (NCBI) BioProject with accession number PRJDB5914.

Related research article
Not applicable

Value of the Data
The Japanese wood mouse could be a model mammal for investigating the natural histories of organisms inhabiting the Japanese Archipelago.
We sequenced the Japanese wood mouse genome to facilitate the acquisition of useful resources for inferring the molecular evolution of this species.
This data allows other researchers focusing on this species to start genome-wide analysis.

Data
The genus Apodemus, which is a member of the subfamily Murinae, is one of the most species rich taxa of the murine rodents, next to Rattus and Mus, which are the first and second speciose genera, respectively [1]. Species belonging to Apodemus, known as field or wood mice, are distributed in the temperate region of the Eurasian continent including the Japanese Archipelago. According to previous molecular phylogenetic analyses, Apodemus diversified in Asia and Europe, around the early Pliocene and early Pleistocene eras, respectively [2]. Deciphering the genome sequences of this genus would facilitate our understanding of the evolutionary trait of the group adapted to the temperate environment, with the aid of those of Rattus and Mus. The genome of a representative species of Apodemus from Europe, Apodemus sylvaticus, was sequenced by a research group at the Liverpool University according to the JGI GOLD database (https://gold.jgi.doe.gov/; project ID ¼ Gp0143296), but its project status is "permanent draft." In Asian species, no genome sequence data are currently available.
The large Japanese wood mouse, Apodemus speciosus, is widely distributed in Japan from Hokkaido to Kyushu, including the remote peripheral islands. This species inhabits forests and grasslands, eats insects and acorns, and shows seasonal breeding in the wild. A recent phylogeographic analysis revealed that this species experienced a rapid population growth in response to the effect of the Quaternary ice ages [3]. Thus, the species could be a model mammal for investigating the natural histories of organisms inhabiting the Japanese Archipelago [4]. We sequenced the Japanese wood mouse genome to facilitate the acquisition of useful resources for inferring the molecular evolution of this species.
The genome was assembled using the ALLPATHS-LG program using two-step processes of assembling and scaffolding. The paired-end reads were first assembled into contigs, which were then joined to form concatenated scaffolds by using mate-pair reads. We reconstructed 647,983 contigs (N 50 is 3.5 kbp). These contigs were joined in 336,124 scaffolds (Table 2), and the N 50 value of the scaffolds was 47 kbp. The wood mouse genome sequenced is a potentially useful resource for the future molecular evolutionary analysis of rodents.

Sample preparation and sequencing
We sampled the A. speciosus at Tsukuba, Japan (NIES ID; IB14-021, male). All rodent experiments were conducted in accordance with the guidelines for using wild mammals of the Mammal Society of Japan [5] and the rules of the National Institute for Environmental Studies for analysis and experimentation with environmental samples contaminated with radioactive materials. Genomic DNA was extracted from the liver using proteinase K and phenol:chloroform:isoamyl alcohol [6]. The library preparations and sequencings were conducted by the Eurofin Sequencing Service (Eurofin MWG Operon, Germany). Paired-end sequencing of each 100-bp end of the DNA fragments was performed using the Hiseq.