The chromosome-scale genome of the raccoon dog: Insights into its evolutionary characteristics

Summary The raccoon dog (Nyctereutes procyonoides) is an invasive canid species native to East Asia with several distinct characteristics. Here, we report a chromosome-scale genome of the raccoon dog with high contiguity, completeness, and accuracy. The intact taste receptor genes, expanded gene families, and positively selected genes related to digestion, absorption, foraging, and detoxification likely support the omnivory of raccoon dogs. Several positively selected genes and raccoon dog-specific mutations in TDRD6 and ZP3 genes may explain their high reproductivity. Enriched GO terms in energy metabolism and positively selected immune genes were speculated to be closely related to the diverse immune system of raccoon dogs. In addition, we found that several expanded gene families and positively selected genes related to lipid metabolism and insulin resistance may contribute to winter sleep of the raccoon dog. This high-quality genome provides a valuable resource for understanding the evolutionary characteristics of this species.


INTRODUCTION
The raccoon dog (Nyctereutes procyonoides) is the only species in the genus Nyctereutes. It is a mediumsized canid that prefers various habitats such as gardens, marshlands, river valleys, and damp forests with rich undergrowth (Kauhala and Kowalczyk, 2011). The native distribution area of raccoon dogs includes China, Korea, North Vietnam, Japan, Mongolia, and East Siberia (Pitra et al., 2009;Ward and Wurster-Hill, 1990) ( Figure 1A). The range of this species has continuously expanded over the last 100 years, and currently covers most of Eurasia, making the raccoon dog a prominent invasive species in Europe and posing a great risk to local ecosystems and public health.
There are several distinct evolutionary characteristics in raccoon dogs. This species is reported to have a high reproductive rate with a relatively young age of sexual maturity and large average litter size (8-10 individuals) (Helle and Kauhala, 1995;Kauhala, 1996;Kauhala and Kowalczyk, 2011;Kowalczyk et al., 2009). The raccoon dog is a typical omnivore with a diverse diet that includes food ranging from fruits to insects, frogs, birds, and small mammals (Drygala et al., 2013;Kauhala and Kowalczyk, 2011). This helps raccoon dogs shift their diets according to the different habitats and seasons, making the raccoon dog a species with high plasticity. Another unique habit of raccoon dogs among canids is winter sleep under harsh conditions, which can protect raccoon dogs from food deprivation and cold weather (Asikainen et al., 2004). Before winter sleep, raccoon dogs fatten themselves and their body weight will be correspondingly increased (Nieminen et al., 2002). Unlike typical hibernation, the body temperature of the raccoon dog iScience Article genome assembly (Table 1). We first assembled one primary genome with a contig N50 of 23.99 Mb by using error-corrected PacBio subreads. Then, this primary genome was polished by PacBio subreads and WGS short reads. Finally, Hi-C reads were used for concatenating primary contigs into chromosome-scale assembly. The raccoon dog genome contains 27 pairs of chromosomes (2n = 54), including 26 pairs of autosomes and one pair of sex chromosomes (Nie et al., 2003). The size of the final assembled nuclear genome was 2.38 Gb, and the genome consisted of 218 scaffolds after redundancy removal, with 2.32 Gb assigned to 27 chromosomes (Table S1, Figure S1, and Figure 1B). The scaffold N50 of this chromosome-scale assembly was 41.87 Mb and the GC content of this genome was 41.33%, which is very similar to that of the domestic dog (CanFam3.1, GCA_000002285.2) and red fox (VulVul2.2, GCA_003160815.1) genomes (Table 1).
To evaluate the completeness of the genome, we first performed the Benchmarking Universal Single-Copy Orthologs (BUSCO) (Simã o et al., 2015) analysis by using the mammalia_odb10 database. We found that 96.4% of the 9,226 BUSCO genes were complete in the genome, and only 2.8% and 0.8% were identified as missing and fragmented, respectively (Table S2). To compare with the previously published genome (GCA_905146905.1, Rac 1.0 hereafter), we also used the laurasiatheria_odb10 database to perform the BUSCO analysis (Table S2). The BUSCO score of our genome was slightly higher than that of the Rac 1.0 genome. In addition, 96.28% of the transcript data (kidney) and 99.26% of the WGS short reads were mapped onto our final assembly (Table S3). We also downloaded the RNA-seq data of the Rac 1.0 genome and mapped it against its own genome and the genome we assembled. We found that the mapping rate on our genome (96.24%) was higher than that on the Rac 1.0 genome (85.75%). The above assessments showed that our assembly represents a more complete genome with high quality and contiguity.
To further evaluate the accuracy of our assembly at the chromosome level and detect the fusion and fission events, we performed an interspecies synteny analysis between our assembled genome and the domestic dog genome (CanFam3.1, GCA_000002285.2). Overall, we found high collinearity between our assembled genome and the domestic dog genome with clear one-to-one syntenic blocks ( Figure 1C). We also found nine fusion and three fission events in the synteny result. For example, Chr1 is the largest chromosome of the raccoon dog, which was found to be a fusion of Chr13, Chr14, Chr29, and Chr31 of the domestic dog genome. Chr1, Chr13, and Chr19 of the domestic dog genome were split into two chromosomes in the raccoon dog genome. All these fission and fusion events were surprisingly consistent with the findings of a previous karyotypic study (Becker et al., 2011), indicating the accuracy of our assembled genome at the chromosome level.

Genome annotation
We first identified 835.98 Mb of repetitive elements in our assembled genome, representing 35.11% of the total genome size. These repeat elements included LINEs (22.71%), LTRs (12.00%), DNA elements (2.40%), SINEs (1.23%), and other repeats (0.34%) (Tables S4-S6). Our genome's total repeat length was found to be iScience Article somewhat larger than that of the Rac 1.0 genome (34.04%), demonstrating the superiority of our genome's longer contigs in assembling highly repeated regions.
We sequenced 101.70 Gb of RNA-seq data and generated 175,521 full-length transcripts to support the gene annotation of our genome. We finally identified 20,000 protein-coding genes with high confidence by combining high-quality transcript mapping, homology-based protein alignment, and ab initio predictions. The average exon length, intron length, and gene length were 176.40 bp, 4481.96 bp, and 38.22 kb, respectively ( Figure S6). Among our predicted genes, 14,913 (74.57%) were supported by transcript data. The BUSCO score of our predicted gene set was evaluated as 92.7% with the mammalia_odb10 database, showing high completeness (Table S2). Finally, 19,973 (99.87%) genes were functionally annotated in at least one of the five databases we used (Table S7 and Figure S7). In addition, 781 rRNAs, 995 miRNAs, 2,280 snRNAs, and 146,963 tRNAs were predicted in our genome (Table S8).

Identification of sex-linked regions
Considering that the Rac 1.0 genome was assembled using a female individual, we determined both X-and Y-linked scaffolds with multiple lines of evidence. We first ranked the average sequencing coverage of all 218 scaffolds (Table S9) and found that Chr27 and Scaffold30 presented sequencing coverages with nearly half of the whole genome level ( Figure S8). We then regarded these two genome regions as derived from sex-linked chromosomes. A corresponding low level of genetic diversity was also found in these scaffolds, which further supported our identification of sex chromosomes (Table S10). To further separate the Y-and X-linked regions, we performed synteny analysis between our primarily identified sex-linked regions and the X/Y chromosomes of the domestic dog genome (Figures 2A and 2B). We found 797 and 25 protein-coding genes in the domestic dog X and Y chromosomes, respectively, that were successfully mapped onto Chr27 and Scaffold30 (Table S11) in the raccoon dog genome. The assembled X and Y chromosomes of the raccoon dog presented high collinearity with the X and Y chromosomes of the domestic dog. We therefore concluded that Chr27 (127.44 Mb) and Scaffold30 (3.24 Mb) were the X chromosome and Y-linked scaffold, respectively.
To further validate our identification, we mapped the whole genome resequencing data from 38 individuals of known sex (5 females and 33 males) to our assembled genome (Table S12). As we expected, the sequencing depth of Chr27 and Scaffold30 in the male individuals was nearly half that of the autosomes ( Figures 2C-2E). For the female individuals, however, the depths of Chr27 were nearly the same as those of the autosomes ( Figures 2C-2E). These results further supported that our identification of the sex-linked regions was accurate. This is the first time we identified X and Y sex-linked genomic regions in a raccoon dog reference genome, which will be a valuable resource for future studies.

Expansion of gene families in the raccoon dog genome
To explore the genomic adaptations of raccoon dogs, we performed a comprehensive comparative genomic analysis with 17 other Carnivora and mammalian species (Table S13). We focused on biological characteristics such as reproduction, immunity, omnivory, and winter sleep, which are prerequisites for raccoon dogs to spread and adapt to habitats. To better understand the possible genetic basis of the omnivorous diet, we included herbivores, omnivores, and carnivores.
Evolution of genes associated with omnivory, reproduction, immunity, and winter sleep Raccoon dogs are opportunistic omnivores, as they mainly consume easily found food. Their diet comprises small mammals, insects, fruit, plant seeds and leaves, and carrion comprise (Hirasawa et al., 2006;Mulder, 2012). Toxic secondary metabolites usually exist in plants (Xia et al., 2021) and toxic byproducts are also widely distributed in rotting food (Blumstein et al., 2017). In addition, poisonous substances such as plant alkaloids are typically bitter in taste . Therefore, we focused on detoxification and taste-receptor-related gene families. In the expanded gene families, we found some typical detoxification gene families, including glutathione S-transferase (Hayes and Pulford, 1995;Sayyab et al.), protein tyrosine phosphatase (UGT) (Bock, 2016), and aldehyde oxidase (Chang et al., 2010;. Interestingly, we found that the GST gene family was expanded in the raccoon dog genome in all groups, including omnivores, herbivores, and carnivores ( Figures 4A and 4B). Sweet taste, umami taste, and bitter taste are mediated by the taste receptors TAS1R2/TAS1R3, TAS1R1/TAS1R3, and TAS2Rs, respectively. In this study, we found that TAS1R1, TAS1R2, TAS1R3, and several TAS2R genes were intact and functional in   iScience Article the raccoon dog genome (Table S16). Selection analysis showed that three detoxification-related genes, ABCC12, FM O 1, and DAO, were under positive selection. Feeding behavior and digestion-related genes (GOLGB1, SEC16B, and MLN) were also found to be under positive selection (Table S17).
Although we did not find significantly enriched GO or KEGG pathways that were directly related to reproduction in expanded gene families, five reproductive genes, TDRD6, CDKN2C, ZP3, PLCD4, and TSSK4, were found to be positively selected in the raccoon dog genome. These genes are closely related to spermatogenesis and acrosomal reactions. In the TDRD6 and ZP3 genes, we found that each gene contains three raccoon dog-specific amino acid changes ( Figures 4C and 4D). The mutation N793D is located in the Tudor superfamily domain of the TDRD6 gene, and the S144V mutation was found in the ZP domain region of the ZP3 gene. We then predicted the effects of these mutations on protein function by using the Protein Variation Effect Analyzer (Provean) web server (Choi and Chan, 2015). Five of the six mutations in the TDRD6 and ZP3 genes were predicted to be neutral by Provean, and we did not find damaging effects in the amino acid changes in the domain regions (Table S18). We further predicted the three-dimensional structure of the ZP3 protein. Although these amino acid substitutions did not change the main structure of this protein, the physicochemical properties of the amino acids changed, which may contribute to the high reproduction of the raccoon dog ( Figure 4E).
By comparison with the above mentioned 17 species, we screened 206 genes that were under positive selection. Thirty genes, accounting for 14.56% of the total positively selected genes (PSGs), were immune-related, including IL5, IL25, CCL20, CD99L2, HIF1A, and NEDD9 (Table S19). We also performed a comparative genomic analysis on all 30 genes with other canines. Thirty-five (35) amino acid changes harbored in 18 genes were found to be raccoon dog-specific changes, and 21 mutations were found to be located in functional domain regions (Table S20, Figure S9). To test the reliability of these mutations, we checked the read mapping of these mutations and found that all mutations were real and well supported by read mapping. Further inspection found that 32 of these mutations were fixed in the population  (Table S21). We also found that 10 tumor suppressor genes (RERG, BRCA1, FETUB, RNF20, MY O 18B, RBM5, BMP3, LACTB, UNC5A, and PCK1) (Table S22) and 6 energy metabolism-related genes (CDKAL1, TRIM63, GALNT13, PRKAG3, RPUSD4, and MRPL19) were under positive selection (Table S23).
To further explore the possible genetic basis of winter sleep in raccoon dogs, we performed selection analysis by comparing raccoon dogs with other 14 other non-hibernating animals (Table S24). In total, we found that 194 genes were under positive selection (Table S25). As expected, we found 21 immune-related genes (Table S26) and five energy metabolism-related genes (Table S27). Genes with feeding behaviors were also found to be under positive selection (HCRTR1, GOLGB1, and MLN) (Table S28). In particular, we found three genes related to insulin secretion (STXBP5L) and lipid metabolism (SEC16B and BSCL2) (Table S28).

Population history and genetic diversity
Although the genetic diversity of raccoon dogs has been explored by using mtDNA and microsatellite loci (Hong et al., 2018;Slaska and Grzybowska-Szatkowska, 2011;Slaska et al., 2010), this is the first study reporting genetic diversity evaluation at the whole-genome scale. To facilitate comparison with other species, we collected the heterozygosity (H) data of 17 other species, including the Felidae, Canidae, and some other invasive species (Table S29). The genome-wide H of the raccoon dog was identified as 0.28 ( Figure 5A). This H is lower than some other invasive species; however, it is obviously higher than many Felidae and Canidae species listed in this study ( Figure 5A and Table S29).
We reconstructed the history of effective population size (Ne) of the raccoon dog from $1.6 million years ago (Mya) to nearly 1,000 years ago (kya), by combining the PSMC (Li and Durbin, 2011), MSMC2 (Schiffels iScience Article and Durbin, 2014), and SMC++ (Terhorst et al., 2017) methods. The PSMC result showed that the Ne gradually declined from $1.6 Mya to $320 kya and then experienced an expansion from $320 kya to $100 kya. The Ne was relatively stable between $100 kya and 50 kya, and the population declined again to 10 kya ( Figure 5B). The PSMC usually shows poor performance for inferring recent population history (Li and Durbin, 2011). We therefore used MSMC2 to infer a more recent population history, and the results showed that Ne presented a continuous decline from 50 kya to 1.0 kya without any rebounding ( Figure 5C). Since the switch error rate induced by phasing influences the accuracy of MSMC2, we further validated the results from MSMC2 by using SMC++, which is independent of the phasing process. As we expected, the trend in Ne inferred by SMC++ was extremely similar to that inferred by MSMC2 ( Figure 5D).

Improved genome assembly and annotation
A chromosome-scale genome assembly with high completeness, accuracy, and contiguity can provide a solid foundation for genetic research (Rhie et al., 2021). Although a draft genome of the raccoon dog has been recently reported (Chueca et al., 2021), the authors used a female individual for genome assembly, which lacked data for the Y chromosome. The raccoon dog genome assembly needs to be further improved. Here, we report a comprehensively improved chromosome-scale genome by combining PacBio and Hi-C technologies for this species. This genome showed 1.19-fold and 1.55-fold improvements in N50 values of the contig and scaffold over those of the Rac 1.0 genome (Chueca et al., 2021), respectively, indicating substantial improvement in this chromosome-scale genome assembly in terms of contiguity. The karyotypic study of the raccoon dog showed 26 pairs of autosomes, one pair of allosomes, and 2 to 3 B chromosomes (Nie et al., 2003). Here, we identified 27 chromosomes by Hi-C scaffolding, which was consistent with the karyotyping result. Surprisingly, all fission and fusion events between the domestic dog and raccoon dog genomes were highly consistent with those found in a previously published karyotypic study (Nie et al., 2003), indicating the high accuracy of this assembly at the chromosome level. In addition, we identified a 127.44 Mb X chromosome and 3.24 Mb Y-linked region with multiple lines of evidence. This is the first time we identified a genome-wide Y-linked region in the raccoon dog. The genome size estimated in this study was $3.21 Gb ( Figure S10), which is similar to that estimated by flow cytometry (Chueca et al., 2021), indicating the existence of B chromosomes. However, we failed to assemble the B chromosomes probably due to clusters of telomeric sequences along the whole B chromosome (Bugno-Poniewierska et al., 2014). For the gene annotation, although the gene number we identified is much less than that in the Rac 1.0 genome, the number we annotated is more comparable with those of other Canidae species (Kukekova et al., 2018;Peng et al., 2021). Altogether, we presented a much-improved raccoon dog genome, which will be a valuable resource for future studies.

Genetic basis for the omnivorous diet of the raccoon dog
The raccoon dog is a typical omnivorous canid species. We found that genes encoding receptors that mediate sweet, bitter, and umami tastes all exist in the raccoon dog genome and are structurally intact. As in humans, the most typical representatives of omnivores, these functional taste receptor genes provide the basic genetic basis for the omnivorous diet (Gravina et al., 2013). The sweet and umami taste receptors help the raccoon dog to forage fruits and meat, and the bitter taste receptor plays an important role in recognizing poisonous substances such as plant alkaloids that are typically bitter in taste . However, many toxic secondary metabolites in plants (Xia et al., 2021) and toxic byproducts in rotting food (Blumstein et al., 2017) cannot be tasted. GST and CYP450 are reported to be the main enzymes that catalyze the phase I and phase II reactions of many toxic substances in organisms . The expanded gene families in the raccoon dog were enriched in one detoxification-related GO term (drug metabolic process) and two KEGG pathways (drug metabolism-cytochrome P450 and glutathione metabolism). The cytochromes P450 (CYP) superfamily is a large enzymatic protein family in both animals and plants that participates in multiple and critical physiological processes (Danielson, 2002;Xu et al., 2015). One of the most important functions of the CYP450 superfamily is the degradation of toxic substances, which has been reported in many species (Gui et al., 2020;Li et al., 2018). Glutathione (GSH) plays an important role in detoxification by forming GSH-toxin conjugates (either with xenobiotic or endogenous electrophilic compounds) (DeLeve and Kaplowitz, 1991;Ketterer et al., 1983) ( Figure 4A). Although some conjugations can occur spontaneously, conjugations of GSH to a wide variety of toxic substances are catalyzed by GST, a powerful detoxification enzyme (DeLeve and Kaplowitz, 1991). Interestingly, we found that the GST gene family was significantly expanded in the raccoon dog genome compared with omnivorous, herbivorous, and carnivorous animals. GST can affect the activity of many antioxidant enzymes to help ll OPEN ACCESS iScience 25, 105117, October 21, 2022 9 iScience Article remove contaminants and reactive oxygen species (Yao et al., 2020). In addition, we also found that the ABCC12 gene was under positive selection compared with other canids. The ABCC12 gene is a member of the ABC gene family, and the proteins encoded by this gene family play a role in detoxification (Dermauw and Van Leeuwen, 2014). Although the exact function of ABCC12 is unknown, this gene is a member of the ABCC subfamily, which belongs to the group of multidrug resistance protein (MRPs) (Ono et al., 2007;Whitlock and Leslie, 2020), which play important roles in the function of GST pumps (Muller et al., 1996). Taken together, from these results, we inferred that GST, which is involved in glutathione metabolism, plays an important role in physiological processes associated with detoxification, perhaps even being more important than the CYP 450 superfamily in raccoon dogs. We also found two other detoxification-related genes that were under positive selection, FM O 1 and DAO. FMO genes can be found across the whole animal kingdom, in which they play roles in the detoxification of xenobiotics by catalyzing the conversion of heteroatom-containing chemicals to excretable metabolites (Sehlmeyer et al., 2010). The DAO gene encodes diamine oxidase, which also plays an important role in dietary histamine detoxification to reduce the uptake of histamine by enterocytes (Yoshikawa et al., 2019). All these detoxification-related genes, gene families, and pathways are possible candidates for the genetic basis of the raccoon dog's broad diet.

High reproduction rate from a genetic perspective
The raccoon dog has a high reproductive rate with an average litter size of 8-10 individuals (Helle and Kauhala, 1995;Kauhala, 1996;Kauhala and Kowalczyk, 2011;Kowalczyk et al., 2009), which is larger than that of the red fox, its close relative (Kauhala, 1996). Although we did not find significantly enriched GO and KEGG pathways that are directly related to reproduction in the raccoon dog genome, we still found several reproduction-associated genes that were under positive selection. The TDRD6 (Vasileva et al., 2009), INK4C (Zindy et al., 2001), and TSSK4  genes were reported to be involved in spermatogenesis. The development of spermatids from round to elongated morphology will be abrogated in mice with TDRD6 gene knockout (Vasileva et al., 2009). INK4C and INK4D are two essential genes for male fertility and collaborate in regulating spermatogenesis. The deletion of INK4C and INK4D in mice results in male infertility (Zindy et al., 2001). Mutation in the TSSK4 gene was also found to be harmful to spermatogenesis in infertile Chinese men (Su et al., 2008). The ZP3 (Wassarman, 1999) and PLCD4 (Fukami et al., 2001) genes play key roles in the acrosome reaction. The acrosome reaction is the fusion process of sperm and egg cells, and ZP3 is considered the sperm receptor of the oocyte and triggers the acrosome reaction (Jungnickel et al., 2007); PLCD4 is responsible for mediating this zona pellucida-induced acrosome reaction (Fukami et al., 2001). We speculated that these genes may help to maintain high reproduction in raccoon dogs, especially for males. The raccoon dog-specific amino acid changes were also found in the ZP3 and TDRD6 genes ( Figures 4C and 4D), which may also play positive roles in the function of spermatogenesis and acrosome reaction. However, experimental analysis is necessary for further validation. In addition, many of these genes were pleiotropic (Lutful Kabir et al., 2015;Sayyab et al., 2016), and these genes and mutations were only candidates for future validation and studies on the genetic basis of high reproduction of raccoon dogs.

Genomic basis for the diverse immune system
The raccoon dog is a well-known reservoir host for many pathogens (Aoyagi et al., 2000;Guan et al., 2003;Kauhala and Kowalczyk, 2011;Kjaer et al., 2021;Qi et al., 2009;Song et al., 2019), but raccoon dogs with many pathogens are often asymptomatic, without signs of illness Yang et al., 2021b). We expected that the raccoon dog has a diverse immune system. We found several significantly enriched KEGG pathways related to the immune system. The base excision repair pathway is essential for the diversification of antigen receptors that shape the adaptive immune response (Stratigopoulou et al., 2020). Necroptosis can release danger signals to provoke the immune system for the clearance of pathogens, and can affect innate immunity by actively inducing the death of infected cells (Cho, 2020). The spliceosome is reported to have links with immune signaling, and some core spliceosome components present immune functions in cells (Yang et al., 2021a). The proteasome is considered necessary for the important immune functions of activated CD4 + T cells and proteasome inhibition suppresses unwanted and deregulated immune responses mediated by T cells (Berges et al., 2008). Although the primary functions of these enriched KEGG pathways are not immune functions, it is still possible to improve the immune system of raccoon dogs. Interestingly, a large proportion (14.56%) of PSGs were found to be immune-related, which was much large than the proportions for other functions. The functions of these genes are widely related to many innate and adaptive immune system processes (Table S19) iScience Article acid changes were found in 18 of the 30 immune genes and seemed to be retained by positive selection (Tables S20 and S21 and Figure S9). Further population genomic analysis showed that most of these amino acid changes were fixed in the population, even though the genetic diversity of the raccoon dog was high ( Figure 5A), indicating the potential advantages of these mutations to raccoon dog survival. We did not predict the effects of these mutations on the protein function, because it is hard to conclude that a single mutation is harmful or helpful to the function of a certain protein. However, multiple mutations in a single gene may collaborate to aid their immune adaptation to new habitats, but this prediction should still be experimentally validated in future works. Although certain types of cancer have been reported in raccoon dogs, we did not find a comprehensive study on the investigation of the cancer incidence statistics in this species. A series of tumor suppressor genes found in the PSGs indicated the possible strong cancer survival ability of raccoon dogs.
It is energetically very demanding for the immune system to defend against pathogens. The synthesis of a cytokine requires the breakdown of 1,150 glucose molecules to generate 2,300 ATP molecules (Straub et al., 2010). The energy demand of the immune system usually represents 25%-30% of basal metabolism (Hortova-Kohoutkova et al., 2021;Straub et al., 2010). Balancing the energetic trade-off between immunity and other physiological energy-consuming processes is critically important for animal survival (Ganeshan et al., 2019). The cost of the immune response for innate and adaptive immunity is variable. The systemic innate immune response is the most resource-demanding, followed by cell-mediated immunity, and humoral responses cost the least when compared with innate and cell-mediated defenses due to inflammation being rarely induced (Lee and Klasing, 2004). A strategy to favor a less costly adaptive immune system and reduce the high resource-demanding innate immune response should be beneficial for allocating more energy for reproduction and growth (Lee and Klasing, 2004). However, we found that most PSGs (immune-related) were involved in innate immunity and induced an inflammatory response with fever, a loss of appetite and fatigue, which was energetically expensive. The large number of expanded gene families and PSGs related to energy metabolism in the raccoon dog genome may play important roles in supplying energy for their immune response.

Several genetic factors may be jointly involved in the winter sleep of raccoon dogs
The most unique characteristic of raccoon dogs among canids is winter sleep (Asikainen et al., 2004). Raccoon dogs will fatten themselves before winter sleep (Nieminen et al., 2002). First, the omnivorous diet is very conducive to raccoon dogs to foraging and fattening. Except for taste receptor and detoxification-related genes, the two enriched olfactory transduction and salivary secretion pathways were also closely related to foraging behaviors and digestion (Jiahuan et al., 2018;Pedersen et al., 2018). The GOLGB1 gene plays a crucial role in the development of the mammalian palate (Lan et al., 2016). The HCRTR1 gene is the receptor gene of orexin, which is involved in foraging behavior and the intake of energy-dense food (Barson, 2020). The MLN gene encodes a small peptide hormone, motilin, which is secreted by the small intestine and regulates gastrointestinal motility and contractions, and stimulates hunger signaling (Tack et al., 2016). A genome-wide association study also detected a significant variant that was closest to the MLN gene in a study of the seasonal onset of hibernation in the 13-lined ground squirrel (Grabek et al., 2019). In addition, we also found that several lipid metabolism-related genes were under positive selection. The SEC16B gene in mice is required for lipid absorption, and is reported to be closely related to obesity in the human population (Sahibdeen et al., 2018;Shi et al., 2021). The BSCL2 gene is also involved in lipid metabolism, and the deletion of BSCL2 in mature white and brown adipose tissue triggers cAMP/PKA-mediated lipolysis and fatty acid oxidation resulting in adipose tissue loss (Zhou et al., , 2022. These genes may facilitate intensive foraging and fat accumulation before winter sleep. Unlike typical hibernation, the body temperature of raccoon dogs is close to normal during winter sleep (Asikainen et al., 2004). We found that the expanded gene families were enriched in thermogenesis (map04714, p = 8.36E-10) which may help raccoon dogs to maintain a near-normal body temperature. During winter sleep, fuel use shifts from glucose to lipids. Bears exhibit insulin resistance during hibernation and regain insulin sensitivity in the spring (Rigano et al., 2017). The positively selected gene STXBP5L is reported to be a negative regulator of insulin secretion, which may be involved in the usage shift of fuel in raccoon dogs during winter sleep. In addition, a large number of PSGs related to immunity may play an important role in preventing raccoon dog from becoming infected with the pathogen during winter sleep.

OPEN ACCESS
iScience 25, 105117, October 21, 2022 11 iScience Article Historical declining population and currently high genetic diversity Although the raccoon dog is considered a successful invasive species with a strong ability to survive, its effective population size was detected to be continuously declining from $50 kya. The population decline accelerated from $28 kya to 11 kya, which may be caused by the harsh climate during the Last Glacial Maximum (Denton et al., 2010). However, despite the climate becoming warmer at the beginning of the Holocene, the effective size of the raccoon dog population was still declining. Considering the frequent human activity within the most recent 10 kya, we speculate that the population decline of the raccoon dog in its evolutionary history might in part be due to human activities and climate change. In addition, changes in predator and prey abundance, habitat connectivity, and habitat productivity may also influence the fluctuation of the raccoon dog population.
Generally, population size is positively correlated with the level of genetic diversity (Ouborg et al., 2010). Interestingly, the genetic diversity of the raccoon dog is high, which was consistent with previous reports (Lavrov, 1971;Pitra et al., 2009), indicating a current healthy population. The historical long-term population decline of this species seems to indicate that this species may not be as tolerant and plastic as we expected for such a successful invader. The success of invasion of a certain species is not directly related to its population size but is closely related to the ''preadapted'' characteristics and post-invasive adaptation (North et al., 2021). We cannot investigate the post-invasive adaptation of raccoon dogs after invasion due to the lack of global samples. The ''preadapted'' biological characteristics, such as an omnivorous diet, diverse immune system, high reproduction rate, and winter sleep, could contribute to their successful invasiveness, even though raccoon dogs are experiencing a long-term population decline.

Limitations of the study
Although we assembled a high-quality chromosome-scale genome of the raccoon dog, we did not assemble the B chromosomes, which may contribute to important biological functions. In addition, functional analysis should be further performed to validate the candidate genes and raccoon dog-specific mutations we found in this study to be associated with omnivory, reproduction, and immunity.

STAR+METHODS
Detailed methods are provided in the online version of this paper and include the following: