Allele surfing causes maladaptation in a Pacific salmon of conservation concern

Quentin Rougemont; Thibault Leroy; Eric B. Rondeau; Ben Koop; Louis Bernatchez

doi:10.1371/journal.pgen.1010918

Abstract

How various factors, including demography, recombination or genome duplication, may impact the efficacy of natural selection and the burden of deleterious mutations, is a central question in evolutionary biology and genetics. In this study, we show that key evolutionary processes, including variations in i) effective population size (N_e) ii) recombination rates and iii) chromosome inheritance, have influenced the genetic load and efficacy of selection in Coho salmon (Oncorhynchus kisutch), a widely distributed salmonid species on the west coast of North America. Using whole genome resequencing data from 14 populations at different migratory distances from their southern glacial refugium, we found evidence supporting gene surfing, wherein reduced N_e at the postglacial recolonization front, leads to a decrease in the efficacy of selection and a surf of deleterious alleles in the northernmost populations. Furthermore, our results indicate that recombination rates play a prime role in shaping the load along the genome. Additionally, we identified variation in polyploidy as a contributing factor to within-genome variation of the load. Overall, our results align remarkably well with expectations under the nearly neutral theory of molecular evolution. We discuss the fundamental and applied implications of these findings for evolutionary and conservation genomics.

Author summary

Understanding how historical processes, such as past glaciations, may have impacted variations in population size and genetic diversity along the genome is a fundamental question in evolution. In this study, we investigated how recent postglacial demographic expansion has affected the distribution of deleterious genetic variants and the resulting deleterious mutation load in Coho salmon (Oncorhynchus kisutch), throughout its native range in North America. By sequencing the entire genome of 71 Coho salmon, we reveal that postglacial expansion has led to allele surfing, a process where alleles increase in frequency in populations that are expanding or colonizing new environments. Here, allele surfing resulted in an increased deleterious mutation load at the colonization front. Furthermore, we demonstrated that the efficacy of natural selection scales with variation in effective population size among populations. We showed that the specific genomic features of Coho salmon, namely variation in local recombination rate and variation in chromosomal inheritance, strongly impacted the segregation of deleterious mutations.

Citation: Rougemont Q, Leroy T, Rondeau EB, Koop B, Bernatchez L (2023) Allele surfing causes maladaptation in a Pacific salmon of conservation concern. PLoS Genet 19(9): e1010918. https://doi.org/10.1371/journal.pgen.1010918

Editor: Nicolas Bierne, CNRS UMR5554, FRANCE

Received: November 11, 2022; Accepted: August 11, 2023; Published: September 8, 2023

Copyright: © 2023 Rougemont et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: New sequencing data are currently being deposited on NCBI SRA under project PRJNA982773. The coho vcf is available online at 10.5281/zenodo.8027705. All scripts to reproduce our analyses are currently being deposited online at https://github.com/QuentinRougemont/selection_efficacy.

Funding: This research was supported by a Genome Canada strategic grant entitle EPIC4–Enhanced Production in Coho: Culture, Community Catch (grant ID: 229COH, Genome Canada) attributed to L.B. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Demographic events, including population contractions, expansions and secondary contacts have profound impacts on the spatial distribution of genetic diversity within species [1]. Range expansions can be accompanied by multiple founder events when few individuals colonize new areas. At the wave front, this process results in increased genetic drift and a reduced effective population size [2, 3]. This has multiple consequences genome-wide including (i) extended linkage disequilibrium [4], (ii) a loss of genetic diversity and (iii) increased levels of genetic differentiation [2, 3, 5]. At the wavefront, increased genetic drift favors allele surfing, a process that will increase the relative proportion of neutral, deleterious or advantageous mutations that will fix in the population [5–7]. Such allele surfing is expected to have two main negative consequences at the wave front: i) an increase of the deleterious load (called expansion load [8, 9]) and ii) a loss of adaptive potential [10].

First, allele surfing through increased genetic drift can overwhelm the effect of weak purifying selection, resulting in an increase of the deleterious load [11]. Due to the lower efficacy of natural selection in low N_e populations, a greater fraction of slightly deleterious mutations is expected to be present in populations at the wavefront (i.e., elevated ratios of non-synonymous to synonymous nucleotide diversity (π_N/π_S) [8, 9]). Moreover, under the nearly neutral theory of molecular evolution, the reduced efficacy of purifying selection in small populations should increase the fixation rate of slightly deleterious mutations [12]. On the other hand, given that most deleterious mutations are expected to be recessive, increased homozygosity of deleterious mutations should enable their removal more efficiently through genetic purging, leading to a reduction of the recessive and additive load [13]. To take this into account, a possible strategy is to investigate both the additive (i.e,. total) and the recessive (i.e., fixed) deleterious mutation load. [8]. Evidence for an increased recessive genetic load has been reported in humans which has been associated with the Out-of-Africa bottleneck [14, 15]. Evidence has also been gathered in expanding populations in plants [16–20]. For instance in Arabidopsis lyrata, range expansion has been associated with an increased mutational load [16] and increased linkage disequilibrium [21]. Similar results were observed in Escherichia coli [22]. In contrast, a recent study in A. lyrata found no evidence of increased load following range expansion [23].

Second, allele surfing is expected to lead to a loss of adaptive potential due to a reduction in the rate of adaptive substitutions given that the supply of new mutations is proportional to N_e [24]. Unless compensatory mutations (i.e., favourable mutations whose probability of occurrence increases to compensate for the fixation of slightly detrimental mutations–themselves with a higher probability of fixation in small populations–), counteract this effect in populations with a low N_e [25], one expects a lower proportion of substitutions driven by positive selection [26]. Support for a decreased rate of adaptation associated with range expansion was documented, particularly in plants [17, 20]. For instance, such a pattern was reported in the North American populations of A. lyrata[17] [27], but not in the European ones despite a slight increase in their genomic burden [23]. Similarly an increased load is observed in human populations, a result that matches theoretical expectations under the out-of-Africa scenario [28] but with limited evidence of a decrease in the efficacy of natural selection [29]. Beyond application to plants, humans and bacteria, there is a lack of empirical studies documenting the consequences of range expansion on the deleterious mutation load and the efficacy of natural selection.

Geographic variation in the overall mutation load is not the only interesting pattern to investigate; the genomic scale is also important. Both deleterious load and efficacy of selection are expected to vary along the genome depending on the local recombination rate [30, 31]. Non-recombining regions are expected to more freely accumulate deleterious mutations, a process called Muller’s Ratchet [32]. Moreover, Hill-Robertson interference, a process whereby competing alleles interfere with each other to become fixed, increases the fixation rate of deleterious variants linked to positively selected sites. Such a process will be exacerbated in the absence of recombination [30, 33].

In species that have undergone a whole-genome duplication (WGD) with associated residual polyploidy, variation in chromosomal inheritance and genomic architecture may affect the distribution of deleterious mutations along the genome [34]. By doubling the chromosome number, WGD has major consequences. In the short term, WGD alters gene expression and favours polyploid dysfunction (reviewed in [35]). In the middle to long-term, the progressive return to a diploid state will result in regions of residual tetraploidy. Such regions should display an increased effective population size as compared to the rest of the genome (4 N_e instead of 2 N_e) and therefore a higher efficacy of selection and a reduced genetic load. Similarly, the population scale recombination rate, which also depends on N_e, should favour a higher efficacy of selection and a reduced genetic load. We thus predict that these regions should display a lower genetic load compared to diploid regions of the genome [36]. WGD has been associated with a reduced efficacy of purifying selection which may favour the accumulation of transposable elements. Such WGD may in turn have contrasted effects, as observed in plants [37, 38]. For instance, a LTR insertion favouring early flowering time (an adaptive trait in harsh environments), was shown to be present in tetraploid populations, but not in diploid populations of A. arenosa. In contrast, WGS was shown to be associated with small but significant differences in the load and distribution of fitness effect in the same species.

The Coho salmon (Oncorhynchus kisutch) is a Pacific salmon species of major cultural and economical importance, which has severely declined over the last decades (reviewed in [39]). It has undergone further recent declines in multiple parts of its native range [40]. In addition to human-induced perturbations, the species has undergone a series of postglacial expansions from its main glacial refugium (Cascadia/California) with multiple founder events along its route toward Alaska, leading to a pattern of isolation by distance, a gradient of population structure and a decrease in genetic diversity [41–43]. Accordingly, these range expansions may result in an increased expansion load, questioning whether such load may have consequences on the fitness of populations at the expansion front. The Coho salmon offers a unique opportunity to test the role of demography and recombination on the deleterious load and on the efficacy of selection. The Coho salmon belongs to the Salmonidae family, a particularly relevant group of species to investigate the effect of WGD. The common ancestor of all present-day salmonids has undergone a whole-genome duplication event ~80-100MyA [44]. The genome of salmonid species is still on its path to rediploidization. For instance, approximately 8% of the Coho salmon genome displays residual tetraploidy [41]. Residual tetraploidy offers a unique opportunity to investigate the direct effect of the genomic variation of N_e on the efficacy of selection.

To address these questions, we aimed at testing the following general hypotheses: i) whether demographic expansion and bottlenecks have led to an increased load and decreased efficacy of selection, ii) whether heterogeneous recombination levels shape the within-genome variation of the load as a result of Hill-Robertson effects and iii) whether residual tetraploidy results in an increased efficacy of selection and reduced deleterious load through increased recombination and N_e.

Results and discussion

Strong postglacial population expansion revealed through whole genome sequences

We generated 30X coverage whole genome resequencing data for 71 Coho salmon representing 14 populations distributed from California to some of the most upstream populations of the Porcupine River in Yukon, Canada (Fig 1A). We also included several outgroup species namely the Sockeye salmon (O. nerka), Chinook salmon (O. tshawytscha), Pink salmon (O. Gorbuscha), Rainbow trout (O. mykiss) and Atlantic salmon (Salmo salar). These genomes were used to identify the ancestral and derived alleles in Coho and/or testing correlation(s) between genomic load estimates and recombination rate (S1–S3 Tables).

Download:

Fig 1. Sampling design and population size change inferred through whole genome sequences.

A) Sample site along the Pacific coast of North America. Map produced in R using the World data from Maps package with public domain data available at [47]. B) Average Tajima’s D values computed along the genome and summarized for each sample locality. Populations are ordered from distance to the river mouth and from the South to the North (i) California, (ii) Cascadia, (iii) British-Columbia (BC), (iv) Haida-Gwaii, (v) Salmon River (Fraser Basin) and (vi) Alaska. Abbreviation provided in S2 Table). Tajima’s D values vary positively as a function of the distance from the southernmost site (R² = 0.28, p <2e^-16) and as a function of the distance to the river mouth (R² = 0.47, p <2e^-16) reflecting the history of founder events from downstream to upstream sites.

https://doi.org/10.1371/journal.pgen.1010918.g001

To explore the amount of clustering among individuals a PCA was performed (S1A and S1B Fig). The PCA illustrates that populations are spatially structured following both latitudinal and longitudinal gradients. The first axis strongly correlates with both latitude (Pearson r = 0.97, p < 2e^-16) and longitude (Pearson r = -0.82, p < 2e^-16), as expected when space drives population structure [45]. Moreover, admixture analyses (S1B Fig) revealed that Coho populations were generally structured at the river/watershed level. These observations corroborate our previous inferences based on lower resolution genomic dataset [42, 43].

Under a single refugial expansion scenario, we expect a linear decrease in genetic diversity as a function of the distance from the source. We plotted the distribution of observed heterozygosity (H_o) as a function of the distance to the southernmost sample and observed a negative correlation with the distance to the southernmost site (R² = 0.77, p<2e^-16, S2B Fig). Next, we used the β_ST coefficient to identify the most likely ancestral populations from which the expansion could have started. β_ST can account for the non-independence among populations and negative values are indicative of ancestral populations [46]. This metric was also positively correlated with distance to the southernmost site (R² = 0.75, p < 2e^-16, S2A Fig).

The Salmon River population from the Thompson area (dark green point, Figs 1A and S2A) exhibits lower diversity levels, as compared to all non-Alaskan populations. Excluding this outlier river, revealed even stronger correlations, with a correlation with β_ST and H_O respectively raising to R² = 0.88 and R² = 0.90 (p <2e^-16 for both tests), suggesting an overall clear spatial pattern across the populations sampled, with the notable exception of this specific population which had a different demographic trajectory in this population, which seems consistent with a strong bottleneck and inbreeding [41].

Another indicator of changes in population sizes is an increase in Tajima’s D values at the genome-wide level. We tested the hypothesis of increased Tajima’s D values as a function of (i) the traveled distance from the southernmost site (reflecting northward postglacial expansion) and (ii) the distance to the sea only (reflecting upstream directed founder events). Results are consistent with a signal of population expansion along the colonization axis from the south to the north based on Tajima’s D values (R² = 0.28, p< 2e^-16, Fig 1B). Distance to the sea is also highly significant (R² = 0.47, p <2e^-16 for Tajima’s D). This suggests that more genetic drift occurs in upstream populations e.g. Porcupine (POR), Mile Slough (MSL), Thompson (SAL) and Deschutes (DES).

Another hallmark of population bottlenecks and genetic drift associated with founder events is an increase in linkage disequilibrium across the genome [4]. We tested this hypothesis by measuring linkage disequilibrium (r²) following Hill & Robertson [48]. In line with the above observations, we observed extended linkage disequilibrium (LD) along the genome in remote Alaskan/Yukon populations (Porcupine and Mile Slough Rivers) and in the Salmon R. population (Thompson R. Watershed, S2C Fig). Accordingly, the LD decay—here defined as a r² reaching half of its maximum value—is observed at 25, 12 and 6 kbp in these three populations, contrasting with values around 0.5 kbp observed in the other populations(S2C Fig). These results show a large among population variance in effective population sizes, likely associated with the strength of the founder events.

In addition, we constructed a population phylogeny based on a matrix of pairwise F_ST values computed between each pair of populations (Fig 2A and 2B) and on a matrix of variance-covariance in allele frequencies implemented in Treemix [49] (S3 Fig). Both analyses confirmed that populations located in the north underwent more genetic drift, in line with the postglacial expansion from the south to the north and the subsequent founder effects associated with upstream colonisation of the rivers from the west to the east.

Download:

Fig 2. Expansion routes inferred from phylogenetic analyses revealed the extent of the mutation load.

A) population phylogenetic tree inferred from the pairwise F_ST values (see also S3 Fig) superimposed on the map. Map produced with R using the worldHires data [51]. B) Population specific branches are coloured according to the π_N/π_S values, used as a proxy of the genetic load. C) Correlation between tree branch lengths to the root and the π_N/π_S ratio. D) Correlation between the travelled distance to the southernmost site and the π_N/π_S ratio. The blue line represents the slope of the linear model between the two variables. The grey area in panel C and D represents the 95% confidence interval levels around the regression lines.

https://doi.org/10.1371/journal.pgen.1010918.g002

To consolidate the above observations, we investigated historical changes in N_e using the Sequentially Markovian Coalescent in SMC++ [50]. This method takes advantage of both the SMC and of the site frequency spectrum to infer change of effective population size though time. We inferred such changed for each population separately (S2D Fig), testing a mutation rate of 8e^-9 and 1.25e^-8 mutations/bp/generation, which respectively corresponds to the median and the mean mutation rate estimated in Atlantic salmon (salmo salar), a closely related salmonid (J. Wang, personal communication). This analysis revealed: i) an expansion of most populations approximately 12–20 KyA ago, concomitant with postglacial recolonization, ii) a slow and steady demographic decline in the Thompson R. (S2D Fig), and iii) a split time between all pairwise combinations of populations (median = 16,3 KyA, range = 6,7KyA - 61KyA, S4 Fig) compatible with the onset of postglacial population expansion and colonisation of different rivers following glacial retreat [41, 42]. Using the mean mutation rate yielded similar results with a more recent estimates of split times (median = 9,6 KyA, S4 Fig) (min = 5 KyA–max = 18 KyA). Overall, SMC++ results indicate that all populations shared a similar demographic history until they began to diverge following the end of the last glacial cycle (see S1 Note and associated Supplementary Tables).

Range expansion explained variation in the mutation load

We estimated the mutation load of the populations using different metrics: i) π_N/π_S, ii) count of derived homozygous mutations (including both missense and Loss-of-function (LoF) mutations), and iii) total count of derived mutations (including both missense and LoF mutations). We observed a significant positive correlation of the π_N/π_S ratio of each local population and the distance to the southernmost site (see methods), corresponding to the most-likely refugia as discussed in our previous work [42] (R² = 0.73, p<0.0001, Fig 2D). Similarly, using the tree branch length to the root of our tree (Fig 2B) as a proxy of expansion route revealed a significant correlation with the π_N/π_S ratio (R² = 0.66, p<0.0001, Fig 2C). This result was also supported by an analysis considering tree branch lengths inferred with Treemix (S4 Table). Accordingly, there was a significant correlation between π_N/π_S and two different proxies of the effective population size, namely levels of nucleotide diversity at synonymous sites (π_S) and SMC++ N_e,, expected to be reduced at the colonization front and in bottlenecked populations (π_S in S5A Fig and SMC++ N_e in S5B Fig). These results support expectations pertaining to variation in mutation load under a model of expansion load (e.g. [28]).

The interpretation of π_N/π_S in populations that are not at mutation-drift equilibrium (say after a bottleneck) is however difficult, since π_N will reach an equilibrium value faster than π_S because selected alleles will be subject to higher negative selection and undergo a faster turnover [14, 52, 53]. Thus, the π_N/π_S may not be the best predictor of the total burden of deleterious mutations, as it is potentially affected by demography and selection [14]. To circumvent this limitation, we counted and plotted the distribution of non-synonymous mutations classified according to i) their impact and expected consequences on fitness (i.e. missense and LoF mutations) and; ii) segregation patterns (i.e. additive (total) load composed of heterozygous and derived homozygous genotypes or recessive (fixed) load composed of homozygous derived genotype) [54]. Just as the π_N/π_S, we found a significant association between the derived load and the distance to the southernmost sites both for missense mutations (R² = 0.790, p < 0.001, Fig 3A) and for LoF mutations (R² = 0.777, p < 0.001, Fig 3B). Similarly, we observed a significant association between the tree branch length to the root and the derived load of missense and LoF mutations (R² = 0.39, p = 0.0096 and R² = 0.40, p = 0.009, respectively, S4 Table).

Download:

Fig 3. Number of deleterious alleles per riverper individual for all populations sorted from the South to the North.

The plot shows the additive load (left panel) and recessive derived homozygous load (right panel) for A) missense mutations and B) Loss-Of-Function (LoF) mutations. No strong differences are observed in the additive load among populations. In contrast, significant differences were observed for the recessive load in populations at the expansion front both for missense mutations and LoF mutations and this was significantly correlated with the distance to the southernmost site. Each color represents a major regional group. Abbreviation for each site is provided in S1 Table.

https://doi.org/10.1371/journal.pgen.1010918.g003

The most deleterious mutations are efficiently purged across the range

The total number of missense mutations was significantly more abundant in the more southern populations (i.e California, Cascadia, S5 Table for all Tukey-HSD p-values) than in northern ones (Fig 3A), while it was nearly constant from the south to the north regarding LoF mutations (Fig 3B), with no significant differences among populations (S5 Table). The southernmost populations displayed a higher load of mutations in heterozygous state (Fig 3), as expected due to their higher historical effective population size, favouring the segregation of recessive mutations hidden in a heterozygous state [42, 55].

The fixed load increases with population expansion

The fixed load (i.e. count of derived homozygous sites) increased from south to north for both missense and LoF with the most extremes samples, including Mile Slough (MSL), Porcupine (POR) and Salmon (SAL) rivers being the most significantly loaded (S5 Table), as expected due to founder events and allele surfing for Alaskan/Yukon samples and due to bottlenecks for the Salmon River (Fig 3 right panel).

Range expansion have reduced the efficacy of selection at range margin

We predicted that selection efficacy will be reduced at the expansion front and in populations with a lower N_e. To test this, we measured three parameters, namely the rate of non-adaptive and adaptive amino-acid substitutions relative to neutral divergence (ω_NA and ω_A, respectively; with ω_A = d_N/d_S—ω_NA) and the proportion of non-synonymous amino-acid substitution that result from positive selection (α = ω_A/(d_N/d_S)) using the software Grapes [56] using maximum likelihood estimation under a population genetic model. A general expectation is that populations at the expansion margin will exhibit a higher rate of non-adaptive substitutions (ω_NA), a lower rate of adaptive substitutions (ω_A) and a lower proportion of amino-acid substitutions (α).

We observed a reduced efficacy of purifying selection likely associated with the multiple founder events and bottlenecks which resulted in a decrease rate of adaptive substitutions (ω_A, R² = 0.37, p = 0.013) and increased ω_NA as a function of the distance to the South (R² = 0.57, p = 0.0011, Fig 4A). We also found a significant positive correlation between ω_NA and the distance to the ocean (R² = 0.52, p = 0.002, S6A Fig) and conversely a negative correlation between ω_A and the distance to the ocean (R² = 0.40, p = 0.00895, S6B Fig), suggesting that upstream populations displays a lower adaptive potential. This observation thus suggests that the population expansion had impacts on both the adaptive and non-adaptive substitution rates.

Download:

Fig 4. Efficacy of selection decreases at the expansion front and as a function of long-term effective population size.

A) increased rate of nonsynonymous non-adaptive substitutions (ω_NA) in northernmost populations of Coho salmon (Alaska) and in the bottlenecked Thompson River. B) Correlation between α and historical variation in the coalescent effective population size estimated from SMC++. See also S7 and S8 Figs for correlations based on tree branch lengths. Note that the distance from the southernmost site was computed considering a Californian sample, from a previous study [42], for which no WGS data were available.

https://doi.org/10.1371/journal.pgen.1010918.g004

Similarly strong correlations were observed when considering the tree branch length to the root (details in S7 and S8 Figs and S4 Table). The Salmon R. population (Thompson R. watershed), which suffered a recent decline in abundance, supposedly due to the sustained release of hatchery fish derived from few individuals and the Ryman-Laikre effect [57] (see discussion below), also displayed a high ω_NA.

We also predicted a higher α value in populations with higher N_e and, conversely, a decreased value of α in populations with lower N_e. We observed a strong positive correlation between α and the synonymous nucleotide diversity (π_S) of each local population, which represents a good predictor of the long term (coalescent) N_e (R² = 0.63, p = 0.0004, S9 Fig). To more directly test the association with N_e, we correlated SMC++ based estimates of N_e (averaged over a 200KyA window) with α and recovered significant correlations (R² = 0.49, p = 0.004, Fig 4B). It is noteworthy that a good correlation between proxies of N_e obtained from SMC++ and π_S was also observed (R² = 0.68, p = 0.00017, results not shown). Altogether, the above results provide empirical support pertaining to the evolutionary consequences of allele surfing at expanding range margins, in particular regarding the loss of adaptive potential and the mutation burden.

Recombination rates shape the deleterious mutational landscape

In addition to the spatial structure associated with the postglacial recolonization, we investigated the genome-wide variation in mutation load. Such variation could be associated with the occurrence of structural variants (e.g. chromosomal inversions) which may incur a significant load because deleterious recessive mutations may freely accumulate in the absence of recombination, as observed for instance in sex chromosome and related supergene-like architecture [58]. To test this hypothesis, we used the GC content at third codon position (GC3 hereafter) as a proxy of the rate of recombination [59, 60] (see S2 Note for an explanation). We observed strong correlations between levels of GC3 and π_N/π_S ratio for all Coho salmon populations (R² range = 0.938–0.955, p<0.001) except for population MSL (R² = 0.252; p = 0.0038) (Fig 5A). An analysis focused on GC-conservative sites, which are not affected by GC-biased Gene Conversion (gBGC), revealed similarly strong patterns across all Coho salmon populations (Fig 5B, R² range = 0.665–0.876, p < 0.01). π_N/π_S ratios estimated based on all sites or GC-conservative-sites only are highly correlated (Pearson r = 0.909, p <0.001). We tested the generality of this relationship using four other closely related Pacific salmon species and found strikingly similar pattern in Chinook salmon (R² GC3 ~ π_N/π_S = 0.9338, p = 1e-11) Sockeye and Kokanee salmon (R² range = 0.81–0.96, p <0.001), as well as Rainbow trout populations (R² range = 0.95–0.96, p <0.001, Fig 5C).

Download:

Fig 5. The deleterious load is determined by variation in GC3 content in multiple salmonids.

) A) Correlations between the deleterious load (π_N/π_S) and GC content at third codon position in multiple Coho salmon populations. Each line represents the values colored by major regional groups (See S10A Fig for detail). B) Correlations between the deleterious load (π_N/π_S) and the GC content at third codon position considering an independent method based on the site frequency spectrum at GC-conservative sites. C) Correlation between the deleterious load (π_N/π_S) and GC content at third codon position in sockeye salmon ecotypes (sockeye and kokanee), chinook, pink salmon and in rainbow trout. Averages are provided for rainbow trout, salmon and kokanee at the species level, see S10B and S10C Fig for detail by population.

https://doi.org/10.1371/journal.pgen.1010918.g005

We then analyzed the difference in the correlations between levels of GC3 and π_N/π_S ratio among all Coho salmon populations. Indeed, the intensity of this correlation highlights the effect of recombination rate variation on the efficiency of purifying selection [61]. The populations at the expansion front (Mile Slough R., Porcupine R., Snake R.) in Alaska exhibit the lowest correlations between GC3 and π_N/π_S. If both demographic factors (i.e. distance to the source here) and genomic factors (i.e. recombination) interact to shape the load, we may predict a positive relationship between these two factors because the northernmost populations should have an even higher load in regions of low recombination. We tested this hypothesis using a linear model between the slope of the regression between π_N/π_S and GC3 and the distance to the ancestral source populations did not reveal such a relationship (p = 0.133, S11 Fig).

Regions of residual tetraploidy revealed drivers of the load in Coho salmon

We tested the hypothesis that regions with residual tetraploidy exhibit a reduced load due to increased efficacy of purifying selection due to higher population sizes (4Ne rather 2Ne). Contrary to this expectation, increased deleterious load was observed in regions of residual tetraploidy (3,700 genes) as compared to diploid regions (Fig 6A, red dotted line mean π_N/π_S > 0.35, diploid region π_N/π_S < 0.30). Given that we also observed lower levels of recombination in regions with residual tetraploidy compared to re-diploidized genomic regions (S12 Fig, p < 0.0001, W Mann-Whitney = 3.04e⁷, see also S13 Fig and S6 Table for differences in recombination among populations), our results suggest that this higher π_N/π_S could be mostly due to lower recombination rates. Another expected genomic consequence of this lower recombination rate is a higher load of transposable elements [62]. When computing the relative length of TE, i.e. the length of TE corrected by the chromosome length (see methods), we found a significant enrichment of TE in the regions of residual tetraploidy as compared to diploid chromosomes (Fig 6B, p <0.0001, WMann-Whitney = 1.36e¹⁰). This tendency was also observed across the different TE categories (S14 Fig). To more directly test the N_e-effect hypothesis, we eliminated the effect of the recombination rate by comparing the load across similar bins of GC in diploid vs. tetraploid regions. The π_N/π_S was systematically higher in diploid regions than in the tetraploid ones after excluding the class with the lowest GC content (Fig 6C and S7 Table) indicating that the load was significantly higher in diploid compared to tetraploid regions, with the notable exception of regions with extremely reduced recombination. Therefore, it is still possible that increased efficacy of selection is at play in recombining regions of residual tetraploidy, following the hypothesis of higher effective size in these regions.

Download:

Fig 6. Increased load in region of residual tetraploidy.

A) Distribution of π_N/π_S ratio when considering all genes in regions of residual tetraploidy (red lines) for each population compared to a set of 200 randomly generated samples of 4,000 genes in the rest of the genome (gray histogram). B) Violin plot showing that regions of residual tetraploidy displayed significantly longer transposable elements when compared to diploid regions. Orange = diploid chromosome. Gray = chromosome with residual tetraploidy. Red point = mean +/- 1*sd. C) Difference in the distribution of πN/πS ratio in different genomic regions of the Coho salmon. Each dot represents the value observed for a given population in either diploid or tetraploid region with the “lowest” GC3 value (corresponding to low recombination) compared to the rest of GC3 values (labelle “other GC3”, corresponding to intermediate to high recombination). The GC values were computed in 4mb windows separately for diploid and tetraploid regions.

https://doi.org/10.1371/journal.pgen.1010918.g006

Conclusion

The role of genetic drift, recombination, selection and variation in inheritance in affecting the deleterious load and efficacy of natural selection is a central question with fundamental and applied consequences for biodiversity.

Using population genomics analyses, we investigated the evolutionary consequences of recent demographic events in Coho salmon. Our results supported gene surfing across North America [42]. Results also supported lower N_e at the northern range expansion front, which induced two main evolutionary consequences: a surf of slightly deleterious mutations and a putative reduction of the adaptive potential, as expected under the nearly neutral hypothesis of molecular evolution [12].

We further demonstrated that one population, the Salmon River (Thompson R. watershed) displayed an increased fixed load and decreased selection efficacy compared to southern populations. Previous studies showed that this population is genetically isolated from others [42, 43, 63] and displayed genomic footprints of a bottlenecked population such as long runs of homozygosity [41]. This population has been subjected to extensive hatchery enhancement (from a single population) to circumvent the decline of Coho salmon in the Thompson drainage [64]. From a theoretical standpoint, if a few captive individuals were used as parents for subsequent releases, then an increase in inbreeding and a reduction in effective population size are expected, a phenomenon called the Ryman-Laikre effect [57] [65]. Consequently it is possible that the increased π_N/π_S, ω_NA, and fixed deleterious load could be explained by both long-term evolution at low N_e and by the Ryman-Laikre effect associated with hatchery enhancements. Regardless, this suggests that careful enhancement needs to be performed with a diversity of parents to maximize genetic diversity of supplemented populations. This result has implications for fisheries enhancement, as well as genetic rescue programs aiming at reducing the inbreeding load of declining populations and restoring the fitness of these populations.

It is noteworthy that the southernmost population (Klamath R, California) was not characterized by the highest α value. Several non-exclusive hypotheses may explain this pattern: first, some Californian populations are known to have undergone strong recent (human-driven) decrease in abundance due to both habitat degradation and climate change, which may leave a detectable footprint in the genome [66]. The recent demography could have an impact on the accuracy of the fit of the synonymous and nonsynonymous SFS under Grapes. Second, the Klamath R. population is located in the far upstream part of that river, and may have undergone a founder event when reaching this upper part, a hypothesis that is supported by the correlations we observed between several metrics and distance to the river mouth. Finally, we did not include the southernmost populations from California, which are more likely to be more ancestral, as we previously reported [42].

Since the use of π_N/π_S ratio can be criticized [14, 67, 68], especially when used to quantify the load within species, we computed additional metrics [56] to more directly estimate mutation load, as advocated by others [14, 69]. In this case, we found small differences in the additive load among Coho salmon populations, but a linear increase in the recessive load as a function of the distance to the southernmost sites and as a function of the tree branch length, both are used as proxies of the expansion route.

We demonstrated that the π_N/π_S ratio was negatively correlated with the GC content at third codon position, which represents a good proxy of the local recombination rate (see S2 Note). The negative correlation between π_N/π_S and GC3 was repeatedly observed across different salmonid species using both a sequence based estimate of π_N/π_S and an estimate based on GC-conservative site (non-affected by gBGC) [60]. These results indicated that recombination plays a key role in explaining the variation in the mutation load along the genome in salmonids. Our study empirically supports theoretical work about the accumulation of slightly deleterious mutations in non-recombining regions [30, 32]. Similarly, increased prevalence of deleterious mutations in low recombining regions have been reported in plants [70] and in human populations [71–73]. In particular, recent work in human populations have shown that both variation in demographic history (i.e. change in effective population size) and recombination rates are affecting allele-specific expression of harmful mutations [71]. The authors showed that allele specific expression causes underexpression of harmful mutations more efficiently in normally and highly recombining regions compared to low recombining regions. They further documented variation of this process among populations with varying demographic histories.

In line with the key role of recombination rate, we observed that only regions of residual tetraploidy with extremely low recombination rate (lower GC content) displayed an increased load in Coho salmon, accumulating more transposable elements, suggesting efficient selection in more “normally” recombining region of residual tetraploidy. In particular, our results indicate that i) the regions of residual tetraploidy (4N_e) with a normal recombination rate do not display an increased load, which may be expected under the nearly neutral theory if higher N_e is associated with the efficacy of selection; ii) only region of extremely low recombination rate (lower GC content) displays the highest load, highlighting once again the primary role of recombination in shaping the load, with a higher contribution than N_e. A third factor is the variation in dominance of deleterious mutation. Indeed, a recent simulation study found that in the case of hard sweeps, dominance of recessive mutation was a central aspect determining the signal of sweeps in polyploid genomes [39]. A detailed investigation of this aspect in regions of residual tetraploidy was beyond the scope of our study but would be worthy of further investigation.

The detailed consequences of ongoing rediploidization have been extensively studied at the regulatory levels elsewhere, with support for an increased load (higher d_N/d_S and TE load) in duplicated genes undergoing lower expression [74]. Recent studies in polyploid plant species have also documented various evolutionary consequences of such duplication on local variation in the load and efficacy of selection [35]. For instance in A. arenosa, a reduced efficacy of purifying selection was suggested in tetraploids (4X) genome compared to diploids (2X), and the authors suggest that this is because deleterious alleles are better masked in autotetraploids [38]. Similarly, a recent work in the allopolyploid cotton (Gossypium) demonstrated that this species accumulates more deleterious mutations than the diploid species [75]. This observation supports a theory proposed by Haldane that recessive deleterious mutations accumulate faster in allopolyploids because of the masking effect of duplicated genes [76]. To sum up, our results concur with predictions from the nearly neutral theory of molecular evolution [12], in which slightly deleterious mutations are effectively neutral and purged effectively in regions of higher recombination, except perhaps in populations at the extreme of the expansion front. Interestingly, these results indicate that regions of low-recombination re-diploidize later than other genomic regions. We are not aware of any paper having documented such patterns so far. Further studies of this process across more species would be welcome to validate the generality of this observation.

Finally, our results have implications for conservation practices. We showed that the additive load is approximately constant, indicative of efficient purging across populations for both missense and LoF, similar to some recent studies on several plant and animal models [77–80]. However, results indicate that population at higher latitude from the Yukon watershed (Mile Slough R., Porcupine R.) or from the bottlenecked Salmon R. have not entirely purged the most deleterious mutations, including missense and LoF mutations, which may impose a fitness cost to these populations. Further empirical evidence for a causal link between the putative fitness cost of LoF mutations and adaptive phenotypic variation will be necessary to validate our observations. With this caveat in mind, our results for the Salmon R could guide practices in supplementation programs. For instance, choosing a diversity of parents from moderately differentiated populations of modest size may help increase the levels of heterozygosity and mask the expression of recessive deleterious mutations [55]. This strategy could reduce the occurrence of deleterious alleles and counteract the Ryman-Laikre effect described above. Moreover, populations (e.g. most upstream Alaskan/Yukon populations) for which our results suggest a reduced adaptive potential are also the most strongly exposed to rapid climate change, as the rate of temperature increase is most rapid at higher latitudes [81]. In all cases, maximizing the connectivity among populations and limiting habitat degradation appears as fundamental strategies to maintain high effective population size and increase the adaptive potential of Coho salmon to the multiple ongoing anthropogenic pressures [82]. More generally, how best to manage declining populations and guide conservation policies in these conditions to minimize the load and/or maximize genetic diversity is another debated issue [55, 83–85]. In the meantime, while conservation genomics undoubtedly has a major role for the short-term preservation of endangered species, this should not override the crucial need for reducing human impacts on natural ecosystems to preserve biodiversity over long time scales [86].

Methods

Sampling design for Coho salmon

We sampled and sequenced 71 individuals representing 14 populations distributed from California to Alaska (S1 Table). A set of 55 individuals was sequenced on an Illumina HiSeq2500 platform [38] and the other 16 individuals were sequenced on a NovaSeq6000 S4 platform using paired-end 150 bp reads. Reads were processed using fastp for trimming [87], and mapped to the most recent Coho reference genome (https://www.ncbi.nlm.nih.gov/assembly/GCF_002021735.2/) using bwa mem v2 [88]. Reads with a minimum quality below 20 were discarded with samtools v1.7. Duplicates were removed with picard (http://broadinstitute.github.io/picard/). SNP calling was performed using GATK v4.2 [89] using our pipeline available at github.com/quentinrougemont/gatk_haplotype/. We generated a Haplotype gVCF for each sample individually, combined all gVCF and then performed a joint genotyping. We checked the variants quality score of our data and filtered our genotypes according to their quality following GATK best practices and based on quantiles distributions of quality metrics. We excluded all sites that did not match the following criterion: MQ < 30, QD < 2, FS > 60, MQRankSum < -20, ReadPosRankSum < 10, ReadPosRankSum > 10. We also excluded multiallelic SNPs, as well as indels. Genotypes with a depth lower than 6 or higher than 100 reads were also excluded to remove low confidence genotypes potentially associated with paralogs. Finally, we also generated a separate vcf file using the—all-sites option to obtain a file with invariant position to reconstruct sequence data (see the Genetic load estimation in Coho salmon and related species section).

A total of 14,701,439 SNPs were identified without missing data. Population structure was evaluated using a principal component analysis (PCA) performed using Ade4 [90] on a set of LD-pruned SNPs without missing data (1,739,037 SNPs) identified stringently with plink1.9 [91] (command indep-pairwise 100 50 0.1).

Outgroup dataset.

In order to test the generality of the relationship observed between π_N/π_S and GC3 we took advantage of the newly assembled reference genomes and resequencing data from other closely related Pacific salmonid species with similar demographic histories [92–94]. Sockeye salmon published by [92] were retrieved from NCBI PRJNA530256. Samples with a high number of individuals per ecotype (Sockeye and Kokanee) were chosen from the NCBI table. We retained a total of 5 Kokanee populations and 5 Sockeye populations (from Fraser & Columbia watershed described in S1 and S3 Tables). Three Chinook salmon samples were provided by B. Koop (also available at NCBI PRJNA694998). Additionally, 11 samples of “even” and 10 samples of “odd” pink salmon (O. gorbuscha) were downloaded from PRJNA 556728 and included in the analysis. Here “even” and “odd” refers to Salmon returning to their natal rivers in “even” and “odd” years to spawn, leading to a temporal isolation of these ecotypes [93]. These were indeed clearly separated based on a PCA. Finally, a number of rainbow trout available from NCBI PRJNA386519 were used (n = 19 from 3 random populations showing genomic differentiation based on a PCA).

Each sample was downloaded and mapped onto its species’ reference genome downloaded from NCBI and using the exact same procedure as described above relying on fastp, bwa-mem2, picard and GATK 4.2.5.0 to generate a final vcf filter based on usual GATK quality criteria and variance in sequencing depth. For each species, we then quantified the load using the π_N/π_s ratio with the procedure described below for Coho salmon. For the Sockeye/Kokanee ecotypes, the Sockeye is a fully migratory ecotype, whereas the Kokanee is a resident (non-migratory) ecotype that typically comprises more isolated populations. The two alternative ecotypes are sometimes found in similar locations, with three rivers from our sampling design including both ecotypes (S3 Table). For all other species we tested the relationship between π_N/π_S and GC3 (see below).

Ancestral and derived alleles identification

To accurately recover the ancestral allelic states, we used three outgroup species; including the chinook salmon and rainbow trout sample (see above, n = 5 for rainbow trout) plus data from Atlantic salmon (n = 5, SRP059652). Each individual was aligned against the Coho salmon V2 genome (GCF_002021745.2) using GATK with the same procedure as above and calling every SNP using the–all-site mode to obtain invariant positions. We then determined the ancestral state of each SNP if 1) the SNP was homozygous in at least 90% of the individuals from our three outgroups, and 2) matched one of the two alleles identified in Coho salmon. Otherwise, the site was inferred as missing and was not used in subsequent analyses of the load. In addition, we reconstructed a consensus ancestral fasta sequence using the doFasta option from angsd [95]. This was used for demographic reconstruction detailed in S1 Note.

Demographic reconstruction

We first tested our prediction that genetic diversity (observed heterozygosity H_o) decreases towards the North following our «out of Cascadia» model previously inferred [42, 43]. Conversely, we verified that the β_ST coefficient, a measure of both genetic differentiation and ancestrality [46] increases northwards the North, as expected due to isolation by distance. β_ST and observed heterozygosity (H_o) were measured using the hierfstat R package [96] Details about β_ST computation are provided in Weir & Goudet [46] and H_o was computed following Nei (1987)[97] as 1 - ∑_k∑_iPkii/np with Pkii the proportion of homozygotes i in sample k and np the number of samples. Oceanic coastal distances were computed using the marmap package [98] and waterway distance was computed using ArcGIS. Then we summed the 2 distances to obtain the distance to the most likely refugia that we identified in our previous studies [42, 43]. A shapefile of the rivers used here is available at https://github.com/QuentinRougemont/selection_efficacy/tree/main/00.raw_data.

Under postglacial expansion from a single refugium, the general hypothesis would be that all sampled populations should follow a common temporal trajectory of a population decline (bottleneck due to founder events by few individuals) followed by a (strong) increase in N_e corresponding to the expansion phase. To test this hypothesis, we inferred temporal changes in N_e using SMC++ [50]. SMC++ works similarly to the PSMC model but takes into account the Site Frequency Spectrum (SFS) and is better suited to large sample sizes. Estimates of changes in population size were performed for all populations. To validate the fact that expansions are indeed postglacial, splitting time was estimated between all pairs of samples from different geographic areas based on the joint SFS (n = 75 pairwise comparisons). A generation time of 3.5 years, well documented for Coho, and a mutation rate of 8e^-9 mutation/bp/generation were applied (corresponding to the median substitution rate inferred in Atlantic salmon, J. Wang, Personal communication). We also compared the results to the mean substitution rate of 1.25e^-8 mutation/bp/generation also inferred by Wang.

In addition, pairwise linkage disequilibrium provides valuable information regarding population size and inbreeding. We computed the squared correlation coefficient between genotypes in each sample and all populations separately using popLDdecay [99]. We used a MAF of 5% in each population, keeping between 3.7 and 6.4 million SNPs with populations having undergone stronger bottleneck/founding events displaying the lowest amount of variation. We estimated LD decay by plotting LD (R²) against physical distance measured in base pairs in ggplot2 [100] package in R.

We observed slight discrepancies between our SMC++ estimates of divergence time and our previous work based on the site frequency spectrum [42]. To investigate this, we performed a new set of inference based on the unfolded joint site frequency spectrum (jSFS) using ∂a∂i [101] and a new set of refined models as detailed in supp. S1 Note and S8 and S9 Tables.

To reconstruct the population expansion routes we computed Weir and Cockerham [102] F_ST values among populations using Vcftools [103] and constructed a phylogenetic population tree using the R package ape [104]. We projected the tree on the map using the functions contMap and phylo.to.map from the phytools R package [105]. We then computed the tree branch lengths to the root using the function distroot from adephylo [106]. We also explored broad relationships among populations with treemix [49]. We fitted a model with an increasing number of migration edges. We chose a model with K = 3 migration edges as all edges displayed significant p-value and captured a high proportion of explained variance. Fitting more edges decreased the p-value without really improving the fit to our data. Similarly, we computed the tree branch length to the root using the function adephylo and compared our results to those from F_ST-based values. The resulting tree branch length values were used as a proxy for the expansion routes, and we tested their correlation using linear models in R, with all our metrics of deleterious load (π_N/π_S, total number of putative homozygous derived deleterious alleles (recessive load) and total number of deleterious alleles (both in homozygous and heterozygous states, additive load) for derived missense and LoF mutations) and metrics of selection efficacy (ω_NA, ω_A, α detailed below). Similar inference was performed, but using the reconstructed distances to the southernmost site included in our previous studies [42, 43]. This distance was computed by summing the oceanographic distance and river distances described above.

Population-scaled recombination rate

Statistical phasing of the Coho whole genome sequences was performed using the Shapeit software [107], considering all individuals at once. We then estimated effective recombination rates (ρ = 4.N_e.r where r represents the recombination rate per generation and N_e is the effective population size) along the genome using LDHat [108]. Phased genotypes were converted into LDHat format using vcftools after excluding SNPs with MAF < 10% since rare variants are not informative for such inferences. Following the guidelines of the LDHat manual, the genome was split into fragments of 2,000 SNPs with overlapping windows of 500 SNPs. We measured recombination rates independently for each population. Differences in the distribution of population-scaled recombination was visualized using violin plot (S13 Fig) and statistically tested using Tuckey HSD tests in R (S5 Table).

Genetic load estimation

π_N/π_S estimates.

The approach developed in [59] was used to reconstruct fasta sequences for each individual. For each species (Coho and the other salmonids), coding sequences (CDS) from each reconstructed sequence were extracted using the gff files available with the reference genome to estimate the nucleotide diversity (π). We also concatenated the CDS sequences into different classes according to their length and computed _N and π_S over 4-Mb concatenated gene space windows. Such large windows reduce the stochasticity due to the low π_S values in Coho salmon.

Identifying potential deleterious non-synonymous alleles

We tested the difference in count of non-synonymous mutations in each local population of Coho salmon across non-synonymous missense mutations (putatively deleterious) and Loss of Function (LoF) mutations (likely to be strongly deleterious) identified with SNPeff. We analyzed data in two ways: first, we counted the total number of putative homozygous derived deleterious alleles (recessive load) per individual as well as the total number of deleterious alleles (both in homozygous and heterozygous states, additive load) using: Ntotal = 2 Χ N_homo + N_hetero [28]. These individual values were then averaged per population. We tested for differences in the distribution of these counts among populations using an ANOVA followed by a TukeyHSD test. The p-values were corrected using a Bonferroni correction. We also computed mean derived allele frequencies (DAF) in all sampling locations and across mutation categories (synonymous, non-synonymous missense and LoF). We also applied the commonly used Provean [109] software based on a random set of non-synonymous mutations (but see S3 Note for a brief discussion regarding the limitations).These results were then compared with results from non-synonymous mutations (S15 Fig).

Correlation between π_N/π_S and recombination

We computed the GC content at third-codon positions of protein coding genes (GC3) which has been shown to be an accurate proxy of local recombination rate in other species and these positions are generally silent [85, 86]. To compute π_N/π_S values we sorted genes by ascending GC3 values, which enabled us to obtain a ratio based on genes with similar GC3 values. Moreover, we also used the site frequency-based approach proposed by Rousselle et al. [60] to estimate the π_N/π_S ratios. This approach enabled us to compute SFS separately for GC conservative sites (A<->T and C<->G mutations), that is, not affected by GC-biased Gene Conversion (gBGC).

Finally, we measured the correlation between GC3 and π_N/π_S using linear models. We replicated these analyses considering only genes (n = 3,500) and SNPs in regions of residual tetraploidy (8% of the genome).

DFE estimation and rate of adaptation in Coho salmon

We estimated the rate of non-adaptive and adaptive synonymous substitutions (ω_NA and ω_A, respectively; with ω_A = d_N/d_S—ω_NA) and the proportion of amino-acid substitution potentially resulting from positive selection (α = ω_A/(d_N/d_S), with d_N being the rate of non-synonymous substitutions, d_S being the rate of synonymous substitutions). To do so, we used the method implemented in Grapes v1.0 which builds upon the approach of [110]. Grapes models the effect of favorable mutations on the non-synonymous site frequency spectrum (SFS) while accounting for various confounding factors distorting the SFS (demographic change, linked selection, genotyping errors, SNP misorientation).

We used the following parameters in Grapes v1.0: we assumed a negative Gamma distribution to the synonymous and non-synoymous SFS (parameters GammaExpo in Grapes and parameter “unfolded” site frequency spectrum). To obtain suitable data, we converted the quality filtered whole genome vcf file into a fasta file for each population containing sequence information for each individual in the population (using vcf2fasta available at https://github.com/QuentinRougemont/selection_efficacy/tree/main/08.piNpiS/). This file was then converted into a site-frequency spectrum using bppml [111]. We required at least 10 sites without gap to keep a given site (-gapN_site), with a maximum proportion of gap of 0.5 (-gapN_seq), a minimum number of complete codon of 6 (-min_nb_codon), we did not allow frameshift (remove_frameshift parameter), we also required a sample size of 8 (-sample_size), the number of initial/terminal position in which Stop codons or FrameShift codons are tolerated was set to 20 (-tolerance_zone), we did not allow Stop or FrameShift codons between the two tolerance zones (-allow_internal_bc parameter). The kappa value (i.e Ts/Tv ratio) was set to 1.6 and we used an unfolded SFS using the three species as an outgroup. Fitted parameters of the DFE were used to compute the expected d_N/d_S under near neutrality, which was compared to the observed d_N/d_S to estimate α, ω_NA, ωa.

Differences in load for region of residual tetraploidy in Coho salmon

π_N/π_S comparison.

In the Coho reference genome, Chromosomes 1 to 30 are considered to represent generally diploid chromosomes, whereas chromosomes 31 to 38 represent those with a clear signal of residual tetraploid [41]. We took advantage of this specificity regarding chromosome evolution to contrast the load for the 3700 genes in regions of residual tetraploidy (averaged across all genes for each population). For diploid regions, we generated 200 datasets of 4,000 genes randomly sampled and then estimated the load for each of these datasets.

TE annotations

We used the TEs annotation file from repeatmasker [112] (made available on NCBI for the reference genome [41]) and tested for difference in the length of TEs between diploid and tetraploid regions, after correcting for the difference in chromosome length.

Supporting information

S1 Table. Sampling strategy, including various outgroup species.

https://doi.org/10.1371/journal.pgen.1010918.s001

(XLS)

S2 Table. Summary Statistics across coho populations, including geographic coordinates, tree branch length, genetic diversity and load statistics.

Fig 1 to Fig 4 can be reconstructed from these.

https://doi.org/10.1371/journal.pgen.1010918.s002

(XLS)

S3 Table. Outgroup summary statistics by species, with a focus on the load.

https://doi.org/10.1371/journal.pgen.1010918.s003

(XLS)

S4 Table. Results of linear models.

These provide tests of the relationship between branch length of the population phylogenies based on (a) pairwise Fst value and b) treemix tree inferred with 3 migrations event. c) treemix results obtained after removing the “salmon” river with aberrant branch length.

https://doi.org/10.1371/journal.pgen.1010918.s004

(XLS)

S5 Table. Results of Mann-Whitney test for difference in load among population based on missense mutation and Loss of Function mutation.

Mutation effect identified with SNPeff.

https://doi.org/10.1371/journal.pgen.1010918.s005

(XLS)

S6 Table. Results of Tukey-HSD test for difference in recombination rate among populations based on Ldhat estimates.

“Pair” is the compared population pair with the initial described in Table S01. “Diff” is the difference in recombination between the pair Lower and Upper CI are the 95% confidence level

https://doi.org/10.1371/journal.pgen.1010918.s006

(XLS)

S7 Table. Difference in load among diploid region, compared to region of residual tetraploidy with load computed in 500kb windows for each kind of regions.

https://doi.org/10.1371/journal.pgen.1010918.s007

(XLS)

S8 Table. AIC of each model for each pairwise comparison.

AM = Ancient Migration, IM = Isolation With Migration, SI = Strict Isolation, SC = Secondary Contact, G = prefix indicating Growth in the descending population, 2N = prefix indicating linked selection, 2m = prefix indicating reduced effective migration along the genome, the A prefix after each model (e.g. AMA) indicated a Growth in the ancestral population.

https://doi.org/10.1371/journal.pgen.1010918.s008

(XLS)

S9 Table. Parameter estimates for each of the major regional group where ∂a∂i was fitted.

Na = effective size of the ancestral population, Ne pop1 = effective size of the first population, Nepop2 = effective size of the second population, m12 = migration rate from 2 into 1, m21 being the reverse, me12 = effective migration rate in barriers regions, me21 being the same in the reverse, Tsplit = Split time, Tam = time of migration stop, P = proportion of the genome being neutrally exchanged, Q = proportion the genome undergoing linked selection, O = proportion of the genome correctly oriented, hrf = Hill-Roberston factor, indicating the extent of reduction in Ne in region affected by linked selection, b1, b2 = extend of population growth in current population1 and 2 respectively.

https://doi.org/10.1371/journal.pgen.1010918.s009

(XLS)

S1 Note. Reconstruction of demographic history from RADseq data.

https://doi.org/10.1371/journal.pgen.1010918.s010

(ODT)

S2 Note. Relationship between GC3 and recombination.

https://doi.org/10.1371/journal.pgen.1010918.s011

(ODT)

S3 Note. Comparison of Provean and non-synonymous results.

https://doi.org/10.1371/journal.pgen.1010918.s012

(ODT)

S1 Fig. PCA and admixture plots among population.

A. Result of a principal component analysis obtained from a set of high quality biallelic SNPs without missing data showing both a clusterization along latitude and longitude as well as discrete clusters corresponding broadly to each river. Each label represents a given individual from a given river (labelled following table S01) and is coloured according to its region of sampling. B. LEA results for admixture inference for K = 14. Each bar represents an individual and is coloured according to its membership probability. Each name corresponds to a river. The TsooYes appear as a mixture of different individuals. Results must be interpreted with caution given the small sample sizes.

https://doi.org/10.1371/journal.pgen.1010918.s013

(TIF)

S2 Fig. Summary statistics revealed the demography of Coho salmon.

A. Positive correlation between the β_ST and distance to the southernmost site showing that differentiation increased linearly from the south to the north. In all panels each point represents a sampling site and is coloured according to the region in which it was sampled. The most negative values display likely ancestral samples. The Thompson sample displays high inbreeding and is bottlenecked. Displayed is the adjusted R² of a linear model along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R. B. Negative correlation between genetic diversity (observed heterozygosity) and distance to the south.The Thompson sample displays high inbreeding and is bottlenecked. Displayed is the adjusted R² of a linear model along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R. C. Rates of LD decay as a function of distance along the genome. The higher LD indicates a history of inbreeding or bottleneck. D. SMC++ inference of population size change with whole genome sequences for each local population of Coho salmon. Recent times should be interpreted carefully.

https://doi.org/10.1371/journal.pgen.1010918.s014

(TIF)

S3 Fig. Inference of population split and mixture from Treemix.

A) Tree with three migration arrows. Each name describes a river sample site. Each river is color coded following the color scheme provided elsewhere (e.g. Fig 1). 3 significant migration arrows are displayed. Each migration arrow is colored according to the weight it received (from yellow to red) in Treemix. The weights are related to the fraction of alleles in the descendant population that originated in each donor population. Each node was highly supported based on 500 bootstrap. B) proportion of variance in the covariance of allele frequency explained as a function of the number of migration edges. C) Same tree colored according to the values of π_N/π_S

https://doi.org/10.1371/journal.pgen.1010918.s015

(TIF)

S4 Fig. SMC++ split time.

Estimates of population split time from SMC++ under a model without gene flow among populations. Shown are estimates obtained when comparing split time between pairs of samples from different major regional groups. Two different mutation rates were used the: mean and median values based on Salmo salar orthologues mapped on the pike Esox lucius genome (Wang J. personal communication).

https://doi.org/10.1371/journal.pgen.1010918.s016

(TIF)

S5 Fig. Correlation between π_N/π_S and π_S and π_N/π_S and N_e from smc++.

A) Distribution of π_N/π_S as a function of π_S in each coho salmon populations from the study. B) Distribution of π_N/π_S as a function of N_e from SMC++ for each coho salmon populations from the study. Results of linear models are displayed. In all panels each point represents a sampling site and is coloured according to the region in which it was sampled. Displayed is the adjusted R² of a linear model along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R.+

https://doi.org/10.1371/journal.pgen.1010918.s017

(TIF)

S6 Fig.

Correlation between distance to the ocean of each sample location (i.e. corresponding to the spawning migration) and the inferred rate of A) non-adaptive substitution (ω_NA) and B) adaptive substitution (ω_A). In all panels each point represents a sampling site and is coloured according to the region in which it was sampled. Displayed is the adjusted R² of a linear model along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R.

https://doi.org/10.1371/journal.pgen.1010918.s018

(TIF)

S7 Fig. Results of linear models testing the effect of tree branch length to the root extracted from a F_ST-based population phylogeny on different metrics of selection efficacy.

A: relationship between ω_NA and tree branch length; B: relationship between ω_A and tree branch length; C: relationship between α and tree branch length. See text for a definition of each metrics. Sample sites are coloured by region. The blue line represents the value of the regression line. In all panels each point represents a sampling site and is coloured according to the region in which it was sampled. Displayed is the adjusted R² along with its p-value.

https://doi.org/10.1371/journal.pgen.1010918.s019

(TIF)

S8 Fig.

Results of linear models testing the effect of tree branch length to the root extracted from a treemix population phylogeny on the load (π_N/π_S, panel A) and different metrics of selection efficacy B: relationship between ω_NA and tree branch length; C: relationship between ω_A and tree branch length; D: relationship between α and tree branch length. See text for a definition of each metrics). Sample sites are coloured by region. The blue line represents the value of the regression line. In all panels each point represents a sampling site and is coloured according to the region in which it was sampled. Displayed is the adjusted R² along with its p-value.

https://doi.org/10.1371/journal.pgen.1010918.s020

(TIF)

S9 Fig. Correlation between the proportion of amino-acid substitution that results from positive selection (α) and the synonymous diversity π_S used as a proxy of effective population size.

Each point represents a sampling site and is coloured according to the region in which it was sampled. Displayed is the adjusted R² along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R.

https://doi.org/10.1371/journal.pgen.1010918.s021

(TIF)

S10 Fig. Relationship between GC3 and pN/pS for all populations and all outgroups.

A) Correlation for each population of coho salmon. Each point represents a sampling site and is coloured according to the region in which it was sampled; B) correlation within populations of rainbow trout. Each point represent a population as infered using a PCA and corresponds to different rivers of sampling. C) correlation for each population of Sockeye and Kokanee ecotype. Each point corresponds to differents rivers. All correlations are significant. The x-axis displays the median GC3 and y-axis the π_N/π_S ratio. Abbreviation for each site is available in Table S01. Displayed is the adjusted R² of a linear model along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R.

https://doi.org/10.1371/journal.pgen.1010918.s022

(TIFF)

S11 Fig. Relationship between recombination (GC3) and demographic factors (distance to the southernmost site).

A) relationship between the slope of the linear model between GC3 ~ and π_N/π_S and the distance to the southernmost site. B) Correlation between the lowest recombining GC3 classes (expected to display the highest load) and the distance to the southernmost sites. In all panels each point represents a sampling site and is coloured according to the region in which it was sampled. Displayed is the adjusted R² of a linear model along with its p-value. The grey area represents the 95% confidence interval levels around the regression lines obtained with the predict function in R.

https://doi.org/10.1371/journal.pgen.1010918.s023

(TIFF)

S12 Fig. Region of residual tetraploidy vs diploid region of the genome display different recombination landscapes.

Combined Violin plot and boxplot showing the distribution of population scale recombination (⍴ = 4*N_e*r) inferred from LDhat in Diploid chromosomes (orange) vs the 8 Regions of residual tetraploidy (gray).

https://doi.org/10.1371/journal.pgen.1010918.s024

(TIF)

S13 Fig. Distribution of recombination rate among all populations.

Each point represents the observed value of population scale recombination rate (ρ = 4N_e*μ) computed in 1 mb windows over the whole genome in each population. The harmonic mean is plotted by a red dot along with its value. For each population a violin plot embedded within a boxplot is shown.

https://doi.org/10.1371/journal.pgen.1010918.s025

(TIF)

S14 Fig. TE length differs between diploid chromosome versus chromosome displaying residual tetraploidy.

Violin plot displaying the difference in TEs relative length (i.e. length corrected by the total length of each chromosome) for each major TE category and each type of chromosome. Red point = mean +/- 1*sd.

https://doi.org/10.1371/journal.pgen.1010918.s026

(TIFF)

S15 Fig. Provean analysis of potentially deleterious mutation and the resulting load.

Boxplot showing the number of deleterious alleles per river (sorted from the south to to north). left panel = additive load, right panel = recessive load. top = missense deleterious mutations according to Provean predictions, bottom = missense tolerated mutations according to Provean predictions. No strong differences are observed in the additive load among populations. Significant differences were observed for the recessive load in populations at the expansion front which is qualitatively similar to our inferences from missense and LoF mutations. Each color represents a major regional group. Results were obtained for a random subset of mutations only given the strong computational burden of Provean.

https://doi.org/10.1371/journal.pgen.1010918.s027

(TIF)

Acknowledgments

We are grateful to Alysse Perreault-Payette and Bérénice Bougas for help in DNA extraction of additional individuals for whole genome sequencing. We thank Benoit Nabholz for the initial development of the π_N/π_S C++ app we used here. We are grateful to the genomic platform at Compute Canada (Canada), IBIS (Canada), and MBB (Montpellier, France) for providing computational capacity as well as Bird (Nantes, France) for data storage. This research was carried out in conjunction with EPIC4 (Enhanced Production in Coho: Culture, Community, Catch), a project supported in part by University Laval, Ressources Aquatiques Québec (RAQ), and the Government of Canada through Genome Canada, Genome British Columbia, and Genome Québec.

References

1. Hewitt GM. Genetic consequences of climatic oscillations in the Quaternary. Philos Trans R Soc Lond, B, Biol Sci. 2004;359: 183–195; discussion 195. pmid:15101575
- View Article
- PubMed/NCBI
- Google Scholar
2. Excoffier L, Foll M, Petit RJ. Genetic Consequences of Range Expansions. Annual Review of Ecology, Evolution, and Systematics. 2009;40: 481.
- View Article
- Google Scholar
3. Slatkin M, Excoffier L. Serial Founder Effects During Range Expansion: A Spatial Analog of Genetic Drift. Genetics. 2012;191: 171–181. pmid:22367031
- View Article
- PubMed/NCBI
- Google Scholar
4. Ohta T. Linkage disequilibrium due to random genetic drift in finite subdivided populations. Proc Natl Acad Sci U S A. 1982;79: 1940–1944. pmid:16593171
- View Article
- PubMed/NCBI
- Google Scholar
5. Hallatschek O, Hersen P, Ramanathan S, Nelson DR. Genetic drift at expanding frontiers promotes gene segregation. Proc Natl Acad Sci U S A. 2007;104: 19926–19930. pmid:18056799
- View Article
- PubMed/NCBI
- Google Scholar
6. Klopfstein S, Currat M, Excoffier L. The Fate of Mutations Surfing on the Wave of a Range Expansion. Molecular Biology and Evolution. 2006;23: 482–490. pmid:16280540
- View Article
- PubMed/NCBI
- Google Scholar
7. Travis JMJ, Münkemüller T, Burton OJ, Best A, Dytham C, Johst K. Deleterious Mutations Can Surf to High Densities on the Wave Front of an Expanding Population. Molecular Biology and Evolution. 2007;24: 2334–2343. pmid:17703053
- View Article
- PubMed/NCBI
- Google Scholar
8. Peischl S, Dupanloup I, Kirkpatrick M, Excoffier L. On the accumulation of deleterious mutations during range expansions. Mol Ecol. 2013;22: 5972–5982. pmid:24102784
- View Article
- PubMed/NCBI
- Google Scholar
9. Peischl S, Excoffier L. Expansion load: recessive mutations and the role of standing genetic variation. Molecular Ecology. 2015;24: 2084–2094. pmid:25786336
- View Article
- PubMed/NCBI
- Google Scholar
10. Peischl S, Dupanloup I, Foucal A, Jomphe M, Bruat V, Grenier J-C, et al. Relaxed Selection During a Recent Human Expansion. Genetics. 2018;208: 763–777. pmid:29187508
- View Article
- PubMed/NCBI
- Google Scholar
11. Haag CR, Roze D. Genetic Load in Sexual and Asexual Diploids: Segregation, Dominance and Genetic Drift. Genetics. 2007;176: 1663–1678. pmid:17483409
- View Article
- PubMed/NCBI
- Google Scholar
12. Ohta T. Slightly Deleterious Mutant Substitutions in Evolution. Nature. 1973;246: 96–98. pmid:4585855
- View Article
- PubMed/NCBI
- Google Scholar
13. Hedrick PW, Garcia-Dorado A. Understanding Inbreeding Depression, Purging, and Genetic Rescue. Trends in Ecology & Evolution. 2016;31: 940–952. pmid:27743611
- View Article
- PubMed/NCBI
- Google Scholar
14. Simons YB, Sella G. The impact of recent population history on the deleterious mutation load in humans and close evolutionary relatives. Curr Opin Genet Dev. 2016;41: 150–158. pmid:27744216
- View Article
- PubMed/NCBI
- Google Scholar
15. Lohmueller KE. The Impact of Population Demography and Selection on the Genetic Architecture of Complex Traits. Williams SM, editor. PLoS Genet. 2014;10: e1004379. pmid:24875776
- View Article
- PubMed/NCBI
- Google Scholar
16. Willi Y, Fracassetti M, Zoller S, Van Buskirk J. Accumulation of Mutational Load at the Edges of a Species Range. Mol Biol Evol. 2018;35: 781–791. pmid:29346601
- View Article
- PubMed/NCBI
- Google Scholar
17. Willi Y, Fracassetti M, Bachmann O, Van Buskirk J. Demographic Processes Linked to Genetic Diversity and Positive Selection across a Species’ Range. Plant Communications. 2020;1: 100111. pmid:33367266
- View Article
- PubMed/NCBI
- Google Scholar
18. Perrier A, Sánchez-Castro D, Willi Y. Expressed mutational load increases toward the edge of a species’ geographic range. Evolution. 2020;74: 1711–1723. pmid:32538471
- View Article
- PubMed/NCBI
- Google Scholar
19. de Pedro M, Riba M, González-Martínez SC, Seoane P, Bautista R, Claros MG, et al. Demography, genetic diversity and expansion load in the colonizing species Leontodon longirostris (Asteraceae) throughout its native range. Molecular Ecology. 2021;30: 1190–1205. pmid:33452714
- View Article
- PubMed/NCBI
- Google Scholar
20. González-Martínez SC, Ridout K, Pannell JR. Range Expansion Compromises Adaptive Evolution in an Outcrossing Plant. Current Biology. 2017;27: 2544–2551.e4. pmid:28803874
- View Article
- PubMed/NCBI
- Google Scholar
21. Lucek K, Willi Y. Drivers of linkage disequilibrium across a species’ geographic range. PLOS Genetics. 2021;17: e1009477. pmid:33770075
- View Article
- PubMed/NCBI
- Google Scholar
22. Bosshard L, Dupanloup I, Tenaillon O, Bruggmann R, Ackermann M, Peischl S, et al. Accumulation of Deleterious Mutations During Bacterial Range Expansions. Genetics. 2017;207: 669–684. pmid:28821588
- View Article
- PubMed/NCBI
- Google Scholar
23. Takou M, Hämälä T, Koch EM, Steige KA, Dittberner H, Yant L, et al. Maintenance of Adaptive Dynamics and No Detectable Load in a Range-Edge Outcrossing Plant Population. Molecular Biology and Evolution. 2021;38: 1820–1836. pmid:33480994
- View Article
- PubMed/NCBI
- Google Scholar
24. Rousselle M, Simion P, Tilak M-K, Figuet E, Nabholz B, Galtier N. Is adaptation limited by mutation? A timescale-dependent effect of genetic diversity on the adaptive substitution rate in animals. PLoS Genet. 2020;16: e1008668. pmid:32251427
- View Article
- PubMed/NCBI
- Google Scholar
25. Lanfear R, Kokko H, Eyre-Walker A. Population size and the rate of evolution. Trends Ecol Evol. 2014;29: 33–41. pmid:24148292
- View Article
- PubMed/NCBI
- Google Scholar
26. McDonald JH, Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351: 652–654. pmid:1904993
- View Article
- PubMed/NCBI
- Google Scholar
27. Laenen B, Tedder A, Nowak MD, Toräng P, Wunder J, Wötzel S, et al. Demography and mating system shape the genome-wide impact of purifying selection in Arabis alpina. PNAS. 2018;115: 816–821. pmid:29301967
- View Article
- PubMed/NCBI
- Google Scholar
28. Henn BM, Botigué LR, Peischl S, Dupanloup I, Lipatov M, Maples BK, et al. Distance from sub-Saharan Africa predicts mutational load in diverse human genomes. PNAS. 2016;113: E440–E449. pmid:26712023
- View Article
- PubMed/NCBI
- Google Scholar
29. Do R, Balick D, Li H, Adzhubei I, Sunyaev S, Reich D. No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans. Nat Genet. 2015;47: 126–131. pmid:25581429
- View Article
- PubMed/NCBI
- Google Scholar
30. Felsenstein J. The evolutionary advantage of recombination. Genetics. 1974;78: 737–756. pmid:4448362
- View Article
- PubMed/NCBI
- Google Scholar
31. Nordborg M, Charlesworth B, Charlesworth D. The effect of recombination on background selection. Genet Res. 1996;67: 159–174. pmid:8801188
- View Article
- PubMed/NCBI
- Google Scholar
32. Muller HJ. The relation of recombination to mutational advance. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis. 1964;1: 2–9. pmid:14195748
- View Article
- PubMed/NCBI
- Google Scholar
33. Hill WG, Robertson A. The effect of linkage on limits to artificial selection. Genetics Research. 1966;8: 269–294. pmid:5980116
- View Article
- PubMed/NCBI
- Google Scholar
34. Hill RR. Selection in autotetraploids. Theor Appl Genet. 1970;41: 181–186. pmid:24430193
- View Article
- PubMed/NCBI
- Google Scholar
35. Baduel P, Bray S, Vallejo-Marin M, Kolář F, Yant L. The “Polyploid Hop”: Shifting Challenges and Opportunities Over the Evolutionary Lifespan of Genome Duplications. Frontiers in Ecology and Evolution. 2018;6. Available: https://www.frontiersin.org/articles/10.3389/fevo.2018.00117.
- View Article
- Google Scholar
36. Monnahan P, Brandvain Y. The effect of autopolyploidy on population genetic signals of hard sweeps. Biology Letters. 2020;16: 20190796. pmid:32097595
- View Article
- PubMed/NCBI
- Google Scholar
37. Baduel P, Quadrana L, Hunter B, Bomblies K, Colot V. Relaxed purifying selection in autopolyploids drives transposable element over-accumulation which provides variants for local adaptation. Nat Commun. 2019;10: 5818. pmid:31862875
- View Article
- PubMed/NCBI
- Google Scholar
38. Monnahan P, Kolář F, Baduel P, Sailer C, Koch J, Horvath R, et al. Pervasive population genomic consequences of genome duplication in Arabidopsis arenosa. Nat Ecol Evol. 2019;3: 457–468. pmid:30804518
- View Article
- PubMed/NCBI
- Google Scholar
39. Sergeant CJ, Sexton EK, Moore JW, Westwood AR, Nagorski SA, Ebersole JL, et al. Risks of mining to salmonid-bearing watersheds. Science Advances. 2022;8: eabn0929. pmid:35776798
- View Article
- PubMed/NCBI
- Google Scholar
40. Gustafson RG, Waples RS, Myers JM, Weitkamp LA, Bryant GJ, Johnson OW, et al. Pacific Salmon Extinctions: Quantifying Lost and Remaining Diversity. Conservation Biology. 2007;21: 1009–1020. pmid:17650251
- View Article
- PubMed/NCBI
- Google Scholar
41. Rondeau EB, Christensen KA, Minkley DR, Leong JS, Chan MTT, Despins CA, et al. Population-size history inferences from the coho salmon (Oncorhynchus kisutch) genome. G3 Genes|Genomes|Genetics. 2023;13: jkad033. pmid:36759939
- View Article
- PubMed/NCBI
- Google Scholar
42. Rougemont Q, Moore J-S, Leroy T, Normandeau E, Rondeau EB, Withler RE, et al. Demographic history shaped geographical patterns of deleterious mutation load in a broadly distributed Pacific Salmon. PLOS Genetics. 2020;16: e1008348. pmid:32845885
- View Article
- PubMed/NCBI
- Google Scholar
43. Rougemont Q, Xuereb A, Dallaire X, Moore J-S, Normandeau E, Perreault-Payette A, et al. Long-distance migration is a major factor driving local adaptation at continental scale in Coho salmon. Molecular Ecology. 2023;32: 542–559. pmid:35000273
- View Article
- PubMed/NCBI
- Google Scholar
44. Macqueen DJ, Johnston IA. A well-constrained estimate for the timing of the salmonid whole genome duplication reveals major decoupling from species diversification. Proc Biol Sci. 2014;281. pmid:24452024
- View Article
- PubMed/NCBI
- Google Scholar
45. Novembre J, Stephens M. Interpreting principal component analyses of spatial population genetic variation. Nat Genet. 2008;40: 646–649. pmid:18425127
- View Article
- PubMed/NCBI
- Google Scholar
46. Weir BS, Goudet J. A Unified Characterization of Population Structure and Relatedness. Genetics. 2017;206: 2085–2103. pmid:28550018
- View Article
- PubMed/NCBI
- Google Scholar
47. natural earth data. https://www.naturalearthdata.com/http//www.naturalearthdata.com/download/110m/physical/110m_physical.zip.
48. Hill WG, Robertson A. Linkage disequilibrium in finite populations. Theor Appl Genet. 1968;38: 226–231. pmid:24442307
- View Article
- PubMed/NCBI
- Google Scholar
49. Pickrell JK, Pritchard JK. Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data. PLOS Genetics. 2012;8: e1002967. pmid:23166502
- View Article
- PubMed/NCBI
- Google Scholar
50. Terhorst J, Kamm JA, Song YS. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nature Genetics. 2017;49: 303–309. pmid:28024154
- View Article
- PubMed/NCBI
- Google Scholar
51. world Hire data. worldHires data public domain https://www.evl.uic.edu/pape/data/WDB/namer.tar.gz).
52. Gravel S. When Is Selection Effective? Genetics. 2016;203: 451–462. pmid:27010021
- View Article
- PubMed/NCBI
- Google Scholar
53. Brandvain Y, Wright SI. The Limits of Natural Selection in a Nonequilibrium World. Trends in Genetics. 2016;32: 201–210. pmid:26874998
- View Article
- PubMed/NCBI
- Google Scholar
54. Bertorelle G, Raffini F, Bosse M, Bortoluzzi C, Iannucci A, Trucchi E, et al. Genetic load: genomic estimates and applications in non-model animals. Nat Rev Genet. 2022;23: 492–503. pmid:35136196
- View Article
- PubMed/NCBI
- Google Scholar
55. Kyriazis C, Wayne RK, Lohmueller KE. High genetic diversity can contribute to extinction in small populations | bioRxiv. 10 Mar 2020 [cited 10 Mar 2020]. Available: https://www.biorxiv.org/content/10.1101/678524v1.
- View Article
- Google Scholar
56. Galtier N. Adaptive Protein Evolution in Animals and the Effective Population Size Hypothesis. PLOS Genetics. 2016;12: e1005774. pmid:26752180
- View Article
- PubMed/NCBI
- Google Scholar
57. Ryman N, Laikre L. Effects of Supportive Breeding on the Genetically Effective Population Size. Conservation Biology. 1991;5: 325–329.
- View Article
- Google Scholar
58. Berdan EL, Blanckaert A, Butlin RK, Bank C. Deleterious mutation accumulation and the long-term fate of chromosomal inversions. PLOS Genetics. 2021;17: e1009411. pmid:33661924
- View Article
- PubMed/NCBI
- Google Scholar
59. Leroy T, Rousselle M, Tilak M-K, Caizergues AE, Scornavacca C, Recuerda M, et al. Island songbirds as windows into evolution in small populations. Current Biology. 2021 [cited 3 Mar 2021]. pmid:33476557
- View Article
- PubMed/NCBI
- Google Scholar
60. Rousselle M, Laverré A, Figuet E, Nabholz B, Galtier N. Influence of Recombination and GC-biased Gene Conversion on the Adaptive and Nonadaptive Substitution Rate in Mammals versus Birds. Mol Biol Evol. 2019;36: 458–471. pmid:30590692
- View Article
- PubMed/NCBI
- Google Scholar
61. MacPherson B, Scott R, Gras R. Sex and recombination purge the genome of deleterious alleles: An Individual Based Modeling Approach. Ecological Complexity. 2021;45: 100910.
- View Article
- Google Scholar
62. Kent TV, Uzunović J, Wright SI. Coevolution between transposable elements and recombination. Philosophical Transactions of the Royal Society B: Biological Sciences. 2017;372: 20160458. pmid:29109221
- View Article
- PubMed/NCBI
- Google Scholar
63. Xuereb A, Rougemont Q, Dallaire X, Moore J-S, Normandeau E, Bougas B, et al. Re-evaluating Coho salmon (Oncorhynchus kisutch) conservation units in Canada using genomic data. Evolutionary Applications. 2022;15: 1925–1944. pmid:36426130
- View Article
- PubMed/NCBI
- Google Scholar
64. Government of Canada PS and PC. Stock assessment of Thompson River/upper Fraser River coho salmon / J.R. Irvine … [et al.].: Fs70-1/1999-28E-PDF—Government of Canada Publications—Canada.ca. 1 Jul 2002 [cited 11 Aug 2022]. Available: https://publications.gc.ca/site/eng/9.804890/publication.html.
65. Waples RS, Hindar K, Karlsson S, Hard JJ. Evaluating the Ryman–Laikre effect for marine stock enhancement and aquaculture. Curr Zool. 2016;62: 617–627. pmid:29491949
- View Article
- PubMed/NCBI
- Google Scholar
66. Gilbert-Horvath EA, Pipal KA, Spence BC, Williams TH, Garza JC. Hierarchical Phylogeographic Structure of Coho Salmon in California. Transactions of the American Fisheries Society. 2016;145: 1122–1138.
- View Article
- Google Scholar
67. Kratochvíl L, Rovatsos M. Ratios can be misleading for detecting selection. Curr Biol. 2022;32: R28–R30. pmid:35015989
- View Article
- PubMed/NCBI
- Google Scholar
68. Leroy T, Nabholz B. Response to Kratochvíl and Rovatsos. Curr Biol. 2022;32: R30–R31. pmid:35015990
- View Article
- PubMed/NCBI
- Google Scholar
69. Simons YB, Turchin MC, Pritchard JK, Sella G. The deleterious mutation load is insensitive to recent population history. Nature Genetics. 2014;46: 220–224. pmid:24509481
- View Article
- PubMed/NCBI
- Google Scholar
70. Ramu P, Esuma W, Kawuki R, Rabbi IY, Egesi C, Bredeson JV, et al. Cassava haplotype map highlights fixation of deleterious mutations during clonal propagation. Nat Genet. 2017;49: 959–963. pmid:28416819
- View Article
- PubMed/NCBI
- Google Scholar
71. Harwood MP, Alves I, Edgington H, Agbessi M, Bruat V, Soave D, et al. Recombination affects allele-specific expression of deleterious variants in human populations. Sci Adv. 8: eabl3819. pmid:35559670
- View Article
- PubMed/NCBI
- Google Scholar
72. Hussin JG, Hodgkinson A, Idaghdour Y, Grenier J-C, Goulet J-P, Gbeha E, et al. Recombination affects accumulation of damaging and disease-associated mutations in human populations. Nat Genet. 2015;47: 400–404. pmid:25685891
- View Article
- PubMed/NCBI
- Google Scholar
73. Alves I, Houle AA, Hussin JG, Awadalla P. The impact of recombination on human mutation load and disease. Philos Trans R Soc Lond B Biol Sci. 2017;372: 20160465. pmid:29109227
- View Article
- PubMed/NCBI
- Google Scholar
74. Gillard GB, Grønvold L, Røsæg LL, Holen MM, Monsen Ø, Koop BF, et al. Comparative regulomics supports pervasive selection on gene dosage following whole genome duplication. Genome Biology. 2021;22: 103. pmid:33849620
- View Article
- PubMed/NCBI
- Google Scholar
75. Conover JL, Wendel JF. Deleterious Mutations Accumulate Faster in Allopolyploid Than Diploid Cotton (Gossypium) and Unequally between Subgenomes. Molecular Biology and Evolution. 2022;39: msac024. pmid:35099532
- View Article
- PubMed/NCBI
- Google Scholar
76. Haldane J. The Causes of Evolution. 1932 [cited 9 Jun 2023]. Available: https://jbshaldane.org/books/1932-Causes-of-Evolution/index.html.
77. Khan A, Patel K, Shukla H, Viswanathan A, van der Valk T, Borthakur U, et al. Genomic evidence for inbreeding depression and purging of deleterious genetic variation in Indian tigers. Proceedings of the National Academy of Sciences. 2021;118: e2023018118. pmid:34848534
- View Article
- PubMed/NCBI
- Google Scholar
78. Robinson JA, Räikkönen J, Vucetich LM, Vucetich JA, Peterson RO, Lohmueller KE, et al. Genomic signatures of extensive inbreeding in Isle Royale wolves, a population on the threshold of extinction. Sci Adv. 2019;5: eaau0757. pmid:31149628
- View Article
- PubMed/NCBI
- Google Scholar
79. Grossen C, Guillaume F, Keller LF, Croll D. Purging of highly deleterious mutations through severe bottlenecks in Alpine ibex. Nat Commun. 2020;11: 1001. pmid:32081890
- View Article
- PubMed/NCBI
- Google Scholar
80. Kleinman-Ruiz D, Lucena-Perez M, Villanueva B, Fernández J, Saveljev AP, Ratkiewicz M, et al. Purging of deleterious burden in the endangered Iberian lynx. Proceedings of the National Academy of Sciences. 2022;119: e2110614119. pmid:35238662
- View Article
- PubMed/NCBI
- Google Scholar
81. Parmesan C, Yohe G. A globally coherent fingerprint of climate change impacts across natural systems. Nature. 2003;421: 37–42. pmid:12511946
- View Article
- PubMed/NCBI
- Google Scholar
82. Irvine JR, Fukuwaka M. Pacific salmon abundance trends and climate change. ICES J Mar Sci. 2011;68: 1122–1130.
- View Article
- Google Scholar
83. Teixeira JC, Huber CD. The inflated significance of neutral genetic diversity in conservation genetics. PNAS. 2021;118. pmid:33608481
- View Article
- PubMed/NCBI
- Google Scholar
84. García-Dorado A, Caballero A. Neutral genetic diversity as a useful tool for conservation biology. Conserv Genet. 2021;22: 541–545.
- View Article
- Google Scholar
85. Kardos M, Armstrong EE, Fitzpatrick SW, Hauser S, Hedrick PW, Miller JM, et al. The crucial role of genome-wide genetic variation in conservation. Proceedings of the National Academy of Sciences. 2021;118: e2104642118. pmid:34772759
- View Article
- PubMed/NCBI
- Google Scholar
86. Ripple WJ, Wolf C, Newsome TM, Gregg JW, Lenton TM, Palomo I, et al. World Scientists’ Warning of a Climate Emergency 2021. BioScience. 2021;71: 894–898.
- View Article
- Google Scholar
87. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34: i884–i890. pmid:30423086
- View Article
- PubMed/NCBI
- Google Scholar
88. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:13033997 [q-bio]. 2013 [cited 3 Jul 2019]. Available: http://arxiv.org/abs/1303.3997.
- View Article
- Google Scholar
89. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics. 2011;43: 491. pmid:21478889
- View Article
- PubMed/NCBI
- Google Scholar
90. Dray S, Dufour A-B. The ade4 Package: Implementing the Duality Diagram for Ecologists. Journal of Statistical Software. 2007;22: 1–20.
- View Article
- Google Scholar
91. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am J Hum Genet. 2007;81: 559–575. pmid:17701901
- View Article
- PubMed/NCBI
- Google Scholar
92. Christensen KA, Rondeau EB, Minkley DR, Sakhrani D, Biagi CA, Flores A-M, et al. The sockeye salmon genome, transcriptome, and analyses identifying population defining regions of the genome. PLOS ONE. 2020;15: e0240935. pmid:33119641
- View Article
- PubMed/NCBI
- Google Scholar
93. Christensen KA, Rondeau EB, Sakhrani D, Biagi CA, Johnson H, Joshi J, et al. The pink salmon genome: Uncovering the genomic consequences of a two-year life cycle. PLoS One. 2021;16: e0255752. pmid:34919547
- View Article
- PubMed/NCBI
- Google Scholar
94. Christensen KA, Leong JS, Sakhrani D, Biagi CA, Minkley DR, Withler RE, et al. Chinook salmon (Oncorhynchus tshawytscha) genome and transcriptome. PLOS ONE. 2018;13: e0195461. pmid:29621340
- View Article
- PubMed/NCBI
- Google Scholar
95. Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics. 2014;15: 356. pmid:25420514
- View Article
- PubMed/NCBI
- Google Scholar
96. Goudet J. hierfstat, a package for r to compute and test hierarchical F-statistics. Molecular Ecology Notes. 2005;5: 184–186.
- View Article
- Google Scholar
97. Nei M. Molecular Evolutionary Genetics. Molecular Evolutionary Genetics. Columbia University Press; 2019.
- View Article
- Google Scholar
98. Pante E, Simon-Bouhet B. marmap: A Package for Importing, Plotting and Analyzing Bathymetric and Topographic Data in R. PLOS ONE. 2013;8: e73051. pmid:24019892
- View Article
- PubMed/NCBI
- Google Scholar
99. Zhang C, Dong S-S, Xu J-Y, He W-M, Yang T-L. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35: 1786–1788. pmid:30321304
- View Article
- PubMed/NCBI
- Google Scholar
100. Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag; 2009. Available: https://www.springer.com/gp/book/9780387981413.
101. Gutenkunst RN, Hernandez RD, Williamson SH, Bustamante CD. Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data. PLOS Genetics. 2009;5: e1000695. pmid:19851460
- View Article
- PubMed/NCBI
- Google Scholar
102. Weir BS, Cockerham CC. Estimating F-Statistics for the Analysis of Population Structure. Evolution. 1984;38: 1358–1370. pmid:28563791
- View Article
- PubMed/NCBI
- Google Scholar
103. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27: 2156–2158. pmid:21653522
- View Article
- PubMed/NCBI
- Google Scholar
104. Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35: 526–528. pmid:30016406
- View Article
- PubMed/NCBI
- Google Scholar
105. Revell L. phytools: An R package for phylogenetic comparative biology (and other things). Methods in Ecology and Evolution. 2012; 3: 217–223.
- View Article
- Google Scholar
106. Jombart T, Balloux F, Dray S. adephylo: new tools for investigating the phylogenetic signal in biological traits. Bioinformatics. 2010;26: 1907–1909. pmid:20525823
- View Article
- PubMed/NCBI
- Google Scholar
107. Delaneau O, Zagury J-F, Robinson MR, Marchini JL, Dermitzakis ET. Accurate, scalable and integrative haplotype estimation. Nat Commun. 2019;10: 5436. pmid:31780650
- View Article
- PubMed/NCBI
- Google Scholar
108. McVean G, Awadalla P, Fearnhead P. A Coalescent-Based Method for Detecting and Estimating Recombination From Gene Sequences. Genetics. 2002;160: 1231–1241. pmid:11901136
- View Article
- PubMed/NCBI
- Google Scholar
109. Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS ONE. 2012;7: e46688. pmid:23056405
- View Article
- PubMed/NCBI
- Google Scholar
110. Eyre-Walker A, Keightley PD. Estimating the Rate of Adaptive Molecular Evolution in the Presence of Slightly Deleterious Mutations and Population Size Change. Molecular Biology and Evolution. 2009;26: 2097–2108. pmid:19535738
- View Article
- PubMed/NCBI
- Google Scholar
111. Guéguen L, Gaillard S, Boussau B, Gouy M, Groussin M, Rochette NC, et al. Bio++: Efficient Extensible Libraries and Tools for Computational Molecular Evolution. Molecular Biology and Evolution. 2013;30: 1745–1750. pmid:23699471
- View Article
- PubMed/NCBI
- Google Scholar
112. Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics. 2009;Chapter 4: Unit 4.10. pmid:19274634
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Hewitt GM. Genetic consequences of climatic oscillations in the Quaternary. Philos Trans R Soc Lond, B, Biol Sci. 2004;359: 183–195; discussion 195. pmid:15101575
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Excoffier L, Foll M, Petit RJ. Genetic Consequences of Range Expansions. Annual Review of Ecology, Evolution, and Systematics. 2009;40: 481.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Slatkin M, Excoffier L. Serial Founder Effects During Range Expansion: A Spatial Analog of Genetic Drift. Genetics. 2012;191: 171–181. pmid:22367031
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Ohta T. Linkage disequilibrium due to random genetic drift in finite subdivided populations. Proc Natl Acad Sci U S A. 1982;79: 1940–1944. pmid:16593171
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Hallatschek O, Hersen P, Ramanathan S, Nelson DR. Genetic drift at expanding frontiers promotes gene segregation. Proc Natl Acad Sci U S A. 2007;104: 19926–19930. pmid:18056799
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Klopfstein S, Currat M, Excoffier L. The Fate of Mutations Surfing on the Wave of a Range Expansion. Molecular Biology and Evolution. 2006;23: 482–490. pmid:16280540
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Travis JMJ, Münkemüller T, Burton OJ, Best A, Dytham C, Johst K. Deleterious Mutations Can Surf to High Densities on the Wave Front of an Expanding Population. Molecular Biology and Evolution. 2007;24: 2334–2343. pmid:17703053
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Peischl S, Dupanloup I, Kirkpatrick M, Excoffier L. On the accumulation of deleterious mutations during range expansions. Mol Ecol. 2013;22: 5972–5982. pmid:24102784
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Peischl S, Excoffier L. Expansion load: recessive mutations and the role of standing genetic variation. Molecular Ecology. 2015;24: 2084–2094. pmid:25786336
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Peischl S, Dupanloup I, Foucal A, Jomphe M, Bruat V, Grenier J-C, et al. Relaxed Selection During a Recent Human Expansion. Genetics. 2018;208: 763–777. pmid:29187508
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Haag CR, Roze D. Genetic Load in Sexual and Asexual Diploids: Segregation, Dominance and Genetic Drift. Genetics. 2007;176: 1663–1678. pmid:17483409
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref12] 12. Ohta T. Slightly Deleterious Mutant Substitutions in Evolution. Nature. 1973;246: 96–98. pmid:4585855
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref13] 13. Hedrick PW, Garcia-Dorado A. Understanding Inbreeding Depression, Purging, and Genetic Rescue. Trends in Ecology & Evolution. 2016;31: 940–952. pmid:27743611
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref14] 14. Simons YB, Sella G. The impact of recent population history on the deleterious mutation load in humans and close evolutionary relatives. Curr Opin Genet Dev. 2016;41: 150–158. pmid:27744216
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref15] 15. Lohmueller KE. The Impact of Population Demography and Selection on the Genetic Architecture of Complex Traits. Williams SM, editor. PLoS Genet. 2014;10: e1004379. pmid:24875776
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref16] 16. Willi Y, Fracassetti M, Zoller S, Van Buskirk J. Accumulation of Mutational Load at the Edges of a Species Range. Mol Biol Evol. 2018;35: 781–791. pmid:29346601
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref17] 17. Willi Y, Fracassetti M, Bachmann O, Van Buskirk J. Demographic Processes Linked to Genetic Diversity and Positive Selection across a Species’ Range. Plant Communications. 2020;1: 100111. pmid:33367266
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref18] 18. Perrier A, Sánchez-Castro D, Willi Y. Expressed mutational load increases toward the edge of a species’ geographic range. Evolution. 2020;74: 1711–1723. pmid:32538471
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref19] 19. de Pedro M, Riba M, González-Martínez SC, Seoane P, Bautista R, Claros MG, et al. Demography, genetic diversity and expansion load in the colonizing species Leontodon longirostris (Asteraceae) throughout its native range. Molecular Ecology. 2021;30: 1190–1205. pmid:33452714
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref20] 20. González-Martínez SC, Ridout K, Pannell JR. Range Expansion Compromises Adaptive Evolution in an Outcrossing Plant. Current Biology. 2017;27: 2544–2551.e4. pmid:28803874
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref21] 21. Lucek K, Willi Y. Drivers of linkage disequilibrium across a species’ geographic range. PLOS Genetics. 2021;17: e1009477. pmid:33770075
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref22] 22. Bosshard L, Dupanloup I, Tenaillon O, Bruggmann R, Ackermann M, Peischl S, et al. Accumulation of Deleterious Mutations During Bacterial Range Expansions. Genetics. 2017;207: 669–684. pmid:28821588
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref23] 23. Takou M, Hämälä T, Koch EM, Steige KA, Dittberner H, Yant L, et al. Maintenance of Adaptive Dynamics and No Detectable Load in a Range-Edge Outcrossing Plant Population. Molecular Biology and Evolution. 2021;38: 1820–1836. pmid:33480994
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref24] 24. Rousselle M, Simion P, Tilak M-K, Figuet E, Nabholz B, Galtier N. Is adaptation limited by mutation? A timescale-dependent effect of genetic diversity on the adaptive substitution rate in animals. PLoS Genet. 2020;16: e1008668. pmid:32251427
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref25] 25. Lanfear R, Kokko H, Eyre-Walker A. Population size and the rate of evolution. Trends Ecol Evol. 2014;29: 33–41. pmid:24148292
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref26] 26. McDonald JH, Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351: 652–654. pmid:1904993
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref27] 27. Laenen B, Tedder A, Nowak MD, Toräng P, Wunder J, Wötzel S, et al. Demography and mating system shape the genome-wide impact of purifying selection in Arabis alpina. PNAS. 2018;115: 816–821. pmid:29301967
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref28] 28. Henn BM, Botigué LR, Peischl S, Dupanloup I, Lipatov M, Maples BK, et al. Distance from sub-Saharan Africa predicts mutational load in diverse human genomes. PNAS. 2016;113: E440–E449. pmid:26712023
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref29] 29. Do R, Balick D, Li H, Adzhubei I, Sunyaev S, Reich D. No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans. Nat Genet. 2015;47: 126–131. pmid:25581429
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref30] 30. Felsenstein J. The evolutionary advantage of recombination. Genetics. 1974;78: 737–756. pmid:4448362
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref31] 31. Nordborg M, Charlesworth B, Charlesworth D. The effect of recombination on background selection. Genet Res. 1996;67: 159–174. pmid:8801188
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref32] 32. Muller HJ. The relation of recombination to mutational advance. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis. 1964;1: 2–9. pmid:14195748
View Article
PubMed/NCBI
Google Scholar

[125] View Article

[126] PubMed/NCBI

[127] Google Scholar

[ref33] 33. Hill WG, Robertson A. The effect of linkage on limits to artificial selection. Genetics Research. 1966;8: 269–294. pmid:5980116
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref34] 34. Hill RR. Selection in autotetraploids. Theor Appl Genet. 1970;41: 181–186. pmid:24430193
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref35] 35. Baduel P, Bray S, Vallejo-Marin M, Kolář F, Yant L. The “Polyploid Hop”: Shifting Challenges and Opportunities Over the Evolutionary Lifespan of Genome Duplications. Frontiers in Ecology and Evolution. 2018;6. Available: https://www.frontiersin.org/articles/10.3389/fevo.2018.00117.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref36] 36. Monnahan P, Brandvain Y. The effect of autopolyploidy on population genetic signals of hard sweeps. Biology Letters. 2020;16: 20190796. pmid:32097595
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref37] 37. Baduel P, Quadrana L, Hunter B, Bomblies K, Colot V. Relaxed purifying selection in autopolyploids drives transposable element over-accumulation which provides variants for local adaptation. Nat Commun. 2019;10: 5818. pmid:31862875
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref38] 38. Monnahan P, Kolář F, Baduel P, Sailer C, Koch J, Horvath R, et al. Pervasive population genomic consequences of genome duplication in Arabidopsis arenosa. Nat Ecol Evol. 2019;3: 457–468. pmid:30804518
View Article
PubMed/NCBI
Google Scholar

[148] View Article

[149] PubMed/NCBI

[150] Google Scholar

[ref39] 39. Sergeant CJ, Sexton EK, Moore JW, Westwood AR, Nagorski SA, Ebersole JL, et al. Risks of mining to salmonid-bearing watersheds. Science Advances. 2022;8: eabn0929. pmid:35776798
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref40] 40. Gustafson RG, Waples RS, Myers JM, Weitkamp LA, Bryant GJ, Johnson OW, et al. Pacific Salmon Extinctions: Quantifying Lost and Remaining Diversity. Conservation Biology. 2007;21: 1009–1020. pmid:17650251
View Article
PubMed/NCBI
Google Scholar

[156] View Article

[157] PubMed/NCBI

[158] Google Scholar

[ref41] 41. Rondeau EB, Christensen KA, Minkley DR, Leong JS, Chan MTT, Despins CA, et al. Population-size history inferences from the coho salmon (Oncorhynchus kisutch) genome. G3 Genes|Genomes|Genetics. 2023;13: jkad033. pmid:36759939
View Article
PubMed/NCBI
Google Scholar

[160] View Article

[161] PubMed/NCBI

[162] Google Scholar

[ref42] 42. Rougemont Q, Moore J-S, Leroy T, Normandeau E, Rondeau EB, Withler RE, et al. Demographic history shaped geographical patterns of deleterious mutation load in a broadly distributed Pacific Salmon. PLOS Genetics. 2020;16: e1008348. pmid:32845885
View Article
PubMed/NCBI
Google Scholar

[164] View Article

[165] PubMed/NCBI

[166] Google Scholar

[ref43] 43. Rougemont Q, Xuereb A, Dallaire X, Moore J-S, Normandeau E, Perreault-Payette A, et al. Long-distance migration is a major factor driving local adaptation at continental scale in Coho salmon. Molecular Ecology. 2023;32: 542–559. pmid:35000273
View Article
PubMed/NCBI
Google Scholar

[168] View Article

[169] PubMed/NCBI

[170] Google Scholar

[ref44] 44. Macqueen DJ, Johnston IA. A well-constrained estimate for the timing of the salmonid whole genome duplication reveals major decoupling from species diversification. Proc Biol Sci. 2014;281. pmid:24452024
View Article
PubMed/NCBI
Google Scholar

[172] View Article

[173] PubMed/NCBI

[174] Google Scholar

[ref45] 45. Novembre J, Stephens M. Interpreting principal component analyses of spatial population genetic variation. Nat Genet. 2008;40: 646–649. pmid:18425127
View Article
PubMed/NCBI
Google Scholar

[176] View Article

[177] PubMed/NCBI

[178] Google Scholar

[ref46] 46. Weir BS, Goudet J. A Unified Characterization of Population Structure and Relatedness. Genetics. 2017;206: 2085–2103. pmid:28550018
View Article
PubMed/NCBI
Google Scholar

[180] View Article

[181] PubMed/NCBI

[182] Google Scholar

[ref47] 47. natural earth data. https://www.naturalearthdata.com/http//www.naturalearthdata.com/download/110m/physical/110m_physical.zip.

[ref48] 48. Hill WG, Robertson A. Linkage disequilibrium in finite populations. Theor Appl Genet. 1968;38: 226–231. pmid:24442307
View Article
PubMed/NCBI
Google Scholar

[185] View Article

[186] PubMed/NCBI

[187] Google Scholar

[ref49] 49. Pickrell JK, Pritchard JK. Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data. PLOS Genetics. 2012;8: e1002967. pmid:23166502
View Article
PubMed/NCBI
Google Scholar

[189] View Article

[190] PubMed/NCBI

[191] Google Scholar

[ref50] 50. Terhorst J, Kamm JA, Song YS. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nature Genetics. 2017;49: 303–309. pmid:28024154
View Article
PubMed/NCBI
Google Scholar

[193] View Article

[194] PubMed/NCBI

[195] Google Scholar

[ref51] 51. world Hire data. worldHires data public domain https://www.evl.uic.edu/pape/data/WDB/namer.tar.gz).

[ref52] 52. Gravel S. When Is Selection Effective? Genetics. 2016;203: 451–462. pmid:27010021
View Article
PubMed/NCBI
Google Scholar

[198] View Article

[199] PubMed/NCBI

[200] Google Scholar

[ref53] 53. Brandvain Y, Wright SI. The Limits of Natural Selection in a Nonequilibrium World. Trends in Genetics. 2016;32: 201–210. pmid:26874998
View Article
PubMed/NCBI
Google Scholar

[202] View Article

[203] PubMed/NCBI

[204] Google Scholar

[ref54] 54. Bertorelle G, Raffini F, Bosse M, Bortoluzzi C, Iannucci A, Trucchi E, et al. Genetic load: genomic estimates and applications in non-model animals. Nat Rev Genet. 2022;23: 492–503. pmid:35136196
View Article
PubMed/NCBI
Google Scholar

[206] View Article

[207] PubMed/NCBI

[208] Google Scholar

[ref55] 55. Kyriazis C, Wayne RK, Lohmueller KE. High genetic diversity can contribute to extinction in small populations | bioRxiv. 10 Mar 2020 [cited 10 Mar 2020]. Available: https://www.biorxiv.org/content/10.1101/678524v1.
View Article
Google Scholar

[210] View Article

[211] Google Scholar

[ref56] 56. Galtier N. Adaptive Protein Evolution in Animals and the Effective Population Size Hypothesis. PLOS Genetics. 2016;12: e1005774. pmid:26752180
View Article
PubMed/NCBI
Google Scholar

[213] View Article

[214] PubMed/NCBI

[215] Google Scholar

[ref57] 57. Ryman N, Laikre L. Effects of Supportive Breeding on the Genetically Effective Population Size. Conservation Biology. 1991;5: 325–329.
View Article
Google Scholar

[217] View Article

[218] Google Scholar

[ref58] 58. Berdan EL, Blanckaert A, Butlin RK, Bank C. Deleterious mutation accumulation and the long-term fate of chromosomal inversions. PLOS Genetics. 2021;17: e1009411. pmid:33661924
View Article
PubMed/NCBI
Google Scholar

[220] View Article

[221] PubMed/NCBI

[222] Google Scholar

[ref59] 59. Leroy T, Rousselle M, Tilak M-K, Caizergues AE, Scornavacca C, Recuerda M, et al. Island songbirds as windows into evolution in small populations. Current Biology. 2021 [cited 3 Mar 2021]. pmid:33476557
View Article
PubMed/NCBI
Google Scholar

[224] View Article

[225] PubMed/NCBI

[226] Google Scholar

[ref60] 60. Rousselle M, Laverré A, Figuet E, Nabholz B, Galtier N. Influence of Recombination and GC-biased Gene Conversion on the Adaptive and Nonadaptive Substitution Rate in Mammals versus Birds. Mol Biol Evol. 2019;36: 458–471. pmid:30590692
View Article
PubMed/NCBI
Google Scholar

[228] View Article

[229] PubMed/NCBI

[230] Google Scholar

[ref61] 61. MacPherson B, Scott R, Gras R. Sex and recombination purge the genome of deleterious alleles: An Individual Based Modeling Approach. Ecological Complexity. 2021;45: 100910.
View Article
Google Scholar

[232] View Article

[233] Google Scholar

[ref62] 62. Kent TV, Uzunović J, Wright SI. Coevolution between transposable elements and recombination. Philosophical Transactions of the Royal Society B: Biological Sciences. 2017;372: 20160458. pmid:29109221
View Article
PubMed/NCBI
Google Scholar

[235] View Article

[236] PubMed/NCBI

[237] Google Scholar

[ref63] 63. Xuereb A, Rougemont Q, Dallaire X, Moore J-S, Normandeau E, Bougas B, et al. Re-evaluating Coho salmon (Oncorhynchus kisutch) conservation units in Canada using genomic data. Evolutionary Applications. 2022;15: 1925–1944. pmid:36426130
View Article
PubMed/NCBI
Google Scholar

[239] View Article

[240] PubMed/NCBI

[241] Google Scholar

[ref64] 64. Government of Canada PS and PC. Stock assessment of Thompson River/upper Fraser River coho salmon / J.R. Irvine … [et al.].: Fs70-1/1999-28E-PDF—Government of Canada Publications—Canada.ca. 1 Jul 2002 [cited 11 Aug 2022]. Available: https://publications.gc.ca/site/eng/9.804890/publication.html.

[ref65] 65. Waples RS, Hindar K, Karlsson S, Hard JJ. Evaluating the Ryman–Laikre effect for marine stock enhancement and aquaculture. Curr Zool. 2016;62: 617–627. pmid:29491949
View Article
PubMed/NCBI
Google Scholar

[244] View Article

[245] PubMed/NCBI

[246] Google Scholar

[ref66] 66. Gilbert-Horvath EA, Pipal KA, Spence BC, Williams TH, Garza JC. Hierarchical Phylogeographic Structure of Coho Salmon in California. Transactions of the American Fisheries Society. 2016;145: 1122–1138.
View Article
Google Scholar

[248] View Article

[249] Google Scholar

[ref67] 67. Kratochvíl L, Rovatsos M. Ratios can be misleading for detecting selection. Curr Biol. 2022;32: R28–R30. pmid:35015989
View Article
PubMed/NCBI
Google Scholar

[251] View Article

[252] PubMed/NCBI

[253] Google Scholar

[ref68] 68. Leroy T, Nabholz B. Response to Kratochvíl and Rovatsos. Curr Biol. 2022;32: R30–R31. pmid:35015990
View Article
PubMed/NCBI
Google Scholar

[255] View Article

[256] PubMed/NCBI

[257] Google Scholar

[ref69] 69. Simons YB, Turchin MC, Pritchard JK, Sella G. The deleterious mutation load is insensitive to recent population history. Nature Genetics. 2014;46: 220–224. pmid:24509481
View Article
PubMed/NCBI
Google Scholar

[259] View Article

[260] PubMed/NCBI

[261] Google Scholar

[ref70] 70. Ramu P, Esuma W, Kawuki R, Rabbi IY, Egesi C, Bredeson JV, et al. Cassava haplotype map highlights fixation of deleterious mutations during clonal propagation. Nat Genet. 2017;49: 959–963. pmid:28416819
View Article
PubMed/NCBI
Google Scholar

[263] View Article

[264] PubMed/NCBI

[265] Google Scholar

[ref71] 71. Harwood MP, Alves I, Edgington H, Agbessi M, Bruat V, Soave D, et al. Recombination affects allele-specific expression of deleterious variants in human populations. Sci Adv. 8: eabl3819. pmid:35559670
View Article
PubMed/NCBI
Google Scholar

[267] View Article

[268] PubMed/NCBI

[269] Google Scholar

[ref72] 72. Hussin JG, Hodgkinson A, Idaghdour Y, Grenier J-C, Goulet J-P, Gbeha E, et al. Recombination affects accumulation of damaging and disease-associated mutations in human populations. Nat Genet. 2015;47: 400–404. pmid:25685891
View Article
PubMed/NCBI
Google Scholar

[271] View Article

[272] PubMed/NCBI

[273] Google Scholar

[ref73] 73. Alves I, Houle AA, Hussin JG, Awadalla P. The impact of recombination on human mutation load and disease. Philos Trans R Soc Lond B Biol Sci. 2017;372: 20160465. pmid:29109227
View Article
PubMed/NCBI
Google Scholar

[275] View Article

[276] PubMed/NCBI

[277] Google Scholar

[ref74] 74. Gillard GB, Grønvold L, Røsæg LL, Holen MM, Monsen Ø, Koop BF, et al. Comparative regulomics supports pervasive selection on gene dosage following whole genome duplication. Genome Biology. 2021;22: 103. pmid:33849620
View Article
PubMed/NCBI
Google Scholar

[279] View Article

[280] PubMed/NCBI

[281] Google Scholar

[ref75] 75. Conover JL, Wendel JF. Deleterious Mutations Accumulate Faster in Allopolyploid Than Diploid Cotton (Gossypium) and Unequally between Subgenomes. Molecular Biology and Evolution. 2022;39: msac024. pmid:35099532
View Article
PubMed/NCBI
Google Scholar

[283] View Article

[284] PubMed/NCBI

[285] Google Scholar

[ref76] 76. Haldane J. The Causes of Evolution. 1932 [cited 9 Jun 2023]. Available: https://jbshaldane.org/books/1932-Causes-of-Evolution/index.html.

[ref77] 77. Khan A, Patel K, Shukla H, Viswanathan A, van der Valk T, Borthakur U, et al. Genomic evidence for inbreeding depression and purging of deleterious genetic variation in Indian tigers. Proceedings of the National Academy of Sciences. 2021;118: e2023018118. pmid:34848534
View Article
PubMed/NCBI
Google Scholar

[288] View Article

[289] PubMed/NCBI

[290] Google Scholar

[ref78] 78. Robinson JA, Räikkönen J, Vucetich LM, Vucetich JA, Peterson RO, Lohmueller KE, et al. Genomic signatures of extensive inbreeding in Isle Royale wolves, a population on the threshold of extinction. Sci Adv. 2019;5: eaau0757. pmid:31149628
View Article
PubMed/NCBI
Google Scholar

[292] View Article

[293] PubMed/NCBI

[294] Google Scholar

[ref79] 79. Grossen C, Guillaume F, Keller LF, Croll D. Purging of highly deleterious mutations through severe bottlenecks in Alpine ibex. Nat Commun. 2020;11: 1001. pmid:32081890
View Article
PubMed/NCBI
Google Scholar

[296] View Article

[297] PubMed/NCBI

[298] Google Scholar

[ref80] 80. Kleinman-Ruiz D, Lucena-Perez M, Villanueva B, Fernández J, Saveljev AP, Ratkiewicz M, et al. Purging of deleterious burden in the endangered Iberian lynx. Proceedings of the National Academy of Sciences. 2022;119: e2110614119. pmid:35238662
View Article
PubMed/NCBI
Google Scholar

[300] View Article

[301] PubMed/NCBI

[302] Google Scholar

[ref81] 81. Parmesan C, Yohe G. A globally coherent fingerprint of climate change impacts across natural systems. Nature. 2003;421: 37–42. pmid:12511946
View Article
PubMed/NCBI
Google Scholar

[304] View Article

[305] PubMed/NCBI

[306] Google Scholar

[ref82] 82. Irvine JR, Fukuwaka M. Pacific salmon abundance trends and climate change. ICES J Mar Sci. 2011;68: 1122–1130.
View Article
Google Scholar

[308] View Article

[309] Google Scholar

[ref83] 83. Teixeira JC, Huber CD. The inflated significance of neutral genetic diversity in conservation genetics. PNAS. 2021;118. pmid:33608481
View Article
PubMed/NCBI
Google Scholar

[311] View Article

[312] PubMed/NCBI

[313] Google Scholar

[ref84] 84. García-Dorado A, Caballero A. Neutral genetic diversity as a useful tool for conservation biology. Conserv Genet. 2021;22: 541–545.
View Article
Google Scholar

[315] View Article

[316] Google Scholar

[ref85] 85. Kardos M, Armstrong EE, Fitzpatrick SW, Hauser S, Hedrick PW, Miller JM, et al. The crucial role of genome-wide genetic variation in conservation. Proceedings of the National Academy of Sciences. 2021;118: e2104642118. pmid:34772759
View Article
PubMed/NCBI
Google Scholar

[318] View Article

[319] PubMed/NCBI

[320] Google Scholar

[ref86] 86. Ripple WJ, Wolf C, Newsome TM, Gregg JW, Lenton TM, Palomo I, et al. World Scientists’ Warning of a Climate Emergency 2021. BioScience. 2021;71: 894–898.
View Article
Google Scholar

[322] View Article

[323] Google Scholar

[ref87] 87. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34: i884–i890. pmid:30423086
View Article
PubMed/NCBI
Google Scholar

[325] View Article

[326] PubMed/NCBI

[327] Google Scholar

[ref88] 88. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:13033997 [q-bio]. 2013 [cited 3 Jul 2019]. Available: http://arxiv.org/abs/1303.3997.
View Article
Google Scholar

[329] View Article

[330] Google Scholar

[ref89] 89. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics. 2011;43: 491. pmid:21478889
View Article
PubMed/NCBI
Google Scholar

[332] View Article

[333] PubMed/NCBI

[334] Google Scholar

[ref90] 90. Dray S, Dufour A-B. The ade4 Package: Implementing the Duality Diagram for Ecologists. Journal of Statistical Software. 2007;22: 1–20.
View Article
Google Scholar

[336] View Article

[337] Google Scholar

[ref91] 91. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am J Hum Genet. 2007;81: 559–575. pmid:17701901
View Article
PubMed/NCBI
Google Scholar

[339] View Article

[340] PubMed/NCBI

[341] Google Scholar

[ref92] 92. Christensen KA, Rondeau EB, Minkley DR, Sakhrani D, Biagi CA, Flores A-M, et al. The sockeye salmon genome, transcriptome, and analyses identifying population defining regions of the genome. PLOS ONE. 2020;15: e0240935. pmid:33119641
View Article
PubMed/NCBI
Google Scholar

[343] View Article

[344] PubMed/NCBI

[345] Google Scholar

[ref93] 93. Christensen KA, Rondeau EB, Sakhrani D, Biagi CA, Johnson H, Joshi J, et al. The pink salmon genome: Uncovering the genomic consequences of a two-year life cycle. PLoS One. 2021;16: e0255752. pmid:34919547
View Article
PubMed/NCBI
Google Scholar

[347] View Article

[348] PubMed/NCBI

[349] Google Scholar

[ref94] 94. Christensen KA, Leong JS, Sakhrani D, Biagi CA, Minkley DR, Withler RE, et al. Chinook salmon (Oncorhynchus tshawytscha) genome and transcriptome. PLOS ONE. 2018;13: e0195461. pmid:29621340
View Article
PubMed/NCBI
Google Scholar

[351] View Article

[352] PubMed/NCBI

[353] Google Scholar

[ref95] 95. Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics. 2014;15: 356. pmid:25420514
View Article
PubMed/NCBI
Google Scholar

[355] View Article

[356] PubMed/NCBI

[357] Google Scholar

[ref96] 96. Goudet J. hierfstat, a package for r to compute and test hierarchical F-statistics. Molecular Ecology Notes. 2005;5: 184–186.
View Article
Google Scholar

[359] View Article

[360] Google Scholar

[ref97] 97. Nei M. Molecular Evolutionary Genetics. Molecular Evolutionary Genetics. Columbia University Press; 2019.
View Article
Google Scholar

[362] View Article

[363] Google Scholar

[ref98] 98. Pante E, Simon-Bouhet B. marmap: A Package for Importing, Plotting and Analyzing Bathymetric and Topographic Data in R. PLOS ONE. 2013;8: e73051. pmid:24019892
View Article
PubMed/NCBI
Google Scholar

[365] View Article

[366] PubMed/NCBI

[367] Google Scholar

[ref99] 99. Zhang C, Dong S-S, Xu J-Y, He W-M, Yang T-L. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35: 1786–1788. pmid:30321304
View Article
PubMed/NCBI
Google Scholar

[369] View Article

[370] PubMed/NCBI

[371] Google Scholar

[ref100] 100. Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag; 2009. Available: https://www.springer.com/gp/book/9780387981413.

[ref101] 101. Gutenkunst RN, Hernandez RD, Williamson SH, Bustamante CD. Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data. PLOS Genetics. 2009;5: e1000695. pmid:19851460
View Article
PubMed/NCBI
Google Scholar

[374] View Article

[375] PubMed/NCBI

[376] Google Scholar

[ref102] 102. Weir BS, Cockerham CC. Estimating F-Statistics for the Analysis of Population Structure. Evolution. 1984;38: 1358–1370. pmid:28563791
View Article
PubMed/NCBI
Google Scholar

[378] View Article

[379] PubMed/NCBI

[380] Google Scholar

[ref103] 103. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27: 2156–2158. pmid:21653522
View Article
PubMed/NCBI
Google Scholar

[382] View Article

[383] PubMed/NCBI

[384] Google Scholar

[ref104] 104. Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35: 526–528. pmid:30016406
View Article
PubMed/NCBI
Google Scholar

[386] View Article

[387] PubMed/NCBI

[388] Google Scholar

[ref105] 105. Revell L. phytools: An R package for phylogenetic comparative biology (and other things). Methods in Ecology and Evolution. 2012; 3: 217–223.
View Article
Google Scholar

[390] View Article

[391] Google Scholar

[ref106] 106. Jombart T, Balloux F, Dray S. adephylo: new tools for investigating the phylogenetic signal in biological traits. Bioinformatics. 2010;26: 1907–1909. pmid:20525823
View Article
PubMed/NCBI
Google Scholar

[393] View Article

[394] PubMed/NCBI

[395] Google Scholar

[ref107] 107. Delaneau O, Zagury J-F, Robinson MR, Marchini JL, Dermitzakis ET. Accurate, scalable and integrative haplotype estimation. Nat Commun. 2019;10: 5436. pmid:31780650
View Article
PubMed/NCBI
Google Scholar

[397] View Article

[398] PubMed/NCBI

[399] Google Scholar

[ref108] 108. McVean G, Awadalla P, Fearnhead P. A Coalescent-Based Method for Detecting and Estimating Recombination From Gene Sequences. Genetics. 2002;160: 1231–1241. pmid:11901136
View Article
PubMed/NCBI
Google Scholar

[401] View Article

[402] PubMed/NCBI

[403] Google Scholar

[ref109] 109. Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS ONE. 2012;7: e46688. pmid:23056405
View Article
PubMed/NCBI
Google Scholar

[405] View Article

[406] PubMed/NCBI

[407] Google Scholar

[ref110] 110. Eyre-Walker A, Keightley PD. Estimating the Rate of Adaptive Molecular Evolution in the Presence of Slightly Deleterious Mutations and Population Size Change. Molecular Biology and Evolution. 2009;26: 2097–2108. pmid:19535738
View Article
PubMed/NCBI
Google Scholar

[409] View Article

[410] PubMed/NCBI

[411] Google Scholar

[ref111] 111. Guéguen L, Gaillard S, Boussau B, Gouy M, Groussin M, Rochette NC, et al. Bio++: Efficient Extensible Libraries and Tools for Computational Molecular Evolution. Molecular Biology and Evolution. 2013;30: 1745–1750. pmid:23699471
View Article
PubMed/NCBI
Google Scholar

[413] View Article

[414] PubMed/NCBI

[415] Google Scholar

[ref112] 112. Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics. 2009;Chapter 4: Unit 4.10. pmid:19274634
View Article
PubMed/NCBI
Google Scholar

[417] View Article

[418] PubMed/NCBI

[419] Google Scholar

Figures

Abstract

Author summary

Introduction

Results and discussion

Strong postglacial population expansion revealed through whole genome sequences

Range expansion explained variation in the mutation load

The most deleterious mutations are efficiently purged across the range

The fixed load increases with population expansion

Range expansion have reduced the efficacy of selection at range margin

Recombination rates shape the deleterious mutational landscape

Regions of residual tetraploidy revealed drivers of the load in Coho salmon

Conclusion

Methods

Sampling design for Coho salmon

Outgroup dataset.

Ancestral and derived alleles identification

Demographic reconstruction

Population-scaled recombination rate

Genetic load estimation

πN/πS estimates.

Identifying potential deleterious non-synonymous alleles

Correlation between πN/πS and recombination

DFE estimation and rate of adaptation in Coho salmon

Differences in load for region of residual tetraploidy in Coho salmon

πN/πS comparison.

TE annotations

Supporting information

S1 Table. Sampling strategy, including various outgroup species.

S2 Table. Summary Statistics across coho populations, including geographic coordinates, tree branch length, genetic diversity and load statistics.

S3 Table. Outgroup summary statistics by species, with a focus on the load.

S4 Table. Results of linear models.

S5 Table. Results of Mann-Whitney test for difference in load among population based on missense mutation and Loss of Function mutation.

S6 Table. Results of Tukey-HSD test for difference in recombination rate among populations based on Ldhat estimates.

S7 Table. Difference in load among diploid region, compared to region of residual tetraploidy with load computed in 500kb windows for each kind of regions.

S8 Table. AIC of each model for each pairwise comparison.

S9 Table. Parameter estimates for each of the major regional group where ∂a∂i was fitted.

S1 Note. Reconstruction of demographic history from RADseq data.

S2 Note. Relationship between GC3 and recombination.

S3 Note. Comparison of Provean and non-synonymous results.

S1 Fig. PCA and admixture plots among population.

S2 Fig. Summary statistics revealed the demography of Coho salmon.

S3 Fig. Inference of population split and mixture from Treemix.

S4 Fig. SMC++ split time.

S5 Fig. Correlation between πN/πS and πS and πN/πS and Ne from smc++.

S6 Fig.

S7 Fig. Results of linear models testing the effect of tree branch length to the root extracted from a FST-based population phylogeny on different metrics of selection efficacy.

S8 Fig.

S9 Fig. Correlation between the proportion of amino-acid substitution that results from positive selection (α) and the synonymous diversity πS used as a proxy of effective population size.

S10 Fig. Relationship between GC3 and pN/pS for all populations and all outgroups.

S11 Fig. Relationship between recombination (GC3) and demographic factors (distance to the southernmost site).

S12 Fig. Region of residual tetraploidy vs diploid region of the genome display different recombination landscapes.

S13 Fig. Distribution of recombination rate among all populations.

S14 Fig. TE length differs between diploid chromosome versus chromosome displaying residual tetraploidy.

S15 Fig. Provean analysis of potentially deleterious mutation and the resulting load.

Acknowledgments

References

π_N/π_S estimates.

Correlation between π_N/π_S and recombination

π_N/π_S comparison.

S5 Fig. Correlation between π_N/π_S and π_S and π_N/π_S and N_e from smc++.

S7 Fig. Results of linear models testing the effect of tree branch length to the root extracted from a F_ST-based population phylogeny on different metrics of selection efficacy.

S9 Fig. Correlation between the proportion of amino-acid substitution that results from positive selection (α) and the synonymous diversity π_S used as a proxy of effective population size.