Main

There can now be little doubt that natural selection plays an important role in the evolution of postzygotic reproductive isolation. Over the last decade or so, considerable work has shown that positive natural selection plays a part in the evolution of both extrinsic (ecological inviability or behavioral sterility) and intrinsic postzygotic (hybrid inviability or sterility) isolation (reviewed by Coyne and Orr, 2004). Although the former role is perhaps not surprising—it seems intuitively clear that selection shapes divergence in characters of ecological significance—the latter role was less expected. It was not obvious that substitution at the genes causing intrinsic hybrid sterility or inviability is driven by positive natural selection. Nonetheless, molecular population genetic analysis of several genes known to cause hybrid sterility or inviability, for example, OdsH, Nup96, Lhr and Hmr, shows strong evidence of positive Darwinian selection (reviewed by Coyne and Orr, 2004; Orr et al., 2007). It seems clear, then, that many of the substitutions underlying genic (Dobzhansky–Muller) incompatibilities are driven by selection. (This selection was not, of course, selection for postzygotic isolation but for some other character; this evolution pleiotropically gives rise to lowered fitness in hybrids.)

Natural selection might drive the evolution of Dobzhansky–Muller incompatibilities (DMIs) in two ways. First, allopatric populations might adapt to their respective—and different—environments and hybrid sterility or inviability might arise as a pleiotropic side effect of this divergent selection. One population, for example, might adapt to a dry environment and another to a wet. The genes underlying these ecological adaptations might later cause incompatibilities in hybrids formed between the populations. Second, allopatric populations might adapt to the same environment but in different ways genetically. As populations can respond to identical selection pressures in different ways, two populations might still arrive at different genotypes. If so, their hybrids might still suffer incompatible gene combinations. This scenario was first described by Muller (1942).

Here, we point out a potential problem with this identical environment scenario. The problem is that it is harder than it might first seem for two populations to adapt to an identical environment in different ways, thereby allowing the evolution of postzygotic isolation. The reason is that natural selection, although not deterministic, can often cause the substitution of the same mutations in two populations. This can lower dramatically the chance of forming a hybrid incompatibility. This problem does not arise when incompatibilities result from adaptation to different environments.

In this note, we calculate the probability of evolving a DMI when natural selection drives adaptation to an identical environment. We consider the simplest scenario in which allopatric populations evolve by substitution of recurrent new mutations.

The model

We begin by considering two allopatric populations that experience no gene flow. These populations begin with identical genotypes (both are fixed for wild-type alleles at all relevant loci) and experience identical environments. At some point in time this shared environment changes and the populations begin to adapt to the new, but still shared, environment. (Environments need not be strictly identical; formally, our calculations assume only that those aspects of the environment that affect selection coefficients at the loci we follow are the same.) Mutation is recurrent and evolution involves the substitution of new mutations. We assume that all mutations have independent effects on fitness. Following Gillespie (1984, 1991), we consider the strong selection (Ns1) and weak mutation (1) domain. Results involving weak selection and/or strong mutation (for example, 1) might well differ.

Following Nei (1976), we can simplify our analysis by considering haploids. This captures the essence of our problem without requiring us to deal with unnecessary details about the dominance of incompatibilities. In the case of a two-locus DMI, both populations initially have genotype A0B0. In the new environment, mutations A1 and B1 are beneficial. But, when brought together, A1 and B1 cause an in compatibility. Relative fitnesses among genotypes are: where sA, sB, and t are all positive.

Note that, once A1 is fixed in a population, B1 cannot be fixed as this would yield the unfit A1B1 genotype; similarly, once B1 is fixed, A1 cannot be fixed. We assume that rates of mutation to A1 and B1 are equal; back-mutation to A0 and B0 is not allowed.

A DMI arises if one population evolves the A1B0 genotype and the other evolves the A0B1 genotype, allowing formation of unfit A1B1 hybrids if the populations were to come into contact. A DMI cannot occur if both populations evolve the same genotype, either A1B0 or A0B1. In this case, hybrids between the populations would retain a pure-population genotype and would be perfectly fit.

We calculate the ultimate probability that two loci form a DMI, PDM, that is, the probability that populations arrive at incompatible genotypes after an extended (essentially infinite) period of time; in effect, then, we assume that taxa remain allopatric for very long periods of time. Note also that the loci we follow are capable of causing a hybrid incompatibility if they evolve in the ‘right’ way in our populations. PDM represents, therefore, a conditional probability: given that the loci we follow are capable of causing a hybrid incompatibility, what is the probability that they do so?

The neutral case

Though mathematically trivial, it is worth first considering the case in which the evolution is neutral: sA=sB=0. We will see that this scenario represents a limit in the case of evolving a hybrid incompatibility by natural selection. We still assume that t>0, that is, A1B1 is unfit. Populations will ultimately either fix A1 or B1. Under neutrality, there is a one-fourth chance that both populations fix A1 (arriving at A1B0) and a one-fourth chance that both fix B1 (arriving at A0B1). The probability of an incompatibility is one minus the sum of these probabilities, PDM=½.

This logic is extended trivially to more than two populations, each of which evolves independently. The probability of incompatibility, that is, that at least one pair of a allopatric populations arrives at incompatible genotypes, is one minus the probability that all a populations arrive at the same genotype: PDM=1−1/2a−1.

Selection: two loci

Now consider the evolution by natural selection. In the special case in which sA=sB, matters are again simple. Because A1 and A2 are equally likely to fix first in a population, there is again a one-fourth chance both populations evolve to be A1B0 and a one-fourth chance both evolve to be A0B1; the probability that two populations arrive at different genotypes, causing postzygotic isolation, is thus again PDM=½, as noted by Gavrilets (2004). Finding the probability of incompatibility with arbitrary sA and sB is not so straightforward, however.

To find this probability, we take advantage of several results obtained by Gillespie (1984, 1991). Gillespie considered a large population in which beneficial mutations are sufficiently rare (Nu1) to have independent fates (that is, clonal interference does not occur) and in which beneficial mutations have definite fitness advantages (Ns1). Standard population genetic theory shows that a new beneficial mutation has a probability of fixation of Π≈2s, where the approximation assumes that s is fairly small (Haldane, 1927). Because most new mutations are lost accidentally, it takes time before recurrent mutation produces a copy of a beneficial allele that is destined to sweep to fixation. Gillespie showed that the waiting time to the appearance of such a mutation is geometrically distributed with a mean of 1/(2Nus) generations (in diploids, 1/(4Nus)). For a A0B0 population, the mean waiting time to fixation of A1 is 1/(2NusA) generations and the mean waiting time to fixation of B1 is 1/(2NusB) generations. As each of these times is approximately exponentially distributed, the probability that A1 fixes first in a population equals the probability that an exponential random variable with a mean of 1/(2NusA) is smaller than an exponential random variable with a mean of 1/(2NusB). Gillespie shows that this probability is ΠA=sA/(sA+sB). Conversely, the probability that B1 fixes before A1 is ΠB=sB/(sA+sB). Note that these values are independent of population size (so long as it is large).

As ΠA and ΠB are identical in our a=2 populations, the probability that one population becomes A1B0 and the other A0B1, yielding a DMI, is

This probability is maximized when sA=sB and PDM=1/2. This maximum probability of incompatibility is identical to the probability of incompatibility under neutrality.

Equation (1) makes good sense. When sA and sB are equal, there is a good chance that one population will substitute A1 and the other B1, allowing the formation of a hybrid incompatibility. But when sA and sB are very different, both populations are likely to substitute the same allele—the one with the greatest advantage—arriving at the same genotype. This precludes formation of a hybrid incompatibility. When sA=0.05 and sB=0.001, for instance, the probability of incompatibility decreases to only 3.8%. The above logic is extended trivially to the case of a allopatric populations, yielding

Although we do not know the actual values of s for beneficial mutations in nature, a reasonable argument can be made that s is approximately exponentially distributed. Gillespie (1984, 1991) and Orr (2003) pointed out that, given a variety of distributions of fitnesses (for example, normal, gamma, exponential, logistic and so on), extreme value theory shows that the right-hand tail of these distributions declines exponentially. Because wild-type alleles are typically highly fit, this tail corresponds approximately to the distribution of selection coefficients among beneficial mutations (see Gillespie and Orr for details). Although this argument is far from decisive (see Joyce et al., 2008), especially when considering mutations at different loci, an exponential distribution of s at least represents a reasonable guess about biological reality.

We can thus ask: What is the average probability of incompatibility between a pair of populations when sA and sB are drawn from a common exponential distribution? The answer is

where 1/λ is the mean s among beneficial mutations. In words, Equation (2) shows that pairs of populations evolve in different directions—yielding a DMI in hybrids—one-third of the time. Surprisingly, this result is independent of mean selection coefficient, population size, and all other biological parameters. The finding is closely connected to that in Orr (2005) for parallel adaptation, who did not consider reproductive isolation and who took a different approach mathematically.

A similar calculation shows that, given a independently evolving populations, the expected probability of incompatibility is

When a=2, we recover Equation (2). As expected, postzygotic isolation is more likely with more allopatric populations.

Selection: complex incompatibilities

A DMI need not involve only two loci. Instead, genetic analyses of hybrid sterility and inviability have shown that complex incompatibilities, involving three or more loci, are common (Cabot et al., 1994; Orr, 1995). We can extend our calculations to these cases.

It is again worth considering the neutral case. Consider two allopatric populations that experience recurrent mutation at n loci with no back-mutation. Though each mutation is neutral, any individual who carries all n alleles suffers a genic incompatibility. Given enough time, each population will ultimately fix n−1 of the alleles; the last allele cannot fix as this would yield the unfit hybrid genotype. A hybrid incompatibility will thus form unless two populations fix the same genotypes, which occurs with probability 1/n. The probability of incompatibility is therefore PDM=1−1/n. If n=3 loci, PDM=2/3. Given a allopatric populations, we get PDM=1−1/na−1.

When substitutions are driven by natural selection, the n-locus case is messy mathematically. We thus present results for the three-locus (n=3) case only, which captures the biological point we wish to make. Alleles A1, B1 and C1 are beneficial when alone or in pairs; but any individual that carries all three alleles suffers a DMI and has fitness 1−t. We assume that fitnesses are otherwise independent, that is, multiplicative: A1B0C0 has fitness 1+sA, whereas A1B1C0 has fitness (1+sA)(1+sB), and so on.

Populations will ultimately fix two of the three mutations. Which two depends on the magnitudes of sA, sB and sC. For example, the probability, P(A1,B1), that a population ultimately fixes alleles A1 and B1 is the sum of two events: that A1 is fixed first and then B1 or that B1 is fixed first and then A1. Using Gillespie's arguments to calculate the probabilities of these events, straightforward calculations show that

where Πs is the product of the three selection coefficients and Σs is their sum. P(A1,C1) and P(B1,C1) have the same form, with the obvious change of subscripts.

As postzygotic isolation evolves unless both populations evolve the same genotype, the probability of incompatibility is

Table 1 shows that, when selection coefficients among the three alleles are similar, evolution of a hybrid incompatibility is likely. In particular, when sA=sB=sC, the chance of isolation is maximized at PDM=2/3. Again, then, the maximum probability of incompatibility under natural selection equals that under neutrality. But when the selection coefficients are very different, postzygotic isolation becomes much less likely. If, for instance, sA=0.001, sB=0.03 and sC=0.05, PDM decreases to 7.7%.

Table 1 Probabilities of postzygotic isolation as a function of selection coefficients among beneficial mutations

Finally, we can ask about the mean probability of incompatibility when selection coefficients involved in three-locus incompatibilities are drawn from an exponential distribution. Although the integration required appears intractable, numerical work shows that E[PDM]≈3/7. The mean probability of incompatibility thus increases somewhat when DMIs are complex.

Conclusions

Although the mathematics involved in our analysis is simple, the biological implications that emerge seem non-trivial. We have obtained two main results. First, when allopatric populations adapt to identical environments and the loci we follow can potentially form a DMI, the probability that they do so is constrained in a way that does not occur when populations adapt to different environments. The reason is that the evolution of a DMI requires that populations evolve different genotypes. But adaptation to an identical environment is somewhat repeatable: populations will often substitute the same alleles, arriving at the same genotype. (See Wood et al. (2005), who reviewed the evidence for parallel genetic evolution in the quantitative genetic and experimental evolution literature). When populations independently arrive at the same genotype, a DMI cannot arise. Of course, if selection is spatially heterogeneous, that is, populations adapt to different environments, incompatibilities can arise more easily (see also Navarro and Barton, 2003; Kondrashov, 2003; Gavrilets, 2004). Our analysis further shows that, as the number of loci involved in the incompatibility increases, the probability of evolving an incompatibility also increases.

Our second finding is that the probability of postzygotic isolation decreases as selection coefficients among beneficial mutations become less similar. In fact, the maximum probability of isolation occurs when selection coefficients at each locus are equal; in this case, the probability of incompatibility equals that under neutrality. Under both the equal-s and neutral scenarios, evolution randomly chooses alleles at the relevant loci for substitution. Populations thus often arrive at different genotypes, allowing postzygotic isolation. It should be noted, however, that the time-scales for the evolution of postzygotic isolation under the equal-s and neutral scenarios are very different (Gavrilets, 2004): neutral evolution is slow.

Most of our mathematical results have not been noted before and the reason seems clear. Many previous analyses of the Dobzhansky–Muller model (including our own, for example, Orr, 1995) typically focused on the long-term accumulation of alleles causing hybrid incompatibilities and were agnostic about whether substitutions were driven by natural selection or genetic drift. Selection coefficients among beneficial alleles did not, therefore, generally appear in previous theory (for important exceptions, see Nei (1976); Gavrilets (2003, 2004); Gavrilets, in particular, produced theory that considered the role of selection in the buildup of incompatibilities in spatially-structured populations).

In closing, it is important to guard against a possible misinterpretation of our results. We do not suggest that the evolution of postzygotic isolation in identical ecological environments is unlikely. Indeed, a common cause of reproductive isolation by DMIs could be genetic conflict, in which substitutions are driven by arm races among different sites in the genome. Such genetic-conflict-based reproductive isolation can obviously evolve in identical ecological environments. The reason our results do not militate against this form of speciation is subtle. Our model assumes that selection coefficients among beneficial mutations are independent. Thus, if one mutation fixes, selection coefficients at all other mutations remain unchanged. This need not, of course, be true and it not generally true under genetic conflict. With conflict, a mutation might become beneficial only after another has fixed. (For example, an allele that suppresses meiotic drive is favored only after a driver mutation has appeared.). Put differently, genetic conflict does, in fact, involve a kind of adaptation to different environments, but the relevant ‘environments’ are genetic, that is, involve genotypes at other loci.