A computational framework for resolving the microbiome diversity conundrum

Daybog, Itay; Kolodny, Oren

doi:10.1038/s41467-023-42768-4

Download PDF

Article
Open access
Published: 02 December 2023

A computational framework for resolving the microbiome diversity conundrum

Nature Communications volume 14, Article number: 7977 (2023) Cite this article

2298 Accesses
12 Altmetric
Metrics details

Subjects

Abstract

Recent empirical studies offer conflicting findings regarding the relation between host fitness and the composition of its microbiome, a conflict which we term ‘the microbial β- diversity conundrum’. The microbiome is crucial for host wellbeing and survival. Surprisingly, different healthy individuals’ microbiome compositions, even in the same population, often differ dramatically, contrary to the notion that a vital trait should be highly conserved. Moreover, gnotobiotic individuals exhibit highly deleterious phenotypes, supporting the view that the microbiome is paramount to host fitness. However, the introduction of almost arbitrarily selected microbiota into the system often achieves a significant rescue effect of the deleterious phenotypes. This is true even for microbiota from soil or phylogenetically distant host species, highlighting an apparent paradox. We suggest several solutions to the paradox using a computational framework, simulating the population dynamics of hosts and their microbiomes over multiple generations. The answers invoke factors such as host population size, the specific mode of microbial contribution to host fitness, and typical microbiome richness, offering solutions to the conundrum by highlighting scenarios where even when a host’s fitness is determined in full by its microbiome composition, this composition has little effect on the natural selection dynamics of the population.

The microbiome extends host evolutionary potential

Article Open access 26 August 2021

Eco-evolutionary feedbacks in the human gut microbiome

Article Open access 06 November 2023

Intrahost evolution of the gut microbiota

Article 17 April 2023

Introduction

The microbiome, the diverse community of microbial symbionts associated with a host, can immensely influence its host’s wellbeing in numerous direct ways, including providing access to nutrients^1,2, protecting against pathogens^3,4, and inducing resistance to extreme conditions^5,6. Thus, a variety of traits and diseases have been linked to alterations in microbiome composition and its interaction with the host^7,8, also prompting the issue of its part in ecological and evolutionary processes^9,10. These have inspired extensive research, particularly in humans, to uncover the mechanisms that govern the host-microbiome interaction. Such studies have been able to link the composition of an individual’s microbiome to numerous factors, ranging from the host’s health and physiology^11,12,13, through its behavior^14,15, to its aging dynamics^13,16.

At the same time, studies have also found that the compositions of different individuals’ microbiomes, even when considering only healthy individuals in the same population, often differ dramatically^17,18,19. Such observations may be baffling and seem to contrast with the mass of evidence linking a host’s fitness to the composition of its microbiome. The apparent contradiction stems from the fact that a trait that greatly benefits an individual is expected to spread and fix in the population through selective dynamics, and to be highly conserved^20,21,22.

This expectation, to find reduced diversity in traits of importance that is driven by selection, is supported by observations across multiple systems and biological modalities: it is widely observed in protein structure²³ and is used to infer aspects of functional importance^24,25; in genetics, the extent to which a genetic sequence is conserved is commonly used as a measure of its importance for its bearer’s fitness^26,27,28. Reduced trait diversity in traits of functional importance is also seen outside of biology, as in²⁹, where the diversity of functional and symbolic design features of Polynesian canoes was quantified. The existence of standing variation in fitness-influencing traits thus calls for explanation; for example, standing polymorphism in functional traits is often explained as a result of balancing selection³⁰, frequency-dependent selection³¹, character displacement³², niche heterogeneity³³, or adaptive trade-offs³⁴. Driven by the observed variation in the microbiome composition of individuals within the same population, across populations, and among host species, studies have examined the specificity of the relationship between microbes and their hosts. Several such studies focused on gnotobiotic mice and zebrafish, which present highly deleterious phenotypes. It has been shown that introduction of arbitrarily selected microbiota, even from soil or from phylogenetically distant host species, often leads to the successful establishment of microbiota and to significant rescue effects of host phenotypes^35,36,37,38. These findings deepen the conundrum, as they showcase the importance of the microbiome, yet at the same time present the magnitudes of variation it can withstand while still bestowing a fit phenotype. These seemingly conflicting findings pose what we term the microbial ${{{{{\rm{\beta }}}}}}$-diversity conundrum.

Although the depth of the conundrum is not frequently appreciated, the surprising diversity of microbiome compositions among healthy individuals in the same population has not gone unnoticed. The most commonly invoked explanation to this puzzle is the possibility that thanks to functional redundancy among microbial taxa, even microbiomes that are taxonomically divergent from one another can have similar functional profiles^17,39,40. Thus, for example, different microbes may be able to break down a particular complex carbohydrate, and different host individuals can receive this “service” from different microbial species. Although plausible, the empirical support for this claim has been heavily criticized^41,42 and extensive, previously unappreciated, functional diversity among individuals’ microbiomes has been reported^41,42,43,44.

The paradox can usefully be considered from an eco-evolutionary perspective: on the one hand, the microbiome is able to influence fitness significantly, suggesting that natural selection would shape its composition and that the microbiome would play a role in selection on its host, while on the other hand, it shows a great deal of variation within species, perhaps testifying against its relevance in selection-dictating dynamics and evolutionary shaping of host populations.

Several computational models have been proposed to investigate the evolution of hosts and their microbiome. Most have focused on short-term dynamics, very specific selection-inducing scenarios, or on the population dynamics of the microbiome itself^45,46,47. For example, one computational approach has suggested the possibility that selection against toxic stress can drive the co-evolution of adaptive capabilities of the host microbiome⁴⁸. Another model-based study has suggested that to preserve the existence of symbionts that benefit their host at a cost to themselves, very strict conditions must be met⁴⁹. The proposed conditions include fast host reproduction and strong vertical transmission of the microbiome, assertions that have recently been challenged⁵⁰.

Despite the rapid developments in the study of the microbiome and the understanding of its importance, research has focused primarily on its influence over short periods on hosts’ wellbeing or the evolutionary dynamics of the microbiome alone. Understanding of the dynamics between the microbiome and its associated hosts is still incomplete, and an intuitive and broad theoretical framework that will allow explicit hypothesis testing regarding these dynamics over many generations is still lacking. We attempt to bridge this gap by proposing a framework that implements considerations from the field of microbiome research, alongside perspectives and approaches typically found in studies of population dynamics and evolutionary biology. The framework is designed in a modular and general way, to allow exploration of a broad range of questions on different timescales. Furthermore, this framework can serve as a null model, as it assumes neutral dynamics at the microbial level, in the ecological sense of neutral models^51,52. In particular, it assumes no niche specialization among the modeled microbes, aiming to adopt the most parsimonious approach and to offer explanations to observed phenomena that rely on as few assumptions as possible.

In this study we use our framework to propose several possible solutions to what we dubbed the microbial ${{{{{\rm{\beta }}}}}}$-diversity conundrum, highlighting scenarios in which despite a major influence of the microbiome on each individual’s fitness, the role of the microbiome composition on the hosts’ population dynamics may vary from great to none. These, in turn, offer testable predictions regarding the conditions in which the microbiome composition is expected to be conserved or divergent among individuals.

Results

The model

Our agent-based framework tracks a population of host individuals and their corresponding microbiomes over time. It shares many commonalities with the frameworks that have been put forth by Zeng et al.^45,46. several differences are discussed in the Methods section. The basic mechanisms of the simulations follow a Wright-Fisher model using discrete non-overlapping generations^53,54. For simplicity, it is assumed that all hosts reproduce asexually, avoiding the need to explicitly simulate the pairing of individuals for mating. The model focuses on utilizing ecological and evolutionary principles of microbiome transmission, assembly, and contribution to host fitness while setting aside other factors that can be further explored in the future.

We simulate the host as a passive microbe recipient, receiving microbes randomly based on their abundance in the microbial pool available to it. Furthermore, the microbiome assembly is dictated by the arrival order of different microbial taxa, a logistic growth function, and is limited only by the carrying capacity, eliminating effects of any other intra-microbial dynamics, i.e. it is a neutral model in an ecological sense^52,55. Lastly, after the assembly is complete, there are no further changes in the microbiome configuration of an individual host. Thus, our framework is highly simplified and general, not bound by the explicit mechanisms of any specific host, microbe, or eco-system. This allows it to serve as a tool for understanding underlying patterns that may be difficult to notice or tease apart from other processes in more complex systems, and as a null framework for fundamental host-microbiome dynamics. The framework was designed in a modular way, such that future explorations that apply it may introduce more specific assumptions along any of these dimensions.

The framework can simulate a wide range of eco-evolutionary scenarios. However, the main forward-in-time process remains similar. The host population size $N$ is constant and generations are non-overlapping. To assemble a new generation, $N$ new agents are defined, and a parent is selected from the preceding generation for each one. Next, this individual’s microbiome composition is assembled using a microbial pool constructed according to its parental microbiome and the population-wide microbiome. Finally, the hosts’ fitnesses are calculated based on the composition of their newly acquired microbiome, thus allowing the generation cycle to continue.

The selection of a parent for each offspring is done by randomly choosing one host from the previous generation, where high-fitness hosts have a proportionally larger probability to be chosen. Each choice is independent, enabling some hosts to produce multiple offspring while maintaining a constant population size (Fig. 1). In a neutral selection scenario, where all host fitnesses are constantly identical, all hosts have an equal probability to be chosen as parents. Thus, in a population of size $N$, random drift will drive a coalescent process of host lineages such that on average all host lineages will coalesce to a single common ancestor after $2N$ generations, following a neutral Wright-Fisher process⁵³.

**Fig. 1: An illustration showing the inter-generational reproduction scheme.**

A host acquires microbes from two main sources. The first is the parental microbiome, contributing to the total microbe pool available to the offspring according to a ‘vertical transmission coefficient’ ${T}_{v}$. The microbiome composition of the parent is normalized to represent the relative abundance of the different microbial species, and later multiplied by ${T}_{v}$ to represent its relative contribution to the pool from which the offspring will sample microbes. Thus, if the abundance of a specific microbe species in the parent is $x$, it will contribute ${T}_{v}\cdot \frac{x}{\begin{array}{c}{{{{{\rm{all}}}}}}\; {{{{{\rm{microbes}}}}}}\; {{{{{\rm{in}}}}}}\\ {{{{{\rm{the}}}}}}\; {{{{{\rm{parent}}}}}}\; {{{{{\rm{host}}}}}}\end{array}}$ of that species to the microbiome sampling pool available to its offspring. The second source is the population-wide microbiome. This oblique microbiome transmission from non-parental individuals is denoted by the transmission coefficient ${T}_{h}$⁵⁶. For simplicity, we refer to this type of transmission as horizontal transmission, in contrast to the parental vertical transmission. Similarly, if the abundance of a microbe taxon within the entire parent population is $y$, the environment will contribute ${T}_{h}\cdot \frac{y}{\begin{array}{c}{{{{{\rm{all}}}}}}\; {{{{{\rm{microbes}}}}}}\; {{{{{\rm{in}}}}}}\; {{{{{\rm{the}}}}}}\\ {{{{{\rm{parent}}}}}}\; {{{{{\rm{population}}}}}}\end{array}}$ of that specific taxon to the pool. The ratio between ${T}_{v}$ and ${T}_{h}$ dictates the transmission scenario being simulated (Fig. 2). For simplicity and increased tractability, to uncover patterns in the microbiome’s effect on its host’s selection dynamics, we mainly focus in this manuscript on extreme transmission scenarios, where either ${T}_{v}=0$ or ${T}_{h}=0$, corresponding to purely vertical or purely horizontal transmission. Other transmission schemes are feasible: when $\frac{{T}_{v}}{{T}_{h}} > 1$, the transmission is mostly vertical, allowing higher conservation of microbiome-related traits between a parent and its offspring, whereas when $\frac{{T}_{v}}{{T}_{h}} < 1$ the transmission is mostly horizontal, which lowers the correlation between parent and offspring microbiome compositions.

**Fig. 2: A high-level display of offspring microbiome acquisition from the two available sources.**

The microbiome assembly process is performed as the available microbe species start inhabiting the host, where more abundant species in the pool of candidate microbes are more probable to be the first to establish within the host (Fig. 2). Between establishment events, all previously acquired microbe populations grow logistically, limited by a predefined maximal microbial species’ population size. The host’s microbiome takes shape as more species join and grow in number until the overall carrying capacity of microbes in the host is reached.

A host’s fitness score is calculated by summing the individual contribution of each microbe taxon it possesses. For simplicity, the calculation in the simulations we carried out was done based on the presence/absence of each microbial species, regardless of its abundance. This may occur, for example, if the helper microbiome supplies a vital nutrient, otherwise inaccessible to the host, required in small amounts⁵⁷. Each microbial species contributes a certain value to a host’s fitness score. At the start of each simulation, each microbial species is randomly assigned a fitness value that this species’ presence will contribute to the host, drawn from a distribution (Fig. 3). Several contribution distributions are plausible, including a step distribution, where some taxa contribute much while others contribute little (Fig. S1a), a low-variance distribution, determining that the contributions of different taxa are similar, and an almost uniform distribution, leading to a broad range of different contributions by different taxa (Fig. S1b).

**Fig. 3: An example of the fitness calculation of a single host.**

The simulation continues until the number of common ancestors of the hosts in the population, denoted as ${{{{{\rm{AC}}}}}}$ for ‘ancestral coalescence’, reaches a predetermined value. For example, to follow the simulation until all the hosts in share the same common ancestor, the simulation is run until ${{{{{\rm{AC}}}}}}=1$.

We used the framework to execute simulations under various combinations of parameters. These include different microbial assembly and transmission factors, varying host population sizes, and a few other ecological components of the model. With these we were able to detect and explore scenarios that may hold the solution to the conundrum highlighted above, highlighting situations in which the microbiome determines host fitness while remaining relatively non-conserved.

Running and interpreting simulations

The $\beta$-diversity conundrum arises from empirically supported and seemingly contradicting observations—on the one hand the microbiome is crucial for host fitness, while on the other hand it can differ vastly even among healthy individuals in the same population. The contradiction stems from the latter being an unexpected characteristic for a fitness-determining trait, which are typically highly conserved. We seek solutions to the conundrum in the form of scenarios in which both observations are true and their co-existence is interpretable. For this, we consider the most conservative scenarios with respect to the first of the two conditions we are after. Firstly, in our simulations the host fitness is solely a function of microbiome composition, epitomizing the dependence of the fitness of hosts on their microbial symbionts. Secondly, microbiome diversity among individuals would arise when selection cannot effectively act to favor one composition over another via selection on hosts. An obvious setting in which this would occur is when microbiome composition is not heritable. This trivial case of purely horizontal transmission of microbes is explored in the supplementary. We focus instead on the setting at the other end of the spectrum: the case in which transmission is purely vertical. Solving the conundrum is thus reduced to detecting in our framework, under these most conservative conditions, fundamentally different scenarios in which selection on microbiome composition is ineffective.

Measures of microbiome influence on natural selection

We used two measures to evaluate the microbiome’s effect on selection dynamics in the host population. The first is the observed difference between the fitness scores of hosts with different microbiome compositions, expressed in the distribution of host fitnesses in a single generation. This acts as a direct intragenerational approximation of how the microbiome influences the differential of selection among individuals and is not directly influenced by different microbial transmission schemes. The second measure is the number of generations that pass until ${{{{{\rm{AC}}}}}}$ reaches a predefined value. Taken from the field of population genetics, this measure quantifies the long-term effects of the microbiome on the selection dynamics in the host population over many generations.

Applying the two measures, we characterized conditions under which the microbiome can substantially influence host selection, alongside key scenarios where different microbiome compositions have little influence or none, even though the microbiome in our framework is the sole contributor to the fitness of its host.

Species-rich microbial configurations facilitate high diversity among hosts of microbiome compositions that all lead to similar fitness values

We first examined the possible effects of the microbiome’s $\alpha$-diversity, microbiome diversity within individuals, on the selection-related dynamics of the host population, by simulating two microbiome structures. The first structure represents a species-rich microbiome composition, both in the number of species and their abundance, corresponding to the one that is characteristic of many vertebrate host species: each individual carries a microbiome composed of between 200 and 300 microbial species, ranging in relative abundance from highly prevalent to rare, with few particularly common taxa accounting for the majority of the overall microbial biomass of that individual (see, e.g.^17,58,59,60) (Fig. S2). The second structure is the complement of the latter, exhibiting a species-poor composition with a lower number of species and a lower carrying capacity. This structure simulates the microbial composition found for example in many insects, where typically one or two microbial species dominate and only a few others are sparsely present^61,62 (Fig. S2b). In the supplementary we explore several additional compositions (Fig. S3 and respective results in Figs. S8–S9).

We ran 100 repeats of the simulation with each of the two host population types—the species-rich and species-poor microbiome configurations under vertical microbial transmission, where each microbe species contributes differently to its host fitness. The results display the empirical distribution of host fitness scores observed in the first generation (Fig. 4a). The fitness scores of the hosts in the species-poor population are widely distributed and show great variance (Fig. 4b), underlining that some hosts have gained a very high relative fitness score by acquiring the most beneficial microbes, whereas others did not.

In contrast, we find that the host fitness scores in the species-rich population present lower variance (p < .001) (Fig. 4b). This results in the least fit host being almost equivalent to the fittest one, stripping the microbiome of the ability to influence host selection. Notably, both in the species-rich and in the species-poor microbiome configuration scenarios the $\beta$-diversity within the population at the beginning of each simulation is high, with mean pairwise Jaccard distances between individuals of $0.82$ and $1$ respectively. In these simulations, $\beta$-diversity changes over time; only in the species-poor scenario do fitness differences among hosts drive selection, leading the lineage with the beneficial microbiome to spread and to a respectively rapid decrease in the population’s mean $\beta$-diversity (see also supplementary figures S12-S13). Combining these findings, we conclude that a species-rich microbiome configuration may act as the first key to the microbial $\beta$-diversity conundrum. It presents a state where the microbiome affects fitness and is diverse between hosts, yet it does not create a noticeable fitness difference, giving rise to neutral population dynamics.

**Fig. 4: The influence of the microbiome’s species-richness on first-generation fitness scores across 100 repetitions.**

These results can be attributed to the law of large numbers, which formulates the tendency of large sample sizes to approximate well the mean of a hidden distribution⁶³. We simulate the microbial species’ different contributions to host fitness such that they can be viewed as discreetly drawn random variables from some background distribution. Thus, when summing the fitness contributions of species in a microbiome configuration, we expect to get an approximation of the mean of the background distribution multiplied by the number of species in that microbiome. According to the law of large numbers, when the microbiome is species-rich the approximation of the multiplied mean in each host will be much better than when the microbiome is species-poor. This leads to the low variance in fitness scores observed between hosts in populations with species-rich microbiome configurations, creating a situation in which the microbiome is unlikely to generate a large enough fitness difference between hosts to significantly influence their selection dynamics.

This scenario, in which very different microbiomes lead to similar overall contribution to host fitness, shares features with the commonly invoked solution to the $\beta$-diversity conundrum, which was mentioned earlier: that different microbiome compositions, thanks to functional redundancy among different species, share functional similarities that allow them to provide the same “services” to the host^{17,40,64,65,66}. In ecological terms, this explanation relies on a niche-based perspective, assuming that functional niches exist in the gut and may be filled by a range of microbial species. The solution proposed here based on our simulations does not contradict this niche-based explanation, and in fact aligns well with it. However, it relies on less assumptions. As our framework is neutral (in the ecological sense, with respect to microbial dynamics and functions), it is more parsimonious and sidesteps the criticism that has been levelled at the functional redundancy hypothesis (e.g^41,42).

To validate our hypothesis on a longer time scale, we address our second measure—the number of generations to ancestral coalescence under three microbial transmission schemes—pure vertical, pure horizontal, and an equal combination of the two, dubbed ‘midway’ transmission. We compare the number of generations until coalescence in each population to this measure in a neutral scenario where the coalescent dynamics are only the product of random drift, and the microbiome is irrelevant to the hosts’ fitness. Under vertical transmission, The simulations’ results show that the time it takes for the populations with the species-rich microbiome to coalesce is almost identical to the neutral scenario (p = 0.99), while the populations with the species-poor microbiome composition coalesce to a single lineage in half that time on average (p < 0.001) (Fig. 5a). This indeed matches our results using the first measure, further implying that the microbiome is not affecting the hosts’ natural selection when it is highly $\alpha$-diverse, and vice versa.

**Fig. 5: Influence of the microbiomes’ species richness on the number of generations it took for all existing hosts in the population to share a common ancestor.**

When addressing pure horizontal transmission dynamics, the coalescence times of both the species-rich and species-poor microbiome configurations behave similarly to that of the neutral dynamics (Fig. 5c). This is the expected outcome, as when the microbial configuration is not at all linked to the ancestry of the individual host, it will not be able to benefit specific lineages, leaving the assembly of the microbial configurations in each generation to neutral processes. Under the midway microbial transmission scheme (Fig. 5b), the species-rich configuration remains indistinguishable from neutral dynamics (p = 0.81), but the coalescence times of the species-poor populations are faster relative to it and to the neutral dynamics (p < 0.05), yet not as fast as in purely vertical transmission. This coheres to the nature of the “midway” transmission as a mixture of both vertical and horizontal transmissions—the vertical transmission enables host lineages to acquire unique microbial compositions that bestow different fitness scores, thus driving the system to faster coalescence, while the horizontal transmission somewhat breaks this exclusivity, delaying coalescence (Fig. 5c).

The results in the species-rich populations showcase a feasible scenario where both sides of the conundrum are true simultaneously—The microbiome is the sole contributor to the fitness of its host, yet still it is not able to influence the selection dynamics in the host population. This observation, under the most extreme case of purely vertical microbial transmission, highlights the richness of the microbiome compositions as a possible solution to the microbial $\beta$-diversity conundrum.

Large differences in the contributions of microbial species to fitness lead to effective selection among hosts

We saw that species-rich microbiomes are less likely to drive selection. So—is microbiome composition irrelevant to selection in species that typically have high ${{{{{\rm{\alpha }}}}}}$-diversity microbiomes? We tried searched for conditions which would invalidate the assumptions of the law of large numbers. This would require that although many species are present in the microbiome, the sum of their contributions still would not approximate the multiplied mean of the background contributions distribution well. Thus, we tested the influence of altering the distribution itself, i.e., the contributions of microbe species to the fitness of their hosts. We examined three background distributions of contributions-to-fitness among the different microbe species: The first is a step distribution, where each microbe species contributes either the maximal or minimal value possible, a trait set only once at the beginning of the simulation by random sampling with the probabilities $0.025$ and $0.975$ respectively (Fig. S4a). This Effectively leads to 2.5% of the microbiome taxa to greatly contribute to the host fitness, while the rest contribute very little. The second is an almost uniform distribution where each contribution value between the minimal and the maximal values is equally represented, as was used in the previous section (Fig. S4b). The third is a midpoint between the two previously described schools, where most species would result in contributing little while few species contribute greatly (Fig. S4c).

We ran the simulation under the three scenarios on populations of hosts with the species-rich microbiome configuration. As before, we first look at the distribution of fitness scores in the first generations, to understand the microbiome’s predisposition to influence host selection under each scenario. Truly, we see that altering the scheme by which the microbiome contributes to its host’s fitness has an impact on the distribution of fitness scores (Fig. 6a). When comparing the three scenarios, the observed trend is of an increase in fitness scores’ variance as the microbiome contribution distribution is less uniform (p < 0.001) (Fig. 6b).

**Fig. 6: Influence of different distributions of microbes’ contributions on host fitness.**

These findings correspond to the law of large numbers: when the distribution’s variance is larger, a greater sample size is required to approximate its mean^63,67. Thus, when the contribution of each microbial species to its host’s fitness is drawn from a high-variance distribution, even in species-rich microbiomes the number of species may still be small enough such that different microbiome compositions lead to significantly different fitness scores.

The long-term effect of the different microbial contribution distributions, as seen in the time until coalescence under the vertical transmission scenario, also supports this hypothesis. We indeed see that under the uniform-distribution scenario the times to coalescence are quite similar to the ones under neutral dynamics (p = 0.99), meaning the microbiome did not have a significant influence on population dynamics of the hosts (Fig. 7). In contrast, when the variance in contributions is large, the time to coalescence shortens by half (p < 0.001), indicating that the microbiome did take part in driving the host selection processes by allowing the more fit host lineages to take over the population within a smaller number of generations.

**Fig. 7: Influence of different distributions of microbes’ contributions to host fitness.**

Under purely horizontal transmission dynamics, we see that the time to lineage coalescence remains similar to that of the neutral dynamics under all the three different distributions of microbial contribution (p > 0.3) (Fig. S6a). This is reasonable as neutral selection dynamics are expected when the fitness of hosts is not strongly linked to their ancestry, and the fitness is determined by many components, reducing the effect of small fluctuations in its composition. The results are similar under a midway microbial transmission scheme (Fig. S6b).

The results arising from these simulations, especially under purely vertical transmission, show that in host populations with species-rich microbial compositions a possible solution to the conundrum could lie within the particular fashion in which the microbes contribute to the fitness of their hosts. For example, if the distribution of the microbial contributions to the fitness of the hosts is uniform, then the microbiome does not affect the selections dynamics of its hosts despite being the sole determiner of their fitness.

Microbiome is more likely to drive selection in large host populations

Another theoretical and empirical factor known to play a prominent role in the population dynamics and its selection dynamics is its size^{68,69,70,71,72}. We thus set out to test whether the size of the host population can play a part in the microbiome’s ability to drive natural selection dynamics among its hosts. To do so, we simulated host populations varying only in the magnitude of the number of hosts that comprise them: 20, 200, and 2000 hosts. The hosts’ microbiomes were simulated using the species-rich configuration, and the distribution of microbes’ contribution to their host’s fitness was under an almost uniform scenario (see Fig. S10 for a complementary exploration of population size effect in hosts with species-poor microbiomes).

We begin by looking at our short-term indicator for the microbiome’s ability to influence host selection, the distribution of fitness scores in the first generation (Fig. 8a). We see that unlike the factors that we tested previously, the population’s size does not lead to large differences in the distribution or variance of the hosts’ fitness scores (Fig. 8b). In other words, the differential of selection among host lineages is small in this case and is not affected by population size.

**Fig. 8: Influence of host population size on first-generation fitness scores.**

Perhaps surprisingly, however, when we examine the multigenerational effects of the population size on the microbiome’s influence on selection, we find that population size alters the extent to which the microbiome influences population dynamics. To compare coalescence times in populations of different sizes under purely vertical transmission dynamics, we normalize the generation of coalescence by the population size, N (Fig. 9). We see that the larger the population, the relative number of generations needed to reach a state where all hosts share a common ancestor becomes significantly shorter (p < 0.001),. This suggests that although a single-generation indicator does not show, for different N values, a different differential-of-selection among lineages, the microbiome is more capable of driving selection in larger host populations. This is due to the increased efficacy of selection in large populations, and the increased likelihood that in a large population, even small fitness differences will be realized, as is well-known in models of genetic evolution^{69,70,71,73,74}. This contrasts with the dynamics in small populations, in which random drift is a relatively more prominent force^69,70,71,73, and in which effective selection due to microbiome-mediated fitness effects seems less likely. As expected, under horizontal and midway microbial transmission schemes, the size of the host population does not significantly affect the time to lineage coalescence (Fig. S7). We thus find that, in line with classic population genetics’ theory, in smaller host populations the microbiome is limited in the extent to which it can influence the selection dynamics of its hosts. This highlights the population size as another solution to the seemingly conflicting aspects of the microbial $\beta$-diversity conundrum.

**Fig. 9: Influence of host population size on the number of generations it took for all existing hosts in the population to share a common ancestor divided by the populations size.**

Discussion

In this paper we highlight an overlooked conflict in empirical findings from microbiome research—‘the microbial ${{{{{\rm{\beta }}}}}}$- diversity conundrum’—and attempt to reconcile it by introducing a simple and modular framework capable of simulating the evolutionary and ecological dynamics of a host population and their associated microbiomes. We propose different answers to the paradox by simulating various scenarios, including different assembly and contribution dynamics of the microbiome, different microbial transmission schemes, and different host population-related parameters. Our method of suggesting solutions to the puzzle was to demonstrate probable scenarios where the microbiome alone is affecting its host fitness while also displaying high $\beta$-diversity and inability to drive selection between the hosts in the population. In these scenarios, we aim to also pinpoint the parameters that facilitate this duality. In this article we present three such scenarios—a species-rich microbiome configuration, a relatively uniform distribution of contributions to host fitness between microbial species, and a small host population size.

Resolving the conundrum means settling the conflict between the empirical findings which brought the microbial ${{{{{\rm{\beta }}}}}}$-diversity conundrum to light— ones that underline the microbiome’s importance to its host’s fitness on one hand^{1,2,3,4,5,6,7,8,11,13,14,15,16,75}, but that do not lead to the conservation of a particular microbiome structure in a population^17,18. We find that one solution may lay in the composite nature of the microbiome. Being composed of many different species with different contributions, as opposed to traditional traits which are usually thought of when discussing trait conservation, the overall influence of the microbiome on its host is subjugated to the law of large numbers. As such, we expect that species-poor microbiome configurations are more likely to be driving selection among hosts, and thus be respectively more conserved. This hypothesis is supported by empirical observations in insects, which are characterized by microbiomes composed of relatively few species, where greater uniformity in microbiome compositions has been reported^{76,77,78,79,80}. Such a difference in the compositionality of the microbiome may also occur between different body sites’ microbiomes in the same host; in humans, for example, the vaginal microbiome is relatively species-poor, and—in line with the hypothesis above—is characterized by relatively low within-population $\beta$-diversity^81,82.

Our findings may direct further research to empirically validate whether the microbial diversity conundrum can truly be explained by the solutions we have suggested here. Novel research can focus on the real-world causes in implications of the conundrum, and test if more microbiome $\beta$-diversity or conservation is found in populations that are characterized by one or more of the factors that were discussed, per the results of our simulations.

Notably, the three factors that we highlight as providing possible solutions to the conundrum do so in qualitatively different ways, with rather different characteristics. All three factors: microbiome diversity, contribution distribution of the microbiome, and host population size can influence the number of generations needed for the host lineages to coalesce, yet only the first two impact the fitness scores’ distributions in the first generation. This difference underscores the multiplicity of types of factors that take part in the correlation between microbiome and host selection. The solutions regarding the microbiome richness and distribution of contributions are driven by the tendency of selection to be subject to random sampling and regression towards a mean, with results that are mediated by the law of large numbers and are thus evident even in a single generation. In contrast, the solution related to the host population size influences selection dynamics over multiple generations, creating (or not creating) slight selection biases very gradually, which are not necessarily noticeable within a single generation. Interestingly, these factors require different perspectives for their consideration and are traditionally treated within different sub-disciplines of academic study; they are thus rarely considered within the same framework.

This also highlights a significant difference between our two measures of the microbiome’s influence on host natural selection. The first, the distribution of fitness scores in the first generation, corresponds to what is typically monitored in empirical studies of microbiome and its effect on its host’s wellbeing. Discussing the variance of fitness scores in our simulation parallels with examining the phenotypic diversity among hosts with varying microbiome compositions, which is what initially sparked the notion of the microbiome’s importance. In contrast, the second measure—the time to ancestral coalescence—parallels the microbiome to a hereditary trait. By summarizing its impact on selection processes through multiple generations this measure may be more appropriate for inference or exploration of long-term evolutionary dynamics of the host population, closer in spirit to frameworks of population genetics and molecular evolution⁹.

To keep our framework general and modular, most of the ‘real-world’ processes regarding the population and ecological dynamics of the microbes and the hosts were not implemented. Hosts reproduce asexually and are passive in the microbiome acquisition process, the microbes do not interact among themselves in ways other than being subjected to the host’s carrying capacity, and after the initial assembly process ends, the microbiome remains constant throughout the host’s life span. The framework was designed in a modular way, allowing future incorporation of these processes, alongside other dynamics that were not implemented in the model. By using a simplified model, we were able to identify underlying factors mediating the microbiome’s ability to affect host selection even under simplified and selectively neutral intra-microbiome dynamics. The determinants of the microbiome’s $\beta$-diversity in reality may be many; furthermore, even within the simple version of our framework that was used in this study, a broad range of values of $\beta$-diversity can occur, depending on the parameter values used for the microbiomes’ assembly; here, we focused on highlighting scenarios in which high $\beta$-diversity would be maintained over evolutionary time-scales despite a significant role of the microbiome in determination of fitness. In the future, our framework may also be utilized to explore the expected $\beta$-diversity in more specific scenarios, such as for a certain species in which the real parameter values are known. We have also focused here on the hardest-to-explain scenario with respect to $\beta$-diversity, namely the case in which it fully determines the host’s fitness and yet remains high. Our framework can also be used to explore scenarios in which $\beta$-diversity is low in the first place due to the assembly process or other constraints, or to study cases in which fitness is only partially dependent on the microbiome’s composition.

The framework’s modularity enables it to simulate a broad range of scenarios, and may be used in future research of questions that are unrelated to the microbial $\beta$-diversity conundrum. For example, the simulations can be used to predict the environmental rescue effect of microbial species within the microbiomes of a host population. In other words, it can be used to analyze the probability, and the factors controlling it, of species of microbes that were extinct in a population’s microbiome to spread once again through the population if re-introduced at some rate from the environment. This can contribute to a current ongoing discussion about the possibility and benefits of human microbiome rewilding—the act of reintroducing lost microbe species to human microbiomes to regain health benefits our hunter-gatherer ancestors possessed^83,84.

In conclusion, we have focused on a surprising paradox that has gone largely unnoticed thus far—the microbial $\beta$-diversity conundrum, an apparent conflict between two commonly discussed findings regarding the microbiome. We have attempted to understand how the composition of the microbiome can be crucial for host fitness, while also being highly divergent among healthy individuals. Using a series of simulations, our research presents a list of several probable factors that could enable this duality—a species-rich microbiome composition, a uniform distribution of microbial contributions to host fitness, or a large population size of hosts. Not only can these solutions resolve the paradox, but also direct further research regarding the microbiome’s diversity and the intricate relationship between hosts and their associated microbes. Furthermore, the presented framework is modular and can be used to explore a range of additional topics in microbiome research.

Methods

The framework implements an agent-based simulation of a host population consisting of a fixed number of individuals, $N=50$, with $B=2000$ microbe taxa available in the environment unless stated otherwise. Each executed simulation was run with ${{{{{\rm{AC}}}}}}=2$, meaning until the population was comprised of hosts sharing no more than two common ancestors.

Measuring the microbiome’s influence on host selection

The first measure we used to study selection dynamics—the distribution of host fitness scores in a single generation—is directly derived from the individuals’ microbiome compositions and correlates with each individual’s expected mean number of offspring, thus acting as a relevant indicator for the microbiome’s effect on host population dynamics. We approximated the magnitude of the difference using the fitness scores’ distribution’s variance, comparing it for the first generation under each simulated scenario. Although it is a direct measure, it is also a short-term one, appliable only for individual generations.

The second measure of selection dynamics is the relative time it takes for all the hosts in the population to share the same ancestor. This evaluation is taken from the field of population genetics and is an adaptation of the time it takes an advantageous allele to fixate in a population, paralleling the alleles with the microbiome⁷¹. Naturally, it is most relevant when microbiome transmission is mostly vertical since an influence on the coalescence generation is expected only to a limited extent in scenarios where the microbiome is not strongly correlated to specific lineages, as is the case in horizontal transmission.

Generation of microbiome templates

Instead of real-time calculation of each host’s exact microbiome structures, “empty” microbiome templates denoting only the number of species in the microbiome and their abundances were pre-generated, only to be assigned specific taxa during the simulation itself. In the moment of creation, a host is assigned such a template—empty slots varying in size representing abundance, each to later be allocated to a different microbe taxon, out of a total $B$ existing taxa. The microbiome templates are generated by simulating microbial establishment events, where each represents one slot in the final template. As these consecutive events act as a Poisson process, each waiting time ${t}_{i}$, between establishment events ${e}_{i-1}$ and ${e}_{i}$, is drawn from an exponential distribution with a tunable rate parameter ${\lambda }_{1}$:

$$\forall i\, {\mathbb{\in }}\, {\mathbb{N}}\, {t}_{i} \quad\sim {{{{{\rm{Exp}}}}}}\left({\lambda }_{1}\right)$$

(1)

Afterward, each waiting time is multiplied by an establishment probability coefficient ${s}_{i}$, whose value reflects the probability of a successful establishment event, as a function of its chronological order. In our framework, three such coefficient vectors can be applied, each describing a different microbiome acquisition scenario. (i) A null scenario where all establishment probabilities are 1, thus ${t}_{i}$ remains as is (Fig. S5a). (ii) The earlier the establishment event, the more likely it is, simulating the growing struggle for space and resources when more and more taxa inhabit the same niche⁸⁵. The scaling factors follow a scaled exponential decay with a scaling factor ${E}_{s}$ and a rate ${\lambda }_{2}$ (Fig. S5b). (iii) The highest establishment probabilities are received after several pioneer taxa have already established within the host, colonizing it, and making it more habitable⁸⁶, followed by a decrease in the probability, like in the previous case. This creates a “hump” shaped probability vector, with parameters $a,b,c$ creating the hump parabola $a{x}^{2}+{bx}+c$, alongside $p$ controlling the index of the event with the highest priority and a scaling factor ${H}_{s}$. The vector is then normalized to the range $[0\ldots 1]$, and the minimal probability is set to ${H}_{m}$ (Fig. S5c). The explicit notation of the three establishment probability vectors is:

$$\left\{1\,{{{{{\rm{| }}}}}}\,i\, {\mathbb{\in }}\,{\mathbb{N}}\right\}$$

(2)

$$\left\{\frac{{E}_{s}+{e}^{-{\lambda }_{2}i}}{1+{E}_{s}}{{{{{\rm{| }}}}}}\, i\, {\mathbb{\in }}\, {\mathbb{N}}\right\}$$

(3)

$$ {Let}\,P=a{\left(i-p\right)}^{2}+b\left(i-p\right)+c\to \\ \left\{\max \left(\frac{1}{1+{H}_{s}}\cdot \left({H}_{s}+\frac{P-\min \left(P\right)}{\max \left(P\right)-\min \left(P\right)}\right),{H}_{m}\right){{{{{\rm{|}}}}}}i{\mathbb{\in }}{\mathbb{N}}\right\}$$

(4)

The drawn waiting times are scaled by the scaling vector, relatively shortening the waiting time of a high probability event and elongating that of a low probability one. Thus, the final waiting times, ${{t}_{f}}_{i}$, are:

$$\forall i\,{\mathbb{\in }}\, {\mathbb{N}}\, {{t}_{f}}_{i}=\frac{{t}_{i}}{{s}_{i}}$$

(5)

During these waiting times, the already established taxa grow in number in each timestep according to a classic logistic growth function, where ${C}_{s},k,m$ represent the maximal size of a single taxon within the host, the growth steepness, and the sigmoid midpoint respectively. The population size of a single microbe taxon after ${t}_{i}$ time would be:

$$\frac{{C}_{s}}{1+{e}^{-k\left({t}_{i}-m\right)}}$$

(6)

The microbiome template’s computation is finished when the sum of the abundances of all microbe taxa within the host has reached a predefined global capacity, ${C}_{g}$.

Different parameters were used to pre-generate 10,000 different microbiome templates for both the species-rich and species-poor microbiome configurations, to be chosen from randomly during the initialization of each host in the simulations (Tables S1a, S1b). To reach species-rich microbiome templates, waiting times were scaled according to scenario (iii), and to reach the species-poor templates, waiting times were scaled according to scenario (ii). Supplementary figure S3 depicts microbiome templates that were constructed with slightly different parameters, used in simulations that are described in the supplementary material as well.

Microbiome acquisition

After a microbiome template is assigned to the host, the acquisition process of microbiome equals to assigning each empty slot a unique microbe taxon, denoting its abundance within the host. The first generation of hosts in each simulates being randomly seeded with different microbes out of the possible $B$ taxa. Each host randomly selects microbe species according to the number of slots in its microbiome templates, and randomly assigns each taxon to a different slot thus creating a diverse initial host population.

In the next generations, the host acquires the microbiome through randomly sampling available microbes from a distribution dictated by its parent and the entire previous population. The parental source, $P$, is simply the microbiome of the host’s parent normalized to represent the relative abundance of its microbes, and the population-wide microbiome, $E$, is the per-taxon summation of abundances in the previous population’s microbiomes, also normalized. The final available microbe distributions a summation of the two sources, weighted by the vertical and horizontal transmission coefficients ${T}_{v},{T}_{h}\in {{\mathbb{R}}}^{+}$, representing the relative contribution of the parental source and population-wide source respectively:

$${{{{{\rm{abundance}}}}}}\; {{{{{\rm{of}}}}}}\; {{{{{\rm{taxon}}}}}}\; {{{{{\rm{i}}}}}}\; {{{{{\rm{in}}}}}}\; {{{{{\rm{the}}}}}}\; {{{{{\rm{sampling}}}}}}\; {{{{{\rm{distribution}}}}}}={T}_{v}\cdot {P}_{i}+{T}_{h}\cdot {E}_{i}$$

(7)

The microbes from the available pool are randomly assigned to the slots, weighted by their abundance in the pool. Thus, taxa that are more abundant in the joint contribution of the sources are more likely to establish first and inhabit larger slots in the host microbiome’s template.

Microbiome contribution

The specific contribution of each microbe species to the host’s fitness is generated at the start of the simulation and remains constant throughout. For each taxon, a contribution value is randomly selected within the range of ${C}_{\min }$ and ${C}_{\max }$, representing the minimal and maximal possible contributions respectively. The random sampling is altered by various parameters following the simulated scenario and is factored by the contribution rate parameter- ${\lambda }_{3}$(Table S2). (i) A “step” contribution scenario, where some taxa contribute ${C}_{\min }$, others ${C}_{\max }$, and ${\lambda }_{3}$ represents the number of taxa to contribute ${C}_{\max }$ (Fig. S1a). (ii) An “exponential decay” contribution scenario, where the sampling is weighted according to an exponential distribution ${Exp}({\lambda }_{3})$. Large ${\lambda }_{3}$ values denote a steep density distribution, eventually resulting in a small number of microbes that contribute a lot, while the rest barely do. Whereas small ${\lambda }_{3}$ values denote an almost uniform contribution distribution (Fig. S1b).

Fitness calculation

The fitness of a host is a linear summation of all the contributions of microbe taxa that dwell in its microbiome, summing a species contribution only once if it is present in the microbiome, dismissing its abundance. Meaning for host $i$, the fitness - ${f}_{i}$, is calculated as follows:

$${f}_{i}=\mathop{\sum }\limits_{j=1}^{B}{1}_{\{{taxon}\, j\, {in}\, {host}\, {i}^{{\prime} }s\, {microbiome}\}}\cdot {c}_{j}$$

(8)

After the fitness of the entire population is calculated, it is divided by the maximal value in order to adhere to the traditional $\left[{{{{\mathrm{0,1}}}}}\right]$ fitness range.

Jaccard distance calculation

Jaccard distance between the microbiome configuration of two hosts ${h}_{A}$ and ${h}_{B}$ with microbiomes A and $B$ was calculated traditionally:

$${{{{{\rm{Jaccard\; distance}}}}}}\left({h}_{A},{h}_{B}\right)=1-\frac{{{{{{\rm{|}}}}}}A\cap B{{{{{\rm{|}}}}}}}{\left|A\cup B\right|}$$

(9)

The representing Jaccard distance of a population $P$ was calculated as the mean of all Jaccard distances within every two hosts in the population:

$${{{{{\rm{Jaccard}}}}}}\left(P\right)=\frac{\mathop{\sum}\limits_{i,j\in \left[1\ldots N\right],\, i\ne j}{{{{{\rm{Jaccard\; distance}}}}}}\left({h}_{i},\, {h}_{j}\right)}{\left({N}\atop{2}\right)}$$

(10)

Statistical significance calculation

We used standard two-sided t-tests to determine whether observed differences between different groups were significant. Wherever first generation fitness scores were compared, the compared groups were composed of 100 repetitions $\times N$ hosts. Where $N$ was different across comparisons the calculations were done on the minimal $N$. Each set of compared times to coalescence was comprised of measurements from 100 stochastic simulation repetitions.

Boxplot information

All boxplots in this manuscript are presented with the sample median and the box representing the 25th to 75th percentiles. Whiskers portray the sample minima and maxima.

Relation to other generative frameworks for modeling host-microbiome dynamics

Several computational frameworks have been constructed so far for the modeling of host-microbiome dynamics in population contexts of both hosts and microbes^{45,46,47,49,50,87,88}. Among these, the framework we propose here is most similar to the two models proposed by Zeng et al.^45,46. Our implementation is independent of theirs, and a detailed comparison among them in the future may be productive, as it may highlight qualitative differences that arise from seemingly arbitrary modeling choices and implementation practices. Two particularly notable differences between Zeng et al.’s (2015, 2017) frameworks and our framework have to do with the dynamics of construction of a host’s microbiome, and the way in which the microbiome may influence host fitness.

(1)
A host individual’s microbiome in the Zeng et al. framework (2015) is composed of microbes that occupy a pre-determined number of slots (n = 1000, or n = 100,000, for example); these slots are filled via multinomial sampling from several possible sources according to simulation parameters. Our framework’s scheme of populating the microbiome is slightly more ecologically realistic: we distinguish between transmission of microbes (a rare event, of one individual microbe) and their establishment (which may depend on the number of previously established species, for example, even in a niche-neutral model, as described above) and multiplication within the host, from a single individual to 10³−10⁸ individuals, depending on how early in the colonization process the microbe arrived. Depending on model parameters these processes can lead to a range of microbiome structures as discussed above, and the model can thus be used to explore the ecological determinants of microbiome assembly and their interaction with dynamics on evolutionary timescales (of multiple host generations), including incorporation of a range of ecological considerations and their influence on patterns of microbial diversity. This includes, for example, direct comparison between scenarios in which the dominant force is transmission limitation and scenarios in which the main force shaping the microbiome is selection that the host environment imposes.
(2)
The Zeng et al. (2017) framework includes microbial influence on host fitness which is slightly different from this influence in our framework: their approach focuses on a functional perspective, and each microbe can contribute to one or more of certain functions that benefit or harm the host. Our framework is structured in a modular way that allows such exploration in future studies, but in the version implemented here we explore—as describe above—several schemes of microbial contribution to its host that make very minimal assumptions about the functional profile of the microbiome. We compare different fitness schemes in which each microbe has an additive contribution to its host’s fitness, each sampled from a certain distribution, showing that different distributions would lead to qualitatively different selection dynamics.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Representative samples of the raw data as it was generated during runs of the simulations described in this paper is available in github.com/itaydaybog/MicrobiomeFramework. The full code is provided, allowing the reproduction of all data and figures used in this study. All produced data can be received from the authors upon request.

Code availability

The code of the framework described in this paper, alongside code for the results presentation and analysis, is provided in github.com/itaydaybog/MicrobiomeFramework⁸⁹. The model was implemented in Python 3.7.

References

Bäckhed, F., Ley, R. E., Sonnenburg, J. L., Peterson, D. A. & Gordon, J. I. Host-bacterial mutualism in the human intestine. Science 307, 1915–1920 (2005).
Article ADS PubMed Google Scholar
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Fukuda, S. et al. Bifidobacteria can protect from enteropathogenic infection through production of acetate. Nature 469, 543–547 (2011).
Article ADS CAS PubMed Google Scholar
Olszak, T. et al. Microbial exposure during early life has persistent effects on natural killer T cell function. Science 336, 489–493 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Marasco, R. et al. A drought resistance-promoting microbiome is selected by root system under desert farming. PLoS ONE 7, e48479 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Ziegler, M., Seneca, F. O., Yum, L. K., Palumbi, S. R. & Voolstra, C. R. Bacterial community dynamics are linked to patterns of coral heat tolerance. Nat. Commun. 8, 1–8 (2017).
Article Google Scholar
Blaser, M., Bork, P., Fraser, C., Knight, R. & Wang, J. The microbiome explored: recent insights and future challenges. Nat. Rev. Microbiol. 11, 213 (2013).
Article CAS PubMed Google Scholar
Ley, R. E., Turnbaugh, P. J., Klein, S. & Gordon, J. I. Human gut microbes associated with obesity. Nature 444, 1022–1023 (2006).
Article ADS CAS PubMed Google Scholar
Kolodny, O., Callahan, B. J. & Douglas, A. E. The role of the microbiome in host evolution. Phil. Trans. R. Soc. B 375, 20190588 (2020).
Kolodny, O. & Schulenburg, H. Microbiome-mediated plasticity directs host evolution along several distinct time scales. Philos. Trans. R. Soc. B 375, 20190589 (2020).
Article Google Scholar
Gould, A. L. et al. Microbiome interactions shape host fitness. Proc. Natl Acad. Sci. USA 115, E11951–E11960 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Sommer, F. & Bäckhed, F. The gut microbiota—masters of host development and physiology. Nat. Rev. Microbiol. 11, 227–238 (2013).
Article CAS PubMed Google Scholar
Heintz, C. & Mair, W. You are what you host: microbiome modulation of the aging process. Cell 156, 408–411 (2014).
Article CAS PubMed PubMed Central Google Scholar
Archie, E. A. & Tung, J. Social behavior and the microbiome. Curr. Opin. Behav. Sci. 6, 28–34 (2015).
Article Google Scholar
Cryan, J. F. & Dinan, T. G. Mind-altering microorganisms: the impact of the gut microbiota on brain and behaviour. Nat. Rev. Neurosci. 13, 701–712 (2012).
Article CAS PubMed Google Scholar
Smith, P. et al. Regulation of life span by the gut microbiota in the short-lived African turquoise killifish. Elife 6, e27014 (2017).
Article PubMed PubMed Central Google Scholar
Huttenhower, C. et al. Structure, function and diversity of the healthy human microbiome. Nature 486, 207 (2012).
Article ADS CAS Google Scholar
Falony, G. et al. Population-level analysis of gut microbiome variation. Science 352, 560 LP–560564 (2016).
Article ADS Google Scholar
Davenport, E. R. et al. The human microbiome in evolution. BMC Biol. 15, 1–12 (2017).
Article Google Scholar
Fisher, R. A. The Genetical Theory of Natural Selection (Рипол Классик, 1958).
Jordan, I. K., Rogozin, I. B., Wolf, Y. I. & Koonin, E. V. Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. Genome Res. 12, 962–968 (2002).
Article CAS PubMed PubMed Central Google Scholar
Zhang, L. & Li, W.-H. Mammalian housekeeping genes evolve more slowly than tissue-specific genes. Mol. Biol. Evol. 21, 236–239 (2004).
Article PubMed Google Scholar
Guharoy, M. & Chakrabarti, P. Conservation and relative importance of residues across protein-protein interfaces. Proc. Natl Acad. Sci. USA 102, 15447–15452 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Valdar, W. S. J. & Thornton, J. M. Conservation helps to identify biologically relevant crystal contacts. J. Mol. Biol. 313, 399–416 (2001).
Article CAS PubMed Google Scholar
Landau, M. et al. ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures. Nucleic Acids Res. 33, W299–W302 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Cooper, G. M. & Brown, C. D. Qualifying the relationship between sequence conservation and molecular function. Genome Res. 18, 201–205 (2008).
Article CAS PubMed Google Scholar
Nguyen Ba, A. N. et al. Proteome-wide discovery of evolutionary conserved sequences in disordered regions. Sci. Signal. 5, rs1–rs1 (2012).
Article PubMed PubMed Central Google Scholar
Siepel, A. et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050 (2005).
Article CAS PubMed PubMed Central Google Scholar
Rogers, D. S. & Ehrlich, P. R. Natural selection and cultural rates of change. Proc. Natl Acad. Sci. USA 105, 3416–3420 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Charlesworth, D. Balancing selection and its effects on sequences in nearby genome regions. PLoS Genet. 2, e64 (2006).
Article PubMed PubMed Central Google Scholar
Fitzpatrick, M. J., Feder, E., Rowe, L. & Sokolowski, M. B. Maintaining a behaviour polymorphism by frequency-dependent selection on a single gene. Nature 447, 210–212 (2007).
Article ADS CAS PubMed Google Scholar
Brown, W. L. & Wilson, E. O. Character displacement. Syst. Zool. 5, 49–64 (1956).
Article Google Scholar
Hedrick, P. W. Genetic polymorphism in heterogeneous environments: a decade later. Annu. Rev. Ecol. Syst. 17, 535–566 (1986).
Article Google Scholar
Christie, M. R., McNickle, G. G., French, R. A. & Blouin, M. S. Life history variation is maintained by fitness trade-offs and negative frequency-dependent selection. Proc. Natl Acad. Sci. USA 115, 4441–4446 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Seedorf, H. et al. Bacteria from diverse habitats colonize and compete in the mouse gut. Cell 159, 253–266 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rawls, J. F., Mahowald, M. A., Ley, R. E. & Gordon, J. I. Reciprocal gut microbiota transplants from zebrafish and mice to germ-free recipients reveal host habitat selection. Cell 127, 423–433 (2006).
Article CAS PubMed PubMed Central Google Scholar
Ottman, N. et al. Soil exposure modifies the gut microbiota and supports immune tolerance in a mouse model. J. Allergy Clin. Immunol. 143, 1198–1206 (2019).
Article CAS PubMed Google Scholar
Liddicoat, C. et al. Naturally-diverse airborne environmental microbial exposures modulate the gut microbiome and may provide anxiolytic benefits in mice. Sci. Total Environ. 701, 134684 (2020).
Article ADS CAS PubMed Google Scholar
Burke, C., Steinberg, P., Rusch, D., Kjelleberg, S. & Thomas, T. Bacterial community assembly based on functional genes rather than species. Proc. Natl Acad. Sci. USA 108, 14288–14293 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Morgan, X. C., Segata, N. & Huttenhower, C. Biodiversity and functional genomics in the human microbiome. Trends Genet. 29, 51–58 (2013).
Article CAS PubMed Google Scholar
Heintz-Buschart, A. & Wilmes, P. Human gut microbiome: function matters. Trends Microbiol. 26, 563–574 (2018).
Article CAS PubMed Google Scholar
Manor, O. & Borenstein, E. Revised computational metagenomic processing uncovers hidden and biologically meaningful functional variation in the human microbiome. Microbiome 5, 1–11 (2017).
Article Google Scholar
Heintz-Buschart, A. et al. Integrated multi-omics of the human gut microbiome in a case study of familial type 1 diabetes. Nat. Microbiol. 2, 1–13 (2016).
Google Scholar
Franzosa, E. A. et al. Relating the metatranscriptome and metagenome of the human gut. Proc. Natl Acad. Sci. USA 111, E2329–E2338 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zeng, Q., Wu, S., Sukumaran, J. & Rodrigo, A. Models of microbiome evolution incorporating host and microbial selection. Microbiome 5, 1–16 (2017).
Article CAS Google Scholar
Zeng, Q., Sukumaran, J., Wu, S. & Rodrigo, A. Neutral models of microbiome evolution. PLoS Comput. Biol. 11, e1004365 (2015).
Article ADS PubMed PubMed Central Google Scholar
Roughgarden, J., Gilbert, S. F., Rosenberg, E., Zilber-Rosenberg, I. & Lloyd, E. A. Holobionts as units of selection and a model of their population dynamics and evolution. Biol. Theory 13, 44–65 (2018).
Article Google Scholar
Osmanovic, D., Kessler, D. A., Rabin, Y. & Soen, Y. Darwinian selection of host and bacteria supports emergence of Lamarckian-like adaptation of the system as a whole. Biol. Direct 13, 24 (2018).
Article PubMed PubMed Central Google Scholar
Van Vliet, S. & Doebeli, M. The role of multilevel selection in host microbiome evolution. Proc. Natl Acad. Sci. USA 116, 20591–20597 (2019).
Article ADS PubMed PubMed Central Google Scholar
Daybog, I. & Kolodny, O. Simplified model assumptions artificially constrain the parameter range in which selection at the holobiont level can occur. Proc. Natl Acad. Sci. USA 117, 11862 LP–11811863 (2020).
Article ADS Google Scholar
Rosindell, J., Hubbell, S. P. & Etienne, R. S. The unified neutral theory of biodiversity and biogeography at age ten. Trends Ecol. Evol. 26, 340–348 (2011).
Article PubMed Google Scholar
Hubbell, S. P. The Unified Neutral Theory of Biodiversity and Biogeography (MPB-32). vol. 32 (Princeton University Press, 2001).
Wright, S. Evolution in Mendelian populations. Genetics 16, 97–159 (1931).
Article CAS PubMed PubMed Central Google Scholar
Fisher, R. A. XXI.—On the dominance ratio. Proc. R. Soc. Edinb. 42, 321–341 (1923).
Article Google Scholar
Alonso, D., Etienne, R. S. & McKane, A. J. The merits of neutral theory. Trends Ecol. Evol. 21, 451–457 (2006).
Article PubMed Google Scholar
Ram, Y., Liberman, U. & Feldman, M. W. Evolution of vertical and oblique transmission under fluctuating selection. Proc. Natl Acad. Sci. USA 115, E1174–E1183 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Das, P., Babaei, P. & Nielsen, J. Metagenomic analysis of microbe-mediated vitamin metabolism in the human gut microbiome. BMC Genomics 20, 208 (2019).
Article PubMed PubMed Central Google Scholar
Youngblut, N. D. et al. Host diet and evolutionary history explain different aspects of gut microbiome diversity among vertebrate clades. Nat. Commun. 10, 1–15 (2019).
Article CAS Google Scholar
Kolodny, O. et al. Coordinated change at the colony level in fruit bat fur microbiomes through time. Nat. Ecol. Evol. 3, 116–124 (2019).
Article PubMed Google Scholar
Nishida, A. H. & Ochman, H. A great-ape view of the gut microbiome. Nat. Rev. Genet. 20, 195–206 (2019).
Article CAS PubMed Google Scholar
Kolasa, M. et al. How hosts taxonomy, trophy, and endosymbionts shape microbiome diversity in beetles. Microb. Ecol. 78, 995–1013 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Fredensborg, B. L. et al. Parasites modulate the gut-microbiome in insects: a proof-of-concept study. PLoS ONE 15, e0227561 (2020).
Article CAS PubMed PubMed Central Google Scholar
Judd, K. L. The law of large numbers with a continuum of iid random variables. J. Econ. Theory 35, 19–25 (1985).
Article MathSciNet MATH Google Scholar
Yachi, S. & Loreau, M. Biodiversity and ecosystem productivity in a fluctuating environment: the insurance hypothesis. Proc. Natl Acad. Sci. USA 96, 1463–1468 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Moya, A. & Ferrer, M. Functional redundancy-induced stability of gut microbiota subjected to disturbance. Trends Microbiol. 24, 402–413 (2016).
Article CAS PubMed Google Scholar
Vieira-Silva, S. et al. Species-function relationships shape ecological properties of the human gut microbiome. Nat. Microbiol. 1, 16088 (2016).
Hsu, P.-L. & Robbins, H. Complete convergence and the law of large numbers. Proc. Natl Acad. Sci. USA 33, 25 (1947).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Gillespie, J. H. Population Genetics: A Concise Guide (JHU Press, 2004).
Lohmueller, K. E. The distribution of deleterious genetic variation in human populations. Curr. Opin. Genet. Dev. 29, 139–146 (2014).
Article CAS PubMed Google Scholar
Ohta, T. Slightly deleterious mutant substitutions in evolution. Nature 246, 96–98 (1973).
Article ADS CAS PubMed Google Scholar
Kimura, M. & Ohta, T. The average number of generations until fixation of a mutant gene in a finite population. Genetics 61, 763 (1969).
Article CAS PubMed PubMed Central Google Scholar
Ohta, T. The nearly neutral theory of molecular evolution. Annu. Rev. Ecol. Syst. 23, 263–286 (1992).
Article Google Scholar
Kimura, M. & Ohta, T. Theoretical Aspects of Population Genetics. vol. 4 (Princeton University Press, 1971).
Rouzine, I. M., Rodrigo, A. & Coffin, J. M. Transition between stochastic evolution and deterministic evolution in the presence of selection: general theory and application to virology. Microbiol. Mol. Biol. Rev. 65, 151–185 (2001).
Article CAS PubMed PubMed Central Google Scholar
Beard, A. S. & Blaser, M. J. The ecology of height: the effect of microbial transmission on human height. Perspect. Biol. Med. 45, 475–498 (2002).
Article PubMed Google Scholar
Pinto-Tomás, A. A. et al. Comparison of midgut bacterial diversity in tropical caterpillars (Lepidoptera: Saturniidae) fed on different diets. Environ. Entomol. 40, 1111–1122 (2011).
Article PubMed Google Scholar
Paniagua Voirol, L. R., Frago, E., Kaltenpoth, M., Hilker, M. & Fatouros, N. E. Bacterial symbionts in Lepidoptera: their diversity, transmission, and impact on the host. Front. Microbiol. 9, 556 (2018).
Article PubMed PubMed Central Google Scholar
Robinson, C. J., Schloss, P., Ramos, Y., Raffa, K. & Handelsman, J. Robustness of the bacterial community in the cabbage white butterfly larval midgut. Microb. Ecol. 59, 199–211 (2010).
Article ADS PubMed Google Scholar
Engel, P. & Moran, N. A. The gut microbiota of insects–diversity in structure and function. FEMS Microbiol. Rev. 37, 699–735 (2013).
Article CAS PubMed Google Scholar
Douglas, A. E. Lessons from studying insect symbioses. Cell Host Microbe 10, 359–367 (2011).
Article CAS PubMed PubMed Central Google Scholar
Callahan, B. J. et al. Replication and refinement of a vaginal microbial signature of preterm birth in two racially distinct cohorts of US women. Proc. Natl Acad. Sci. USA 114, 9966–9971 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
DiGiulio, D. B. et al. Temporal and spatial variation of the human microbiota during pregnancy. Proc. Natl Acad. Sci. USA 112, 11060–11065 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Lorimer, J. Probiotic environmentalities: Rewilding with wolves and worms. Theory, Cult. Soc. 34, 27–48 (2017).
Article Google Scholar
Mills, J. G. et al. Urban habitat restoration provides a human health benefit through microbiome rewilding: the Microbiome Rewilding Hypothesis. Restor. Ecol. 25, 866–872 (2017).
Article Google Scholar
Hibbing, M. E., Fuqua, C., Parsek, M. R. & Peterson, S. B. Bacterial competition: surviving and thriving in the microbial jungle. Nat. Rev. Microbiol. 8, 15–25 (2010).
Article CAS PubMed PubMed Central Google Scholar
Huang, R., Li, M. & Gregory, R. L. Bacterial interactions in dental biofilm. Virulence 2, 435–444 (2011).
Article PubMed PubMed Central Google Scholar
Roughgarden, J. Holobiont evolution: Population theory for the hologenome. Am. Nat. 201, 763–778 (2023).
Article PubMed Google Scholar
Roughgarden, J. Holobiont evolution: mathematical model with vertical vs. horizontal microbiome transmission. Philosophy, Theory, and Practice in Biology 12, 002 https://doi.org/10.3998/ptpbio.16039257.0012.002. (2020)
Daybog, I. & Kolodny, O. Solutions to the microbiome diversity conundrum wherein the microbiome determines host fitness but differs among individuals. Zenodo https://doi.org/10.5281/zenodo.8373183 (2023).

Download references

Acknowledgements

We thank Tommy Kaplan, Amir Bar, Marcus Feldman, and members of the Kolodny lab for insightful comments and discussions. O.K. and I.D. were funded by the Israel Science Foundation (ISF; 1826/20), the United States – Israel Binational Science Foundation (BSF), and the Gordon and Betty Moore Foundation.

Author information

Authors and Affiliations

Department of Ecology, Evolution and Behavior, The A. Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel
Itay Daybog & Oren Kolodny

Authors

Itay Daybog
View author publications
You can also search for this author in PubMed Google Scholar
Oren Kolodny
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

O.K. conceived the framework in this project. I.D. and O.K. designed the study. I.D. implemented the model, derived the results, and analyzed them. I.D. and O.K. wrote the paper.

Corresponding authors

Correspondence to Itay Daybog or Oren Kolodny.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Allen Rodrigo and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Daybog, I., Kolodny, O. A computational framework for resolving the microbiome diversity conundrum. Nat Commun 14, 7977 (2023). https://doi.org/10.1038/s41467-023-42768-4

Download citation

Received: 24 November 2022
Accepted: 20 October 2023
Published: 02 December 2023
DOI: https://doi.org/10.1038/s41467-023-42768-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.