Nested whole-genome duplications coincide with diversification and high morphological disparity in Brassicaceae

Walden, Nora; German, Dmitry A.; Wolf, Eva M.; Kiefer, Markus; Rigault, Philippe; Huang, Xiao-Chen; Kiefer, Christiane; Schmickl, Roswitha; Franzke, Andreas; Neuffer, Barbara; Mummenhoff, Klaus; Koch, Marcus A.

doi:10.1038/s41467-020-17605-7

Download PDF

Article
Open access
Published: 30 July 2020

Nested whole-genome duplications coincide with diversification and high morphological disparity in Brassicaceae

Nature Communications volume 11, Article number: 3795 (2020) Cite this article

7572 Accesses
58 Citations
128 Altmetric
Metrics details

Subjects

Abstract

Angiosperms have become the dominant terrestrial plant group by diversifying for ~145 million years into a broad range of environments. During the course of evolution, numerous morphological innovations arose, often preceded by whole genome duplications (WGD). The mustard family (Brassicaceae), a successful angiosperm clade with ~4000 species, has been diversifying into many evolutionary lineages for more than 30 million years. Here we develop a species inventory, analyze morphological variation, and present a maternal, plastome-based genus-level phylogeny. We show that increased morphological disparity, despite an apparent absence of clade-specific morphological innovations, is found in tribes with WGDs or diversification rate shifts. Both are important processes in Brassicaceae, resulting in an overall high net diversification rate. Character states show frequent and independent gain and loss, and form varying combinations. Therefore, Brassicaceae pave the way to concepts of phylogenetic genome-wide association studies to analyze the evolution of morphological form and function.

Gene duplications and phylogenomic conflict underlie major pulses of phenotypic evolution in gymnosperms

Article 19 July 2021

Buxus and Tetracentron genomes help resolve eudicot genome history

Article Open access 02 February 2022

A genome assembly for Orinus kokonorica provides insights into the origin, adaptive evolution and further diversification of two closely related grass genera

Article Open access 02 December 2023

Introduction

The diversity of morphological traits and characters is dazzling, and its non-uniform realization during the evolution of plant life¹ allowed land plants to diversify into almost all terrestrial habitats. Morphological diversity is considered the major product of macroevolution^1,2. It is often expressed as disparity, which by its simplest definition—that we will follow here—is the amount of morphological variation present in a given taxon relative to the total morphological variation in the set of taxa under investigation². Morphological disparity and taxonomic richness are not independent, since morphological differences are the basis for taxonomic descriptions, which are then translated into different species and genera. Nevertheless, the two factors are not necessarily correlated, as high disparity can be maintained in clades of low taxonomic richness, and plant diversification can occur without expanding morphological variation (or the morphospace) within taxa^2,3. In animal clades, early high disparity with subsequent decrease is the predominant pattern in deep evolutionary histories⁴, whereas it is intriguing to notice that for the past ~100 my (million years) large clades of land plants, such as conifers, ferns, and angiosperms, have maintained high levels of disparity². However, in most cases the intrinsic or extrinsic factors driving disparity are unknown^2,4.

Whole-genome duplications (WGD) are among the factors that have the potential to increase disparity⁵, and they have played a fundamental role during land plant evolution⁶. WGD arises either via auto- or allopolyploidization, and the mode of polyploidization is often unknown for paleopolyploids; herein we cannot distinguish between the two. WGDs may not only provide the opportunity to develop new character states, key characters, or even evolutionary innovations, but also generally increase the potential to implement increased character and trait variation in a clade and thereby its disparity. Subsequently, increased morphological disparity may provide the basis to further diversify and eventually even radiate in a changing spatiotemporal context, while only a minority of characters and their states contribute to key traits and innovations and thereby drive clade diversification and radiation.

Polyploidy is common among land plants, with up to 24% of species being recent polyploids⁷, and all flowering plants have at least one if not numerous ancient polyploidy events in their history⁸. While the evolutionary potential of polyploids has been debated in recent years⁹, the importance of this phenomenon for angiosperm diversification has been corroborated by numerous studies. All angiosperms share at least two paleopolyploidization events¹⁰, and since the first identification of such major events, the so-called At-α, At-β, and At-γ events, in the genome of the model plant Arabidopsis thaliana¹¹, a multitude of WGDs have been identified in many angiosperm lineages¹². Many have been dated to times of major environmental change and transitions between geological epochs, like the Cretaceous-Paleogene boundary¹³ or Pleistocene glaciation-deglaciation cycles¹⁴. An adaptive advantage of polyploids is thought to be caused by the availability of duplicate genes for sub- and neofunctionalization and thus the emergence of new traits¹⁵. Clades having undergone WGDs often also have higher diversification rates¹⁶, although the timings of WGD and diversification do not necessarily coincide. Rather, a diversification leading to a species-rich clade occurs sometime after the split from another lineage, which remains relatively species poor. The so-called WGD radiation lag-time model¹⁷ attempts to explain this observation that can be detected both among angiosperms¹⁷ and in the Brassicaceae family^17,18. Since polyploidization per se is causing major disadvantages, e.g., during mitosis and meiosis, lineages subjected to WGD usually experience a subsequent phase of diploidization¹⁹ often accompanied by drastic genome reorganization and genome size reduction²⁰. These processes might be key to subsequent evolutionary success of respective clades evolving new traits and innovations. Some examples for key traits presumably originating from WGDs exist, such as synthesis of glucosinolates, secondary compounds involved in insect defense mechanisms in the order Brassicales, following At-β²¹ or evolution of the pentamerous flower in Pentapetalae triggering plant-pollinator coevolution following At-γ²². However, there are few studies systematically linking morphological diversity and its evolution with WGDs⁵.

The Brassicaceae are among the 15 largest angiosperm families, comprising almost 4000 species in 351 genera²³, including the model species Arabidopsis thaliana, Arabidopsis lyrata and Arabis alpina, as well as important crops, such as cabbage and rapeseed. A taxonomic system of 51 monophyletic tribes has been proposed to subdivide the family, and they can be organized into several major evolutionary lineages: The tribe Aethionemeae diverged from the rest of the family around 32 million years ago²⁴ (mya). The other 50 tribes are grouped into three²⁵, four²⁶, or five lineages^27,28, which started diversifying ~23 mya²⁴. The phylogenetic positions of these tribes and lineages are still unresolved, in part due to conflicting signals from nuclear and plastid data. Furthermore, potential undersampling of the ingroup may have contributed to difficulties in generating a fully resolved phylogeny-two recent studies included 55 species from 45 genera in 29 tribes²⁷, and 79 species from 72 genera in 50 tribes²⁸, often only sampling one representative per tribe and few species from other families of the order Capparales. Selection of genes for phylogenetic reconstructions following different strategies, unresolved reticulate evolutionary scenarios, and taxonomic inaccuracy, such as erroneous circumscription of tribe Stevenieae²⁸, may also have contributed to these inconsistencies.

The evolutionary history of Brassicaceae has been shaped by repeated cycles of WGDs, most notably At-α preceding the divergence of the family and the earlier At-β event within the Brassicales^11,21. In addition to these paleopolyploidizations, so-called mesopolyploidization events²⁹ have been discovered and confirmed in eleven out of the 51 Brassicaceae tribes^30,31, among them the agronomically important tribe Brassiceae, and explicitly shown to be absent from various other tribes^31,32 (Supplementary Table 1). Post-mesopolyploidization genomes are characterized by extensive diploidization through chromosomal rearrangements, genome size reduction and fractionation; however, duplicated genomic regions are still detectable^29,33. Interestingly, the occurrence of WGDs seems largely uncoupled from shifts in diversification rate that have been observed in nine tribes³⁴. Neopolyploidy is common in Brassicaceae as well, with polyploidy detected in 43% of species, and the percentage of neopolyploids per tribe is weakly correlated with species richness²⁴, indicating polyploidization is a continuously running process over evolutionary time scales in Brassicaceae.

Brassicaceae are known for parallel evolution of most morphological characters in most lineages. Therefore, character state reconstruction analyses have often failed to provide any evolutionary signature on the tribal or higher lineage level²⁷. This led to the idea that morphological characters in Brassicaceae vary strongly and are of little use for phylogenetic reconstruction. However, this has never been analyzed rigorously on the familial level using the entire morphospace needed to determine, identify, and distinguish genera taxonomically. Several studies have revealed extensive homoplasy and convergent evolution for nearly all morphological characters, such as fruit shape²⁷, flower characters³⁵, shape of leaf base²⁷, and leaf complexity^31,36, and even for complex traits such as life history¹⁸. There is no obvious morphological key character or innovation that evolved after mesopolyploidization and triggered subsequent diversification. This raises the question if and how WGDs act on disparity and diversification in a successful and constantly diversifying angiosperm family like Brassicaceae³⁴.

Here, we present a plastome phylogeny combined with a comprehensive morphological dataset of the entire Brassicaceae as well as a complete species checklist comprising all currently described 3977 species and 351 genera. This allows us to establish a phylogenetically and temporally resolved maternal evolutionary framework in which the herein presented, as well as future results, can be interpreted. We analyze if mean morphological disparity has increased after tribal and lineage specific WGDs, and we study whether early diverging disparity among main evolutionary lineages coincides with shifts of net diversification rates in the early evolutionary history of the family. We propose that WGDs contribute to morphological disparity, but do not necessarily trigger immediate clade diversification and radiation. Our study thus highlights the evolution of morphological diversity as a so far underexplored aspect of the general importance of WGDs for the evolutionary success of a lineage over more than 30 million years.

Results

The taxonomic and phylogenetic backbone concept

Past phylogenetic reconstructions of Brassicaceae have revealed incongruences between biparentally inherited nuclear and predominantly maternally inherited plastome marker based phylogenies, most likely due to rapid diversification, intertribal hybridization³⁷ and nuclear introgression³⁸. A recently published nuclear phylogeny provided substantial progress and comprises many of the tribes described in Brassicaceae³⁰. However, there is still no comprehensive phylogeny available on the tribal level that also considers all deep nodes within tribes. Furthermore, the lack of consistently estimated divergence times over the entire family restricts its potential use for phylogenetic comparative methods. Here, we rely on a plastome phylogeny, allowing us to estimate divergence times, extract age estimates, and conduct phylogenetically corrected downstream analyses. Furthermore, the use of the plastome, assessed via a genome skimming approach, enabled us to include samples from rare herbarium specimens. Our family-wide taxon sampling for the phylogenetic analysis covers all tribes of the Brassicaceae and aims at representing the deepest known phylogenetic split within each tribe. The presented maternal phylogeny includes 153 out of the 351 genera currently recognized in the Brassicaceae. The tree topology based on coding gene space of the plastome (Supplementary Fig. 1) is fully congruent with earlier plastome analyses encompassing a fraction of taxa (5–24% of Brassicaceae genera) compared to the taxon set included herein^24,28,39. Furthermore, our maternal phylogeny (Supplementary Fig. 1) is highly consistent with published phylogenetic-systematic studies in respect to taxonomic conclusions on the tribal and genus level (Supplementary Note 1). Two recent tribal level phylogenies of Brassicaceae based on genome-wide nuclear sequence data^27,28 are somewhat inconsistent with each other and also with our plastome phylogeny, mostly in respect to the deeper splits within the family. Consequently, we did not analyze our data using phylogenetic information and tree structure on genus level, but restricted conclusions on the level of higher order evolutionary lineages and comparing tribes directly with each other. Our phylogenetic analysis resulted in a well-supported maternal topology, which is summarized on the tribal level in Fig. 1 (detailed tree in Supplementary Fig. 1/2; a phylogenetic tree based on the nuclear encoded rDNA cistron, in contrast, is largely unresolved and shown in Supplementary Fig. 3, details are given in Supplementary Note 1). Topologies from maximum likelihood (ML) reconstruction and divergence time estimations are congruent, and robustness is demonstrated by high bootstrap support for almost all nodes in our RAxML analysis (Supplementary Fig. 1). As in previous studies, Brassicaceae can be grouped into an early branching lineage represented only by tribe Aethionemeae (Fig. 1), with a split time of ~30 my (Brassicaceae crown group age), followed by diversification of the major evolutionary lineages I, II, and III (following lineage assignment by Koch & Al-Shehbaz²⁵) 24–25 mya. This is consistent with time estimates based on data from both plastid and nuclear genomes^24,34,39,40. Additional subclades of evolutionary lineages were introduced based on nuclear genomic data²⁸, and we also analyzed our data using these phylogenetic concepts since we cannot resolve the topological conflict among phylogenetic trees.

**Fig. 1: Brassicaceae maternal timeline.**

Previously detected significant shifts in diversification rates³⁴ and mesopolyploidization events^30,31 are restricted to lineages I and II and are indicated as green triangles and yellow stars, respectively, in Fig. 1. Our sampling strategy allowed us to extract stem group ages for all 51 tribes and crown group ages of 24 tribes directly from the dated phylogeny. Correlation of these crown group ages with those obtained recently from tribal analyses based on nuclear encoded rDNA³⁴ was very high and significant (Spearman’s rank correlation coefficient 0.89, P-value < 0.001, see Supplementary Fig. 4). Thus, we used time estimates from that study to fill our missing data points and calculate the lag-phase between clade divergence and diversification for all tribes (Supplementary Table 2).

We compiled a comprehensive Brassicaceae species checklist including every accepted genus and species. This checklist comprises 3977 species from 351 genera, most of which could be assigned to one of the 51 tribes, with only eleven orphan genera, for which a reliable phylogenetic placement on the familial level was provided (Fig. 1, Supplementary Fig. 1/2). The detailed descriptions and taxonomic discussions are provided in Supplementary Note 2, and a species checklist providing accepted names is also accessible via BrassiBase (version 1.3; https://brassibase.cos.uni-heidelberg.de/). Our species and genus checklist, which provides accepted names for species and subspecies, incorporates changes to 860 previously accepted species names, including changes for more than 3500 synonyms (often wrongly assigned in the past), and resulted in a list with 15,365 updated data entries.

Morphological characters are homoplastic in Brassicaceae

Evaluation of morphological variation on the genus level and across the entire Brassicaceae family (Supplementary Note 3) resulted in an interactive key enabling researchers to determine all genera, available at BrassiBase (https://brassibase.cos.uni-heidelberg.de/?action=key). The underlying matrix of the key (38 characters, 166 character states) was optimized for statistical analyses (37 characters, 111 character states); we also provide a matrix compiled on tribal level. For clarity, characters were arbitrarily grouped into six categories (A–F) referring to life form, hairs, stem, leaves, flowers and inflorescences, and fruits and seeds, respectively (Fig. 2, Supplementary Methods).

**Fig. 2: Morphological characters and their disparity across tribes and lineages.**

To visualize differences in presence or absence of character states between main evolutionary lineages, we assessed pairwise differences in character state frequency. High-pairwise differences (Fig. 2a, Supplementary Table 3) denote character states predominantly found in one lineage, such as multicellular glandular hairs (B03), which are restricted to four of the seven tribes of lineage III. Other character states with high differences between lineage III and the rest of the family are decurrent and connivent stigma lobing in fruits (F30) or seed wings (F34). Characters with low differences, such as ‘sepal orientation’ (E18) or ‘hairs’ (B04), may represent a set for which the evolutionary potential of character state diversification either evolved between the split of the major lineages and the onset of their diversification or were lost. However, only few character states showed pairwise differences higher than 0.5, and none reached a pairwise difference of 1 (Fig. 2a, Supplementary Table 3).

Morphological disparity is not phylogenetically constrained

To assess the lineage- and tribe-specific potential for morphological diversity (despite the high degree of homoplasy) we analyzed morphological disparity. Rather than looking at individual characters or character states, disparity elaborates on the presence of possible character states within a defined taxonomical unit (in our case genera and tribes). Note that from here on, we will use the term disparity for the specific measure of disparity (observed character states/total character states). We did not detect significant differences in mean disparity among evolutionary lineages I, II, and III (Fig. 2c, Supplementary Fig. 5/6, Supplementary Tables 4/5), or using alternative lineage assignments^26,28 (Supplementary Fig. 6, Supplementary Tables 4, 6, and 7). Furthermore, mean tribal disparity was randomly distributed across the phylogeny; we detected no phylogenetic signal (Moran’s I). This can be interpreted as an absence of phylogenetic constraints on phenotypic variation and the increasing number of realized character states. The character with the highest disparity was duration (life cycle), where both character states (annual and perennial) were observed in most tribes, followed by basal leaves rosette forming, fruit orientation, fruit type, and seed wing. Lowest disparity was observed for characters typically fixed across the Brassicaceae with few exceptions, such as stamen number (Supplementary Table 3).

In contrast to overall disparity, we detected phylogenetic signal in disparity for six of the 37 characters analyzed (multicellular glandular hairs, hairs, sepal orientation, flower symmetry, stigma lobing in fruit, and seed wing; indicated in Fig. 2a, b; see Supplementary Fig. 7 and Supplementary Table 3), i.e., in these cases morphological disparity was phylogenetically clustered. All six characters had overall medium disparity values (0.54–0.79), as can be expected for characters displaying differences between clades. While detection of phylogenetic signal is usually used for characters such as morphological traits, where phylogenetic signal indicates non-random distribution of the respective character along the phylogeny, here phylogenetic signal indicates constraints to evolve or reduce morphological diversity in some clades. Five of these six characters also showed high pairwise differences in character state frequency, suggesting that those character states contribute to differences in disparity among evolutionary lineages (Supplementary Table 3).

As we detected differences in tribal level disparity in some characters, but no differences between mean tribal level disparity, we conducted a Discriminant Analysis of Principal Components (DAPC) of disparity with group priors according to the major lineages to assess the characters’ contribution to distinguish between lineages. The three lineages I, II and III were well-separated in the resulting scatterplot (Fig. 2d); however, splitting lineage II further following alternative lineage assignments^26,28 resulted in overlapping groups (Supplementary Fig. 8). The characters contributing most to separating the lineages mostly coincided with highest pairwise character state difference, phylogenetic signal, or both (Supplementary Table 3), and had medium levels of mean disparity.

Disparity is a product of a combination of processes

The direct result of diversification processes can be expressed as species or genus richness, and these two numbers may be considered as a good predictor for morphological disparity and realization of the phenotypic space. Accordingly, disparity of tribes (directly calculated from the morphomatrix) was correlated with both species and genus richness (Supplementary Table 8). Tribal crown group age, i.e. the time of diversification, as well as length of the lag-phase between stem and crown group age were also correlated with disparity; however, they were also correlated with species and genus richness, respectively, thus a combination of factors may contribute to disparity.

It has been debated whether polyploidization is associated with increased diversification rates⁴¹, and thus may coincide with higher morphological disparity. However, a direct association of polyploidization and increased net diversification has neither been shown for angiosperms⁴¹ nor for Brassicaceae³⁴ or within Brassicaceae (tribe Microlepidieae)⁴². Using phylogenetic ANOVA, we detected significantly higher disparity (p = 0.01) in tribes having undergone mesopolyploidization events predating their diversification^30,31 compared to other tribes (Fig. 3a). These tribes also had a higher number of genera (Supplementary Fig. 9, Supplementary Table 9). Interestingly, while shifts in diversification rates on the species level are not necessarily associated with mesopolyploidization in Brassicaceae³⁴, but rather with crown group age and number of species (Supplementary Fig. 10, Supplementary Table 9), disparity is also significantly elevated (p = 0.018) in tribes with rate shifts (Fig. 3a).

**Fig. 3: Disparity and diversification.**

To bridge previous studies investigating rate shifts within tribes³⁴ or in the context of all angiosperms¹⁶, we conducted a Bayesian Analysis of Macroevolutionary Mixtures (BAMM) on the genus level. We detected two shifts in genus-level diversification rate in the best shift set, both associated with increased net diversification (Fig. 3b). The first shift was located at the base of core Brassicaceae, just before lineage diversification ~25 mya. The second shift predated the split of tribe Brassiceae and the Thelypodieae/Sisymbrieae clade in lineage II 12.6 mya. Shifts at similar branches were detected in the other credible shift sets (Supplementary Fig. 11). The corresponding rate-through-time plot (Supplementary Fig. 12) revealed an increase in net diversification rate until ~16 mya, when many of the tribes originated. Stem and crown group ages of all the tribes are 15.21 ± 4.71 my (mean ± standard deviation) and 9.99 ± 4.77 my, respectively. Correlation analysis furthermore indicated that none of the shifts in speciation rate were significantly associated with genus-level disparity values of any of the 37 morphological characters or mean disparity (Supplementary Table 10).

Diversification rates have been estimated on the tribal level before³⁴. We tested whether speciation, extinction and net diversification were significantly different between Brassicaceae tribes having undergone WGDs, rate shifts, a combination of both or neither. As could be expected, net diversification rate was significantly higher in tribes with rate shifts compared to tribes with WGDs or without shifts and WGDs (Supplementary Fig. 13). This could be attributed to speciation rate (Supplementary Fig. 13), where the same significant differences were found. In contrast, extinction rates were not significantly different when corrected for phylogenetic signal, although tribes with WGDs had considerably lower extinction rates compared to those with rate shifts and without WGDs or shifts (0.21 ± 0.19 compared to 0.34 ± 0.38 and 0.32 ± 0.18).

Discussion

Our analysis of morphological characters and their disparity in Brassicaceae revealed mostly consistent patterns across lineages and traits. Characters with a high mean disparity occur in multiple states across the phylogeny, generally have a low contribution to lineage differentiation and low pairwise differences between lineages, and are devoid of phylogenetic signal. This agrees with previous molecular-phylogenetic studies showing that morphological characters are homoplastic. Characters with low disparity, where one or more of the character states are rare, show a similar pattern of low contribution to lineage differentiation, low pairwise differences and lack of phylogenetic signal. In contrast, those characters showing phylogenetic signal in disparity, high pairwise differences between character state frequencies and high contribution to lineage differentiation have medium levels of disparity. Key innovations after WGDs, such as the emergence of glucosinolates in Brassicales²¹ or the pentamerous flower in Pentapetalae²², have been hypothesized to play a major role for subsequent diversification of lineages affected by WGD⁴³. Within our extensive set of morphological characters in Brassicaceae, we find no evidence for any morphological character state being a key innovation after mesopolyploidizations, but rather recurrent occurrences of almost all character states and high levels of homoplasy throughout the entire family.

Consistent with the high levels of homoplasy in leaf shape (character D10), we did not detect phylogenetic signal or lineage differentiation in this character. The REDUCED COMPLEXITY (RCO) gene is responsible for the appearance of complex leaves in Brassicaceae³¹, and it has been shown that leaf shape can contribute to fitness in Brassicaceae⁴⁴. Interestingly, this gene originated through gene duplication after divergence from Aethionemeae³⁶, providing an example that also local gene duplication can provide the genomic substrate to evolve new traits. This gene has been repeatedly lost across the family³¹, leading to extensive homoplasy. Most trichome types likely evolved multiple times in Brassicaceae^45,46. The corresponding character in our study is B04 (hairs), in particular character states B04-3 and B04-4. Trichome disparity was not phylogenetically constrained but showed differentiation between lineage I and II. A single origin of malpighiaceous and stellate trichomes is also not supported by our and others’ data²⁷, which could be attributed to the complex genetic control of trichome development⁴⁷. Trichome types have not been studied in the context of family-wide diversification rates and species richness, and thougha selective advantage has been discussed⁴⁸, there may also be fitness costs depending on environmental conditions⁴⁹. In contrast, another trichome character (B03, multicellular glandular hairs) was indeed phylogenetically constrained, both regarding disparity and character occurrence. Only four of the seven tribes of lineage III exhibit multicellular glands, making this the only character state restricted to a single lineage. This pattern has been reported earlier²⁷, but interestingly our extensive sampling shows that both character states are consistently present at the tribal level and in part even at the genus level, indicating that instead of replacing one character state with another, the potential to exhibit the alternative character state evolved. However, the genetic basis of this trait has yet to be found.

In tribe Brassiceae, taxa with indehiscent fruits have higher net diversification and speciation rates, as well as larger seeds and diverse pericarp features, such as corkiness, hooks, barbs, and wings⁵⁰. Fruit opening in Brassiceae corresponds to our character F27, and we did not find a correlation between its disparity and speciation rates on the genus level (Supplementary Table 10). It is also not significantly phylogenetically constrained (although presence of indehiscent fruit shows a high difference between lineage I and II), but disparity is indeed high in tribe Brassiceae. This is one example that preceding WGD (triplication in Brassiceae⁵¹) might have increased disparity and net diversification rates. The additional rate shift in Brassiceae+Thelypodieae+Sisymbrieae may have further provided an additive effect and led to an increased speciation rate and high species richness.

Diversification rate upshifts have been intensively studied across angiosperms^12,16, and some correlation between WGDs and diversification rate upshifts was found, even with a cumulative effect of increased speciation rates with increased number of WGDs in a given lineage¹². Mean speciation rate in Brassicaceae is about 0.55 species/my³⁴, compared to a mean of 0.49 species/my across rosids¹². However, net diversification rates are highest in the clade containing Brassicaceae, Cleomaceae and Capparaceae compared to all other analyzed angiosperms¹⁶. An increase in speciation rate after WGDs, as had been observed in other lineages¹², was not detected within Brassicaceae. In our study, the speciation rate was significantly higher in tribes that underwent rate shifts than in tribes without shifts (Supplementary Fig. 13, Supplementary Tables 11–14), as could be expected, and this is also reflected in elevated net diversification rates. However, this pattern was not observed in tribes with mesopolyploidization events. Net diversification rates were lowest in tribes with WGDs, associated with low extinction rates after WGDs. A decrease in the extinction fraction was also shown in the Brassicaceae+Cleomaceae+Capparaceae clade following At-β in Brassicales¹². Nevertheless, there are no decreases in net diversification anywhere in Brassicaceae³⁴. The robustness of speciation and extinction rates is under debate surrounding the reliability of different widely used methods without further biological information and well-justified constraints⁵² and, therefore, more detailed conclusions are not drawn herein.

Angiosperms underwent on average 3–4 WGDs (on the level of orders and families) over the evolutionary history of land plants, spanning more than 440 my. The Rosids, which Brassicaceae belong to, have a mean number of 4 WGDs^8,12. Brassicaceae underwent not only the At-α WGD in their early evolution⁵³ ~35 mya, but also accumulated eleven additional tribe-specific WGDs (mesopolyploidizations) during their Miocene/Pliocene evolutionary history^30,31. In total 130 genera are potentially affected by one of the additional eleven mesopolyploidizations, resulting in a mean number of 1.45 WGDs per genus in 35 my. Brassicaceae therefore have a high rate of 0.41 WGD/10 my compared to the land plant average of 0.09 WGD/10 my. If we further consider that nearly 43% of Brassicaceae species are neopolyploids²⁴, which are not included in the calculation above, and assume an age of under one million years for the more recent neopolyploids, then a similarly high rate of among recurrent WGDs can also be inferred.

WGDs in angiosperms have been associated with shifts in diversification rate, although mostly after a considerable lag time, leading to the conclusion that WGDs promote, but are not sufficient to cause increased diversification. The generally high and increasing diversification rates have been attributed to the cumulative effect of these ancient WGDs¹⁶. We also do not observe a strict association of rate shifts with WGDs in Brassicaceae. The lag-time (time span between WGD and associated increased diversification¹⁷) ranges from 0.6 my to 49.2 my in angiosperms¹⁶. For the diversification after the At-β WGD event (85 mya²¹ including Brassicaceae with Cleomaceae and Capparaceae) a lag time of at least 36 my can be assumed. At-α was not analyzed, but our data provide a lag time of about 10 my (At-α WGD ~35 mya and core Brassicaceae rate shift ~25 mya). Within Brassicaceae, lag time is in a similar range for the two tribes with both events (12 my and 17 my between an assumed WGD at the base of the tribe and later diversification in Physarieae and Cochlearieae, respectively). This is considerably longer than the mean lag-phase (time span between tribal stem age and first lineage divergence) of about 5 my (Supplementary Figs. 9 and 10, Supplementary Table 2) and highlights the long-lasting impact of WGD on any other factor increasing diversification such as environmental change, migration or coevolution. Our study does not provide further data to explore combinatorial effects of WGD with other factors driving morphological disparity, and we can only summarize earlier findings that lineage diversification and differentiation in Brassicaceae is often associated with major environmental change^18,34,39.

The herein elaborated phylogenetic framework based on plastome data and the entire rDNA operon has been explored to denote species within tribes and identify those genera, which remain orphan taxa and are not assigned to any tribe. Main evolutionary lineages were defined accordingly. However, phylogenetic relationships among tribes should be regarded as a work in progress and may be further resolved in the future with comprehensive nuclear genome datasets^28,31 that allow for to the reconstruction of the early evolutionary history of any given tribe. Consequently, our conclusions refer to evolutionary patterns on the level of major evolutionary lineages. However, alternative concepts based on phylogenies derived from the nuclear genome, with additional lineages, were analyzed as well, and resulted in congruent patterns. Furthermore, our conclusions are not based on statistical tests that rely on a resolved genus-level phylogeny (disparity, lineage differences, DAPC). Those tests that use phylogenetic relationships among tribes (phylogenetic ANOVA, test for phylogenetic signal) may be affected if a future comprehensive nuclear genome phylogeny is used and may provide additional significant results. We were not able to reconstruct changes of morphological disparity over time and along lineages since this requires a fully resolved evolutionary history of Brassicaceae incorporating phylogenetic evidence from both plastid and nuclear genomes. This is also a prerequisite for future research studying past environmental parameters driving evolutionary patterns and processes.

We may hypothesize that our findings of high disparity throughout evolutionary times and constantly high net diversification also correspond to constantly evolving genomic blocks as demonstrated extensively in Brassicaceae⁵⁴ both by genomic and cytogenetic analyses^30,55. The most recent ancestral Brassicaceae genome model consisted of 22 genomic blocks^54,56. Fractionated blocks are interpreted as a consequence of the diploidization phenomenon in extant paleopolyploids⁵⁷, and evolutionary mobility of genomic blocks is set into a context of frequent polyploidy, diploidization, and block shuffling⁵⁵. This may facilitate future studies to link disparity and trait evolution utilizing entire genomic information⁵⁷, and thereby unravel the phenomenon of consistently high capacity for morphological diversification throughout evolutionary times resulting in a permanent gain and loss of character states. This will also pave the way for new approaches linking phenotypes with genomic sequence and genome context data to detect the underlying genetic causes, as has recently been shown for the genetic basis of leaf shape³¹. Based on our results we propose a model of nested WGDs and diversification shifts triggered by other factors, such as environmental change, which in concert increase morphological disparity.

Methods

A matrix of the family-wide morphospace

We generated a morphological character matrix intended to discriminate on genus level. This matrix was generated in two steps. First, the morphospace (characters and character states) used in species and genus descriptions and classical text book determination keys was extracted and incorporated into an ‘online interactive key’ to the genera of Brassicaceae (BrassiBase https://brassibase.cos.uni-heidelberg.de/; toolkit Interactive Key²³). Data can be fully explored in BrassiBase, and a visualization tool permits a summary on tribal level (BrassiBase; toolkit Morphology Tool²³). This key was further improved for downstream statistical analysis by adjusting character definition and character states (Supplementary Note 3). Finally, 37 characters with 111 character states were scored for all 351 genera. A detailed definition and description of characters and character states is provided in Supplementary Methods. The morphomatrix integrates data compiled on genus level and does not include individual species’ morphological diversity. Consequently, the morphomatrix represents multistate, unordered characters and character states, and genus (and above) level morphological variation. For downstream analysis, we also scored presence/absence of characters on the tribal level (tribal level morphomatrix).

To assess whether any character states showed differences in their presence/absence across lineages, we calculated the fraction of presence of character states for each lineage from the tribal level morphomatrix. High pairwise differences between lineages indicate character states, which are realized differentially and may be targeted in future studies.

DNA preparation and sequencing

We selected 178 samples from 176 Brassicaceae species for phylogenetic reconstruction with the aim of (i) covering all tribes, (ii) recovering the deepest split within each tribe, (iii) placing formerly unplaced genera, and (iv) including taxa of interest to the scientific community. Our sampling was further expanded with published data from Brassicaceae taxa and extended to the other families of Brassicales, where we also generated sequences for five new species, thereby covering almost all families. The entire taxon sampling including additional outgroup sequences from the Rosids is provided in Supplementary Data 1.

The Invisorb Spin Plant Mini Kit (STRATEC Biomedical) was used for DNA extractions from dried leaf material of all samples. Preparation and sequencing of total genomic DNA libraries were performed at the CellNetworks Deep Sequencing Core Facility (Heidelberg) with library insert sizes of 200–400 bp either on an Illumina HiSeq 2500 instrument in 100-bp paired-end mode or on a HiSeq 4000 instrument in 150-bp paired-end mode.

Plastid de novo assembly and analyses

Raw Illumina reads were filtered into high-quality (HQ) sequences using NUCLEAR 3.2.4 (GYDLE Inc., Québec, Canada) by trimming adapters and retaining segments of 50 or more consecutive bases with quality scores 20 or higher. Plastid assemblies were initiated by mapping HQ sequences to the A. thaliana plastid genome (ENA/GenBank accession NC_000932.1; [https://www.ncbi.nlm.nih.gov/nuccore/NC_000932.1]), using NUCLEAR parameters: -l 40 -s 16 -m 3 --min-score-cov 80, thus selecting fragments aligned over at least 80% of one of their sequence reads in High-scoring Segment Pairs (gapless local alignments) of 40 bases or more, with 16 consecutive identities and at most three mismatches every 40 bases (92.5% local similarity). The assemblies were then produced using VISION 2.6.12 (GYDLE Inc., Québec, Canada) by iterative resolution steps combining local fragment realignment, consensus resolution, gap filling, segmental reorganization, recruiting and remapping of HQ sequences, interactive edition and visualization. The assembler in VISION is always aware of each fragment’s mapping scores, therefore regions consistently connected and covered by perfectly aligned reads are not subject to misassembly influenced by additional divergent reads, such as those carrying sequencing errors or representing contaminants or insertions into the nuclear genome. The orientation of the small single copy (SSC) region relative to the large single copy (LSC) region was kept consistent between all samples, and the finishing process ensured the proper assembly of identical inverted repeats and their exact junctions to the LSC and SSC regions, with all assemblies starting at the first base of the LSC.

Plastid annotation was performed using a curated set of plastid gene features extracted from the A. thaliana plastid genome GenBank record. These features were aligned to the assemblies in their nucleotide form with NUCLEAR, CDS (coding sequences) were also aligned in their amino-acid form with BLASTX. Protein and nucleotide alignments were processed in order to adjust start and stop codons and detect pseudogenized genes due to missing exons or internal stop codons.

Alignment and phylogenetic analysis

Geneious 7.1.7 (Biomatters) was used for the extraction of plastid coding regions (excluding introns), which were subsequently aligned using the FFTNS-ix1000 algorithm in MAFFT 7.017⁵⁸ as implemented in Geneious, with the 200 PAM/k = 2 scoring matrix, a gap open penalty of 1.53, and an offset value of 0.123. Starting from the set of plastid genes used for phylogenetic reconstruction of Brassicaceae earlier²⁴, we selected genes based on available annotation in our extended taxon set. In general, only genes present in all taxa were considered for phylogenetic data analysis, yet in order to maximize available sequence information within Brassicaceae, two exceptions were made for genes rrn16 (alignment position 25,102–26,517, 1416 bp) and trnF-GAA (alignment position 28,456–28,519, 64 bp) as these genes were missing only in one outgroup sample respectively, namely in Tovaria pendula (rrn16) and Setchellanthus caeruleus (trnF-GAA) from the Capparales. Here, missing sequences were replaced by Ns. A total of 60 genes remained, including 43 protein-coding genes, 14 tRNAs and three rRNAs. Start and stop codons were removed from alignments of protein-coding genes and all alignments were realigned (settings as above). After manual inspection of all sequence alignments, Gblocks 0.91b⁵⁹ was used to exclude indels using a minimum block length of 2 bp and also saving non conserved blocks given the high sequence divergence in the dataset. The remaining data blocks were concatenated in Geneious resulting in a total alignment length of 29,126 bp with 12,187 variable sites (9326 thereof parsimony informative).

PartitionFinder 2.1.1⁶⁰ was used to determine the best-fit partitioning scheme in order to account for putative rate heterogeneity among the selected genes. Models implemented in BEAST were tested using the Bayesian Information Criterion (BIC) for model selection in a greedy search and branch lengths were allowed to be unlinked. A single partition was determined and GTR + G + I + X was identified as the best substitution model. The analysis was performed twice to confirm the resulting partitioning. Both analyses revealed the same best-fitting partitioning scheme, which was therefore used in the following analysis and is in congruence with earlier analyses³⁹.

RAxML 8.1.16⁶¹ was used to conduct an unpartitioned maximum likelihood (ML) phylogenetic tree reconstruction with a rapid bootstrap analysis (1000 replicates) and search for the best-scoring ML tree under the GTR + G + I + X model. Vitis vinifera was set as outgroup. The resulting phylogenetic tree was compared to a phylogeny based on the entire nuclear encoded rDNA operon (see Supplementary Methods for details).

Divergence time estimation and fossil calibration

In order to use the ML tree as a starting tree for divergence time estimation, the R package ape 3.1–4⁶² was used to make the ML tree ultrametric with node ages corresponding to time constraints using the ‘chronos’ function. Divergence time estimation was performed in BEAST 1.8.4⁶³ under an uncorrelated lognormal relaxed clock approach with estimated rates and Vitis vinifera set as outgroup. The BEAST analysis was performed following Hohmann et al.²⁴ with four fossil calibration points (Supplementary Table 16) using uniform distributions and a maximum age of 125 mya for all fossil constraints. The root age was bounded between 92 and 125 mya with a uniform distribution.

Twenty-eight independent MCMC runs were performed with 100,000,000 generations each and sampling parameters every 50,000 generations until the effective sampling size (ESS) of each parameter in the combined logfile exceeded 200 excluding the first 20% of generations as burn-in. We used LogCombiner 2.4.7⁶⁴ after removing burn-in to combine trees from the different runs. A maximum clade credibility tree was constructed from 21,038 trees in TreeAnnotator 2.4.7⁶⁴ and visualized using FigTree 1.4.1⁶³.

Morphological disparity analysis

Since the morphological variation (characters and character states) was defined and scored reflecting the entire discriminative morphospace of Brassicaceae, the same dataset was used to study morphological disparity in more detail. Morphological disparity, the ‘range or variance of morphological form across species or other taxa’² is here defined as the fraction of character states per character, which are realized on a given taxonomic level. Consequently, the disparity value for each character ranges from one divided by the number of character states (only a single character state realized) to one (all character states realized).

For downstream analyses, both genus and tribal level disparity was used, and we provide two different values for tribal level disparity. (i) Disparity was calculated directly from the tribal level morphomatrix (‘disparity direct’). (ii) Mean tribal disparity was calculated from genus-level disparity values (‘disparity from genera’). Direct disparity values are never lower than disparity from genera. Correlation of the two matrices was estimated using Spearman’s rank correlation coefficient in R. Additionally, we searched for phylogenetic autocorrelation using Moran’s I⁶⁵ within disparity for all 37 characters and mean disparity on tribal level (direct calculation) with the R package phylosignal 1.2.1⁶⁶ using a pruned tribal tree based on our BEAST results.

Analysis of diversification patterns

Patterns of diversification were assessed using Bayesian Analysis of Macroevolutionary Mixtures 2.5.0 (BAMM⁶⁷) on the genus level. We first pruned our dated phylogeny to include only one species per (monophyletic) genus within Brassicales. Multiple tips were retained in genera displaying paraphyly in the plastome BEAST phylogeny. Genera missing from the phylogeny were accounted for by using clade-specific sampling probabilities on a tribal level within Brassicaceae and, because the tribal system is specific to the more species-rich Brassicaceae, on a family level in all other Brassicales families. Number of genera for Brassicales families were extracted from the Angiosperm Phylogeny Website⁶⁸. Rate parameters were detected using the ‘setBAMMpriors’ function in the R package BAMMtools 2.1.6⁶⁹ from the input tree and total number of genera (425 including outgroups), and the prior for the expected number of shifts was set to 1.0. BAMM was then run for 10,000,000 MCMC generations with default MCMC settings, and convergence of runs was detected using the log-likelihood trace as well as determining effective sample size (ESS; > 200) of the log-likelihood and number of shifts after discarding the first 1,000,000 samples as burn-in using the R package coda 0.19–2⁷⁰. The credible shift set and best tree configuration were determined using the respective functions in BAMMtools.

Finally, we correlated disparity values (calculated on the genus level and for each of the 37 characters separately as well as for mean disparity) and speciation rates with the function ‘traitDependentBAMM’ implemented in BAMMtools using the spearman correlation coefficient and a two-tailed test. We subset the BAMM results using BAMMtools ‘subtreeBAMM’ to exclude outgroups, where no disparity values were available.

Disparity pattern across phylogenetic groups

In order to further explore the morphological disparity, a Discriminant Analysis of Principal Components (DAPC⁷¹) was carried out using the ‘dapc’ function implemented in the R package adegenet 2.0.1⁷². By maximizing variation between groups while minimizing variation within groups, the DAPC, although initially developed and applied for the analysis of genetic clusters, was utilized in order to reveal—if present—clustering patterns within the disparity data and to identify the morphological characters contributing most significantly to these patterns. The analysis was based on the direct estimate of tribal level disparity. Since DAPC is based on an a priori grouping of the dataset, we used the ‘find.clusters’ function to identify putative clusters within the disparity data, yet no optimal grouping solution could be revealed from the resulting BIC values. Therefore, two DAPCs were performed with the major lineages set as group priors, both with basal tribe Aethionemeae and without. For both DAPCs, cross-validation was performed using the ‘xvalDapc’ function (with 1000 replicates for each level of PC retention) in order to identify the optimal number of principal components (PCs) to retain based on the lowest root mean squared error associated with the predictive success.

Calculation of stem and crown group ages and lag time

The comprehensive plastome dataset included taxa from all tribes. Our initial taxon sampling aimed to cover the deepest split within every tribe to allow for comparison of this respective node age (crown group age) with the onset of the tribal lineage (stem group age). Since we sampled all tribes, all stem group ages could be extracted from plastome-based divergence time estimates (Supplementary Table 2, Supplementary Fig. 2). The plastome dataset furthermore allowed us to extract tribal crown group ages for 39 tribes, because (i) we included taxa representing the deepest node in the respective tribes and (ii) there is phylogenetic congruence with published data (see Supplementary Note 1). For estimation of the crown group age of tribe Camelineae we excluded Pseudoarabidopsis (see Supplementary Note 2). For tribes with one single genus only, such as Shehbazieae and Aphragmeae, there is no crown group age.

In order to generate an almost complete dataset for tribal crown group ages we extracted time estimates from an ITS-based analysis, which considered more than 2000 Brassicaceae species for crown group age estimation:³⁴ This study had calibrated individual ITS-based tribal trees (including a consistent set of outgroups among trees) with multiple plastome-based divergence time estimates²⁴. These divergence time estimates used for secondary calibration³⁴ are fully consistent with those we present here. We first compared our herein estimated crown group ages based on plastome sequences with those from Huang et al.³⁴. This direct comparison of crown group ages of tribes was possible in 24 cases and was used to calculate a linear regression and calibration line comparing plastome and ITS derived datasets (Supplementary Table 2, Supplementary Fig. 4). This regression line was used to extrapolate for missing crown group ages in our study to maximize (1) consistency among datasets and (2) work with a complete stem/crown group age dataset for further statistical tests. Since the regression between both studies was nearly perfect, we added missing crown group ages for 12 tribes from Huang et al.³⁴ to our dataset (Supplementary Table 2).

A group of tribes closely related to Cremolobeae (eight species) (namely Asteae, Scoliaxoneae, Eudemeae-with only 31 species in total) was not fully resolved according to their tribal assignment, neither in BEAST nor in ML analysis, indicating a polytomy (Supplementary Fig. 1/2). Therefore, we combined all four of them into one group and extracted one single crown group age for all of them. This close relationship has been discussed earlier⁷³.

Finally, we calculated three values related to the estimated divergence time for all tribes. (1) Mean stem group age, which is solely based on herein presented plastome data, (2) Mean crown group age, which is mostly based on herein presented plastome data and supplemented by estimates from Huang et al.³⁴ as described above, (3) ‘Lag-phase’, representing the time between respective stem and crown group ages and indicating a time span of evolutionary stasis (here with zero net diversification).

Association between tribal level data

Finally, we assessed the correlation of tribal level disparity, genome size, species/genus richness, and stem/crown group age as well as lag-phase with each other and their association with the presence of mesopolyploidization events^30,31, significant shifts in diversification rate detected in the ‘best shift configuration’³⁴ and with assignment to the major evolutionary lineages^25,26,28. Mesopolyploidization events were found at or near the origin of eleven tribes (excluding the WGD detected in Leavenworthia, Cardamineae, as it is restricted to this genus and not shared with the rest of the tribe)^30,31 and shown to be absent in at least twenty tribes³¹, most notably in all analyzed tribes of lineage III³². For a number of other tribes, low base chromosome numbers and genome sizes²⁴ are consistent with a lack of mesopolyploidization, while for few, neither data nor material for cytogenetic or sequence-based analyses is available (Supplementary Table 1). We also assessed differences in tribal speciation, extinction and net diversification rates³⁴ (shown in Supplementary Table 16) depending on occurrence of mesopolyploidization, rate shift, both, or neither on tribal level. Rate estimates calculated under a Bayesian framework using BAMM^67,69 were obtained from a recent Brassicaceae-wide study³⁴. Mean tribal genome size values were recalculated with up to date taxonomic information and tribal assignments based on previously published data²⁶, and genome size variation was calculated as the standard deviation of genome size estimates for a given tribe normalized by its mean while restricting to tribes with genome size data available for at least three taxa. For correlation of tribal level data, we used Phylogenetically Independent Contrasts⁷⁴ in R (‘pic’ from the package ape) to account for phylogenetic dependencies. To test whether tribal level data was significantly different between those tribes having undergone mesopolyploidization events or rate shifts, we used phylogenetic ANOVAs⁷⁵ with 1000 simulations and posthoc comparisons and Bonferroni correction using ‘phylANOVA’ from the R package phytools 0.6–60⁷⁶. The same analysis was conducted to test for differences of diversification rates; the detected rate shift was in Brassiceae+Thelypodieae+Sisymbrieae was considered here. Finally, to test for significant differences between the lineages, where data are per se highly phylogenetically dependent, we used Kruskal–Wallis rank-sum tests followed by pairwise Wilcoxon rank-sum tests with Bonferroni correction when a significant difference was detected.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data supporting the findings of this work are available within the paper and its Supplementary Information files. A reporting summary for this Article is available as a Supplementary Information file. The datasets generated and analyzed during the current study are available from the corresponding author upon request. The plastome sequences generated during the current study are available at ENA/GenBank under accession codes MK637648 -MK637830 (see Supplementary Data 1), and raw sequencing data under ENA/GenBank Bioproject PRJEB38700. The taxonomic and morphological datasets as well as alignments and the dated phylogeny are available at Dryad [https://doi.org/10.5061/dryad.fttdz08pt]. Source data are provided with this paper.

References

Foote, M. The evolution of morphological diversity. Annu. Rev. Ecol. Syst. 28, 129–152 (1997).
Article Google Scholar
Oyston, J. W., Hughes, M., Gerber, S. & Wills, M. A. Why should we investigate the morphological disparity of plant clades? Ann. Bot. 117, 859–879 (2016).
Article PubMed Google Scholar
Minelli, A. Species diversity vs. morphological disparity in the light of evolutionary developmental biology. Ann. Bot. 117, 781–794 (2016).
Article PubMed Google Scholar
Hughes, M., Gerber, S. & Wills, M. A. Clades reach highest morphological disparity early in their evolution. Proc. Natl Acad. Sci. USA 110, 13875–13879 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Clark, J. W. & Donoghue, P. C. J. Whole-genome duplication and plant macroevolution. Trends Plant Sci. 23, 933–945 (2018).
Article CAS PubMed Google Scholar
Li, F.-W. et al. Fern genomes elucidate land plant evolution and cyanobacterial symbioses. Nat. Plants 4, 460–472 (2018).
Article CAS PubMed PubMed Central Google Scholar
Barker, M. S., Arrigo, N., Baniaga, A. E., Li, Z. & Levin, D. A. On the relative abundance of autopolyploids and allopolyploids. N. Phytol. 210, 391–398 (2016).
Article Google Scholar
One Thousand Plant Transcriptomes Initiative. One thousand plant transcriptomes and the phylogenomics of green plants. Nature 574, 679–685 (2019).
Article CAS Google Scholar
Mayrose, I. et al. Recently formed polyploid plants diversify at lower rates. Science 333, 1257–1257 (2011).
Article ADS CAS PubMed Google Scholar
Amborella Genome Project. The Amborella genome and the evolution of flowering plants. Science 342, 1241089–1241089 (2013).
Article CAS Google Scholar
Bowers, J. E., Chapman, B. A., Rong, J. & Paterson, A. H. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422, 433–438 (2003).
Article ADS CAS PubMed Google Scholar
Landis, J. B. et al. Impact of whole-genome duplication events on diversification rates in angiosperms. Am. J. Bot. 105, 348–363 (2018).
Article PubMed Google Scholar
Lohaus, R. & Van de Peer, Y. Of dups and dinos: evolution at the K/Pg boundary. Curr. Opin. Plant Biol. 30, 62–69 (2016).
Article CAS PubMed Google Scholar
Novikova, P. Y., Hohmann, N. & Van de Peer, Y. Polyploid Arabidopsis species originated around recent glaciation maxima. Curr. Opin. Plant Biol. 42, 8–15 (2018).
Article PubMed Google Scholar
Hoffmeier, A. et al. A dead gene walking: convergent degeneration of a clade of MADS-box genes in crucifers. Mol. Biol. Evol. 35, 2618–2638 (2018).
CAS PubMed Google Scholar
Tank, D. C. et al. Nested radiations and the pulse of angiosperm diversification: increased diversification rates often follow whole genome duplications. N. Phytol. 207, 454–467 (2015).
Article Google Scholar
Schranz, M. E., Mohammadin, S. & Edger, P. P. Ancient whole genome duplications, novelty and diversification: the WGD Radiation Lag-Time Model. Curr. Opin. Plant Biol. 15, 147–153 (2012).
Article PubMed Google Scholar
Karl, R. & Koch, M. A. A world-wide perspective on crucifer speciation and evolution: phylogenetics, biogeography and trait evolution in tribe Arabideae. Ann. Bot. 112, 983–1001 (2013).
Article PubMed PubMed Central Google Scholar
Baduel, P., Bray, S., Vallejo-Marin, M., Kolář, F. & Yant, L. The “Polyploid Hop”: shifting challenges and opportunities over the evolutionary lifespan of genome duplications. Front. Ecol. Evol. 6, 117 (2018).
Article Google Scholar
Lysak, M. A., Koch, M. A., Beaulieu, J. M., Meister, A. & Leitch, I. J. The dynamic ups and downs of genome size evolution in Brassicaceae. Mol. Biol. Evol. 26, 85–98 (2009).
Article CAS PubMed Google Scholar
Edger, P. P. et al. The butterfly plant arms-race escalated by gene and genome duplications. Proc. Natl Acad. Sci. USA 112, 8362–8366 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Chanderbali, A. S., Berger, B. A., Howarth, D. G., Soltis, D. E. & Soltis, P. S. Evolution of floral diversity: genomics, genes and gamma. Philos. Trans. R. Soc. B 372, 20150509 (2017).
Article Google Scholar
Koch, M. A., German, D. A., Kiefer, M. & Franzke, A. Database taxonomics as key to modern plant biology. Trends Plant Sci. 23, 4–6 (2018).
Article CAS PubMed Google Scholar
Hohmann, N., Wolf, E. M., Lysak, M. A. & Koch, M. A. A time-calibrated road map of Brassicaceae species radiation and evolutionary history. Plant Cell 27, 2770–2784 (2015).
CAS PubMed PubMed Central Google Scholar
Koch, M. A. & Al-Shehbaz, I. A. in Biology and Breeding of Crucifers (ed. Gupta, S. K.) 1–19 (CRC Press, 2009).
Franzke, A., Lysak, M. A., Al-Shehbaz, I. A., Koch, M. A. & Mummenhoff, K. Cabbage family affairs: the evolutionary history of Brassicaceae. Trends Plant Sci. 16, 108–116 (2011).
Article CAS PubMed Google Scholar
Huang, C.-H. et al. Resolution of Brassicaceae phylogeny using nuclear genes uncovers nested radiations and supports convergent morphological evolution. Mol. Biol. Evol. 33, 394–412 (2016).
Article CAS PubMed Google Scholar
Nikolov, L. A. et al. Resolving the backbone of the Brassicaceae phylogeny for investigating trait diversity. N. Phytol. 222, 1638–1651 (2019).
Article Google Scholar
Mandáková, T., Joly, S., Krzywinski, M., Mummenhoff, K. & Lysak, M. A. Fast diploidization in close mesopolyploid relatives of Arabidopsis. Plant Cell 22, 2277–2290 (2010).
Article PubMed PubMed Central CAS Google Scholar
Mandáková, T., Li, Z., Barker, M. S. & Lysak, M. A. Diverse genome organization following 13 independent mesopolyploid events in Brassicaceae contrasts with convergent patterns of gene retention. Plant J. 91, 3–21 (2017).
Article PubMed CAS Google Scholar
Kiefer, C. et al. Interspecies association mapping links reduced CG to TG substitution rates to the loss of gene-body methylation. Nat. Plants 5, 846–855 (2019).
Article CAS PubMed Google Scholar
Mandáková, T., Hloušková, P., German, D. A. & Lysak, M. A. Monophyletic origin and evolution of the largest crucifer genomes. Plant Physiol. 174, 2062–2071 (2017).
Article PubMed PubMed Central CAS Google Scholar
Mandáková, T. & Lysak, M. A. Post-polyploid diploidization and diversification through dysploid changes. Curr. Opin. Plant Biol. 42, 55–65 (2018).
Article PubMed Google Scholar
Huang, X.-C., German, D. A. & Koch, M. A. Temporal patterns of diversification in Brassicaceae demonstrate decoupling of rate shifts and mesopolyploidization events. Ann. Bot. 125, 29–47 (2019).
Article CAS PubMed Central Google Scholar
Nikolov, L. A. Brassicaceae flowers: diversity amid uniformity. J. Exp. Bot. 70, 2623–2635 (2019).
Article CAS PubMed Google Scholar
Vlad, D. et al. Leaf shape evolution through duplication, regulatory diversification, and loss of a homeobox gene. Science 343, 780–783 (2014).
Article ADS CAS PubMed Google Scholar
Joly, S., Heenan, P. B. & Lockhart, P. J. A Pleistocene inter-tribal allopolyploidization event precedes the species radiation of Pachycladon (Brassicaceae) in New Zealand. Mol. Phylogenetics Evol. 51, 365–372 (2009).
Article CAS Google Scholar
Forsythe, E. S., Nelson, A. D. L. & Beilstein, M. A. Biased gene retention in the face of massive nuclear introgression obscures species relationships. Preprint at https://www.biorxiv.org/content/10.1101/197087v2 (2017).
Guo, X. et al. Plastome phylogeny and early diversification of Brassicaceae. BMC Genomics 18, e176 (2017).
Article CAS Google Scholar
Kagale, S. et al. Polyploid evolution of the Brassicaceae during the Cenozoic era. Plant Cell 26, 2777–2791 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wood, T. E. et al. The frequency of polyploid speciation in vascular plants. Proc. Natl Acad. Sci. USA 106, 13875–13879 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Mandáková, T. et al. Multispeed genome diploidization and diversification after an ancient allopolyploidization. Mol. Ecol. 26, 6445–6462 (2017).
Article PubMed CAS Google Scholar
Soltis, P. S. & Soltis, D. E. Ancient WGD events as drivers of key innovations in angiosperms. Curr. Opin. Plant Biol. 30, 159–165 (2016).
Article PubMed Google Scholar
Zhang, J. et al. Genome of plant Maca (Lepidium meyenii) illuminates genomic basis for high-altitude adaptation in the Central Andes. Mol. Plant 9, 1066–1077 (2016).
Article CAS PubMed Google Scholar
Beilstein, M. A., Al-Shehbaz, I. A. & Kellogg, E. A. Brassicaceae phylogeny and trichome evolution. Am. J. Bot. 93, 607–619 (2006).
Article CAS PubMed Google Scholar
Beilstein, M. A., Al-Shehbaz, I. A., Mathews, S. & Kellogg, E. A. Brassicaceae phylogeny inferred from phytochrome A and ndhF sequence data: tribes and trichomes revisited. Am. J. Bot. 95, 1307–1327 (2008).
Article CAS PubMed Google Scholar
Doroshkov, A. V., Konstantinov, D. K., Afonnikov, D. A. & Gunbin, K. V. The evolution of gene regulatory networks controlling Arabidopsis thaliana L. trichome development. BMC Plant Biol. 19, 53 (2019).
Article PubMed PubMed Central Google Scholar
Sato, Y., Shimizu-Inatsugi, R., Yamazaki, M., Shimizu, K. K. & Nagano, A. J. Plant trichomes and a single gene GLABRA1 contribute to insect community composition on field-grown Arabidopsis thaliana. BMC Plant Biol. 19, 163 (2019).
Article PubMed PubMed Central Google Scholar
Sletvold, N., Huttunen, P., Handley, R., Kärkkäinen, K. & Ågren, J. Cost of trichome production and resistance to a specialist insect herbivore in Arabidopsis lyrata. Evol. Ecol. 24, 1307–1319 (2010).
Article Google Scholar
Willis, C. G., Hall, J. C., Rubio de Casas, R., Wang, T. Y. & Donohue, K. Diversification and the evolution of dispersal ability in the tribe Brassiceae (Brassicaceae). Ann. Bot. 114, 1675–1686 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lysak, M. A. Chromosome triplication found across the tribe Brassiceae. Genome Res. 15, 516–525 (2005).
Article CAS PubMed PubMed Central Google Scholar
Louca, S. & Pennell, M. W. Extant timetrees are consistent with a myriad of diversification histories. Nature 580, 502–505 (2020).
Article ADS CAS PubMed Google Scholar
Haudry, A. et al. An atlas of over 90,000 conserved noncoding sequences provides insight into crucifer regulatory regions. Nat. Genet. 45, 891–898 (2013).
Article CAS PubMed Google Scholar
Schranz, M., Lysak, M. & Mitchellolds, T. The ABC’s of comparative genomics in the Brassicaceae: building blocks of crucifer genomes. Trends Plant Sci. 11, 535–542 (2006).
Article CAS PubMed Google Scholar
Lysak, M. A., Mandáková, T. & Schranz, M. E. Comparative paleogenomics of crucifers: ancestral genomic blocks revisited. Curr. Opin. Plant Biol. 30, 108–115 (2016).
Article PubMed Google Scholar
Hu, T. T. et al. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nat. Genet. 43, 476–481 (2011).
Article PubMed PubMed Central CAS Google Scholar
Murat, F. et al. Understanding Brassicaceae evolution through ancestral genome reconstruction. Genome Biol. 16, 262 (2015).
Article PubMed PubMed Central CAS Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Castresana, J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552 (2000).
Article CAS PubMed Google Scholar
Lanfear, R., Calcott, B., Kainer, D., Mayer, C. & Stamatakis, A. Selecting optimal partitioning schemes for phylogenomic datasets. BMC Evol. Biol. 14, 82 (2014).
Article PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Paradis, E., Claude, J. & Strimmer, K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289–290 (2004).
Article CAS PubMed Google Scholar
Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bouckaert, R. et al. BEAST 2: a software platform for bayesian evolutionary analysis. PLoS Comput. Biol. 10, e1003537 (2014).
Article PubMed PubMed Central CAS Google Scholar
Gittleman, J. L. & Kot, M. Adaptation: statistics and a null model for estimating phylogenetic effects. Syst. Biol. 39, 227–241 (1990).
Google Scholar
Keck, F., Rimet, F., Bouchez, A. & Franc, A. phylosignal: an R package to measure, test, and explore the phylogenetic signal. Ecol. Evol. 6, 2774–2780 (2016).
Article PubMed PubMed Central Google Scholar
Rabosky, D. L. Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees. PLoS ONE 9, e89543 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Stevens, P. F. Angiosperm Phylogeny Website. Angiosperm Phylogeny Website. Version 14, July 2017. http://www.mobot.org/MOBOT/Research/APweb/ (2019).
Rabosky, D. L. et al. BAMMtools: an R package for the analysis of evolutionary dynamics on phylogenetic trees. Methods Ecol. Evol. 5, 701–707 (2014).
Article Google Scholar
Plummer, M., Best, N., Cowles, K. & Vines, K. CODA: convergence diagnosis and output analysis for MCMC. R. News. 6, 7–11 (2006).
Google Scholar
Jombart, T., Devillard, S. & Balloux, F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 11, 94 (2010).
Article PubMed PubMed Central Google Scholar
Jombart, T. & Ahmed, I. adegenet 1.3-1: new tools for the analysis of genome-wide SNP data. Bioinformatics 27, 3070–3071 (2011).
Article CAS PubMed PubMed Central Google Scholar
Salariato, D. L., Zuloaga, F. O., Franzke, A., Mummenhoff, K. & Al-Shehbaz, I. A. Diversification patterns in the CES clade (Brassicaceae tribes Cremolobeae, Eudemeae, Schizopetaleae) in Andean South America. Botanical J. Linn. Soc. 181, 543–566 (2016).
Article Google Scholar
Felsenstein, J. Phylogenies and the comparative method. Am. Naturalist 125, 1–15 (1985).
Article Google Scholar
Garland, T., Janis, C. M. & Jones, J. A. Phylogenetic analysis of covariance by computer simulation. Syst. Biol. 42, 28 (1993).
Article Google Scholar
Revell, L. J. phytools: an R package for phylogenetic comparative biology (and other things): phytools: R package. Methods Ecol. Evol. 3, 217–223 (2012).
Article Google Scholar

Download references

Acknowledgements

We thank our gardeners Thorsten Jakob, Bärbel Schwarz and Frank Korn for excellent plant curation. Peter Sack, Lisa Kretz and Lua Lopez are acknowledged for lab assistance as is David Ibberson at Heidelberg Deep Sequencing Core facility. We are grateful to Korbinian Schneeberger and Eric Schranz for helpful comments on the manuscript, and we are in particular grateful to Ihsan Al-Shehbaz sharing unpublished earlier work with us. This work has been supported by German Research foundation (DFG) KO2302/13 and KO2302/23 (M.A.K.).

Author information

Dmitry A. German
Present address: South-Siberian Botanical Garden, Altai State University, Lenina Ave. 61, 656049, Barnaul, Russia
Xiao-Chen Huang
Present address: School of Life Sciences, Nanchang University, 330031, Nanchang, China
These authors contributed equally: Nora Walden, Dmitry A. German, Eva M. Wolf.

Authors and Affiliations

Centre for Organismal Studies, University of Heidelberg, Im Neuenheimer Feld 345, 69120, Heidelberg, Germany
Nora Walden, Dmitry A. German, Eva M. Wolf, Markus Kiefer, Philippe Rigault, Xiao-Chen Huang, Christiane Kiefer, Andreas Franzke & Marcus A. Koch
GYDLE, 1135 Grande Allée Ouest, Québec, QC, G1S 1E7, Canada
Philippe Rigault
Department of Botany, Faculty of Science, Charles University, Benátská 2, 128 01, Prague, Czech Republic
Roswitha Schmickl
Department of Biology, Systematic Botany, University of Osnabrück, Barbarastraße 11, 49076, Osnabrück, Germany
Barbara Neuffer & Klaus Mummenhoff

Authors

Nora Walden
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry A. German
View author publications
You can also search for this author in PubMed Google Scholar
Eva M. Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Markus Kiefer
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Rigault
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Chen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Kiefer
View author publications
You can also search for this author in PubMed Google Scholar
Roswitha Schmickl
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Franzke
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Neuffer
View author publications
You can also search for this author in PubMed Google Scholar
Klaus Mummenhoff
View author publications
You can also search for this author in PubMed Google Scholar
Marcus A. Koch
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.A.K. conceived and designed the study. D.A.G., E.M.W., and M.A.K. performed experiments and collected data. N.W., M.A.K., E.M.W., P.R., M.K., X.H., R.S., A.F., and C.K. analyzed data. B.N. and K.M. contributed to designing of taxon selection and provided plant material. M.A.K. and N.W. wrote the manuscript. All authors contributed to the final version of the manuscript.

Corresponding author

Correspondence to Marcus A. Koch.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Alessandro Minelli, Christian Parisod and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Walden, N., German, D.A., Wolf, E.M. et al. Nested whole-genome duplications coincide with diversification and high morphological disparity in Brassicaceae. Nat Commun 11, 3795 (2020). https://doi.org/10.1038/s41467-020-17605-7

Download citation

Received: 02 December 2019
Accepted: 09 July 2020
Published: 30 July 2020
DOI: https://doi.org/10.1038/s41467-020-17605-7

This article is cited by

Intergrative metabolomic and transcriptomic analyses reveal the potential regulatory mechanism of unique dihydroxy fatty acid biosynthesis in the seeds of an industrial oilseed crop Orychophragmus violaceus
- Changfu Jia
- Qiang Lai
- Jing Wang
BMC Genomics (2024)
The evolution of ephemeral flora in Xinjiang, China: insights from plastid phylogenomic analyses of Brassicaceae
- Tian-Wen Xiao
- Feng Song
- Xue-Jun Ge
BMC Plant Biology (2024)
The uneven distribution of refugial endemics across the European Alps suggests a threefold role of climate in speciation of refugial populations
- Joachim W. Kadereit
Alpine Botany (2024)
Remarkable mitochondrial genome heterogeneity in Meniocus linifolius (Brassicaceae)
- Jie Liu
- Jin-Yong Hu
- De-Zhu Li
Plant Cell Reports (2024)
Evolution of phenotypic disparity in the plant kingdom
- James W. Clark
- Alexander J. Hetherington
- Philip C. J. Donoghue
Nature Plants (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.