Short Communication
The phylogeny of the BEP clade in grasses revisited: Evidence from the whole-genome sequences of chloroplasts

https://doi.org/10.1016/j.ympev.2011.10.019Get rights and content

Abstract

Despite the considerable efforts to reconstruct the phylogeny of grasses, the relationships among the subfamilies Bambusoideae, Pooideae and Ehrhartoideae in the BEP clade remain unresolved. Here we completely sequenced three chloroplast genomes of representative species from Bambusoideae and Ehrhartoideae and obtained 19 additional chloroplast genome sequences of other grasses from GenBank. Using sequences of 76 chloroplast protein-coding genes from the 22 grass species, we fully resolved the phylogeny of the BEP clade. Our results strongly supported the (B,P)E hypothesis, i.e., Bambusoideae and Pooideae are more closely related than Ehrhartoideae. This result was not biased by systematic or sampling errors and was impervious to phylogenetic methods or model specification. The divergence time estimate suggests that the initial diversification of the BEP clade into three subfamilies happened within a short time period (∼4 MY). The presence of these short internal branches may explain the inability of previous studies to achieve a confident resolution of the BEP clade. The combination of the sequences of the entire chloroplast genomes provided sufficient phylogenetic information to resolve the BEP phylogeny fully. These results provide a valuable evolutionary framework for comparative and functional genomic studies using the grass family as a model system.

Highlights

► A fully resolved phylogeny of the major lineages of grasses was obtained. ► The (B,P)E hypotheses was strongly supported. ► The diversification of the BEP clade into three subfamilies happened within ∼4 MY. ► The short internal branches in BEP clade caused incongruent phylogenies in studies.

Introduction

The grass family (Poaceae) is one of the most diverse angiosperm families and consists of approximately 700 genera and more than 10,000 species (Clayton and Renvoize, 1986, GPWG (Grass Phylogeny Working Group), 2001). The grasses not only include many economically important crops, such as rice (Oryza sativa), corn (Zea mays), wheat (Triticum aestivum) and sorghum (Sorghum bicolor), but also dominate various natural and agricultural landscapes of the world (Clayton and Renvoize, 1986). With the accomplished and ongoing genome sequencing of many cereal crops, grass species have become a model system for functional and comparative genomic research.

In the past decades, the phylogenetic relationships of Poaceae have been the focus of extensive studies. An evolutionary framework has been thoroughly established at the family level (Clark et al., 1995, GPWG (Grass Phylogeny Working Group), 2001, Duvall et al., 2007, Bouchenak-Khelladi et al., 2008, Vicentini et al., 2008, Kellogg, 2009). To date, two major clades within the family have been identified, with one comprising the three subfamilies Bambusoideae, Ehrhartoideae, and Pooideae (BEP clade) and the other consisting of the seven subfamilies Panicoideae, Arundinoideae, Chloridoideae, Centothecoideae, Micrairoideae, Aristidoideae, and Danthonioideae (PACCMAD clade) (GPWG (Grass Phylogeny Working Group), 2001, Duvall et al., 2007, Bouchenak-Khelladi et al., 2008, Kellogg, 2009). However, the phylogenetic relationships within these clades have not been fully resolved (GPWG (Grass Phylogeny Working Group), 2001, Bouchenak-Khelladi et al., 2008, Kellogg, 2009). Using the plastid ndhF sequences, Clark et al. (1995) first recovered a tree with two major groups (the PACC and BOP clades) and found that bambusoid (i.e., Bambusoideae) and oryzoid (i.e., Ehrhartoideae) clustered together and were sister to pooid (i.e., Pooideae). The (B,E)P relationship, with Bambusoideae and Ehrhartoideae being more closely related than Pooideae, was confirmed by subsequent studies (GPWG (Grass Phylogeny Working Group), 2001, Vicentini et al., 2008, Sungkaew et al., 2009). Nevertheless, many other studies have revealed either a sister relationship of Bambusoideae and Pooideae, (B,P)E (Zhang, 2000, Bouchenak-Khelladi et al., 2008, Leseberg and Duvall, 2009, Peng et al., 2010) or a sister relationship of Ehrhartoideae and Pooideae, (E,P)B (Mason-Gamer et al., 1998, Mathews et al., 2000). The problem of the relationship among the BEP lineages is similar to the well-known “trichotomy problem” involving chimpanzee, gorilla, and human (Satta et al., 2000) in which all three possible relationships have been recovered in various studies.

Apart from the inconsistent phylogenies within the BEP clade, it is noteworthy that the different topologies were generated if different sampling strategies (gene and taxon) were employed and if different methods of analysis were used. For instance, based on sequences of 43 orthologous loci selected from 10,608 putative full-length cDNA sequences of bamboos, Peng et al. (2010) obtained different phylogenetic relationships within the BEP clade using different phylogenetic methods. Ehrhartoideae was the basal lineage on the maximum likelihood and Bayesian trees, i.e., (B,P)E. In contrast, Pooideae was the basal lineage on the Neighbor-Joining tree, i.e., (B,E)P. Therefore, despite considerable efforts to reconstruct the phylogeny of the grasses, the BEP relationship remains unclear.

The cpDNA sequences are increasingly used for resolving the deep phylogeny of plants because of their low rates of nucleotide substitutions and structural changes (Soltis et al., 2004, Jansen et al., 2007, Moore et al., 2010). Concatenating sequences from many genes may overcome the problem of multiple substitutions that results in a loss of phylogenetic information between chloroplast lineages and can reduce sampling errors due to substitutional noise (Delsuc et al., 2005). Here, we sequenced the complete chloroplast genomes of three grass species and obtained 19 additional chloroplast genomes of grasses from GenBank. Using sequences of 76 plastid protein-coding genes (70,700 bp in total) that are derived from the complete plastid genomes of the 22 grass species, we were able to resolve the phylogenetic relationships of the BEP clade fully. Based on the divergence time estimates, we found that two rapid speciation events within the BEP clade occurred in a small time interval (roughly 4 million years). These events might be responsible for the inconsistent phylogenetic reconstructions that have been reported in previous studies.

Section snippets

Complete plastid genome sequence of three grass species

Using the Fosmid library and PCR amplification methods, we completely sequenced three plastomes, including two species (Leersia tisserantii and Rhynchoryza subulata) from the subfamily Ehrhartoideae and one species (Phyllostachys propinqua) from Bambusoideae. Fresh leaves of the three species were collected from plants grown in the greenhouse or garden of the Institute of Botany of the Chinese Academy of Sciences in Beijing. The seeds of L. tisserantii and R. subulata were requested from the

Results

We have sequenced the chloroplast genomes of two species in Ehrhartoideae (L. tisserantii and R. subulata) and one species in Bambusoideae (P. propinqua). The genome sizes were 136,550 bp, 136,303 bp and 139,704 bp, respectively (Table 1). The genomes include a pair of inverted repeats (IRs) with a length of 21,329 bp (L. tisserantii), 20,840 bp (R. subulata) and 21,800 bp (P. propinqua), separated by a small single-copy region of 12,027 bp (L. tisserantii), 12,594 bp (R. subulata) and 12,877 bp (P.

Discussion

Despite extensive investigations of the phylogeny of the grass family, the phylogenetic relationships of the BEP clade remain unresolved. All three possible relationships among the three subfamilies have been suggested by a variety of previous studies. The inconsistent phylogenies might be attributed to a relatively small number of phylogenetic markers used in these studies. Because different genes/fragments might have evolved at different rates or possess different histories (Delsuc et al.,

Acknowledgments

We thank Dr. Wen-Li Chen for providing samples and taxonomic identification. We are grateful to Fu-Min Zhang, Hai-Fei Zhou, and Hai-Qin Sun for technical assistance and discussions and to Tao Sang for reading and improving the manuscript. We are also grateful to the International Rice Research Institute (Los Banos, Philippines) for providing seed samples. This work was supported by the National Natural Science Foundation of China (30990240).

References (35)

  • R.C. Edgar

    MUSCLE, multiple sequence alignment with high accuracy and high throughput

    Nucl. Acids Res.

    (2004)
  • GPWG (Grass Phylogeny Working Group)

    Phylogeny and subfamilial classification of the grasses (Poaceae)

    Ann. Mol. Bot. Gard.

    (2001)
  • S. Guindon et al.

    A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood

    Syst. Biol.

    (2003)
  • R.K. Jansen et al.

    Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns

    Proc. Natl. Acad. Sci. USA

    (2007)
  • E. Kellogg

    The evolutionary history of ehrhartoideae, Oryzeae, and Oryza

    Rice

    (2009)
  • C.H. Leseberg et al.

    The complete chloroplast genome of Coix lachyrma-jobi and a comparative molecular evolutionary analysis of plastomes in cereals

    J. Mol. Evol.

    (2009)
  • R.J. Mason-Gamer et al.

    Granule-Bound starch synthase: structure, function, and phylogenetic utility

    Mol. Biol. Evol.

    (1998)
  • Cited by (0)

    View full text