Comparison of Four Complete Chloroplast Genomes of Medicinal and Ornamental Meconopsis Species: Genome Organization and Species Discrimination

Li, Xiaoxue; Tan, Wei; Sun, Jiqi; Du, Junhua; Zheng, Chenguang; Tian, Xiaoxuan; Zheng, Min; Xiang, Beibei; Wang, Yong

doi:10.1038/s41598-019-47008-8

Download PDF

Article
Open access
Published: 22 July 2019

Comparison of Four Complete Chloroplast Genomes of Medicinal and Ornamental Meconopsis Species: Genome Organization and Species Discrimination

Xiaoxue Li¹^na1,
Wei Tan²^na1,
Jiqi Sun¹,
Junhua Du³,
Chenguang Zheng¹,
Xiaoxuan Tian²,
Min Zheng¹,
Beibei Xiang⁴ &
…
Yong Wang¹

Scientific Reports volume 9, Article number: 10567 (2019) Cite this article

3860 Accesses
31 Citations
Metrics details

Subjects

An Author Correction to this article was published on 17 October 2019

This article has been updated

Abstract

High-throughput sequencing of chloroplast genomes has been used to gain insight into the evolutionary relationships of plant species. In this study, we sequenced the complete chloroplast genomes of four species in the Meconopsis genus: M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea. These plants grow in the wild and are recognized as having important medicinal and ornamental applications. The sequencing results showed that the size of the Meconopsis chloroplast genome ranges from 151864 to 153816 bp. A total of 127 genes comprising 90 protein-coding genes, 37 tRNA genes and 8 rRNA genes were observed in all four chloroplast genomes. Comparative analysis of the four chloroplast genomes revealed five hotspot regions (matK, rpoC2, petA, ndhF, and ycf1), which could potentially be used as unique molecular markers for species identification. In addition, the ycf1 gene may also be used as an effective molecular marker to distinguish Papaveraceae and determine the evolutionary relationships among plant species in the Papaveraceae family. Futhermore, these four genomes can provide valuable genetic information for other related studies.

The complete chloroplast genome of Gleditsia sinensis and Gleditsia japonica: genome organization, comparative analysis, and development of taxon specific DNA mini-barcodes

Article Open access 01 October 2020

Comparative and phylogenetic analysis of the complete chloroplast genome sequences of Allium mongolicum

Article Open access 15 December 2022

Evolutionary and phylogenetic aspects of the chloroplast genome of Chaenomeles species

Article Open access 10 July 2020

Introduction

The genus Meconopsis belongs to the Papaveraceae family of herb angiosperms and comprises approximately 49 species, 38 of which are found in China¹. These plants are mainly distributed in the Himalayan foothills at an elevation of 2500–5500 m and are widely used in Tibetan folk medicine in China². Detailed records of the medicinal usage of these plants have been written in the famous classic works on traditional Tibetan medicine, such as Jingzhu Materia Medica, Yue Wang Yao Zhen, and Four Medical Codes³. Recently, many kinds of isoquinoline alkaloids have been isolated from plants of the Meconopsis genus, and some have shown bioactivity, such as anti-inflammatory and analgesic activities⁴. Plants in this genus are also well known for their ornamental flowers and are widely used in horticultural gardening, with names such as fairy grass and Himalayan poppy. These plants are iconic in Tibet and Yunnan and play a significant role in the local Tibetan economy, as they are among the top ten ornamental flowering plants in the region². Howere, overexploitation and anthropogenic habitat destruction are increasingly threatening the survival of many wild Meconopsis species. Meconopsis punicea has been listed as an endangered species on the China Species Red List⁵.

To understand the evolutionary relationships of plant species in the Meconopsis genus and in the Papaveraceae family, it is important to obtain genetic information or molecular markers of individual species. This “barcode” can also aid in medicinal usage, for which the accurate identification of species is required, as the regions and sources of species are often complex or unknown^6,7,8 and can affect the efficacy of the final medicinal product.

Recent chloroplast genomic research has provided large quantities of data that are useful for selecting pertinent markers to resolve obscure phylogenetic relationships in seed plants⁹. At present, nearly 3000 complete chloroplast genomes are available in the NCBI database (https://www.ncbi.nlm.nih.gov/genomes/GenomesGroup.cgi?taxid=2759&opt=plastid)¹⁰. However, there is only one sequence from the chloroplast DNA of Meconopsis species in GenBank¹¹.

In this study, we sequenced and assembled the chloroplast genomes of four Meconopsis species using a next-generation sequencing platform. We report the assembly, annotation and analysis of the chloroplast genomes of Meconopsis racemosa, Meconopsis integrifolia (Maxim.) Franch, Meconopsis horridula and Meconopsis punicea. We also constructed phylogenetic trees to perform comparisons among chloroplast genomes published for other plant species in related families. This study expands our understanding of the diversity of chloroplast genomes of Meconopsis species and their evolutionary relationships and provides fundamental data for the genetic engineering of Meconopsis chloroplasts.

Results and Discussion

Chloroplast genome sequencing, assembly and validation

Using the Illumina HiSeq 2000 system, we sequenced the complete chloroplast genomes of four Meconopsis species, M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea. Raw data were generated with an average read length of 150 bp. The complete sequences of the four chloroplast genomes were assembled by both de novo and reference-based assembly. Gaps were validated using PCR-based sequencing with one primer pair (Supplementary Table 1). The final high-quality chloroplast genome sequences were submitted to GenBank (Accession Numbers: M. racemosa, MK533649; M. integrifolia (Maxim.) Franch, MK533647; M. horridula, MK533646; M. punicea, MK533648), and the corresponding genome maps are shown in Fig. 1.

Chloroplast genome structural features and gene content

It was previously reported that the chloroplast genomes of angiosperms are conserved in their genomic structure in terms of gene number and order, although IR expansion or contraction occur frequently^12,13. The Meconopsis chloroplast genomes are in accordance with this observation, and their genome structures are similar to those of other Papaveraceae species¹⁴. All of the Meconopsis chloroplast genomes display the typical quadripartite structure of angiosperm cpDNA, which consists of a pair of IR regions (51306–51988 bp) separated by an LSC region (82809–83982 bp) and an SSC region (17729–17898 bp). These four chloroplast genomes are highly conserved in gene content, gene order, and intron number. The Meconopsis chloroplast genomes harbor 127 genes, 90 coding proteins, 37 coding tRNAs and 8 coding rRNAs. Some genes are duplicated in the IR region, among which ten are protein-coding genes (rpl2, rpl12, rps12, rps15, rps16, rps19, ndhB, ycf1, ycf15 and ycf2), four are ribosomal RNA genes (rrn4.5, rrn5, rrn16, rrn23) and six are transfer RNA genes (trnL-CAA, trnN-GUU, trnR-ACG, trnA-UGC, trnI-GAU and trnV-GAC) (Table 1). Fifteen protein-coding genes (petB, petD, ndhA, ndhB, atpF, rps12, rps15, rps16, rps19, rpl2, rpl12, rpl16, rpoC1, clpP, and ycf3) contain one or more introns. The A content ranged from 30.4 to 30.5%, the C content ranged from 19.7 to 19.8%, the G content ranged from 18.8 to 19%, the T content ranged from 30.8 to 31%, and the GC content ranged from 38.5 to 38.8%, indicating nearly identical levels among the four Meconopsis chloroplast genomes (Table 2).

Table 1 Summary of assembly data for the Meconopsis chloroplast genome.

Full size table

Table 2 Chloroplast genome gene content and functional classification in M.

Full size table

Amino acid abundance and codon usage

Codon usage plays an important role in shaping chloroplast genome evolution. Mutational bias has been reported to have an essential role in this process¹⁵. As shown in Supplementary Tables 2–5, the 90 protein-coding genes are encoded by 26338, 26365, 26342 and 26337 codons in the chloroplast genomes of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea, respectively. Leucine (11.1–9.5%) was the most abundant amino acid among the proteins encoded by the chloroplast genes. Cysteine (1.2–1.7%) was the least abundant amino acid in the proteins encoded by chloroplast genes in the M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea chloroplast genomes. Leucine and isoleucine are the most commonly observed amino acids in the proteins of chloroplast genomes of angioperms¹⁶.

We calculated and summarized the codon usage of the chloroplast genomes in these four plants (Fig. 2). The codon UUA, for leucine, occurred at the highest proportion in all four species (27.1–30.3%). There were a total of 711 codons encoding tRNA genes in the M. racemosa, M. integrifolia (Maxim.) Franch and M. horridula chloroplast genomes, but only 704 codons in the tRNA-encoding genes in M. punicea (Supplementary Tables 2–5), indicating that codons ending in U and A were common; perhaps the variation in the tRNA-encoding genes is related to species evolution.

We also calculated the relative synonymous codon usage (RSCU) in the chloroplast genomes of the four species. Usage of the start codon methionine AUG and tryptophan UGG had no bias (RSCU = 1). All preferred relative synonymous codons (RSCU >1) ended with an A or a U, except for UUG (all 4 species), UCC (M. integrifolia (Maxim.) Franch, M. horridula and M. punicea) and UAG (M. integrifolia (Maxim.) Franch and M. punicea) (Supplementary Tables 2–5).

Plastid RNA editing prediction

RNA editing is a generic term comprising a variety of processes that alter the DNA-encoded sequence of a transcribed RNA by inserting, deleting or modifying nucleotides in a transcript¹⁷. Chloroplast RNA editing was first discovered in 1991. Nearly 30 years after the discovery of C-to-U editing in plant chloroplasts, the field has recently expanded tremendously in several research directions¹⁸. RNA editing provides a way to create transcript and protein diversity¹⁹. In higher plants, some chloroplast RNA editing sites are conserved²⁰.

To gain insight into the RNA editing sites in Meconopsis plants, we predicted 92, 78, 84 and 94 RNA editing sites out of 27, 26, 28 and 28 plastid genes in the chloroplast genomes of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea, respectively, with PREP (Supplementary Tables 6–9). In these four species, the amino acid conversion from S to L was the most frequent type of conversion. As previously reported, with increased amino acids, the conversion from S to L becomes more frequent²¹. This finding indicated that the evolutionary conservation of RNA editing is essential^22,23.

Simple sequence repeats and repetitive sequence analysis

Tandem repeat sequences consisting of 1–6 nucleotide repeat units are known as simple sequence repeats (SSRs), or microsatellites²⁴. SSRs are valuable molecular markers with a high degree of variation within species and have been used in many population genetics and polymorphism investigations. Using the MISA software tool, we analyzed the occurrences and types of SSRs in the four Meconopsis chloroplast genomes. These genomes all have SSRs, and the majority of which are mono- and dinucleotide repeats, which were identified 88 and 29 times, respectively. The mononucleotide repeats were A/T repeats, and 82.8% of the dinucleotide repeats were AT/AT repeats (Table 3). Although the AT richness in the SSRs of the four chloroplast genomes of Meconopsis species was similar to that identified in previous studies, which suggested that SSRs found in the chloroplast genome are generally composed of polythymine (T) or polyadenine (A) repeats²⁵, the number of SSRs differs among the different species (40 in M. racemosa, 33 in M. integrifolia (Maxim.) Franch, 38 in M. horridula and 34 in M. punicea; Table 3). These findings indicate that SSRs can be used as molecular markers to identify these plant species.

Table 3 Types and numbers of SSRs in the chloroplast genomes of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea.

Full size table

More complex and longer repeat sequences may play an important roles in sequence divergence and genomes²⁶. In these four Meconopsis chloroplast genomes, we found that the length of repeated sequences ranged mainly from 30 to 90 bp, similar to the lengths reported in other angiosperm plants^25,27,28. The numbers of repeats with at least 30 base pairs (bp) per repeat unit in the M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula, and M. punicea chloroplast genomes are 35, 49, 34 and 29, respectively. The M. racemosa chloroplast genome contains 27 repeats of 30–50 bp, 5 repeats of 51–70 bp, and 3 repeats longer than 90 bp. The M. integrifolia (Maxim.) Franch chloroplast genome contains 16 repeats of 30–50 bp, 12 repeats of 51–70 bp, 2 repeats of 71–90 bp and 19 repeats longer than 90 bp. The M. horridula chloroplast genome contains 25 repeats of 30–50 bp, 6 repeats of 51–70 bp, 1 repeat of 71–90 bp and 2 repeats longer than 90 bp. The M. punicea chloroplast genome contains 26 repeats of 30–50 bp, 1 repeat of 51–70 bp, and 2 repeats longer than 90 bp (Fig. 3).

Divergent hotspots in the Meconopsis chloroplast genome

Molecular markers with nucleotide diversity over 1.5% have been reported as highly variable regions that can be used for phylogenetic analysis and species identification in seed plants^29,30. Currently, there are few molecular biology-based studies of Meconopsis plants, and there is no uniform molecular marker for species identification^{31,32,33,34,35}.

A SNP (single nucleotide polymorphism) marker is a single base change in a DNA sequence, typically with two possible nucleotide alternatives at a given position³⁶. A total of 176, 2459, 36, 2982 SNPs were found in M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea, respectively. To reveal the sequence divergence levels, the nucleotide variability values within 800 bp in all four chloroplast genomes were calculated with DnaSP 6.10.03 software. The values ranged from 0 to 0.07, revealing slight differences among the genomes. For example, the p-distance between M. racemosa and each of M. integrifolia (Maxim.) Franch, M. horridula and M. punicea is 0.016, 0.001 and 0.018, respectively. These divergence hotspot regions can provide information for marker development for phylogenetic analyses of Meconopsis species. Overall, the results reveal higher divergence in noncoding regions than in coding regions. Using whole chloroplast genomes, we found that some regions differ among the four species, such as rps16, trnC-GCA, trnD-GCU, trnT-GGU, rps15, accD-PsaI and petA (Fig. 4a). The coding regions with marked differences include the matK, rpoC2, petA, ndhF and ycf genes (Fig. 4b). These genes could be utilized as potential phylogenetic markers to reconstruct the phylogeny in this genus. Qu Yan et al. reported that the ndhF gene could not be used to distinguish M. racemosa from M. horridula³⁷. However, our present study shows that the sequence of the ndhF gene in the chloroplast genome differs between these two species is distinct.

Divergent hotspots of chloroplast genomes have been used to identify species in other plants of the Papaveraceae family. Jianguo Zhou et al. used ycf1, rpoB-trnC, trnD-trnT, petA-psbJ, psbE-petL and ccsA-ndhD sequences in the chloroplast genome to distinguish Papaver orientale and Papaver rhoeas¹⁴. Zhe Zhang et al.³⁸ analyzed the phylogeny of 15 species from the Papaveraceae family based on the nuclear gene ITS sequence, the chloroplast gene rbcL sequence, and the combined sequences of these genes.

Comparisons of the chloroplast genomes among nine species in the Papaveraceae family

We compared the 9 known chloroplast genome sequences of species in the Papaveraceae family (M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula, M. punicea, Macleaya microcarpa (MH394383.1), Coreanomecon hylomeconoides (KT274030.1), Papaver somniferum (KU204905.1), Papaver rhoeas (MF943221.1) and Papaver orientale (MF943222.1)). The results indicated that species with the largest chloroplast genome is the M. microcarpa (161118 bp) and that with the smallest is M. integrifolia (Maxim.) Franch genome (151864 bp) (Table 1). The M. microcarpa (161118 bp) genome was used as the reference genome.

Next, we used the online program mVISTA to analyze gene order and content in the chloroplast genome. We found that the gene order and contents of the Meconopsis plants are similar to those of other members of the Papaveraceae family (Fig. 5). Similar to other plant species, all Meconopsis species have conserved chloroplast genomes, their coding regions are more conserved than their noncoding regions, and their IR regions are more conserved than their LSC and SSC regions^16,39,40.

Altitude and plant distribution

Altitude influences ecological factors such as water and temperature, which affects plant genetic variation and population differentiation⁴¹. In this study, the plant materials of M. racemosa and M. integrifolia (Maxim.) Franch were mainly collected from the Bayan Har mountains, Qinghai Province. This region has a cold continental climate with an average altitude of over 5000 m. The plant materials of M. horridula were collected from Matuo Country, Guoluo Tibetan Autonomous Prefecture, Qinghai Province. This region has an alpine grassland climate with an average annual temperature of −4 °C and an average altitude of over 4000 m. The plant materials of M. punicea were mainly collected in Chindu Country, Qinghai Province. This region has an average altitude of over 4000 m. Studies have shown that the evolutionary relationships of plants are affected by altitude^42,43. The plant materials used in this study were collected in the same area but at different altitudes: M. racemosa 4232 m; M. integrifolia (Maxim.) Franch, 4695 m; M. horridula, 4289 m; and M. punicea, 4639 m. According to traditional plant morphology taxonomy, M. racemosa is more closely related to M. horridula than to other Meconopsis species and is more distantly related to M. integrifolia (Maxim.) Franch and M. punicea⁴⁴, which is consistent with both the phylogenetic results of this study and the altitudes of their distributions. Although they are distributed in the same region, there is evident genetic isolation among them. We speculate that altitude may be an important ecological factor that affects the evolution of Meconopsis plants.

Phylogenetic analysis

With improvements and advancements in techniques, increasing numbers of chloroplast genome sequences have been used to reconstruct plant phylogenies⁴⁵. To identify the phylogenetic positions of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea within the Meconopsis genus, Bayesian inference (BI) and maximum likelihood (ML) methods of phylogenetic analysis were performed based on 90 protein-coding gene datasets from 40 plant taxa, with Sabia yunnanensis and Nelumbo nucifera used as outgroups. Both the BI and ML trees have similar phylogenetic topologies, and most nodal support values were high (Fig. 6). Using this reconstructions, M. racemosa, M. racemosa (MH394401)¹¹ and M. horridula were grouped together, as were M. integrifolia (Maxim.) Franch and M. punicea. These species are closely related to the Papaver genus within the Papaveraceae family.

In addition, we found that M. racemosa, M. horridula and M. racemosa (MH394401)¹¹ were grouped together. For several years, the delimitation of M. racemosa and M. horridula in the genus has been highly controversial⁴⁶. Fedd, Kingdon-Ward and Prain et al. considered M. racemosa and M. horridula to be the same species⁴⁶. However, in Tibetan Flora, M. racemosa is described as a variant of M. horridula. M. racemosa and M. racemosa (MH394401)¹¹ were distributed on different branches but are the same species. Incomplete lineage sorting, insufficient informative characters, hybridization or plastid capture could be responsible for the incongruent phylogenetic positions of this species^47,48.

We used the five gene markers (matK, rpoC2, petA, ndhF and ycf1 genes), screened by divergent hotspots in the Meconopsis chloroplast genomes, to construct five phylogenetic trees of these four Meconopsis plants and five other plants from the Papaveraceae family (P. somniferum, P. rhoeas, P. orientale, Macleaya microcarpa and Coreanomecon hylomeconoides) using Decaisnea insignis, Euptelea pleiosperma and Nuphar advena as outgroups (Fig. 7 and Supplementary Figs 1–4). The results showed that M. racemosa, M. racemosa (MH394401)¹¹ and M. horridula are grouped together and that M. integrifolia (Maxim.) Franch and M. punicea are grouped together. Among the five genes, the rpoC2 gene is not a suitable for potential DNA barcoding of Meconopsis plants, and the ycf1 gene has the highest node support value in the phylogenetic tree, which is consistent with previous reports that have used ycf1 to distinguish unknown Papaveraceae plants^14,49. In Tibetan Flora, M. racemosa is described as a variant of M. horridula on account of the similar morphological characterization of these taxa and the consistent ITS sequence. However, Dou et al.³⁵, using the ITS2 sequence, and Ni et al.³⁴, using the psbA-trnH sequence, constructed an evolutionary trees and found that these taxa clustered in different branches.

The chloroplast genome usually contains uniparentally inherited DNA, which is well suited for studying the evolutionary history of plants, such as dating a common ancestor⁵⁰. Yuan et al. used the chloroplast genome sequence of trnL-trnF and found that M. punicea is the mother of the hybrid species Meconopsis × cookei (Papaveraceae) and that M. quintuplinervia is the father³³.

Conclusions

In this study, we used the Illumina HiSeq 2000 system to sequence the complete chloroplast genomes of four Meconopsis species: M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea. We demonstrate that these four Meconopsis species are divided into two groups, with M. racemosa and M. horridula in one group and M. integrifolia (Maxim.) Franch and M. punicea in the other. By comparing the chloroplast genome sequences, we were able to retrieve all genetic resources, including SNPs, SSRs, repetitive sequence, codon usage, RNA editing prediction, ‘hotspot’ regions and phylogenomic analysis. These resources will provide chloroplast genome molecular markers for the identification of these Meconopsis species. We also used four hotspot genes (matK, petA, ndhF and ycf1) to construct phylogenetic trees and clearly distinguish these species.

With the development of plant science, plastid transformation is becoming an important tool. The limited availability of complete chloroplast genomic information is one of the major factors preventing the extension of this technology to valuable plants. The Meconopsis chloroplast genome data obtained in this study could be applied in biotechnology and provide useful information for designing transformation vectors in the future.

Materials and Methods

Plant material and DNA extraction

The plant materials used in this study were seeds collected from M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea in Qinghai Province. All samples were identified by Professor Junhua Du, who is affiliated with Qinghai Normal University. Total genomic DNA was isolated from seeds using the Mag-MK Plant Genomic DNA extraction kit (Sangon Biotech, Shanghai, China), and DNA quality was assessed based on spectrophotometry and electrophoresis in a 1% (w/v) agarose gel. Total DNA samples were chosen for Illumina 2000 sequencing.

Chloroplast genome assemblage and annotation

For these four species, the high-throughput sequencing data were qualitatively assessed and assembled using NOVOPlasty 2.6.3. Gaps in the cpDNA sequences were filled by PCR amplification and Sanger sequencing. The annotations of the chloroplast genomes were performed with Geneious 8.0.4, DOGMA⁵¹, CPGAVAS⁵² and CPGAVAS2⁵³ followed by manual correction. The tRNAs were verified by the online tRNAscan-SE 1.21 search server. All the annotations were manually checked against the references (NC_029434.1 and NC_031446.1). The genome maps were drawn by OGDRAW. The entire chloroplast genome sequences of M.racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea, along with the gene annotations, were submitted to GenBank (Accession Numbers: M. racemosa, MK533649; M. integrifolia (Maxim.) Franch, MK533647; M. horridula, MK533646; M. punicea, MK533648).

Codon usage

Codon usage was determined for all protein-coding genes. The relative synonymous codon usage (RSCU) values and codon usage were determined with MEGA7, which was used to reveal the characteristics of the variation in synonymous codon usage⁵⁴.

Simple sequence repeats and repetitive sequence analysis

Chloroplast microsatellites were identified in a high-quality sequence of clusterbean by using the MISA Perl script⁵⁵. The minimum numbers for the SSR motifs were 10, 5, 4, 3, 3 and 3 for mono-,di-,tri-,tetra-,penta-,and hexanucleotide repeats, respectively. REPuter was used to identify forward repeats, reserve sequences, complementary and palindromic sequences, with a minimum repeat size of 30 bp and 90% sequence identity⁵⁶.

Prediction of RNA editing sites

Twenty-eight protein-coding genes of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea were used to predict potential RNA editing sites using the Predictive RNA Editor for Plants (PERP) suite (http://prep.unl.edu) with a cutoff value of 0.8.

Genome comparison

MAFFT was used to align the chloroplast genomes⁵⁷. The complete chloroplast genomes of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea were compared using mVISTA⁵⁸.

Divergent hotspots identification

The M. racemosa, M. integrifolia(Maxim.) Franch, M. horridula and M.punicea chloroplast genome sequences were aligned using MAFFT and were manually adjusted using Geneious 8.0.4. To analyze nucleotide diversity, we conducted a sliding window analysis using DnaSP version 6.10.03. software⁵⁹. The window length was set to 800 bp, and the step size was 200 bp.

Phylogenetic analysis

The chloroplast genome sequences of M. racemosa, M. integrifolia(Maxim.) Franch, M. horridula, M. punicea and those of 38 other species were collected from NCBI (Supplementary Table 10) were used for phylogenetic analysis. All of the coding sequences from the 42 species were aligned with the MAFFT method based on codons by Geneious 8.0.4. The best nucleotide substitution model (GTR + G + I) was tested, and a maximum likelihood (ML) tree (1000 bootstrap replicates) was constructed with RAxML software⁶⁰. BI analyses were conducted using GPU MrBayes. The GTR + I + G substitution model was used for BI. In the BI analyses, two simultaneous runs of 10000000 generations were conducted for the matrix. Each set was sampled every 1000 generations with a burn-in of 25%. The matK, rpoC2, petA, ndhF and ycf1 gene sequences of M. racemosa, M. integrifolia(Maxim.) Franch, M. horridula, M. punicea and 9 other species were collected from NCBI. Maximum likelihood (ML) analyses were conducted using RAxML software with the GTR model⁶¹.

Change history

17 October 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Wang, B., Song, X. H. & Cheng, C. M. Advance in Ethnobotanical Investigation on Meconopsis. Chinese Academic Medical Magazine of Organisms 01, 39–45 (2003).
Google Scholar
Guo, Z. et al. Chemical constituents from a Tibetan medicine Meconopsis horridula. China Journal of Chinese Materia Medica 39, 1152–1156 (2014).
CAS PubMed Google Scholar
Zhao, Z. et al. Peng. Advances in studies on the classification, chemical composition and pharmacological action of Meconopsis as Tibetan medicines. China. Pharmacy 27, 4391–4394 (2016).
Google Scholar
Chang, Y., Wang, X. L., Tang, X. Y., Yuan, L. Y. & Chen, L. H. A New Alkaloid from Meconopsis horridula. Natural Product Research and Development 29, 731–734 (2017).
Google Scholar
Qu, Y. & Ou, Z. The research advancement on the genus Meconpsis. Northern Horticulture 191–194 (2012).
Wang, B., Song, X. H., Cheng, C. M. & Yang, J. S. Studies on species of Meconopsis as Tibetan medicines. Chinese Wild Plant Resources 22, 45–48 (2003).
Google Scholar
Fan, Y. et al. Effect of Meconopsis racemosa alcohol extract on proliferation of K562 cells and its mechanism. Journal of Chinese Medicinal Materials 36, 1143–1146 (2013).
Google Scholar
Guo, Z. Q. Study on the anti-myocardial ischemic effect and chemical composition of Meconopsis horridula as Tibetan medicine, Beijing University Of Chinese Medicine (2014).
Luo, J. et al. Comparative chloroplast genomes of photosynthetic orchids: insights into evolution of the Orchidaceae and development of molecular markers for phylogenetic applications. Plos One 9, e99016 (2014).
Article ADS PubMed PubMed Central Google Scholar
Ni, L. H., Zhao, Z. L., Xu, H. X., Chen, S. L. & Dorje, G. Chloroplast genome structures in Gentiana (Gentianaceae), based on three medicinal alpine plants used in Tibetan herbal medicine. Current Genetics 63, 1–12 (2016).
Google Scholar
Zeng, C. X. et al. Genome skimming herbarium specimens for DNA barcoding and phylogenomics. Plant Methods 14, 43, https://doi.org/10.1186/s13007-018-0300-0 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chen, H. M. et al. Sequencing and analysis of Strobilanthes cusia (Nees) Kuntze chloroplast Genome revealed the rare simultaneous contraction and expansion of the inverted repeat region in Angiosperm. Frontiers in Plant Science 9, 324 (2018).
Article PubMed PubMed Central Google Scholar
Chang, C. C. et al. The chloroplast genome of Phalaenopsis aphrodite (Orchidaceae): Comparative analysis of evolutionary rate with that of grasses and its phylogenetic implications. Molecular Biology Evolution 23, 279 (2006).
Article CAS PubMed Google Scholar
Zhou, J. G. et al. Complete chloroplast Genomes of Papaver rhoeas and Papaver orientale: molecular structures, comparative analysis, and phylogenetic analysis. Molecules 23, 437, https://doi.org/10.3390/molecules23020437 (2018).
Article CAS PubMed Central Google Scholar
Li, B., Lin, F. R., Huang, P., Guo, W. Y. & Zheng, Y. Q. Complete chloroplast Genome sequence of Decaisnea insignis: Genome organization, Genomic resources and comparative analysis. Sci Rep 7, 10073, https://doi.org/10.1038/s41598-017-10409-8 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, X., Li, Y., Yang, H. & Zhou, B. Chloroplast Genome of the folk medicine and vegetable plant Talinum paniculatum (Jacq.) Gaertn.: gene organization, comparative and phylogenetic analysis. Molecules 23, 857, https://doi.org/10.3390/molecules23040857 (2018).
Article CAS PubMed Central Google Scholar
Mower, J. P. The PREP suite: predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res 37, W253–259, https://doi.org/10.1093/nar/gkp337 (2009).
Article CAS PubMed PubMed Central Google Scholar
Lenz, H., Hein, A. & Knoop, V. Plant organelle RNA editing and its specificity factors: enhancements of analyses and new database features in PREPACT 3.0. BMC Bioinformatics 19, 255, https://doi.org/10.1186/s12859-018-2244-9 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bundschuh, R., Altmuller, J., Becker, C., Nurnberg, P. & Gott, J. M. Complete characterization of the edited transcriptome of the mitochondrion of Physarum polycephalum using deep sequencing of RNA. Nucleic Acids Res 39, 6044–6055, https://doi.org/10.1093/nar/gkr180 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zeng, W. H., Liao, S. C. & Chang, C. C. Identification of RNA editing sites in chloroplast transcripts of Phalaenopsis aphrodite and comparative analysis with those of other seed plants. Plant Cell Physiol 48, 362–368, https://doi.org/10.1093/pcp/pcl058 (2007).
Article CAS PubMed Google Scholar
Luo, J. et al. Comparative chloroplast genomes of photosynthetic orchids: insights into evolution of the Orchidaceae and development of molecular markers for phylogenetic applications. PLoS One 9, e99016, https://doi.org/10.1371/journal.pone.0099016 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Magdalena, G. N., Ewa, F. & Wojciech, P. Cucumber, melon, pumpkin, and squash: Are rules of editing in flowering plants chloroplast genes so well known indeed? Gene 434, 0–8 (2009).
Google Scholar
Huang, Y. Y., Antonius, J. M. M. & Matzke, M. Complete sequence and comparative analysis of the chloroplast Genome of Coconut Palm (Cocos nucifera). Plos One 8, e74736 (2013).
Kaila, T. et al. Chloroplast Genome sequence of Clusterbean (Cyamopsis tetragonoloba L.): Genome structure and comparative analysis. Genes 8, 212, https://doi.org/10.3390/genes8090212 (2017).
Article CAS PubMed Central Google Scholar
Li, Y. G., Xu, W. Q., Zou, W. T., Jiang, D. Y. & Liu, X. H. Complete chloroplast genome sequences of two endangered Phoebe (Lauraceae) species. Bot Stud 58, 37, https://doi.org/10.1186/s40529-017-0192-8 (2017).
Article CAS PubMed PubMed Central Google Scholar
Smith, T. C. Chloroplast evolution:secondary dispatch symbiogenesis and multiple losses. Current Biology 12, 0–0 (2002).
Google Scholar
Greiner, S. et al. The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. sequence evaluation and plastome evolution. Nucleic Acids Res 36, 2366–2378, https://doi.org/10.1093/nar/gkn081 (2008).
Article CAS PubMed PubMed Central Google Scholar
Song, Y. et al. Chloroplast Genomic Resource of Paris for Species Discrimination. Sci Rep 7, 3427, https://doi.org/10.1038/s41598-017-02083-7 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Sarkinen, T. & George, M. Predicting plastid marker variation: can complete plastid genomes from closely related species help? PLoS One 8, e82266, https://doi.org/10.1371/journal.pone.0082266 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Korotkova, N., Nauheimer, L., Hasmik, T. V., Allgaier, M. & Borsch, T. Variability among the most rapidly evolving plastid genomic regions is lineage-specific: implications of pairwise genome comparisons in Pyrus (Rosaceae) and other angiosperms for marker choice. Plos One 9, e112998 (2014).
Article ADS PubMed PubMed Central Google Scholar
Kim, K. et al. Complete chloroplast and ribosomal sequences for 30 accessions elucidate evolution of Oryza AA genome species. Sci Rep 5, 15655, https://doi.org/10.1038/srep15655 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Yuan, C. C., Li, P. X., Wang, Y. F. & Shi, S. H. The confirmation of putative natural hybrid species Meconopsis × cooei G. Taylor (Papaveraceae) based on nuclear ribosomal DNA ITS region sequence. Acta Agrestia Sinica 31, 901–907 (2004).
CAS Google Scholar
Yuan, C. C., He, X. B., Yuan, Q. M. & Shi, S. H. Genetic relationship between a natural hybrid Meconopsis × cookei (Papaveraceae) and its parents based on cpDNA trnL-trnF region sequence. Acta Botanica Yunnanica 29, 103–108 (2007).
CAS Google Scholar
Ni, L. H., Zhao, Zl, Meng, Q. W. & GAAWE, D. & MI, M. Identification of Tibetan medicinal plants of Meconopsis Vig. using ITS and psbA-trnH sequence. Chinese Traditional and Herbal. Drugs 45, 541–545 (2014).
CAS Google Scholar
Dou, R. K. et al. Identification and analysis of Corydalis boweri, Meconopsis horridula and their close related species of the same genus by using ITS2 DNA barcode. China. Journal of Chinese Materia Medica 40, 1453 (2015).
CAS Google Scholar
Vignal, A., Milan, D., SanCristobal, M. & Eggen, A. A review on SNP and other types of molecular markers and their use in animal genetics. Genet Sel Evol 34, 275–305, https://doi.org/10.1051/gse:2002009 (2002).
Article CAS PubMed PubMed Central Google Scholar
Qu, Y., Zhao, W. Y., Ou, Z., Leng, Q. S. & Xiong, J. Analysis of chloroplast gene ndhF and rbcL sequences of Tibetan medicine plants of Meconopsis. Journal of Central South University of Forestry Technology 38, 90–95 (2018).
Google Scholar
Zhang, Z., Kong, Y., Li, Y., Wang, X. Y. & Liu, B. Phylogeny of some Papaveraceae plants in Xinjiang based on DNA barcoding technology. Arid Zone Research 31, 322–328 (2014).
Google Scholar
Olga, K. & Ralph, B. Elimination of deleterious mutations in plastid genomes by gene conversion. The Plant Journal 46, 85–94 (2006).
Article Google Scholar
Ni, L., Zhao, Z., Xu, H., Chen, S. & Dorje, G. The complete chloroplast genome of Gentiana straminea (Gentianaceae), an endemic species to the Sino-Himalayan subregion. Gene 577, 281–288 (2016).
Article CAS PubMed Google Scholar
Zhao, C. The plasticity of altitudes to the morphological characteristics of salicornia. Acta Agrestia Sinica 23, 897–904 (2015).
Google Scholar
Winkworth, R. C., Wagstaff, S. J., Glenny, D., Lockhart, P. J. J. O. D. & Evolution. Evolution of the New Zealand mountain flora: Origins, diversification and dispersal. 5, 237–247 (2005).
Wei, L. & Wei, C. Effects of phytogenetic structure and environmental factors on plant community in changbai mountain. Journal of Arid Land Resources and Environment 27, 63–68 (2013).
Google Scholar
yi, W. Z. & Zhuang, X. Study on the classification system of Meconopsis. Plant Diversity 2, 371–381 (1980).
Google Scholar
Li, B. & Zheng, Yq Dynamic evolution and phylogenomic analysis of the chloroplast genome in Schisandraceae. Scientific Reports 8, 9285 (2018).
Article ADS PubMed PubMed Central Google Scholar
Xie, S. j., Yang, J. w., Xu, W. y. & Yuan, C. c. In 2006 Chinese symposium on physiological ecology and molecular biology of plant stress.
Maddison, W. P. & Knowles, L. L. Inferring phylogeny despite incomplete lineage sorting. Syst Biol 55, 21–30, https://doi.org/10.1080/10635150500354928 (2006).
Article PubMed Google Scholar
Yang, H. M., Zhang, Y. X., Yang, J. B. & Li, D. Z. The monophyly of Chimonocalamus and conflicting gene trees in Arundinarieae (Poaceae: Bambusoideae) inferred from four plastid and two nuclear markers. Mol Phylogenet Evol 68, 340–356, https://doi.org/10.1016/j.ympev.2013.04.002 (2013).
Article PubMed Google Scholar
Jeon, J. H. & Kim, S. C. Comparative analysis of the complete Chloroplast Genome sequences of three closely related East-Asian wild roses (Rosa sect. Synstylae; Rosaceae). Genes 10 (2019).
Jheng, C. F. et al. The comparative chloroplast genomic analysis of photosynthetic orchids and developing DNA markers to distinguish Phalaenopsis orchids. Plant Science 190, 62–73 (2012).
Article CAS PubMed Google Scholar
Wyman, S. K., Jansen, R. K. & Boore, J. L. Automatic annotation of organellar genomes with DOGMA. Bioinformatics 20, 3252–3255, https://doi.org/10.1093/bioinformatics/bth352 (2004).
Article CAS PubMed Google Scholar
Liu, C. et al. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences. Bmc Genomics 13, 715, https://doi.org/10.1186/1471-2164-13-715 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shi, L. C. et al. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res 1, 1–9, https://doi.org/10.1093/nar/gkz345/5486746 (2019).
Article Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Molecular Biology Evolution 33, 1870 (2016).
Article CAS PubMed PubMed Central Google Scholar
Thiel, T., Michalek, W., Varshney, R. & Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theoretical Applied Genetics 106, 411–422 (2003).
Article CAS PubMed Google Scholar
Kurtz, S. et al. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Research 29, 4633–4642 (2001).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Kazutaka, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology Evolution 30, 772–780 (2013).
Article Google Scholar
Dubchak, I. & Ryaboy, D. V. VISTA Family of Computational Tools for Comparative Analysis of DNA Sequences and Whole Genomes. Methods in Molecular Biology 338, 69–89 (2006).
CAS PubMed Google Scholar
Rozas, J. et al. DnaSP 6: DNA sequence polymorphism analysis of large datasets. Molecular Biology Evolution 34 (2017).
Alexandros, S. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article Google Scholar
Yang, H., Li, T., Dang, K. & Bu, W. Compositional and mutational rate heterogeneity in mitochondrial genomes and its effect on the phylogenetic inferences of Cimicomorpha (Hemiptera: Heteroptera). Bmc Genomics 19, 264 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by grants from the Natural Science Foundation of Tianjin (No. 18JCQNJC14000), the Tianjin City High School Science & Technology Fund Planning Project (No. 20130203), Qinghai Science and Technology Project (No. 2014-HZ-815) and the Ph.D. Candidate Research Innovation Fund of Nankai University. We thank the Guangzhou Gene Denovo Biotechnology Company for assisting with the sequencing analysis.

Author information

Xiaoxue Li and Wei Tan contributed equally.

Authors and Affiliations

College of Life Science, Nankai University, Weijin Road 94, 300071, Tianjin, China
Xiaoxue Li, Jiqi Sun, Chenguang Zheng, Min Zheng & Yong Wang
Tianjin State Key Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Poyang Lake Road 10, 301617, Tianjin, China
Wei Tan & Xiaoxuan Tian
College of Life and Geographic Sciences, Qinghai Normal University, 36 Wusixi Street, 810008, Qinghai, China
Junhua Du
School of Chinese Materia Medica, Tianjin University of Traditional Chinese Medicine, Poyang Lake Road 10, 301617, Tianjin, China
Beibei Xiang

Authors

Xiaoxue Li
View author publications
You can also search for this author in PubMed Google Scholar
Wei Tan
View author publications
You can also search for this author in PubMed Google Scholar
Jiqi Sun
View author publications
You can also search for this author in PubMed Google Scholar
Junhua Du
View author publications
You can also search for this author in PubMed Google Scholar
Chenguang Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoxuan Tian
View author publications
You can also search for this author in PubMed Google Scholar
Min Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Beibei Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.-X.L., B.-B.X. and Y.W. designed the experiment and drafted and revised the manuscript. W.T., C.-G.Z. and X.-X.T. analyzed the data. J.-Q.S., J.-H.D. and M.Z. prepared the plant materials and collected the samples. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Beibei Xiang or Yong Wang.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, X., Tan, W., Sun, J. et al. Comparison of Four Complete Chloroplast Genomes of Medicinal and Ornamental Meconopsis Species: Genome Organization and Species Discrimination. Sci Rep 9, 10567 (2019). https://doi.org/10.1038/s41598-019-47008-8

Download citation

Received: 11 April 2019
Accepted: 08 July 2019
Published: 22 July 2019
DOI: https://doi.org/10.1038/s41598-019-47008-8

This article is cited by

The complete chloroplast genome of Onobrychis gaubae (Fabaceae-Papilionoideae): comparative analysis with related IR-lacking clade species
- Mahtab Moghaddam
- Atsushi Ohta
- Shahrokh Kazempour-Osaloo
BMC Plant Biology (2022)
Characterization of the Dicranostigma leptopodum chloroplast genome and comparative analysis within subfamily Papaveroideae
- Lei Wang
- Fuxing Li
- Jiahui Sun
BMC Genomics (2022)
Comparative plastomes and phylogenetic analysis of seven Korean endemic Saussurea (Asteraceae)
- Seona Yun
- Seung-Chul Kim
BMC Plant Biology (2022)
Complete chloroplast genome sequence of Lens ervoides and comparison to Lens culinaris
- Nurbanu Tayşi
- Yasin Kaymaz
- M. Bahattin Tanyolaç
Scientific Reports (2022)
Comparative genomic study on the complete plastomes of four officinal Ardisia species in China
- Chunzhu Xie
- Wenli An
- Xiasheng Zheng
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

Chloroplast genome sequencing, assembly and validation

Chloroplast genome structural features and gene content

Amino acid abundance and codon usage

Plastid RNA editing prediction

Simple sequence repeats and repetitive sequence analysis

Divergent hotspots in the Meconopsis chloroplast genome

Comparisons of the chloroplast genomes among nine species in the Papaveraceae family

Altitude and plant distribution

Phylogenetic analysis

Conclusions

Materials and Methods

Plant material and DNA extraction

Chloroplast genome assemblage and annotation

Codon usage

Simple sequence repeats and repetitive sequence analysis

Prediction of RNA editing sites

Genome comparison

Divergent hotspots identification

Phylogenetic analysis

Change history

17 October 2019

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links