Unravelling molecular mechanisms from floral initiation to lipid biosynthesis in a promising biofuel tree species, Pongamia pinnata using transcriptome analysis

Sreeharsha, Rachapudi V.; Mudalkar, Shalini; Singha, Kambam T.; Reddy, Attipalli R.

doi:10.1038/srep34315

Download PDF

Article
Open access
Published: 28 September 2016

Unravelling molecular mechanisms from floral initiation to lipid biosynthesis in a promising biofuel tree species, Pongamia pinnata using transcriptome analysis

Rachapudi V. Sreeharsha¹^na1,
Shalini Mudalkar¹^na1,
Kambam T. Singha¹^na1 &
…
Attipalli R. Reddy¹^na1

Scientific Reports volume 6, Article number: 34315 (2016) Cite this article

2493 Accesses
20 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Pongamia pinnata (L.) (Fabaceae) is a promising biofuel tree species which is underexploited in the areas of both fundamental and applied research, due to the lack of information either on transcriptome or genomic data. To investigate the possible metabolic pathways, we performed whole transcriptome analysis of Pongamia through Illumina NextSeq platform and generated 2.8 GB of paired end sequence reads. The de novo assembly of raw reads generated 40,000 contigs and 35,000 transcripts, representing leaf, flower and seed unigenes. Spatial and temporal expression profiles of photoperiod and floral homeotic genes in Pongamia, identified GIGANTEA (GI) - CONSTANS (CO) - FLOWERING LOCUS T (FT) as active signal cascade for floral initiation. Four prominent stages of seed development were selected in a high yielding Pongamia accession (TOIL 1) to follow the temporal expression patterns of key fatty acid biosynthetic genes involved in lipid biosynthesis and accumulation. Our results provide insights into an array of molecular events from flowering to seed maturity in Pongamia which will provide substantial basis for modulation of fatty acid composition and enhancing oil yields which should serve as a potential feedstock for biofuel production.

Integrated analysis of transcriptomic and proteomic data from tree peony (P. ostii) seeds reveals key developmental stages and candidate genes related to oil biosynthesis and fatty acid metabolism

Article Open access 01 October 2019

Transcriptome analyses reveals the dynamic nature of oil accumulation during seed development of Plukenetia volubilis L.

Article Open access 24 November 2020

Global transcriptome analysis of subterranean pod and seed in peanut (Arachis hypogaea L.) unravels the complexity of fruit development under dark condition

Article Open access 03 August 2020

Introduction

Decreasing the fossil fuel consumption and reconciling the worsening global environmental conditions are fundamental concerns of the society in this industrial era. The development and use of alternative fuels, including bioethanol and biodiesel are predicted to significantly alleviate the problems caused by the usage of fossil fuels. An imminent biofuel tree Pongamia pinnata (L.) (Family: Fabaceae), is a native species of India which can be grown in diverse tropical and subtropical marginal lands of the world. It is a drought and salinity tolerant, semi-deciduous, nitrogen fixing tree which grows up to 15–20 meters in height with a large canopy¹. The oil content of Pongamia seeds ranges from 35 to 40% of seed dry weight and 55% of it is oleic acid which is the ideal fatty acid for good quality biodiesel production. Upon trans-esterification, the Pongamia oil, with a blend of diesel, can be applied in automobiles without any further modification of engines. Pongamia has a long life cycle, usually sets flowering after 4–5 years of plantation and takes 9–11 months to form a mature pod after anthesis. There are many positive attributes that could be potentially achieved by understanding the genetics and genomics of Pongamia. For instance, the oil content of Pongamia which is around 35% could be increased to about 50% with higher oleic acid content and also the fruit maturity time could be reduced. But, limited genetic resources and long production cycles have constrained the molecular breeding programs aimed at better oil quality and seed yields in this potential biofuel tree species.

Flowering time, fertilization and seed development are inter-related and determine the yield of certain promising biofuel tree species, including Pongamia. Understanding the molecular mechanisms that control the onset of reproductive events and development of the seeds is crucial for improving the biofuel feedstock. Photoperiod and vernalization pathways have been reported as major control mechanisms to synchronize environmental cues with the internal rhythm^2,3,4. The former promotes flowering in response to increasing day length and the latter enables induction of flowering following a prolonged exposure to cold. In order to trigger flower initiation at the precise time and in the right conditions, circadian clocks perceive and integrate both environmental and endogenous signals, which in turn activate mobile florigen and other related proteins to induce flowering. Besides the floral promotion pathways, mutation studies in Arabidopsis have also revealed the existence of genes that repress the floral transition⁵. Exploiting these genes through genetic engineering by rendering knowledge obtained from the model plants like Arabidopsis and sequence information obtained from transcriptome will have substantial impact on floral transitions in Pongamia. However, lack of publicly available sequence information is a hindrance to the efforts for improving seed oil quality and floral transition period in non-model and non-edible biofuel crops like Pongamia.

In many seeds, triacylglycerols (TAGs) that accumulate during the maturation phase of embryo and/or endosperm act as major storage reserves of carbon and in the due course of germination, they support the establishment of the seedling⁶. Fatty acid (FA) metabolism and the enzymes involved in it play an important role in plant morphology, growth, seed development and stress responses. Earlier reports have shown that FATB mutant of Arabidopsis resulted in reduced concentrations of palmitate and stearate and affect both plant growth and seed development⁷. Similarly, enoyl-CoA reductase and stearoyl – ACP desaturase had essential roles in endocytic membrane trafficking and defence responses respectively by regulating the oleic acid contents^8,9. Also, owing to the commercial importance of plant lipids, several attempts have been made to alter seed TAG composition through genetic engineering. For instance, Arabidopsis and Camelina transgenic plants were engineered to produce oil with high omega-3/omega-6 ratio and high DHA¹⁰. Engineering of key FA biosynthetic enzymes, including fatty acid desaturase 2 (FAD2) and fatty acid elongase 1 (FAE1) has resulted in high oleic acid containing lines of Camelina and soybean^11,12. Oil accumulation depends on the seed development patterns which show great diversity in duration of maturity and oil formation stages. Knowledge about temporal expression patterns of oil biosynthetic enzymes is crucial to understand reprogramming strategies of oil biosynthesis as well as modification in fatty acid composition.

High throughput deep sequencing of transcriptome is a promising and powerful tool to identify the key genes associated with species-specific exotic FA biosynthetic enzymes and molecular marker development. Of late, several non-model organisms, whose reference genome sequence was absent, were sequenced and annotated using platforms such as Roche/454, AB SOLiD and Illumina^{13,14,15,16,17}. Recently, salt responsive genes were identified upon transcriptome sequencing of leaf and root tissues of Pongamia pinnata using Illumina platform¹⁸. In the same year, another group had claimed the chloroplastic and mitochondrial genome of Pongamia through second generation DNA sequencing¹⁹. Further, Pavithra et al.²⁰, reported the FA profiles of seeds at different developmental stages of Pongamia. Very recently, parallel to the current study, Huang et al.²¹, reported the seed transcriptome of Pongamia which provided valuable information about lipid biosynthetic genes and SSR markers. But the whole transcriptome information, together with gene expression patterns of lipid biosynthetic enzymes during seed development and flowering pathway genes, are far from being characterised in Pongamia. In the present study, we constructed the paired end cDNA library from pooled RNA isolated from leaf, flower, pod and seed tissues of mature Pongamia tree and sequenced using Illumina TrueSeq protocol on Illumina NextSeq 500 platform. We also data mined circadian clock genes and lipid biosynthetic genes in Pongamia and reported their homology with other model organisms. Our data provide a comprehensive information on Pongamia transcriptome and temporal expression of candidate genes involved in lipid accumulation and flowering which can be applied to molecular breeding programs for improving the seed oil content in Pongamia.

Results

Sequencing and de novo assembly of Pongamia transcriptome

We generated a total of 24 004 632 paired end sequence reads, each 76 bp in length, encompassing about 2.8 GB of sequence data in fastQ format. After stringent filtering of sequence data for low-quality reads and reads containing primer/adaptor sequences, we obtained a total of 22 158 278 high quality sequence reads (with phred quality score of <20). The final data set comprising ~22 million very high-quality reads were used for optimization of de novo assembly and analysis of Pongamia transcriptome. All processed reads were assembled into contigs without any reference (de-novo) using velvet – 1.2.10 software. Assembly was tried on various hash lengths (k-mers) and 41 was selected as the best hash-length. Best k-mer is decided on various parameters including: number of contigs, total number of reads used, total contig length and number of non-ATGC characters. A total of 42,724 contigs were generated with maximum and minimum lengths of 11,665 and 200 bp respectively with an average read length of 639.7 bp. Contigs were then processed into transcripts (spliced isoforms) using oases – 0.2.8 software. The majority of high quality reads (81.77%) were assembled to generate a total of 36,047 transcripts that ranged from 200–30, 656 bp length with an average length of 937.586 bp. The number of transcripts were decreased as the length increased with maximum number of transcripts falling in the range of 0–100 bp followed by 100–500 bp. The N50, N90 and rpkm values of Pongamia transcriptome along with other related parameters were presented in Supplementary Table S1. The average GC content of Pongamia transcripts was 48% and has a higher proportion of transcripts in the range of 40–45%, followed by 45–50%, resulting much broader GC content range (Supplementary Fig. S1).

Functional annotation and characterisation of Pongamia transcripts

The assembled transcripts were annotated against NCBI-BLAST 2.2.29 using GeneMark software. A total of 47,461 genes/proteins were predicted of which 25,112 proteins were annotated with either Swiss-Prot (108) or TrEmbL (25, 004) databases. Most of the predicted proteins in Pongamia showed homology in UniProt with G. max (44%) and G. soja (22%) followed by P. vulgaris (21%) and other Papilinoideae members (Supplementary Data 1). The transcripts that show significant homology to the genes against UniProt database were selected for GO annotation. A total of 16,146 (45%) transcripts were assigned with at least one GO term, in which 9508 (26.37%) were assigned in biological process category, 4420 (12.26%) were assigned in cellular component category and 2218 (6.15%) were assigned to molecular function category (Fig. 1a). Among the various biological processes, ignoring unknown and other biological process categories, ATP binding (2205) and protein serine/threonine kinase activity (1316) were highly represented. The genes involved in other biological processes such as zinc ion binding, DNA binding, oxidoreductase activity, hydrolase activity and metal ion binding and those having catalytic activity were also identified through GO annotations (Fig. 1a). Similarly, genes involved in transcription and transcription regulation were mostly represented in molecular function category, followed by carbohydrate, protein metabolism and transport (Fig. 1a). Integral membrane components and nucleus were most represented among the cellular components followed by membrane and cytoplasm. Also, we annotated 13,764 Pongamia genes to KOG (Eukaryotic Orthologous Groups) database which aids in identification and phyletic classification of the orthologous proteins, coded in whole genome of almost 21 organisms including bacteria, algae and eukaryotes²². The resulting KOG annotation grouped the transcripts into three functional categories: cellular processing and signalling (3214; 31%); metabolism (2999; 30%) and information storage and processing (1945; 19%) and rest of the genes resulted in poorly characterized annotations (Fig. 1b) (Supplementary Data 2). We identified a total of 4148 SSRs in which mono-nucleotide SSRs represented the largest fraction (36.4%) followed by tri-nucleotide (31.3%) and di-nucleotide (28.8%) SSRs (Supplementary Fig. S2). Pongamia transcripts also contained a quite significant number of tetra- (79), penta- (31) and hexa – nucleotide (28) SSRs though their representation is small in total SSR pool.

Sequence similarity of Pongamia transcripts with other plants

The transcripts of Pongamia were analysed for similarity against the unigene datasets of legume crops, biofuel plants and other oil bearing plants belonging to different families using TBLASTX search. An E-value cut-off threshold of 1E – 05 was considered to define a significant hit. The largest number of Pongamia transcripts showed significant similarity with soybean putative mRNA sequences (53%) followed by M. truncatula (35%). While, Pongamia showed little conservation with oil bearing trees of other families (Fig. 2a). We also analysed the sequence conservation of translated Pongamia transcripts with proteomes of selected plant species. Putative Pongamia proteins showed maximum homology with the biofuel plant J. curcas (59.2%) followed by C. sativa (52%) and G. max (53%) (Fig. 2a). Our analysis also showed that 53% of Pongamia transcripts are having homology with legumes indicating these genes are legume specific (Supplementary Data 3).

Identification of transcription factor families

We identified the transcription factor encoding transcripts by sequence comparison to known transcription factor gene families in Plant TFDB (Transcription factor data base). In total, 1332 putative transcription factor genes distributed in at least 18 families were identified representing 3.7% of Pongamia transcripts (Supplementary Data 4). Genes encoding for C3H, HLH, MYB, bZIP and HB transcription factor families were abundantly expressed while minimum number of transcripts were observed for NAM, ARF, FHA, MADS and WRKY families (Fig. 2b). Further, we annotated and analysed transcription factors exclusive to lipid biosynthetic pathway which resulted in identification of genes belonging to MYB, PLATZ, GRAS, MYB-related, bHLH8, CCAAT, G2-like and PHD transcription factor families.

Pathway mapping of transcripts by KEGG

Ortholog assignment and mapping of the contigs to the biological pathways were performed using KEGG automatic annotation server (KAAS). All the contigs were compared against the KEGG database using BLASTX with threshold bi-score value of 60 (default). It assigned Enzyme Commission numbers for 2784 contigs and they were mapped to respective pathways (Supplementary Data 5). Among the mapped contigs, 1709 were identified as genes involved in metabolic pathways of major biomolecules such as carbohydrates (306, 11%), amino acids (252, 9%), lipids (177, 6.3%), nucleotides (150, 5.3%), cofactors, vitamins (130, 4.6%), glycans (71, 2.5%), terpenoids (74, 2.5%). The KEGG pathway analysis also showed that 186 and 69 contigs represent the energy and secondary metabolites metabolisms respectively. A total of 247 transcripts that represent enzymes involved in carbon metabolism, fatty acid metabolism, degradation of aromatic compounds, biosynthesis of amino acids as well as 2-oxocarboxylic acid metabolism were also identified. Further, the mapped contigs also represented the genes involved in genetic information processing that include, translation (12.3%), folding, sorting and degradation (9.5%), transcription (5.7%) as well as replication and repair (4.5%). Cellular processes (transport and catabolism, cell motility, cell growth and death, cell communication) and environmental information processing (membrane transport, signal transduction, signalling molecules and interaction) are other minor groups represented in the KEGG annotation of Pongamia. The KAAS analysis also represented genes involved in biosynthesis of karanjin, ansamycin and siderophore thus substantiating the insecticidal, medicinal and anti-bacterial properties of Pongamia respectively (Supplementary Data 5). Further, we studied genes involved in the following major metabolic events which had a prominent role in improving the yield and oil quality related traits in Pongamia.

Genes involved in circadian rhythms

Our transcriptome data represented 25 key genes including TFs that are involved in the regulation of floral meristem identity, photoperiod as well as vernalization pathways (Table 1). Majority of the genes were having sequence homology with G. max followed by M. truncatula which shows the evolutionarily conserved relationship between legumes. The phylogenetic relationship of Pongamia flowering genes with other related organisms was deduced (Fig. 3a). ORF sequence analysis unfold the complete protein coding sequence of all the genes and the polypeptide information was presented (Table 1). Genes like Flowering locus T (FT), GIGANTEA (GI), Chalcone synthase (CS), PRR1/TOC1, PRR5, PRR7 represented in this study were known to promote flowering through photoperiod regulation under long day conditions. The Pseudo-receiver (PR) domain at the N-terminal region and CCT domain at the C-terminal region which are characteristic conserved domains of PRR protein family were identified in Pongamia compared with other related organisms (Fig. 3b). Other genes like PST, SPT, APT, CAU, AGA which encode TFs play a key role in defining floral meristem identity. The transcriptome also represented COP1, SPA1, PHYA, PHYB and CK2A genes which inhibit flowering process by repressing CONSTANS (CO) gene during prolonged cold periods through vernalization responses.

Table 1 Flowering and circadian rhythms related genes.

Full size table

Spatial and temporal profiling revealed the diurnal nature of clock genes

To understand the diurnal behaviour of circadian clock genes and expression trends of floral homeotic genes, we quantified the expression of key flowering pathway genes in leaf tissues collected at four different time points in a day (6, 12, 18, 24 hrs) as well as in inflorescence of four different stages (10, 20, 30, 40 days after flowering - DAF) (Stage 1, 2, 3 and 4 respectively) in a field grown Pongamia plant (Fig. 4a). We selected photoperiod pathway genes, vernalisation genes and certain crucial transcription factors which act as floral homeotic proteins and a heatmap was constructed based on their expressin profiles (Fig. 4b). The genes were divided into three clusters based on their expression profiles (i) circadian clock genes that function in initiation of flowering through photoperiod pathway: genes like PRR1, PRR5, PRR7 showed time dependent regulation in leaves wherein, the three genes showed significant up regulation in the morning and decreased expression during dusk (Fig. 4b). Contrastingly, ELF3 which is an evening gene showed peak expression during 18–24 hrs. GI which acts upstream to FT, is a major component of leaf generated mobile florigen, showed coordinated expression with FT wherein, peak expression at 6 and 12 h of day was observed. At the same time, they showed minimal expression in inflorescence at all stages. (ii) Circadian clock genes that repress flowering process: COP1-SPA1 complex which operates in dark conditions to degrade CO protein by polyubiquitination showed downregulation as the day light progressed. PHYA, PHYB and CK2A were only expressed during midday in leaves but showed significant up regulation in earlier stages of inflorescence (Fig. 4b). (iii) TFs which operate in the process of flower development and photoperiod: APT, PTL, AGA, CAU and SPT showed significant and constitutive expression in all stages of inflorescence. The expression of PTL, SPT and APT was significantly peaked at stage 2 of inflorescence whereas, CAU, AGA showed higher expression at stage 1. MYB75 TF expression was significantly high in flowers and roots compared to leaf tissue. LHY which is a MYB – related TF that binds to the promoter region of PRR1/TOC1 thereby repressing their expression to postpone the flowering process, showed basal level of expression in leaves and significant up regulation in flowers (Fig. 4b).

Genes involved in membrane and storage lipid metabolism

A total of 203 transcripts corresponding to 136 unigenes were identified and grouped into 14 categories of various lipid metabolic pathways (Supplementary Table S3). The KEGG annotation, sorted out 264 unigenes representing all essential enzymes involved in fatty acid biosynthesis, elongation, degradation as well as glycerolipid and phospholipid metabolisms. From our data, it is evident that 41 genes have mapped to transcripts at two different loci indicating the diploid nature in Pongamia and 20 genes have duplicate contigs at the same locus presuming the gene duplication and the presence of paralogs. The overall lipid metabolism is an interplay between carbohydrate and fatty acid stoichiometric profiles and can be viewed as three major events: (i) pyruvate to Acetyl CoA synthesis (ii) FA synthesis from Acetyl CoA (iii) TAG assembly and degradation. Our transcriptome data represented all the major rate limiting enzymes involved in these three categories (Table 2). We also analysed the homology of Pongamia transcripts with respective organisms having maximum sequence coverage. ORF analysis indicated that the putative Pongamia transcripts were having full length sequences and the polypeptide information (mass and pI) of translated putative transcripts suggested that the proteins involved in the oil biosynthetic pathway are functional mostly at basic pH (Table 2). We further searched the orthologs for PpFAD8, PpFAD2, PpFAD6 and PpSAD to establish the phylogenetic relationship of Pongamia with other legumes, oil bearing trees and crop plants. Our data demonstrated PpFAD6 and PpSAD are part of a small clades consisting of respective genes from Glycine max and Glycine soja (Fig. 5a). Intriguingly, PpFAD8 was grouped with neither legumes nor biofuel trees and formed a separate clade indicating its distinct evolutionary conservation (Fig. 5a). Also, phylogeny of certain key flowering related genes of Pongamia was established. Considerable number of transcripts were also found for arachidonic acid (4%) and linoleic acid (2%) metabolism in our transcriptome data. Besides lipids for oil biosynthesis, enzymes involved in membrane lipid biosynthesis, including glycerolipid metabolism, glycerophospholipid metabolism, sphingolipid metabolism, steroid biosynthesis, ether lipid metabolism and cutin as well as suberin and wax synthesis were also identified through our data which corroborated that the transcriptome has covered all the genes involved in lipid biosynthesis signifying the depth of the sequencing (Supplementary Table S3).

Table 2 Genes involved in lipid biosynthesis.

Full size table

Profiling of lipid biosynthetic genes revealed stage specific FA synthesis during seed development

Our data on seed oil accumulation showed that, Pongamia accumulate oil actively after 210 DAF and progressed with the pod development till maturity (300 DAF) (Fig. 5b). The oil red staining of total lipids in the seed sections also showed a gradual increase in the lipid content of seeds during the four stages (Fig. 5c). Further, mRNA expression levels of genes coding for oil biosynthetic enzymes were studied during four stages of seed development which include: mature green pod stages (210 DAF, 240 DAF and 270 DAF) (Stage 1, 2, 3 respectively) and late dark brown pod stage (300 DAF) (stage 4) that were normalised with immature green pod stage (150 DAF) (Fig. 5d). Expression profiles of all the genes were shown in a schematic representation of lipid metabolism (Fig. 6). Based on their expression levels at four different stages, the lipid biosynthetic genes were categorised into 4 groups: (a) those which expressed in a bell shaped manner which include, MAT, KASII, KASIII, KAR, HAD, EAR, LPAT, PAP. These genes showed peak expression during 2^nd and 3^rd stages and decreased thereafter. (b) Those which showed a gradually decreased expression as the development progressed: FATB and PDAT. (c) Those which showed increased expression towards the development of pod: Thiolase, HDH, ECH, ACD which majorly involved in β-oxidation and (d) those which expressed constantly throughout the pod development like ACC, LACS, DGAT (Fig. 6).

Discussion

The next generation sequencing technologies and bioinformatics tools enable assembly and annotation of short reads into expressed sequence data, particularly for non-model organisms without a known reference. In this study, using the Illumina NextSeq platform, we characterized whole transcriptome of a non-model legume biofuel tree Pongamia, for which the sequence data are limited so far in the public databases. Genes related to lipid biosynthesis, flowering cycle and flavonoid biosynthesis were emphasised and the transcript information was further used to understand the temporal expression patterns of oil biosynthetic and flowering related genes in Pongamia. The total RNA from the four tissues were pooled and normalized cDNA was synthesised which remarkably reduces the frequency of abundant transcripts and increases the rate recovery of unique transcripts²³. Upon sequencing and de novo assembly, we selected a total of 92% of sequenced raw reads by stringent filtering to annotate into functional transcripts belonging to crucial metabolic pathways of leaf, flower and seed. The average length of Pongamia unigenes (937.58 bp) was more than those of reported in other related species like chickpea (523 bp), peanut (619 bp), alfalfa (803 bp), as well as in a recent report on Pongamia (787 bp) but shorter than those of Camelina (1198 bp)^15,24,25,26. The GC content (ratio of guanine and cytosine) which ranges from 20 to 72% among different organisms, is an important criterion for establishing the phylogenetic and evolutionary relationships among various species. Our analysis revealed that the average GC content of Pongamia transcripts (48%) was little lower than the C. sativa (49%) and higher than J. curcas (43%) and in other Pongamia report (44.77%) which explains the complexity and diversity of the transcriptome sequencing^15,21. The parameters like mean length of unigenes, GC content, N50 value of present data show the increased coverage and depth of the sequencing. Annotation of transcripts to UniProt resulted in identification of putative proteins which corresponded to various metabolic pathways. In the current study, we outlined SSR markers identified in Pongamia, which act as important resource in gene mapping and marker assisted molecular breeding. More emphasis on Pongamia EST-SSR markers development, characterization and validation was given in a recent study²¹. Certain transcription factors identified in the current study were reported to play an important role in regulation of gene expression in various metabolic and signalling pathways like fatty acid biosynthesis (MYB, PLATZ), elongation (MYB-related, Bhlh), palmitoleate biosynthesis (MYB), oleate biosynthesis (bHLH, GRAS) stearate biosynthesis (G2-like) and fatty acid degradation (PHD, CCAAT)¹⁵. Interestingly, MYB and MYB related transcription factors which were deciphered in this study are involved in regulation of circadian rhythms and flowering.

Flowering, which is regulated by circadian rhythms, determines production of seeds and yield of the plant. Circadian clock genes integrate the environmental signals required for flowering and also help in adaptation of plants to different geographical locations⁴. It is of great significance to know the sequence information of genes involved in circadian rhythms and floral transitions to understand and alter the flowering cycle in Pongamia. Recently, Winarto et al.⁴, reported four circadian clock genes (ELF4, LCL1, PRR7, AND TOC1) in Pongamia which are key regulators of central oscillator and showed that they were under diurnal regulation. Here, we reported 25 other crucial genes including TFs whose sequence information is not available for Pongamia in the public databases. The clock gene ELF3 is known to form an evening complex (EC) with ELF4 and LUX thus generating circadian rhythms and hence regulate output pathways such as flowering^27,28,29,30. The peak expression of ELF3 during dawn in leaves of Pongamia is in accordance with the ELF4 expression observed in a previous study⁴. PRR1, PRR3, PRR5, PRR7 and PRR9 are members of PRR gene family and have important roles in the central oscillator³¹. The presence of highly conserved PR and CCT domains in the putative Pongamia PRR proteins implies a similar role to that of G. soja and Arabidopsis by repressing LCL1 expression in the central oscillator^32,33. LHY and MYB75 are MYB-like transcription factors that play pivotal roles in the morning loop of the central oscillator. These transcription factors belong to the REVEILLE (RVE) family which consists of 11 proteins with conserved MYB-like domain³⁴. Since the MYB domain is known for DNA-binding, these transcription factors could play an important role in the DNA-binding activity of Pongamia LCL1. In Pongamia, MYB was actively expressed in all stages of inflorescence development which could be attributed to the anthocyanin metabolism that gives the characteristic colour to the flowers. GI-CO-FT-APT model of signal cascade for floral initiation under long day conditions was well established in Arabidopsis³⁵. GI protein represses the CYCLING DOF FACTOR 1 (CDFs) thereby allowing the expression of CO protein during late day which eventually activates FT expression⁵. The peak expression of Pongamia GI and FT in the evening as observed in Arabidopsis corroborated the fact that the photoperiod was an ancient and conserved pathway for controlling flowering. Implication of CO-FT module in the control of photoperiodic flowering has also been described in garden pea, sugar beet and woody species such as poplar where this regulatory module has been proposed to mediate other photoperiodic responses such as growth cessation and bud set^36,37,38.

Positional cloning and mutation studies on clock genes provided substantial evidence for the role of transcripts showing circadian rhythms in regulating the grain yield, grain weight, number of grains per panicle and flowering time in many cereals^39,40,41. Also, in legumes, significant number of transcripts including genes involved in protein, fatty acid synthesis, lipid metabolism and photosynthesis are showing circadian rhythms suggesting the potential roles of circadian clock in flower opening, nectar secretion, seed composition and development^42,43. Pongamia, which is an outcrossing species, through insect - mediated pollination, starts flowering after 3 to 4 years and seed maturation takes about 10 months after flowering. The information provided in this study about circadian clock genes will provide substantial basis for the studies related to modulation/manipulation of flowering time to get shorter vegetative period and prolonged reproductive stage that leads to the extended period of seed production.

Pongamia is believed to contribute to biodiesel production through its ability to biosynthesize and accumulate considerable amounts of unsaturated triacylglycerols (TAGs) in seeds. In this study, the transcripts involved in lipid metabolism were annotated and further analysed to understand oil accumulation and degradation in the seeds of Pongamia which are of great interest for biofuel production. Pongamia takes 9–10 months to form a mature pod after fertilization of the flower. The initial pod and seed development are at low pace with negligible oil content and poor development of the cotyledons. At 175 DAF, the cotyledon development and oil biosynthesis go at constant pace till maturity (300 DAF). Many other Pongamia accessions belonging to different geographical locations had shown similar patterns of oil accumulation during seed development^20,44. During FA biosynthesis, plastidial acetyl CoA and malonyl CoA are converted into long-chain acyl-ACP by a series of reactions involving certain enzymes with ACP as a cofactor. Carboxylation of acetyl-CoA to malonyl-CoA is the first committed step in FA synthesis which is catalysed by a multi-subunit acetyl-CoA carboxylase (ACCase) complex and in turn limits the oil accumulation in the seeds. Our data represented all four subunits of ACCase: alpha carboxyltransferase (CTA), beta carboxyl transferase (CTB) and biotin carboxylase (BC) and also a homomeric isoform. The transcript for homomeric isoform was absent in previous reports on Pongamia, Jatropha and peanut. qPCR analysis showed that these three genes exhibited a coordinated and stable expression pattern throughout the seed development which is in consistent with previous reports on Arabidopsis, B.napus and R.communis^45,46. The subsequent formation of plastidial malonyl ACP from malonyl CoA that is catalysed by malonyl-ACP-transferase (MAT) showed maximum expression at stage 2 and stage 3 and decreased thereafter towards the end of the seed development. The activity of ketoacyl-ACP reductase (KAR), which is a component of fatty acyl synthase (FAS) multiprotein complex, is essential for FA biosynthesis and catalyses an NADPH-dependent reduction of 3-ketoacyl-ACP to the 3-hydroxyacyl isomer. Another key enzyme, enoyl-ACP-reductase (EAR) plays a determinant role in establishing the rate of FA biosynthesis⁴⁷. KAR, EAR together with HAD and KAS-II showed a coordinated expression pattern wherein the genes were up regulated at all stages of seed development but showed a downtrend during maturation of the seed (Fig. 6). Similar type of bell shaped expression pattern of FAS genes was also observed in Jatropha seeds⁴⁸. The enzymes SAD, FAD6 and FAD8 biosynthesise oleic acid, linoleic acid and linolenic acid respectively and are crucial for an ideal biofuel feedstock. PpFAD8 is the most abundant transcript represented in our transcriptome data followed by SAD when compared to other lipid biosynthetic enzymes, suggesting the unsaturated FA synthesis potency of Pongamia. However, the expression levels for SAD during seed development were higher than any other enzyme involved in FA synthesis which could be attributed to the low catalytic efficiency of SAD associated with high oleic acid content in Pongamia⁴⁵. Further studies are needed to understand the gene regulation at promoter level and functional characterization of the PpFAD8 protein which provide important clues about oil accumulation patterns in Pongamia seeds. In addition to PpFAD8, other enzymes involved in biosynthesis of unsaturated fatty acids including PpFAD2, PpFAD6 and SAD could be the potential targets for gene engineering to improve oil quality and quantity in Pongamia, where the sequence information can be deduced through our transcriptome data.

The transcripts that encode two acyl-ACP thioesterases that terminate plastid FA synthesis, FATA (responsible for unsaturated FA production) and FATB (for saturated FA production) showed varied expression patterns during the four stages (Fig. 6). The expression of FATA increased significantly from stage 2 to 4 while, FATB decreased after stage 2. This is in agreement with greater plastid production of unsaturated than saturated FAs in Pongamia seeds. However, in Jatropha seeds, FATA expression was at its peak during late developmental stages. Palmitic acid and stearic acid, which are major constituents of cell membrane, also play important role in development of cotyledons which are usually active during 120–210 DAF in case of Pongamia. Our data clearly indicated that the expression of FA biosynthetic genes for saturated FAs followed a typical bell shaped pattern where it increased during stage 1 and was stable during 2^nd and 3^rd stages which decreased slightly at stage 4 of seed development. Towards the maturity, more unsaturated FAs were synthesized as evidenced from our data on FATB and FATA expression levels and also supported by previous findings on oil content in Pongamia at various seed developmental stages²⁰. The free FAs generated by thioesterases in the plastid are esterified to CoA by long-chain acyl-CoA synthetases (LACS) at the plastid envelope. PpLACS showed consistently high level of expression during all the four stages when compared to 150 DAF, suggesting that the oil accumulation was accelerated at 150 DAF. After FA synthesis, a series of membrane-associated reactions assemble the acyl chains into TAG. Glycerol-3-phosphate acyltransferases (GPAT) catalyse sn-1 acylation of glycerol-3-phosphate to yield lysophosphatidic acid (LPA). The second acylation in de novo TAG assembly is catalysed by LPA acyltransferase. In Pongamia, the genes involved in TAG assembly including GPAT, LPAT and PAP were mostly expressed during stage 2–4, wherein the maximum oil accumulation has been recorded in our study. However, considerable expression was also noticed during stage I which should account for the membrane lipid biosynthesis during cotyledon development. The final step in TAG biosynthesis is the acylation of diacylglycerol (DAG) to form TAG. Depending on the acyl donor to DAG, two classes of enzymes, diacylglycerol acyltransferases (DGAT) and phospholipid:diacylglycerol acyltransferases (PDAT), can catalyse this crucial step of TAG synthesis⁴⁹. Our results on DGAT and PDAT expression patterns also demonstrate the active involvement of DGAT in Pongamia TAG assembly. Further, the expression patterns of genes involved in β- oxidation revealed that the fatty acid degradation in Pongamia seeds was active during early stages of seed development which could presumptively be responsible for cotyledon development (Fig. 6).

In conclusion, the transcriptome of Pongamia pinnata seed along with leaf, pod as well as flower tissues was sequenced and assembled, to maximize the gene representation associated with flowering and lipid biosynthesis. Our data have led to the identification of transcripts, transcription factors involved in various physiological processes and metabolic pathways, which will provide ample information to the database on Pongamia and also aid in the functional and comparative genomic studies to improve oil and seed yield related traits. GI-CO-FT signalling cascade was found to be active in regulating photoperiod control of flowering in Pongamia. The expression patterns of lipid biosynthetic genes at different developmental stages revealed ACCase, SAD and FAD8 as candidate genes during seed maturity and clearly showed that 270–300 DAF was optimum time for seed harvesting. In summary, our results provide an insight into the complex metabolic pathways and regulatory networks involved in different tissues of the Pongamia.

Methods

Plant material

P. pinnata plantation was established in the experimental farm of Tree Oils India Limited (TOIL), Zaheerabad, Medak district, Andhra Pradesh (latitude 17°36′; longitude 77°31′E; 622 m MSL). High quality, disease free seeds of nearly 600 accessions were collected from various regions of India and planted in the farm. After attaining reproductive phase, the plants which did not give flowering for two years were removed from the farm and remaining plants were assessed for their yield potential for three consecutive years. The highest yielding variety (TOIL 1) was selected as the experimental plant for the current study. Leaves, flowers, pods and seeds at 210 DAF were collected and snap froze in aseptic conditions for transcriptome sequencing.

For gene quantification studies, five year old Pongamia accession (TOIL 1) those were actively flowering were selected. Leaf samples at four time points 00:00, 06:00, 12:00, 18:00 hr of a day and flowers at four developmental phases of infloresence were collected and stored at −80 °C until further use. Seeds of four developmental stages were collected from same accession of Pongamia to quantify lipid biosynthetic genes.

Transcriptome sequencing

Total RNA was isolated from leaves, flowers, pods and seeds of Pongamia using Agilent plant RNA isolation kit (Agilent Technologies, USA). The concentration, intactness and purity of RNA were checked with Agilent 2100 Bioanalyzer (Agilent Technologies, USA). Samples having RNA integrity number (RIN) value greater than 8 were used for library preparation. Paired-end cDNA library preparation was done according to Illumina TruSeq RNA library protocol outlined in “TruSeq RNA Sample Preparation Guide”. Briefly, 1 μg of total RNA was subjected to Poly A purification of mRNA. Purified mRNA was fragmented for 4 minutes at 94 °C in the presence of divalent cations and reverse transcribed with Superscript III Reverse transcriptase by priming with Random Hexamers (Invitrogen, USA). Second strand cDNA was synthesized in the presence of RNA Polymerase I and RnaseH. The cDNA was cleaned up using Agencount Ampure XP SPRI beads (Beckman Coulter, USA). Illumina adapters were ligated to the cDNA molecules after end repair and the addition of A base. SPRI clean – up was performed after ligation. The library was amplified using 8 cycles of PCR for the enrichment of adapter - ligated fragments. The prepared library was quantified using Nanodrop and validated for quality by running an aliquot on High Sensitivity Bioanalyzer Chip (Agilent Technologies, USA). Sequencing of constructed cDNA library was performed on Illumina NextSeq 500 sequencer. RNA-Seq data were generated in FastQ format.

Transcriptome assembly, annotation and analysis

Sequencing resulted in the generation of 76 nucleotide raw reads having attached adapter sequences. These raw reads were subjected to filtering through the standard Illumina pipeline. The filtered reads were further subjected for quality control using NGS QC tool kit V 2.3.1 to remove adapters, B-block and low quality bases towards 3′ ends⁵⁰. The high quality filtered reads were de novo assembled by Velvet 1.2.10 and Oases 0.2.08 was used for transcript generation^51,52. Genes/proteins were predicted from assembled transcripts using GeneMark software⁵³.

MEGA7 was used for the construction of phylogenetic tree using Clustal W and neighbour-joining analysis by taking the known amino acid sequences of all targeted genes and deduced amino acid sequence of Pongamia⁵⁴.

Pathway analysis and identification of transcription factors

After assembly and clustering, transcript annotation was done by performing BLASTX analysis at an e-value cut-off of 10⁻⁵ against UniProt-Papilionoideae database⁵⁵. Blast2GO was used to assign GO (Gene Ontology) terms to transcripts on the basis of best significant match with proteins of members of Papilionoideae to impart a broad overview of their functions and categorized into biological process, molecular function and cellular component. Also, KOG (Eukaryotic Orthologous Groups) was used to identify the transcript homologues from other organisms and thus assigning a probable function to transcripts. KAAS (KEGG (Kyoto Encyclopedia of Genes and Genomes) Automatic Annotation Server) was used for metabolic pathway analysis using Arabidopsis thaliana, Arabidopsis lyrata and Glycine max as reference organisms to identify the enriched metabolic pathways in various gene sets⁵⁶. The transcripts were categorized into various transcription factors (TFs) using Transcription factor Family Data Base (TFDB)⁵⁷.

SSR marker identification

The percentage compositions of the nucleotides A, T, G and C were calculated for each sequence and across the entire distribution of transcripts. Simple Sequence Repeats (SSRs) were detected using MIcroSAtellite tool. SSRs were detected by considering 100 bp flanking sequences on upstream and downstream of SSRs.

Real-time PCR analysis

Seeds, leaves, roots and flower tissues were collected in triplicates and total RNA was isolated using Spectrum Plant Total RNA isolation kit (Sigma, USA). 1 μg of RNA was used for cDNA synthesis by Revert aid first strand cDNA synthesis kit (Thermo-Fisher Scientific, USA). qRT – PCR was performed on Eppendorf thermal cycler using SYBR FAST qPCR universal master mix (2X) (KAPA Biosystems, USA). Each reaction contained 1 μl of the first-strand cDNA as template in a total volume of 10 μl reaction mixture. List of genes, primer sequences and melting temperatures used in this study were given in Supplementary Table S4 and S5. The amplification program was performed at 95 °C for 30 s followed by 95 °C for 5 s and 55 °C for 30 s (35 cycles). The relative expression was calculated using the formula, 2^−∆∆Ct, with actin as housekeeping gene for normalisation of data⁵⁸. The fold change values were log transformed with base 2 so that 1.5 fold which corresponds to 0.58 was used to identify differentially expressed genes.

Quantification of oil

Oil was extracted from four growing stages of Pongamia seeds by soxhlet extraction method using hexane as a solvent as described in Kumar et al.⁵⁹. Briefly, seeds (5 g) were ground in a coffee grinder to make powder. Oil was extracted with 150 ml of hexane at distillation temperature for 2 to 3 hours in the Soxhlet extractor using a heating mantle. Hexane was removed from the extracted oil using a rotary evaporator (Heidolph 514-01002-06-0, Germany) at 55 °C under reduced pressure for 30 min.

Statistics

For qRT PCR analysis, three independent biological replicates with three technical replicates of each were used and the mean ± standard deviation (SD) values were calculated for each sample. The significance of the difference was tested by using Analysis of Variance (ANOVA) and the comparisons were tested with Holm-Sidak method, the level of significance was set to 0.05. Microsoft excel 2013 was used for data processing. Statistical analysis was performed using software, Sigma plat 11.0.

Additional Information

Accession Codes: The data has been submitted to NCBI Sequence Read Archive (SRA) and BioSample databases with BioSample accession number SAMN04212410 and BioProject Id PRJNA299718. The SRA project Id is SRP065225.

How to cite this article: Sreeharsha, R. V. et al. Unravelling molecular mechanisms from floral initiation to lipid biosynthesis in a promising biofuel tree species, Pongamia pinnata using transcriptome analysis. Sci. Rep. 6, 34315; doi: 10.1038/srep34315 (2016).

References

Arote, S. & Yeole, P. Pongamia pinnata L: a comprehensive review. Int. J. Pharm. Tech. Res. 2, 2283–2290 (2010).
Google Scholar
Abou-Elwafa, S. F. et al. Conservation and divergence of autonomous pathway genes in the flowering regulatory network of Beta vulgaris. J. Exp. Bot. 62, 3359–3374 (2011).
CAS PubMed Google Scholar
Coupland, G. Regulation of flowering by photoperiod in Arabidopsis. Plant, Cell Environ. 20, 785–789 (1997).
Google Scholar
Winarto, H. P. et al. Isolation and Characterization of Circadian Clock Genes in the Biofuel Plant Pongamia (Millettia pinnata). Bioenergy Res. 8, 760–774 (2015).
CAS Google Scholar
Jarillo, J. A. & Piñeiro, M. Timing is everything in plant development. The central role of floral repressors. Plant Sci. 181, 364–378 (2011).
CAS PubMed Google Scholar
Baud, S., Dubreucq, B., Miquel, M., Rochat, C. & Lepiniec, L. Storage reserve accumulation in Arabidopsis: metabolic and developmental control of seed filling. The Arabidopsis Book 6, e0113 (2008).
PubMed PubMed Central Google Scholar
Bonaventure, G., Salas, J. J., Pollard, M. R. & Ohlrogge, J. B. Disruption of the FATB gene in Arabidopsis demonstrates an essential role of saturated fatty acids in plant growth. Plant Cell 15, 1020–1033 (2003).
CAS PubMed PubMed Central Google Scholar
Kachroo, A. et al. Oleic acid levels regulated by glycerolipid metabolism modulate defense gene expression in Arabidopsis. Proc.Natl. Acad. Sci. USA 101, 5152–5157 (2004).
CAS PubMed ADS PubMed Central Google Scholar
Zheng, H., Rowland, O. & Kunst, L. Disruptions of the Arabidopsis enoyl-CoA reductase gene reveal an essential role for very-long-chain fatty acid synthesis in cell expansion during plant morphogenesis. Plant Cell 17, 1467–1481 (2005).
CAS PubMed PubMed Central Google Scholar
Petrie, J. R. et al. Metabolic engineering plant seeds with fish oil-like levels of DHA. PloS One 7, e49165 (2012).
CAS PubMed PubMed Central ADS Google Scholar
Kang, J., Snapp, A. R. & Lu, C. Identification of three genes encoding microsomal oleate desaturases (FAD2) from the oilseed crop Camelina sativa. Plant Physiol. Biochem. 49, 223–229 (2011).
CAS PubMed Google Scholar
Kinney, A. & Clemente, T. Modifying soybean oil for enhanced performance in biodiesel blends. Fuel Process. Technol. 86, 1137–1147 (2005).
CAS Google Scholar
Meyer, E., Logan, T. L. & Juenger, T. E. Transcriptome analysis and gene expression atlas for Panicum hallii var. filipes, a diploid model for biofuel research. Plant J. 70, 879–890 (2012).
CAS PubMed Google Scholar
Fan, Z. et al. Genome-wide transcriptome profiling provides insights into floral bud development of summer-flowering Camellia azalea. Sci Rep 5, 9729 (2015).
CAS PubMed PubMed Central Google Scholar
Mudalkar, S., Golla, R., Ghatty, S. & Reddy, A. R. De novo transcriptome analysis of an imminent biofuel crop, Camelina sativa L. using Illumina GAIIX sequencing platform and identification of SSR markers. Plant Mol. Biol. 84, 159–171 (2014).
CAS PubMed Google Scholar
Wang, F. et al. Mining and identification of polyunsaturated fatty acid synthesis genes active during camelina seed development using 454 pyrosequencing. BMC Plant Biol. 15, 147 (2015).
PubMed PubMed Central Google Scholar
Xu, C. et al. De novo and comparative transcriptome analysis of cultivated and wild spinach. Sci Rep. 5, 17706 (2015).
CAS PubMed PubMed Central ADS Google Scholar
Huang, J. et al. Transcriptome characterization and sequencing-based identification of salt-responsive genes in Millettia pinnata, a semi-mangrove plant. DNA Res. 19, 195–207 (2012).
CAS PubMed PubMed Central Google Scholar
Kazakoff, S. H. et al. Capturing the biofuel wellhead and powerhouse: the chloroplast and mitochondrial genomes of the leguminous feedstock tree Pongamia pinnata. PloS One 7, e51687 (2012).
CAS PubMed PubMed Central ADS Google Scholar
Pavithra, H., Gowda, B., Kumar, K. R., Prasanna, K. & Shivanna, M. Oil, fatty acid profile and karanjin content in developing Pongamia pinnata (L.) Pierre seeds. J. Am. Oil Chem. Soc. 89, 2237–2244 (2012).
CAS Google Scholar
Huang, J. et al. De novo sequencing and characterization of seed transcriptome of the tree legume Millettia pinnata for gene discovery and SSR marker development. Mol. Breed. 36, 1–15 (2016).
Google Scholar
Tatusov, R. L. et al. The COG database: an updated version includes eukaryotes. BMC bioinformatics 4, 1 (2003).
Google Scholar
Natarajan, P. & Parani, M. De novo assembly and transcriptome analysis of five major tissues of Jatropha curcas L. using GS FLX titanium platform of 454 pyrosequencing. BMC genomics 12, 1 (2011).
Google Scholar
Garg, R., Patel, R. K., Tyagi, A. K. & Jain, M. De novo assembly of chickpea transcriptome using short reads for gene discovery and marker identification. DNA Res. 18, 53–63 (2011).
CAS PubMed PubMed Central Google Scholar
Liu, Z. et al. Global transcriptome sequencing using the Illumina platform and the development of EST-SSR markers in autotetraploid alfalfa. PloS One 8, e83549 (2013).
PubMed PubMed Central ADS Google Scholar
Zhang, J. et al. De novo assembly and Characterisation of the Transcriptome during seed development and generation of genic-SSR markers in Peanut (Arachis hypogaea L.). BMC Genomics 13, 90 (2012).
CAS PubMed PubMed Central Google Scholar
Doyle, M. R. et al. The ELF4 gene controls circadian rhythms and flowering time in Arabidopsis thaliana. Nature 419, 74–77 (2002).
CAS PubMed ADS Google Scholar
Khanna, R., Kikis, E. A. & Quail, P. H. EARLY FLOWERING 4 functions in phytochrome B-regulated seedling de-etiolation. Plant Physiol. 133, 1530–1538 (2003).
CAS PubMed PubMed Central Google Scholar
Nozue, K. et al. Rhythmic growth explained by coincidence between internal and external cues. Nature 448, 358–361 (2007).
CAS PubMed ADS Google Scholar
Nusinow, D. A. et al. The ELF4-ELF3-LUX complex links the circadian clock to diurnal control of hypocotyl growth. Nature 475, 398–402 (2011).
CAS PubMed PubMed Central Google Scholar
Matsushika, A., Makino, S., Kojima, M. & Mizuno, T. Circadian waves of expression of the APRR1/TOC1 family of pseudo-response regulators in Arabidopsis thaliana: insight into the plant circadian clock. Plant Cell Physiol. 41, 1002–1012 (2000).
CAS PubMed Google Scholar
Makino, S. et al. Genes encoding pseudo-response regulators: insight into His-to-Asp phosphorelay and circadian rhythm in Arabidopsis thaliana. Plant Cell Physiol. 41, 791–803 (2000).
CAS PubMed Google Scholar
Strayer, C. et al. Cloning of the Arabidopsis clock gene TOC1, an autoregulatory response regulator homolog. Science 289, 768–771 (2000).
CAS PubMed ADS Google Scholar
Carré, I. A. & Kim, J. Y. MYB transcription factors in the Arabidopsis circadian clock. J. Exp. Bot. 53, 1551–1557 (2002).
PubMed Google Scholar
Roux, F., Touzet, P., Cuguen, J. & Le Corre, V. How to be early flowering: an evolutionary perspective. Trends Plant Sci. 11, 375–381 (2006).
CAS PubMed Google Scholar
Chia, T., Müller, A., Jung, C. & Mutasa-Göttgens, E. Sugar beet contains a large CONSTANS-LIKE gene family including a CO homologue that is independent of the early-bolting (B) gene locus. J. Exp. Bot. 59, 2735–2748 (2008).
CAS PubMed PubMed Central Google Scholar
Horvath, D. Common mechanisms regulate flowering and dormancy. Plant sci. 177, 523–531 (2009).
CAS Google Scholar
Weller, J. L. et al. Update on the genetic control of flowering in garden pea. J. Exp. Bot. 60, 2493–2499 (2009).
CAS PubMed Google Scholar
Beales, J., Turner, A., Griffiths, S., Snape, J. W. & Laurie, D. A. A pseudo-response regulator is misexpressed in the photoperiod insensitive Ppd-D1a mutant of wheat (Triticum aestivum L.). Theor. Appl. Genet. 115, 721–733 (2007).
CAS PubMed Google Scholar
Shaw, L. M., Turner, A. S. & Laurie, D. A. The impact of photoperiod insensitive Ppd‐1a mutations on the photoperiod pathway across the three genomes of hexaploid wheat (Triticum aestivum). Plant J. 71, 71–84 (2012).
CAS PubMed Google Scholar
Turner, A., Beales, J., Faure, S., Dunford, R. P. & Laurie, D. A. The pseudo-response regulator Ppd-H1 provides adaptation to photoperiod in barley. Science 310, 1031–1034 (2005).
CAS PubMed ADS Google Scholar
Hudson, K. A. The circadian clock-controlled transcriptome of developing soybean seeds. The Plant Genome 3, 3–13 (2010).
CAS Google Scholar
Preuss, S. B. et al. Expression of the Arabidopsis thaliana BBX32 gene in soybean increases grain yield. PloS One 7, e30717 (2012).
CAS PubMed PubMed Central ADS Google Scholar
Pavithra, H., Sagar, B. C., Prasanna, K., Shivanna, M. & Gowda, B. Localisation of Storage Reserves in Developing Seeds of Pongamia pinnata (L.) Pierre, a Potential Agroforestry Tree. J. Am. Oil Chem. Soc. 90, 1927–1935 (2013).
CAS Google Scholar
Troncoso‐Ponce, M. A. et al. Comparative deep transcriptional profiling of four developing oilseeds. Plant J. 68, 1014–1027 (2011).
PubMed PubMed Central Google Scholar
Baud, S. & Lepiniec, L. Regulation of de novo fatty acid synthesis in maturing oilseeds of Arabidopsis. Plant Physiol.Biochem. 47, 448–455 (2009).
CAS PubMed Google Scholar
Niu, Y. et al. Global analysis of gene expression profiles in Brassica napus developing seeds reveals a conserved lipid metabolism regulation with Arabidopsis thaliana. Molecular plant 2, 1107–1122 (2009).
CAS PubMed Google Scholar
Gu, K. et al. Expression of fatty acid and lipid biosynthetic genes in developing endosperm of Jatropha curcas. Biotechnol. Biofuels 5, 47 (2012).
CAS PubMed PubMed Central Google Scholar
Chaitanya, B. S. K. et al. Stage-Specific Fatty Acid Fluxes Play a Regulatory Role in Glycerolipid Metabolism during Seed Development in Jatropha curcas L. J. Agric. Food Chem. 63, 10811–10821 (2015).
CAS PubMed Google Scholar
Patel, R. K. & Jain, M. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PloS One 7, e30619 (2012).
CAS PubMed PubMed Central ADS Google Scholar
Schulz, M. H., Zerbino, D. R., Vingron, M. & Birney, E. Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics 28, 1086–1092 (2012).
CAS PubMed PubMed Central Google Scholar
Zerbino, D. R. & Birney, E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 (2008).
CAS PubMed PubMed Central Google Scholar
Lomsadze, A., Burns, P. D. & Borodovsky, M. Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm. Nucleic Acids Res. 42, e119–e119 (2014).
PubMed PubMed Central Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol Evol. 33, 1870–1874 (2016).
CAS PubMed PubMed Central Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC bioinformatics 10, 421 (2009).
PubMed PubMed Central Google Scholar
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A. C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, W182–W185 (2007).
PubMed PubMed Central Google Scholar
Jin, J., Zhang, H., Kong, L., Gao, G. & Luo, J. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res. 42 D1182–D1187 (2013).
PubMed PubMed Central Google Scholar
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2− ΔΔCT method. Methods 25, 402–408 (2001).
CAS PubMed Google Scholar
Kumar, S., Chaitanya, B. S., Ghatty, S. & Reddy, A. R. Growth, reproductive phenology and yield responses of a potential biofuel plant, Jatropha curcas grown under projected 2050 levels of elevated CO2. Physiol. Plant. 152, 501–519 (2014).
CAS PubMed Google Scholar

Download references

Acknowledgements

The work was funded by DBT grant (BT/PR-12024/BCE/8/1097/2014) from Department of Biotechnology, Government of India. We greatly acknowledge Tree Oils India Limited (TOIL), Hyderabad, India for generously providing plant material for all our experiments. Thanks are due to Sandor Technologies, Hyderabad, India for library construction, sequencing and assembly. RVS and SM are thankful to UGC, New Delhi, India, for fellowship. KTS was supported by BBL fellowship, University of Hyderabad, India.

Author information

Sreeharsha Rachapudi V. and Mudalkar Shalini contributed equally to this work.

Authors and Affiliations

Department of Plant Sciences, University of Hyderabad, Hyderabad, 500046, India
Rachapudi V. Sreeharsha, Shalini Mudalkar, Kambam T. Singha & Attipalli R. Reddy

Authors

Rachapudi V. Sreeharsha
View author publications
You can also search for this author in PubMed Google Scholar
Shalini Mudalkar
View author publications
You can also search for this author in PubMed Google Scholar
Kambam T. Singha
View author publications
You can also search for this author in PubMed Google Scholar
Attipalli R. Reddy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.V.S., S.M. and A.R.R. are involved in designing experiments. R.V.S., S.M. and K.T.S. performed experiments. R.V.S., S.M., K.T.S. and A.R.R. analysed data and discussed results. R.V.S., S.M. and A.R.R. wrote the manuscript. All authors revised the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Sreeharsha, R., Mudalkar, S., Singha, K. et al. Unravelling molecular mechanisms from floral initiation to lipid biosynthesis in a promising biofuel tree species, Pongamia pinnata using transcriptome analysis. Sci Rep 6, 34315 (2016). https://doi.org/10.1038/srep34315

Download citation

Received: 10 May 2016
Accepted: 12 September 2016
Published: 28 September 2016
DOI: https://doi.org/10.1038/srep34315

This article is cited by

Catalytic hydrothermal liquefaction of Pongamia pinnata (L.) to produce bio-oil and biochar within a biorefinery framework
- Rachapudi Venkata Sreeharsha
- Harishankar Kopperi
- S. Venkata Mohan
Biomass Conversion and Biorefinery (2023)
Development of EST-SSR markers for Pongamia pinnata by transcriptome database mining: cross-species amplification and genetic diversity
- Rahul G. Shelke
- Supriyo Basak
- Latha Rangan
Physiology and Molecular Biology of Plants (2020)
Temporal transcriptome profiling of developing seeds reveals a concerted gene regulation in relation to oil accumulation in Pongamia (Millettia pinnata)
- Jianzi Huang
- Xuehong Hao
- Yizhi Zheng
BMC Plant Biology (2018)
Molecular insights into photosynthesis and carbohydrate metabolism in Jatropha curcas grown under elevated CO2 using transcriptome sequencing and assembly
- Sumit Kumar
- Rachapudi Venkata Sreeharsha
- Attipalli Ramachandra Reddy
Scientific Reports (2017)
Unravelling a stearidonic acid-rich triacylglycerol biosynthetic pathway in the developing seeds of Buglossoides arvensis: A transcriptomic landscape
- R. V. Sreedhar
- P. Prasad
- Malathi Srinivasan
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.