Elsevier

Gene

Volume 799, 5 October 2021, 145840
Gene

Research paper
Genome-wide identification and expression analysis of U-box gene family in wild emmer wheat (Triticum turgidum L. ssp. dicoccoides)

https://doi.org/10.1016/j.gene.2021.145840Get rights and content

Highlights

  • The U-box gene family was firstly identified in wild emmer wheat and other 3 triticum species.

  • Segmental duplication and polyploidization mainly contributed to the expansion of U-box family in wild emmer wheat.

  • Expression patterns and co-expression network of the TdPUBs were comprehensively investigated using RNA-seq samples and the stress-responsive candidates were obtained.

  • Genetic variation analysis of U-box genes based on the public resequencing data showed that significant genetic bottleneck has occurred during evolution process of tetraploid wheat.

Abstract

In this study, 82 U-box genes were identified in wild emmer wheat (TdPUBs) through a genome-search method. Phylogenetic analysis classified them into seven groups and the genes belonging to the same group shared the similar exon–intron structure, motif organization and cis-element compositions. Synteny analysis of the U-box genes between different species revealed that segmental duplication and polyploidization mainly contributed to the expansion of TdPUBs. Furthermore, the genetic variations of U-box were investigated in wild emmer, domesticated emmer and durum wheat. Results showed that significant genetic bottleneck has occurred during domestication process of tetraploid emmer wheat. Meanwhile, 12 TdPUBs were co-located with known domestication related QTLs. Finally, the tissue-specific and stress-responsive TdPUB genes were identified through RNA-seq analysis. Combined with qPCR validation of 19 salt-responsive TdPUBs, the candidates involving in salt response were obtained. It lays the foundation to better understand the regulatory roles of U-box family in emmer wheat and beyond.

Introduction

Ubiquitination is a conserved post-translational modification of proteins to regulate many cellular processes in vertebrates and higher plants, including cell division, cell death, hormone responses, biotic and abiotic stress responses (Zeng et al., 2008). The major effect of ubiquitination is to add the ubiquitin molecule with the size of 76-amino acid (AA) on the lysine residues of the target protein. In eukaryotic, the ubiquitin proteasome system (UPS) degrades the aberrant and short-lived proteins and then controls the protein loads of the cell, which contains in-concert catalytic activities of three types of enzymes, namely a large number of ubiquitin ligases (E3), together with a few ubiquitin-activating enzymes (E1) and ubiquitin-conjugating enzymes (E2) (Yan et al., 2003). In the process of ubiquitination, E1 is the activator to activate the ubiquitin molecule to form as a thioester bond through an ATP-dependent reaction at cysteine residue on E1, and then the ubiquitin is transferred to ubiquitin-conjugating enzyme E2. Furthermore, E2 functions as the transporter to transfer the activated ubiquitin to E3 to form the ubiquitin thioester, which finally mediates the ubiquitination of target proteins (Pickart, 2001, Finley and Chau, 1991). Among them, E3 ubiquitin ligases are the largest family and the different E3 ligases can regulate the system to emerge different functions. According to their structure, E3 ligases can be classified into HECT-type, RING-type and U-box-type (Callis, 2014).

U-box family is one group of E3 ubiquitin ligase proteins with U-box motif comprising of ~70 AA, which has a tertiary structure resembling of RING domain but loses the characteristic zinc-chelating cysteine and histidine residues to chelate the zinc (Ohi et al., 2003). Extensive studies have reported that U-box genes played the vital roles in regulating diverse developmental processes and stress signaling in plants (Azevedo et al., 2001). Up to date U box family has been widely identified in many plant species, including 64 members belonging to 7 groups in Arabidopsis (Wiborg et al., 2008), 77 in rice and 67 barley (Zeng et al., 2008, Ryu et al., 2019) as well as 91, 99 and 101 in banana, Brassica oleracea.L and B. rapa. L, respectively (Hu et al., 2018, Hu et al., 2019, Wang et al., 2015). Meanwhile, some plant U-box genes have been functionally validated. Wang et al found that mutation of AtPUB4 in Arabidopsis could result in male sterility due to sporophytic defect, suggesting AtPUB4 was a vital regulator in sporophyte development (Wang et al., 2013). AtPUB18 and AtPUB19 were found to coordinately function as regulatory components in development and stress response in Arabidopsis (Bergler, 2011). Furthermore; NtACRE276 was confirmed to have the E3 ligase activity and involved in cell death and defense signaling, and its ortholog in Arabidopsis (AtPUB17) and canola (BnARC1) showed similar biological function (Yang et al., 2006). In wheat, overexpression and RNAi-mediated knockdown of TaPUB1 have revealed that TaPUB1 could induce the expression of target genes to improve the antioxidant capacity under stress condition (Wang et al., 2020).

Wheat is one of the most important staple crops all over the world, which occupied around 17% of global cultivated lands and provided the 30% of global calorie consume (Shewry, 2009). However, its yield is dramatically reduced by abiotic stress, such as drought and high salinity, in particular under the challenge of climate change. As the ancestor donor of A and B genome of bread wheat, wild emmer wheat was considered as the indispensable invaluable genetic resource for wheat breeding due to its high adaptability to abiotic stress (Xie and Nevo, 2008, Nevo et al., 1993). It was found that wild emmer wheat had the extreme tolerant genotypes through comparison of the salt tolerance between wild emmer and durum wheat accessions (Feng et al., 2018). Thus, identification and mining of the gene associated with stress tolerance in wild emmer wheat holds the promise for underlying the mechanism underlying stress response and also breeding for elite varieties in emmer and bread wheat. Some studies have been carried out to identify the genes related to stress tolerance in wild emmer wheat. Chen et al. reported that TdCBL6 encoded a calcineurin B-like protein and overexpression of TdCBL6 improved the salt tolerance, which could regulate both Na + and K + uptake/translocation to keep ion homeostasis under salt stress conditions (Chen et al., 2015). An autophagy-related gene, TdAtg8, was found to be a positive regulator in osmotic and drought stress response (Kuzuoglu-Ozturk et al., 2012). However, few ubiquitin pathway related genes have been identified in wild emmer wheat at present.

The completion of its reference genome provides the opportunity to investigate the genomic organization and evolution dynamics of gene family in wild emmer wheat at the genomic level (Avni et al., 2017). At present, the detailed information of U-box gene family, especially in its roles in stress tolerance, has not been well studied in wild emmer wheat. Here, we performed an in-silico genome-wide search to identify U-box family in wild emmer (TdPUBs) using the updated reference genome information. Then, the phylogenetic relationship, chromosome localization, conserved domain of the identified TdPUBs were analyzed. Furthermore, the expression patterns of these TdPUBs in different tissues and under stress conditions were systematically investigated, and then 19 salt-responsive candidates were selected to validate their expression through qRT-PCR analysis. Finally, genetic variations of these TdPUBs were investigated based on resequencing data to reveal the selection effect during emmer wheat evolution. This was the first study to identify U-Box family in wild emmer, which not only provided the important candidate genes for further functional study, but also contribute to the evolution of this family in wheat and beyond.

Section snippets

Identification of U-box family genes in wild emmer wheat

The protein sequences of the wild emmer genome were downloaded from the Ensembl Plants database (ftp://ftp.ensemblgenomes.org/pub/plants/release-50/fasta/triticum_dicoccoides/pep/) as the local proteins data and then two methods were used for identifying U-box family. Firstly, the PFAM profile of U-box domain (PF04564) was downloaded from PFAM database (https://pfam.xfam.org/) and then used as the query to search against the wild emmer local proteins using the HMMER 3.0 tool with the threshold

Genome-wide identification of TdPUBs

Using the methods as descried in Materials and Methods, a total of 82 U-box ligase genes were detected in wild emmer genome. Since there is no standard nomenclature, the predicted U-box genes were then named as TdPUB based on their chromosome locations (Table 1). Sequence characteristic analysis showed that the length of TdPUBs ranged from 729 (TdPUB55) to 14,413 (TdPUB45) bp, and the average size of CDS and amino acid sequences were 1,724 bp and 574 aa, respectively. The PI value ranged from

Characteristics of U-box family members in wild emmer

In this study, we identified 82 U-box genes in the wild emmer genome through a genome-wide search method. Based on the conserved domain structure and phylogenetic tree, we classified these TdPUBs into seven groups. In Arabidopsis, PUB genes mainly contained UFD2, ARM, GKL domains (conserved Glycine (G), Lysine (K)/Arginine (R) residues and leucine-rich residues), serine/threonine kinase domains, WD40 repeats, tetratrico-peptide repeats (TRPs) and MIF4G-type domain and then grouped them

Conclusion

It is the first study to identify U-box family in wild emmer at the genome level. A total of 82 TdPUBs were obtained, which could be classified into 7 groups based on phylogenetic relationship. Then, exon–intron gene structure, conserved motifs as well as cis-element predication supported the classification and the member in the same group showed similar gene structures. Duplication and collinearity events revealed that segmental duplication contributed significantly to the expansion of U-box

Author Contribution

YG collected data and performed analysis. PWQ and LHB performed the RT-PCR experiments. GY and PY completed the visualization of the data. GY contributed to plant material collection. NXJ and CLC drafted the manuscript. WZY revised the manuscript. NXJ and SWN conceived this study and revised the manuscript. All authors read and approved the final manuscript.

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article and its additional files.

Funding

This work was mainly funded by the National Natural Science Foundation of China (Grant No. 31971885 and 31771778), and partially supported by the Key Research and Development Program of Shaanxi Province, China (Grant No. 2019NY-014).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We thank the High-Performance Computing center of Northwest A&F University for providing computational resources in this work.

References (48)

  • J. Callis

    The ubiquitination machinery of the ubiquitin system

    Arabidopsis Book

    (2014)
  • M.D. Ohi et al.

    Structural insights into the U-box, a domain associated with multi-ubiquitination

    Nat. Struct. Biol.

    (2003)
  • J. Wiborg et al.

    Biochemical function of typical and variant Arabidopsis thaliana U-box E3 ubiquitin-protein ligases

    Biochem. J.

    (2008)
  • M.Y. Ryu et al.

    Classification of barley U-box E3 ligases and their expression patterns in response to drought and pathogen stresses

    BMC Genomics

    (2019)
  • H. Hu et al.

    Genome-wide identification and analysis of U-box E3 ubiquitin(-)protein ligase gene family in Banana

    Int. J. Mol. Sci.

    (2018)
  • D. Hu et al.

    Genome-wide distribution, expression and function analysis of the U-Box gene family in Brassica oleracea L

    Genes (Basel)

    (2019)
  • C. Wang et al.

    Genome-wide survey and expression analysis of the PUB family in Chinese cabbage (Brassica rapa ssp. pekinesis)

    Mol. Genet. Genomics

    (2015)
  • H. Wang et al.

    The Arabidopsis U-box/ARM repeat E3 ligase AtPUB4 influences growth and degeneration of tapetal cells, and its mutation leads to conditional male sterility

    Plant J.

    (2013)
  • Bergler J, Hoth S, Plant U-box armadillo repeat proteins AtPUB18 and AtPUB19 are involved in salt inhibition of...
  • C.-W. Yang et al.

    The E3 ubiquitin ligase activity of Arabidopsis PLANT U-BOX17 and its functional tobacco homolog ACRE276 are required for cell death and defense

    Plant Cell

    (2006)
  • W. Wang et al.

    The involvement of wheat U-box E3 ubiquitin ligase TaPUB1 in salt stress tolerance

    J. Integr. Plant Biol.

    (2020)
  • Shewry PRJJoeb: Wheat. 2009,...
  • W. Xie et al.

    Wild emmer: genetic resources, gene mapping and potential for wheat improvement

    Euphytica

    (2008)
  • Nevo, E., Krugman, T., Beiles, A. 1993. Genetic Resources for Salt Tolerance in the Wild Progenitors of Wheat (Triticum...
  • Cited by (7)

    View all citing articles on Scopus
    View full text