Research paperGenome-wide identification and expression analysis of U-box gene family in wild emmer wheat (Triticum turgidum L. ssp. dicoccoides)
Introduction
Ubiquitination is a conserved post-translational modification of proteins to regulate many cellular processes in vertebrates and higher plants, including cell division, cell death, hormone responses, biotic and abiotic stress responses (Zeng et al., 2008). The major effect of ubiquitination is to add the ubiquitin molecule with the size of 76-amino acid (AA) on the lysine residues of the target protein. In eukaryotic, the ubiquitin proteasome system (UPS) degrades the aberrant and short-lived proteins and then controls the protein loads of the cell, which contains in-concert catalytic activities of three types of enzymes, namely a large number of ubiquitin ligases (E3), together with a few ubiquitin-activating enzymes (E1) and ubiquitin-conjugating enzymes (E2) (Yan et al., 2003). In the process of ubiquitination, E1 is the activator to activate the ubiquitin molecule to form as a thioester bond through an ATP-dependent reaction at cysteine residue on E1, and then the ubiquitin is transferred to ubiquitin-conjugating enzyme E2. Furthermore, E2 functions as the transporter to transfer the activated ubiquitin to E3 to form the ubiquitin thioester, which finally mediates the ubiquitination of target proteins (Pickart, 2001, Finley and Chau, 1991). Among them, E3 ubiquitin ligases are the largest family and the different E3 ligases can regulate the system to emerge different functions. According to their structure, E3 ligases can be classified into HECT-type, RING-type and U-box-type (Callis, 2014).
U-box family is one group of E3 ubiquitin ligase proteins with U-box motif comprising of ~70 AA, which has a tertiary structure resembling of RING domain but loses the characteristic zinc-chelating cysteine and histidine residues to chelate the zinc (Ohi et al., 2003). Extensive studies have reported that U-box genes played the vital roles in regulating diverse developmental processes and stress signaling in plants (Azevedo et al., 2001). Up to date U box family has been widely identified in many plant species, including 64 members belonging to 7 groups in Arabidopsis (Wiborg et al., 2008), 77 in rice and 67 barley (Zeng et al., 2008, Ryu et al., 2019) as well as 91, 99 and 101 in banana, Brassica oleracea.L and B. rapa. L, respectively (Hu et al., 2018, Hu et al., 2019, Wang et al., 2015). Meanwhile, some plant U-box genes have been functionally validated. Wang et al found that mutation of AtPUB4 in Arabidopsis could result in male sterility due to sporophytic defect, suggesting AtPUB4 was a vital regulator in sporophyte development (Wang et al., 2013). AtPUB18 and AtPUB19 were found to coordinately function as regulatory components in development and stress response in Arabidopsis (Bergler, 2011). Furthermore; NtACRE276 was confirmed to have the E3 ligase activity and involved in cell death and defense signaling, and its ortholog in Arabidopsis (AtPUB17) and canola (BnARC1) showed similar biological function (Yang et al., 2006). In wheat, overexpression and RNAi-mediated knockdown of TaPUB1 have revealed that TaPUB1 could induce the expression of target genes to improve the antioxidant capacity under stress condition (Wang et al., 2020).
Wheat is one of the most important staple crops all over the world, which occupied around 17% of global cultivated lands and provided the 30% of global calorie consume (Shewry, 2009). However, its yield is dramatically reduced by abiotic stress, such as drought and high salinity, in particular under the challenge of climate change. As the ancestor donor of A and B genome of bread wheat, wild emmer wheat was considered as the indispensable invaluable genetic resource for wheat breeding due to its high adaptability to abiotic stress (Xie and Nevo, 2008, Nevo et al., 1993). It was found that wild emmer wheat had the extreme tolerant genotypes through comparison of the salt tolerance between wild emmer and durum wheat accessions (Feng et al., 2018). Thus, identification and mining of the gene associated with stress tolerance in wild emmer wheat holds the promise for underlying the mechanism underlying stress response and also breeding for elite varieties in emmer and bread wheat. Some studies have been carried out to identify the genes related to stress tolerance in wild emmer wheat. Chen et al. reported that TdCBL6 encoded a calcineurin B-like protein and overexpression of TdCBL6 improved the salt tolerance, which could regulate both Na + and K + uptake/translocation to keep ion homeostasis under salt stress conditions (Chen et al., 2015). An autophagy-related gene, TdAtg8, was found to be a positive regulator in osmotic and drought stress response (Kuzuoglu-Ozturk et al., 2012). However, few ubiquitin pathway related genes have been identified in wild emmer wheat at present.
The completion of its reference genome provides the opportunity to investigate the genomic organization and evolution dynamics of gene family in wild emmer wheat at the genomic level (Avni et al., 2017). At present, the detailed information of U-box gene family, especially in its roles in stress tolerance, has not been well studied in wild emmer wheat. Here, we performed an in-silico genome-wide search to identify U-box family in wild emmer (TdPUBs) using the updated reference genome information. Then, the phylogenetic relationship, chromosome localization, conserved domain of the identified TdPUBs were analyzed. Furthermore, the expression patterns of these TdPUBs in different tissues and under stress conditions were systematically investigated, and then 19 salt-responsive candidates were selected to validate their expression through qRT-PCR analysis. Finally, genetic variations of these TdPUBs were investigated based on resequencing data to reveal the selection effect during emmer wheat evolution. This was the first study to identify U-Box family in wild emmer, which not only provided the important candidate genes for further functional study, but also contribute to the evolution of this family in wheat and beyond.
Section snippets
Identification of U-box family genes in wild emmer wheat
The protein sequences of the wild emmer genome were downloaded from the Ensembl Plants database (ftp://ftp.ensemblgenomes.org/pub/plants/release-50/fasta/triticum_dicoccoides/pep/) as the local proteins data and then two methods were used for identifying U-box family. Firstly, the PFAM profile of U-box domain (PF04564) was downloaded from PFAM database (https://pfam.xfam.org/) and then used as the query to search against the wild emmer local proteins using the HMMER 3.0 tool with the threshold
Genome-wide identification of TdPUBs
Using the methods as descried in Materials and Methods, a total of 82 U-box ligase genes were detected in wild emmer genome. Since there is no standard nomenclature, the predicted U-box genes were then named as TdPUB based on their chromosome locations (Table 1). Sequence characteristic analysis showed that the length of TdPUBs ranged from 729 (TdPUB55) to 14,413 (TdPUB45) bp, and the average size of CDS and amino acid sequences were 1,724 bp and 574 aa, respectively. The PI value ranged from
Characteristics of U-box family members in wild emmer
In this study, we identified 82 U-box genes in the wild emmer genome through a genome-wide search method. Based on the conserved domain structure and phylogenetic tree, we classified these TdPUBs into seven groups. In Arabidopsis, PUB genes mainly contained UFD2, ARM, GKL domains (conserved Glycine (G), Lysine (K)/Arginine (R) residues and leucine-rich residues), serine/threonine kinase domains, WD40 repeats, tetratrico-peptide repeats (TRPs) and MIF4G-type domain and then grouped them
Conclusion
It is the first study to identify U-box family in wild emmer at the genome level. A total of 82 TdPUBs were obtained, which could be classified into 7 groups based on phylogenetic relationship. Then, exon–intron gene structure, conserved motifs as well as cis-element predication supported the classification and the member in the same group showed similar gene structures. Duplication and collinearity events revealed that segmental duplication contributed significantly to the expansion of U-box
Author Contribution
YG collected data and performed analysis. PWQ and LHB performed the RT-PCR experiments. GY and PY completed the visualization of the data. GY contributed to plant material collection. NXJ and CLC drafted the manuscript. WZY revised the manuscript. NXJ and SWN conceived this study and revised the manuscript. All authors read and approved the final manuscript.
Availability of data and materials
The datasets supporting the conclusions of this article are included within the article and its additional files.
Funding
This work was mainly funded by the National Natural Science Foundation of China (Grant No. 31971885 and 31771778), and partially supported by the Key Research and Development Program of Shaanxi Province, China (Grant No. 2019NY-014).
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
We thank the High-Performance Computing center of Northwest A&F University for providing computational resources in this work.
References (48)
- et al.
Classification, expression pattern, and E3 ligase activity assay of rice U-box-containing proteins
Mol. Plant
(2008) - et al.
The U-box protein family in plants
Trends Plant Sci.
(2001) - et al.
TBtools, a toolkit for biologists integrating various biological data handling tools with a user-friendly interface
Mol. Plant
(2020) - et al.
Systematic analysis of maize class III peroxidase gene family reveals a conserved subfamily involved in abiotic stress response
Gene
(2015) - et al.
StPUB17, a novel potato UND/PUB/ARM repeat type gene, is associated with late blight resistance and NaCl stress
Plant Sci.
(2010) Evolution: Arabidopsis thaliana and its wild relatives: A model system for ecology and evolution
Trends Ecol. Evol.
(2001)- et al.
The molecular genetics of crop domestication
Cell
(2006) - Yan J, Wang J, Li Q, Hwang JR, Patterson C, Zhang H: AtCHIP, a U-box-containing E3 ubiquitin ligase, plays a critical...
Mechanisms underlying ubiquitination
Annu. Rev. Biochem.
(2001)- et al.
Ubiquitination
Annu. Rev. Cell Biol.
(1991)