Genome wide association in Spanish bread wheat landraces identifies six key genomic regions that constitute potential targets for improving grain yield related traits

López-Fernández, Matilde; García-Abadillo, Julián; Uauy, Cristobal; Ruiz, Magdalena; Giraldo, Patricia; Pascual, Laura

doi:10.1007/s00122-023-04492-x

Genome wide association in Spanish bread wheat landraces identifies six key genomic regions that constitute potential targets for improving grain yield related traits

Original Article
Open access
Published: 13 November 2023

Volume 136, article number 244, (2023)
Cite this article

Download PDF

You have full access to this open access article

Theoretical and Applied Genetics Aims and scope Submit manuscript

Genome wide association in Spanish bread wheat landraces identifies six key genomic regions that constitute potential targets for improving grain yield related traits

Download PDF

Matilde López-Fernández¹,
Julián García-Abadillo²,
Cristobal Uauy³,
Magdalena Ruiz⁴,
Patricia Giraldo ORCID: orcid.org/0000-0003-4369-1078¹ &
…
Laura Pascual¹

1679 Accesses
5 Altmetric
Explore all metrics

Abstract

Key message

Association mapping conducted in 189 Spanish bread wheat landraces revealed six key genomic regions that constitute stable QTLs for yield and include 15 candidate genes.

Abstract

Genetically diverse landraces provide an ideal population to conduct association analysis. In this study, association mapping was conducted in a collection of 189 Spanish bread wheat landraces whose genomic diversity had been previously assessed. These genomic data were combined with characterization for yield-related traits, including grain size and shape, and phenological traits screened across five seasons. The association analysis revealed a total of 881 significant marker trait associations, involving 434 markers across the genome, that could be grouped in 366 QTLs based on linkage disequilibrium. After accounting for days to heading, we defined 33 high density QTL genomic regions associated to at least four traits. Considering the importance of detecting stable QTLs, 6 regions associated to several grain traits and thousand kernel weight in at least three environments were selected as the most promising ones to harbour targets for breeding. To dissect the genetic cause of the observed associations, we studied the function and in silico expression of the 413 genes located inside these six regions. This identified 15 candidate genes that provide a starting point for future analysis aimed at the identification and validation of wheat yield related genes.

Genomic basis determining root system architecture in maize

Article 12 April 2024

Genome-wide association study of drought tolerance in wheat (Triticum aestivum L.) identifies SNP markers and candidate genes

Article Open access 02 March 2024

Genetic diversity of grain yield traits and identification of a grain weight gene SiTGW6 in foxtail millet

Article 16 March 2024

Introduction

Bread wheat (Triticum aestivum L.) is one of the major staple crops, providing about 20% of dietary calories and proteins (Shewry and Hey 2015). Thus, identifying new genes or favourable alleles controlling key breeding traits, like yield, is mandatory to develop high-yield varieties and ensure food security. Elucidating the genetic control of key breeding traits has been challenging, since they are mainly quantitative traits controlled by multiple quantitative trait loci (QTLs) and affected by environmental factors (Sehgal et al. 2017; Sukumaran et al. 2018). Advances in high-throughput sequencing technologies coupled with Genome Wide Association Studies (GWAS), based on linkage disequilibrium formed over generations, offer the possibility to map QTLs with high resolution (Zhu et al. 2008). These approaches have allowed the identification of multiple QTLs for agronomic and quality traits, as well as, for stresses responses in a wide range of crops, such as rice (Yano et al. 2016), barley (Alqudah et al. 2014), maize (Li et al. 2013) or soybean (Fang et al. 2017). In species with complex genomes, like wheat, association analysis has also been successful for dissecting the genetic architecture of key traits (see Saini et al. 2021).

In wheat, yield can be dissected into three principal components, including number of spikes per area, grain number per spike and weight grain (normally expressed as thousand kernel weight; TKW) (Liu et al. 2018). From them, TKW is the most stable and heritable parameter, and can be further divided into kernel size and shape traits (grain length, width, and area) (Gegas et al. 2010). In addition to TKW and grain traits, several other traits can affect yield, such as spikelets per spike, spike length or plant height (Wu et al. 2014). Reduced plant height, for example has proved to increase yield since the introduction of semi dwarf varieties in the Green Revolution. Additionally, phenological traits, such as days to heading and maturity, have proved their importance, since wheat must develop biomass and flower at optimal environmental conditions (Trethowan 2014). In the last decade, hundreds of QTLs for yield related traits have been reported in bread wheat. Some studies have even lead to the identification of candidate genes, like TraesCS2D01G331100, an orthologue of the rice D11 gene contributing to grain length and width (Tekeu et al. 2021), or the cloning of genes controlling the studied trait, such as TaGW8, associated with kernel size and weight (Yan et al. 2019).

Lately, several authors have identified stable QTLs based on meta-analysis. Cao et al. (2020) defined 58 QTL-rich clusters related with TKW, kernel number per spike and spike number, located in all the wheat chromosomes except 3B. Liu et al. (2020) identified and validated 76 core Meta-QTL (MQTL) regions, in all wheat chromosomes, related with wheat yield and its component traits. Yang et al. (2021) summarized studies developed for yield related traits in irrigation and drought/heat-stressed environments, and identified 86 MQTL, some of them only in one of the environments. Finally, Ma et al. (2022) integrated their work with previous studies and identified 58 QTLs for kernel size related traits in 11 wheat chromosomes. Although, thousands of QTLs have been already identified, additional studies including non-previously screened variability, have the potential to identify new genes according to Malik et al. (2021).

One of the main requirements for GWAS has been the use of highly diverse populations, such as landraces, in order to capture the available genetic variability for the trait of interest (Kulwal and Singh 2021). Landraces have been adapted specifically to their region of origin through their evolution in local environments characterized by a wide range of biotic and abiotic conditions (Zeven 1998; Lopes et al. 2015). Thus, landraces represent an important source of genetic variability and have provided novel alleles for various agronomic, quality, biotic, and abiotic stress response traits (Azeez et al. 2018; Lopes et al. 2015). Moreover, landraces are traditionally grown with less inputs and have the potential to widen the gene pool of modern cultivars by adding underexploited variability in wheat breeding programmes (Nazco et al. 2012).

Spanish wheat landraces present high diversity due to the wide range of climatic conditions present in the Iberian Peninsula (Ruiz et al. 2018; Chacón et al. 2020). The Spanish National Plant Genetic Resources Centre (Centro de Recursos Fitogenéticos, CRF-INIA, CSIC, Madrid), maintains the national collection of Spanish bread wheat landraces composed of 522 accessions. This collection contains landraces from all Spanish regions where bread wheat was cultivated in the first half of the twentieth century. From this collection, a primary subset of 189 genotypes were selected based on collection site data (altitude, longitude, latitude) and morphological spike traits to represent the available diversity (Pascual et al. 2020a). Pascual et al. (2020b) genotyped this subset, and showed that landraces present higher genetic diversity than modern cultivars sown nowadays in Spain. Thus, these materials may include new variability non-previously screened, as showed in a previous GWAS study with Spanish durum wheat landraces where most of the marker-trait associations identified had not been previously described (Giraldo et al. 2016).

The aim of this study was to identify new genomic regions associated to yield-related traits, including also grain size and shape, and phenological traits in the 189 genotyped Spanish bread wheat landraces. For this purpose, a characterization of eleven yield-related traits in these landraces was performed along five seasons. The subsequent GWAS analysis identified genomic regions controlling these traits across environments. Moreover, we identified putative candidate genes inside associated genomic regions based on in silico expression analysis and functional annotation.

Material and methods

Plant material and phenotyping

In this study, a set of 189 bread wheat Spanish landraces (Triticum aestivum subsp. vulgare (Vill.)), already described in Pascual et al. (2020a, b) and López-Fernández et al. (2021) were analysed. The 189 genotypes were selected based on their collection site data (altitude, longitude, latitude) and morphological spike traits, to include all the agroclimatic (from cold sub-humid areas in the northern parts of Spain to warm semi-arid regimes in the southeast) and morphologic diversity found in a wider collection of 522 Spanish landraces of Triticum aestivum subsp. vulgare (Vill.) (Gadea 1954). This selection was the starting point for the construction of the Spanish bread wheat landraces core collection described in a previous study (Pascual et al. 2020a).

To obtain the phenotypic data, all landraces were sown during five consecutive seasons in an augmented design in plots of four rows per genotype (1 m long). In the 2016–2017 season, the accessions were sown in Alcalá de Henares (40°31′17, 8″ N, 3°17′33″ W, Madrid). In the following seasons (2017–2018, 2018–2019, 2019–2020, and 2020–2021), the accessions were sowed in the same conditions in the experimental fields of the ETSIAAB, Universidad Politécnica de Madrid (40º25’ N, 3º42’ W, Madrid). Daily meteorological data were recorded over the period of study (autumn 2016 to summer 2021) at nearby weather stations.

Phenotyping was conducted for a total of eleven traits, including: (i) grain traits: grain area (Ar), grain perimeter (Perim), grain major ellipse (Majell) and grain minor ellipse (Minell); (ii) yield-related traits: thousand kernel weight (TKW), grain number per spike (GrnSpk), number of spikelets per spike (SplN), spike length (SpkLng) and plant height (PH); and (iii) phenological traits: days to heading (DH) and days to maturity (DM). Some data were available from previous studies (Pascual et al. 2020a; López-Fernández et al. 2021) but phenotyping was completed in this work (see Table S1). DH, DM, PH, SpkLng and SplN were recorded in accordance with the International Board of Plant Genetic Resources (IBPGR 1985). Grain size and shape data (Ar, Perim, Majell, Minell) were obtained scanning at least 300 kernels using GrainScan software (Whan et al. 2014).

Statistical analysis was conducted using R v.4.0.3 (R Core Team 2022). Normality was tested by the Shapiro–Wilk test (p-value < 0.01), and significant traits were log transformed to achieve normality if possible (only GrnSpk was log transformed for the analysis). Mean, standard deviation, maximum and minimum values, and coefficient of variation were calculated for each trait by season. Correlations between years inside each trait and correlations among traits were calculated with Spearman coefficient (p-value < 0.05). Homocedasticity was checked using the Levene test. The effect of season, the genetic structure of the collection, and their interaction were evaluated with the Kruskal–Wallis (p-value < 0.05) and Wilcox tests (p-value < 0.05).

Genetic analysis

High-throughput genotyping data for the set of 189 accessions were available from Pascual et al. (2020b). In this previous work, the accessions were genotyped by DArTseq GBS technology at SAGA (Genetic Analysis Service for Agriculture, Mexico City, Mexico). For this study, from the total 58,660 raw SNPs (Single Nucleotide Polymorphism) markers available, those with the same allelic profile, more than 10% of missing data, or MAF < 0.05 (Minimum Allele frequency) were filtered out. The remaining markers were subjected to BLAST search against the currently available Triticum aestivum genome REFseq v2.0 (Zhu et al. 2021); only markers located in the genome (BLAST E-value < 5e − 10 and sequence identity > 90%) were kept. The genetic structure of the 189 accessions was calculated in Pascual et al. (2020b) based on the DArT (presence/absence) markers. The set of 189 accessions was divided in four genetic subpopulations, from now on named pop1, pop2, pop3 and pop4.

Linkage disequilibrium (LD) among markers was calculated using TASSEL 5.0 (Bradbury et al. 2007). Pair-wise LD was measured using the squared allele frequency correlations r2 and the values were plotted by chromosome against the physical distance to determine how fast the LD decays. A LOESS curve was fitted to the plot. LD decay was estimated according to Remington et al. (2001).

Genome-wide association study

Associations between phenotypic and genotypic data were detected using TASSEL 5.0 (Bradbury et al. 2007). A unique estimation of the phenotypic value was obtained by BLUES (best linear unbiased estimate) for the traits with a correlation between seasons higher than 0.5 in all the analysed seasons. For the remaining traits, associations were conducted independently per season. Associations were detected by a general linear model (GLM) including as a covariate the genetic structure (Q matrix). The obtained p-values for each MTA (Marker Trait Association) test were corrected by Bonferroni. For this purpose, the threshold was calculated dividing the standard p-value = 0.05 by the number of independent tests obtained with Tagger function of Haploview v4.2 software with r2 = 1 threshold (Barrett et al. 2005). LD blocks containing an association with the trait were defined as the chromosomic region containing all the markers in a LD > 0.3 (Alemu et al. 2021) with the associated marker. To do so the allele frequency correlations r2 between a significant marker and the markers located up and downstream were screened, when a marker presented r2 > 0.3 we moved to the next one, the marker that presented an r2 lower than 0.3 was considered as the end of the LD block. MTAs in the same LD block (or with overlapping end-star for their LD blocks) were considered to belong to the same QTL and grouped in Marker Trait Association Quantitative Trait Loci (MTA-QTLs). High-density MTA-QTLs regions were defined as the regions with single or overlapping MTA-QTLs, including more than 4 associated traits.

For high-density MTA-QTLs regions, the effect of days to heading was tested performing a statistic linear model using DH trait as a covariate:

$$y = x_{1} \omega + x_{2} M + \varepsilon ,$$

where $y$ was a vector with phenotypic values, $x_{1}$ was the vector with covariate values, $\omega$ was the estimate of covariate effect, $x_{2}$ was the vector with the genotypic values of the marker (0;1), $M$ was the estimate of the marker effect and $\varepsilon$ was the error.

Identification of candidate genes

Gene annotation for the MTA-QTLs regions was obtained using the gene models for high-confidence genes reported for the wheat genome sequence Triticum aestivum genome REFseq v2.1 (Zhu et al. 2021) available at https://urgi.versailles.inra.fr/download/iwgsc/IWGSC_RefSeq_Annotations/v2.1/. The function of all the genes was obtained from Triticum aestivum genome REFseq v1.0 (IWGSC 2018) available at https://urgi.versailles.inra.fr/download/iwgsc/IWGSC_RefSeq_Annotations/v1.0/.

Expression of the genes coded inside the high-density MTA-QTLs regions was analysed in silico with the gene expression dataset of Azhurnaya spring wheat developmental time course experiment (Ramírez-Gonzalez et al. 2018; Borrill et al. 2016). Genes that did not reach an expression of 0.5 transcripts per million of sequences (TPM) in target stages and tissues (from tillering stage, “shoot apical meristem”; from full boot, “spike”; from spike, “spike 30%” and “spikelets 30%”; from anthesis, “anther” and “stigma ovary”; from milk grain stage, “glumes”, “lemma” and “grain”; from soft dough, hard dough and ripening, "grain"; and from dough, "endosperm") were filtered out.

To check the possible relationship between the traits and candidate genes, KnetMiner software (Hassani-Pak et al. 2021) was used, using as keywords “1000-grain weight" OR "Grain yield" OR "Grain size" OR "Grain width" OR "Grain number" OR "Grain weight" OR "Grain length", and as gene list the candidate gene names.

Results

Uncovering the phenotypic diversity in Spanish bread wheat landraces

To evaluate the phenotypic diversity in the set of 189 bread wheat landraces, this material was characterized for eleven traits (including grain traits, yield-related traits and phenological traits) during five seasons (Table 1). The highest variation, based on the coefficient of variation (CV) among accessions, was observed for SpkLng and TKW, and the smallest for phenological traits (DH and DM). Phenological traits showed a high diversity, with differences ranging up to 48 days in heading (DH) and up to 33 in days to maturity (DM). This diversity reflected the potential of the Spanish landraces for adapting to a high range of environments.

Table 1 Summary of the phenotypic data obtained

Full size table

As this set of landraces was clustered into four subpopulations (Pascual et al. 2020a), the effect of the genetic structure (pop) on the phenotype was evaluated. Significant differences were found for all the studied traits except Minell (Table 1). Besides, the environmental effect was also evaluated based on the different environments (seasons). A significant effect was found for all the studied traits, except Majell. Grain traits, TKW and DH values were higher on season 2017–2018, which was the wettest (Fig. S1). Although PH was not evaluated in that season, the highest PH values were found during the 2019–2020 season which was the second wettest. Moreover, DH, PH and TKW showed the lowest values on season 2016–2017, which registered the driest months during the grain filling period. To quantify this environmental effect, correlation analyses were carried out between seasons for each trait (Fig. 1). Positive to high positive correlations were observed for Ar, Perim, Majell, Minell, DH, DM, SpkLng and SplN. Thus, a unique phenotypic value across seasons was estimated for each of these traits through BLUES (Best Linear Unbiased Estimate). On the other hand, PH, GrnSpk and TKW showed low positive correlation values between seasons, due to the genotype x environment interaction, so each season phenotypes were kept separately for subsequent analysis.

Finally, correlations among traits were evaluated (Fig. 1). Grain traits (Ar, Perim, Majell and Minell) showed positive correlation values between them (except Majell with Minell), and with TKW, indicating the key role of the grain shape in grain weight. However, those traits were weakly and negative correlated with other yield-related traits (GrnSpk and SplN). DH and DM were positively correlated among them, as expected, but negatively correlated with GrnSpk and SplN (Fig. 1).

Linkage disequilibrium along the chromosomes differed between homoeologous

High-throughput genotyping data for the set of 189 accessions had been previously reported at Pascual et al. (2020b). From the 58,660 raw SNP obtained on that study, a total of 4856 high-quality markers that could be located in Chinese Spring reference genome were selected for the analysis. Linkage disequilibrium (LD) among pairs of markers located in the same chromosome was calculated. The average square allele frequency correlation was r2 = 0.06 for the whole genome, ranging from 0.09 for chromosome 4B to 0.03 for chromosome 7D. The percentage of loci pairs showing a significant LD (p < 0.001) ranged from 28.56% for chromosome 1A to 9.41% for chromosome 4D. LD differed between homoeologous genomes with an average of 24.58% significant locus pairs (r2 mean = 0.07) corresponding to the B genome, 23.86% (r2 mean = 0.06) to the A genome and 11.82% (r2 mean = 0.04) to the D genome (Table S2). LD decay showed a similar trend for A and B genomes in all chromosomes, except for homoeologous group 4. For D genome chromosomes, LD decay was slower (Fig. S2A). The genome-wide half LD decay was 0.23 and the intersect of that value with the LD decay curve was at 1.3 Mb (Fig. S2B). Later, according to the HAPLOVIEW tagger function, it was determined that a total of 4476 independent test could be performed with the set of markers.

Numerous marker trait associations were identified by GWAS

With the aim of identifying the genomic regions associated with the evaluated traits, GWAS was performed. The analyses detected a total of 881 significant MTAs, involving 434 markers across the genome, as some markers were associated with more than one trait (Fig. 2, Table S3). The MTAs were distributed equally in the A and B genomes (~ 40%), and less in the D genome (~ 17%), consistent to the distribution of the whole set of SNP markers used for these analyses (Table 2). However, at the whole chromosome level, the distribution of MTAs was variable. Chromosome 5A showed the highest number of MTAs (112; 12.71%), despite not harbouring the highest number of SNP markers, whereas chromosome 4D showed the lowest (8; 0.91%), as expected since it is the smallest chromosome. Focussing on the traits, chromosome 4A, with only 3.52% of the total MTAs, harboured MTAs for the 11 traits studied. The number of MTAs associated with each trait ranged from 2 for SplN to 139 for Perim (Table S3). Finally, the mean percentage of phenotypic variance (PVE) explained per MTA was calculated, being its value similar for all traits, and ranging from 0.10 to 0.13, except for SplN (0.06) (Fig. 3c). Almost 70% of the MTAs showed a PVE lower than 0.12.

Table 2 Distribution of all associations identified along the wheat genome

Full size table

To further determine the number of loci associated along the genome, the 881 MTAs were grouped into 366 Marker Trait Association Quantitative Trait Loci (MTA-QTL) based on the LD between flanking markers (Fig. 3, Table 2, Table S3). As for MTAs, chromosome 5A harboured the highest number (33), followed by chromosome 2B (31), while chromosome 4D harboured the lowest (3) (Fig. 3a and Table 2). Regarding the size of the MTA-QTLs, 165 (45%) included only one MTA, whereas the remaining 201 ranged from 2 (in 89 MTA-QTLs) to 19 MTAs. The average MTA-QTL physical length was 10.1 Mb (median 4.11 Mb), with 77.4% of them shorter than 10 Mb and 1.65% longer than 100 Mb. The smallest MTA-QTL, with only 20 kb, was located on chromosome 2B and the biggest one, with 214.22 Mb on chromosome 4A (Fig. 3d). The number of traits associated per MTA-QTL varied from 1 to 7. As expected, the traits with the highest number of MTAs (grain size traits (Ar, Perim and Majell) and SpkLng) were the ones with higher number of MTA-QTLs and the one with a lower number of MTA-QTLs (only 2) was SplN (Fig. 3b).

MTA-QTLs linked to the same trait when characterized in different environments are especially interesting and can be considered as stable QTLs. Stable QTLs could be target for the traits TKW, GrnSpk and PH analysed by season due to the lack of correlation between seasons. From the total of 89 MTA-QTLs identified for TKW (39 for season 2016–2017, 7 for season 2017–2018, 49 for season 2018–2019 and 29 for season 2019–2020) none of them was stable among all seasons. However, 10 were coincident in three seasons and 15 in two. For GrnSpk a total of 42 MTA-QTLs were identified (30 for season 2017–2018, 1 for season 2018–2019 and 15 for season 2019–2020) being only 4 stable on two seasons. Finally, for PH, 24 MTA-QTLs were detected (2 for season 2016–2017, 3 for season 2018–2019, 12 for season 2019–2020 and 11 for season 2020–2021), and also 4 were stable across 2 seasons. Besides stable QTLs, MTA-QTLs linked to several correlated traits also constitute a target that pinpoints genes with a possible pleiotropic effect. First, all co-localizing MTA-QTLs harbouring associations with grain size related traits were grouped. A total of 30 common MTA-QTLs for Ar, Perim and Majell, that could be considered key QTLs controlling grain size, were identified. For the two phenological traits DH and DM, 52 and 34 MTA-QTLs were detected, 17 common in both traits.

To identify candidate genes controlling the analysed traits, the genes inside the MTA-QTLs were analysed. The associations included a total of 25,373 genes according to IWGSC Wheat Refseq 2.1. The number of genes per MTA-QTL ranged from 0 to 656. The average number of genes per MTA-QTL was 71, with 9% of the MTA-QTLs contained less than 10 genes, and 7% more than 200 (Fig. 3d). The closest gene to the most significant marker for each trait within the MTA-QTLs and its predicted function was analysed (Table S3), and none of them matched known genes controlling the studied traits. However, several detected MTA-QTLs included or were close to key known genes. For example, MTA-QTL_4B.196 was located close to VRN-B2, MTA-QTL_5A.215 included VRN-A1 and MTA-QTL_2D.115 was located close to PPD-D1, being all of them associated with DH and DM in cereal species (Fernández-Calleja et al. 2021; Chen et al. 2010; Yan et al. 2003; Welsh et al. 1973). Regarding to grain traits, MTA-QTL_6A.267 linked to TKW, co-localized with TaGW2. Also, as expected considering the Spanish landraces were collected before the Green Revolution, no MTA-QTLs for PH were located close to RHT genes on chromosomes 4B and 4D.

Targeting high density MTA-QTL regions along the genome

Genomic regions associated to more than one trait could be interesting, specially to target genes that might help breeding for different traits. Thus, high-density MTA-QTL regions (from now on regions) were defined as a genomic interval including associations to four or more traits in only one MTA-QTL or in two or more overlapping MTA-QTLs. In total 46 regions were identified, most of them harbouring associations with grain traits and TKW (Table S4). Fourteen of those key regions were associated with DH. As it has been reported that DH might affect grain and yield related traits, DH effect on the associations identified in each region was tested. After this analysis, 33 regions remained associated to at least four traits (Table 3), including 6 regions where DH had been one of the associated traits, even though this association was no longer significant. In one of them, R5A.3, the size of the region was smaller.

Table 3 Description of the 33 selected genomic regions being associated with at least four traits

Full size table

As TKW and grain traits (Ar, Perim and Majell) represent a cornerstone for breeding, out of the 33 regions described previously, the six that were associate with these traits (TKW in three seasons) were selected as the most promising ones (Fig. 2, Table 4). For them, the effect of the allele carried by each accession, at the most significant MTA according to GWAS, in the average values of the associated traits was explored (Fig. S3). Region R2B.6 included the most significant MTA for TKW (Table S3, Fig. 4a and b), for this marker, the accessions carrying allele G presented an increase of 19.40% for Ar, 10.46% for Perim, 10.88% for Majell, and for TKW an increase up to 41.32% on season 2016–2017, 30.79% on season 2018–2019 and 29.30% on season 2019–2020 (Fig. 4d, Fig. S3).

Table 4 Description of the six selected genomic regions

Full size table

To dissect the genetic cause of the observed associations, the function fo the 413 genes located inside these 6 regions was studied. First, these genes were classified based on their GO terms (Fig. S4). According to Biological Process, 162 and 143 genes were included in “cellular process” and “metabolic process”, followed by biological regulation (52 genes). Regarding Molecular Function, the main categories with 167 and 146 genes were “catalytic activity” and “binding”, followed by “transferase activity” (80 genes). Second, to select putatives candidate genes, the genes were filtered by relevant tissue-specific expression (see "Methods"), obtaining 308 expressed genes. Those genes included at least 38 transcription factors and genes with functions related to grain size and yield according to Gupta et al. (2020). The most promissing candidates taking into account the expression pattern and the predicted function are shown in Table 4.

Discussion

The aim of the present study was to identify in a panel of wheat landraces, new genomic regions associated with key breeding traits, including grain traits, yield-related traits and phenological traits.

Spanish bread wheat landraces present a wide range of phenotypic diversity

The phenotypic diversity of any collection of accessions is the limiting factor that will determine the chance to identify novel MTAs when conducting a GWAS. Thus, a successful study requires a collection as diverse as possible, but that at the same time is adapted to the target environment. For this analysis, a total of 189 Spanish bread wheat landraces, selected from 522 accessions to capture the available diversity regarding to collection site data (altitude, longitude, latitude) and morphological spike traits (Pascual et al. 2020a) have been characterized during five different seasons. Previous studies have pointed out the high degree of genetic diversity harboured by Spanish bread wheat landraces, highlighted, for example, by the high and novel allelic variability for prolamines (Giraldo et al. 2010; Ruiz et al. 2002). Moreover, Pascual et al. (2020b) have determined that the genetic diversity of this collection has not been included in the bread wheats currently cultivated in the country. When this collection was characterized at phenotypic level, this genetic diversity was translated into a wide range of phenotypic variation (i.e. grain traits variation shown on Table 1, Fig. 4d). It should be noted that variation is greater than that found in other landraces collections. For example, TKW presented a range of 25 gr in the season 2017–2018 (the wettest one) (Table 1), which is higher than that found in Asian landraces (according to Lopes et al. 2015). For yield related traits, such as PH, the range of variation was around 60 cm in all seasons (Table 1), similar to that found in a collection of Spanish durum wheat landraces (Giraldo et al. 2016). That is expected, as landraces precede the Green Revolution during which dwarfing genes were fixed, thus present higher variability than modern cultivars. Regarding phenological traits, differences in the latitude of landraces collection sites are typically related with diversity in vernalisation and photoperiod genes (Royo et al. 2020). The collection includes only Spanish accessions, however a range greater than a month was found for DH in all seasons (Table 1). This high phenotypic diversity has been also detected in Spanish durum wheat landraces (Giraldo et al. 2016), and it is probably due to the diverse environmental conditions found in Spain. Indeed, landraces were grown from cold sub-humid areas in the northern parts of Spain to warm semi-arid regimes in the southeast (Gadea 1954), in basic or neutral soils in the Centre and East, and acid soils in the western regions (Reuter et al. 2008).

LD along the genome can be linked to the available genomic diversity

Linkage disequilibrium, the basis for association mapping, is mainly affected by historical recombination, allele frequency and selection in a natural population (Alqudah et al. 2020). In this work, LD and LD decay were evaluated. An average r2 = 0.06 for the whole genome was found, which is similar to the value obtained in other wheat landraces (Hanif et al. 2021). This low linkage disequilibrium is reflecting the lack of identity by descent, as the accessions predate the Green Revolution and thus do not share common parents in their pedigree and guarantees a high level of resolution when performing association analysis.

Comparing the different homoeologous genomes, it was found that, as previously described, the number of paired makers in LD was the lowest for the D genome and LD decay was also slower than for the A and B genomes (Pang et al. 2020; Jung et al. 2021) (Table S2). This might be due to the reduced genetic diversity of the D genome as a consequence of its relatively recent incorporation to bread wheat (IWGSC 2014). When focus was set at chromosome level, LD was the lowest at chromosome 7D and highest at chromosome 4B, however both chromosomes harbour a similar number of polymorphic markers (146 and 145 respectively) (Table S2). In this case, the difference does reflect the lower genetic diversity (Hs) at the centromeric region of chromosome 4B detected by Pascual et al. (2020b).

Finally, HAPLOVIEW software (Barrett et al. 2005) was employed to estimate the number of independent tests that could be performed with the selected molecular markers. The 4856 high quality SNP markers allowed to perform 4476 independent tests, a reduction of 7.8%, clearly lower than in other studies. For example, Rufo et al. (2021) genotyped their landraces with the Illumina Infinium 15K Wheat SNP Array, and from 10,090 high quality SNPs only considered 3696 to by independent. This fact indicates that the selected markers do not provide redundant information.

GWAS in a collection of Spanish landraces uncover novel yield related MTA-QTLs

An association analysis combining the phenotypic data (11 traits in five different environments) from the highly diverse collection of landraces (189 accessions), and the set of high-quality SNP markers, considering the genetic structure (Pascual et al 2020b) identified a total of 881 Marker Trait Associations involving 434 markers across the genome (Fig. 2). Later, the genomic intervals (MTA-QTLs) that should contain the causal polymorphisms responsible of the phenotypic variance explained by the associated marker were defined according to LD. We identified 366 MTA-QTLs (Fig. 3 and Table S3), each of them associated with an average of 1.77 traits (ranging from 1 to 7 traits) and including an average of 1.35 markers, as expected considering the lack of redundancy found for the selected SNPs. MTA-QTLs were detected in all the wheat chromosomes; the A genome had the highest number of associations (152 MTA-QTLs) as previously described (Ain et al. 2015; Godoy et al. 2018; Khan et al. 2022), followed by the B (144) and D (62) genomes. Chromosome 5A, known for harbouring several genes affecting phenology and yield (Kato et al. 2000), included the highest number of MTA-QTLs (35) despite non-being the largest chromosome. In summary, our study revealed a large number of genomic regions implicated in key breeding traits, probably due to the wide agroclimatic diversity found in the Iberian Peninsula (Gadea 1954; Reuter et al. 2008). Moreover, several studies including landraces have previously shown the potential of those locally adapted accessions to reveal new associations, as example, Rahimi et al. (2019) and Rabieyan et al. (2022) analysed a collection including one hundred Iranian modern varieties two hundred Iranian landraces and detected 394 and 257 respectively.

To target which of the detected MTA-QTLs uncover novel associations with yield and yield related traits, the previous identified genes controlling the analysed traits and the most recent Meta-QTLs studies that best summarize the available information (Cao et al. 2020; Liu et al. 2020; Yang et al. 2021; Ma et al. 2022) were compared with the obtained results. Already known associations for grain and yield traits were validated in the present study, such as MTA-QTL 4A.182 (4A from 612.6 to 614 Mb) that includes the cell invertase TaCWI associated with kernel weight and grain number per spike (Jiang et al. 2015), MTA-QTL 5A.200 ( 5A from 49.59 to 136 Mb) that harbours TaSnRK2 a protein kinase controlling yield related traits (Ur Rehman et al. 2019), MTA-QTL 6A.267 (6A from 230.15 to 285.56 Mb) that contains the widely studied TaGW2 controlling grain size (Su et al. 2011) or MTA-QTL 7B.338 (7B from 68.15 to 71.35 Mb) inside which is located TaSUS1 associated with TKW (Hou et al. 2014). Regarding phenological traits, four MTA-QTLs close by or including the well-known genes were detected; PPD-D1 (Welsh 1973) (for MTA-QTL 2D.127 21–32Mb), TaELF3-1DL homolog to Early Flowering from Arabidopsis (Wang et al. 2016a) (for MTA 1D.63 483–486.34Mb), VRN-B2 (Yan et al. 2004) (for MTA-QTL 4B.213, 655–670Mb), and VRN-A1 (Yan et al. 2003) (for MTA-QTL 5A.215, 588–590Mb) for which it is already known the analysed set of landraces presents polymorphism (Pascual et al. 2020b). Then, the MTA-QTLs that to our knowledge are close by or include genes or QTLs previously identified were filtered out. The analysis revealed more than 150 considered novel associations, as were not included in the most recent Meta-QTLs analysis (Cao et al. 2020; Liu et al. 2020; Yang et al. 2021; Ma et al. 2022). New MTA-QTLs were identified for most of the characterized traits (except SpIN), moreover non-previously described associations could be found in all the chromosomes. Those results reflect the unexplored genetic diversity harboured by the bread wheat Spanish landraces (Pascual et al. 2020b), and are in accordance with those of Giraldo et al. (2016), where a GWAS in Spanish durum wheat landraces revealed mainly novel associations. Even though landraces present lower yields compared to modern cultivars under optimal conditions, they usually present more stable yields under harsh environments (Zeven 1998). Thus, those novel associations might include key genes that will enhance breeding programmes considering the actual climate change scenario. To look for putative candidate genes underlying the novel associations, we identified the annotation of gene located closest to the most significant marker inside each novel MTA-QTL. More than ten transcription factors and plant hormone related genes were identified.

Dissection of high-density MTA-QTLs genomic regions identified new putative genes related with wheat yield

Genomic regions harbouring associations to several traits are especially useful for breeding, as they allow selecting for multiple traits. In this work, we identified 33 high density QTLs regions, associated with more than four traits and non-related with days to heading. One fifth of those regions were located on chromosome 5A, which again highlights the key role of this chromosome in adaptability and yield related traits control (Barabaschi et al. 2015). As expected, considering the high number of traits associated with them, most of those regions had been previously described (Cao et al. 2020; Liu et al. 2020; Yang et al. 2021; Ma et al. 2022). However, according to these studies some of the Meta-QTLs include hundreds of Mb. The present study helps to narrow the genomic interval that may include the causal genes, thus facilitates the search of putative candidates. For example, Liu et al. (2020) identified a Meta-QTL for TKW for chromosome 7B (size 65 Mb), which co-localized with R7B.2 whose size is just 2.27 Mb and includes the genes TraesCS7B03G1112900 and TraesCS7B03G1114600 two promising candidates. Moreover, one of those regions R5A.3 (described also by Cao et al. 2020; Yang et al. 2021) originally was linked to DH, DM (Table S4), and included the gene VRN-A1 (589Mb according to Triticum aestivum genome REFseq v2.1). After taking into account the effect of DH, the region was reduced by 2 Mb, to the interval from 584.83 to 588.52 Mb at chromosome 5A, and remained associated to Ar, Perim, Majell and TKW (Table 3). This suggest that the already described link between DH and TKW (Giraldo et al. 2016) might be due to linkage disequilibrium between VRN-A1 and another gene affecting grain weight. An ancestral recombination, that might have taken place during the selection of Spanish bread wheat landraces, may have helped to detect this link and suggests that exists an underexploited gene in this interval. Besides, to our knowledge four of the targeted regions, located on chromosomes 1A, 3A and 3B, have not been previously linked to the studied traits (Table 3). For one of them, R1A.1 the closest gene to the most significant MTA is TraesCS1A03G0122300 a RING/U-box superfamily protein. It is well known that RING/U-box ubiquitin ligases play a role in plants growth and development, as well as in regulating the response to different stresses (Serrano et al. 2018). Actually in wheat and rice several studies have identified U-Box ubiquitin ligases as responsible for the regulation of grain related traits (Song et al. 2007; Wang et al. 2022; Brinton et al. 2018).

Finally, the focus was set on the six genomic regions that were linked to Ar, Perim, Majell and TKW in at least three seasons (stable QTLs) (Fig. 2). The total genes (413) included on these regions were carefully analysed to detect putative candidates, annotation as well as in silico expression analysis allowed the identification of the 15 most promising genes (Table 4).

Inside R1B.2 two putative genes coding for MYB transcription factors (TraesCS1B03G0803000 and TraesCS1B03G0817400) were identified. This family of transcription factors is involved in different physiological and biochemical processes, including control of cell development and cell cycle, hormone synthesis, and signal transduction (Dubos et al. 2010; Feller et al. 2011). Moreover, according to KnetMiner database (Hassani-Pak et al. 2021) those genes regulate grain size related genes. Besides the gene TraesCS1B03G0827400 that codes for a ubiquitin ligase whose link to yield has already been described was also selected inside this region based on its predicted expression (Wang et al. 2016b).

The second region R2B.6 (Fig. 4) located at the end of chromosome 2B (752.7–757.03) was considered the most promising one, as harboured the most significant MTA for TKW in three seasons, which produced also the greatest effects on grain size (Fig. 4d). It included a RNA binding protein (TraesCS2B03G1383200), as well as, the transcription factor TraesCS2B03G1382600 with a high homology to rice ILI1 gene. This rice gene, according to Zhang et al. (2009), acts as a positive regulator of cell elongation and plant development, having a positive role in leaf bending. Moreover, the rice gene ILI6 from same family, plays a key role in determining rice grain length (Heang and Sassa 2012).

In chromosome 3B two regions were highlighted. R3B.1 (12.61–16.82 Mb), in which four candidates were selected, TraesCS3B03G0058000, a putative Cytochrome P450 highly expressed in spikelets, and three kinases TraesCS3B03G0054900, TraesCS3B03G0055300 and TraesCS3B03G0055900. The kinases presented a high homology to Leucine-Rich-Repeat (LRR) receptor kinases SERK2, SERK4 and BAK1 from rice, involved in the regulation of plant growth through the brassinosteroid signalling pathway (Li et al. 2009; Park et al. 2011). The second region R3B.2 (249.38–258.96Mb), harboured an expressed NAC domain protein (TraesCS3B03G0504300) whose role in developmental process in widely known (Olsen et al. 2005), and a WD-repeat protein (TraesCS3B03G0496600) that codifies for a TOPLESS-related protein. The TOPLESS proteins play multiple roles throughout plant development (Causier et al. 2012; Oh et al. 2014).

On chromosome 5A (590.25–596.91Mb), two kinase proteins (TraesCS5A03G0945200 and TraesCS5A03G0956000) were found within the R5A.4 region. The first one is highly similar to BRI1 a Brassinosteroid LRR receptor kinase from rice, which increases the biomass and grain production in this species (Morinaka et al. 2006). The second one, codes for a Sucrose non-fermenting-1-related protein kinase 2.8 (SnRK2), an orthologue of AT3G50500 Arabidopsis protein, involved in the abscisic acid signalling.

The last region R7B.1 (688.21–69.48) harboured TraesCS7B03G1114600 and TraesCS7B03G1112900, an ubiquitin and F-box family protein, respectively, both highly expressed in spike and grains.

In summary, the present study of a collection of Spanish bread wheat landraces highlighted the high phenotypic diversity of this collection and identified more than 350 MTA-QTLs, including at least 150 novel ones. Those MTA-QTLs allowed the targeting of 33 high dense QTL regions in the genome, that remained associated to at least four traits after considering the effect of days to heading. Finally, taking into account the importance of detecting stable QTLs, six regions associated to several grain traits and TKW in at least three environments were selected as the most promising ones to harbour targets for breeding. Moreover, the preliminary screening for candidate genes reported in this study provide a starting point for future analysis aimed at the identification and validation of wheat yield related genes.

Data availability

The datasets generated during and/or analysed during the current study are available as supplementary material (average values), or from the corresponding author on reasonable request (raw values).

References

Ain Q, Rasheed A, Anwar A, Mahmood T, Imtiaz M, Mahmood T, Xia X, He Z, Quraishi UM (2015) Genome-wide association for grain yield under rainfed conditions in historical wheat cultivars from Pakistan. Front Plant Sci 6:743. https://doi.org/10.3389/fpls.2015.00743
Article PubMed PubMed Central Google Scholar
Alemu A, Feyissa T, Maccaferri M, Sciara G, Tuberosa R, Ammar K, Badebo A, Acevedo M, Letta T, Abeyo B (2021) Genome-wide association analysis unveils novel QTLs for seminal root system architecture traits in Ethiopian durum wheat. BMC Genomics 22:20. https://doi.org/10.1186/s12864-020-07320-4
Article PubMed PubMed Central CAS Google Scholar
Alqudah AM, Sharma R, Pasam RK et al (2014) Genetic dissection of photoperiod response based on GWAS of pre-anthesis phase duration in spring barley. PLoS ONE 9:e113120. https://doi.org/10.1371/journal.pone.0113120
Article PubMed PubMed Central CAS Google Scholar
Alqudah AM, Sallam A, Baenziger PS, Börner A (2020) GWAS: fast-forwarding gene identification and characterization in temperate cereals: lessons from barley - a review. J Adv Res 22:119–135. https://doi.org/10.1016/j.jare.2019.10.013
Article PubMed Google Scholar
Azeez MA, Adubi AO, Durodola FA (2018) Landraces and crop genetic improvement. In: Rediscovery of Landraces as a Resource for the Future. In: Adubi AO (ed) Rediscovery of landraces as a resource for the future. IntechOpen, Rijeka. https://doi.org/10.5772/intechopen.75944
Barabaschi D, Magni F, Volante A, Gadaleta A, Šimková H, Scalabrin S, Prazzoli ML, Bagnaresi P, Lacrima K, Michelotti V (2015) Physical mapping of bread wheat chromosome 5A: an integrated approach. Plant Genome 8:1–24. https://doi.org/10.3835/plantgenome2015.03.0011
Article CAS Google Scholar
Barrett JC, Fry B, Maller J, Daly MJ (2005) Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21:263–265. https://doi.org/10.1093/bioinformatics/bth457
Article PubMed CAS Google Scholar
Borrill P, Ramirez-Gonzalez R, Uauy C (2016) expVIP: a customizable RNA-seq data analysis and visualisation platform. Plant Physiol 170:2172–2186. https://doi.org/10.1104/pp.15.01667
Article PubMed PubMed Central CAS Google Scholar
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635. https://doi.org/10.1093/bioinformatics/btm308
Article PubMed CAS Google Scholar
Brinton J, Simmonds J, Uauy C (2018) Ubiquitin-related genes are differentially expressed in isogenic lines contrasting for pericarp cell size and grain weight in hexaploid wheat. BMC Plant Biol 18(1):22. https://doi.org/10.1186/s12870-018-1241-5
Article PubMed PubMed Central CAS Google Scholar
Cao S, Xu D, Hanif M, Xia X, He Z (2020) Genetic architecture underpinning yield component traits in wheat. Theor Appl Genet 133:1811–1823. https://doi.org/10.1007/s00122-020-03562-8
Article PubMed CAS Google Scholar
Causier B, Ashworth M, Guo W, Davies B (2012) The TOPLESS interactome: a framework for gene repression in Arabidopsis. Plant Physiol 158:423–438. https://doi.org/10.1104/pp.111.186999
Article PubMed CAS Google Scholar
Chacón EA, Vázquez FJ, Giraldo P, Carrillo JM, Benavente E, Rodríguez-Quijano M (2020) Allelic variation for prolamins in Spanish durum wheat landraces and its relationship with quality traits. Agronomy 10:136. https://doi.org/10.3390/agronomy10010136
Article CAS Google Scholar
Chen Y, Carver BF, Wang S, Cao S, Yan L (2010) Genetic regulation of developmental phases in winter wheat. Mol Breed 26:573–582. https://doi.org/10.1007/s11032-010-9392-6
Article Google Scholar
Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L (2010) MYB transcription factors in Arabidopsis. Trends Plant Sci 15:573–581. https://doi.org/10.1016/j.tplants.2010.06.005
Article PubMed CAS Google Scholar
Fang C, Ma Y, Wu S, Liu Z, Wang Z, Yang R, Hu G, Zhou Z, Yu H, Zhang M, Pan Y, Zhou G, Ren H, Du W, Yan H, Wang Y, Han D, Shen Y, Liu S, Liu T, Zhang J, Qin H, Yuan J, Yuan X, Kong F, Liu B, Li J, Zhang Z, Wang G, Zhu B, Tian Z (2017) Genome-wide association studies dissect the genetic networks underlying agronomical traits in soybean. Genome Biol 18:16. https://doi.org/10.1186/s13059-017-1289-9
Article CAS Google Scholar
Feller A, Machemer K, Braun EL, Grotewold E (2011) Evolutionary and comparative analysis of MYB and bHLH plant transcription factors. Plant J 66:94–116. https://doi.org/10.1111/j.1365-313X.2010.04459.x
Article PubMed CAS Google Scholar
Fernández-Calleja M, Casas AM, Igartua E (2021) Major flowering time genes of barley: allelic diversity, effects, and comparison with wheat. Theor Appl Genet 134:1867–1897. https://doi.org/10.1007/s00122-021-03824-z
Article PubMed PubMed Central CAS Google Scholar
Gadea M (1954) Trigos españoles. Instituto Nacional de Investigaciones Agronómicas, Madrid
Google Scholar
Gegas VC, Nazari A, Griffiths S, Simmonds J, Fish L, Orford S, Sayers L, Doonan JH, Snape JW (2010) A genetic framework for grain size and shape variation in wheat. Plant Cell 22:1046–1056. https://doi.org/10.1105/tpc.110.074153
Article PubMed PubMed Central CAS Google Scholar
Giraldo P, Rodriguez-Quijano M, Simon C, Vázquez JF, Carrillo JM (2010) Allelic variation in HMW glutenins in Spanish wheat landraces and their relationship with bread quality. Span J Agric Res 8:1012–1023. https://doi.org/10.5424/sjar/2010084-1394
Article Google Scholar
Giraldo P, Royo C, González M, Carrillo JM, Ruiz M (2016) Genetic diversity and association mapping for agromorphological and grain quality traits of a structured collection of durum wheat landraces including subsp. durum, turgidum and diccocon. PloS one 11:e0166577. https://doi.org/10.1371/journal.pone.0166577
Godoy J, Gizaw S, Chao S, Blake N, Carter A, Cuthbert R, Dubcovsky J, Hucl P, Kephart K, Pozniak C (2018) Genome-wide Association Study of Agronomic Traits in a Spring-Planted North American Elite Hard Red Spring Wheat Panel. Crop Sci 58:1838–1852. https://doi.org/10.2135/cropsci2017.07.0423
Article CAS Google Scholar
Gupta PK, Balyan HS, Sharma S, Kumar R (2020) Genetics of yield, abiotic stress tolerance and biofortification in wheat (Triticum aestivum L.). Theor Appl Genet 133:1569–1602. https://doi.org/10.1007/s00122-020-03583-3
Article PubMed Google Scholar
Hanif U, Alipour H, Gul A, Jing L, Darvishzadeh R, Amir R, Munir F, Ilyas MK, Ghafoor A, Siddiqui SU (2021) Characterization of the genetic basis of local adaptation of wheat landraces from Iran and Pakistan using genome-wide association study. Plant Genome 14:e20096. https://doi.org/10.1002/tpg2.20096
Article PubMed CAS Google Scholar
Hassani-Pak K, Singh A, Brandizi M, Hearnshaw J, Parsons JD, Amberkar S, Phillips AL, Doonan JH, Rawlings C (2021) KnetMiner: a comprehensive approach for supporting evidence-based gene discovery and complex trait analysis across species. Plant Biotechnol J 19:1670–1678. https://doi.org/10.1111/pbi.13583
Article PubMed PubMed Central Google Scholar
Heang D, Sassa H (2012) Antagonistic actions of HLH/bHLH proteins are involved in grain length and weight in rice. PLoS ONE 7:e31325. https://doi.org/10.1371/journal.pone.0031325
Article PubMed PubMed Central CAS Google Scholar
Hou J, Jiang Q, Hao C, Wang Y, Zhang H, Zhang X (2014) Global selection on sucrose synthase haplotypes during a century of wheat breeding. Plant Physiol 164:1918–1929. https://doi.org/10.1104/pp.113.232454
Article PubMed PubMed Central CAS Google Scholar
IBPGR (1985) Revised Descriptor List for Wheat (Triticum spp). International Board for Plant Genetic Resources, Rome
Google Scholar
International Wheat Genome Sequencing C (2018) Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science 361:7191. https://doi.org/10.1126/science.aar7191
Article CAS Google Scholar
Jiang Y, Jiang Q, Hao C, Hou J, Wang L, Zhang H, Zhang S, Chen X, Zhang X (2015) A yield-associated gene TaCWI, in wheat: its function, selection and evolution in global breeding revealed by haplotype analysis. Theor Appl Genet 128:131–143. https://doi.org/10.1007/s00122-014-2417-5
Article PubMed CAS Google Scholar
Jung WJ, Lee YJ, Kang C, Seo YW (2021) Identification of genetic loci associated with major agronomic traits of wheat (Triticum aestivum L.) based on genome-wide association analysis. BMC Plant Biol 21:1–14. https://doi.org/10.1186/s12870-021-03180-6
Article CAS Google Scholar
Kato K, Miura H, Sawada S (2000) Mapping QTLs controlling grain yield and its components on chromosome 5A of wheat. Theor Appl Genet 101:1114–1121. https://doi.org/10.1007/s001220051587
Article CAS Google Scholar
Khan H, Krishnappa G, Kumar S, Mishra CN, Krishna H, Devate NB, Rathan ND, Parkash O, Yadav SS, Srivastava P (2022) Genome-wide association study for grain yield and component traits in bread wheat (Triticum aestivum L.). Front Genet 13:982589. https://doi.org/10.3389/fgene.2022.982589
Kulwal PL, Singh R (2021) Association Mapping in Plants. In: Tripodi P (eds) Crop Breeding. Humana, New York, pp 105–117. https://doi.org/10.1007/978-1-0716-1201-9_8
Li D, Wang L, Wang M, Xu Y, Luo W, Liu Y, Xu Z, Li J, Chong K (2009) Engineering OsBAK1 gene as a molecular tool to improve rice architecture for high yield. Plant Biotechnol J 7:791–806. https://doi.org/10.1111/j.1467-7652.2009.00444.x
Article PubMed CAS Google Scholar
Li H, Peng Z, Yang X, Wang W, Fu J, Wang J, Han Y, Chai Y, Guo T, Yang N, Liu J, Warburton ML, Cheng Y, Hao X, Zhang P, Zhao J, Liu Y, Wang G, Li J, Yan J (2013) Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat Genet 45:43–50. https://doi.org/10.1038/ng.2484
Article PubMed CAS Google Scholar
Liu K, Sun X, Ning T, Duan X, Wang Q, Liu T, An Y, Guan X, Tian J, Chen J (2018) Genetic dissection of wheat panicle traits using linkage analysis and a genome-wide association study. Theor Appl Genet 131:1073–1090. https://doi.org/10.1007/s00122-018-3059-9
Article PubMed CAS Google Scholar
Liu H, Mullan D, Zhang C, Zhao S, Li X, Zhang A, Lu Z, Wang Y, Yan G (2020) Major genomic regions responsible for wheat yield and its components as revealed by meta-QTL and genotype–phenotype association analyses. Planta 252:65. https://doi.org/10.1007/s00425-020-03466-3
Article PubMed CAS Google Scholar
Lopes MS, El-Basyoni I, Baenziger PS, Singh S, Royo C, Ozbek K, Aktas H, Ozer E, Ozdemir F, Manickavelu A (2015) Exploiting genetic diversity from landraces in wheat breeding for adaptation to climate change. J Exp Bot 66:3477–3486. https://doi.org/10.1093/jxb/erv122
Article PubMed CAS Google Scholar
López-Fernández M, Pascual L, Faci I, Fernández M, Ruiz M, Benavente E, Giraldo P (2021) Exploring the End-Use Quality Potential of a Collection of Spanish Bread Wheat Landraces. Plants 10:620. https://doi.org/10.3390/plants10040620
Article PubMed PubMed Central CAS Google Scholar
Ma J, Liu Y, Zhang P, Chen T, Tian T, Wang P, Che Z, Shahinnia F, Yang D (2022) Identification of quantitative trait loci (QTL) and meta-QTL analysis for kernel size-related traits in wheat (Triticum aestivum L.). BMC Plant Biol 22:1–18. https://doi.org/10.1186/s12870-022-03989-9
Article CAS Google Scholar
Malik P, Kumar J, Singh S, Sharma S, Meher PK, Sharma MK, Roy JK, Sharma PK, Balyan HS, Gupta PK (2021) Single-trait, multi-locus and multi-trait GWAS using four different models for yield traits in bread wheat. Mol Breed 41:1–21. https://doi.org/10.1007/s11032-021-01240-1
Article CAS Google Scholar
Morinaka Y, Sakamoto T, Inukai Y, Agetsuma M, Kitano H, Ashikari M, Matsuoka M (2006) Morphological alteration caused by brassinosteroid insensitivity increases the biomass and grain production of rice. Plant Physiol 141:924–931. https://doi.org/10.1104/pp.106.077081
Article PubMed PubMed Central CAS Google Scholar
Nazco R, Villegas D, Ammar K, Peña RJ, Moragues M, Royo C (2012) Can Mediterranean durum wheat landraces contribute to improved grain quality attributes in modern cultivars? Euphytica 185:1–17. https://doi.org/10.1007/s10681-011-0588-6
Article Google Scholar
Oh E, Zhu J, Ryu H, Hwang I, Wang Z (2014) TOPLESS mediates brassinosteroid-induced transcriptional repression through interaction with BZR1. Nat Commun 5:4140. https://doi.org/10.1038/ncomms5140
Article PubMed CAS Google Scholar
Olsen AN, Ernst HA, Leggio LL, Skriver K (2005) NAC transcription factors: structurally distinct, functionally diverse. Trends Plant Sci 10:79–87. https://doi.org/10.1016/j.tplants.2004.12.010
Article PubMed CAS Google Scholar
Pang Y, Liu C, Wang D, Amand PS, Bernardo A, Li W, He F, Li L, Wang L, Yuan X (2020) High-resolution genome-wide association study identifies genomic regions and candidate genes for important agronomic traits in wheat. Mol Plant 13:1311–1327. https://doi.org/10.1016/j.molp.2020.07.008
Article PubMed CAS Google Scholar
Park HS, Ryu HY, Kim BH, Kim SY, Yoon IS, Nam KH (2011) A subset of OsSERK genes, including OsBAK1, affects normal growth and leaf development of rice. Mol Cells 32:561–569. https://doi.org/10.1007/s10059-011-0178-4
Article PubMed PubMed Central CAS Google Scholar
Pascual L, Fernández M, Aparicio N, López-Fernández M, Fité R, Giraldo P, Ruiz M (2020a) Development of a Multipurpose Core Collection of Bread Wheat Based on High-Throughput Genotyping Data. Agronomy 10:534. https://doi.org/10.3390/agronomy10040534
Article CAS Google Scholar
Pascual L, Ruiz M, López-Fernández M, Pérez-Peña H, Benavente E, Vázquez JF, Sansaloni C, Giraldo P (2020b) Genomic analysis of Spanish wheat landraces reveals their variability and potential for breeding. BMC Genom 21:122. https://doi.org/10.1186/s12864-020-6536-x
Article CAS Google Scholar
R Core Team (2022) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/. Accessed 6 June 2023
Rabieyan E, Bihamta MR, Moghaddam ME et al (2022) Genome-wide association mapping for wheat morphometric seed traits in Iranian landraces and cultivars under rain-fed and well-watered conditions. Sci Rep 12:17839. https://doi.org/10.1038/s41598-022-22607-0
Article PubMed PubMed Central CAS Google Scholar
Rahimi Y, Bihamta MR, Taleei A et al (2019) Genome-wide association study of agronomic traits in bread wheat reveals novel putative alleles for future breeding programmes. BMC Plant Biol 19:541. https://doi.org/10.1186/s12870-019-2165-4
Article PubMed PubMed Central CAS Google Scholar
Ramírez-González RH, Borrill P, Lang D et al (2018) The transcriptional landscape of polyploid wheat. Science 361:eaar6089. https://doi.org/10.1126/science.aar6089
Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J, Kresovich S, Goodman MM, Buckler ES (2001) Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc Natl Acad Sci USA 98:11479–11484. https://doi.org/10.1073/pnas.201394398
Article PubMed PubMed Central CAS Google Scholar
Reuter H, Rodriguez Lado L, Hengl T, Montanarella L (2008) Continental-Scale Digital Soil Mapping Using European Soil Profile Data: Soil PH. In: Böhner J, Blascke T, Montanarella L, editors. Hamburger Beiträge zur Physischen Geographie und Landschaftsökologie. Hamburg (Germany): University of Hamburg. p. 91–102. JRC45667
Royo C, Dreisigacker S, Ammar K, Villegas D (2020) Agronomic performance of durum wheat landraces and modern cultivars and its association with genotypic variation in vernalization response (Vrn-1) and photoperiod sensitivity (Ppd-1) genes. Eur J Agron 120:126129. https://doi.org/10.1016/j.eja.2020.126129
Article CAS Google Scholar
Rufo R, López A, Lopes MS, Bellvert J, Soriano JM (2021) Identification of Quantitative Trait Loci Hotspots Affecting Agronomic Traits and High-Throughput Vegetation Indices in Rainfed Wheat. Front Plant Sci 12:735192. https://doi.org/10.3389/fpls.2021.735192
Article PubMed PubMed Central Google Scholar
Ruiz M, Rodriguez-Quijano M, Metakovsky EV, Vazquez JF, Carrillo JM (2002) Polymorphism, variation and genetic identity of Spanish common wheat germplasm based on gliadin alleles. Field Crops Res 79:185–196. https://doi.org/10.1016/S0378-4290(02)00139-9
Article Google Scholar
Ruiz M, Giraldo P, González JM (2018) Phenotypic variation in root architecture traits and their relationship with eco-geographical and agronomic features in a core collection of tetraploid wheat landraces (Triticum turgidum L.). Euphytica 214:54. https://doi.org/10.1007/s10681-018-2133-3
Saini DK, Chopra Y, Singh J, Sandhu KS, Kumar A, Bazzer S, Srivastava P (2021) Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies. Mol Breed 42:1. https://doi.org/10.1007/s11032-021-01272-7
Article PubMed PubMed Central Google Scholar
Sehgal D, Autrique E, Singh R, Ellis M, Singh S, Dreisigacker S (2017) Identification of genomic regions for grain yield and yield stability and their epistatic interactions. Sci Rep 7:41578. https://doi.org/10.1038/srep41578
Article PubMed PubMed Central CAS Google Scholar
Serrano I, Campos L, Rivas S (2018) Roles of E3 ubiquitin-ligases in nuclear protein homeostasis during plant stress responses. Front Plant Sci 9:139. https://doi.org/10.3389/fpls.2018.00139
Article PubMed PubMed Central Google Scholar
Shewry PR, Hey SJ (2015) The contribution of wheat to human diet and health. Food and Energy Secur 4:178–202. https://doi.org/10.1002/fes3.64
Article Google Scholar
Song X, Huang W, Shi M, Zhu M, Lin H (2007) A QTL for rice grain width and weight encodes a previously unknown RING-type E3 ubiquitin ligase. Nat Genet 39:623–630. https://doi.org/10.1038/ng2014
Article PubMed CAS Google Scholar
Su Z, Hao C, Wang L, Dong Y, Zhang X (2011) Identification and development of a functional marker of TaGW2 associated with grain weight in bread wheat (Triticum aestivum L.). Theor Appl Genet 122:211–223. https://doi.org/10.1007/s00122-010-1437-z
Article PubMed CAS Google Scholar
Sukumaran S, Lopes M, Dreisigacker S, Reynolds M (2018) Genetic analysis of multi-environmental spring wheat trials identifies genomic regions for locus-specific trade-offs for grain weight and grain number. Theor Appl Genet 131:985–998. https://doi.org/10.1007/s00122-017-3037-7
Article PubMed CAS Google Scholar
Tekeu H, Ngonkeu EL, Bélanger S, Djocgoué PF, Abed A, Torkamaneh D, Boyle B, Tsimi PM, Tadesse W, Jean M (2021) GWAS identifies an ortholog of the rice D11 gene as a candidate gene for grain size in an international collection of hexaploid wheat. Sci Rep 11:19483. https://doi.org/10.1038/s41598-021-98626-0
Article PubMed PubMed Central CAS Google Scholar
The International Wheat Genome Sequencing C (2014) A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. Science 345:1251788. https://doi.org/10.1126/science.1251788
Article CAS Google Scholar
Trethowan RM (2014) Defining a genetic ideotype for crop improvement. In: Fleury D, Whitford R (eds) Crop breeding: methods and protocols. Springer New York, New York, pp 1–20. https://doi.org/10.1007/978-1-4939-0446-4_1
Ur Rehman S, Wang J, Chang X, Zhang X, Mao X, Jing R (2019) A wheat protein kinase gene TaSnRK2. 9–5A associated with yield contributing traits. Theor Appl Genet 132:907–919. https://doi.org/10.1007/s00122-018-3247-7
Article PubMed CAS Google Scholar
Wang J, Wen W, Hanif M, Xia X, Wang H, Liu S, Liu J, Yang L, Cao S, He Z (2016a) TaELF3-1DL, a homolog of ELF3, is associated with heading date in bread wheat. Mol Breed 36:161. https://doi.org/10.1007/s11032-016-0585-5
Article CAS Google Scholar
Wang J, Wu F, Zhu S, Xu Y, Cheng Z, Wang J, Li C, Sheng P, Zhang H, Cai M (2016b) Overexpression of Os MYB 1R1–VP 64 fusion protein increases grain yield in rice by delaying flowering time. FEBS Lett 590:3385–3396. https://doi.org/10.1002/1873-3468.12374
Article PubMed CAS Google Scholar
Wang S, Zhang Z, Fan Y, Huang D, Yang Y, Zhuang J, Zhu Y (2022) Control of Grain Weight and Size in Rice (Oryza sativa L.) by OsPUB3 Encoding a U-Box E3 Ubiquitin Ligase. Rice 15:58. https://doi.org/10.1186/s12284-022-00604-1
Welsh JR, Keim DL, Pirasteh B, Richards RD (1973) Genetic control of photoperiod response in wheat. In: Sears ER, Sears LMS (eds) Proc 4th Int Wheat Genet Symp. University of Missouri, Columbia, pp 879–884
Google Scholar
Whan AP, Smith AB, Cavanagh CR, Ral JF, Shaw LM, Howitt CA, Bischof L (2014) GrainScan: a low cost, fast method for grain size and colour measurements. Plant Methods 10:23. https://doi.org/10.1186/1746-4811-10-23
Article PubMed PubMed Central Google Scholar
Wu X, Cheng R, Xue S, Kong Z, Wan H, Li G, Huang Y, Jia H, Jia J, Zhang L (2014) Precise mapping of a quantitative trait locus interval for spike length and grain weight in bread wheat (Triticum aestivum L.). Mol Breed 33:129–138. https://doi.org/10.1007/s11032-013-9939-4
Article CAS Google Scholar
Yan L, Loukoianov A, Tranquilli G, Helguera M, Fahima T, Dubcovsky J (2003) Positional cloning of the wheat vernalization gene VRN1. Proc Natl Acad Sci U S A 100:6263–6268. https://doi.org/10.1073/pnas.0937399100
Article PubMed PubMed Central CAS Google Scholar
Yan L, Loukoianov A, Blechl A, Tranquilli G, Ramakrishna W, SanMiguel P, Bennetzen JL, Echenique V, Dubcovsky J (2004) The wheat VRN2 gene is a flowering repressor down-regulated by vernalization. Science 303:1640–1644. https://doi.org/10.1126/science.1094305
Article PubMed PubMed Central CAS Google Scholar
Yan X, Zhao L, Ren Y, Dong Z, Cui D, Chen F (2019) Genome-wide association study revealed that the TaGW8 gene was associated with kernel size in Chinese bread wheat. Sci Rep 9:2702. https://doi.org/10.1038/s41598-019-38570-2
Article PubMed PubMed Central CAS Google Scholar
Yang Y, Amo A, Wei D, Chai Y, Zheng J, Qiao P, Cui C, Lu S, Chen L, Hu Y (2021) Large-scale integration of meta-QTL and genome-wide association study discovers the genomic regions and candidate genes for yield and yield-related traits in bread wheat. Theor Appl Genet 134:3083–3109. https://doi.org/10.1007/s00122-021-03881-4
Article PubMed CAS Google Scholar
Yano K, Yamamoto E, Aya K, Takeuchi H, Lo P, Hu L, Yamasaki M, Yoshida S, Kitano H, Hirano K, Matsuoka M (2016) Genome-wide association study using whole-genome sequencing rapidly identifies new genes influencing agronomic traits in rice. Nat Genet 48:927–934. https://doi.org/10.1038/ng.3596
Article PubMed CAS Google Scholar
Zeven AC (1998) Landraces: A review of definitions and classifications. Euphytica 104:127–139. https://doi.org/10.1023/A:1018683119237
Article Google Scholar
Zhang L, Bai M, Wu J, Zhu J, Wang H, Zhang Z, Wang W, Sun Y, Zhao J, Sun X (2009) Antagonistic HLH/bHLH transcription factors mediate brassinosteroid regulation of cell elongation and plant development in rice and Arabidopsis. Plant Cell 21:3767–3780. https://doi.org/10.1105/tpc.109.070441
Article PubMed PubMed Central CAS Google Scholar
Zhu C, Gore M, Buckler ES, Yu J (2008) Status and Prospects of Association Mapping in Plants. Plant Genome 1:5–20. https://doi.org/10.3835/plantgenome2008.02.0089
Article CAS Google Scholar
Zhu T, Wang L, Rimbert H, Rodriguez JC, Deal KR, De Oliveira R, Choulet F, Keeble-gagnère G, Tibbits J, Rogers J, Eversole K, Appels R, Gu YQ, Mascher M, Dvorak J, Luo M (2021) Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring Genome Assembly Plant J 107:303–314. https://doi.org/10.1111/tpj.15289
Article PubMed CAS Google Scholar

Download references

Acknowledgements

The authors are grateful to J.F. Vazquez and M. Fernández for his support in plant material management.

Funding

This study was funded by the Spanish Ministry of Science and Innovation (Grants No. AGL2016-77149 and PID2019-109089RB-C32 from MCIN/AEI/https://doi.org/10.13039/501100011033), Universidad Politécnica de Madrid project VJIDOCUPM18LPB, and by Comunidad de Madrid (Spain) and Structural EU Funds 2014–2020 (ERDF and ESF) (Grant No. AGRISOST-CM S2018/BAA-4330). M. López Fernández and Julián García-Abadillo are recipients of a predoctoral fellowship from the Programa Propio of the Universidad Politécnica de Madrid.

Author information

Authors and Affiliations

Department of Biotechnology-Plant Biology, School of Agricultural, Food and Biosystems Engineering (ETSIAAB), Universidad Politécnica de Madrid (UPM), Madrid, Spain
Matilde López-Fernández, Patricia Giraldo & Laura Pascual
Department of Biotechnology and Plant Biology, Centre for Biotechnology and Plant Genomics (CBGP), Universidad Politécnica de Madrid (UPM), Madrid, Spain
Julián García-Abadillo
John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK
Cristobal Uauy
Instituto Nacional de Investigacion y Tecnologia Agraria y Alimentaria (INIA), CSIC, Autovía A2, Km. 36.2. Finca La Canaleja, 28805, Alcalá de Henares, Madrid, Spain
Magdalena Ruiz

Authors

Matilde López-Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Julián García-Abadillo
View author publications
You can also search for this author in PubMed Google Scholar
Cristobal Uauy
View author publications
You can also search for this author in PubMed Google Scholar
Magdalena Ruiz
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Giraldo
View author publications
You can also search for this author in PubMed Google Scholar
Laura Pascual
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LP and PG are responsible for conceptualization, methodology, investigation, supervision, funding acquisition and writing-original draft. MLF involved in methodology, investigation, formal analysis, and writing-original draft. MR involved in investigation, and writing-review and editing. JGA involved in formal analysis. CU involved in supervision and writing-review and editing. All authors have revised and approved the final manuscript.

Corresponding author

Correspondence to Patricia Giraldo.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Communicated by Peter Langridge.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 57 KB)

Supplementary file2 (DOCX 13 KB)

Supplementary file3 (XLSX 100 KB)

Supplementary file4 (XLSX 13 KB)

Supplementary file5 (PDF 256 KB)

Supplementary file6 (PDF 360 KB)

Supplementary file7 (PDF 241 KB)

Supplementary file8 (PDF 283 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

López-Fernández, M., García-Abadillo, J., Uauy, C. et al. Genome wide association in Spanish bread wheat landraces identifies six key genomic regions that constitute potential targets for improving grain yield related traits. Theor Appl Genet 136, 244 (2023). https://doi.org/10.1007/s00122-023-04492-x

Download citation

Received: 16 June 2023
Accepted: 24 October 2023
Published: 13 November 2023
DOI: https://doi.org/10.1007/s00122-023-04492-x

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Genome wide association in Spanish bread wheat landraces identifies six key genomic regions that constitute potential targets for improving grain yield related traits

Abstract

Key message

Abstract

Similar content being viewed by others

Introduction

Material and methods

Plant material and phenotyping

Genetic analysis

Genome-wide association study

Identification of candidate genes

Results

Uncovering the phenotypic diversity in Spanish bread wheat landraces

Linkage disequilibrium along the chromosomes differed between homoeologous

Numerous marker trait associations were identified by GWAS

Targeting high density MTA-QTL regions along the genome

Discussion

Spanish bread wheat landraces present a wide range of phenotypic diversity

LD along the genome can be linked to the available genomic diversity

GWAS in a collection of Spanish landraces uncover novel yield related MTA-QTLs

Dissection of high-density MTA-QTLs genomic regions identified new putative genes related with wheat yield

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation