Introduction

Wheat (Triticum aestivum L.) is one of the most important food crops worldwide, and it is also the source of major cooking foods in China. With the improvement of people’s living standards, the requirements for wheat colour have also increased significantly. Moreover, flour colour significantly influences the quality of wheat end-use products1,14. Therefore, it is necessary to pay attention to the study of flour colour to improve the quality of wheat products to meet the development needs of the market.

Flour colour and whiteness are important indexes of wheat flour quality that can reflect flour quality and milling precision and are also important indicators of flour grading. Flour colour (whiteness) is a quantitative trait controlled by multiple genes with high heritability2,3,4. Some researchers found that the flour brightness index (L* value), the redness index (a* value), and the yellowness index (b* value) could be used to evaluate flour colour5,6,7. In 2002, the CIE-L*, a*, b* colour system was introduced for the colour determination of Chinese flour and noodles8.

Although there have been many studies on the quantitative trait locus (QTL) mapping of wheat flour quality-related traits9,10, genetic dissection of the flour and dough sheet colours was rarely conducted at the same time. Through QTL mapping, some QTLs that control the stability of noodle colour traits were found on chromosomes 2A and 2D using DH populations11. The major QTLs/genes that affect the brightness of wheat flour were also detected on chromosomes 4D, 4A and 5D, ranging from 10 to 23.4% of PVE, and some QTLs associated with the yellowness of wheat flour were detected on chromosomes 7A and 4D12. However, for flour colour-related traits, the genotype rather than the environment was the main determinant13. In addition, the dough sheet colour is also important for noodle quality. The sheet colour score accounts for 9–45% of the total score when evaluating noodle quality14. With respect to dough sheet colour, the L* value is significantly affected by the protein content, and the a* value is highly correlated with the yellow pigment content. However, with respect to fresh dough sheets, PPO (polyphenol oxidase) activity significantly decreases the L* value, and highly significant positive correlations between the a* and b* values of fresh dough colour and PPO activity were reported from 4 to 24 h under resting conditions15. However, there is little information on the genetic dissection of the colour of dough sheets.

A genome-wide association study (GWAS) is a powerful approach for mapping economically and biologically valuable traits of germplasm collections. This method has great potential in determining the locus responsible for a specific phenotype and for analysing the complex traits of crops16. Recently, GWAS has been widely used to study the complex traits of various plant species, such as Arabidopsis17,18, rice19,20, corn21,22,23 and durum wheat24. Although this method has also been used in wheat25,26,27,28,29, genome-wide association mapping of the colour-related flour and dough sheet traits has rarely been reported.

Therefore, the present study used GWAS to dissect the whiteness and colour-related traits of flour and dough sheets using 24,355 single nucleotide polymorphisms (SNPs) genotyped using the 90K Illumina iSelect array in a population of diverse winter wheat varieties. The objective of this study was to identify markers and candidate genes for loci associated with these traits to improve wheat flour and dough colour quality by breeding.

Results

Phenotypic variation in colour traits of flour and dough sheets

The phenotypic values for flour whiteness, flour colour and dough sheet colour within four environments (TA 2015, DZ 2015, TA 2016, DZ 2016) are shown in Tables 1 and 2. Extensive phenotypic variation for these traits was observed across four environments (i.e., two growing seasons and two locations) among the 205 winter wheat accessions. The whiteness, L*, a* and b* values of the flour and dough sheets were continuously distributed in the population (Fig. S1 and Fig. S2), which like typical quantitative traits, indicating that they were genetically controlled by multiple genes. Analysis of variance showed significant differences in the flour whiteness and L*, a* and b* values of the flour and dough sheets (P < 0.0001) among genotypes and environments, as well as G × E interactions (Table S1). The h2 values of flour whiteness and the b* value were 98% and 99%, respectively (Table 1). The h2 values of the a* and b* values of the fresh dough sheets (FDS) and the b* value of the dry dough sheets (DDS) were 50%, 48% and 46%, respectively (Table 2).

Table 1 Phenotypic analysis of flour whiteness and colour in two locations in 2 years.
Table 2 Phenotypic analysis of fresh and dry dough sheet colour in two locations in 2 years.

Marker–trait associations (MTAs) of colour traits of flour

In total, 24,355 mapped SNPs were used for MTA analysis28,53. Twenty-two MTAs associated with flour whiteness (P < 10–4) were distributed on chromosomes 1A, 1B, 2B, 3A, 5D, 6A, and 7B (Table 3; Fig. 1). There were four SNP loci (Excalibur_c4709_576, TA001505-1171, BS00000020_51, RAC875_c34446_396) detected in two or more environments. The BS00000020_51 locus had the maximum phenotypic variation explained (PVE) with 15.95%. Alleles C and T of marker BS00000020_51 on chromosome 5D were associated with the largest phenotypic difference (2.13) (Table S2). The phenotypic value of flour whiteness associated with BS00000020_51-C on chromosome 5D was significantly higher than that associated with BS00000020_51-T across all four environments, indicating that the contribution of BS00000020_51-C locus to flour whiteness was better than that of BS00000020_51-T locus (Table S2).

Table 3 SNP markers significantly associated with flour whiteness in two locations in 2 years (P < 10–4).
Figure 1
figure 1

Manhattan plot of flour whiteness and flour colour.

In total, 6 SNP loci on chromosomes 1B, 2D, 3A, 5B, 5D and 6A were associated with flour colour brightness (L* value) (P < 10–4) (Table 4; Fig. 1). Three SNP loci (Kukri_c33486_128, BS00000020_51, and GENE-4011_91) were identified in more than one environment. The BS00000020_51 locus located on the 5D chromosome was detected in the four environments, with a maximum PVE of 12.28%. Alleles C and T of marker BS00000020_51 were associated with the largest phenotypic differences (0.57) (Table S2).

Table 4 SNP markers significantly associated with the brightness index (L* value) of flour colour in two locations in 2 years (P < 10–4).

Seventeen MTAs associated with flour colour redness (a* value) (P < 10–4) were distributed on chromosomes 1A, 3A, 6A, 6B, 7A, 7B and 7D. Among them, eleven MTAs were identified in two or more environments (Table 5; Fig. 1). Six SNP loci (Excalibur_c8883_214, Excalibur_rep_c114255_439, Kukri_c65663_642, Excalibur_rep_c92684_578, Excalibur_c5938_1669, Excalibur_c5938_1703) were detected in three environments. Alleles A and G of marker Kukri_c65663_642 were associated with the largest phenotypic differences (0.38) (Table S2).

Table 5 SNP markers significantly associated with the redness index (a* value) of flour colour in two locations in 2 years (P < 10–4).

Eleven MTAs on different chromosomes, i.e., 1A, 3B, 4A, 5B, 5D and 6A, were associated with flour colour yellowness (b* value) (P < 10–4) (Table 6; Fig. 1). Of these, TA003858-0637 (6A) and BS00000020_51 (5D) were found in three and four environments, respectively. Alleles A and G of marker Kukri_c1214_437 on chromosome 5B were associated with the largest phenotypic differences (0.79). The phenotypic value of flour colour yellowness (b* value) associated with Kukri_c1214_437-G was significantly higher than that associated with Kukri_c1214_437-A across all four environments, indicating that Kukri_c1214_437-G was better than Kukri_c1214_437-A for the b* value (Table S2).

Table 6 SNP markers significantly associated with the yellowness index (b* value) of flour colour in two locations in 2 years (P < 10–4).

In general, MTAs consistently identified in more than one environment were considered to be stable. There were four, three, eleven and eleven stable SNP loci for flour whiteness and the L*, a* and b* values, respectively. There was one SNP marker, BS00000020_51, associated with both flour whiteness and flour L* and b* values.

MTAs of the colour traits of dough sheet

Ten MTAs significant at the P ≤ 10–5 level for the L* value of dough sheet colour were detected, similar to the a* and b* values of the FDS and DDS (Table 7; Fig. 2). Of these, only one locus, GENE-1258_171 on chromosome 2A, was found in two environments, with a maximum PVE of 10.51%.

Table 7 Loci significantly associated with dough sheet colour in two locations in 2 years.
Figure 2
figure 2

Manhattan plot of the L* value and the a* value of fresh dough sheet and the b* value of dry dough sheet.

Prediction of candidate genes for colour traits of flour

Markers with high PVE values were selected from loci significantly associated with flour whiteness and flour colour for prediction (Table S3). Some important loci were identified. There were twenty-one candidate genes predicated. Of which, the marker BS00000020_51 on chromosome 5D had two candidate genes, TraesCS5D01G004300 and Gsp-1D, from wheat (Table S3). The functions of these genes are related to grain hardness, that is, puroindoline-b and an arabinogalactan peptide, respectively. The candidate gene of the marker Excalibur_c4709_576 on 2B in wheat and Arabidopsis (mouse-ear cress) also participates in glycerol-3-phosphate O-acyltransferase activity and protein self-association. The marker BS00027770_51 on chromosome 6B had the candidate gene TraesCS6B01G419200, whose functions is related to the flavonoid biosynthetic process and regulation of jasmonic acid mediated signalling pathway. The candidate gene TraesCS5A01G003800 for the marker BS00099534_51 on chromosome 5A is related to the fucose metabolic process and protein phosphorylation. The marker TraesCS6A01G241200 on chromosome 6A had the candidate gene TA005690-1190, whose function is related to the ketone body biosynthetic process, leucine catabolic process and lipid metabolic process. These candidate genes may be related to flour colour, and their functions will be explored in future research.

Prediction of candidate genes for dough sheet colour

Markers with high PVE values were selected from loci significantly associated with dough sheet colour for prediction. Five new genes have been predicated. The marker BS00065510_51 on chromosome 1D had the candidate gene TraesCS1D01G070200 from wheat (Table S4). The functions of this gene are related to ATP binding, RNA binding and RNA helicase activity. The candidate gene TraesCS1D01G070300 of the marker BS00065722_51 on 1D in Oryza barthii also participates in the RNA catabolic process. The marker GENE-1258_171 on chromosome 2A had the candidate gene TraesCS2A01G593500 from Oryza barthii, Aegilops tauschii and wheat. The functions of this gene are related to protein tyrosine kinase activity and protein serine/threonine kinase activity. The gene TraesCS5A01G003800 is predicated on chromosome 5A for maker BS00099534_51, whose function is related to catalytic activity and transferase activity participating in fucose metabolic process. The marker BobWhite_c15802_72 on chromosome 6A had the candidate gene TraesCS6A01G024900 from wheat, soybean (Glycine hispida) and Arabidopsis (mouse-ear cress). The functions of this gene are related to the response to oxidative stress, adenine salvage activity and the carbohydrate metabolic process. These candidate genes may be related to dough sheet colour, and their functions will be explored in future research.

Discussion

Flour whiteness and flour and dough colour-related traits are critical determinants for the end-use product quality of wheat. Therefore, it is important to identify some major and stable loci for these traits and then transfer these favourable alleles into commercial varieties. Although some loci for these traits were found in previous studies in different populations11,30,31,32,33,34 using QTL mapping methods, genome-wide association studies (GWAS) for these traits are rarely conducted using single nucleotide polymorphism markers (SNPs). Therefore, it is still important to find new major and stable loci introgressed into commercial cultivars using GWAS.

Zhai et al.34 constructed a genetic map that included 8227 SNP markers using an RIL population and found fifty-six QTLs. However, in the present study, a total of 24,355 SNP markers were mapped for MTA analysis using a panel of varieties, and more new loci were found than in a previous study. Some chromosomes for flour colour traits were involved in QTL mapping, including mainly homoeologous group 1, 2, 5, and 7 chromosomes, and chromosomes 3B, 4A, 4B and 6B34,35. Some QTLs detected for the a* value were found on the 1B and 3B chromosomes36, and other involved chromosomes were mainly 1B, 1D, 2D, 4A, 4D and 7B. In this study, loci associated with these traits were found on chromosomes 6A, 4A and 3A in addition to the above chromosomes. Zhai et al.37 performed GWAS on 166 bread wheat cultivars using the wheat 90 and 660K SNP arrays and 10 allele-specific markers, and identified 100 MTAs for flour color-related traits. This indicated that GWAS and QTL mapping could be complementary to each other.

PPO activity and the yellow pigment content have been reported to affect flour whiteness and colour38,39,40. Previous studies showed that PPO activity is mainly controlled by the genes on the homoeologous group 2 chromosomes, particularly 2A and 2D41,42. In the present study, significant SNP loci markers on chromosomes 2B, 2D and 2A were associated with flour whiteness, the flour L* value and dough sheet colour, but these SNP loci showed be different from PPO genes loci by comparing their physical positions. Excalibur_c4709_576 and TA001505-1171 on chromosome 2B were significantly associated with flour whiteness in two environments. Their candidate gene prediction indicated that the function was related to glycerol-3-phosphate O-acyltransferase activity, which participates in the fatty acid biosynthetic process. Previous research showed that lipoxygenase affects flour whiteness and colour43. Therefore, these loci may influence flour whiteness by lipoxygenase but not PPO activity. Kukri_c33486_128, on chromosome 2D, which was significantly associated with the flour L* value, was stably identified in three environments, but its candidate gene and function were not predicted in the BLAST search, which indicated that this locus is new and should be further studied in the future. GENE-1258_171 on chromosome 2A was significantly associated with the dough sheet L* value in two environments. Its predicted gene is TraesCS2A01G593500, whose function is related to protein kinase activity, so this locus affects the dough sheet colour, perhaps as a result of the grain protein content.

Previous studies showed that the yellow pigment content is mainly controlled by chromosomes 7A and 7B; moreover, the Psy1 gene was reported to co-segregate with the b* value and yellow pigment content35,44,45 In the present study, significantly associated loci were also found on these two chromosomes. Only one SNP marker associated with flour whiteness was detected on chromosome 7B. Although no loci were found on these two chromosomes for the flour b* value, nine SNP markers were significantly associated with the flour a* value. Moreover, they seemed to be stable in multiple environments. Three SNP markers were found on chromosome 7A, and their function is related to leucine rich repeat family protein expression. The other six SNP markers on chromosome 7B have one candidate gene, TraesCS7B01G482200, which is related to sucrose synthase activity. Therefore, the mechanism of their influence on flour colour is different from that of the Psy gene and yellow pigment content, which needs to be further studied in the future.

In addition, the flour whiteness colour is also affected by milling characteristics32,46. However, the milling characteristics are influenced by grain hardness, so the grain hardness affects the flour whiteness and colour. Zhai et al. identified the QTL QFL.caas-5D-1, which is close to the Pin-b gene, with a distance of 2.1 cM. The QTL for the L* value on chromosome 5DS coincided with the hardness (Ha) locus in previous studies32,46. In the present study, the SNP marker BS00000020_51 on chromosome 5D was significantly associated with flour whiteness and the L* and b* values and was stably detected in multiple environments. Therefore, this locus is important for flour whiteness and colour. Through a comparison of Zhai et al.’s results34, this SNP marker (BS00000020_51) was also found on their genetic map and was close to the closest marker BobWhite_s67669_117 of the QTL, with a distance of 2.1 cM. This indicated that the SNP marker BS00000020_51 controls flour whiteness and colour through grain hardness. We found two candidate genes, TraesCS5D01G004300 and Gsp-1D, by candidate gene prediction. Of these, Gsp-1D is the grain softness protein-1 gene that is linked to grain hardness, which affects flour particle size. Moreover, flour whiteness showed negatively correlated with flour particle size47, so it is possible that Gsp-1D affect the flour whiteness. Therefore, the accuracy of the results is confirmed.

Most interestingly, the SNP markers found on chromosome 6A were significantly associated with flour whiteness and colour. The markers RAC875_c34446_396 and GENE-4011_91 were at the same position, i.e., 49 cM, and were associated with flour whiteness and the flour L* value, respectively. The function of the candidate gene PUP88 was related to hydrolase activity and the hydrolysis of O-glycosyl compounds, participating in the carbohydrate metabolic process. However, at position 79 cM, five SNP markers detected in more than one environment were associated with the flour a* and b* values. The functions of the candidate gene TraesCS6A01G241200 of the SNP marker TA005690-1190 were different from those of other candidate genes. The TraesCS6A01G241200 gene participates in the ketone body biosynthetic process and lipid metabolic processes, which affect flour colour. Other candidate genes were mainly related to proteins. Moreover, the SNP marker BobWhite_c15802_72 on chromosome 6A was associated with dough sheet colour. Its function is related to peroxidase activity in soybean, but in wheat, the biological process is not clear. In addition, one special locus, the BS00027770_51 marker, associated with the flour a* value, was identified on chromosome 6B. Its candidate gene participates in the flavonoid biosynthetic process and regulation of jasmonic acid in Arabidopsis thaliana, but the function in wheat remains unknown. The above loci were not found in previous studies.

Flour whiteness and colour-related traits are inherently correlated. Three important loci on chromosomes 5D and 6A were identified. The SNP locus at genetic position 103 cM of chromosome 5D was involved in both flour whiteness and the b* value; the loci at genetic position 49 cM of chromosome 6A influenced flour whiteness and the flour L* value, and the last locus at genetic position 79 cM of chromosome 6A was associated with the flour a* and b* values. These relationships were reflected by the correlation coefficients (Table S5; Table S6), in agreement with previous studies34. Their physical positions were seen in Table S7. Therefore, genes with pleiotropic effects may explain the genetic basis of trait correlation. Pleiotropic effects were observed for dough sheet colour-related traits.

Conclusions

This study provided the important information about the influence of proteins, lipoxygenase and grain hardness on the flour whiteness and colour in addition to PPO activity and yellow pigment at the molecular level. GWAS is a good method for identifying new, important, stable loci. SNP markers significantly associated with flour whiteness and colour detected in this study provide opportunities for MAS of traits that are difficult to phenotype at the early stages of wheat breeding.

Materials and methods

Ethics statement

All samples analysed in our study adhered to all local, national or international guidelines and legislation, and no ethical approval was required.

Plant material

The association mapping panel of 205 wheat genotypes for GWAS comprised 77 released cultivars, 55 landraces including two lines from Mexico and France, and 73 breeding lines from 10 provinces that represent the major winter wheat production regions in China27,28,48.

Growth conditions

The seeds used for the association mapping panel were planted in the 2015 and 2016 growing seasons in the experimental fields of two locations, that is, the Shandong Agricultural University, Tai’an (TA) location (116°36′ E, 36°57′ N) and the Dezhou Institute of Agricultural Sciences, Dezhou (DZ) location (116°29′ E, 37°45′ N). E1, E2, E3 and E4 represented Tai’an location in 2015 (TA 2015), Dezhou loction in 2015 (DZ 2015), Tai’an location in 2016 (TA 2016), and Dezhou location in 2016 (DZ 2016), respectively. The experimental field was arranged in a completely randomized block design, with two replicates in each environment. All lines were grown in 2 m plots with 3 rows spaced 25 cm apart, and 70 seeds were evenly broadcast in each row. During the growing seasons, all recommended local crop management practices were followed, and damage attributed to lodging, disease, or pests was not observed.

Phenotypic trait evaluation

Flour milling was carried out by using a Bühler experimental mill (Bühler mill, Bühler-Miag Company, Braunschweig, Germany) with a flour extraction yield of approximately 70% in all samples stored for approximately 1 month after being harvested. The samples were tempered to 14%-16% moisture content according to grain texture overnight before milling.

Flour whiteness was determined by a WSB-IV intelligent whiteness determination meter (Dajiguangdian Instruments, Hangzhou, China)36. The working principle is to measure the absolute spectral diffuse reflectance using a photometry integrating sphere. The peak wavelength of the spectral power distribution of Y10 whiteness optical system is 475 nm and the half wave width is 44 nm. The standard of Y10 optical system accords with national standard GB3979 of China.

Dough making was performed according to49 with minor modifications. Flour and water were mixed to achieve 44% absorption by slow mixing at a low speed for 5 min, followed by mixing at a medium speed for 2 min using a Kitchen Aid Professional Mixer (KPM5, St. Joseph, MI, USA). During the resting stage, crumbly dough was placed in a stainless steel bowl for 20 min at room temperature. The crumbly dough was then hand kneaded into a stiff mass and passed through an automatic noodle maker (JMTZ-14, Dongfang Fude Technology Development Center, Beijing, China) three times to form a noodle sheet at a 2.0 mm roll gap-setting. The dough sheet was then folded twice and passed through six different roll gaps (3.5, 3.0, 2.5, 2.0, 1.5, and 1.0 mm). Then, the fresh dough sheet was cut into approximately 6 small sheets (length 10 cm, width 5 cm, thickness 1.0 mm). The fresh sheet was dried in the oven for 24 h at 40 °C.

The colour-related traits of the flour and dough sheet

The flour colour parameters (L*, a* and b*) were measured with a Minolta colorimeter (CR-300, Minolta Camera Co., Ltd., Osaka, Japan) using the commission Internationale de l’éclairage (CIE) L* a* b* colour system50. The L* value indicates the lightness of flour with a range of 0–100 representing darkness to lightness (L* = 0 means black, L* = 100 means white, and the middle value is a grey transition with different brightness). The a* value indicates the red-green direction, that is, it designates redness when positive but greenness when negative. The b* value indicates the degree of the flour yellow-blue colour, that is, the higher b* value denotes a greater amount of yellow51.

The dough sheet colour was also measured by a Minolta CR-300 colour meter. Three points were measured to determine the fresh dough sheet (FDS) (uncooked) at 0 h and the dry dough sheet colour at 24 h per noodle sheet, each at a different location on the same side of the surface of the noodle sheet52.

The colorimeter parameter of each sample was measured three times, and the mean values were used for subsequent statistical analysis.

Statistical analysis

Analysis of variance (ANOVA) and correlations among phenotypic traits were carried out using the statistical software SPSS version 17.0 (SPSS Inc., Chicago, IL, USA). Heritability (h2) was calculated as hB2 = σg2/(σg2 + σge2/r + σe2/re), where σg2, σge2, and σe2 are estimates of genotype, genotype × environment and residual error variances, respectively, r is replicates, and re is the product of replicates and number of environments. The estimates of σg2, σge2, and σe2 were obtained from variance estimates included in the ANOVA, which was performed using the PROC GLM procedure of SAS 8.0 (SAS Institute Inc., Cary, NC, USA).

Genome-wide association analysis

SNP markers, genotyping and the population structure of the samples were reported previously27,28. Based on this information, significant marker–trait associations (MTAs) were identified using a mixed linear model (MLM) in TASSEL3.0. The P-value was used to determine whether a QTL was associated with a marker. The R2 value was used to evaluate the magnitude of the MTA effects. The genome-wide significance threshold (P-value ≤ 0.001) was given. SNPs with P-value ≤ 0.001 were considered to be significantly associated with phenotypic traits. When the MTA locus was detected in two or more environments, it was considered a stable association site48.

Forecasting candidate genes for flour whiteness and dough sheet colour-related traits

To identify the position of important MTA loci on a physical map and possible candidate genes, significant markers detected in this study were used to identify putative candidate genes. A BLAST (Basic Local Alignment Search Tool) search was performed on the International Wheat Genome Sequencing Consortium database (IWGSC; http://www.wheatgenome.org/, 20th November 2020) using the sequence of the significant SNP markers identified by GWAS. When an SNP marker sequence from the IWGSC was 100% identical to any wheat contig, the sequence was extended 2 Mb for each marker using the IWGSC BLAST results. Then, the extended sequence was used to run a BLAST search at the National Center for Biotechnology Information (NCBI) database (http://www.ncbi.nlm.nih.gov, 20th November 2020) and Ensembl Plants (http://plants.ensembl.org/Triticum_aestivum/Tools/Blast, 20th November 2020) to confirm possible candidate genes and functions.

Ethics approval and consent to participate

Wheat is a common crop extensively cultivated in the world. This study does not contain any research requiring ethical consent or approval.