Identification of a major QTL underlying sugar content in peanut kernels based on the RIL mapping population

High sugar content in peanut seeds is one of the major breeding objectives for peanut flavor improvement. In order to explore the genetic control of sugar accumulation in peanut kernels, we constructed a recombinant inbred line population of 256 F2:6-7 lines derived from the Luhua11 × 06B16 cross. A high-resolution genetic map was constructed with 3692 bin markers through whole genome re-sequencing. The total map distance was 981.65 cM and the average bin marker distance was 0.27cM. A major stable QTL region (qSCB09/qSSCB09) was identified on linkage group (LG) B09 associated with both sucrose content (SC) and soluble sugar content (SSC) explaining 21.51-33.58% phenotypic variations. This major QTL region was consistently detected in three environments and mapped within a physical interval of 1.56 Mb on chromosome B09, and six candidate genes were identified. These results provide valuable information for further map-based cloning of favorable allele for sugar content in peanut.


Introduction
Peanuts (Arachis hypogaea L.) are widely cultivated worldwide as food and oil crops.Peanuts are not only used as a rich source for vegetable oil extraction, and are also rich in protein and micronutrients (Toomer, 2018).In recent years, the increase in both production and consumption of fresh roasted peanuts has resulted in demands for improved sweetness in peanut kernels, which is highly correlated with flavor (Pattee et al., 1998;Guo et al., 2023b).The content of soluble sugars in the kernels will directly determine peanut sweetness (Guo et al., 2023b).The color and flavor of roasted peanut depends on peanut sugar content and its interaction with other components under heating conditions (McDaniel et al., 2012).Soluble sugars in peanut include sucrose, fructose, and glucose, with sucrose being for the highest proportion (McDaniel et al., 2012).
Genetic analysis suggested that sucrose content in peanut kernels is controlled by two additive-dominance-epistasis major genes plus an additive-dominance-epistasis polygene (Qin et al., 2021).However, a lack of favorable alleles or genes has hindered breeding progress, given that the sugar content of seeds is influenced by various factors, including abiotic and biotic stresses, environmental conditions such as temperature, soil moisture, freezing, seed maturity, growth conditions, and genotype (Maughan et al., 2000;Pattee et al., 2000;Knizia et al., 2023;Guo et al., 2023b).Quantitative trait locus (QTL) mapping is a powerful method to identify alleles or genes controlling complex traits.Over the past few decades, increasing number of peanut QTLs associated with sucrose content have been reported by BSA-seq, GWAS, and genetic mapping (Li et al., 2023;Guo et al., 2023aGuo et al., , 2023b;;Zhang et al., 2023b;Huai et al., 2024;Wang et al., 2024).Although these studies made advances in dissecting the genetic mechanism of sucrose content, these is still much to be learned about this trait in peanut.In particular, it is still difficult to identify the most promising QTL for genetic cloning.
In this study, a population of 256 recombinant inbred lines (RILs) was utilized to map the QTLs for sucrose content and soluble sugar content in peanut kernel via high throughput sequencing.The objectives of this study were: (1) to construct a high-density genetic map (HDGM) according to the whole genome re-sequencing protocol; (2) to determine genomic regions that are associated with the sucrose content and soluble sugar content for kernels; (3) to identify the candidate genes for the major stable QTLs.

Plant material and phenotyping
An F 2:6 populations of 256 RIL lines derived from a cross between 'Luhua11' and '06B16' was developed at the experimental station of Shandong Peanut Research Institute, Qingdao, China.'Luhua11' cultivar is large-seeded with low sugar content.The genotype '06B16' is small-seeded with high sugar content.The RIL population and its parental lines were planted in experimental fields in Laixi (N 36.86°,E 120.53°), Shandong Province (planted in May and harvested in September of 2021 and 2022); in Greenhouse of Laixi (N 36.86°,E 120.53°), Shandong Province (planted in May and harvested in September of 2022); and Weihai (N 37.24°, E 122.37°), Shandong Province (planted in May and harvested in September of 2023).The field and greenhouse experiments both followed a randomized block design with three replicates.For each plot, 10 plants from each RIL line were grown 15-cm apart within a row, with an 85-cm gap between RILs.Standard agricultural practices were applied for field management.Each plant was harvested individually at its maturity to prevent loss from overripening.Only eight plants in the middle of each row were used for trait measurement.About 20 seeds from each line were used to calculate seed sugar concentration using near-infrared reflectance (NIR) spectroscopy following the manufacturer's protocol (Spectra Star XL, Unity, USA) (Tang et al., 2018).

Statistical analysis of phenotypic data
The mean value and standard deviation of each trait for the parents and each RIL line were analyzed, and Student's t-tests were conducted using IBM ® SPSS ® statistics 19.The normality of the population data was analyzed using Kolmogorov-Smirnov tests.Using the equation h 2 =s g 2 /(s g 2 +s ge 2 /n+s ϵ 2 /nr), the broad-sense of heritability(h 2 ) for traits was calculated using ANOVA analysis with QTL IciMapping V4.2 (Meng et al., 2015).

DNA extraction, resequencing and SNP calling
At the seedling stage, young leaf tissue from recombinant inbred lines (RIL) lines and both parental plants were collected for DNA isolation.Genomic DNA was sheared to 350 bp fragments using a Covaris ® disruptor.Subsequently, the DNA fragments underwent a series of steps in the library preparation process, including end repair, addition of polyA tails, addition of sequencing adapters, purification, and PCR amplification, culminating in the completion of the entire library preparation process (Illumina Inc., San Diego, CA, USA).High-throughput sequencing was performed on an Illumina Nova6000 platform at Lianchuan Biotechnology Corporation (Hangzhou, China), to generate 150-bp paired-end reads.The sequencing depths for the two parents and the RILs were approximately 20× and 3×, respectively.After filtering the reads to eliminate adapters and lowquality reads, the remaining clean reads were aligned to the reference genome (https://data.legumeinfo.org/Arachis/hypogaea/genomes/Tifrunner.gnm2.J5K5/) (Bertioli et al., 2019) using the BWA-0.7.10.The uniquely mapped reads were used to call SNPs with SAMtools v0.1.19and GATK v3.3.0.Homozygous and polymorphic SNPs between the two parents were selectively retained for analysis within the RIL population.SNPs meeting the criteria of a missing rate and heterozygosity rate both less than 10%, along with a sequencing depth exceeding 3×, were employed for subsequent analyses and the construction of the genetic map.

High density genetic map construction
SNPs were consolidated through binning using the genotype information of the parents.SNPBinner software (Gonda et al., 2019), which employs a Hidden Markov model (HMM) to determine recombination breakpoints and construct co-separation bins, was utilized in this study.To build a high-quality linkage map, bins with over 50% of samples with missing genotypes and severe partial separation were discarded.QTL IciMapping V4.2 software was applied to construct the linkage map, employing the 2-opt algorithm and performing rippling local optimization with a window size of 5 (Meng et al., 2015).The genetic positions of markers on each linkage group (LG) are presented in Supplementary Table 4.In addition, a co-linearity map was generated to evaluate the map quality.The linkage group number corresponds to the chromosome number assigned by the Tifrunner reference genome, indicating a high degree of consistency between genetic and physical maps.

QTL mapping
The R/qtl package (Broman et al., 2003) was used to detect QTL and confirm the relationship between different markers around each QTL with the composite interval mapping method (CIM).The permutation test was conducted 1000 times with probability larger than 1% as the cutoff value.A logarithm of the odds (LOD) threshold value of 2.5 was employed to declare the presence of a QTL at a significance level of 99%.The positive and negative additive effect that represented the favorable alleles were from 'Lu11'and '06B16', respectively.

Candidate gene mining and expression analysis
All the genes in the QTL region were annotated in four databases (NR, Swiss-Prot, GO, KEGG).Transcription data of genes in different tissues were obtained from the PeanutBase and used to graph a heatmap at https://software.broadinstitute.org/morpheus/ (Clevenger et al., 2016).A heatmap indicating the relative expression levels of candidate genes was constructed at https://www.omicstudio.cn/tool.Secondary structure prediction for candidate protein was performed with an online tool (PSIPRED) at http://bioinf.cs.ucl.ac.uk/psipred.

Phenotypic variations of sugar content in parents and RIL individuals
The mature peanut kernels of '06B16' had higher sucrose and soluble sugar content (5.91 ± 0.61% and 7.75 ± 0.32% on average, respectively) than those of 'Lu11' (5.49 ± 1.00% and 7.47 ± 0.95%, respectively) across four environments (Table 1).Phenotypic data of the RIL population demonstrated skewed distribution (Figure 1; Table 1), indicating polygenic inheritance of these two traits.Transgressive segregation was observed in most environments (Figure 1).Sucrose content in the RIL population ranged from 3.33% to 7.55% in the 2021 Laixi, 3.81% to 8.08% in the 2022 Laixi, 2.35% to 6.54% in the 2022 Greenhouse, 3.46% to 7.88% in 2023 Weihai.The soluble sugar content of the RIL population ranged from 3.80% to 8.76% in the 2021 Laixi, 6.08% to 10.98% in the 2022 Laixi, 3.60% to 9.35% in the 2022 Greenhouse, 4.98% to 9.82% in the 2023 Weihai.ANOVA (analysis of variance) results indicated that the effects of genotype (G), environments (E), and interaction of G and E (G ×E) were all significant (Table 2).The sucrose content and soluble sugar content exhibited relatively high broad-sense heritability (h 2 ), ranging from 0.78 to 0.71 (Table 2).

Resequencing and SNP identification
Approximately 2362.75G cleaned data (Q20 > 94%) were generated, including 15.75 billion reads (Supplementary Table 1).About 368.86 and 304.57 million reads were obtained for the female and male parents, respectively, whereas 40.58-200.37 million reads were obtained for the RILs (Supplementary Table 1).More than 99% of the reads were mapped to the reference genome (Supplementary Table 2).A total of 924,882 polymorphic SNPs were identified for the two parents, of which 286,582 were homologous and used to screen for SNPs in the RILs.GATK software was also used to detect variation, the uniformly output SNPs were retained as reliable and used for the following analysis (Supplementary Table 3).

Construction of the high-density genetic map
Redundant markers were removed from the filtered SNP set using SNPBinner (Gonda et al., 2019).In total, 3,875 bin markers (83,642 SNPs) were detected.After filtering the low quality bin markers, 3692 bin markers (82,292 SNPs) were used to construct the high density genetic map (HDGM) and were assigned to 20 linkage groups (LGs) (Figure 2; Supplementary Table 4).The whole map length was 981.65cM, with the genetic distance ranging from 26.18 cM to 82.74 cM for each LG with an average marker distance of 0.27cM (Figure 2; Table 3).LG B03 was the longest group covering a distance of 82.74 cM with 401 loci, while LG A10 was the shortest group spanning 26.18 cM with 91 loci (Table 3).Subsequently, the degree of map uniformity and inter-marker linkage were evaluated by the percentage of 'Gaps ≤ 5 cM', which ranged from 98.80% to 100% with an average value of 99.94%.The largest gap existed on LG B01 which was 12.77 cM.In order to evaluate the precision of the high density genetic map, we performed collinearity analysis by comparing the genetic positions of markers on each LG with their corresponding physical positions.The collinearity between the genetic and genomic positions was remarkably high, with all values surpassing 99.99% (Supplementary Table 5; Supplementary Figure 1), providing strong evidence of a thoroughly organized marker assignment.
QTL mapping identified 11 QTLs for soluble sugar content with 4.42%-30.45%PVE.These were distributed on LG07, B06, and B09 (Figure 3; Table 4).The major QTL region was on LG19 (qSSCB09.1,and qSSCB09.2) was consistently detected at all four environments, with LOD scores of 9.68-20.19(Table 4).The additive effect was between -0.46 and -0.34, implying the high soluble sugar content allele was derived from the male parent '06B16'.The other seven QTLs were detected in single environment.
It is interesting that qSCB09 and qSSCB09 were co-localized to a 4.02 cM interval (37.16-41.18cM, the corresponding physical interval is 154.20-155.76Mb) with flanking markers of C19P154679799-C19P155713851 (Table 4).To determine phenotypic contributions of this QTL region, flanking marker profile was used to selected and group RIL into two homozygous genotypes (Figure 4; Supplementary Table 6).Student t-test revealed significant differences (P < 0.01) between the two genotypic groups (Figure 4).The RILs with the A 1 A 1 genotype (representing the '06B16' allele in qSCB09 and qSSCB09) had a higher sucrose content and soluble sugar content than those with aa genotype (representing the 'Lu11' allele in qSCB09 and qSSCB09) (Figure 4).The elite allele of qSSC/SCA07 and qSSC/SCB06 from the elite '06B16' was defined as the 'B 1 B 1 ' and 'C 1 C 1 ' genotype, respectively, while another allele from 'Lu11' was defined as the 'B 2 B 2 ' and 'C 2 C 2 ' genotype, respectively.We noticed that some RILs exhibited higher sucrose and soluble sugar contents compared to both parents.Therefore, we further examined the distribution of allele from all three QTLs, qSSC/SCB09, qSSC/SCA07 and qSSC/ SCB06.The genotype of three QTLs was from the lines in the RIL population.Obviously, the lines with 'A 1 A 1 B 2 B 2 C 2 C 2 ' genotype by combining three elite alleles of qSSC/ SCB09, qSSC/SCA07 and qSSC/SCB06 showed an over-dominant phenotype with 6.19% of sucrose content, and 7.89% of soluble sugar content, which is significantly higher than that of lines with other genotypes (Figure 5; Supplementary Table 6).These results showed that the combination of three elite alleles from three QTLs, qSSC/SCB09, qSSC/SCA07 and qSSC/SCB06, produced an overdominant phenotype with significantly increased sucrose content and soluble sugar content.
3.5 Potential candidate genes within the major QTL qSCB09/qSSCB09 for sucrose and soluble sugar content The stable major QTL region qSCB09/qSSCB09 was mapped in the cultivated peanut genome between 154,057,090 and 155,797,391 bp on chromosome B09 with 118 annotated genes located in this interval.There were 84 functional genes related to maltose metabolism, signal transduction, response to stress, lipid metabolism, secondary metabolite biosynthesis, oxidationreduction, transcription, sucrose metabolism, and starch biosynthesis (Supplementary Table 7).
To investigate expression pattern of annotated genes in the stable major QTL region qSCB09/qSSCB09, we surveyed the transcript abundance of these genes in leaf, root, and seed at different developmental stages.Interestingly, most of the genes in the confidence interval were highly expressed in seeds (Figure 6).The expression pattern of Arahy.MS9EFZ, Arahy.11CPHY, and Arahy.AY8I6Y were all first increased and then decreased, which is in line with the trend of sucrose accumulation, indicating they may be candidate genes for sucrose content and soluble sugar content in peanut kernels.
Meanwhile, a total of 38 SNP variants were identified in the interval based on the whole genome resequencing data in the parental lines (Supplementary Table 8).Among these variants, seven SNP variants which affected six genes were detected (Supplementary Table 8), including two intron variants in Arahy.JI97RZ and Arahy.KA4332, a premature start codon gain variant in the 5'-UTR of Arahy.MS9EFZ, three downstream gene variants in Arahy.11CPHY,Arahy.LC7X94, and Arahy.AY8I6Y, and a missense variant in Arahy.AY8I6Y.This SNP (G/C) caused an amino acid substitution (Ser toThr) of Arahy.AY8I6Y and might result in secondary structure (strand, helix and coil) variations (Supplementary Figure 2).

Discussion
In this study, we successfully identified a novel major and stable QTL region, along with candidate genes associated with sugar High density genetic map of the RIL population using Bin markers, the markers were indicated by black bars.Wang et al. 10.3389/fpls.2024.1423586Frontiers in Plant Science frontiersin.orgcontent in peanut seeds.The QTL region qSCB09/qSSCB09 is the major and stable locus controlling sugar content, explaining phenotypic variance of 21.51% to 33.58%.The parental lines did not significantly differ for sucrose content and soluble sugar content in this study, but showed transgressive segregation, indicated that much of the broad-sense heritability was contributed by dominant and epistatic gene actions, which can be exploited via hybrid breeding (Ayalew et al., 2022).This phenomenon is often encountered in studies of complex traits in crops, where multiple genetic factors contribute to variations in quantitative traits (Ren et al., 2020;Mackay et al., 2021;Ayalew et al., 2022;Zhang et al., 2023c).It has implications for crop improvement, as it introduces genetic diversity that may lead to enhanced traits.Notably, the absence of extreme phenotypes in the parent lines underscores the dynamic nature of genetic interactions.In this study, the board-sense heritability (h 2 ) of sucrose and soluble sugars was 78% and 71%, respectively (Table 2), The higher broadsense heritability (92% and 86%) of sucrose content in peanut have been reported (Li et al., 2023;Wang et al., 2024), indicating a potential for sucrose and soluble sugars improvement.However, the concentration of sucrose and soluble sugars were also influenced by environmental factors, such as soil, water, climate, etc.The minimum content of sucrose and soluble sugars in the RIL population varied greatly across different environments, the content of SC and SSC were 2.35%-3.81%and 3.60%-6.08%,respectively, and the highest sucrose and soluble sugars concentrations were found in 22LX (Table 1).
Construction of a high density genetic map (HDGM) is a traditional way of mapping QTL for agronomic traits.The whole genome re-sequence (WGRS) strategy has advantages in HDGM and favorable QTL identification (Kumar et al., 2020;Zhao et al., 2020;An et al., 2021;Wu et al., 2023).Currently, several HDGMs based on WGRS have been published in cultivated peanut, and resulted in yield and oil quality related QTLs detection in cultivated peanut (Li et al., 2019;Liu et al., 2020;Wang et al., 2020;Sun et al., 2022;Li et al., 2023;Yang et al., 2023).In our present study, a highdensity genetic map was constructed with 3692 bin markers, the whole length was about 981.65 cM and the average marker distance was 0.27cM (Figure 2; Table 3), the density of which was higher than the previous studies (Li et al., 2023;Wang et al., 2024).Collinearity analysis validated the high quality of this map (Supplementary Table 5).In view of the HDGM and the phenotypic data of multiple environments, the QTLs identified in our study were reliable.
There were increasing numbers of quantitative trait loci (QTLs) associated with sucrose content in peanut have been reported.BSAseq technology was employed to map four QTLs were located on chromosomes A03 and A06.Furthermore, a stable QTL qSUCA06  (Guo et al., 2023b).Two other genomic regions associated with sucrose content, qSUCA08a and qSUCB06a on chromosomes A08 and B06 respectively, were identified by QTL-seq, a major QTL, qSUCA08.2,explained 5.43%-17.84% of phenotypic variation across five environments (Li et al., 2023).
Based on whole genome re-sequencing and construction of a high density genetic map, two major stable QTLs for sucrose content, qSCA06.2and qSCB06.2were mapped on chromosomes A06 (115.0-116.1 Mb) and B06 (147.9-148.6Mb) (Lei et al., 2022;Wang et al., 2024).Furthermore, Guo et al. (2023a) mapped 10 QTLs located on A01, A07, A10, B01, B03 and B10, accounting for 4.56%-12.25% of PVE, and qSUCA07 was detected in three LOD curves of SC and SSC on the whole genome across four environments.The chromosomes of QTLs identified were denoted in red, and the major QTL region in chromosome B09 was showed in details. environments (Guo et al., 2023a).A genome-wide association study (GWAS) with principal component analysis identified seven significant QTLs associated with total sugars and 22 significant QTLs associated with sucrose content (Zhang et al., 2023b).
In our study, a total of five sucrose-associated QTLs were detected in this study on four chromosomes (LGs A07, B06 and B09).Furthermore, a novel major stable QTL-qSCB09 was identified, which explained 21.51-33.58% of the phenotypic variation for sucrose across three environments.We identified a QTL on LG B06, the physical interval was 136.84-140.76Mb, which is different with qSUCB06.1  Mb), qSCB06.2(147.9-148.6Mb) (Li et al., 2023;Wang et al., 2024).qSUCA07 was found across three environments which explained 7.05%-12.25% of the phenotypic variation for sucrose (Guo et al., 2023a), which was different to qSCA07 in our study.Few studies have sought to identify soluble sugar-related QTL in peanut, except a QTL on B06 by BSA-seq which was only a 0.7 Mb (149.47-152.28Mb) physical interval (Zhang et al., 2020), and is close with qSSCB06 in our study (137.15-141.26Mb) (Table 4).The qSSCB06 in our study was stable detected in two environments, explaining 12.69%-15.32%PVE (Table 4).Interestingly, our results showed that qSCB09 and qSSCB09 were co-localized to a 4.02 cM interval (37.16-41.18cM), indicating a potential simultaneous impact on the contents of sucrose and soluble sugar.Sucrose is the main component of soluble sugar in peanut (Li et al., 2023), thus the main point in breeding high sweetness peanut have been focused on increasing sucrose content.The qSSCB06 identified in our study suggested that it is possible to increase other soluble sugars (such as The boxplot of sucrose content and soluble sugar content between two genotypic groups in the RIL population.A 1 A 1 and A 2 A 2 represented the elite allele in qSCB09 and qSSCB09 were from 06B16 and Lu11, respectively.***represent significant difference at P=0.001 level by t-test.fructose and glucose) content in the future.Moreover, this study reveals that the sucrose content and soluble sugar content are controlled by three QTLs or genes.The highest sucrose content and soluble sugar content phenotype can only be achieved when the elite allelic genotype of the three genes is combined.
Several candidate genes related to sucrose content in peanut have been reported.Arahy.I1MI1E, a DEG in qSUCA08.2,encoding receptor-like protein kinase, was more likely candidate gene responsible for sucrose content.It may control sucrose content via regulating these genes related to sucrose metabolism and transport (Li et al., 2023).Arahy.Y2LWD9, which encodes acyl-CoA-binding domain 3 (ACBD), is a candidate gene in qSUCA06 for sucrose content accumulation (Guo et al., 2023b).Arahy.42CAD1 was identified as the most likely candidate gene in qSucA06, being co-expressed with genes involved in vesicle transport and oil body assembly.This indicates that sucrose accumulation may be due to disruptions in TAG transport or storage mechanisms (Huai et al., 2024).
Protein kinases were proven to play key roles in sugar accumulation (Lu et al., 2007;Lecourieux et al., 2010;Huang et al., 2014;Li et al., 2023).The candidate gene Arahy.11CPHYencoded serine/threonine-protein kinase PRP4, the expression of which increased in S2 and S3 stages and decreased in S4 and S5 stages (Figure 6), in accordance with the sucrose accumulation pattern (Li et al., 2021).Notably, Ah.W977IP and Ah.NG0Q3I which have higher expression in early developmental stages of peanut seed (Figure 6), also matched with sucrose accumulation pattern (Li et al., 2021).Ah.W977IP and Ah.NG0Q3I both encoded alpha-xylosidase 1 which is a Glycosyl Hydrolase 31 (GH31) family member and related to starch and sucrose metabolism (Cuebas-Irizarry et al., 2017;Xu et al., 2023).Thus, they may also be the candidate genes for sucrose content and soluble content in peanut kernels.In our study, Arahy.AY8I6Y was most likely the candidate gene for qSC/SSCB09.Arahy.AY8I6Y encodes a Zinc finger protein CONSTANS-LIKE 16 (COL16), which belongs to zinc finger protein transcription factor family.COL genes have been reported to be involved in many molecular regulation processes and plant growth development, including regulation of flowering time, photoperiodic responses and photomorphogenesis, stigma color and key roles in banana (Musa nana) fruit ripening (Hassidim et al., 2009;Chen et al., 2012;Ordoñez-Herrera et al., 2018;Zhang et al., 2023a).COL genes also involved in abiotic stress tolerance response, anthocyanin accumulation and plants carotenoid The combinatorial effects of QTLs on B09, A07, and B06 with sucrose content (A) and soluble sugar content (B).The letters A, B, C represented QTL regions on B09, A07, and B06, respectively.A 1 A 1 and A 2 A 2 represented the homozygous alleles in qSCB09 and qSSCB09 were from 06B16 and Lu11, respectively.Different letters indicate significant (p < 0.05) differences between means, based on a one-way ANOVA test.Wang et al. 10.3389/fpls.2024.1423586Frontiers in Plant Science frontiersin.orgmetabolism regulation (Bai et al., 2019;Zhang et al., 2023a).Besides, a zinc finger protein AdDof3 has been reported to interacted physically with the AdBAM3L promoter, regulating starch degradation in Kiwiruit (Zhang et al., 2018), indicating a potential role in sugar metabolism.Arahy.AY8I6Y was identified as the most likely candidate gene due to a nonsynonymous SNP causing amino acid substitution (Ser to Thr).The predicted structural change associated with missense variation may affect its molecular and biological functions, however, further study is needed to verify the function of Arahy.AY8I6Y and its role of regulating sugar accumulation.
Besides, the correlation analysis of sugar content with protein content and oil content were investigated, the result showed that sucrose content and soluble sugar content were all significantly negatively correlated with oil content, but were no correlation with protein content (Supplementary Table 9), which is consistent with the reports in soybean (Jiang et al., 2018;Huai et al., 2024).So, we can hypothesis that some genes associated with transformation between sugar and oil may also affect sugar content in peanut.

Conclusion
In summary, we identified a new major stable QTL region qSCB09/qSSCB09 for both sucrose content and soluble sugar content of peanut kernels based on whole genome resequencing and QTL mapping.Within this confidence interval, six nonsynonymous mutation genes were identified as candidate genes, and Arahy.AY8I6Y was the most likely candidate gene for sucrose accumulation in peanut kernel.These findings will contribute to an enhanced understanding of sugar accumulation in peanut kernels, and the candidate genes will be useful for breeding of high sugar content peanut varieties.

FIGURE 1
FIGURE 1 Phenotypic distribution of SC and SSC in the RIL population across four environments.The y-axis represents the number of lines and the x-axis represents the values of sugar content (SC) and Soluble sugar content (SSC).2021LX, 2022LX, 2022GH, and 2023WH were four trials in Laixi (LX), Weihai (WH) and Greenhouse (GH) from 2021 to 2023.P1 and P2 denote the parents 'Luhua 11' and '06B16', respectively.

FIGURE 6
FIGURE 6Heatmap analysis of candidate genes of peanut SSC and SC in different developmental stages.S1 to S5 indicated the fruit Pattee 1 to 5 stages.The values of transcript abundances were taken fromClevenger et al. (2016).

TABLE 1
Phenotypic variation of Soluble sugar content and Sucrose content among the RIL population of four environments.Min b Max c Skew Kurt Sig. of K-S test d a SD, standard deviation.b Min, minimum value.c Max, maximum value.d Sig of K-S test, significance for normality test by Kolmogorov-smirnov.**means significant p<0.01.2021LX, 2022LX, 2022DP, and2023WH were four trials in Laixi (LX), Greenhouse (GH) and Weihai (WH) from 2021 to 2023.

TABLE 2
Analysis of the broad-sense of heritability of sucrose content and soluble sugar content.
a DF, degree of freedom.b SS, sum of square.c MS, mean of square.FIGURE 2

TABLE 3
Summary of the high-density genetic map.

TABLE 4
QTL analysis for sucrose content and soluble sugar content.

TABLE 4 Continued
a Env, environment.bLG,linkage group.c Cl, confidence interval.d LOD, logarithm of the odds.e ADD, additive effect.f PVE, phenotypic variation explained.