Genome-Wide Association Analysis for Resistance to Coniothyrium glycines Causing Red Leaf Blotch Disease in Soybean

Soybean is a high oil and protein-rich legume with several production constraints. Globally, several fungi, viruses, nematodes, and bacteria cause significant yield losses in soybean. Coniothyrium glycines (CG), the causal pathogen for red leaf blotch disease, is the least researched and causes severe damage to soybean. The identification of resistant soybean genotypes and mapping of genomic regions associated with resistance to CG is critical for developing improved cultivars for sustainable soybean production. This study used single nucleotide polymorphism (SNP) markers generated from a Diversity Arrays Technology (DArT) platform to conduct a genome-wide association (GWAS) analysis of resistance to CG using 279 soybean genotypes grown in three environments. A total of 6395 SNPs was used to perform the GWAS applying a multilocus model Fixed and random model Circulating Probability Unification (FarmCPU) with correction of the population structure and a statistical test p-value threshold of 5%. A total of 19 significant marker–trait associations for resistance to CG were identified on chromosomes 1, 5, 6, 9, 10, 12, 13, 15, 16, 17, 19, and 20. Approximately 113 putative genes associated with significant markers for resistance to red leaf blotch disease were identified across soybean genome. Positional candidate genes associated with significant SNP loci-encoding proteins involved in plant defense responses and that could be associated with soybean defenses against CG infection were identified. The results of this study provide valuable insight for further dissection of the genetic architecture of resistance to CG in soybean. They also highlight SNP variants and genes useful for genomics-informed selection decisions in the breeding process for improving resistance traits in soybean.


Introduction
Soybean is a cash crop used for animal feed, human consumption, soil fertility improvement, and industrial use for ethanol and biofuel production [1][2][3]. Soybean is a Table 1. Source, quantity, and summary description of the 279 soybean genotypes.

Origin Quantity Description
Uganda 192 Variable in height, number of days to maturity, and pubescence colors. China 38 Variable in seed color, height, days to flowering, and seed size. USA 40 Variable in leaf size and shape, plant color, and seeds. Zimbabwe 9 Variable in lodging, height, seeds, and days to maturity.

Experimental Design
The 279 soybean genotypes were planted in a randomized complete block design (RCBD) with 2 replications at the Makerere University Agricultural Research Institute Kabanyoro (MUARIK) for two seasons (2021-2022) and at the Nakabango District farm in Jinja-Uganda for the 2022 season. The experimental sites are known as hotspot areas for red leaf blotch disease in soybean in Uganda. Twenty seeds were sown in a line plot of 1 m, with a spacing from other plots of 0.6 m. Weeding was performed by hand twice at 20 and 55 days after planting.

Scoring of Soybean Red Leaf Blotch
Red leaf blotch disease was recorded seven times in a panel of varying soybean genotypes from 30 days after planting up to reproductive stage 6 (R6). The stages R4 to R6 are recognized as the best fit for disease scoring in soybean production [24,25]. The severity of the red leaf blotch disease was evaluated using a 0-5 scale, which has previously been used to score red leaf blotch disease in soybean [19,26,27]. This scale is based on the observed damage, such as the percentage of the leaf area affected, fragmentation of the leaf, presence of pycnidia on blotches, and the color and size of the blotches (Table 2). Table 2. Scale to assess the severity of red leaf blotch disease in soybean production.

Category Description
0 No visible symptoms. 1 Few to many tiny purple-maroon spots (up to 2 mm in diameter), covering 1 to 10% of the leaf area. 2 Mainly large purple-maroon spots (up to 5 mm in diameter), covering 11 to 35% of the leaf area. 3 Purple-maroon or brown blotches (up to 10 mm in diameter) without pycnidia, covering 36 to 65% of the leaf area. 4 Dark brown blotches with pale, bleached centers and pycnidia present, covering 66 to 90% of the leaf, with fragmentation of the leaf starting. 5 91 to 100% of the leaf area affected, extensive blotching and fragmentation of the leaf.

Genotyping and Quality Control
Fresh leaves were collected and kept on three 96-well plates at 15 days after germination. The three plates were expedited to the Integrated Genotyping Service and Support (IGSS) of the Biosciences in Eastern and Central Africa-ILRI Hub, Kenya, for genotyping. The DNA was extracted from the leaf tissues using the Nucleomag Plant Genomic DNA extraction kit [28], and the DNA quality check was conducted on 0.8% agarose. Genotyping was performed using Diversity Array Technology sequencing (DArTseq). Then, a genomic DNA library was constructed using genomic complexity reduction technology [29]. The library was purified and quantified for cluster generation in an automated clonal amplification system (cBOT Illumina). Thereafter, next-generation sequencing was performed using the sequencer HiSeq 2500 (Illumina, San Diego, CA, USA).

Phenotyping Analysis
Phenotypic data obtained from the three environments were pooled and subjected to a linear mixed model analysis using the lme4 package implemented in R. The best linear unbiased estimates (BLUEs) for three environments were obtained by considering the genotypes' effects as fixed and the environment and replication effects as random in the mixed model as follows: where Y ijk = phenotypic observation for a trait, µ = grand mean, E = environment effect, B = replication effect, G = genotype effect, GE = genotype by environment interaction, and ε = random residual error.

GWAS Analysis, Genes Annotation, and Linkage Disequilibrium
To perform the GWAS, a multilocus model Fixed and random model Circulating Probability Unification (FarmCPU) with correction of the population structure and a statistical test p-value threshold of 5% was used [30]. The Manhattan and quantile-quantile (QQ) plots were plotted using the R package "rMVP" (a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study) [31].
The SNP markers significantly associated with resistance to the red leaf blotch disease identified through GWAS were annotated using the Phytozome 13.0 database (https:// phytozome-next.jgi.doe.gov/info/Gmax_Wm82_a4_v1, accessed on 10 December 2022) and used as the source for the candidate gene search. The linkage disequilibrium (LD) was estimated among the significant SNPs using the "LDheatmap" library [32]. The LD decay rate of 90 to 574 kb has been reported in soybean [33] and a ±574 kb region was used to identify positional candidate genes [34].
In selecting candidate genes, the following criteria [34,35] were used as (i) genes of known function in soybean related to the trait under study, (ii) genes with function-known orthologs in Arabidopsis related to the trait under study, and (iii) genes pinpointed by the peak SNPs. The public database InterPro, European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), was used to determine the functions of the genes associated with the different SNPs identified [36].
The SNPs' contributions to the resistance to red leaf blotch disease based on the observed alleles were plotted using the "ggplots", and their confidence statistics were calculated with "rstatix" in R.

Phenotypic Variation
In this study, significant (p < 0.001) differences were observed among the soybean genotypes for their response to the red leaf blotch disease under natural infestation. The analysis of variance (ANOVA) revealed that genotypes, environments, and genotypes by environment interaction were the main sources of variation ( Table 3). Out of the 279 soybean genotypes, approximately 10.75% were evaluated as resistant to red leaf blotch disease ( Figure 1). Figure 1 shows the resistant genotypes, represented by the dots on the bottom. The heritability values observed from the studied traits ranged from 0.23 to 0.70, with the lowest heritability identified for R 1 (Table 3). with the lowest heritability identified for R1 (Table 3).   Table S1.

Marker Coverage and Distribution
The number of Diversity Array Technology sequencing (DArTseq)-generated SNP markers was 14,082. A large number (7687) were discarded after the filtering and imputation of the raw data, and the remaining markers was 6395 SNPs, approximately 45.41% of the DArTseq-generated SNP markers. The 6395 SNPs markers matched the criteria of the data for use in the GWAS. The 6395 SNPs were distributed across the 20 Glycine max chromosomes. Chromosome 12 and chromosome 18 have, respectively, a small (201) and high (476) number of SNPs ( Figure 2, Table 4). The MAF (minor allele frequency) and PIC (polymorphism information content) of these SNPs ranged from 0.043 to 0.5, with an average of 0.22, and 0.08 to 0.74, with an average of 0.29, respectively.

Association Results
The FarmCPU model was used in this study to reveal the loci associated with the resistance to CG causing the red leaf blotch disease of soybean. In total, 19 significant association signals were found at various reproductive stages of the crop growth (Table  5). At R1, four signals were identified on chromosomes 1, 13, 15, and 16 (

Association Results
The FarmCPU model was used in this study to reveal the loci associated with the resistance to CG causing the red leaf blotch disease of soybean. In total, 19 significant association signals were found at various reproductive stages of the crop growth (Table 5). At R 1 , four signals were identified on chromosomes 1, 13, 15, and 16 (Table 5, Figure 3A). At R 2 , four signals were identified on chromosomes 12, 16, 17, and 19 (Table 5, Figure 3B). At R 3 , two signals were identified on chromosome 6 (Table 5, Figure 3C). At R 4 , five signals were identified on chromosomes 1, 6, 16, and 20 (Table 5, Figure 3D). At R 5 , four signals were identified on chromosomes 5, 6, 16, 17, and 19 (Table 5, Figure 3E). At R 6 , five signals were identified on chromosomes 1, 6, 9, 10m and 19 (Table 5, Figure 3F). The quantilequantile plots (QQ plots) produced by displaying the negative logarithms (−log 10 ) for the p-values against their p-values demonstrated that the genome-wide association study analysis model was reasonable in this research. Differences between observed and expected values of the traits studied in this research were identified, and they indicate a link between the phenotypic and SNPs, as demonstrated by the QQ plots ( Figure 3). Some SNPs were significant at more than one scoring time. This was the case for Gm01_17813710 at R 4 and R 6 ; Gm06_20112134 at R 3 , R 4 , and R 5 ; Gm16_3302971 at R 1 and R 4 ; Gm16_34649045 at R 2 and R 4 ; and Gm19_44916522 at R 5 and R 6 ( Table 5). The Manhattan plot reveals the results of the GWAS significance levels (−log 10 of the p-value of each SNP) by chromosome position, where each chromosome has a different color. Significant SNPs in the Manhattan plot are strongly associated with resistance to red leaf blotch disease in soybean ( Figure 3).

SNPs' Contribution to the Resistance to Red Leaf Blotch Disease in Soybean
Five common significant SNPs were identified for at least two scoring times including Gm01_17813710 at R4 and R6; Gm06_20112134 at R3, R4, and R5; Gm16_3302971 at R1 and R4; Gm16_34649045 at R2 and R4; and Gm19_44916522 at R5 and R6. Further dissection of the five significant SNP loci associated with resistance to red leaf blotch showed that accessions with the homozygous allele AA and/or heterozygous allele AG on chromosome 1 possessed higher resistance than the homozygous allele GG ( Figure 4A,B). On chromosome 6, resistance is associated with the homozygous allele GG ( Figure 4C-E). The marker effect on chromosomes 16 and 19 revealed that, respectively, the homozygous alleles GG and TT ( Figure 4F-I) and CC and TT ( Figure 4J,K) are linked with resistance in the studied population, while the heterozygous allele accounted for low resistance. The SNP confidence statistics revealed an allelic significative difference (p < 0.001) at SNPs Gm01_17813710 and Gm06_20112134 ( Figure 4A-E). At p < 0.001, the other SNPs did not show an allelic significative difference (Figure 4).
In general, resistance to red leaf blotch disease in soybean is characterized by homologous allele GG at positions Gm06_20112134 and Gm16_34649045. The position Gm16_3302971 is dominated by the homologous allele TT, except for two soybean genotypes (UGSOY143 and UGSOY236) that are characterized by the heterozygous allele TA. The position Gm19_44916522 is predominated by homologous alleles CC or TT, but one genotype (UGSOY195) has a heterozygous allele TC (Table 7, Figure 4) 4A,B). On chromosome 6, resistance is associated with the homozygous allele GG ( Figure  4C-E). The marker effect on chromosomes 16 and 19 revealed that, respectively, the homozygous alleles GG and TT ( Figure 4F-I) and CC and TT ( Figure 4J, K) are linked with resistance in the studied population, while the heterozygous allele accounted for low resistance. The SNP confidence statistics revealed an allelic significative difference (p < 0.001) at SNPs Gm01_17813710 and Gm06_20112134 ( Figure 4A-E). At p < 0.001, the other SNPs did not show an allelic significative difference (Figure 4).  In general, resistance to red leaf blotch disease in soybean is characterized by homologous allele GG at positions Gm06_20112134 and Gm16_34649045. The position Gm16_3302971 is dominated by the homologous allele TT, except for two soybean genotypes (UGSOY143 and UGSOY236) that are characterized by the heterozygous allele TA. The position Gm19_44916522 is predominated by homologous alleles CC or TT, but one genotype (UGSOY195) has a heterozygous allele TC (Table 7, Figure 4) Table 7. Profile of the alleles at the significant SNPs for the highly resistant (mean score less than 0.5) soybean genotypes to red leaf blotch recorded out of the 279 genotypes.
Genotypes ** Gm01_17813710 * Gm06_20112134 * Gm16_3302971 * Gm16_34649045 * Gm19_44916522 * UGSOY143 GG GG AT GG CC    UGSOY158  AA  GG  TT  GG  TT  UGSOY165  AA  GG  TT  GG  TT  UGSOY170  AG  GG  TT  GG  CC  UGSOY174  GG  GG  TT  GG  CC  UGSOY183  AG  GG  TT  GG  CC  UGSOY186  AA  GG  TT  GG  TT  UGSOY189  AG  GG  TT  GG  TT  UGSOY190  AA  GG  TT  GG  TT  UGSOY194  AA  GG  TT  GG  TT  UGSOY195  AG  GG  TT  GG  TC  UGSOY197  AG  GG  TT  GG  TT  UGSOY203  AG  GG  TT  GG  TT  UGSOY212  AA  GG  TT  GG  TT  UGSOY214  AA  GG  TT  GG  TT  UGSOY217  AA  GG  TT  GG  TT  UGSOY218  AG  GG  TT  GG  TT  UGSOY228  AG  GG  TT  GG  TT  UGSOY229  AG  GG  TT  GG  TT  UGSOY231  AA  GG  TT  GG  CC  UGSOY232  AA  GG  TT  GG  TT  UGSOY233  AA  GG  TT  GG  TT  UGSOY234  AG  GG  TT  GG  TT  UGSOY235  AA  GG  TT  GG  TT  UGSOY236  AG  GG  AT  GG  TT  UGSOY238  AG  GG  TT  GG  TT  UGSOY240  AA  GG  TT  GG  TT  UGSOY241 AA GG TT GG CC

Haplotype Analysis
Linkage disequilibrium (LD) block heatmaps based on the LD of each identified common SNP loci are shown in Figure 5. The LD analysis of the common loci (two on chromosomes 1 and 19 and three on chromosomes 6 and 16) showed that these markers had a relatively average to high LD parameter (R 2 > 0. 8 UGSOY218  AG  GG  TT  GG  TT  UGSOY228  AG  GG  TT  GG  TT  UGSOY229  AG  GG  TT  GG  TT  UGSOY231  AA  GG  TT  GG  CC  UGSOY232  AA  GG  TT  GG  TT  UGSOY233  AA  GG  TT  GG  TT  UGSOY234  AG  GG  TT  GG  TT  UGSOY235  AA  GG  TT  GG  TT  UGSOY236  AG  GG  AT  GG  TT  UGSOY238  AG  GG  TT  GG  TT  UGSOY240  AA  GG  TT  GG  TT  UGSOY241 AA GG TT GG CC

Haplotype Analysis
Linkage disequilibrium (LD) block heatmaps based on the LD of each identified common SNP loci are shown in Figure 5. The LD analysis of the common loci (two on chromosomes 1 and 19 and three on chromosomes 6 and 16) showed that these markers had a relatively average to high LD parameter (R 2 > 0.8), showing a relatively high cor relation.

Discussion
Identifying novel sources of resistance in soybean germplasm to key biotic and abiotic stress is an essential determinant for enhancing productivity [91,92]. Extensive work on the genetic improvement of soybean to resistance to selected diseases and pests [92] and other nutritional value [93] has been conducted across the world, but limited efforts have been made in the area of red leaf blotch disease in soybean [26]. In this study, a GWAS was conducted to map genomic regions associated with CG resistance. A total of 19 significant GWAS signals were reported for resistance to red leaf blotch disease in soybean. In addition, putative candidate genes associated with resistance were also identified. The markers identified in this study provide a means to accelerate the development of soybean cultivars with resistance and with other acceptable end-user attributes.
The ability of a GWAS to dissect complex traits has been demonstrated in soybean quality improvement and breeding for seed composition [37] and for resistance to soybean rust [38]. The consumption of soybeans is increasing and, consequently, breeding programs need to be encouraged and optimized with new knowledge. The current GWAS was intended for the discovery of QTLs and potential candidate genes linked to genetic diversity for resistance to red leaf blotch disease in soybean. The knowledge of the population structure and familial relationships (i.e., kinship) in an association panel is important to prevent false associations in a GWAS [39]. Population structure and admixture for this population were reported in a previous study [40]. From our study, 113 putative candidate genes encoding for several proteins (Table 6) were discovered as major contributors to the resistance to Coniothyrium glycines, the causal agent of red leaf blotch disease in soybean production.
Squamosa promoter-binding protein-like (SPL) genes play vital regulatory roles in plant growth, development, and stress responses [41,42] and show potential application in crop improvement by genetic modification for abiotic stress in Alfalfa (Medicago sativa L.) [43]. SPL genes are reported to play a role in toxin resistance in plants [44]. Plastid lipid-associated proteins, also termed fibrillin, are known for their role in response to biotic stress in Solanaceae plants, especially for bacterial infections [45]. Isochorismate synthase 2 contributes to the biosynthesis of salicylic acid [46], which is involved in plant defenses [47]. In soybean, the defense responses to the pathogens Pseudomonas syringae and Phytophthora sojae is conferred by the accumulation of salicylic acid [48]. Proteasome is reported to contribute to the tolerance of heat or oxidative stresses in plants [49,50]. The RING/U-box superfamily protein promotes resistance to biotic stress through ubiquitination and leaf senescence [51]. The U-box protein is known to play a major role in responses to abiotic and biotic stresses in rice [52] and regulates drought tolerance in Arabidopsis thaliana [53]. The action of the RHO guanyl-nucleotide exchange factor 7 was demonstrated in the development, pathogenesis, and stress responses of Colletotrichum higginsianum, which causes anthracnose disease of crucifers [54] The chaperone DnaJ-domain superfamily protein has been reported in pepper (Capsicum annuum L.) to play a role in plant growth and development and heat stress [55]. Hence, it is called heat shock protein 40 based on molecular weight [55]. In wheat, the chaperone DnaJ-domain is reported in the regulation of resistance to yellow mosaic virus infection [56]. Earlier, in 2013, a critical role of the nuclear-localized DnaJ domain-containing GmHSP40.1 in cell death and disease resistance in soybean was demonstrated through the screening for candidate genes stimulating cell death in soybean, and silencing GmHSP40.1 enhanced the susceptibility of soybean plants to soybean mosaic virus, confirming its positive role in pathogen defense [57]. Another positive role of the DnaJ-domain is that it is involved in the alkaline-salt, salt, and drought tolerance in Arabidopsis and soybean [58]. The chaperone DnaJ-domain plays a critical function in protein folding and regulation of several physiological processes, and it participates in numerous pathological processes [59]. DnaJ-domain superfamily proteins have been recognized for their diverse functions within cells and extensively studied in many species, including humans, drosophila, Arabidopsis, mushrooms, and tomatoes [60]. Furthermore, this information on the role played by the chaperone DnaJ-domain may guide practical actions in soybean breeding for resistance to red leaf blotch disease.
Enolase is involved in the growth and development of various species [61]. In soybean, the flooding stress is controlled by enolase with the contribution of other proteins [62]. UDP-glucosyl transferase 89B1 plays a vital role in diverse plant functions, and its response to drought, salt, and heat stress in Populus trichocarpa (Black cottonwood) has been revealed [63]. The potential role of the EF-hand calcium-binding protein family in the implementation of resistance to environmental and nutritional stress in soybean was described [64]. This calcium-dependent protein has a function in the soybean-herbivore insect interaction and in drought adaptation [65]. Glycine-rich protein-containing protein-like confer tolerance to stress (e.g., some are involved in cold acclimation and may improve growth at low temperatures), and these proteins could play a promising role in agriculture [66].
AP2/B3-like transcriptional factor family proteins were analyzed for their role in stress tolerance in soybean [67], tree plants [68][69][70], and A. thaliana [71]. A member of the AP2/ERF transcription factor family, GmERF3, was isolated from soybean. There was an enhanced resistance against infection by Ralstonia solanacearum, Alternaria alternata, and tobacco mosaic virus, as well as tolerance to high salinity and dehydration stresses in transgenic tobacco plants [72]. The tetratricopeptide repeat (TPR)-containing protein has functions in plant hormone signaling, and the protein TTL1, containing TPR motifs, is required for abscisic acid responses and osmotic stress tolerance in plants [73,74].
The Transducin/WD40 repeat-like superfamily protein is a functional group that has been reported in plant cell wall formation [75]. In soybean, Transducin/WD40 repeat-like proteins were reported to putatively control the total number of flower and pods [76], while in Arabidopsis, they control seed germination, growth, and biomass accumulation [89]. A gene that encodes for Transducin/WD40 repeat-like proteins were identified in wheat and associated with plant tolerance to abiotic stresses [77]. The genes that encode for Transducin/WD40 repeat-like superfamily protein in soybean may play a role in controlling red leaf blotch by rehabilitating the cells damaged by the pathogen during the employment of the disease epidemic's mechanism. Therefore, the soybean genotypes that present the QTLs involving Transducin/WD40 repeat-like superfamily protein production could be used as future resources for breeding efforts aimed at improving resistance to red leaf blotch disease.
The leucine-rich repeat (LRR) family protein is well-known for controlling disease resistance in crops, including soybean [78]. In soybean, LRR has been reported to regulate the immune response to Phytophthora root rot [79], coordinate the responses against root-lesion nematode [80], and mediate the response to soybean mosaic virus [81]. This study also revealed the contribution of the LRR domain in the resistance to red leaf blotch disease in soybean production. Phytochrome-associated protein 2 is crucial in photoperiod adaptability and, therefore, influences the flowering time. Liu et al., (2008) showed that phytochromes contribute to the establishment of an adaptive response of soybean to environments, and thus the role of contributing to the resistance to red leaf blotch disease in soybean was revealed. The role of malic enzymes in plant growth and response to stress is documented and mainly discovered in cytoplasmic stroma, mitochondria, and chloroplasts. Previous studies have shown that malic enzymes participate in the process of coping with drought, high salt, and high temperature by increasing water use efficiency and improving photosynthesis by plants [83]. Through the improvement of photosynthesis, malic enzymes contribute to the resistance to red leaf blotch disease.
Various studies have shown that ubiquitination plays a key role in stress response and yield constitution [87]. In the UniProtKB database, 2429 ubiquitin-related proteins are predicted in soybean [85]. Ubiquitin plays a key role in regulating the resistance of soybean to Heterodera glycines, a soybean cyst nematode causal agent [84]; P. sojae, an infection that causes stem and root rot [86]; and heat shock [87]. In soybean and other legumes, the C2H2-type zinc finger protein is reported to enhance legume-rhizobia symbiosis [88], which is a key physiological process that can limit nitrogen in plants, affecting their growth and development [90]. The accumulation of isoflavone in soybean is governed by the C2H2type zinc finger protein [94,95], and isoflavone contributes to human health and plant stress tolerance [96]. The soybean C2H2-type zinc finger protein with a conserved QALGGH motif negatively regulates drought responses [97], but it was reported to enhance tolerance to cold [98] in transgenic Arabidopsis. Although several genes that encode for the C2H2-type zinc finger protein have been reported to play various roles in the life mechanism of soybean [95,97,98], the accumulation of unfolded proteins such as the homolog of mammalian P58IPK in the endoplasmic reticulum have built up a conserved mechanism that regulates the stress response in this cell part [99]. The endoplasmic reticulum stress response plays an important role that allows plants to sense and respond to adverse environmental conditions, such as heat stress, salt stress, and pathogen infection [99,100]. These unfolded proteins were revealed, for the first time, as factors that contribute to resistance to red leaf blotch disease in soybean. The role of FASCICLIN-like arabinogalactan-protein in the response to plant pathogens was elucidated by Wu et al., (2020). FASCICLIN-like arabinogalactanprotein controls the infection with the turnip mosaic virus and P. syringae pv tomato strain DC3000 (Pst DC3000) in Nicotiana benthamiana, which is a model plant to study plantpathogen interactions [101]. Recently, their contribution to resistance to clubroot disease stress in Brassica napus was characterized [102]. FASCICLIN-like arabinogalactan-protein was reported to be associated with soluble sugar content in vegetable soybean [103]. The thioredoxin superfamily protein with glutoredoxin regulates the response of nodulated soybean plants to water-deficit stress [104], and it is involved in several plant life mechanisms, including adaptation to environmental stresses [105] or signaling plant immunity [106].
The NRAMP metal ion transporter family protein plays a key role in nodule iron homeostasis to support bacterial nitrogen fixation in soybean production [107] and contributes to avoiding cadmium toxicity [108]. The PLATZ transcription factor family protein increases drought tolerance in soybean hairy roots [109]. The CLAVATA3/ESR-RELATED 9 is a key component that modulates the effect of infection with plant-parasitic nematodes [110]; therefore, CLAVATA3/ESR-RELATED 9 interacts with the nematodes in the process of the establishment of feeding sites on the plant roots. WRKY DNA-binding protein 3 is well known in soybean to promote resistance to cyst nematodes [111]. Soybean WRKY-type transcription factor genes GmWRKY13, GmWRKY21, and GmWRKY54 confer differential tolerance to abiotic stresses in transgenic Arabidopsis plants [82]. Therefore, the potential role of WRKY DNA-binding protein in resistance to red leaf blotch is irrefutable. The cell wall/vacuolar inhibitor of fructosidase 1 was reported to regulate the abscisic acid response and salt tolerance in Arabidopsis [111]. The salinity symptoms in plants are characterized by leaf discoloration and damage. In fact, red leaf blotch disease affects the leaf; therefore, the cell wall/vacuolar inhibitor of fructosidase 1 may play a key role in reducing the expansion of leaves' damage.
Regarding the analysis and functions played in plants by the proteins encoded by the 113 unrevealed genes of soybean, these genes are directly or indirectly responsible for resistance to C. glycines causing red leaf blotch disease in soybean production. These genes govern, in a mutual way, the physiological activities of the soybean plant to enhance resistance to C. glycines infection.
Information on the markers' effects through segregation patterns are fundamental for conversion of the markers to Kompetitive Allele-Specific PCR (KASP) for genotyping of the polymorphisms at different loci and deployment in breeding programs [112,113] for resistance to red leaf blotch disease in soybean. Some of the markers, especially marker Gm02_17813710 at R4 and R6, have the GG and GA alleles, which significantly reduce disease symptoms compared to the second homozygous state. The same pattern was observed for Gm16_3302971 on R1 and R4, with homozygous state TT and heterozygous state AT, which are superior to that of the homozygous allele AA in minimizing the disease severity. These SNPs demonstrate a dominance effect, and both the favorable homozygous and heterozygous alleles can be exploited for KASP markers in breeding for resistance. Marker Gm06_20112134, on the other hand, shows no significant difference in the allelic effect between AA and GG, while both homozygous states are superior to the heterozygous alternative. This SNP allelic effect pattern may encourage use of both homozygous alleles for KASP marker development. However, its effect on disease needs to be further investigated, for example, through direct molecular analyses and intervention, to ascertain their influence on disease severity. Possible reasons for such allelic behavior could be genetic, such as genetic heterogeneity or statistical, for instance, a low sample size and statistical power in the detection of the SNP effect. SNP Gm19_44916522 shows no differences in the allelic effect among its three allelic states, which suggests that it could be a false positive or affected by allelic heterogeneity, which prompts the necessity to further investigate and validate these alleles.

Conclusions
This study identified 19 significant markers and genomic regions associated with red leaf blotch disease resistance in soybean. These markers tagging key genomic regions can be validated and tested in the soybean germplasm. This might be performed by transforming these significant markers to low-cost Kompetitive Allele-Specific PCR (KASPs) markers capable of being used effectively to transfer alleles into elite soybean genotypes for use in future marker-based breeding strategies. The findings of this study will contribute to the implementation of a new approach to soybean breeding for maintaining red leaf blotch disease resistance. The discovered genes from this study provide new insight into the genetic foundation of resistance to red leaf blotch disease in genetic pool of soybean.
Supplementary Materials: The following supporting information can be downloaded at: https: //www.mdpi.com/article/10.3390/genes14061271/s1, Table S1: Origin and characteristics of soybean genotypes. Funding: This project was supported by the Makerere Regional Centre for Crops Improvement (MaR-CCI) under the World Bank grant no. 126974 (Africa Higher Education Centres of Excellence Project).

Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable. Data Availability Statement: Data are unavailable because of privacy or ethical restrictions; however, they may be available upon request.