Validation and analysis of the geographical origin of Angelica sinensis (Oliv.) Diels using multi-element and stable isotopes

Background Place of origin is an important factor when determining the quality and authenticity of Angelica sinensis for medicinal use. It is important to trace the origin and confirm the regional characteristics of medicinal products for sustainable industrial development. Effectively tracing and confirming the material’s origin may be accomplished by detecting stable isotopes and mineral elements. Methods We studied 25 A. sinensis samples collected from three main producing areas (Linxia, Gannan, and Dingxi) in southeastern Gansu Province, China, to better identify its origin. We used inductively coupled plasma mass spectrometry (ICP-MS) and stable isotope ratio mass spectrometry (IRMS) to determine eight mineral elements (K, Mg, Ca, Zn, Cu, Mn, Cr, Al) and three stable isotopes (δ13C, δ15N, δ18O). Principal component analysis (PCA), partial least square discriminant analysis (PLS-DA) and linear discriminant analysis (LDA) were used to verify the validity of its geographical origin. Results K, Ca/Al, δ13C, δ15N and δ18O are important elements to distinguish A. sinensis sampled from Linxia, Gannan and Dingxi. We used an unsupervised PCA model to determine the dimensionality reduction of mineral elements and stable isotopes, which could distinguish the A. sinensis from Linxia. However, it could not easily distinguish A. sinensis sampled from Gannan and Dingxi. The supervised PLS-DA and LDA models could effectively distinguish samples taken from all three regions and perform cross-validation. The cross-validation accuracy of PLS-DA using mineral elements and stable isotopes was 84%, which was higher than LDA using mineral elements and stable isotopes. Conclusions The PLS-DA and LDA models provide a theoretical basis for tracing the origin of A. sinensis in three regions (Linxia, Gannan and Dingxi). This is significant for protecting consumers’ health, rights and interests.


INTRODUCTION
. The mineral elements in A. sinensis reflect its contact with soil geochemistry. The regional geological heterogeneity could lead to variations in the mineral-element contents of A. sinensis. Mineral elements are mostly stable in the raw materials of agricultural products compared with other components, which makes this a promising identifier for determining food provenance (Zhao, Zhang & Zhang, 2017;Catarino et al., 2018). Stable isotopes of plants are affected by environmental factors and its temporal and spatial specificity. It has been reported that δ 13 C and δ 18 O are closely related to factors such as precipitation, temperature, altitude, and slope (Hultine & Marshall, 2000;Granath et al., 2018). Among them, δ 13 C values depend on the relative CO 2 value in the external environment and the CO 2 concentration of the intercellular environment (mainly stomatal density and stomatal conductance) (Ferrio, Voltas & Araus, 2003). δ 18 O values in plants mainly depend on atmospheric precipitation and transpiration (Ripullone et al., 2008). Precipitation and radiation show temporal and spatial specificity, which leads to differences in the δ 18 O enrichment in plants. δ 15 N values in plants are mainly affected by nitrogen pools in soil. δ 15 N typically exists in the form of nitrogen compounds. The nitrogen compounds from different soil types lead to variations of δ 15 N across different geographical regions through nitrification, denitrification and ammonia volatilization (Handley & Raven, 2010). Therefore, the spatial distribution of stable isotope ratios in the environment is necessary to determine the A. sinensis' isoscapes. This study provides a theoretical basis for evaluating the geographical origin of A. sinensis based on the calibrated characteristic values of a stable isotope.
In this study, 25 samples of A. sinensis were collected from three regions (Linxia, Gannan, and Dingxi) in the southeastern province of Gansu, China. We measured three stable isotopes and eight mineral elements. Principal component analysis (PCA), partial least square discriminant analysis (PLS-DA) and linear discriminant analysis (LDA) for multivariate data discriminant analysis were conducted. We sought to (1) describe the differences between stable isotopes and mineral elements in three different regions (Linxia, Gannan and Dingxi); (2) screen the landmark mineral elements and stable isotope factors of A. sinensis in the different regions; and (3) compare PCA, PLS-DA and LDA's discrimination and cross-validation ability on A. sinensis.

Study area and sample collection
Samples were taken from the Linxia (LX), Gannan (GN) and Dingxi (DX) regions of southeast Gansu Province, China, between 34 • 24 -35 • 57 N and 103 • 11 -104 • 28 E. Linxia has a semi-arid climate, with an average annual temperature of 5.2-7.0 • C and an average annual precipitation of 350-660 mm. Gannan has a plateau climate, with large regional differences in annual average temperature and extremely uneven geographical distribution of precipitation; the average annual temperature is between 1-13 • C and the average annual precipitation is 518-634 mm. Dingxi is a middle temperate semi-arid area with a continental monsoon climate, with an average annual temperature of 5.5 • C and an average annual precipitation of 635 mm. We collected 25 samples from farmlands during the A. sinensis harvest season from April to May 2019. We contacted local farmers to obtain their permission to collect experimental samples. Global Positioning System (GPS) was used to record the longitude and latitude of each sampling plot, and ArcGIS (Version 10.7) was used to plot the sampling points. As shown in Fig. 1, samples were collected from planting sites (including five in Linxia, seven in Gannan, and 13 in Dingxi). We dug the roots of A. sinensis, gently brushed away the surface soil, and stored the sample in a self-sealing bag. Samples were put into a 4 • C fresh-keeping container for refrigeration. A freshly collected sample was cleaned and rinsed with deionized water three times then dried in a constant temperature oven at 70 • C to a constant weight. Dried roots were crushed by a high-speed pulverizer and were passed through a 100-mesh sieve. These samples were placed in self-sealing bags for storage.

Stable isotope measurement
We weighed 5.0 mg of ground A. sinensis into tin capsules. We used an isotope ratio mass spectrometer (IRMS Delta plus XP; Thermo-Fisher, USA) and an element analyzer (Flash EA) to analyze δ 13 C and δ 15 N. The sample was introduced into the element analyzer through the autosampler and was transferred to the IRMS through the carrier gas (helium) to determine δ 13 C and δ 15 N. Reference materials, including USGS24, IAEA-600, IAEA-N-2 and IAEA-NO-3, were used for calibration. During the analysis, a laboratory standard was interspersed for every 12 samples for calibration. The long-term accuracy of the instrument was 0.2 . We used a known laboratory wheat flour (B2159) standard with δ 13 C (δ 13 C V−PDB =−13.68 ± 0.2 ( )) and δ 15 N (δ 15 N Air =1.58 ± 0.15 ( )) values to check the accuracy of the instrumental condition. The samples were analyzed in triplicate. We calculated the average of the three samples.
In order to perform δ 18 O analysis, we weighed 1 mg of the dried A. sinensis sample and packed it into an isotope silver capsule. Samples were loaded into the elemental analyzer Flash EA through the autosampler, and the analyte was sent to the isotope ratio mass spectrometer (253plus; Thermo-Fisher, USA) for δ 18 O determination. The international reference standards for δ 18 O are IAEA-601and USGS55. The samples were analyzed in triplicate. We calculated the average of the three samples.
The following formula was used to calculate the relative deviation of the measured δ 13 C, δ 15 N and δ 18 O from the ratio of international standard substances:

Mineral elements analysis
All of the materials used for standard solution and sample processing and the Teflon digestion vessel were immersed in 20% HNO 3 solution for 24 h, then rinsed with deionized water and dried before testing the standard solution and samples. We used inductively coupled plasma mass spectrometry (ICP-MS) tuning solution to calibrate ICP-MS. The Rh standard solution was used as an internal standard. We prepared the working standard solution by diluting the stock standard solution of K, Mg, Ca, Zn, Cu, Mn, Cr and Al with 2% HNO 3 . The preparation concentration of K, Mg and Ca was 0-150 mg/L, and the preparation concentration of Zn, Cu, Mn, Cr and Al was 0-500 µg/L. Each working standard solution tested was repeated three times to obtain a standard curve. The standard curve R 2 of all the elements measured was greater than 0.9990.
We weighed 0.3 ± 0.01 g of the A. sinensis powder. These samples were then placed into a microwave digestion tank and 8 mL of concentrated HNO 3 was added. The container was covered and digested. After digestion was complete, the A. sinensis sample was cooled to room temperature in a 25 mL volumetric flask to a constant volume, and this was repeated three times. The digestion process was: (1) 800 W constant microwave power at 100 • C for 10 min; (2) 800 W constant microwave power at 150 • C for 10 min; (3) 800 W constant microwave power at 180 • C for 30 min.

Statistical analysis
We analyzed the significant differences between variables form the three different regions after testing the 11 variables' distribution and the analysis variables' basic statistics. These were statistically analyzed by grouping (P < 0.05), and multiple comparisons between paired groups were performed by using Tukey's honestly significant differences (Tukey HSD) method. We standardized the data and performed dimensionality reduction and discriminant analysis.
The traceability and provenance of food depends on reliable measurement data and using the appropriate chemometric data processing methods (Bertacchini et al., 2013). Single element stoichiometry is greatly limited in terms of traceability, and it is impossible to comprehensively evaluate a large number of variables with it. Thus, a lot of important traceability information is lost. Multivariate data analysis techniques such as PCA, PLS-DA, and LDA can effectively realize the comprehensive analysis of multiple ''fingerprint'' information. PCA analysis is an internal exploration of data, with only a natural grouping of a multivariate data set and no predictive function. PLS-DA is a supervised discrimination method and is different from PCA. It can integrate the basic functions of multiple linear regression analysis, canonical correlation analysis and principal component analysis and can solve the multicollinearity problem in linear regression (Brereton & Lloyd, 2014). PLS-DA assigns groups to the samples before classification. After grouping, the model adds an implicit virtual binary matrix as the response sample category (specifying a group as 1, and all other values are 0). The multivariate data set is the independent variable matrix used to train the model, the prediction sample independent variable data set is assigned through the threshold of the training model, and the sample category is defined. After establishing the A. sinensis PLS-DA model, the classification performance of the model and the model itself were evaluated. R 2 is the regression coefficient used in the PLS-DA model, which was used to evaluate the overall fit of the model. The recommended value of R 2 for a good model is between 0.6-1. Q 2 refers to the predictive ability of the model after modeling, and should be between 0.5 and 1 (Ballabio & Consonni, 2013;Albergamo et al., 2018). The Variable Importance Plot (VIP) was used to measure the variables with high contribution of the projection on the first two axes (Sayago, González-Domínguez & Beltrán, 2018). LDA is also a dimensionality reduction technique of supervised learning. LDA reduces the dimensionality of samples by maximizing the difference between groups and minimizing the difference within the group. It depends on the average within the group during dimensionality reduction. LDA and PLS-DA have the same data verification method. First, the model was verified by modeling its own data set. Due to the limited survey data, we used the leave-one-out method (LOOCV) to verify the predictive ability of the built model. Finally, statistics of real samples and model prediction samples were represented by a confusion matrix, and the model prediction accuracy rate was calculated. PCA, PLS-DA and LDA are traditional statistical discriminant methods, which usually perform well in terms of geographic origin classification when excluding redundant/confounding predictive factors (Gonzalvez, Armenta & De La Guardia, 2009). They have been widely used in the discriminant analysis of food and drug origin traceability.
The significant differences between each variable in the Linxia, Gannan and Dingxi samples and multiple comparisons of Tukey-HSD were determined using the psych package in R software (version 3.6.1). PCA was conducted using the FactoMineR and factoextra packages in R; PLS-DA model modeling, evaluation, and verification were determined by

Multi-element and stable isotope characteristics of A. sinensis
Statistical analysis of the eight mineral elements and three stable isotope ratios of A. sinensis sampled from three regions were determined using ICP-MS and IRMS. Results are presented in Table 1. Among them, the K content (5639.95 ± 311.94 mg/kg) in 25 A. sinensis samples was the highest, and the variation range and variance were the largest, followed by Ca (869.37 ± 34.70 mg/kg), Mg (751.88 ± 34.34 mg/kg), Al (119.13 ± 10.60 mg/kg), Zn (5.32 ± 0.22 mg/kg), Mn (5.27 ± 0.22 mg/kg), Cu (1.74 ± 0.08 mg/kg) and Cr (1.02 ± 0.11 mg/kg). K, Ca, Mg, Zn, Mn and Cu were moderately variable, while Al and Cr were highly variable. The stable isotopes δ 13 C, δ 15 N and δ 18 O were -23.79 ± 0.1 , 2.13 ± 0.39 and 25.07 ± 0.22 , respectively. The highest coefficient of variation of δ 15 N is 0.91, and the coefficient of variation of δ 13 C and δ 18 O were lower than 0.1. The variation of Al, Cr and δ 15 N were highly variable. Therefore, the greater the difference between elements among A. sinensis, the more likely it will be suitable for origin traceability. The mean ± standard error of Linxia, Gannan and Dingxi in Gansu Province were calculated to compare the differences between elements from the three regions (Table 2). Compared with the mineral elements in the three regions, Linxia A. sinensis K was significantly lower than Gannan and Dingxi (P < 0.05), while Al and Ca/Al were significantly higher than Gannan and Dingxi (P < 0.05). Mg, Zn, Cu, Ca, Mn, Zn/Mn and Cu/Cr had no significant difference. Among the stable isotopes, the δ 13 C and δ 15 N from Linxia were significantly (P < 0.05) lower than that of Gannan and Dingxi, and the δ 18 O from Gannan was significantly (P < 0.05) higher than that of Linxia and Dingxi.

PCA of A. sinensis
The original data (K, Mg, Zn/Mn, Cu/Cr, Ca/Al, δ 13 C, δ 15 N, δ 18 O) were normalized and analyzed by factor analysis. The Kaiser-Meyer-Olkin (KMO) test and Bartlett sphere test results were KMO = 0.506 (KMO > 0.5) and P = 0.045 (P < 0.05). The approximate chi-square value of Bartlett's sphere test was 41.83, which satisfies sampling adequacy and Bartlett's sphere test significance. Table 3 shows that in PCA analysis, the first principal component explained 38.42% of the total variance, the second principal component explained 20.31% of the total variance, and the first three principal components explained 73.68% of the total variance. These results can fully reflect the main data. The relationship between the principal component and each element was explored through the factor loading plot and the contribution rate of each element on the first five principal component axes. We examined the significant correlation (P < 0.05) between principal component axes and mineral elements and stable isotopes (Fig. 2). Figures 2A and 2B show that Ca/Al, K, δ 13 C, and δ 15 N had a significant positive correlation (P < 0.05) with the first principal component axis. Zn/Mn and Cu/Cr had a significant positive correlation (P < 0.05) with the second principal component axis, and δ 18 O had a significant negative correlation (P < 0.05) with the second principal component axis. The contribution of mineral elements and stable isotopes on the first five principal component axis was explained. Ca/Al, K, δ 13 C and δ 15 N had a high contribution to the first principal  Fig. 3A. The score diagram shows that the first principal component axis can distinguish A. sinensis sampled from Linxia and other places. Since Ca/Al, K, δ 13 C and δ 15 N contributed more to the first principal component axis, Ca/Al, K, δ 13 C and δ 15 N are important factors to distinguish A. sinensis from Linxia and other places. It also suggests that the Ca/Al, K, δ 13 C, and δ 15 N in Linxia A. sinensis are lower than that of Dingxi. Although there is no clear distinction between Dingxi and Gannan, most of the samples fall into the first and fourth quadrants, respectively, suggesting that A. sinensis  Figure 3B shows A. sinensis scores from Dingxi and Gannan, and the first two axes explain 90.23% of the total variance. The first principal component axis was mainly affected by Zn/Mn and δ 18 O, and the second principal component axis was mainly affected by Cu/Cr. The samples of A. sinensis from Dingxi and Gannan can be clearly classified.

PLS-DA of A. sinensis
We analyzed 25 samples of A. sinensis from Linxia, Gannan and Dingxi based on eight factors (K, Mg, Zn/Mn, Cu/Cr, Ca/Al, δ 13 C, δ 15 N, δ 18 O). Samples were analyzed using PLS-DA, and the first three component axes were extracted with eigenvalue ≥1. Table 4 shows that the first three component axes explained 70% of the total explanatory variables (the first axis explained 38%, the second axis explained 17%, and the third axis explained 15%). Fitting the response variables, the first three axes explained 68% of the total response variables (the first axis explained 29%, the second axis explained 36%, and the third axis explained 3%). In this model, the cumulative Q 2 of the first two axes was 0.52 (suggested Q 2 > 0.5), the negative effect of the third axis on the model's predictive ability was slightly reduced, and the cumulative Q 2 was 0.47.
We reduced the dimension of the explanatory variables and the scores of these variables are shown in Fig. 4A. t1 and t2 are the first and second component axes of the explanatory variables after dimensionality reduction. These explain 55% of the total variable variation. A. sinensis in the Linxia, Gannan and Dingxi regions are separated and clustered into one category, respectively. We were able to determine the importance of the mineral elements  and stable isotopes of A. sinensis in PLS-DA. We also determined the correlation between variables and variables and variables and sampling areas and analyzed the PLS-DA model loading graph (Fig. 4B). K, Cu/Cr, Ca/Al, δ 13 C, and δ 15 N had larger loading values in the first component, and had a negative correlation. The samples of A. sinensis from Linxia were distant from the sampling areas, so these five variables were in Linxia and the samples of A. sinensis from Gannan and Dingxi differed. Mg, Zn/Mn, and δ 18 O had higher loading values in the second component, and the second axis was positively correlated with Mg and Zn/Mn, and negatively correlated with δ 18 O. Since the A. sinensis samples from Dingxi and Gannan were far apart on the second axis, these three variables are important variables for separating Dingxi and Gannan. We analyzed the variable component axis of the projection in the PLS-DA model, and filtered out the important variables of the projection of the entire model (Fig. 5A). In the A. sinensis samples, the VIPs of the five variables (δ 18 O, Zn/Mn, δ 15 N, K and Ca/Al) were all greater than 0.9 (recommended is VIP > 0.5). This indicates that these are important variables for classification in the PLS-DA model and are of great significance to the model interpretation and source traceability of A. sinensis.
In order to evaluate the fit and prediction performance of the cross-validation set of the built A. sinensis PLS-DA model, R 2 VY was used to evaluate the fit of the cross-validation model, and Q 2 VY was used to evaluate the predictive ability of the cross-validation model. In order to prevent the samples from overfitting in the PLS-DA model, 200 permutation tests were performed on the first three component axes of samples from Linxia, Gannan and Dingxi. These tests evaluated the validity of the cross-validation model and the stability of the model fitting. Our results show that the R 2 VY of Linxia, Gannan and Dingxi were 0.62, 0.76, and 0.65, respectively, and the Q 2 VY were 0.41, 0.59, and 0.39, respectively, which reflects the good fit and predictive ability of the cross-validation training set to the PLS-DA model (Fig. 5B). We applied 200 permutation tests on the first three axes of samples taken from Linxia, Gannan and Dingxi, which indicated that the built PLS-DA cross-validation model showed strong validity.
Finally, internal verification was carried out on A. sinensis samples from Linxia, Gannan and Dingxi based on the PLS-DA model. Due to the small data set, the original data verification and leave-one-out cross validation were used to verify the A. sinensis samples ( Table 5). The original data successfully verified the samples of A. sinensis from Linxia, Gannan and Dingxi, and the accuracy rate of all the samples was 100%. We used leave-oneout cross validation to verify the samples, and the total corrected rate of all of the A. sinensis samples was 84%. The verification rate of A. sinensis from Linxia was only 60%, indicating that two samples of Linxia A. sinensis were wrongly classified as being from Dingxi. The verification rate of A. sinensis samples from Gannan was 100%, and all classifications were correct. The correct rate of the A. sinensis discriminant from Dingxi was 84.62%, of which 2 A. sinensis samples were wrongly judged as being from Linxia. This may be related to the geographical scale of Dingxi, and the large variation of variables among the species.

LDA of A. sinensis
Stable isotopes and mineral elements were analyzed using LDA. Our analysis found that the three A. sinensis samples were clustered together. We also found certain differences between the groups. The recognition ability of the three stable isotopes for the two discriminant functions (LD1, LD2) explained 100% of the total variance (discrimination function 1 explained 71.8% of the total variance, and discriminant function 2 explained 28.2% of the total variance; Fig. 6). The function test was performed after the LDA model was established. The Wilks' lambda value of the two discriminant functions was 0.073, and the   difference between the groups was significant (P < 0.001 We used the original data set and the leave-one-out test set to verify the built LDA model. The accuracy of the LDA model in the original data sets of Linxia, Gannan and Dingxi was 92% (Table 6). Among them, one sample of Linxia was wrongly identified as being from Dingxi, and two sample from Dingxi were wrongly identified as being from Gannan. The correct discrimination rate was 80% using leave-one-out cross validation. The misjudgment was the same as the original data set self-validation.

DISCUSSION
A. sinensis was discovered in Gansu and is recognized as high-quality medicinal material.
In 2017, A. sinensis was listed as a protected product in Minxian County, Gansu. Where specimens originate plays an important role in the formation of authentic medicinal materials. Blindly introducing authentic medicinal materials may cause a decline in quality and result in fake or inferior medicinal materials used in the production and sale of medicinal products. This would in turn reduce the safety and efficacy of Chinese medicinal materials. Food and drug quality and safety issues are closely related to health and consumers rights and involve multiple interests in the entire process. Recently, frequently occuring food and drug safety incidents has aroused public attention, which affects mutual trust in trade and threatens the safety of clinical medication. It is important to address the best ways to control and assess the safety of food and drugs. Food and drug safety be monitored at every point in the food supply chain from planting and harvesting to receiving. Meanwhile, the long-term development of geographical products requires strict management, and the application of scientific and technical means for monitoring and inspection. These methods will ensure the protection of producer and consumers interests and a sustainable development of the industry. A. sinensis from Gansu Province requires an effective tracing method. The traceability of the authentic medicinal material will ensure control over their safety, efficacy and provenance. Mineral elements and stable isotopes, as geographical indicators in plants, are closely related to factors such as climate, mineralogy, element mobility, bioavailability and physiological adaptability of the species. The mineral elements and stable isotopes in plants vary in different environments as determined by the plant's adaptation to the environment and the interaction between the environment and plants. The fingerprints of mineral elements and stable isotopes in plants are the imprint of the environment. Wang et al. (2020) traced the origin of 25 mineral elements in maize from three provinces and determined that the accuracy of eight elements using the SLDA model was 92.2%. Pereira et al. (2018) traced the origin of Arapaima spp. fish from four different regions using strontium and carbon isotopes. Their results show that the accuracy rate of identification of wild and aquaculture fish was 58% and that of four different regions was 76%. We collected mineral elements and stable isotopes to trace the geographical origin of A. sinensis and determine differences between the geological and climatic environments in the three A. sinensis-producing areas. Based on the significant differences and the results of our model, K, Ca/Al, δ 13 C, δ 18 O and δ 15 N were determined to be important factors to distinguish A. sinensis in these regions.
We compared the discriminant analysis of A. sinensis using PLS-DA and LDA. The original data set was used to verify the model, and we found that the accuracy of the PLS-DA model was higher than that of the LDA model. The model was verified by the leave-one-out, and the accuracy rate of the PLS-DA model was higher than that of LDA model. The self-verification of the original data set to the model was a verification of the built model itself. The original data sets involved in the modeling process and the information from the verified original data set were included in the verification as the model was built. Leave-one-out cross validation is suitable for small sample analysis. A single prediction sample was removed during model modeling, the model was trained on all remaining samples, and then all samples were predicted. The total variance explained by the first two axes of the LDA model was higher than that of the PLS-DA model. Model verification using the leave-one-out method showed that PLS-DA was more correct than LDA. The PCA model is suitable for exploring the natural classification of variables in A. sinensis from Linxia, Gannan and Dingxi. The PLS-DA model evaluated the importance of variables and classified the samples. The PLS-DA model was also better than the LDA model in terms of classification and discrimination. Our study integrated mineral elements and stable isotopes to trace the origin of A. sinensis from three regions of Gansu Province. We used the PCA model to explore the data naturally, and the PLS-DA model and the LDA model to leave-one-out cross-validation. The comprehensive accuracy rates were 84% and 80%, respectively.

CONCLUSION
Differences between mineral elements and stable isotopes of A. sinensis sampled from Linxia, Gannan and Dingxi, and Gansu Provinces were found and compared. Significant differences were found in K, Cr, Ca/Al, δ 13 C, δ 15 N, and δ 18 O (P < 0.05).
Significant differences were found among the three groups. PCA and PLS-DA models showed that K, Zn/Mn, Ca/Al, δ 13 C, δ 15 N and δ 18 O were important variables for distinguishing the three regions. However, only K, Ca/Al, δ 13 C, δ 18 O, and δ 15 N demonstrate significant differences among the three regions (P < 0.05). K, Ca/Al, δ 13 C, δ 18 O, and δ 15 N play an important role in the discriminant analysis of A. sinensis among the three regions.
The results of mineral elements and stable isotopes from the PLS-DA and LDA show that the discriminant accuracy rate of the PLS-DA model was 84% and the accuracy rate of the LDA model was 80%.