Monitoring Soybean Soil Moisture Content Based on UAV Multispectral and Thermal-Infrared Remote-Sensing Information Fusion

By integrating the thermal characteristics from thermal-infrared remote sensing with the physiological and structural information of vegetation revealed by multispectral remote sensing, a more comprehensive assessment of the crop soil-moisture-status response can be achieved. In this study, multispectral and thermal-infrared remote-sensing data, along with soil-moisture-content (SMC) samples (0~20 cm, 20~40 cm, and 40~60 cm soil layers), were collected during the flowering stage of soybean. Data sources included vegetation indices, texture features, texture indices, and thermal-infrared vegetation indices. Spectral parameters with a significant correlation level (p < 0.01) were selected and input into the model as single- and fuse-input variables. Three machine learning methods, eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and Genetic Algorithm-optimized Backpropagation Neural Network (GA-BP), were utilized to construct prediction models for soybean SMC based on the fusion of UAV multispectral and thermal-infrared remote-sensing information. The results indicated that among the single-input variables, the vegetation indices (VIs) derived from multispectral sensors had the optimal accuracy for monitoring SMC in different soil layers under soybean cultivation. The prediction accuracy was the lowest when using single-texture information, while the combination of texture feature values into new texture indices significantly improved the performance of estimating SMC. The fusion of vegetation indices (VIs), texture indices (TIs), and thermal-infrared vegetation indices (TVIs) provided a better prediction of soybean SMC. The optimal prediction model for SMC in different soil layers under soybean cultivation was constructed based on the input combination of VIs + TIs + TVIs, and XGBoost was identified as the preferred method for soybean SMC monitoring and modeling, with its R2 = 0.780, RMSE = 0.437%, and MRE = 1.667% in predicting 0~20 cm SMC. In summary, the fusion of UAV multispectral and thermal-infrared remote-sensing information has good application value in predicting SMC in different soil layers under soybean cultivation. This study can provide technical support for precise management of soybean soil moisture status using the UAV platform.


Introduction
Soybeans, rich in protein and oil, hold an important position in the trade of grain and oil crops in China [1,2].They are not only one of the important grain and oil crops, but also an important industrial raw material with a wide range of industrial uses and a huge consumer market [3,4].Soil is an important natural resource for the survival of living organisms and is closely related to human production, life, and social activities [5,6].Soil moisture is one of the key components of the Earth's ecosystem [7], and accurately Plants 2024, 13, 2417 2 of 20 obtaining the spatiotemporal distribution and variation information of soil moisture is crucial for agricultural production.
Soil moisture in the field is closely related to crop growth, and plays a key role in the growth and development of crops [8,9].Accurate estimation of soil moisture content (SMC) in the field is of great significance for predicting crop yield [10], field water stress [11], and crop growth conditions [12].Insufficient SMC may lead to water stress in crops, resulting in conditions such as wilting, slow growth, or even death [13], while excessive soil moisture can lead to root diseases and hypoxia [14], affecting the normal development of crops.Therefore, monitoring SMC throughout the entire crop growth cycle is crucial, as it can reveal details of the crop's water and nutrient status, potential environmental stresses, and the interaction between crops and the soil environment [15], which may vary in different geographical locations and under different climatic conditions.Soil moisture monitoring can help farmers or growers to detect potential water-stress issues in a timely manner and take timely irrigation or drainage measures [16].Visually, the crop's response to water stress may not be apparent in the early stages, especially in large areas of farmland where subtle changes are difficult to detect with the naked eye.Furthermore, conventional techniques for assessing soil moisture levels, including the gravimetric technique and the Time Domain Reflectometry (TDR) method [17], although accurate, are often time-consuming and require destructive sampling, which limits their scalability and real-time applicability in large areas of farmland [18].Therefore, traditional methods may not be suitable for real-time monitoring and rapid response of soil moisture conditions in precision agriculture.
UAV systems equipped with various portable sensors have become widely used tools in crop monitoring, due to their ability to quickly and efficiently obtain high-resolution remote-sensing images that reflect the growth status of crops [19].This technology can more quickly and comprehensively obtain information about crop health, growth trends, and environmental variables, thereby helping farmers and researchers make wiser decisions to improve crop productivity, resource efficiency, and overall agricultural sustainability [20].In recent years, the application of UAV multispectral sensors in precision agriculture has been continuously expanding, including estimating crop leaf-nitrogen content [21], aboveground biomass [22], chlorophyll content [23], and SMC [24].Vegetation indices extracted from UAV multispectral images have great potential for monitoring SMC, which enhances the ability to reflect crop growth conditions through mathematical methods such as summation, ratio, and normalization of the spectral reflectance of the crop canopy [25], reducing the impact of soil, noise, and other external environmental factors [26], providing an effective method for monitoring SMC.In addition, texture features, because of their ability to reflect the crop canopy and distribution patterns [27], are often combined with vegetation indices to improve the accuracy of predicting crop parameters.Yang et al. (2024) used vegetation indices, texture features, and texture indices to predict the water content of soybean leaves, and found that using the complementary information of vegetation indices and texture can significantly improve the prediction accuracy of the model [28].Thermal-infrared sensors, because of their ability to directly reflect the temperature of the crop canopy, are closely related to crop water stress and the degree of soil drought [29].The strategy of fusing thermal-infrared and multispectral data has been proven to significantly enhance the accuracy of predicting water-stress indicators such as SMC [30], leaf water content [31], and canopy transpiration and soil evaporation [32].By integrating the thermal characteristics of thermal-infrared remote sensing with the physiological and structural information of vegetation revealed by multispectral remote sensing, a more comprehensive understanding of the crop's response to moisture conditions can be achieved [33].Peng et al. (2022) used vegetation indices, coverage, and canopy temperature to establish a model for monitoring the moisture status of grape plants, and the model established with the combination of multispectral and thermal-infrared information as predictive variables was superior to the variable combination using single-sensor information [34].
During the flowering stage of soybeans, we collected multispectral and thermalinfrared remote-sensing information, as well as soil moisture-content sample data.During Plants 2024, 13, 2417 3 of 20 the flowering stage of soybean, the canopy coverage is high, allowing the inference of plant water-stress conditions through analysis of spectral data from the canopy.The spectral response of plants under water stress can be quantified using vegetation indices and texture features.Furthermore, although optical data cannot directly penetrate deep soil layers, the distribution of plant roots in the shallow soil enables the plant's water status to indirectly reflect the moisture conditions of deeper soil layers through its canopy spectral characteristics.This indirect relationship is well-documented in agricultural and vegetation research.This study aims to explore the synergistic effects and monitoring capabilities of multispectral remote-sensing information and thermal-infrared information in predicting soil moisture content in soybeans.The specific purpose of the study is (1) determining the optimal combination of multispectral and thermal-infrared information; and (2) combining machine learning algorithms to construct the optimal prediction model for soybean soilmoisture content using remote-sensing information-fusion technology.By optimizing the combination of multispectral and thermal-infrared information, this study not only improves the accuracy of soil-moisture-content prediction but also provides an efficient monitoring tool for agricultural soil moisture management.This is of significant practical value for guiding precision irrigation, optimizing water resource utilization, and increasing crop yields.

Research Area and Test Design
The experiment was conducted in the loess soil of the experimental field at the Watersaving Irrigation Test Station of the Key Laboratory of Agricultural Soil and Water Engineering in Arid and Semiarid Areas of the Ministry of Education, Northwest A&F University, Yangling, Shaanxi (Figure 1).Situated at the geographic coordinates of 108 • 24 ′ east longitude and 34 • 20 ′ north latitude, the experimental site is perched at an elevation of 524.7 m above sea level.It falls within the warm temperate monsoon climate category, characterized by a semi-humid region where the bulk of the precipitation occurs during the months of July to September.The long-term precipitation average stands at 580 mm, while the average rate of evaporation is notably higher, reaching 1500 mm.The region experiences a mean annual temperature of 12.9 degrees Celsius.The soil's field capacity to retain water within the top 100 cm of the soil profile is between 23% and 25%, with the moisture level at which plants begin to wilt being 8.5% by mass.In the uppermost 20 cm of the soil, the pH level is measured at 8.14, enriched with organic matter at a concentration of 12.0 g/kg.The soil also contains total phosphorus at 0.60 g/kg, with 8.21 mg/kg available for plant uptake.Potassium levels are substantial, with a total content of 14.10 g/kg and a rapidly available fraction of 131.97 mg/kg.The total nitrogen content is recorded at 0.89 g/kg, complemented by an alkaline hydrolyzable nitrogen fraction of 55.30 mg/kg.

Experimental Design
This experiment was designed with four nitrogen application rates: N0 (0 kg N/ha), N1 (60 kg N/ha), N2 (120 kg N/ha), and N3 (180 kg N/ha).Additionally, four types of mulching were implemented: Straw mulching (SM), Straw and film mulching (SFM), Film Mulching (FM), and No mulching (NM), resulting in a total of 16 treatments with three replicates each.The plot area for each was 4 m by 6 m, equating to 24 m 2 , and plots were arranged randomly with a 2 m protective zone around the experimental area.Phosphorus and potassium supplements were uniformly distributed across the experimental plots at a dosage of 30 kg/ha.The nitrogen input was facilitated through urea, which contains 46% nitrogen.The phosphorus source was in the form of superphosphate, comprising 16% phosphorus, while the potassium was supplied by potassium chloride, with a concentration of 62% potassium.The application of these fertilizers took place in trenches positioned 25 cm from the crop line, coinciding with the pre-sowing phase.Concurrently, the straw mulching amount was 9000 kg/ha.The cultivation of soybeans was executed with a density of 300,000 plants per hectare, arranged with a row spacing of 50 cm and an inter-plant distance of 10 cm.The soybean variety is Shanning 17. Soybeans were sown on 14 June 2023, and 4 June 2024, and harvested on 29 September 2023; the 2024 crop had not been harvested at the time of writing.Other field management practices (such as pesticide application and weeding) were consistent with local standards.Remote-sensing and ground data were collected on 4 August 2023, and 31 July 2024, corresponding to the flowering stage of the soybean.

Experimental Design
This experiment was designed with four nitrogen application rates: N0 (0 kg N/ha), N1 (60 kg N/ha), N2 (120 kg N/ha), and N3 (180 kg N/ha).Additionally, four types of mulching were implemented: Straw mulching (SM), Straw and film mulching (SFM), Film Mulching (FM), and No mulching (NM), resulting in a total of 16 treatments with three replicates each.The plot area for each was 4 m by 6 m, equating to 24 m 2 , and plots were arranged randomly with a 2 m protective zone around the experimental area.Phosphorus and potassium supplements were uniformly distributed across the experimental plots at a dosage of 30 kg/ha.The nitrogen input was facilitated through urea, which contains 46% nitrogen.The phosphorus source was in the form of superphosphate, comprising 16% phosphorus, while the potassium was supplied by potassium chloride, with a concentration of 62% potassium.The application of these fertilizers took place in trenches positioned 25 cm from the crop line, coinciding with the pre-sowing phase.Concurrently, the straw mulching amount was 9000 kg/ha.The cultivation of soybeans was executed with a density of 300,000 plants per hectare, arranged with a row spacing of 50 cm and an interplant distance of 10 cm.The soybean variety is Shanning 17. Soybeans were sown on 14 June 2023, and 4 June 2024, and harvested on 29 September 2023; the 2024 crop had not been harvested at the time of writing.Other field management practices (such as pesticide application and weeding) were consistent with local standards.Remote-sensing and ground data were collected on 4 August 2023, and 31 July 2024, corresponding to the flowering stage of the soybean.

Remote-Sensing Data Acquisition
Data acquisition via unmanned aerial vehicle (UAV) remote sensing was facilitated by the Matrice600Pro hexacopter (DJI, Shenzhen, China), a model engineered by DJI.This UAV boasts a substantial maximum payload capacity of 6 kg and can sustain flight operations for a duration ranging from 25 to 35 min.The Matrice600Pro was outfitted with a duo of advanced sensors designed for capturing imagery: a thermal-infrared imaging sensor branded as ZENMUSE XT (DJI, Shenzhen, China) and a multispectral sensor known as Yusense MS600 (Yusense Information Technology and Equipment (Qingdao) Co., Ltd., Qingdao, China).The aerial missions were strategically scheduled on days characterized

Remote-Sensing Data Acquisition
Data acquisition via unmanned aerial vehicle (UAV) remote sensing was facilitated by the Matrice600Pro hexacopter (DJI, Shenzhen, China), a model engineered by DJI.This UAV boasts a substantial maximum payload capacity of 6 kg and can sustain flight operations for a duration ranging from 25 to 35 min.The Matrice600Pro was outfitted with a duo of advanced sensors designed for capturing imagery: a thermal-infrared imaging sensor branded as ZENMUSE XT (DJI, Shenzhen, China) and a multispectral sensor known as Yusense MS600 (Yusense Information Technology and Equipment (Qingdao) Co., Ltd., Qingdao, China).The aerial missions were strategically scheduled on days characterized by the absence of wind and the presence of clear, sunny skies, to ensure optimal conditions for data collection.The UAV flew at a height of 20 m for all flight missions, and the MS600 multispectral camera was configured with six spectral bands for spectral information collection, namely 490 nm, 555 nm, 680 nm, 720 nm, 800 nm, and 900 nm, with a resolution of 1 cm.The thermal-infrared camera had a spectral range of 7.5~13.5 µm and a resolution of 0.05 • C, and images were taken with the camera lens perpendicular to the ground during the flight.To ensure that our estimation of soil moisture content (SMC) is minimally affected by field cover methods, we implemented the following measures: first, we conducted extensive data collection under different cover conditions, including spectral data collection under various cover materials and vegetation coverage levels within the same growth stage.Second, during the data analysis phase, we paid special attention to the impact of cover conditions on spectral and thermal-infrared data, employing advanced data processing techniques, such as spectral feature extraction in ENVI software, to distinguish and extract information related solely to the plant canopy.This allowed us to eliminate the influence of cover factors from spectral data.

Ground Data Collection
Ground data were collected simultaneously with the remote-sensing data, mainly including the collection of soil moisture content (SMC) and canopy temperature in each plot.At the flowering stage of soybean, the range of root zone is 0~60 cm.Soil moisture was measured using the oven-drying method, determining the water content W of the soil in the 0~20 cm, 20~40 cm, and 40~60 cm soil layers under soybean cultivation within each treatment plot.Five points were selected in each plot, using the five-point sampling method, and soil samples were collected at each point with an auger and collected in aluminum boxes, with the average of five sets of data taken as the final value.The wet weight M1 of the soil samples (less than 100 g) was promptly weighed using a balance with a sensitivity of 0.01 g, and then the soil samples were dried in an oven at 105 • C for 6~8 h until a constant weight was reached, with the oven temperature maintained between 100 and 110 • C. The dry weight M2 of the soil samples was then measured, and the SMC was calculated using the following formula: Canopy temperature was measured using a handheld thermal-infrared thermometer with a thermal accuracy of ±1 • C. In each experimental plot, six plants were selected, and the average canopy temperature of the six plants was taken as the canopy temperature of the soybean in that plot.While collecting canopy temperature, the temperature of the water in a bucket placed in advance in the experimental area was also collected.The thermalinfrared images collected by the UAV were input into the FLIR Tools V3.1.1 software, and the average leaf temperature and water temperature were used as reference temperatures to calibrate the pixel temperatures in the thermal-infrared images.The canopy temperature at different times in each plot was extracted, and the measured value of the handheld thermometer was used as the reference value for the canopy temperature of the soybean, to carry out the analysis of the results and accuracy of the thermal-infrared image temperature and the actual measured canopy temperature.Figure 2 shows the correlation between the canopy temperature extracted from the thermal-infrared image and the actual canopy temperature at the flowering stage of the soybean, with an R 2 of 0.892, and RMSE and MRE of 1.004 • C and 2.697%, respectively.The high accuracy and good fit indicate a strong correlation between the thermal-infrared image temperature and the actual measured canopy temperature.

Preprocessing of Multi-Source Remote-Sensing Data
In this study, the Yusense Map 2.2.2 software was used to stitch the UAV multispectral images, with automatic geometric and radiometric correction preprocessing during the stitching process.The ENVI 5.3 software was used to perform mask processing on the shadows and soil background in each plot of the stitched images, based on the threshold method, extracting the spectral reflectance of each plot and each band.Vegetation indices were obtained through the combination of the extracted single-band spectral reflectance.The specific vegetation indices selected are shown in Table 1.
The Gray-Level Co-occurrence Matrix (GLCM) stands as a prevalent technique for texture analysis, currently among the most frequently utilized in the field.Characterized by its robustness to image rotation, capability to capture multi-scale characteristics, and relatively low computational demand, the GLCM has found extensive applications across various domains, including image processing, pattern recognition, and remote-sensing observation.The texture attributes derived from this method encompass a spectrum of statistical measures such as Mean, Variance, Homogeneity, Contrast, Dissimilarity, Entropy, Second Moment, and Correlation, as detailed in Table 2.In ENVI 5.3 (Harris, Bloomfield, CO, USA), GLCM is used to perform a 3 × 3 sliding filter on the grayscale images of RGB images to extract 8 texture features in the directions of 0 • , 45

Preprocessing of Multi-Source Remote-Sensing Data
In this study, the Yusense Map 2.2.2 software was used to stitch the UAV multispectral images, with automatic geometric and radiometric correction preprocessing during the stitching process.The ENVI 5.3 software was used to perform mask processing on the shadows and soil background in each plot of the stitched images, based on the threshold method, extracting the spectral reflectance of each plot and each band.Vegetation indices were obtained through the combination of the extracted single-band spectral reflectance.The specific vegetation indices selected are shown in Table 1.

Vegetation Index Computational Formula References
Modified Triangular Vegetation Index (MTVI) Soil-Adjusted Vegetation Index (SAVI) Optimized Soil-Adjusted Vegetation Index (OSAVI) Enhanced Vegetation Index (EVI) 2.5(RN IR − RR)/(R N IR + 6R R − 7.5R B + 1) [36] Notes: In the table, R B , R G , R R , R RE , R N IR represent the band reflectance of blue, green, red, red edge, and near-infrared bands, respectively, X represents the optimized value to reduce soil background effects, set as 0.16 in this study.The near-infrared bands used in this study were 800 and 900 nm, chosen based on their high correlation with the samples.

Textural Features Formula
Mean, Mea Mea = ∑ G i,j=1 (iP(i, j)) This study also utilized texture features to construct texture indices.Six types of texture indices (TI) were defined as follows: Normalized difference texture index (NDTI), Difference texture index (DTI), Ratio texture index (RTI), Nonlinear texture index (NTI), Reciprocal difference texture index (RDTI), and Reciprocal additive texture index (RATI).All possible combinations of measurements were made by combining six bands (490, 555, 680, 720, 800, and 900 nm) with eight GLCM-based texture-feature values, to explore their estimation capabilities.
In the formulas mentioned, T1 and T2 represent any two different texture features based on the Gray-Level Co-occurrence Matrix (GLCM).
Thermal-infrared technology is widely applied in the study of crop water stress, the monitoring of infectious diseases, frost damage, and yield prediction.In this study, canopy temperature (T C ) was extracted from thermal-infrared images, and four thermalinfrared indices were calculated based on this, namely Canopy-Air Temperature Difference (T C D), Normalized Canopy Temperature (NRCT), Canopy Relative-Temperature Difference (CRTD), and Soil Relative-Temperature Difference (SRTD).The specific formulas are as follows: In this formula, T c represents the air temperature, T ci is the canopy temperature of the i-th pixel in the image, T cmax is the maximum temperature measured across the entire experimental field, T cmin is the minimum temperature measured across the entire experimental field, T smax is the maximum soil temperature within each sampled plot, and T smin is the minimum soil temperature within each sampled plot.

Model Construction and Validation
This study collected multispectral and thermal-infrared remote-sensing data, as well as SMC sample data during the flowering stage of soybean, comprising a total of 96 data sets.Two-thirds of the data samples were used as the model training set, and one-third as the model validation set.The data sources included Vegetation Indices (VIs), Texture Features (TF), Texture Indices (TIs), and Thermal-Infrared Vegetation Indices (TVIs).Spectral parameters with a correlation significant at the p < 0.01 level were selected and input into the model as single-and fused-input variables.In the quest to develop predictive models for soybean soil-moisture content (SMC), we leveraged a trio of sophisticated machine learning techniques, namely eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and a Backpropagation Neural Network (BP) enhanced by a Genetic Algorithm (GA-BP).The XGBoost technique underwent a meticulous parameter tuning process using a grid search approach, culminating in the selection of 100 boosting stages (n_estimators), a learning rate set to 0.03, and a cap on the tree depth at 5, as per the optimized parameters [48].The implementation of the RF model required the construction of a large number of decision trees and the use of variable swapping and alteration to enhance predictive performance; after multiple training sessions and error analyses, the number of decision trees was set to 100 [49].The genetic algorithm optimized the initial weights of the BP neural network, replacing the gradient descent process for adjusting network weights and thresholds.The genetic algorithm started with a population size of 5, set for 50 generations, with a crossover probability of 0.4% and a mutation probability of 0.05%; after optimization, the BP neural network had 5 nodes in the input layer, 5 nodes in the hidden layer, 1 node in the output layer, a maximum of 1000 iterations, and a training target of 1 × 10 −6 [50].All algorithmic modeling in this study was completed in MATLAB R2022a software.
The accuracy and reliability of the model's fit were assessed through key performance indicators such as the Coefficient of Determination (R 2 ), the Root Mean Square Error (RMSE), and the Mean Relative Error (MRE).A high R 2 value, nearing the threshold of 1, denotes a model with exceptional predictive capabilities.Furthermore, minimal RMSE and MRE values suggest a model that exhibits consistent performance and delivers predictions that are closely aligned with one another.The calculation formulas are as follows: In the formula ŷi -Model predictive values: y i -Actual sampling value; y-Actual sampling value; n-number of samples.

Methodology
To explore the synergistic effect and monitoring capability of multispectral remotesensing information and thermal-infrared information in predicting SMC, this study collected multispectral and thermal-infrared remote-sensing information and SMC sample data during the flowering stage of soybean.The data sources included vegetation indices (VIs), texture features (TF), texture indices (TIs), and thermal-infrared vegetation indices (TVIs).Spectral parameters with a significant correlation level (p < 0.01) were selected and Plants 2024, 13, 2417 9 of 20 input into the model as single-and fused-input variables.In this research, a trio of advanced machine learning algorithms was employed to develop predictive models for soybean soil-moisture content (SMC).These included eXtreme Gradient Boosting (XGBoost), the ensemble method known as Random Forest (RF), and a Backpropagation Neural Network (BP) enhanced by a Genetic Algorithm (GA).The models were formulated utilizing the integrated data acquired from unmanned aerial vehicle (UAV)-mounted multispectral and thermal-infrared sensors.A visual representation of the workflow and analytical process is depicted in Figure 3, illustrating the sequence of data handling and the methodologies applied throughout the study.
Plants 2024, 13, x FOR PEER REVIEW 10 of 21 the ensemble method known as Random Forest (RF), and a Backpropagation Neural Network (BP) enhanced by a Genetic Algorithm (GA).The models were formulated utilizing the integrated data acquired from unmanned aerial vehicle (UAV)-mounted multispectral and thermal-infrared sensors.A visual representation of the workflow and analytical process is depicted in Figure 3, illustrating the sequence of data handling and the methodologies applied throughout the study.

Selection of Spectral Indices and Textural Features
To fully exploit the spectral information contained within multispectral data, this study selected 15 commonly used vegetation indices (Table 1) and analyzed their correlation with the soil moisture content (SMC) at different soil layers under soybean cultivation.The vegetation indices with correlations significant at the p < 0.01 level are presented in Table 3. Significant correlations (p < 0.01) with SMC were found for indices across various soil layers under soybean cultivation.Notably, the MSR, DVI, GCVI, and NLI all demonstrated strong correlations, with MSR showing the highest correlation coefficients of −0.661, −0.657, and −0.510 for the 0~20 cm, 20~40 cm, and 40~60 cm soil layers, respectively.Additionally, the study calculated the correlation between texture features and the SMC at different soil layers under soybean cultivation (Table 4).The texture features with the highest correlation coefficients in the 0~20 cm, 20~40 cm, and 40~60 cm soil layers under soybean cultivation were the Second Moment of band 5, and the mean values of bands 2 and 3, with correlation coefficients of 0.585, 0.644, and 0.519, respectively.This comprehensive analysis ensures the selection of the most relevant spectral and textural indicators for accurate estimation of SMC, which is crucial for the development of robust predictive models in agricultural and environmental studies.

Selection of Spectral Indices and Textural Features
To fully exploit the spectral information contained within multispectral data, this study selected 15 commonly used vegetation indices (Table 1) and analyzed their correlation with the soil moisture content (SMC) at different soil layers under soybean cultivation.The vegetation indices with correlations significant at the p < 0.01 level are presented in Table 3. Significant correlations (p < 0.01) with SMC were found for indices across various soil layers under soybean cultivation.Notably, the MSR, DVI, GCVI, and NLI all demonstrated strong correlations, with MSR showing the highest correlation coefficients of −0.661, −0.657, and −0.510 for the 0~20 cm, 20~40 cm, and 40~60 cm soil layers, respectively.Additionally, the study calculated the correlation between texture features and the SMC at different soil layers under soybean cultivation (Table 4).The texture features with the highest correlation coefficients in the 0~20 cm, 20~40 cm, and 40~60 cm soil layers under soybean cultivation were the Second Moment of band 5, and the mean values of bands 2 and 3, with correlation coefficients of 0.585, 0.644, and 0.519, respectively.This comprehensive analysis ensures the selection of the most relevant spectral and textural indicators for accurate estimation of SMC, which is crucial for the development of robust predictive models in agricultural and environmental studies.

Texture-Feature-Index Construction
To enhance the relevance of texture features, we extracted six different texture indices through random combinations of various texture features (Table 5 and Figure 4).After screening, the correlations between the randomly combined texture indices and the SMC at different soil layers under soybean cultivation all reached a highly significant level (p < 0.01).Overall, the Reciprocal Difference Texture Index (RDTI) was the most relevant texture index for the SMC in the 0~20 cm soil layer of soybean, with a correlation coefficient of 0.646, and the texture combination was (Cor3, Ent5).The Nonlinear Texture Index (NTI) was the most relevant texture index for the SMC in the 20~40 cm and 40~60 cm soil layers under soybean cultivation, with correlation coefficients of −0.583 and −0.550, respectively, and the texture combinations were (Sec5, Hom5) and (Mea3, Mea2).

Selection of Thermal-Infrared Vegetation Indices
In this study, canopy temperature (T C ) was extracted from thermal-infrared images, and four thermal-infrared vegetation indices were calculated based on this: Canopy-Air Temperature Difference (T C D), Normalized Canopy Temperature (NRCT), Canopy Relative-Temperature Difference (CRTD), and Soil Relative-Temperature Difference (SRTD).The correlation between these five indices and the SMC at different soil layers under soybean cultivation was analyzed (Figure 5).The results indicated that CRTD and SRTD had a better correlation with the SMC of soybean, with both correlations being at a highly significant level (p < 0.01).The correlation of the thermal-infrared indices constructed at different soil layers under soybean cultivation with SMC showed a trend of 0~20 cm > 20~40 cm > 40~60 cm.

Soybean Soil-Moisture Content (SMC) Prediction Models Based on Multispectral, Thermal-Infrared, and Multispectral Thermal-Infrared Remote-Sensing Information Fusion
This study collected multispectral and thermal-infrared remote-sensing information during the flowering stage of soybean, and extracted four types of model input sources: vegetation indices, texture features, texture indices, and thermal-infrared vegetation indices.The models were assessed with both single-input variables and fused-input variables.When evaluating the performance of the models with single-input variables (Table 6), Vegetation Indices (VIs) were found to be the optimal single-model input, with all four models achieving the highest accuracy and good fit (Figure 6).The XGBoost model performed the best, with an R 2 range of 0.456 to 0.683 in the validation set, an RMSE range of 0.631 to 0.795%, and an MRE range of 2.910 to 3.444%.The correlation between these five indices and the SMC at different soil layers under soybean cultivation was analyzed (Figure 5).The results indicated that CRTD and SRTD had a better correlation with the SMC of soybean, with both correlations being at a highly significant level (p < 0.01).The correlation of the thermal-infrared indices constructed at different soil layers under soybean cultivation with SMC showed a trend of 0~20 cm > 20~40 cm > 40~60 cm.

Soybean Soil-Moisture Content (SMC) Prediction Models Based on Multispectral, Thermal-Infrared, and Multispectral Thermal-Infrared Remote-Sensing Information Fusion
This study collected multispectral and thermal-infrared remote-sensing information during the flowering stage of soybean, and extracted four types of model input sources: vegetation indices, texture features, texture indices, and thermal-infrared vegetation indices.The models were assessed with both single-input variables and fused-input variables.When evaluating the performance of the models with single-input variables (Table 6), Vegetation Indices (VIs) were found to be the optimal single-model input, with all four models achieving the highest accuracy and good fit (Figure 6).The XGBoost model performed the best, with an R 2 range of 0.456 to 0.683 in the validation set, an RMSE range of 0.631 to 0.795%, and an MRE range of 2.910 to 3.444%.The fused-input variables were input into the model, and their model performance was assessed (Table 7).The combination of Vegetation Indices (VIs), Texture Features (TIs), and Thermal-Infrared Vegetation Indices (TVIs) proved to be the optimal fused-  The fused-input variables were input into the model, and their model performance was assessed (Table 7).The combination of Vegetation Indices (VIs), Texture Features (TIs), and Thermal-Infrared Vegetation Indices (TVIs) proved to be the optimal fused-model input, with all four models achieving the highest accuracy and good fit (Figure 7).The XGBoost model also performed the best among them, with an R 2 range of 0.589 to 0.780 in the validation set, an RMSE range of 0.437 to 0.793%, and an MRE range of 1.667 to 3.080%.The performance of different models can be represented as XGBoost > GA-BP > RF.Moreover, compared to the 40~60 cm soil layer, these models demonstrated better predictive performance in the 0~20 cm and 20~40 cm soil layers.

Discussion
Multispectral and thermal-infrared sensors, due to their complementary information, often integrate the information from both types of sensors for estimating crop-growth physiological indicators and assessing water and nutrient status in agricultural remote sensing [51].This study acquired remote-sensing information from different sensors to assess and predict the soil moisture-content (SMC) status of soybean.The results showed that the vegetation indices (VIs) based on multispectral sensors had the best accuracy in monitoring SMC in different soil layers under soybean cultivation, with a determination coefficient ranging from 0.456 to 0.683.Vegetation indices can fully utilize the rich information of multispectral data, reduce the impact of soil information on SMC, eliminate redundant information, and reduce the complexity of the model [52].Previous studies have shown that vegetation indices such as the Normalized Difference Vegetation Index (NDVI), Modified Simple Ratio (MSR), and Triangular Vegetation Index (TVI), built based on multispectral information, can greatly preserve the spectral characteristics of the vegetation canopy and can assess well the water stress of crops like winter wheat [53] and rapeseed [17].The thermal-infrared vegetation indices (TVIs) based on thermal-infrared sensors also show good results in soil moisture monitoring, with a determination coefficient ranging from 0.384 to 0.601.The information from thermal-infrared sensors can directly reflect the crop canopy temperature, which is closely related to the water status of the crop and can characterize well the water stress of the crop [54].Marques et al. (2023) harnessed aerial imagery obtained from Unmanned Aerial Vehicles (UAVs) to forecast critical indicators of plant water stress, which included relative water content (RWC), midday leaf-water potential (Ψ MD ), and stomatal conductance (gs).Their findings underscored the substantial efficacy of thermal vegetation indices in the evaluation of these stress indicators [31].When using single-texture information, the prediction accuracy is the worst, but by combining texture-feature values to obtain new texture indices, the performance of estimating SMC by texture features can be effectively improved.This is because combining texture-feature values through normalization, difference, ratio, and other methods can reduce the impact of soil background, solar angle, and sensor view angle, amplify the subtle differences between object spectral features, and highlight object features [55].
When using two input variables to predict the SMC status of different soil layers under soybean cultivation, the vegetation index combined with texture index (VIs + TIs) based on multispectral sensors achieved the best accuracy.After the vegetation index is combined with texture information, it can not only represent the canopy structure of the crop, but also reduce the impact of "same spectrum different objects" and "same object different spectrum", thereby improving the prediction accuracy of crop growth parameters, to a certain extent [56].When predicting the soil moisture status of different soil layers under soybean cultivation based on the input variables of Vis + TVIs, the accuracy is lower, but when the texture index (TIs) is added, it makes up for this deficiency and greatly enhances the prediction accuracy, with an R 2 range of 0.589 to 0.780.This may be because the texture information extracted from the multispectral sensor includes information about the growth and structure of the crop canopy, which can overcome the inherent saturation problem of spectral information, to a certain extent [57].Among the three models in this study, the fusion of vegetation index, texture index, and thermal features is better than any two combinations of information.These results indicate that spectral information, texture features, and thermal features provide special supplementary information that helps to predict crop soil moisture.The results indicate that water-stress conditions in soybean can be effectively monitored through vegetation indices and texture features.Additionally, surface temperature captured by thermal-infrared data serves as an indicator of plant water stress, providing supplementary information for assessing plant water status.
Among the three models selected in this study, the XGBoost-based model has the highest accuracy, indicating that XGBoost has a significant advantage in predicting crop SMC, which is consistent with previous studies [58].XGBoost (Extreme Gradient Boosting) is an efficient gradient-boosting framework that uses multi-threading and distributed computing, can handle large-scale datasets under limited memory conditions, and can remove content that contributes less to the model during the operation, enhancing the interpretability of the model.Although GA-BP shows excellent performance in tasks such as image and speech recognition, it is not efficient and accurate enough in regression and classification tasks.RF, although providing good model robustness and the ability to handle feature interaction, may encounter performance bottlenecks when dealing with large-scale datasets.Based on the input combination of VIs + TIs + TVIs, this study constructed the optimal prediction model for SMC in different soil layers under soybean cultivation, and XGBoost can be considered as the preferred method for soybean SMC monitoring and modeling.In addition, compared with the 40~60 cm soil layer, these models show better predictive performance in the 0~20 cm and 20~40 cm soil layers.This study can provide real-time and efficient technical services for monitoring the surplus and deficit status of crop soil moisture in practical applications.
Building upon the success of our UAV-based multispectral and thermal-infrared remote-sensing research, the future of our work will innovate in collecting a diverse dataset to enhance model adaptability, explore cutting-edge algorithms to refine predictive accuracy, and develop an ensemble of models for robust forecasting.We will implement a real-time monitoring system to facilitate proactive agricultural management, collaborate with a broad spectrum of disciplines to understand soil moisture's impact on crop health, and work towards commercializing our technology to make precision farming more accessible.We will assess the economic and environmental sustainability of our approach, ensuring it is scalable and adaptable to various farming practices.Additionally, we aim to promote sustainable agricultural practices, all while ensuring our research remains at the forefront of innovation in precision agriculture and sustainable development.

Conclusions
This investigation amassed a rich dataset during the flowering stage of soybean cultivation, integrating both multispectral and thermal-infrared remote-sensing measurements along with soil moisture-content (SMC) samples.The dataset was enriched with a spectrum of indicators, including vegetation indices (VIs), texture feature indices (TFs) derived from textural analysis, and thermal-infrared vegetation indices (TVIs).Spectral parameters that demonstrated significant correlations at the stringent p-value threshold of less than 0.01 were identified and utilized as single and fused inputs for model formulation.The construction of predictive models for soybean SMC was accomplished by engaging three sophisticated machine learning methodologies: eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and a Backpropagation Neural Network (BP) optimized by a Genetic Algorithm (GA-BP).These models were adeptly tailored to harness the synergistic potential of UAV-acquired multispectral and thermal-infrared remote-sensing data.The conclusions are as follows: (1) Among the single-input variables, the vegetation indices (VIs) derived from multispectral sensors demonstrated the highest accuracy in monitoring SMC across different soil layers under soybean cultivation.The predictive accuracy was the poorest when using single-texture information alone, but by combining texture-feature values to create new texture indices, the performance of estimating SMC was effectively enhanced.The fusion of vegetation indices (VIs), texture indices (TIs), and thermal-infrared vegetation indices (TVIs) provided a better prediction of soybean SMC.(2) Based on the input combination of VIs, TIs, and TVIs, this study constructed the optimal prediction model for SMC in different soil layers under soybean cultivation, with XGBoost being the preferred method for soybean SMC monitoring and modeling.Moreover, compared to the 40~60 cm soil layer, these models exhibited superior predictive performance in the 0~20 cm and 20~40 cm soil layers.
In summary, the results of this study demonstrate the effectiveness of multi-spectral and thermal-infrared data in monitoring SMC under specific agricultural management practices.This research provides a new perspective and method for monitoring water stress in soybean, which has significant practical application value for agricultural water management.

Plants 2024 , 21 Figure 1 .
Figure 1.Research area overview and aerial photography of some fields in the research area.

Figure 1 .
Figure 1.Research area overview and aerial photography of some fields in the research area.

21 Figure 2 .
Figure 2. Scatter plot of canopy temperature extracted from thermal-infrared images and actual canopy temperature.

Figure 2 .
Figure 2. Scatter plot of canopy temperature extracted from thermal-infrared images and actual canopy temperature.

Figure 3 .
Figure 3. Flowchart of the proposed method.

Figure 3 .
Figure 3. Flowchart of the proposed method.

Figure 4 .
Figure 4. Correlation matrix between soybean SMC at different soil layers under soybean cultivation and randomly combined texture indices.

Figure 4 .
Figure 4. Correlation matrix between soybean SMC at different soil layers under soybean cultivation and randomly combined texture indices.

Figure 5 .
Figure 5. Correlation between thermal-infrared indices and SMC in different soil layers under soybean cultivation.

Figure 6 .
Figure 6.Optimal prediction models for SMC in different soil layers under soybean cultivation based on vegetation indices (VIs) with XGBoost, GA-BP, and RF ((a-i) represents the soil moisture content of 0-20 cm, 20-40 cm, 40-60 cm soil layers under soybean cultivation using XGBoost, GA-BP and RF models based on vegetation index (VIs) input variables).

Figure 6 .
Figure 6.Optimal prediction models for SMC in different soil layers under soybean cultivation based on vegetation indices (VIs) with XGBoost, GA-BP, and RF ((a-i) represents the soil moisture content of 0-20 cm, 20-40 cm, 40-60 cm soil layers under soybean cultivation using XGBoost, GA-BP and RF models based on vegetation index (VIs) input variables).

Figure 7 .
Figure 7. Optimal prediction models for SMC in different soil layers under soybean cultivation based on the combination of vegetation indices (VIs), texture indices (TIs), and thermal-infrared vegetation indices (TVIs) using XGBoost, GA-BP, and RF ((a-i) represents the soil moisture content

Figure 7 .
Figure 7. Optimal prediction models for SMC in different soil layers under soybean cultivation based on the combination of vegetation indices (VIs), texture indices (TIs), and thermal-infrared vegetation indices (TVIs) using XGBoost, GA-BP, and RF ((a-i) represents the soil moisture content of 0-20 cm, 20-40 cm, 40-60 cm soil layers under soybean cultivation using XGBoost, GA-BP and RF models based on the combination of vegetation indices (VIs), texture indices (TIs), and thermal-infrared vegetation indices (TVIs) input variables).

Table 3 .
Correlation coefficient between vegetation index and SMC in different soil layers under soybean cultivation (** significant at p < 0.01).

Table 3 .
Correlation coefficient between vegetation index and SMC in different soil layers under soybean cultivation (** significant at p < 0.01).

Table 4 .
Correlation coefficient between texture features and SMC in different soil layers under soybean cultivation (* significant at p < 0.05, ** significant at p < 0.01).

Table 5 .
The texture index extracted by random combination and the correlation coefficient with soybean SMC (** significant at p < 0.01).

Table 6 .
Validation statistics of soybean SMC models by using single-input variable type.
Figure 5. Correlation between thermal-infrared indices and SMC in different soil layers under soybean cultivation.

Table 6 .
Validation statistics of soybean SMC models by using single-input variable type.

Table 7 .
Validation statistics of soybean SMC models by using multiple-input variable type.