Improving the Performance of Temperature Index Snowmelt Model of SWAT by Using MODIS Land Surface Temperature Data

Simulation results of the widely used temperature index snowmelt model are greatly influenced by input air temperature data. Spatially sparse air temperature data remain the main factor inducing uncertainties and errors in that model, which limits its applications. Thus, to solve this problem, we created new air temperature data using linear regression relationships that can be formulated based on MODIS land surface temperature data. The Soil Water Assessment Tool model, which includes an improved temperature index snowmelt module, was chosen to test the newly created data. By evaluating simulation performance for daily snowmelt in three test basins of the Amur River, performance of the newly created data was assessed. The coefficient of determination (R 2) and Nash-Sutcliffe efficiency (NSE) were used for evaluation. The results indicate that MODIS land surface temperature data can be used as a new source for air temperature data creation. This will improve snow simulation using the temperature index model in an area with sparse air temperature observations.


Introduction
In most of the middle and high latitude regions, snow accumulation and subsequent snowmelt are considered as the most important hydrological processes, because the stream hydrograph is dominated by spring snowmelt [1]. In addition, nutrient transport from land to sea is significantly influenced by spring flood processes [2,3]. Hence, knowledge of the spring snowmelt process is essential not only for hydrological modeling, but also for further study of nutrient dynamics and transport in middle and high latitude regions. Distributed hydrological models have been proven useful and applicable to investigate streamflow and nutrient transport in snowmeltdominated basins [4,5].
Recently, the physically based energy-balance method has been demonstrated to be accurate and powerful for calculating snowmelt processes [6][7][8]. However, the demand for accurate and variable input data and complex parameterization still limits applicability of the method [9,10]. Conversely, the temperature index (henceforth, -) method has been widely used despite its simplicity for the following reasons [11]: (1) wide availability of air temperature data, (2) relative ease of air temperature interpolation and forecasting, and (3) computational simplicity. Thus, hydrological models such as the Soil Water Assessment Tool [12], Hydrological Simulation Program Fortran (HSPF, [13]), MIKE [14], and Snowmelt Runoff Model (SRM, [15]) have adopted themethod to simulate snow accumulation and the snowmelt process. Because -is based on an assumption that the relationship between ablation and air temperature is usually expressed in the form of positive temperature sums [11], the air temperatures are obviously one of the most important variables for this method.
Air temperature ( ) data can be easily obtained in regions where meteorological observation network is dense, but many remote areas have sparse stations. However, with development of earth observations, MODIS remotely sensed land surface temperature (LST) data have proven powerful for creating data. For example, Kloog et al. [16] successfully applied the spatial smoothing method to evaluate daily air 2 The Scientific World Journal temperature data using MODIS LST data in Massachusetts, United States. Zhu et al. [17] also used MODIS LST to evaluate daily and subdaily maximum and minimum air temperature on the northern Tibetan Plateau. Zakšek and Schroedter-Homscheidt [18] reviewed that there are three different methods commonly applied for estimating the based on the LST data: (1) the statistical methods; (2) the temperature-vegetation index methods (TVX); (3) energybalance methods. They reported that the statistical methods generally perform well, within the spatial and time frame they were derived from, but require large amounts of data to train the algorithms [19]. The TVX method is based on the assumption that, for an infinitely thick canopy, the top-of-canopy temperature is the same as within the canopy [17] and uses the Normalized Difference Vegetation Index (NDVI) as a key input variable. However, the assumption of linear and negative slope between LST and NDVI is not always applicable and is influenced by seasonality, ecosystem type, and soil moisture variability [19,20], and the period of created data is limited by the periods of both LST and NDVI data. In addition, Zhu et al. [17] also found that the results of the statistical method are similar to the TVX method. Although the energy-balance methods are physically based, the major disadvantage of this method is the requirement of large amounts of information often not provided by remote sensing [19]. The linear relationship between MODIS LST and data has been demonstrated in different study regions [21][22][23][24]. Thus, the linear regression method is a common choice for data estimation using MODIS LST data. Colombi et al. [25] used the linear regression method and MODIS LST data to generate average daily temperature in Italian alpine areas, and they proved that the result of the linear regression method was superior to that of the spatial interpolation method. Shen and Leptoukh [26] also used the linear regression relationship between air temperature and MODIS LST data to create new daily air temperature data in northern China and central Russia.
One critical disadvantage of using MODIS LST data is that the period of newly created air temperature data is limited by the operational period of the satellite. However, we frequently need historical data, especially for long-term hydrological simulations. Thus, it is necessary to find an easy and effective way to create spatially dense and temporally long-termair temperature data. Motivated by this unsolved problem, we set our research objectives as follows: (1) development of a simple method to create accurate air temperature data over a longer period; (2) hydrological simulation of the snowmelt process using newly created air temperature data for three test basins in the Amur River basin; and (3) evaluation of the validity of the newly created air temperaturedata by analysis of simulated results.

Study Area and Data
2.1. Test Basins. Three basins were selected for model testing, which are located in the upper, middle, and lower stream of the Amur River basin. Basic geographic characteristics of the basins are shown in Figure 1 and Table 1. The Amur River is the tenth longest in the world and is recognized as an important dissolved iron source for the Sea of Okhotsk [27]. There are four distinct phases in the Amur water regime: spring floods, summer low water, summer and autumn floods, and winter low water. The main water source is rainfall, supplying 70-80% of total water, and snowmelt during spring floods adds 10-20% [28]. In the upper stream (basin A, Gari), the annual temperature is −2.4 ∘ C and the annual precipitation is 494 mm, and in the lower stream (basin B, Apkoroshi), the annual temperature is −0.1 ∘ C and the annual precipitation is 641 mm. In the middle stream (basin C, Malinovka), the mean annual temperature is 1.1 ∘ C and the annual precipitation is 593 mm [29].

Data for Hydrological Simulation.
Detailed structure of the snow melt component of SWAT model is explained in the following sections. To generate input data needed for SWAT, a digital elevation model (DEM), soil data, land use and land cover (LULC) data, and weather data are required [30]. In addition, discharge data are required for calibration of hydrological simulations.
A Shuttle Radar Topographic Mission (SRTM 90 m) DEM was used to delineate subbasins of the test basins. We applied the same flow accumulation threshold (200 km 2 ) in watershed delineation of all three test basins. The subbasins were delineated by the ArcSWAT interface. As shown in Figure 1, there are nine subbasins in basin A and 11 in basin B and basin C. The land use/land cover map was constructed by combined use of vegetation maps of China, Mongolia, and Russia and satellite images [31]. Soil data were taken from the Harmonized World Soil Database (HWSD, [32]), obtained from the International Institute for Applied Systems Analysis (IIASA). The spatial resolution of LULC data is as same as DEM data (90 m), and the resolution of soil data is 1 km.
Temporal resolution of all weather data was daily. Asian Precipitation Highly Resolved Observational Data Integration towards Evaluation of Water Resources (APHRODITE, [33]) was used for daily precipitation data. The most important driving data of distributed hydrological models are of accurate precipitation. It has been shown that APHRODITE can give good performance in this study area [34,35]. Maximum air temperature ( ,max ), minimum air temperature ( ,min ), and wind speed data were obtained from the Global Historical Climatic Network-Daily (GHCN-Daily, [36]) of the National Climate Data Center (NCDC). The SWAT model calculates the distance between geometer center of subbasin and the candidate weather station to attach the nearest station for each subbasin as the unique driving station. In our study, the SWAT model selected eight monitoring air temperature stations ( Figure 1 and Table 2) for SWAT model. Relative humidity and solar radiation data were from the NCEP-DOE reanalysis 2 dataset [37] on the website of the NOAA Earth System Research Laboratory. Because we focused on improving snowmelt simulations in the -method, daily monitoring runoff data during March through May were used for the calibration period to obtain optimized parameter values. The years 1983-1987 were selected for basins B and C, and because of lack of the monitoring, 1983, 1984, 1986, and 1988 were used for basin A. Runoff data were provided by the Russian Federal Service for Hydrometeorology and Environmental Monitoring (Roshydromet).  N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N [23] already proved that the AQUA and TERRA have no significant difference in estimation. In addition, Vancutsem et al. [20] also proved that the AQUA data can give reasonable results even if its observation period is shorter than TERRA. MODIS LST data are derived from thermal infrared bands 31 (10.78-11.28 m) and 32 (11.77-12.27 m). Atmospheric effects are corrected by a generalized split-window algorithm [38]. The latest LST data are version 005; these datasets have error less than 1 ∘ C within the range −10 to 50 ∘ C, assuming that surface emissivity is known [19,39]. In addition, ground-based validation has shown that errors were less than 1 ∘ C at homogeneous surfaces such as water, crop, and grassland [39].

Long-Term Air Temperature Data Creation and Validation.
The simplest method to estimate the air temperature is to create a linear regression equation between at points and ( Figure 2) using observed data. In this case, the linear regression equation can be written as follows: (1) Here, 1 and 1 are coefficients of the linear regression equation, and subscripts and indicate the points.
We call this the -method. If monitoring data at both points and are available, we can use the method. However, if we have no data at a point we need to know, the method is not applicable. Here, we use the monitoring stations and their nearest stations ( Table 2) to evaluate the linear relationships between two different places ( Figure 2).
We used three indices: coefficient of determination ( 2 ), mean absolute error (MAE), and root mean square error 4 The Scientific World Journal Station is the WMO code of stations used for validation of new created data and SWAT model; N Station shows the WMO code of the nearest station for each station in the "Station" column; P code is the station pair name, LULC is the land cover type, and Dis is distance between the two stations. (RMSE) for the evaluation. Equations for 2 , MAE, and RMSE are as follows:

Linear regression for LST
Here, is observed on day ,̂is created air temperature on day by each method, is the average value of observed ,̂is the average value of created air temperature, and is the total number of days. The results of linear correlation analysis for are shown in Table 3 and discussed in Section 4.1.
Further, the study of Sun et al. [24] presented a theoretical derivation of linear regression relationships between and LST and proved that the can be mainly explained by the LST in the linear regression equation, and they also showed that the errors of created based on the linear regression method are limited in a reasonable range, in the North China Plain. In addition, Mostovoy et al. [23] also proposed the similar method to estimate at any point from LST at that point. By constructing linear regression equations between and LST at 161 monitoring station points in the state of Mississippi, they also found a common relationship between LST and , irrespective of location. Furthermore, both of these researches indicate that the first order coefficient of linear equation between and LST is equal to 1. Here, we use the monitoring data and LST data to estimate the air temperature between them; we call this the LST-method. In this case, the linear regression equation can be written as We also validate the linear regression relationships between the and LST in all stations (Table 4); the 2 is over 0.95 for both daily maximum and minimum -LST analysis for all station pairs. In addition, the results also showed that the first order coefficient (slope or " ") of linear regression equations (Table 4) is very close to 1. These results also correspond to the theoretical analysis of previous researches in China North Plain [24] and Mississippi of USA [23].
However, the results of -LST indicate that this method extends the errors compared with the -method. Furthermore, a disadvantage of this method is that because LST data are necessary for this method; it can only be applied to periods after MODIS was launched. As already addressed, we frequently need historical air temperature data to execute hydrological models that include snow accumulation and snowmelt processes. Moreover, many watersheds have very sparse observed air temperature data. Because both theand LST-methods are unsuitable for such common cases, we developed a new method as follows.
In the first step, a linear regression equation of LST between two points is created as follows: Here, 2 and 2 are coefficients of the linear regression equation, and subscripts and indicate the points. LST is the result of the predicted LST value of point B from (4).
We conducted the linear regression analysis of LST data in all station pairs. According to the results (Table 5), the same as the previous two methods, it is clear that high linear correlations are obtained in all station pairs for LST data.
In addition, based on the linear analysis results of -LST of station pairs, and using the -LST method, we can estimate , and , at the same time: Based on the results of linear correlation analysis of LST data (Table 5), we neglect the error between the LST and LST and by substituting LST in (6) by LST , , can be expressed by LST as Combining (7) with (5), we get 6 The Scientific World Journal Table 3: Results of daily maximum and minimum linear regression analysis in each station pair.
Daily maximum - Sta Pairs is the station pair name, DataNum is the amount of data used for the regression analysis, " " is the first order coefficient of linear regression equation, and the " " is the interception of linear regression equation.  Sta Pairs is the station pair name, DataNum is the amount of data used for the regression analysis, " " is the first order coefficient of linear regression equation, and the is the interception of linear regression equation.
The Scientific World Journal 9 Because the -LST relationship in the study basins can only be acquired in the point which monitors both LST and data, const cannot be obtained as is not monitored in place B. In this study, we take the const − 2 × const as an entirety. It is clear that if this entirety can be ignored, the equation will be simplified drastically.
Thus, we calculated the const − 2 × const in all station pairs based on the results listed in Tables 4 and 5. The results ( Table 6) indicate that in station pair 4-4 there is a relative large error compared with other station pairs for both daily maximum and minimum analysis. The error is larger in the daily minimum analysis of 2-2 and 6-6 , and it is also larger in the 7-7 for the daily maximum analysis. In addition, the average value of all station pairs is 0.38 ∘ C for daily maximum analysis and 0.45 ∘ C for daily minimum analysis. However, we also recognize that the effects of data on the snowmelt processes are based not only on the "point" or "situ" scale but also on its accurate distribution in the entire basin. Thus, considering relative small average errors (Table 6), we ignore the const − 2 × const of (8) and the new equation is This means that once we acquire coefficients 2 and 2 from linear regression analysis of LST, we can estimate , using a known , . We call this the LST-LST method. We performed a linear regression analysis for creation of based on both daily maximum and minimum LST data. However, according to the limited monitoring period of MODIS LST data, the linear regression equations are formulated only in the entire period. We compared estimated from the LST-LST method with the -method, using the same station pairs listed in Table 2, and also the errors caused from the approximation are discussed, especially the approximations of (8) which may cause more errors for the estimation. The comparison was for both the entire period and spring snowmelt period. We defined the snowmelt period as March through May. The results are presented in Section 4.1.

Air Temperature Calculation in the SWAT Model.
The SWAT model can consider orographic effects on air temperature by dividing the subbasin into multiple elevation bands [40]. This significantly influences snow cover and snowmelt processes. The equation is Here, band is calculated in each elevation band ( ∘ C), is temperature recorded at a monitoring gage ( ∘ C), band is mean elevation of each elevation band (m), gage is elevation of an existing monitoring gage (m), is temperature lapse rate ( ∘ C/km), and 1000 is a unit conversion factor from meters to kilometers. By multiplying band and areal percentage of a band and summing over all elevation bands, we can obtain average of each subbasin. All test basins were divided into ten elevation bands, based on an equal interval of elevation.

Snowmelt Simulation in the SWAT Model.
Based on the basic concept of the -method, the snowmelt module of the SWAT model is as follows (time interval is daily): Here, SNO mlt is the amount of snowmelt (mm H 2 O), mlt is the melt factor (mm H 2 O/day-∘ C), and SNO cov is the fraction of the hydrological response unit (HRU) area covered by snow. The HRU is divided by using land use and soil types in each subbasin. The HRU is the simulated unit in the SWAT model; the simulation of water budget in SWAT is based on the HRU. snow is snow pack temperature ( ∘ C), mx is maximum ( ∘ C), and mlt is the base temperature above which snowmelt is allowed ( ∘ C). mlt allows seasonal variation with maximum and minimum values occurring on the summer and winter solstices. SNO cov allows nonuniform snow cover caused by factors such as shading, land cover, and topography. Consider Here, snow( ) is snow pack temperature on a given day ( ∘ C), snow( −1) is snow pack temperature on the previous day ( ∘ C), sno is snow temperature lag, and av is mean on a given day. In SWAT model, the av is the arithmetic mean value of daily maximum and minimum .

Model Calibration and Evaluation. To evaluate different input
datasets influences on SWAT model snowmelt simulation, we conducted hydrological simulations under two different scenarios: (1) Elev , which used the original data with elevation band method; (2) New , which used the newly created data based on the LST-LST method without elevation bands. For other settings such as watershed delineations, HRU generations, input data (except ), and initial range of calibration parameters, we keep them the same for all different scenarios in each test basin.
Model parameters were calibrated using the Sequential Uncertainty Fitting Version 2 (SUFI-2) method [30]. Table 7 lists the calibration parameters for each scenario. is only included in the Elev scenario. Two different parameter sets were constructed for testing the performance of each scenario. One is the so-called "the least parameters setting" and the other is "complete parameters setting. " For the least parameters setting, to test the performance of two different datasets, we fixed the values of parameters that might significantly influence snowmelt simulations; that is, 50 = 0.5, = 300, = 1 ∘ C, = 0.5 ∘ C,  Table 7 were all applied to evaluate the influence of all factors on snowmelt simulations with the complete parameters setting.
Objective functions used for parameter optimization in runoff simulations were the Nash-Sutcliffe efficiency (NSE)   and coefficient of determination ( 2 ). The sensitivity of parameters was evaluated by the -test of each parameter in the linear multiple regression, which was created using the parameters as arguments and the objective value (NSE) as the dependent variable. The NSE is calculated as follows: Here, is observed runoff on day , is average observed runoff for the entire period, is simulated runoff on day , and is the total number of days.

Comparison between the LST-LST and -Methods.
The 2 of the entire (year) period was over 0.95 for both maximum and minimum estimation of -method (Figures 3, 4, and 5). The results clearly demonstrate that based on the long monitoring period and abundant monitoring data there are high linear correlations in all station pairs. For the LST-LST method, except the station pairs 2-2 and 7-7 , all other station pairs are over 0.90. Although these 2 were less than 0.95 in spring using both methods, they exceeded 0.9 in the -method. Also, the results of the 2 in spring period for LST-LST method showed the same trend as the entire period. For both the entire year and spring, MAE of the LST-LST method was slightly larger than that of the -method ( Figure 6). Overall, MAE performances were around 1.0−2.0 ∘ C in the entire year for daily maximum prediction, but this increasedto 2.0−3.5 ∘ C in spring. For daily minimum prediction, MAE was 1.5−2.5 ∘ C for the entire year and 1.5−4.0 ∘ C in the spring. The results of RMSE (Figure 7) show the same trend as MAE; RSME of the LST−LST method was larger than that of the -method. The larger errors are obtained especially at the station pairs 2-2 , 4-4 , 6-6 , and 7-7 . These results highly agree with the results listed in Section 3.1 and Table 6, in which the potential errors caused by neglecting the item const − 2 × const of (8) are presented. Benali et al. [19] and Vancutsem et al. [20] demonstrated that the land cover may influence the estimation results from statistical method based on MODIS LST data. Here, in our study, the station pairs 2-2 , 4-4 , 6-6 , and 7-7 all contained the land cover type wetland (2 , 4 , 6, and 7 station), and the land cover type of their corresponding stations is forest for station 2, shrub for stations 4 and 6 , and agriculture land for 7 . Further, Westermann et al. [41] presented that MODIS LST data exhibit the high difference between dry area and wet area during snowmelt season in the tundra of Svalbard, Norway. Hachem et al. [42] also concluded that the stagnant surface water which can modify the heat exchanges between the ground surface and the atmosphere leads to the difference between the air temperature and LST in Alaska. Obviously, the water content and vegetation cover are significantly different between the wetland and other land covers. Thus, in our study, the difference of water content in different land cover type is also the probable reason that induces the relative larger error in the station pairs: 2-2 , 4-4 , 6-6 , and 7-7 .
Furthermore, the statistical analysis method always needs abundant training data to generate stable results [18]. The amount of data listed in the Tables 3, 4, and 5 clearly suggested that the short observation period of the MODIS LST data might be an important factor that induces the larger errors in the LST-LST method. The results also indicate that during the spring, ranges of MAE and RMSE from both -and LST-LST methods are increased compared with the entire year. Obviously, comparing with the entire period, the available data for the linear regression analysis is much less, and therefore this is the main reason that the -method and    Elev scenario is elevation bands combined with the original monitoring air temperature data scenario, and New scenario is the newly created air temperature data. 2 is coefficient of determination, and NSE is Nash-Sutcliffe efficiency.
the LST-LST method tend to include more error during the spring.
Although the errors are extended by the LST-LST method, the MAEs and RMSEs obtained herein were still within a reasonable range compared with earlier research in different regions. For example, based on the multistep linear regression method, Colombi et al. [25] predicted in alpine areas of Italy. Their RMSE = 1.89 ∘ C for daily average temperature and maximum and minimum were 2.47 ∘ C and 3.36 ∘ C, respectively, similar to our results. Shen et al. [26] used a single linear regression method to predict daily maximum and minimum based on LST data, with error 2-3 ∘ C. Generally, although the error is enlarged in LST-LST method compared with the -method, the various indexes still show that errors of the LST-LST method were within a reasonable range, and overall performance of the method demonstrated its capability for predicting daily maximum and minimum temperature.

Simulations with the Least Parameters
Setting. Simulation results for bothscenarioswith the least parameters setting are shown in Figures 8, 9, and 10 and Table 8. Basically, the trend of these results in basin A was the same as in basin B.
In basin A, the coefficient of determination of the Elev and New scenarios was 0.0 and 0.15, respectively, and NSE was −0.88 and 0.15. Snowmelt simulation results in both scenarios were overestimated in the early part of each year, while peak flow during snowmelt season was underestimated. In basin B, the coefficient of determination for Elev and New was 0.0 and 0.19. Although NSE values from both scenarios were negative and coefficients of determination were weak, New gave better results than Elev , especially for NSE. Although all evaluation indices indicated poor performance, the results demonstrated that New was superior to Elev . By contrast, for basin C, both scenarios achieved good results. There, the performance of New ( 2 = 0.75, NSE = 0.73) was again better than Elev ( 2 = 0.67, NSE = 0.63).
These results clearly show that performances of New scenarios were always better than Elev in all test basins. However, it was also suggested that the SWAT model cannot attain an acceptable level of simulation without model calibration. Although the test basins were in the same region, differences of snowmelt processes must still be considered. Because there were no additional observed data, parameter calibration is the only way to improve simulation performance to an acceptable level. Thus, simulation and parameter calibration with the complete parameters setting are discussed in the following section.    Elev scenario is elevation bands combined with the original monitoring air temperature data scenario, and New scenario is the newly created air temperature data.

Simulations with Complete Parameters Setting
(1) Parameter Sensitivity. Parameter sensitivity and optimal calibration values of the complete parameters setting are shown in Table 9. Obviously, parameters SNOCOVMX and SNO50COV that define the snow water content of the entire basin are in the top ranks for all test basins. SMFMX that affects daily snowmelt amount and TIMP, which strongly impacts the snow pack temperature, is also significant. Parameter CN2, which is generally recognized as the most important parameter for SWAT [43], was only significant in basin A and showed less sensitivity than SNOCOVMX or SNO50COV. Because our simulations only focused on the snowmelt period, it is reasonable that CN2, whose main function is to split net precipitation into surface flow and infiltration part, is not significant. One of the most influential parameters in the Elev scenario is TLAPS, which is related to elevation change. The sensitivity of TLAPS was significant in basin B but insignificant in basin A and C. Considering the poor results of the least parameters setting, sensitivity results of the complete parameters setting strongly suggest that the parameters that influenced snowmelt varied between the test basins and that calibration is essential for the model to achieve better results.
(2) Simulation Results. Simulation results for the complete parameters setting are shown in Figures 8, 9, and 10 and  Figure 8) and basin B (1983-1987, Figure 9) in particular, simulation results for the peak flow and early melt hydrograph can match monitoring data well, relative to the least parameters setting. Aside from meteorological influences, land cover is another influence on snowmelt. Wetlands are believed to be a major factor in snowmelt simulation, because of its capacity for water storage [44]. Hydrological properties of these wetlands are very sensitive to variations of , seasonal precipitation, and other climatic factors [45]. To simulate snowmelt processes in wetlands, Fang et al. [46] applied a physically based approach to a wetland-dominated prairie basin in Canada. They found that the ability of wetlands to trap blowing snow in winter and store runoff water is a crucial feature of the hydrology, and this poses a substantial challenge to hydrological modeling. Wang et al. [47] and Yang et al. [48] demonstrated that the SWAT model's hydrological simulation component should be improved for wetland-dominated areas based on detailed wetland measurement data, such as annual water table depth and normal and maximum water storage capacities. Wetland covered over 25% of the entire basin A. However, in a data-sparse area, it is difficult to acquire enough data to conduct physically based snowmelt simulation of wetlands, and this is the likely reason that the NSE and 2 were weak in that basin.
Furthermore, as shown in Table 1, the average slope of basin A is only 2.8 degrees. Obviously, snowmelt simulation in such a flat area is greatly influenced by input meteorological data, more than by topographic factors. The parameter sensitivity analysis results also demonstrate that elevation has less influence on snowmelt in this basin, based on the sensitivity rank of the TLAPS parameter (14th among the 16 parameters). However, the simulations indicate that the results are weak in both the least and calibrated Elev scenarios and are greatly improved in the calibrated New scenario.
SWAT model performance was more reasonable in basins B and C. A likely reason is that their elevation differences are significant; average slope of basin B is 18.2 degrees and that of basin C is 12.1 degrees (Table 1). Earlier studies proved that snowmelt performance of SWAT is reasonable in mountainous areas [40,49,50]. Furthermore, dominant land use types in the two basins are forest and pasture; some studies have demonstrated that SWAT model performance is stable in such basins. For example, Zhang et al. [43] used SWAT for the headwaters of the Yellow River, where land cover is dominated by pasture land. They found that SWAT could simulate snowmelt well, based on comparing the temperature index model with two other physical processbased models. Further, Debele et al. [51] demonstrated that SWAT performance was more stable than a physically based snowmelt model in a forest-dominated, watershedscale basin. Although performance of the Elev scenario was superior in the two basins relative to basin A, the New data achieves better results.
In addition, based on the land cover information listed in the Tables 1 and 2, it is clear that the land cover type of the basin A is same as the monitoring stations which are used for the LST-LST method. In the basin B, the shrub, forest, and wetland are land cover types of monitoring stations. Although the land cover of the basin B (Table 1) showed that the pasture is also one of main land cover types the monitoring station is consistent with the other land cover types (forest, shrub, and wetland; 79%). It is likely that the increment of NSE in the basin A and basin B can be explained by the homogeneity of land cover between monitoring stations and test basins. Contrarily, wetland is the only land cover type of stations and forest is the dominated land cover in the basin C. Consequently, the increment of NSE in this test basin is insignificant (Table 8). Hence, the results of our study indicate that the consistency between the land cover types of test basins and the stations used for creation of new can influence the increment of snowmelt simulation results. However, we should also recognize that the temperature is not the only factor that influences the snowmelt processes; the precipitation, topography, and wind speed also have great influences on the snowmelt. As previously mentioned (Section 3.4), we fixed other factors to make the the only influence factor for the snowmelt simulation in each parameter settings; thus, the simulation results of New scenarios can still prove the advantages of new created data for improving snow simulation in this study area. In addition, these results also suggest that the errors caused from the assumptions and approximations in the Section 3.1 have less effect on the final simulation results. The new created data performed well compared with the original data.
Furthermore, although satisfactory results of runoff simulations are obtained with the Elev scenario in basins B and C, calibrated TLAPS values indicated probable errors for the 18 The Scientific World Journal original data and method. Even though actual TLAPS values cannot be evaluated because of sparse monitoring of data in our study area, its calibrated values (−8.8 ∘ C/km for basin A, −8.5 ∘ C/km for basin B, and −9.5 ∘ C/km for basin C) deviated strongly from the typical temperature lapse rate measured around the world (−6 ∘ C/km or −6.5 ∘ C/km). Our calibration results also clearly show that the lapse rate varied between test basins. This also applies for the basins (B and C) with large sensitivity differences, and the calibration values of the TLAPS also varied sharply.
These results strongly demonstrate that the elevation band method, which depended only on a fixed temperature lapse rate and the original sparse monitoring dataset, had a less effect on daily snowmelt simulation in the study area. The results also clearly prove that the new high spatial density data with no additional parameter or elevation modification can achieve superior results for both steep and flat basins.

Conclusions
The main objective of this study was to evaluate a new data creation method to generate data with high spatial density and accuracy, for improving the performance of snowmelt modeling using the -method. The research used a simple linear regression equation between two locations. The method was tested at eight pairs of air monitoring stations and compared with the regression method based on the monitoring data. Although the approximation of linear regression method might extend errors for estimation, the results still demonstrate that this simple linear regression approach can create data with limited errors range over long periods, and spatial density of the created data is very high. Snowmelt simulations with the newly created data and original data were compared in three test basins with varying slope and land cover types. Both the least parameters and complete parameters settings were tested in all basins. The calibration results evaluated using different indices indicate that the newly created data can obtain better simulation results than the original data and method, in all test basins. The simple linear regression using MODIS LST was generally successful and applicable in our study area. The research showed that using the newly created data to improve the temperature index-based hydrological model (SWAT) is feasible, and the results of this new approach suggest that it could be a powerful means for extending the applicability of the -method to areas with sparse data.