A near-global, high resolution land surface parameter dataset for the variable infiltration capacity model

Schaperow, Jacob R.; Li, Dongyue; Margulis, Steven A.; Lettenmaier, Dennis P.

doi:10.1038/s41597-021-00999-4

Download PDF

Data Descriptor
Open access
Published: 11 August 2021

A near-global, high resolution land surface parameter dataset for the variable infiltration capacity model

Scientific Data volume 8, Article number: 216 (2021) Cite this article

5942 Accesses
4 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Hydrologic models predict the spatial and temporal distribution of water and energy at the land surface. Currently, parameter availability limits global-scale hydrologic modelling to very coarse resolution, hindering researchers from resolving fine-scale variability. With the aim of addressing this problem, we present a set of globally consistent soil and vegetation parameters for the Variable Infiltration Capacity (VIC) model at 1/16° resolution (approximately 6 km at the equator), with spatial coverage from 60°S to 85°N. Soil parameters derived from interpolated soil profiles and vegetation parameters estimated from space-based MODIS measurements have been compiled into input files for both the Classic and Image drivers of the VIC model, version 5. Geographical subsetting codes are provided, as well. Our dataset provides all necessary land surface parameters to run the VIC model at regional to global scale. We evaluate VICGlobal’s ability to simulate the water balance in the Upper Colorado River basin and 12 smaller basins in the CONUS, and their ability to simulate the radiation budget at six SURFRAD stations in the CONUS.

Measurement(s)	vegetation characteristics • soil characteristics • elevation
Technology Type(s)	satellite imaging • soil sampling • digital curation
Factor Type(s)	geographic location
Sample Characteristic - Environment	vegetation layer • soil
Sample Characteristic - Location	global

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.14869500

MOD-LSP, MODIS-based parameters for hydrologic modeling of North American land cover change

Article Open access 09 August 2019

Global soil, landuse, evapotranspiration, historical and future weather databases for SWAT Applications

Article Open access 06 November 2019

SOIL-WATERGRIDS, mapping dynamic changes in soil moisture and depth of water table from 1970 to 2014

Article Open access 06 October 2021

Background & Summary

The Variable Infiltration Capacity (VIC, https://github.com/UW-Hydro/VIC) model is a macroscale, semi-distributed hydrologic model^1,2,3 that calculates land surface states and fluxes by solving the surface water and energy balances. The model has a wide user base — the citation index Web of Science shows the original VIC paper³ has been cited nearly 2000 times, with contributing authors from at least 56 countries. Despite the model′s popularity, there are only a few ready-made soil and vegetation parameter datasets that modelers can use to run VIC outside the continental United States. Previous global input datasets^4,5,6,7,8 have been compiled for VIC at resolutions ranging from 2° to 1/4°. Many studies, including Su et al.⁵, Zhou et al.⁷, and Adam et al.⁸ use parameters based on the 2° soil and vegetation parameters developed by Nijssen et al.⁴ (henceforth N2001). As useful as the N2001 dataset has been over the years, the VIC-modeling community would be well-served by a higher-resolution update. The N2001 dataset and its derivatives are limited by the dataset’s coarse resolution, geographically sparse subset of leaf-area index observations, and assumptions of temporally-invariant albedo and 100 percent canopy coverage for all land cover classes, as noted by Bohn and Vivoni⁹, who developed a new VIC parameter dataset for North America that addresses these issues. Our dataset, VICGlobal, emulates their approach at a global-scale.

VICGlobal’s predecessor, the N2001 soil and vegetation parameters, may be appropriate for continental-scale modelling, but its coarse resolution makes it less useful for parameterizing VIC at smaller scales. Coarse resolution land surface models miss topographic variability, distort river networks, and prevent proper representation of land-atmosphere interactions in coupled land-atmosphere models. At coarse resolution, topographic characteristics such as elevation and vegetation cover are averaged over a large grid cell, so the model will miss key details such as the effect of terrain on the radiation balance and the effect of vegetation on ET-soil moisture partitioning. This is particularly important in mountainous regions, where there are large changes in topography across relatively small areas. While VIC does not represent fluxes from one grid cell to another, it is frequently coupled to a routing model to simulate how runoff flows between grid cells. At very coarse resolutions, the modelled river network loses its resemblance to the true river network, necessitating upscaling algorithms to obtain usable coarse-resolution river networks (e.g. Wu et al.¹⁰). Finally, high resolution land surface modelling could improve our ability to simulate land-atmosphere interactions that occur over relatively small spatial scales¹¹. With 1/16° grid cells and representation of up to 17 land cover classes within each grid cell, VICGlobal is a step toward addressing each of these resolution-related challenges.

Regional-scale VIC inputs at 1/16° resolution already exist but have limited coverage outside North America. Livneh et al.¹² (henceforth L2013) set up the VIC model over the conterminous United States (CONUS) using soil and vegetation data compiled from sources including the Food and Agriculture Organization (FAO)/UNESCO Soil Map of the World and the Advanced Very High Resolution Radiometer (AVHRR). The L2013 VIC parameterization is based on that of Maurer et al.¹³, with calibration for a better match with streamflow data. Bohn and Vivoni⁹ (henceforth BV2019) released an updated 1/16° vegetation parameter dataset for the CONUS, Mexico, and part of Canada, improving on of the limitations of the L2013 dataset, such as its assumption of temporally-invariant albedo. They estimated time-varying albedo, leaf-area index (LAI), and fractional canopy cover using observations from the Moderate Resolution Imaging Spectroradiometer (MODIS).

Drawing on the L2013 and BV2019 VIC parameterizations, we developed VICGlobal, a near-global dataset of soil and vegetation parameters for the VIC model at 1/16° resolution, which VIC users can download and subset to their region of study. We estimated soil parameters based on the 30 arc-second FAO Harmonized World Soil Database¹⁴ (HWSD). Vegetation parameters are based on 500 m resolution MODIS observations. VICGlobal includes all the necessary parameters to run regional- to global-scale VIC simulations. We provide MATLAB® codes to subset the VICGlobal parameters to a particular domain. In addition to parameters, meteorological forcing data are required to run VIC. We do not include meteorological forcing data as part of VICGlobal. Instead, we direct readers to existing forcing datasets with near-global coverage, such as the reanalysis datasets MERRA-2¹⁵ and GLDAS¹⁶, or real-time measurement-based datasets — see e.g. Xiao et al.¹⁷, Livneh et al.^12,18, Bohn et al.¹⁹

Finally, a note on the file format: The upgrade from VIC version 4 (VIC-4) to VIC version 5 (VIC-5) introduced two “drivers” for running the model. The Image driver takes NetCDF files as inputs, while the Classic driver takes ASCII text files. The VICGlobal parameter files are available in two formats: one for VIC-5 Classic, and one for VIC-5 Image.

Methods

This section describes how we used freely-available data to compile Classic driver input files for the VIC model. First, we created parameter files for VIC-5 Classic, then we converted them to NetCDF format for VIC-5 Image. VIC-5 Classic requires three parameter files: a soil parameter file, a vegetation parameter file, and a vegetation library file. An optional elevation band file can be provided to resolve sub-grid variability in elevation, which is important in regions with complex topography. The parameters are arranged as a relational database: each grid cell has a unique identifier, called a grid cell number, in the soil parameter file, that VIC uses to find the corresponding rows of data in the vegetation parameter and elevation band files. The Image driver uses a different setup, with all parameters stored in a single NetCDF file.

Soil parameters

The soil parameter file for VIC-5 Classic is an ASCII text file that includes soil parameters such as hydraulic conductivity and porosity, but also other kinds of static parameters, such as average precipitation and time zone offset from GMT. Each row of the soil parameter file represents one grid cell, and each column represents a different variable. We compiled the soil parameter file using MERIT²⁰ elevation data, soil texture data from the FAO HWSD, pedotransfer tables relating soil texture to other soil properties, and interpolated weather station data (WorldClim²¹). Any remaining parameters were set to suggested values from the VIC model’s documentation². The following sections describe the estimation of each variable in the soil parameter file, summarized in Table 1.

Table 1 VIC model parameters for the soil parameter file.

Full size table

Elevation and land mask

The VICGlobal soil parameter file uses the Multi-Error-Removed Improved-Terrain (MERIT²⁰) digital elevation model (DEM) to define the elevations, latitudes, and longitudes of each land grid cell. The MERIT DEM is an error-corrected and extended version of the SRTM DEM, with 3 arc-second resolution and coverage from 60°S to 85°N and 180°W to 180°E. Specifically, MERIT is a combination of the SRTM, AW3D, and Viewfinder Panoramas’ DEMs, corrected for striping, speckle, absolute bias, and tree height bias. We used bilinear interpolation to aggregate MERIT to 1/16° resolution and derive a 1/16° MERIT-based land mask and DEM (Figure S1).

Soil texture data

Soil texture (percent sand, silt, and clay) and bulk density were obtained from the FAO HWSD, a gridded soil parameter dataset derived from in-situ measurements of the soil column. We used a 0.05° resolution NetCDF dataset converted from the original HWSD Microsoft Access database by Wieder et al.²². We resampled the HWSD soil data from 0.05° to 1/16° resolution using bilinear interpolation with the MATLAB® function griddedInterpolant. While HWSD has near global coverage, there are missing data in some places around the world, notably Greenland and northern Africa. We filled in these missing data using inpainting, a gap-filling method from the field of image processing. We used the MATLAB® function inpaintnans²³, which uses a partial differential equation method to fill in missing data, to fill gaps in the HWSD data over the MERIT land mask. Figure 1 shows the HWSD bulk density data before and after inpainting.

The HWSD data are divided into “topsoil” and “subsoil” parameters. The first 30 cm of the soil column are considered topsoil and the lower 70 cm subsoil. VIC is typically run with three soil layers, so we created a three-layer soil parameter file by breaking up the 30 cm HWSD topsoil layer into two soil layers: one of 10 cm and one of 20 cm, so the final soil parameter file has three layers, with thicknesses of 10 cm, 20 cm and 70 cm, from top to bottom of the soil column. Ten centimeters has been a common choice for the uppermost layer soil depth in VIC modeling applications since its use by Liang et al.²⁴. Soil layer depths are typically used as calibration parameters. VICGlobal values should be considered a starting estimate.

Calculating soil parameter values based on soil textures

Pedotransfer functions (e.g. Cosby et al.²⁵) relate readily available soil properties, such as soil texture, to less easily-observed properties, such as hydraulic conductivity. After resampling the HWSD data from 1/4° to 1/16° resolution, we estimated soil parameters by classifying each grid cell’s USDA soil texture class and assigning physical soil properties based on a lookup table included with the VIC documentation^2,26. The lookup table (Table 2) relates the 12 USDA soil texture classes to bulk density, field capacity, wilting point, porosity, saturated hydraulic conductivity, and slope of the soil water retention curve in Campbell’s equation. We classified soil textures using the USDA soil texture triangle, as implemented by the MATLAB® function soil_classification²⁷. Figure 2 shows the derived USDA soil texture map. We used these along with the lookup table to estimate saturated hydraulic conductivity (K_sat), the exponent in Campbell’s equation for hydraulic conductivity (expt), fractional soil moisture at the critical point (wcr_fract), where the critical point is about 70% of field capacity, fractional soil moisture at the wilting point (wpwp_fract), quartz content, and porosity for each soil layer. The lookup table²⁶ did not include quartz content, so we supplemented it with the soil texture-quartz content lookup table from Peters-Lidard et al.²⁸.

Table 2 USDA soil texture class lookup table.

Full size table

We set the variable infiltration capacity parameter ${b}_{infilt}=0.2$, the maximum baseflow fraction threshold ${d}_{s}=0.001$, and maximum soil moisture threshold ${w}_{s}=0.9$, their suggested values from the VIC documentation. These parameters, along with maximum baseflow velocity (dsmax) and soil depth, are typically calibrated. We set the baseflow curve exponent c = 2, the soil thermal damping depth dp = 4 m, soil density = 2685 kg/m³, surface roughness = 0.001 m, and snow roughness = 0.0005 m, also based on guidance from the VIC documentation. The soil moisture diffusion parameter phi_s is not used in the current version of VIC, so we set it to the no-data value (−999). The final few soil parameters — dsmax, initial soil moisture (initm), and bubbling pressure (bubble)— were calculated using the following equations, based on guidance from the VIC documentation.

$$dsmax=slope\ast {\bar{K}}_{sat}$$

(1)

$$initm=wc{r}_{fract}\ast porosity\ast {t}_{l}$$

(2)

$$bubble=0.32\ast expt+4.3$$

(3)

Equation (1) estimates dsmax for each grid cell as the product of soil-column average K_sat and land surface slope, which was calculated from the elevation data using the MATLAB® function gradientm²⁹g. Equation (2), where t_l is the thickness of soil layer l, assumes that initial soil moisture is equal to the fractional soil moisture content at the critical point. Equation (3) calculates bubbling pressure as a function of expt, based on linear regression of bubbling pressure vs. expt³⁰. Figures S2–S9 in the Supplementary Information show maps of each soil parameter. We assumed residual soil moisture, the amount of soil moisture that cannot be removed from the soil by drainage or evapotranspiration, was zero.

Elevation bands

VIC uses an elevation band file (also called a snow band file) to account for subgrid heterogeneity in grid cell elevations. The assumption of uniform elevation over an entire grid cell can lead to modeling errors in mountainous regions, where higher topography is associated with cooler temperatures and higher precipitation rates. The elevation band file accounts for subgrid variability in topography by dividing each grid cell into a number of elevation bands, each of which is simulated separately. VIC adjusts temperature, pressure, and precipitation depending on the elevation in each band. We prepared an elevation band file with five elevation bands by comparing the 1/16° DEM used for the soil parameter file with a 30 arc-second DEM. Both DEMs were derived by aggregating MERIT data. For simplicity, we assumed precipitation was evenly distributed among elevation bands within a grid cell. The elevation band file is provided with the caveat that using elevation bands requires more computing power; users may wish to turn elevation bands on or off (via the VIC global parameter file) depending on their needs.

Vegetation parameters

VIC-5 Classic uses a vegetation parameter file to define the fractional cover of different vegetation types within each grid cell and some of their physical properties. Other vegetation parameters are stored in the “vegetation library” file. (VIC-5 Image simply stores all parameters in a single “parameter” file.) The VIC-5 Classic vegetation parameter file consists of information about fractional cover of each land cover type in each grid cell, and their corresponding root zone depths and root fractions within each root zone. The vegetation parameter file can optionally include time-varying LAI, fractional canopy cover, and albedo data, but it is simpler to specify these in the vegetation library (at the cost of not representing some spatial heterogeneity).

We used MODIS land cover data from the 0.05° MODIS MCD12C1 Collection 6 data product³¹ to assign fractional land cover values to each grid cell by calculating the average land cover for MCD12C1 observations over the 2017 calendar year. We chose 2017 because it was the most recent year with data in all the MODIS-based datasets used for this study, and there is very low interannual variability of land cover³² in MCD12C1 Collection 6. Figure 3 shows majority land cover types from the 2017 MCD12C1 observations.

Like all global land cover data products, MCD12C1 makes classification errors. Sulla-Menashe et al.³² reported 67% overall IGBP classification accuracy for 2001 land cover. Classification errors are more common in the “mixed” land covers, such as cropland/natural vegetation mosaic, shrublands, grasslands, and savannas. Fortunately for our purposes, the vegetation parameters for commonly-confused land covers tend to be fairly similar themselves, which reduces the impact of misclassification on land surface modelling results. For example, the LAI of open shrubland is not too different from the LAI of closed shrubland.

We calculated root fraction as a function of land cover class following the method of Zeng³³, who defined the following formula (Eq. 4) for use in parameterizing land surface models:

$$Y=1-\frac{1}{2}\left({e}^{-ad}+{e}^{-bd}\right)$$

(4)

where Y = cumulative root fraction, d = depth, and a and b are empirical parameters defined by Zeng³³ for each International Geosphere–Biosphere Programme (IGBP) land cover type, based on a rooting depth database compiled from more than 200 field surveys. We used this formula with depths of 0.1 m, 0.7 m, and dr, corresponding to three root zones. The value of dr, the maximum rooting depth for each IGBP land cover type, was taken from Zeng³³. This method assumes that the depth and distribution of roots depends only on the land cover type; we assume that land cover type is the primary control on root characteristics. Table 3 shows root fractions and root zone depths for each IGBP land cover type.

Table 3 Root zone depths (m) and fraction of roots in each zone for IGBP land cover classes.

Full size table

Like previous large-scale VIC vegetation cover datasets, our vegetation parameter file neglects land cover change over time. However, it does have a few other advantages over past vegetation parameter datasets. The land cover classification used in the N2001 and L2013 VIC parameter sets is referred to as “UMD-NLDAS” because it is a modified version of the AVHRR-based University of Maryland (UMD) land cover product³⁴. The UMD-NLDAS classification was modified for the North American Land Data Assimilation project (NLDAS³⁵) to exclude open water, urban, and snow and ice land cover classes (see BV2019). VICGlobal uses 17 IGBP land cover classes, including urban, barren, perennial snow and ice, and inland water bodies, permitting better description of land cover variability than the 11 UMD-NLDAS classification.

Vegetation library file

The vegetation library maps each land cover type to a set of vegetation parameters (Table 4). We adapted the LDAS vegetation library³⁶ for use with the 17 IGBP land cover classes, taking monthly average LAI, fractional canopy cover (fcanopy), and albedo values obtained from recent MODIS data products. We set architectural resistance (r₀) and minimum stomatal resistance (r_min) to values from literature (described below). The rest of the parameters, which are described in the N2001 paper, were left to their original LDAS vegetation library values. This section describes how we estimated LAI, fcanopy, albedo, r₀, and r_min, and how we transferred the remaining parameters from the 11 UMD-NLDAS land cover classes to the 17 IGBP land cover classes.

Table 4 VIC model parameters for the vegetation library file.

Full size table

We used MODIS observations from the year 2017 to calculate monthly average LAI, fcanopy, and albedo for each IGBP land cover type. We calculated LAI and albedo from the MODIS-based Global LAnd Surface Satellite dataset (GLASS^37,38,39) and fcanopy from NDVI observations (MCD13C1⁴⁰) The expression used for fcanopy follows BV2019:

$$fcanopy={\left(\frac{NDVI-NDV{I}_{min}}{NDV{I}_{max}-NDV{I}_{min}}\right)}^{2}$$

(5)

where NDVI_min and NDVI_max are the minimum and maximum values of NDVI observed for that month. Monthly LAI, fcanopy, and albedo values were calculated by averaging over all grid cells of the same land cover type, counting only cells that were at least 90% homogenous, to avoid noise from grid cells with multiple land covers. Excepting perennial snow and ice land cover, the vegetation parameters in the VIC vegetation library should describe snow-free vegetation. Therefore, before calculating LAI, fcanopy, and albedo for each land cover class, we used fractional snow cover data from MOD10CM⁴¹, a global 0.05 degree monthly snow cover dataset, to exclude grid cells with more than 90% snow cover. Additionally, we set albedo to 0.05 for open water, and we set LAI and fcanopy to 0 for open water and perennial snow and ice.

The resistances r_min and r₀ play a role in determining how much plant transpiration occurs. Higher resistance means less transpiration. Stomatal resistance is resistance to the release of water through the plant stomata, and architectural resistance is the aerodynamic resistance between the leaves and the canopy top⁴². Two sets of resistance parameters have been used in past large-scale VIC implementations. N2001 ran VIC over the entire globe using r_min values adapted from Dorman and Sellers’ global database of r_min values⁴³ computed using the Simple Biosphere Model⁴⁴ (SiB). The Nijssen et al.⁴⁵ r₀ values were taken from Ducoudre et al.’s SECHIBA land surface parameterization⁴². The other set of r_min and r₀ parameters are those used in the LDAS vegetation library and in studies such as Livneh et al.¹². This set of r_min values comes from Mao et al.⁴⁶ and Mao and Cherkauer⁴⁷. We used the r_min values from SiB⁴⁴ and the r₀ values from SECHIBA⁴² for VICGlobal as they appeared to be the better documented values.

For the other parameters in the vegetation library file (displacement height, roughness length, etc.), we assigned values using the existing LDAS vegetation library. Since there are 17 IGBP land cover classes, and only 11 UMD-NLDAS land cover classes in the LDAS vegetation library, we re-assigned some IGBP land cover classes to take the parameters of UMD-NLDAS land cover classes. We remapped barren land, permanent wetlands, snow and ice, urban land, and water bodies to take the parameters of “grasslands” from the LDAS vegetation parameter file. While the characteristics of the barren, snow and ice, urban, and water land cover types clearly differ from those of grasslands, their low LAI and fcanopy values, corresponding to sparse vegetation, essentially “turns off” the other vegetation parameters in the VIC model, as pointed out by BV2019. The other remappings were more straightforward. Croplands and croplands/natural vegetation mosaics inherited values from “croplands,” savannas became “wooded grasslands,” and woody savannas became “woodlands.” We were thus able to assign vegetation parameter values to the each of the 17 IGBP land cover classes.

To calculate global average time series of seasonally-varying vegetation parameters would be of limited interest as the seasonal cycle would average out across the equator. Therefore, we calculated average monthly fcanopy, LAI, and albedo for each vegetation type in each hemisphere, and we developed two separate vegetation library files: one for the northern hemisphere and one for the southern hemisphere. Maps of January and July LAI, fcanopy, and albedo are shown in Fig. 4. For illustrative purposes, the parameter values in this figure have been averaged over the 17 IGBP land cover classes using area-based weighting. Figures S14–S19 show maps of the remaining vegetation parameters. Figures S20–S22 show the cycle of LAI, fractional canopy cover, and albedo for each vegetation type, averaged separately over each hemisphere.

Data Records

Soil and vegetation parameters for the VIC model are available for download at Zenodo⁴⁸ in NetCDF format for version 5 of the VIC model. The files are stored as zip archives. parameters_classic.zip contains ASCII text files with soil parameters, vegetation parameters, elevation bands, and two vegetation library files — one of the northern hemisphere and one for the southern hemisphere — for VIC-5 Classic. parameters_global.zip contains a NetCDF “parameter” file with all the soil and vegetation parameters described above and a NetCDF “domain” file describing the VICGlobal domain (all land mass between 60°S and 85°N) for VIC-5 Image. MATLAB® codes for subsetting either set of parameters from the entire VICGlobal extent to a subregion of interest. Additionally, parameter and domain files pre-subsetted to North America, South America, Africa, Eurasia, and Oceania are available for download.

Technical Validation

Streamflow and snow-water equivalent in the Upper Colorado Basin

Having created input files for the VIC model, we tested the parameters in a large, well-studied river basin. We used the VICGlobal parameters to run VIC in water balance mode over the Upper Colorado River Basin (UCRB), a 293,600 km² basin in the western United States. We ran VIC once using the VICGlobal parameters and once using the L2013 parameters. Both simulations used the meteorological forcing data from L2013, at a six-hourly timestep, in water balance mode, for the 6-year period from Oct. 1, 2005 to Sept. 30, 2011.

We compared estimated streamflow from the VICGlobal and L2013 simulations with naturalized streamflow estimates from the U.S. Bureau of Reclamation⁴⁹ (USBR) at Lees Ferry, Arizona (Fig. 5). Naturalized flow is measured streamflow adjusted for the effects of reservoir storage and management and consumptive uses such as irrigation. We compared our VIC model outputs with naturalized streamflow because our VIC implementation does not simulate consumptive water use or reservoir storage. Due to differences in soil and vegetation parameters between the two sets of input files, there are notable differences in the hydrographs from each simulation. Relative to the L2013 results, the uncalibrated VICGlobal peak flows’ timing is too early and their magnitude is too high. This is expected given that the L2013 parameters have been calibrated to get a good match to gauge data.

To understand the cause of this mismatch, we examine seven commonly calibrated soil parameters, which are difficult or impossible to estimate from measurements: ds, ws, dsmax, b_infilt, and the thicknesses of each soil layer (t₁, t₂, t₃). Taking a closer look at these soil parameters in the UCRB (Fig. 6), we see that the infiltration capacity parameter b_infilt is considerably higher in the VICGlobal parameters than it is in the L2013 parameters, which would tend to cause higher runoff rates. ds is lower for VICGlobal than for L2013, so nonlinear baseflow occurs at a lower fraction of dsmax, tending to make baseflow peaks occur earlier. dsmax is considerably higher for VICGlobal than for L2013 in much of the UCRB, so the maximum baseflow rate is higher for VICGlobal. Finally, the thicker soil layers in L2013 mean that more water can infiltrate into the soil before baseflow occurs. Table 5 describes each of the seven calibration parameters and their influence on VIC model outputs. VIC users seeking more guidance on calibrating soil parameters should consult the VIC model documentation and relevant literature^50,51,52. We calibrated the VICGlobal parameters b_infilt, dsmax, and t₃, the same parameters calibrated by L2013, to get a good match between predicted and observed (USBR naturalized) streamflow. However, decreasing b_infilt on its own was not enough to reduce the high runoff estimates produced by the VICGlobal parameters (VIC’s sensitivity to b_infilt depends on the water-holding capacity of the upper two soil layers). By introducing the thickness of the second soil layer t₂ as a fourth calibration parameter, we were able to reduce runoff and increase transpiration. The final set of calibrated parameters was ${b}_{infilt}=0.038$, dsmax = 0.60 mm⁄day, t₂ = 1.5m, and t₃ = 1.6m.

Table 5 Commonly calibrated soil parameters in the VIC model and their effects on model outputs.

Full size table

For this analysis, we used manual calibration because the model run time made automated methods, which require hundreds to thousands of model runs, impractical. We used a custom MATLAB® application — a graphical user interface for running the VIC model, tuning its parameters, and displaying its outputs — to assist with manual calibration. We assumed the calibrated parameters were uniform over the basin. In addition to calibration by trial and error, VICGlobal users may also wish to explore automated calibration methods such as the Shuffled Complex Evolutionary algorithm⁵³ (SCE-UA) or Dynamically-Dimensioned Search⁵⁴ (DDS) when practical.

The calibrated VICGlobal simulation outperformed the L2013 simulation, with a Kling-Gupta efficiency⁵⁵ (KGE) of 0.24, compared to −0.26 for L2013 and −1.7 for the uncalibrated VICGlobal simulation. We also compared simulated snow-water equivalent (SWE) between the VICGlobal and L2013 simulations. Spatial patterns of simulated SWE were consistent between the two simulations (Fig. 7a–c), as were patterns of snow accumulation and melt (Fig. 7d). VICGlobal SWE was 3 mm higher than L2013 SWE, on average. Snow sublimation, including canopy sublimation, was higher for L2013 than for VICGlobal, which helps explain the slight overestimate of SWE by VICGlobal relative to L2013. Overall, VICGlobal is able reproduce the timing, magnitude, and spatial pattern of L2013-simulated SWE in the UCRB, with no need for parameter calibration.

Water balance in 12 unregulated CONUS basins

Beyond the Upper Colorado Basin, we evaluated the VICGlobal parameters’ potential for modelling the water and energy balance in 12 basins, ranging from 1500–25000 km², chosen for good spatial coverage of the CONUS. Modelled discharge was compared with monthly observations at USGS reference stream gages⁵⁶ at each basin outlet; we used the DDS method to calibrate b_infilt, dsmax, t₂, and t₃. (These basins are small enough for automatic calibration to be practical.) The calibration was performed with L2013 meteorological forcing data, for calendar year 1993, with the VIC model run from 1990 – 1992 as spin-up. After 500 model evaluations (50 for the Clearwater River), the average discharge calibration KGE was 0.47, with a maximum of 0.83 for the Clearwater River in Idaho and a minimum of 0.01 for the White River in Arkansas. Using the calibrated parameters, we performed a validation run from 1994–2011. Table 6 shows goodness of fit between modelled and measured discharge for the 18-year validation run. Figure 8 shows discharge plots for each basin over the validation period. Good matches can be seen for the Clearwater, New, Homochitto, Mattawamkeag, Gasconade, Trinity, and Little Fork rivers, while the White, Sheyenne, Brazos, and San Simon rivers did not respond well to calibration, suggesting that parameters other than the four calibrated here are to blame. See e.g. Demaria et al.⁵⁰ for more insight on VIC calibration.

Table 6 Goodness of fit metrics for the 1994–2011 validation run over 12 CONUS basins.

Full size table

Surface radiation budget validation with SURFRAD

To evaluate how well the (uncalibrated) VICGlobal parameters simulate the surface radiation balance, we ran VIC over six SURFRAD⁵⁷ sites, using soil parameters from the 1/16° grid cell containing the SURFRAD sites. We ran hourly simulations in energy balance mode from 1995–2011, with 1994 as a spin-up year. Meteorological inputs taken from meteorological stations at the sites provided input data for the model, except for precipitation, which we took from L2013.

The first row of Fig. 9 shows modelled and observed net radiation, upwelling longwave, downwelling longwave, upwelling shortwave, and downwelling shortwave radiation averaged over each day from 1995–2011 for six SURFRAD sites in the CONUS. Downwelling shortwave and longwave radiation predictions similar to the observations because the SURFRAD data were used as inputs for the VIC model (but not identical because precipitation inputs were taken from L2013 due to lack of ground measurements at the sites). There is a positive bias for net radiation resulting from a slight low bias for upwelling shortwave radiation. Overall, the bias is small.

The second and third rows of Fig. 9 show scatterplots of predicted vs. observed upwelling shortwave and longwave radiation, with one-to-one lines shown in black. The correlation between predicted and observed upwelling shortwave and longwave radiation is close to 1 (ranges from 0.96–0.99 for all sites). Running VIC with VICGlobal parameters allows simulation of upwelling longwave radiation with an RMSE of 25 W/m² and RMSE of 15 W/m² for upwelling shortwave radiation. In Fig. 9, we have excluded data at times when snow covers the ground to address the scale-issue — the spatial scale of the VIC simulations (a 1/16° grid cell) is much larger than that of the SURFRAD measurement — because of snow’s large role in determining upwelling solar radiation, we excluded times when either the VIC model or SURFRAD measurements had snow on the ground using an albedo threshold of 0.4; none of the VICGlobal albedos for non-snowy land surfaces are this large.

Usage Notes

We have described VICGlobal, a globally-consistent 1/16° VIC parameter dataset with soil and vegetation parameters derived from the latest satellite-based remote sensing datasets (MODIS and MERIT, which is based on SRTM data) and in-situ soil data from the FAO HWSD. In addition to its higher resolution, VICGlobal has an advantage over previous global VIC setups due to its inclusion of seasonally-varying fractional canopy cover, LAI, and albedo, and because it explicitly accounts for barren, wetland, open water, and perennial snow and ice land covers. VICGlobal is provided in geographic coordinates, referenced to the WGS84 ellipsoid and datum.

VICGlobal has a few limitations. Its parameters are uncalibrated, so users must calibrate sensitive yet hard-to-measure parameters such as soil depth and the variable infiltration capacity parameter to get a good match between simulated and observed discharge. Several of the vegetation parameters, such as roughness length and displacement height, are assumed constant in time, even though realistically these parameters change as vegetation blooms and senesces throughout the year. And while we believe our monthly, hemisphere-average fractional canopy cover, LAI, and albedo are a major improvement over past global datasets, the most realistic parameter set would have them vary from grid cell to grid cell, even for the same vegetation type. Despite its limitations, we hope that VICGlobal, with its relatively high spatial resolution, wide coverage, and easy availability will be a valuable resource for VIC users.

Code availability

The scripts used to create the VICGlobal data set can be found on the corresponding author’s Github page (https://github.com/jschap1/vicglobal-prep and https://github.com/jschap1/vegpar). The VICGlobal parameters were subset to the Upper Colorado River Basin using the subsetting codes included with the VICGlobal dataset, archived on Zenodo⁴⁸.

References

Hamman, J. J., Nijssen, B., Bohn, T. J., Gergel, D. R. & Mao, Y. The variable infiltration capacity model version 5 (VIC-5): Infrastructure improvements for new applications and reproducibility. Geosci. Model Dev. 11, 3481–3496 (2018).
Article ADS Google Scholar
Hamman, J. et al. Variable Infiltration Capacity (VIC) Macroscale Hydrologic Model. Zenodo https://doi.org/10.5281/zenodo.1233657 (2018).
Liang, X., Lettenmaier, D. P., Wood, E. F. & Burges, S. J. A simple hydrologically based model of land surface water and energy fluxes for general circulation models. J. Geophys. Res. 99 (1994).
Nijssen, B., Schnur, R. & Lettenmaier, D. P. Global Retrospective Estimation of Soil Moisture Using the Variable Infiltration Capacity Land Surface Model, 1980–93. J. Clim. 14, 1790–1808 (2001).
Article ADS Google Scholar
Su, F., Adam, J. C., Bowling, L. C. & Lettenmaier, D. P. Streamflow simulations of the terrestrial Arctic domain. J. Geophys. Res. D Atmos. 110, 1–25 (2005).
Article Google Scholar
Lin, P. et al. Global reconstruction of naturalized river flows at 2.94 million reaches. Water Resour. Res. 2019WR025287, https://doi.org/10.1029/2019WR025287 (2019).
Zhou, T., Nijssen, B., Gao, H. & Lettenmaier, D. P. The contribution of reservoirs to global land surface water storage variations. J. Hydrometeorol. 17, 309–325 (2016).
Article ADS Google Scholar
Adam, J. C., Hamlet, A. F. & Lettenmaier, D. P. Implications of global climate change for snowmelt hydrology in the twenty-first century. Hydrol. Process. 23, 962–972 (2009).
Article ADS Google Scholar
Bohn, T. J. & Vivoni, E. R. MOD-LSP, MODIS-based parameters for hydrologic modeling of North American land cover change. Sci. Data 6, 144 (2019).
Article Google Scholar
Wu, H. et al. A new global river network database for macroscale hydrologic modeling. Water Resour. Res. 48 (2012).
Wood, E. F. et al. Hyperresolution global land surface modeling: Meeting a grand challenge for monitoring Earth’s terrestrial water. Water Resour. Res. 47, 1–10 (2011).
Article MathSciNet Google Scholar
Livneh, B. et al. A long-term hydrologically based dataset of land surface fluxes and states for the conterminous United States: Update and extensions. J. Clim. 26, 9384–9392 (2013).
Article ADS Google Scholar
Maurer, E. P., Wood, A. W., Adam, J. C., Lettenmaier, D. P. & Nijssen, B. A Long-Term Hydrologically-Based Data Set of Land Surface Fluxes and States for the Conterminous {United States}. J. Clim. 15, 3237–3251 (2002).
Article ADS Google Scholar
Nachtergaele, F. et al. Harmonized World Soil Database (version 1.2). FAO, Rome, Italy IIASA, Laxenburg, Austria 1–50 (2012).
Gelaro, R. et al. The modern-era retrospective analysis for research and applications, version 2 (MERRA-2). J. Clim. 30, 5419–5454 (2017).
Article ADS Google Scholar
Rodell, B. Y. M. et al. THE GLOBAL LAND DATA ASSIMILATION SYSTEM This powerful new land surface modeling system integrates data from advanced observing systems to support improved forecast model initialization and hydrometeorological investigations. Bull. Amer. Meteor. Soc. 85, 381–394 (2004).
Article ADS Google Scholar
Xiao, M., Nijssen, B. & Lettenmaier, D. P. Drought in the Pacific Northwest, 1920–2013. J. Hydrometeorol. 17, 2391–2404 (2016).
Article ADS Google Scholar
Livneh, B. et al. A spatially comprehensive, hydrometeorological data set for Mexico, the U.S., and Southern Canada 1950-2013. Sci. Data 2, 1–12 (2015).
Article Google Scholar
Bohn, T. J., Whitney, K. M., Mascaro, G. & Vivoni, E. R. A deterministic approach for approximating the diurnal cycle of precipitation for use in large-scale hydrological modeling. J. Hydrometeorol. 20, 297–317 (2019).
Article ADS Google Scholar
Yamazaki, D. et al. A high-accuracy map of global terrain elevations. Geophys. Res. Lett. 44, 5844–5853 (2017).
Article ADS Google Scholar
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
Article Google Scholar
Wieder, W. R., Boehnert, J., Bonan, G. B. & Langseth, M. Regridded Harmonized World Soil Database v1 .2. ORNL Distributed Active Archive Center https://doi.org/10.3334/ORNLDAAC/1247 (2014).
D’Errico, J. inpaint_nans. (2012).
Liang, X., Wood, E. F. & Lettenmaier, D. P. Surface soil moisture parameterization of the VIC-2L model: Evaluation and modification. Glob. Planet. Change 13, 195–206 (1996).
Article ADS Google Scholar
Cosby, B. J., Hornberger, G. M., Clapp, R. B. & Ginn, T. R. A Statistical Exploration of the Relationships of Soil Moisture Characteristics to the Physical Properties of Soils. Water Resour. Res. 20, 682–690 (1984).
Article ADS Google Scholar
Schaake, J. C. Average hydraulic properties of ARS soil texture classes. https://vic.readthedocs.io/en/master/Documentation/soiltext/ (2000).
Hoffman, H. soil_classification. (2016).
Peters-Lidard, C. D., Blackburn, E., Liang, X. & Wood, E. F. The effect of soil thermal conductivity parameterization on surface energy fluxes and temperatures. J. Atmos. Sci. 55, 1209–1224 (1998).
Article ADS Google Scholar
MathWorks. gradientm. in Mapping Toolbox ^TM Reference R2019a 680–682 (2019).
Rawls, W. J. Infiltration and Soil Water Movement. in Handbook of Hydrology (McGraw-Hill, Inc., 1993).
Friedl, M. & Sulla-Menashe, D. MCD12C1 MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 0.05Deg CMG. https://doi.org/10.5067/MODIS/MCD12C1.006 (2015).
Sulla-Menashe, D., Gray, J. M., Abercrombie, S. P. & Friedl, M. A. Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product. Remote Sens. Environ. 222, 183–194 (2019).
Article ADS Google Scholar
Zeng, X. Global Vegetation Root Distribution for Land Modeling. J. Hydrometeorol. 2, 525–530 (2001).
Article ADS Google Scholar
Hansen, M. C., Sohlberg, R., Defries, R. S. & Townshend, J. R. G. Global land cover classification at 1 km spatial resolution using a classification tree approach. International Journal of Remote Sensing vol. 21 (2000).
Mitchell, K. E. et al. The multi-institution North American Land Data Assimilation System (NLDAS): Utilizing multiple GCIP products and partners in a continental distributed hydrological modeling system. J. Geophys. Res. D Atmos. 109 (2004).
Cherkauer, K. A. GLDAS vegetation parameters. (1999).
Liang, S. et al. A long-term Global LAnd Surface Satellite (GLASS) data-set for environmental studies. Int. J. Digit. Earth 6, 5–33 (2013).
Article ADS Google Scholar
Xiao, Z. et al. Long-Time-Series Global Land Surface Satellite Leaf Area Index Product Derived from MODIS and AVHRR Surface Reflectance. IEEE Trans. Geosci. Remote Sens. 54, 5301–5318 (2016).
Article ADS Google Scholar
Liang, S. et al. Global LAnd Surface Satellite (GLASS) products: Algorithms, validation and analysis. (Springer, 2013).
Didan, K., Munoz, A. B., Solano, R. & Huete, A. MODIS Vegetation Index User’s Guide (MOD13 Series) Version 3.0 Ccollection 6). 38 (2015).
Hall, D. K. & Riggs, G. A. MODIS/Terra Snow Cover Monthly L3 Global 0.05Deg CMG, Version 6. (2015)
Ducoudré, N. I., Laval, K. & Perrier, A. SECHIBA, a New Set of Parameterizations of the Hydrologic Exchanges at the Land-Atmosphere Interface within the LMD Atmospheric General Circulation Model. J. Clim. 6, 248–273 (1993).
Article ADS Google Scholar
Dorman, J. L. & Sellers, P. J. A global climatology of albedo, roughness length and stomatal resistance for atmospheric general circulation models as represented by the Simple Biosphere Model (SiB). J. Appl. Meteor. (1989).
Sellers, P. J., Mintz, Y., Sud, Y. C. & Dalcher, A. A Simple Biosphere Model (SIB) for Use within General Circulation Models. J. Atmos. Sci. 43, 505–531 (1986).
Article ADS Google Scholar
Nijssen, B., Schnur, R. & Lettenmaier, D. P. Global retrospective estimation of soil moisture using the variable infiltration capacity land surface modl, 1980-93. J. Clim. 14, 1790–1808 (2001).
Article ADS Google Scholar
Mao, D., Cherkauer, K. A. & Bowling, L. C. Improved vegetation properties for the estimation of evapotranspiration in the Midwestern United States. in ASABE Annual International Meeting (2007).
Mao, D. & Cherkauer, K. A. Impacts of land-use change on hydrologic responses in the Great Lakes region. J. Hydrol. 374, 71–82 (2009).
Article ADS Google Scholar
Schaperow, J. R. & Li, D. VICGlobal: soil and vegetation parameters for the Variable Infiltration Capacity hydrological model. Zenodo https://doi.org/10.5281/zenodo.5038653 (2021).
Prairie, J. & Callejo, R. Natural Flow and Salt Computation Methods, Calendar Years 1971–1995. Bur. Reclam. 1–112 (2005).
Demaria, E. M., Nijssen, B. & Wagener, T. Monte Carlo sensitivity analysis of land surface parameters using the Variable Infiltration Capacity model. J. Geophys. Res. Atmos. 112, 1–15 (2007).
Article Google Scholar
Shi, X., Wood, A. W. & Lettenmaier, D. P. How essentialis hydrologic model calibration to seasonal stream flow forecasting? J. Hydrometeorol. 9, 1350–1363 (2008).
Article ADS Google Scholar
Gou, J. et al. Sensitivity Analysis-Based Automatic Parameter Calibration of the VIC Model for Streamflow Simulations Over China. Water Resour. Res. 56, 1–19 (2020).
Article Google Scholar
Duan, Q., Sorooshian, S. & Gupta, V. Effective and Efficient Global Optimization for Conceptual Rainfall-Runoff Models. Water Resour. Res. 28, 1015–1031 (1992).
Article ADS Google Scholar
Tolson, B. A. & Shoemaker, C. A. Dynamically dimensioned search algorithm for computationally efficient watershed model calibration. Water Resour. Res. 43, 1–16 (2007).
Article Google Scholar
Gupta, H. V., Kling, H., Yilmaz, K. K. & Martinez, G. F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 377, 80–91 (2009).
Article ADS Google Scholar
Falcone, J. A. GAGES-II: Geospatial Attributes of Gages for Evaluating Streamflow. http://pubs.er.usgs.gov/publication/70046617 (2011).
NOAA. SURFRAD (Surface Radiation) Network. https://www.esrl.noaa.gov/gmd/grad/surfrad/index.html.

Download references

Acknowledgements

We thank Dai Yamazaki for granting us access to the MERIT dataset and Bill Yeh for a helpful discussion about calibration. The first author acknowledges support from the NASA Earth and Space Science Fellowship Program Grant 80NSSC18K1418.

Author information

Authors and Affiliations

Department of Civil and Environmental Engineering, University of California, Los Angeles, 90095, USA
Jacob R. Schaperow, Dongyue Li & Steven A. Margulis
Department of Geography, University of California, Los Angeles, 90095, USA
Dongyue Li & Dennis P. Lettenmaier

Authors

Jacob R. Schaperow
View author publications
You can also search for this author in PubMed Google Scholar
Dongyue Li
View author publications
You can also search for this author in PubMed Google Scholar
Steven A. Margulis
View author publications
You can also search for this author in PubMed Google Scholar
Dennis P. Lettenmaier
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.R. Schaperow compiled the dataset and wrote the manuscript. D. Li converted the Classic driver input files to NetCDF Image driver inputs. All authors participated in planning and revising the manuscript.

Corresponding author

Correspondence to Jacob R. Schaperow.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Schaperow, J.R., Li, D., Margulis, S.A. et al. A near-global, high resolution land surface parameter dataset for the variable infiltration capacity model. Sci Data 8, 216 (2021). https://doi.org/10.1038/s41597-021-00999-4

Download citation

Received: 26 November 2020
Accepted: 29 June 2021
Published: 11 August 2021
DOI: https://doi.org/10.1038/s41597-021-00999-4