The effects of numerical-model complexity and observation type on estimated porosity values

Starn, J. Jeffrey; Bagtzoglou, Amvrossios C.; Green, Christopher T.

doi:10.1007/s10040-015-1289-3

The effects of numerical-model complexity and observation type on estimated porosity values

Les effets de la complexité des modèles numériques et du type d’observation des valeurs estimées de porosité

La complejidad de los efectos del modelo numérico y del tipo de observación sobre los valores de porosidad estimados

数值模型的复杂性和观测类型对估算的孔隙度值的影响

Επίδραση πολυπλοκότητας αριθμητικών μοντέλων και είδους παρατηρήσεων σέ εκτιμώμενες τιμές πορώδους

Os efeitos da complexidade e tipos de observações de modelos numéricos nos valores de porosidade estimada

Paper
Open access
Published: 12 July 2015

Volume 23, pages 1121–1128, (2015)
Cite this article

Download PDF

You have full access to this open access article

Hydrogeology Journal Aims and scope Submit manuscript

The effects of numerical-model complexity and observation type on estimated porosity values

Download PDF

J. Jeffrey Starn^1,2,
Amvrossios C. Bagtzoglou² &
Christopher T. Green³

1822 Accesses
4 Citations
Explore all metrics

Abstract

The relative merits of model complexity and types of observations employed in model calibration are compared. An existing groundwater flow model coupled with an advective transport simulation of the Salt Lake Valley, Utah (USA), is adapted for advective transport, and effective porosity is adjusted until simulated tritium concentrations match concentrations in samples from wells. Two calibration approaches are used: a “complex” highly parameterized porosity field and a “simple” parsimonious model of porosity distribution. The use of an atmospheric tracer (tritium in this case) and apparent ages (from tritium/helium) in model calibration also are discussed. Of the models tested, the complex model (with tritium concentrations and tritium/helium apparent ages) performs best. Although tritium breakthrough curves simulated by complex and simple models are very generally similar, and there is value in the simple model, the complex model is supported by a more realistic porosity distribution and a greater number of estimable parameters. Culling the best quality data did not lead to better calibration, possibly because of processes and aquifer characteristics that are not simulated. Despite many factors that contribute to shortcomings of both the models and the data, useful information is obtained from all the models evaluated. Although any particular prediction of tritium breakthrough may have large errors, overall, the models mimic observed trends.

Résumé

Les mérites relatifs de la complexité des modèles et des types d’observations utilisées dans la calibration des modèles sont comparés. Un modèle existant d’écoulement des eaux souterraines couplé à une simulation du transport advectif de la vallée du Lac Salé, Utah (Etats-Unis d’Amérique), est adapté pour le transport advectif, et la porosité efficace est ajustée jusqu’à ce que les concentrations simulées en tritium coïncident aux concentrations des échantillons provenant des puits. Deux approches de calage sont employées: un champ de porosité “complexe”, fortement paramétré et un modèle parcimonieux “simple” de distribution de la porosité. L’utilisation d’un traceur atmosphérique (tritium dans ce cas-ci) et des âges apparents (à partir du rapport tritium/hélium) dans le calage de modèles sont également discutés. Des deux modèles examinés, le modèle complexe (avec des concentrations en tritium et des âges apparents du rapport tritium/hélium) fournit les meilleurs résultats. Bien que les courbes de percée de tritium simulées par les modèles complexes et simples soient très généralement semblables, et malgré l’intérêt du modèle simple, le modèle complexe est soutenu par une distribution plus réaliste de porosité et un plus grand nombre de paramètres estimables. Collecter les données de meilleure qualité n’a pas mené à améliorer le calage, probablement en raison des processus et des caractéristiques des aquifères qui ne sont pas simulés. En dépit de beaucoup de facteurs qui contribuent aux imperfections des modèles et des données, une information utile est obtenue à partir de tous les modèles évalués. Bien que n’importe quelle prévision particulière de percée de tritium puisse avoir de larges erreurs, finalement, les modèles simulent bien les tendances observées.

Resumen

Se comparan las ventajas relativas de la complejidad del modelo y del tipo de observaciones empleadas en la calibración de un modelo. Se adapta un modelo de flujo de agua subterránea ya existente unido a una simulación de transporte advectivo del Valle de Salt Lake, Utah (EE.UU.), para el transporte advectivo y se ajusta la porosidad efectiva hasta que las concentraciones simuladas de tritio coinciden con concentraciones en las muestras de los pozos. Se utilizan dos métodos de calibración: uno “complejo” de un campo porosidad altamente parametrizable y uno “simple” de un modelo parsimonioso de la distribución de la porosidad. También se discuten el uso de un trazador atmosférico (tritio en este caso) y las edades aparentes (de tritio / helio) en la calibración del modelo. De los modelos probados, el que mejor se comporta es el modelo complejo (con concentraciones de tritio y edades aparentes de tritio / helio). Aunque las curvas de ruptura de tritio simuladas por el modelo complejo y el simple son en general muy similares, y existe un valor para el modelo simple, el modelo complejo apoyado con una distribución de porosidad es más realista y presenta una mayor cantidad de parámetros estimables. El descarte de los mejores datos de calidad no dio lugar a una mejor calibración, posiblemente debido a que los procesos y las características de los acuíferos que no están simulados. A pesar de los muchos factores que contribuyen a las deficiencias tanto de los modelos como de los datos, la información útil se obtiene a partir de la evaluación de todos los modelos. Aunque cualquier predicción particular de la ruptura del tritio puede tener errores grandes, en general, los modelos imitan las tendencias que se observan.

摘要

对模型校准中使用的模型复杂性和观测类型优缺点进行了比较。现有的地下水流模型结合(美国)犹他州盐湖流域平流传输模拟以适应平流传输,并对有效孔隙度进行了调整,直至所模拟的氚浓度与井样品中的浓度相匹配。采用了两个校准方法:“复杂的”高度参数化的孔隙度场及“简单的”质量很一般的孔隙度分布模型。论述了模型校准中大气示踪剂(本实例中用的是氚)的使用和表观年龄(根据氚和氦)。在测试的模型中,复杂模型(氚浓度和氚/氦表观年龄)表现最好。尽管由复杂模型和简单模型模拟的氚突破曲线通常很相似,并且简单模型具有一定的价值,但复杂模型由更现实的孔隙度分布和更多数量的可估算的参数来支撑。选择质量最好的资料并不能得到更好的校准结果,这可能是因为没有模拟的过程和含水层特征造成的。尽管造成两个模型和资料有缺点的因素很多,但从所有评估的模型中获取了有用的信息。虽然氚突破任何特别的预测可能有很大的误差,但总的来说,模型模仿了观测的趋势。

Περίληψη

Τα συγκριτικά πλεονεκτήματα της πολυπλοκότητας μοντέλων και των ειδών παρατηρήσεων που χρησιμοποιούνται σε βαθμονόμηση μοντέλου συγκρίνονται στην εργασία αυτή. Ένα υπάρχον μοντέλο ροής υπογείων υδάτων σε συνδυασμό με προσομοίωση μεταφοράς συναγωγής στη Salt Lake Valley, Utah (ΗΠΑ), προσαρμόζεται και τό φαινόμενο πορώδες ρυθμίζεται ούτως ώστε προσομοιωμένες συγκεντρώσεις Τρίτιου να προσεγγίσουν συγκεντρώσεις σε δείγματα από φρεάτια δειγματοληψίας. Δύο προσεγγίσεις βαθμονόμησης χρησιμοποιούνται: ένα “πολύπλοκο” εξαιρετικά παραμετροποιημένο πεδίο πορώδους και ένα “απλό” ενοποιημένο μοντέλο της χωρικής κατανομής του πορώδους. Η χρήση ατμοσφαιρικών ιχνηλατών (Τρίτιου στην προκειμένη περίπτωση) και φαινόμενης ηλικίας/παλαιότητας υπόγειου νερού (από την αναλογία Τρίτιου/Ηλιου) συζητείται σχετικά με τη βαθμονόμηση του μοντέλου. Από τα μοντέλα που εξετάστηκαν, το πολύπλοκο/σύνθετο μοντέλο (με συγκεντρώσεις Τρίτιου και αναλογία Τρίτιου/Ηλιου) αποδίδει καλύτερα. Αν και οι προσομοιωμένες καμπύλες συγκέντρωσης Τρίτιου με σύνθετα και απλά μοντέλα είναι γενικά παρόμοιες, και σίγουρα το απλό μοντέλο έχει μεγάλη χρησιμότητα, το σύνθετο μοντέλο υποστηρίζει μια πιο ρεαλιστική κατανομή πορώδους και μεγαλύτερο αριθμό εκτιμητέων παραμέτρων. Χρησιμοποίηση των δεδομένων με την καλύτερη ποιότητα δεν οδήγησε σε καλύτερη βαθμονόμηση, πιθανώς λόγω των χαρακτηριστικών του υδροφορέα και φαινομένων που δεν έχουν συμπεριληφθεί στην προσομοίωση. Παρά τους πολλούς παράγοντες που επέφεραν αδυναμίες και στα δύο μοντέλα και τα δεδομένα, χρήσιμες πληροφορίες έχουν ληφθεί από όλα τα μοντέλα που έχουν αξιολογηθεί. Αν και οποιαδήποτε συγγεκριμένη πρόβλεψη συγκέντρωσης Τρίτιου μπορεί να έχει μεγάλη λάθη, από μια συνολική άποψη, τα μοντέλα μιμούνται τις παρατηρούμενες τάσεις.

Resumo

As vantagens relativas da complexidade e tipos de observação empregado na calibração de modelo foram comparadas. Um modelo de fluxo das águas subterrâneas existente associado a uma simulação do transporte advectivo do Vale do Lago Salgado, Utah (EUA), foi adaptado para transportes advectivos, e a porosidade efetiva ajustada até que a simulação da concentração de trítio corresponda as concentrações nas amostras de poços. Duas abordagens foram utilizadas na calibração: um “complexo” campo da porosidade altamente parametrizado e um modelo “simples” parcimonioso de distribuição da porosidade. O uso de um traçador atmosférico (trítio neste caso) e idade aparente (do trítio/hélio) na calibração também foram discutidos. Dos modelos testados, o modelo complexo (com concentração de trítio e idade aparente do trítio/hélio) obteve melhor desempenho. Embora as curvas de distribuição de trítio simuladas por modelos complexos e simples sejam normalmente similares, e há valor no modelo simples, o modelo complexo é sustentado por uma distribuição de porosidade mais realista e um grande número de parâmetros estimáveis. A escolha dos dados de melhor qualidade não levou a melhor calibração, possivelmente por causa de processos e características dos aquíferos que não foram simulados. Apesar de muitos fatores contribuírem para falhas em ambos, modelos e dados, informações importantes foram obtidas por todos os modelos avaliados. Embora qualquer estimativa especifica de distribuição de trítio possa apresentar grandes erros, em geral, os modelos acompanham tendências observadas.

Impact of model complexity and multi-scale data integration on the estimation of hydrogeological parameters in a dual-porosity aquifer

Article 05 March 2018

Geostatistical modeling of porosity and evaluating the local and global distribution

Article Open access 25 September 2021

Vertical hydraulic conductivity and layered heterogeneity: from measurements to models

Article 08 March 2024

Background and motivation

Model uncertainty in transport simulations is affected by how transport parameters are represented in the model and by the number and type of data available for model calibration. Typically, modelers attempt to reduce uncertainty by adding hydrologic and geologic detail to the model and/or by adding additional calibration data. In this paper, model results are compared by using simple and complex models calibrated with either tritium concentrations or tritium concentrations plus interpreted tritium/helium apparent ages. Tritium concentrations above background levels have persisted in the Salt Lake Valley aquifer, Utah (USA) for decades, a time period over which changes in total dissolved solids concentrations also have been observed (Starn et al. 2014); therefore, one would expect that calibration to tritium data should serve to decrease uncertainty in model predictions of changes in water quality.

Ideally, all calibration data are simulated by a single flow and transport model and parameters of the flow model (e.g., hydraulic conductivity and storage coefficient) should be adjusted simultaneously with effective transport parameters; however, this may not be possible because of disparate transport processes in groundwater flow (diffusion) and solute transport (advection; Voss 2011a, b). Perturbations in head propagate evenly throughout a model domain, whereas solute concentrations change discretely and non-uniformly through permeable pathways in an aquifer (Voss 2011a, b). This discrepancy can be more apparent in transient groundwater systems because a given observation may be affected by a head perturbation long before it is affected by a change in solute concentration. Only one parameter, effective transport porosity, is required to convert a groundwater-flow model for advective transport simulation; however, prior estimates of effective porosity vary widely and often are not available for most sites (Konikow 2011). Effective transport porosity values have been determined through calibration to tritium/helium apparent ages (Reilly et al. 1994; Sheets et al. 1998; Murphy et al. 2011), and, less often, directly to tritium concentrations (Herweijer et al. 1985; Engesgaard et al. 1996; Starn et al. 2014).

Large and long-term changes in recharge or discharge relative to groundwater residence times can lead to significant long-term changes in water quality (Reilly and Pollock 1996). Relatively few studies have considered short-term (annual) transient flow in analyzing water-quality trends (Furlong et al. 2011; Starn et al. 2014). In transient flow, age distributions are time dependent (Manning et al. 2012). Time-dependent source areas (Rock and Kupfersberger 2002; Masterson et al. 2004; Starn and Brown 2007; Starn et al. 2014) can contribute different solutes to groundwater, resulting in complex mixtures of water at pumped wells.

Compared to 20 years ago, automated parameter estimation methods are now in common use for groundwater flow and transport models, largely because of public-domain automatic model calibration software such as UCODE (Poeter et al. 2005) and PEST (Doherty 2010, 2014). The use of these methods is discussed at length by Hill and Tiedeman (2007). Each of these software approaches favors different model parameterization strategies. UCODE favors parsimonious models in which the modeler applies geologic knowledge to create piecewise-constant parameter zones. On the other hand, PEST favors highly parameterized models in which parameter fields can vary continuously, thus allowing nuances in the data to possibly be expressed in the solution. In this paper model predictions made by using each of the approaches are explored. Also explored is the effect of having different sets of observations for model calibration with each of these approaches.

In order to improve a model, a modeler can increase the complexity of the model in the hope of capturing additional physical processes that affect model results. Another way to improve a model is to add more or different types of data. In principle, a highly parameterized model should be better able to match the data because it has more parameters, and a model with more and different kinds of data should be able to capture more of the real physical variability in the system being modeled. Although this is not an exhaustive comparison of modeling approaches, the goal in this paper is to contribute to the constructive dialog on the relative merits of complex models and types of observations employed in model calibration. The question posed is the following: “Is added model complexity as useful for predictions as adding different types of observations?”

Approach

The approach used in this study is to convert an existing groundwater flow model to simulate advective transport in the Salt Lake Valley, Utah (Fig. 1), by adjusting effective porosity to match tritium concentrations in samples from wells and, alternatively, interpreted tritium/helium apparent ages (Starn et al. 2014). The modeling procedure includes simulation of groundwater flow (MODFLOW-NWT; Niswonger et al. 2011), simulation of advective transport (MODPATH; Pollock 2012), and a suite of pre- and post-processing steps to perform convolution-based particle tracking (CBPT; Robinson et al. 2010; Green et al. 2010; Srinivasan et al. 2011; Starn et al. 2012, 2013, 2014). To simplify interpretation, parameters of the transient groundwater flow model are held constant and only advective transport is considered. This is justified in part because parameters for groundwater flow may not correspond directly to parameters for transport. Two scenarios of porosity distribution are calibrated using (1) a complex model with highly parameterized porosity distributions and (2) a simple model with parsimoniously zoned porosity distributions. These are described in detail in the next two sections.

Each porosity scenario is calibrated to two data sets for a total of four simulations that are compared. The complex scenario is calibrated to the “full data set,” which consists of 185 samples from 80 wells (Fig. 2) mainly in the area where increased dissolved solids has been observed. Some of the wells were sampled multiple times over a period of decades, yielding 122 observations of tritium concentration. Interpreted groundwater residence times from tritium/helium ratios are available from 52 of the 80 wells (63 observations because some wells were sampled multiple times). The two data sets are (1) tritium concentrations plus apparent ages and (2) only tritium concentrations without apparent ages.

The simple scenario is calibrated to a “reduced data set” that was extracted from the “full data set” by including only wells for which a time series of tritium concentrations (more than one sample over time) and tritium/helium apparent ages were available. A further criterion was that the observation be obtained from wells where the well screens corresponded to at most two model layers (Fig. 3). Application of these criteria winnowed the data to 48 samples from 12 wells consisting of 31 observations of tritium concentration and 17 observations of interpreted groundwater residence times from tritium/helium ratios. The reason that the two scenarios were calibrated to different data sets is that a modeler would typically choose model complexity in part based on the amount of data available. One would be less likely to create the complex model if all one had available was the reduced data set. Also, the reduced data set did not contain sufficient information to calibrate the complex model.

Most of the tritium data are in the USGS National Water Information System (NWIS) database (USGS 2015). Additional data were obtained from Manning (2002) and Thiros (2003), and tritium data from USGS sites reported by University of Utah and Lamont-Doherty Earth Observatory.

Hydrogeologic setting and simulation of groundwater flow

The basin-fill aquifer in the Salt Lake Valley, Utah (Fig. 1), is bounded by mountains and by the Great Salt Lake. Quaternary-age basin fill consists of unconsolidated to semi-consolidated sediments, which average 600 m thick with interbeds of clay, silt, sand, and gravel and lenses of sand and gravel (Stolp 2007). Sediments originated from the adjacent mountains and are generally coarser and less well sorted near the valley wall and finer near the center of the valley. Discontinuous fine-grained layers confine groundwater in the central part of the basin. These layers are not present near the basin edges, where recharge occurs from losing stream reaches, infiltrating precipitation, and flow through fractured consolidated rock in the adjacent mountains.

The basin fill includes Tertiary-age semi-consolidated sediments at its base permeable enough to yield water to wells (Stolp 2007). Groundwater flows from the recharge areas to the Jordan River in the center of the valley (Fig. 1) and to pumping wells. Groundwater from wells supplied about 29 % of the water used for public supply in the basin in 2005 (Thiros and Spangler 2010). While most of the groundwater pumped is used for drinking, some is used for agricultural and industrial purposes (Burden 2009). Some parts of the aquifer contain older water that has been in contact with aquifer minerals for centuries or millennia, resulting in high dissolved solids concentrations (Thiros and Spangler 2010). Carbon-14 and Helium-4 data support very old ages in some wells in Salt Lake Valley (Stephen Hinkle, USGS, unpublished data, 2012).

Population in the valley has steadily increased from the 1930s to more than 1 million people in 2011. Although pumping at individual wells has been variable over time, total withdrawals from the aquifer leveled off after about 1997. During the period of population increase, many wells exhibited long-term increases in dissolved solids concentration. The increasing dissolved solids are related to the location and magnitude of groundwater withdrawals from wells. Increased groundwater withdrawals have induced flow from different source areas, by enhancing both downward flow from areas affected by anthropogenic activity and horizontal flow from areas containing mineralized water from natural processes.

A steady-state model of the Salt Lake Valley was manually calibrated to measured water levels, groundwater flow to the Jordan River, and vertical hydraulic gradients from 1964 to 1968, when groundwater withdrawals and changes in storage were relatively constant (Lambert 1995). Heads computed by the steady-state simulation were used as the initial condition for a transient simulation in which recharge and pumping rates were varied at their estimated annual rates from 1969 to 1991. The transient model was manually calibrated to measured water-level changes and groundwater flow to the Jordan River, and later extended for part of the modeled area to include the time period 1935 to 1964 (Lambert 1995, 1996). Stolp (2007) then updated this model to reflect average recharge and withdrawals during 1997 to 2001.

The model covers 1,152 km² of the Salt Lake Valley. The model grid cells are 563 m on each side in 94 rows and 62 columns. The aquifer is divided into 7 layers; the top two layers represent the shallow unconfined aquifer and the underlying confining unit in the center of the valley. The thickness of each layer is variable. Layers 3–7 represent the principal aquifer and are 46, 46, 46, 61 m, and greater than or equal to 61 m thick, respectively. Layer 7 is a maximum of 460 m thick in the deepest parts of the basin. Boundary conditions include recharge from precipitation, infiltration from losing streams and adjacent mountain blocks, and discharge through pumping wells, gaining stream reaches, and evapotranspiration. The base of the aquifer is a zero-flow boundary.

Simulation and calibration of advective transport

The 2007 groundwater flow model was modified for this study to use MODFLOW-NWT (Niswonger et al. 2011), which necessitated substituting the upstream weighting (UPW) flow package for the block-centered flow (BCF) package (Harbaugh 2005). The multi-node well (MNW2) package for MODFLOW (Konikow et al. 2009) also was used to better simulate pumping from long-screened public-supply wells. Differences in heads and flows between the 2007 model and modified models were very small and were considered negligible.

Although macrodispersion can be an important transport mechanism in groundwater, studies of regional-scale transport have used advective transport simulations to provide first-order estimates of solute concentrations with success (Sanford 2011). Advective transport simulations also have been used successfully to interpret trends of solute concentrations (Kauffman et al. 2001; McMahon et al. 2008) and to estimate groundwater residence times (Wellman et al. 2012) in steady flow.

Advective transport was simulated by using convolution-based particle tracking (CBPT; Starn et al. 2012, 2013, 2014). This method computes either resident solute concentration (for samples from unpumped wells and monitoring wells) or flux-weighted concentration (for pumped wells). The simulated equivalent of apparent age is taken to be the mean particle age. Multi-layer flow rates for flux-weighted concentrations are simulated by the MNW2 package. Flow within a cell containing a pumped well is simulated by using an analytical solution (Zheng 1994) and embedded into a pre-processor by Starn et al. (2012). Particle advection is simulated by using MODPATH and converted into concentration break-through curves by using the CBPT method cited earlier.

The computer code PEST++ (Welter et al. 2012) is used as an inverse simulator. PEST++ is used for both the complex and simple models. In the latter case, it is set up to estimate parameters defined in piece-wise constant zones. The highly parameterized inverse model uses pilot points, regularization, and truncated singular value decomposition as described in guidelines by Doherty and Hunt (2010a) and Welter et al. (2012). Porosity parameters were defined for each of 3 layer groups (layers 1–2, 3–5, and 6–7). A single porosity was estimated for each of the shallowest and deepest layer groups. Porosity in the middle layer group, where most of the pumping occurs, was estimated by using 37 pilot points (Fig. 2). Continuously varying porosity fields for each layer in which there are pilot points are created by interpolating from the pilot points to the center of each finite-difference cell. Homogeneity in the estimated porosity field is favored by regularization that minimizes differences in porosity values between each pilot point and its nearest neighbors Doherty and Hunt (2010a). In this way parameter estimates at pilot points are constrained by a total of 202 regularization equations. Weights on regularization equations are proportional to their separation distance; the constant of proportionality is calculated by PEST++ using constrained optimization (Doherty and Hunt 2010a). Singular value decomposition is a technique whereby parameters are combined into unique linear combinations called super-parameters. Simulation runs do not need to be done for insensitive super-parameters (Doherty and Hunt 2010a), so this technique can reduce model run time.

The simple inverse model was parameterized into three zones, and one effective porosity value was estimated for each zone. The first zone consists of layers 1 and 2; the second zone consists of layers 3–5, and the third zone consists of layers 6 and 7. Pilot points were not used in the simple model; regularization was used, but differently than in the complex model. In the simple model, a preferred regularization value of porosity of 0.10 was assigned to each zone. As in the complex model, the constant of proportionality for these preferred values was determined by using constrained optimization (Doherty and Hunt 2010a).

Weights specified for all observations were calculated with a coefficient of variation of 0.1 and a threshold of either 10 tritium units (TU) for concentration or 10 years for residence time as

$$ W= \min \left\{\begin{array}{c}\hfill {\left(0.1\times O\right)}^{-1}\hfill \\ {}\hfill 10.\hfill \end{array}\right. $$

(1)

where W is the inverse of the standard deviation and O is the observed value. Using the coefficient of variation (standard deviation divided by the mean) allows larger values to have smaller weights, thereby ensuring that small observations have an effect on the inverse simulation (Hill and Tiedeman 2007). The threshold value is used so that very small concentrations, which might have relatively small standard deviations, do not dominate the solution (Hill and Tiedeman 2007). Although weights on tritium concentration and the apparent age that is derived from tritium concentration are correlated, these correlations are not expected to strongly affect the interpretation of these results (Tiedeman and Green 2013).

Results and discussion

Although many improvements such as revisiting the weighting scheme based on an analysis of influential observations, could be made to the calibration, the intent here is to perform the calibrations as consistently as possible and evaluate the relative merits of the four model/scenario combinations. The goodness-of-fit of the calibrations cannot be judged by the objective function value alone because of the different numbers of parameters, different calibration data sets, and because PEST++ adjusts weights among observation groups automatically between iterations. With that caveat in mind, calibration improved the objective functions by small amounts—by factors of 0.81 and 0.41 for the complex model with and without apparent age data, respectively, and by factors of 0.73 and 0.71 for the simple model with and without apparent age data. The pattern of the estimated porosity distribution for the complex model with age data (Fig. 4) is similar to that of previous estimates for the east side of the valley (Freethey et al. 1994), in that porosity is lowest near the margin of the valley, increases toward the center of the basin, and is higher beneath reaches of tributary streams such as Big Cottonwood Creek and Little Cottonwood Creek (Fig. 1). Estimated porosities for the complex model without age data remained close to their initial values throughout calibration, ranged from 0.09 to 0.13, and exhibited a pattern similar to the complex model but with a smaller range. Porosities for the simple model with age data were 0.11, 0.09, and 0.09 for the shallow, medium, and deep layers respectively, and was uniformly 0.11 for all layers in the simple model without age data.

When evaluating the model fit, it is instructive to compare an observation to the entire break-through curve (BTC) rather than to simulated estimates at the one time of the observation. Small changes in porosity shift the simulated curves left or right and, because of the time-dependence of tritium decay, also changes the height of the peak (see Starn et al. 2014 for a graphic example). Wells A, B, and C were selected for display because they are typical fits, are at increasing distance from the basin edge, and have multiple tritium samples and an associated tritium/helium apparent age (Fig. 5). All the models tend to over-predict age, which means that tritium has had more time to decay, and therefore the models also tend to under-predict tritium concentrations. In order to improve the fit, porosities would have to be more than an order of magnitude smaller than those estimated in this study. In general, all the models predicted a temporal trend similar to the trend in the observations.

Well B (Fig. 5) deserves a more detailed discussion. Well B is a pumped well, and during the time tritium samples were being collected (approximately 2000–2010), pumping was halted such that the source of water to the well switched several times between layers 3 and 4. Possible reasons for the low simulated values are (1) the coarse temporal resolution (yearly stress periods) of the groundwater flow model were insufficient to capture flow conditions at the time of sampling; (2) water containing tritium was sequestered in low-permeability lenses within the aquifer, which would essentially be an unsimulated source of tritium when the well was not pumped (multiple transport domains); and (or) (3) the mixture of water from layers 3 and 4 was inaccurately simulated because of unsimulated heterogeneity in those layers. A separate simulation (not discussed in detail here) with a finite-difference transport simulator, which tended to smear temporal fluctuations, did indeed produce a much better fit at well B.

A qualitative way to examine the information content of the four models/scenarios is through the identifiability (IDENT) and relative parameter error reduction statistics (RPER; Doherty and Hunt 2009; comment by Hill 2010; response by Doherty and Hunt 2010b). IDENT and RPER are similar to the composite-scaled sensitivity (CSS) statistic discussed by Hill and Tiedeman (2007) and are appropriate for use in highly parameterized models. Calculation and evaluation of IDENT and RPER is complex and beyond the scope of this article. Simply put, they vary between 0 and 1: a parameter for which IDENT = 0 cannot be estimated with the current model and data set, and a parameter for which IDENT = 1 means that any error associated with the parameter estimate is associated only with measurement noise and not with model inadequacy. Both statistics are sensitive to a value chosen by the modeler (in reference to Doherty and Hunt 2009, it is the number of singular values assigned to model calibration space); however, a poor choice for this number results in values of the statistics less than 0, which did not occur in this case. The number of singular values assigned to the solution space is a function of the number of unique linear orthogonal combinations of parameter values that reduce model error. RPER is similar to IDENT except that it reduces the effect of measurement noise by comparing post-calibration error to pre-calibration error.

For the complex model, which has 39 parameters, 16 parameters have IDENT and RPER greater than 0.80 when using all data (24 singular values assigned to solution space), but when using tritium concentrations, only five parameters have IDENT greater than 0.80 and two with RPER greater than 0.80 (18 singular values assigned to solution space). On this basis, the complex model has significantly more estimable parameters when using all data than with tritium alone. Although not as meaningful because of the small number of parameters, each simple model had one parameter with IDENT and RPER greater than 0.80.

Multiple factors can hinder the ability of a model to reproduce concentrations (Konikow 2011). In this study, these include possible unsimulated processes such as transfer between multiple transport domains (multi-rate mass transfer), unsimulated variations of aquifer properties within flow-model grid cells, and coarse temporal resolution of pumping. With regard to the last factor, in transient flow, groundwater residence time is itself a function of time and varies in relation to stresses on the aquifer. In addition to these factors, there is the difficulty in model calibration of which limb on the tritium BTC to match. Equal fits can be achieved on rising and falling limbs. Apparent ages could help guide the model toward the appropriate limb. On the other hand, tritium/helium apparent ages are based on a piston-flow model and do not account for mixtures of water of different ages, as can be induced in pumping wells and in transient flow regardless of geologic heterogeneity. Tritium/helium apparent age can be significantly less than mean simulated particle residence time; that the amount is less depends on the ratio and age difference of the mixture, which is unknown. Thus, instead of helping choose a limb of the tritium BTC, apparent age could add disinformation. (It is well known that multiple tracers can sort out the ratio of the mixture, but these were not available in this study.)

Conclusions

Two model parameterizations and two data sets were evaluated to inform future modeling effort. Of the models tested, the complex model with tritium concentrations and tritium/helium apparent ages set performed best. Complex models require more time to construct and run, and can be complex to interpret; thus, a simple model might be preferred if it performs adequately. In this study, tritium break-through curves (BTC) simulated by complex and simple models were very generally similar; therefore, there is value in the simple model. The complex model reveals structure in the estimated porosity field that agrees with that found independently by Freethey et al. (1994), and this fact helps support the credibility. The degree to which this structure is expressed increases when apparent age data are included in model calibration. Apparently the potential for disinformation by including apparent age data was less than the information added by guiding the calibration toward the appropriate limb of the BTC. This conclusion is supported by the complex model having more estimable parameters when apparent age data are included than when they are not. Less improvement was noted when apparent age data were added to the simple model.

Two data sets were evaluated. The reduced data set (only used with the simple model) was constructed from the full data set because of the potential problems of matching a point on two-limbed BTCs and of sensitivity to mixing of water from different layers in response to pumping variations. The contention that model calibration might be improved if only measurements that limited those two complications were used was not realized. Even with carefully chosen data, unsimulated features may render that data less than useful, as seen in the BTC to well B (Fig. 5).

Despite many factors that contribute to shortcomings of both the models and the data, there was useful information obtained from all the models evaluated. Although any particular prediction of tritium concentration may have large errors, overall, the models mimic observed trends in tritium breakthrough. Care must be taken to interpret model calibrations, and multiple data types should be used in model calibration.

References

Burden CB (2009) Ground-water conditions in Utah, spring of 2009. Cooperative Investigations Report 50, Utah Department of Natural Resources, Salt Lake City, UT, 29 pp
Doherty J (2010) PEST: model independent parameter estimation. Watermark, Brisbane, Australia
Google Scholar
Doherty J (2014) Addendum to the PEST manual. Watermark, Brisbane, Australia
Google Scholar
Doherty J, Hunt RJ (2009) Two statistics for evaluating parameter identifiability and error reduction. J Hydrol 366:119–127
Article Google Scholar
Doherty J, Hunt RJ (2010a) Approaches to highly parameterized inversion: a guide to using PEST for groundwater-model calibration. US Geol Surv Sci Invest Rep 2010-5169, 59 pp
Doherty J, Hunt RJ (2010b) Response to comment on “two statistics for evaluating parameter identifiability and error reduction”. J Hydrol 380:489–496
Engesgaard P, Jensen KH, Molson J, Frind EO, Olsen H (1996) Large-scale dispersion in a sandy aquifer: simulation of subsurface transport of environmental tritium. Water Resour Res 32:3253–3266
Freethey GW, Spangler LE, Monheiser WJ (1994) Determination of hydrologic properties needed to calculate average linear velocity and travel time of ground water in the principal aquifer underlying the southeastern part of Salt Lake Valley, Utah. US Geol Surv Water Resour Invest Rep 92-4085, 30 pp
Furlong BV, Riley MS, Herbert AW, Ingram JA, Mackay R, Tellam JH (2011) Using regional groundwater flow models for prediction of regional wellwater quality distributions. J Hydrol 398:1–16
Article Google Scholar
Green CT, Böhlke JK, Bekins BA, Phillips SP (2010) Mixing effects on apparent reaction rates and isotope fractionation during denitrification in a heterogeneous aquifer. Water Resour Res 46:W08525. doi:10.1029/2009WR008903
Google Scholar
Harbaugh A W (2005) MODFLOW-2005, The U.S. Geological Survey modular ground-water model: the ground-water flow process. US Geol Surv Tech Methods 6 A16, variously paginated
Herweijer JC, Van Luijn GA, Appelo CAJ (1985) Calibration of a mass transport model using environmental tritium. J Hydrol 78:1–17
Hill MC (2010) Comment on “two statistics for evaluating parameter identifiability and error reduction” by John Doherty and Randall J Hunt. J Hydrol 380:481–488
Article Google Scholar
Hill MC, Tiedeman CR (2007) Effective groundwater model calibration with analysis of data, sensitivities, and uncertainty. Wiley, Hoboken, NJ, 455 pp
Book Google Scholar
Hill MC, Banta ER, Harbaugh AW, Anderman ER (2000) MODFLOW-2000, The U.S. Geological Survey modular ground-water model: user guide to the observation, sensitivity, and parameter-estimation processes and three post-processing programs. US Geol Surv Open-File Rep 00-184, 209 pp
Kauffman LJ, Baehr AL, Ayers MA, Stackelberg PE (2001) Effects of land use and travel time on the distribution of nitrate in the Kirkwood-Cohansey aquifer system in southern New Jersey. US Geol Surv Water Resour Invest Rep 01-4117
Konikow LF (2011) The secret to successful solute-transport modeling. Ground Water 49:144–159
Article Google Scholar
Konikow LF, Hornberger GZ, Halford KJ, Hanson RT (2009) Revised multi-node well (MNW2) package for MODFLOW ground-water flow model. US Geol Surv Tech Methods 6-A30, 67 pp
Lambert PM (1995) Numerical simulation of ground-water flow in basin-fill material in Salt Lake Valley, Utah. Technical publication 110-B, Utah Department of Natural Resources, Salt Lake City, UT, 70 pp
Lambert PM (1996) Numerical simulation of the movement of sulfate in ground water in southwestern Salt Lake Valley, Utah. Technical publication 110-D, Utah Department of Natural Resources, Salt Lake City, UT, 70 pp
Manning AH (2002) Using noble gas tracers to investigate mountain-block recharge to an intermountain basin. PhD Thesis, Univ. of Utah, Salt Lake City, UT, USA
Manning AH, Clark JF, Diaz SH, Rademacher LK, Earman S, Plummer LN (2012) Evolution of groundwater age in a mountain watershed over a period of thirteen years. J Hydrol 460–461:13–28
Article Google Scholar
Masterson JP, Walter DA, LeBlanc DR (2004) Transient analysis of the source of water to wells: Cape Cod, Massachusetts. Ground Water 42(1):126–134
Article Google Scholar
McMahon PB, Burow KR, Kauffman LJ, Eberts SM, Böhlke JK, Gurdak JJ (2008) Simulated response of water quality in public supply wells to land use change. Water Resour Res 44:W00A06. doi:10.1029/2007WR006731
Google Scholar
Murphy S, Ouellon T, Ballard J-M, Lefebvre R, Clark ID (2011) Tritium-helium groundwater age used to constrain a groundwater flow model of a valley-fill aquifer contaminated with trichloroethylene (Quebec, Canada). Hydrogeol J 19:195–207
Article Google Scholar
Niswonger RG, Panday S, Ibaraki M (2011) MODFLOW-NWT: a Newton formulation for MODFLOW-2005. US Geol Surv Tech Methods 6-A37, 44 pp
Poeter EP, Hill MC, Banta ER, Mehl S, Christensen S (2005) UCODE_2005 and six other computer codes for universal sensitivity analysis, calibration, and uncertainty evaluation. US Geol Surv Tech Methods 6-A11
Pollock D W (2012) User guide for MODPATH Version 6: a particle-tracking model for MODFLOW. US Geol Surv Tech Methods 6-A41, 58 pp
Reilly TE, Pollock DW (1996) Sources of water to wells for transient cyclic systems. Ground Water 34:979–988
Article Google Scholar
Reilly TE, Plummer LN, Phillips PJ, Busenberg E (1994) The use of simulation and multiple tracers to quantify groundwater flow in a shallow aquifer. Water Resour Res 30(2):421–433
Article Google Scholar
Robinson BA, Dash ZV, Srinivasan G (2010) A particle tracking transport method for the simulation of resident and flux-averaged concentration of solute plumes in groundwater models. Comput Geosci 14(4):779–792
Article Google Scholar
Rock G, Kupfersberger H (2002) Numerical delineation of transient capture zones. J Hydrol 269:134–149
Article Google Scholar
Sanford WE (2011) Calibration of models using groundwater age. Hydrogeol J 19:13–16
Article Google Scholar
Sheets RA, Bair ES, Rowe GL (1998) Use of ³H/³He ages to evaluate and improve groundwater flow models in a complex buried-valley aquifer. Water Resour Res 34(5):1077–1089
Article Google Scholar
Srinivasan G, Keating E, Moulton JD, Dash ZV, Robinson BA (2011) Convolution-based particle tracking method for transient flow. Comput Geosci 16(3):1–13
Starn JJ, Brown CJ (2007) Simulations of ground-water flow and residence time near Woodbury, Connecticut. US Geol Surv Sci Invest Rep 2007-5210, 45 pp
Starn JJ, Bagtzoglou AC, Robbins GA (2012) Methods for simulating solute breakthrough curves in pumping groundwater wells. Comput Geosci 48:244–255. doi:10.1016/j.cageo.2012.01.011
Article Google Scholar
Starn JJ, Bagtzoglou AC, Robbins GA (2013) Uncertainty in simulated groundwater quality trends in transient flow. Hydrogeol J 21(4):813–827
Article Google Scholar
Starn JJ, Green CT, Hinkle SR, Bagtzoglou AC, Stolp BJ (2014) Simulating water-quality trends in public-supply wells in transient flow systems. Groundwater 52(S1):53–62. doi:10.1111/gwat.12230
Article Google Scholar
Stolp BJ (2007) Hydrogeologic setting and ground-water flow simulations of the Salt Lake Valley Regional Study Area, Utah. In: Paschke SS (ed) Hydrogeologic settings and ground-water flow simulations for regional studies of the transport of anthropogenic and natural contaminants to public-supply wells: studies begun in 2001. US Geol Surv Prof Pap 1737-A, pp 2-1–2-22 (Section 2)
Thiros SA (2003) Quality and sources of shallow ground water in areas of recent residential development in Salt Lake Valley, Salt Lake County, Utah. US Geol Surv Water Resour Invest Rep 03-4028, 74 pp
Thiros S, Spangler L (2010) Decadal-scale changes in dissolved-solids concentrations in groundwater used for public supply, Salt Lake Valley, Utah. US Geol Surv Fact Sheet FS-2010-3073
Tiedeman CR, Green CT (2013) Effect of correlated observation error on parameters, predictions, and uncertainty. Water Resour Res 49(10):6339–6355
Article Google Scholar
USGS (2015) USGS National water information system database. http://waterdata.usgs.gov/ut/nwis/qw/. Accessed 5 April 2015
Voss CI (2011a) Editor’s Message: Groundwater modeling fantasies—part 1, adrift in the details. Hydrogeol J 19:1281–1284
Article Google Scholar
Voss CI (2011b) Editor’s Message: Groundwater modeling fantasies—part 2, down to earth. Hydrogeol J 19:1455–1458
Article Google Scholar
Wellman TP, Kauffman LJ, Clark B (2012) A zonal evaluation of intrinsic susceptibility in selected principal aquifers of the United States. J Hydrol 440–441:36–51
Article Google Scholar
Welter DE, Doherty JE, Hunt RJ, Muffels CT, Tonkin MJ, Schreuder WA (2012) Approaches in highly parameterized inversion: PEST++, a parameter estimation code optimized for large environmental models. US Geol Surv Tech Methods, Book 7, Section C5, 47 pp
Zheng C (1994) Analysis of particle tracking errors associated with spatial discretization. Ground Water 32(5):821–828
Article Google Scholar

Download references

Acknowledgements

The authors greatly appreciate the thorough review by Richard Yager, USGS, and the comments by two anonymous reviewers.

Author information

Authors and Affiliations

US Geological Survey, 101 Pitkin Street, East Hartford, CT, 06108, USA
J. Jeffrey Starn
Department of Civil and Environmental Engineering, University of Connecticut, 261 Glenbrook Road, Unit 3037, Storrs, CT, 06269, USA
J. Jeffrey Starn & Amvrossios C. Bagtzoglou
US Geological Survey, Bldg 15 McKelvey Building, 345 Middlefield Road 496, Menlo Park, CA, 94025, USA
Christopher T. Green

Authors

J. Jeffrey Starn
View author publications
You can also search for this author in PubMed Google Scholar
Amvrossios C. Bagtzoglou
View author publications
You can also search for this author in PubMed Google Scholar
Christopher T. Green
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Jeffrey Starn.

Additional information

Published in the theme issue “Optimization for Groundwater Characterization and Management”

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Starn, J.J., Bagtzoglou, A.C. & Green, C.T. The effects of numerical-model complexity and observation type on estimated porosity values. Hydrogeol J 23, 1121–1128 (2015). https://doi.org/10.1007/s10040-015-1289-3

Download citation

Received: 13 September 2014
Accepted: 21 June 2015
Published: 12 July 2015
Issue Date: September 2015
DOI: https://doi.org/10.1007/s10040-015-1289-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The effects of numerical-model complexity and observation type on estimated porosity values

Abstract

Résumé

Resumen

摘要

Περίληψη

Resumo

Similar content being viewed by others

Impact of model complexity and multi-scale data integration on the estimation of hydrogeological parameters in a dual-porosity aquifer

Geostatistical modeling of porosity and evaluating the local and global distribution

Vertical hydraulic conductivity and layered heterogeneity: from measurements to models

Background and motivation

Approach

Hydrogeologic setting and simulation of groundwater flow

Simulation and calibration of advective transport

Results and discussion

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The effects of numerical-model complexity and observation type on estimated porosity values

Abstract

Résumé

Resumen

摘要

Περίληψη

Resumo

Similar content being viewed by others

Impact of model complexity and multi-scale data integration on the estimation of hydrogeological parameters in a dual-porosity aquifer

Geostatistical modeling of porosity and evaluating the local and global distribution

Vertical hydraulic conductivity and layered heterogeneity: from measurements to models

Background and motivation

Approach

Hydrogeologic setting and simulation of groundwater flow

Simulation and calibration of advective transport

Results and discussion

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation