Skip to main content
Log in

Heuristic search strategy based on probabilistic and geostatistical simulation approach for simultaneous identification of groundwater contaminant source and simulation model parameters

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

In this study, a heuristic search strategy based on probabilistic and geostatistical simulation approach is developed for simultaneous identification of groundwater contaminant source and simulation model parameters. The numerical simulation model, which is repeatedly invoked to evaluate the likelihood in Bayesian formula, can be substituted by the surrogate system to reduce the huge computational load. To improve the approximation accuracy of the surrogate system to the simulation model, we employ Entropy Weight method as a novel method to establish a combined surrogate system by combining Gaussian process, support vector regression, and kernel extreme learning machine. A state evaluation (heuristic) function based on Bayesian formula and the surrogate system is introduced to quantify the approximation degree of variables in current state to true values for contaminant sources and simulation model parameters. Thereafter, a heuristic search iterative process related to artificial intelligence is designed for simultaneous identification, which takes full advantage of the guidance and correction role of actual field monitoring data. A multi-vector and variable-step size random walk method is proposed to select the candidate point. A Metropolis formula based on the state evaluation function is constructed, and the result is used as the judging criterion for the state transition. Finally, simultaneous identification results are obtained when the iteration reaches the convergence criteria. The proposed approaches are tested with a numerical case study. The results indicate that the heuristic search strategy can assist in identifying groundwater contaminant source and simulation model parameters simultaneously with high accuracy and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • Asher MJ, Croke BFW, Jakeman AJ, Peeters LJM (2015) A review of surrogate models and their application to groundwater modeling. Water Resour Res 51(8):5957–5973

    Google Scholar 

  • Atmadja J, Bagtzoglou AC (2001) State of the art report on mathematical methods for groundwater pollution source identification. Environ Forensics 2(3):205–214

    CAS  Google Scholar 

  • Behzadian K, Kapelan Z, Savic D, Ardeshir A (2009) Stochastic sampling design using a multi-objective genetic algorithm and adaptive neural networks. Environ Modell Softw 24(4):530–541

    Google Scholar 

  • Chen C, Li W, Su H, Liu K (2014) Spectral-spatial classification of hyperspectral image based on kernel extreme learning machine. Remote Sens 6(6):5795–5814

    Google Scholar 

  • Datta B, Chakrabarty D, Dhar A (2009) Simultaneous identification of unknown groundwater pollution sources and estimation of aquifer parameters. J Hydrol 376(1–2):48–57

    CAS  Google Scholar 

  • Datta B, Chakrabarty D, Dhar A (2011) Identification of unknown groundwater pollution sources using classical optimization with linked simulation. J Hydro-environ Res 5(1):25–36

    Google Scholar 

  • Davey KR (2008) Latin hypercube sampling and pattern search in magnetic field optimization problems. IEEE Trans Magn 44(6):974–977

    Google Scholar 

  • Guozhen W, Zhang C, Li Y, Haixing L, Zhou H (2016) Source identification of sudden contamination based on the parameter uncertainty analysis. J Hydroinform 18(6):919–927

    Google Scholar 

  • Haario H, Laine M, Mira A, Saksman E (2006) DRAM: efficient adaptive MCMC. Stat Comput 16(4):339–354

    Google Scholar 

  • Hastings WK (1970) Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57(1):97

    Google Scholar 

  • Hora SC, Helton JC (2003) A distribution-free test for the relationship between model input and output when using Latin hypercube sampling. Reliab Eng Syst Safe 79(3):333–339

    Google Scholar 

  • Hou Z, Lu W (2018) Comparative study of surrogate models for groundwater contamination source identification at DNAPL-contaminated sites. Hydrogeol J 26(3):923–932

    Google Scholar 

  • Hou Z, Lu W, Chu H, Luo J (2015) Selecting parameter-optimized surrogate models in DNAPL-contaminated aquifer remediation strategies. Environ Eng Sci 32(12):1016–1026

    CAS  Google Scholar 

  • Hu JN, Hu JJ, Lin HB, Li XP, Jiang CL, Qiu XH, Li WS (2014) State-of-charge estimation for battery management system using optimized support vector machine for regression. J Power Sources 269:682–693

    CAS  Google Scholar 

  • Jha MK, Datta B (2014) Linked simulation-optimization based dedicated monitoring network design for unknown pollutant source identification using dynamic time warping distance. Water Resour Manag 28(12):4161–4182

    Google Scholar 

  • Jiang X, Lu W, Na J, Hou Z, Wang Y, Chi B (2018) A stochastic optimization model based on adaptive feedback correction process and surrogate model uncertainty for DNAPL-contaminated groundwater remediation design. Stoch Environ Res Risk A 32:3195–3206

    Google Scholar 

  • Jin X, Mahinthakumar G, Zechman EM, Ranjithan RS (2009) A genetic algorithm-based procedure for 3D source identification at the Borden emplacement site. J Hydroinform 11(1):51–64

    Google Scholar 

  • Laloy E, Vrugt JA (2012) High-dimensional posterior exploration of hydrologic models using multiple-try DREAM(ZS) and high-performance computing. Water Resour Res 50(3):182–205

    Google Scholar 

  • Laloy E, Rogiers B, Vrugt JA, Mallants D, Jacques D (2013) Efficient posterior exploration of a high-dimensional groundwater model from two-stage Markov chain Monte Carlo simulation and polynomial chaos expansion. Water Resour Res 49(5):2664–2682

    Google Scholar 

  • Lapworth DJ, Baran N, Stuart ME, Ward RS (2012) Emerging organic contaminants in groundwater: a review of sources, fate and occurrence. Environ Pollut 163:287–303

    CAS  Google Scholar 

  • Lilliefors HW (1967) On the Kolmogorov–Smirnov test for normality with mean and variance unknown. J Am Stat Assoc 62(318):399–402

    Google Scholar 

  • Luo J, Lu W (2014) Comparison of surrogate models with different methods in groundwater remediation process. J Earth Syst Sci 123(7):1579–1589

    Google Scholar 

  • Mahar PS, Datta B (2001) Optimal identification of ground-water pollution sources and parameter estimation. J Water Res Plan Man 127(1):20–29

    Google Scholar 

  • Massey FJ (1951) The Kolmogorov–Smirnov test for goodness of fit. J Am Stat Assoc 46(253):68–78

    Google Scholar 

  • Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E (1953) Equation of state calculations by fast computing machines. J Chem Phys 21(6):1087–1092

    CAS  Google Scholar 

  • Michalak AM, Kitanidis PK (2003) A method for enforcing parameter nonnegativity in Bayesian inverse problems with an application to contaminant source identification. Water Resour Res 39(2):1033

    Google Scholar 

  • Mirghani BY, Zechman EM, Ranjithan RS, Mahinthakumar G (2012) Enhanced simulation-optimization approach using surrogate modeling for solving inverse problems. Environ Forensics 13(4):348–363

    Google Scholar 

  • Pinder GF, Bredehoeft JD (1968) Application of the digital computer for aquifer evaluation. Water Resour Res 4(5):1069–1093

    Google Scholar 

  • Prakash O, Datta B (2012) Sequential optimal monitoring network design and iterative spatial estimation of pollutant concentration for identification of unknown groundwater pollution source locations. Environ Monit Assess 185(7):5611–5626

    Google Scholar 

  • Nguyen TH, Nong D, Paustian K (2019) Surrogate-based multi-objective optimization of management options for agricultural landscapes using artificial neural networks. Ecol Model 400:1–13

    CAS  Google Scholar 

  • Queipo NV, Haftka RT, Wei S (2005) Surrogate-based analysis and optimization. Prog Aeosp Sci 41(1):1–28

    Google Scholar 

  • Sadegh M, Vrugt JA (2015) Approximate Bayesian computation using Markov Chain Monte Carlo simulation: DREAM(ABC). Water Resour Res 50(8):6767–6787

    Google Scholar 

  • Shi X, Ye M, Curtis GP, Miller GL, Meyer PD, Kohler M, Yabusaki S, Wu J (2014) Assessment of parametric uncertainty for groundwater reactive transport modeling. Water Resour Res 50(5):4416–4439

    Google Scholar 

  • Singh RM, Datta B (2006) Identification of groundwater pollution sources using GA-based linked simulation optimization model. J Hydrol Eng 11(2):1216–1227

    Google Scholar 

  • Smith T, Sharma A, Marshall L, Mehrotra R, Sisson S (2010) Development of a formal likelihood function for improved Bayesian inference of ephemeral catchments. Water Resour Res 46(12):3909–3914

    Google Scholar 

  • Srivastava D, Singh RM (2015) Groundwater system modeling for simultaneous identification of pollution sources and parameters with uncertainty characterization. Water Resour Manag 29(13):4607–4627

    Google Scholar 

  • Wang X, Han M (2014) Online sequential extreme learning machine with kernels for nonstationary time series prediction. Neurocomputing 145:90–97

    Google Scholar 

  • Wang H, Jin X (2013) Characterization of groundwater contaminant source using Bayesian method. Stoch Environ Res Risk A 27(4):867–876

    Google Scholar 

  • Woodbury A, Sudicky E, Ulrych TJ, Ludwig R (1998) Three-dimensional plume source reconstruction using minimum relative entropy inversion. J Contam Hydrol 32(1–2):131–158

    CAS  Google Scholar 

  • Xing Z, Qu R, Zhao Y, Fu Q, Ji Y, Lu W (2019) Identifying the release history of a groundwater contaminant source based on an ensemble surrogate model. J Hydrol 572:501–516

    CAS  Google Scholar 

  • Yang H, Huang K, Chan L, King I, Lyu MR (2004) Outliers treatment in support vector regression for financial time series prediction. Lecture Notes in Computer Science, pp 1260–1265

  • Yao Y, Shen R, Pennell KG, Suuberg EM (2013) Estimation of contaminant subslab concentration in vapor intrusion including lateralsource-building separation. Vadose Zone J 12(3):1–9

    Google Scholar 

  • Yao Y, Yang F, Suuberg EM, Provoost J, Liu W (2014) Estimation of contaminant subslab concentration in petroleum vapor intrusion. J Hazard Mater 279:336–347

    CAS  Google Scholar 

  • Zanini A, Woodbury AD (2016) Contaminant source reconstruction by empirical Bayes and Akaike’s Bayesian Information Criterion. J Contam Hydrol 185–186:74–86

    Google Scholar 

  • Zeng XK, Ye M, Wu JC, Wang D, Zhu XB (2018) Improved nested sampling andsurrogate-enabled comparison with other marginal likelihood estimators. Water Resour Res 54(2):797–826

    Google Scholar 

  • Zhang Y, Kimberg DY, Coslett HB, Schwartz MF, Wang Z (2014) Multivariate lesion-symptom mapping using support vector regression. Hum Brain Mapp 35(12):5861–5876

    Google Scholar 

  • Zhang J, Zeng L, Chen C, Chen D, Wu L (2015) Efficient Bayesian experimental design for contaminant source identification. Water Resour Res 51(1):576–598

    Google Scholar 

  • Zhang J, Li W, Lin G, Zeng L, Wu L (2017) Efficient evaluation of small failure probability in high-dimensional groundwater contaminant transport modeling via a two-stage Monte Carlo method. Water Resour Res 53(3):1948–1962

    Google Scholar 

  • Zhao Y, Lu W, Xiao C (2016) A Kriging surrogate model coupled in simulation-optimization approach for identifying release history of groundwater sources. J Contam Hydrol 185–186:51–60

    Google Scholar 

  • Zheng C, Wang PP (1999) MT3DMS: a modular three-dimensional multispecies transport model for simulation of advection, dispersion, and chemical reactions of contaminants in groundwater systems. Documentation and User's Guide, Alabama Univ University

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China (No. 41972252) and the National Key Research and Development Program of China (No. 2018YFC1800405).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wenxi Lu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, H., Lu, W., Chang, Z. et al. Heuristic search strategy based on probabilistic and geostatistical simulation approach for simultaneous identification of groundwater contaminant source and simulation model parameters. Stoch Environ Res Risk Assess 34, 891–907 (2020). https://doi.org/10.1007/s00477-020-01804-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-020-01804-1

Keywords

Navigation