Gaussian process based modeling and experimental design for sensor calibration in drifting environments

doi:10.1016/j.snb.2015.03.071

Sensors and Actuators B: Chemical

Volume 216, September 2015, Pages 321-331

https://doi.org/10.1016/j.snb.2015.03.071 Get rights and content

Abstract

It remains a challenge to accurately calibrate a sensor subject to environmental drift. The calibration task for such a sensor is to quantify the relationship between the sensor's response and its exposure condition, which is specified by not only the analyte concentration but also the environmental factors such as temperature and humidity. This work developed a Gaussian Process (GP)-based procedure for the efficient calibration of sensors in drifting environments. Adopted as the calibration model, GP is not only able to capture the possibly nonlinear relationship between the sensor responses and the various exposure-condition factors, but also able to provide valid statistical inference for uncertainty quantification of the target estimates (e.g., the estimated analyte concentration of an unknown environment). Built on GP's inference ability, an experimental design method was developed to achieve efficient sampling of calibration data in a batch sequential manner. The resulting calibration procedure, which integrates the GP-based modeling and experimental design, was applied on a simulated chemiresistor sensor to demonstrate its effectiveness and its efficiency over the traditional method.

Introduction

Chemical sensors have been widely used in indoor and outdoor environment monitoring, vehicle exhaust measurement, human breath detection, etc [1], [2], [3]. It has been long recognized that the responses of chemical sensors, especially chemiresistors, are affected by the drift of environmental factors such as temperature and humidity [4], [5], [6], [7]. To reduce detection errors and false alarm, it is important to accurately calibrate a sensor in a drifting environment, which primarily motivated this work. The environmental factors are denoted as the vector x, and the task of sensor calibration is to establish the functional dependence of the sensor response r upon the analyte concentration c as well as x.

Quantifying the c − x − r relationship is challenging due to two main reasons: First, the variables (c, x) may affect the response r in a nonlinear fashion and also interact nonlinearly with each other. The underlying mechanism is complicated [8], [9], [10], [11] and difficult to be adequately captured by traditional regression analysis [6]. Second, to estimate a calibration model of high dimension, an extremely large sample size is typically required by the classic design of experiments (DOE) [6], [12]. Thus, there is a need to develop new modeling and DOE methods for the efficient calibration of sensors subject to environmental drift.

While focusing on calibrating sensors with environmental drift, this work falls into the research efforts to calibrate sensors with general drifting behaviors, which can be classified into two categories [13], [14]: external (i.e. environmental) and internal drifts. The latter is caused by the physical and/or chemical changes of the sensor itself, and examples of such changes include re-organization of the sensing materials and irreversible interaction with analytes. When calibrating drifting sensors, most of the literature used a reference-based linear compensation or linear regression to quantify the drifting effects [15], [16], [17], [18]. Recognizing the possible nonlinear nature of sensor drifts, powerful nonlinear models have also been employed, such as neural network [6], [19], kernel ridge regression [20] and nonlinear supporting vector machine [21]. However, in this stream of nonlinear modeling work, no effort was ever made to quantify the uncertainty of the target estimates (e.g., the analyte concentration estimated by the calibration model from an observed sensor response). This is at least partly due to the difficulties in deriving valid statistical inference (i.e., quantifying model uncertainty) based on those models [22], [23]. It is known that statistical inference lays the basis for optimum DOE: Experiments are designed to minimize the uncertainty on the model estimates of interest [24], [25], [26]. Thus, optimum DOE is a research issue that has barely been touched in the nonlinear model-based sensor calibration.

In light of the discussions above, our objective is to develop a statistical procedure, which leads to a calibration model of the highest quality by using the least experimental effort. In this work, the calibration model assumes the form of a Gaussian process (GP), which is highly flexible and able to capture practically any continuous functional relationships. GP is chosen over other powerful nonlinear models because of its statistical inference capability [27], which allows for uncertainty quantification and provides the necessary basis for optimum DOE. For sensor calibration, the inference issues are further complicated by the coexistence of forward modeling and inverse estimation (as will become clearer in Section 2.1), and hence a GP-based bootstrap resampling method is developed in this work. The DOE is performed in a batch sequential manner to circumvent the dilemma that the optimum DOE depends on the true c − x − r relationship, which however, is unknown at the stage of designing experiments [28], [25], [24]. A learning process is allowed in such a sequential procedure: For the design of a new batch of experiments to be performed, all the information derived from the experimental data already collected is utilized to search for the optimum DOE of that new batch; and the DOE optimization seeks to minimize the calibration model uncertainty with a given batch size.

The remainder of the paper is organized as follows: Section 2 presents the formulation of the calibration model, which takes the form of a GP. The GP-based model fitting and statistical inference issues are discussed in Section 3. The batch sequential procedure for sensor calibration is described in Section 4. Section 5 is devoted to an empirical study to evaluate the effectiveness and efficiency of the calibration procedure. A brief summary is given in Section 6.

Section snippets

Calibration model

For a sensor exposed to drifting environments, its calibration model needs to functionally relate the sensor response r to the target analyte concentration c as well as the environmental factors x. For notational convenience, all the exposure-condition factors are denoted as the vector $w = {(c, x^{⊤})}^{⊤}$ of d dimension, with d being a positive integer. The sensor response can be generally written as

$r (w) = E [r (w)] + ϵ = F (w) + ϵ, w \in W$ where F(w) quantifies the expected sensor response E[r(w)] as a function of w.

Experimental data for sensor calibration

To calibrate a sensor, experimental data has to be collected at a range of exposure conditions. The calibration sample data can be represented as

${(w_{i}, r_{j} (w_{i})); i = 1, 2, \dots, I, j = 1, 2, \dots, n} .$

In (7), w_i denotes the ith design point (exposure condition at which experiments are performed) out of a total of I distinct design points; r_j(w_i) denotes the observed response from the jth replication at w_i, and n is the number of replications at each design point. The sample average at w_i can then be calculated as

$\bar{r} (w$

The batch sequential procedure

There are two questions that a general DOE typically addresses: (i) At what design points should samples be allocated? (ii) At each design point, how many replications should be assigned? For the physical and/or chemical experiments involved in sensor calibration, it is a common practice to have a predetermined and fixed number of replications assigned to a design point, each time that exposure condition is selected; this is to ensure the reliability of the response measurements which are

Empirical results

The GP-based procedure was applied to calibrate a chemiresistor sensor, whose response can be substantially drifted by the fluctuations in humidity and temperature. The effectiveness of the calibration procedure is illustrated, and its efficiency over the traditional DOE method is demonstrated.

Summary

To efficiently calibrate sensors in drifting environments, a Gaussian process (GP)-based batch sequential procedure is developed. A GP form is employed for the calibration model, and used to capture the possibly comprehensive and nonlinear relationship between the sensor's response and its exposure condition including the target analyte concentrations as well as other environmental factors. Based on GP modeling, a bootstrap resampling method is developed to quantify the uncertainty of the

Acknowledgments

Research reported in this publication was partially supported by the National Science Foundation under Award Number CMMI-1068131 and by the National Institute Of Neurological Disorders And Stroke of the National Institutes of Health (NIH) under Award Number R15NS087515. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF or NIH. We thank Drs. Hossein-Babaei and Ghafarinia for providing

Zongyu Geng is a Ph.D candidate in the Industrial and Management Systems Engineering Department at West Virginia University. His research work has been focused on statistics and chemometrics.

References (55)

J. Fonollosa et al.
Human activity monitoring using gas sensor arrays
Sens. Actuators B: Chem.
(2014)
Z. Geng et al.
Optimum design of sensor arrays via simulation-based multivariate calibration
Sens. Actuators B: Chem.
(2011)
N. Docquier et al.
Combustion control and sensors: a review
Prog. Energy Combust. Sci.
(2002)
G. Martinelli et al.
Thick-film gas sensors
Sens. Actuators B
(1995)
H. Meixner et al.
Metal oxide sensors
Sens. Actuators B
(1996)
F. Hossein-Babaei et al.
Compensation for the drift-like terms caused by environmental fluctuations in the responses of chemoresistive gas sensors
Sens. Actuators B
(2010)
N. Yamazoe et al.
Interactions of tin oxide with O₂, H₂O and H₂
Surf. Sci.
(1979)
P. Mielle
Managing dynamic thermal exchanges in commercial semiconducting gas sensors
Sens. Actuators B
(1996)
J.E. Haugen et al.
A calibration method for handling the temporal drift of solid state gas-sensors
Anal. Chim. Acta
(2000)
A.C. Romain et al.
Long term stability of metal oxide-based gas sensors for e-nose environmental applications: an overview
Sens. Actuators B
(2010)

M. Padilla et al.

Drift compensation of gas sensor array data by orthogonal signal correction

Chemomet. Intell. Lab. Syst.

(2010)

A. Ziyatdinov et al.

Drift compensation of gas sensor array data by common principal component analysis

Sens. Actuators B: Chem.

(2010)

M. Ghasemi-Varnamkhasti et al.

Aging fingerprint characterization of beer using electronic nose

Sens. Actuators B: Chem.

(2011)

M.L. Frank et al.

TiO₂-based sensor arrays modeled with nonlinear regression analysis for simultaneously determining CO and O₂ concentrations at high temperatures

Sens. Actuators B

(2002)

A. Vergara et al.

Chemical gas sensor drift compensation using classifier ensembles

Sens. Actuators B: Chem.

(2012)

B. Curry et al.

Model selection in neural networks: some difficulties

Eur. J. Oper. Res.

(2006)

P. Zhang et al.

High temperature sensor array for simultaneous determination of O₂, CO, and CO₂ with kernel ridge regression data analysis

Sens. Actuators B

(2007)

H. Lei et al.

Modeling carbon black/polymer composite sensors

Sensors and Actuators B

(2007)

Z. Geng et al.

A bootstrapping-based statistical procedure for multivariate calibration of sensor arrays

Sens. Actuators B

(2013)

J. Loeppky et al.

Batch sequential designs for computer experiments

J. Stat. Plan. Infer.

(2010)

W.C. van Beers et al.

Customized sequential designs for random simulation experiments: Kriging metamodeling and bootstrapping

Eur. J. Oper. Res.

(2008)

M. D’Apuzzo et al.

Design of experiments and data-fitting techniques applied to calibration of high-frequency electromagnetic field probes

Measurement

(2011)

I. Rodríguez-Luján et al.

On the calibration of sensor arrays for pattern recognition using the minimal number of experiments

Chemomet. Intell. Lab. Syst.

(2014)

M.E. Johnson et al.

Minimax and maximin distance designs

J. Stat. Plan. Infer.

(1990)

A. Vergara et al.

Demonstration of fast and accurate discrimination and quantification of chemically similar species utilizing a single cross-selective chemiresistor

Anal. Chem.

(2014)

J.F. Boyle et al.

The effects of CO water vapor and surface temperature on the conductivity of a SnO₂ gas sensor

J. Electron. Mater.

(1997)

N. Barsan et al.

Understanding the fundamental principles of metal oxide based gas sensors; the example of CO sensing with SnO₂ sensors in the presence of humidity

J. Phys.: Condens. Matter

(2003)

Cited by (25)

Current issues and perspectives in nanosensors-based artificial olfactory systems for breath diagnostics and environmental exposure monitoring
2024, TrAC - Trends in Analytical Chemistry
Artificial olfactory systems that can provide sustainable monitoring and non-invasive diagnostics are emerging for environmental exposure detection and exhaled breath diagnostics. In particular, gas-sensing platforms based on nanosensor technology show remarkable potential for integration into portable and wearable devices and are expected to become ubiquitous in our daily lives in the future. Progress in materials science has enabled nanosensors to achieve remarkable sensitivity and selectivity in gas detection. Currently, nanosensor engineering research mainly focuses on the development of artificial olfactory systems that enable multisensing and multianalysis through nanosensor arrays. This paper provides an update on nanosensor technologies and nanomaterials for gas detection, discusses the challenges faced by artificial olfactory systems, and suggests future perspectives. Furthermore, we describe the requirements for nanosensor technologies and their progress using a materials science approach. This suggested perspective highlights the importance of nanosensors for continuous environmental exposure monitoring and point-of-care testing through exhaled breath diagnostics.
Relative humidity control during shiitake mushroom (Lentinus edodes) hot air drying based on appearance quality
2022, Journal of Food Engineering
Citation Excerpt :
When different processes are similar, we can model all processes simultaneously and take the advantage of the common aspects to improve predictive performance (Luo et al., 2018). The GPR model has been applied to many engineering applications such as photovoltaic power forecasting (Geng et al., 2015), atmospheric temperature and humidity prediction (Zhou et al., 2018) and injection molding processes Luo et al. (2018). However, it is seldom used to predict changes of material quality in drying process.
A relative humidity (RH) control method was proposed based on changes of shiitake mushroom ratio of wrinkled surface area (RWSA) obtained by computer vision technology (namely 30 %-RWSA) during which dehumidification fan was turned on with a sudden increase of RWSA. Three other RH control methods of continuous fanning, setting constant target RH (namely Whole process 30 %) and controlling RH based on RH change (namely 30 %-RH) were also carried out for comparison. Drying time of 30 %-RWSA group was significantly (p ≤ 0.05) shortened, while shrinkage, RWSA, rehydration ratio were insignificantly different from (p > 0.05) those of 30 %-RH group. The cell aspect ratio of continuous fanning group was the lowest. The results showed that the qualities of shiitake mushrooms could not be improved by shortening RH keeping time, shiitake mushroom should be dried in low RH environment. The Gaussian process regression (GPR) model was effective for on-line prediction of RWSA.
Various uncertainties self-correction method for the supervisory control of a hybrid cooling system in data centers
2021, Journal of Building Engineering
Citation Excerpt :
Virtual in-situ self-correction of uncertain parameters refers to the correction of parameter errors/uncertainties by means of models and other available measurement parameters. It has been proposed in different fields due to the excellent performance of reducing the uncertainty of parameters, such as computer science [21,22], chemical [23], optical [24], etc. However, HVAC systems in buildings are not mass produced or well instrumented, but usually unique [25].
The uncertainties of the crucial parameters have a considerable influence on the supervisory control performance of the water side free cooling system in data centers, especially for the model-based optimal control. To reduce the uncertainty of the key parameters in an operational data center hybrid cooling system, this study proposed an uncertainties self-correction method based on Bayesian Inference and Markov Chain Monte Carlo (BI-MCMC) theory. Four novel enhanced self-correction strategies, including the benchmarks extended, adding sensitivity coefficient, local correction and prior distribution updated, were developed to handle the negative impacts caused by the various uncertain working conditions and complex relationships between parameters. The performance of the proposed method was fully investigated under the cases with single/multiple uncertain parameters with different degrees of uncertainty and deviation. For the uncertainties problem with single parameter error scenario, the basic BI-MCMC method reduced the error by at least 92.5%. For the multiple parameter uncertain scenarios, the designed enhanced strategies can significantly reduce the degree of the uncertainty, i.e. the correction accuracy is up to 98% and the absolute correction deviations are not more than 0.02. Therefore, the proposed method could provide a solution to the current challenges in handling the uncertainties and establishing a reliable model for the supervisory control.
In-situ sensor calibration in an operational air-handling unit coupling autoencoder and Bayesian inference
2020, Energy and Buildings
Citation Excerpt :
These calibrated datasets can be widely used for (1) monitoring/controlling systems, and (2) training/testing various data-driven applications, thereby improving their performance; in this regard, in-situ calibration potentially offer greater advantages when compared with SFDD. In various fields, in-situ calibration methods have been developed; for example, computer science [19,20], chemistry [21–23], optical engineering [24,25], etc. [26,27]. These methods provide good calibration performance when judged on the basis of dense deployment of sensors, reference sensor readings, known conditions, and use of additional equations; they effectively eliminate unknown parameters for a known (better) calibration environment.
Sensor errors have a considerable influence on the system operation and energy usage in an air handling unit. Sensor fault detection and diagnosis (SFDD) has been widely studied to handle the impacts of sensor errors on an air-handling unit (AHU). Beyond the SFDD, in-situ calibration can correct the faulty sensor (especially for systematic errors) automatically in field, thereby reducing the energy waste. In this study, we propose an advanced in-situ sensor calibration named virtual in-situ calibration in an operational AHU. The suggested method is intended to overcome the challenges of previous in-situ calibration methods by coupling the Bayesian inference and autoencoder. In a given sensor calibration domain of an AHU, based on the unsupervised learning neural network feature trained to duplicate their input variables, the autoencoder-coupled calibration can produce system and sensor models effectively without additional sensors and assumptions, which are the main limitations of the earlier methods. It improves the calibration performance and applicability in the AHU. In addition, a three-step strategy to construct autoencoder input variables and a new distance function to achieve successful calibration under various faulty conditions is proposed. In a case study, where the error in the cooling coil supply temperature (+2 °C) caused a total energy increase of 38%, the present method is shown to eliminate the sensor error and the energy waste completely. These results show the capabilities and potentials of the suggested method in the self-repair, diagnostics, and automation of a building energy sector.
Mixed-effects Gaussian process modeling approach with application in injection molding processes
2018, Journal of Process Control
We propose a new nonparametric approach for multi-process data analysis, in which each of the process is modeled as a combination of a fixed-effect and a random-effect Gaussian process (GP) regression model, namely, a mixed-effect Gaussian process (ME-GP) model. The ME-GP approach provides a flexible means to combine the common aspects of all processes and describe the heterogeneity among different processes. In particular, we model the mean and covariance structures of both the fixed- and random-effects simultaneously, and predict a future input using probability density distributions. We apply the ME-GP model to predict the melt-flow-length for filling of different molds in injection molding processes. It is shown that the ME-GP model obtains an improved performance against GP model only.
Extended virtual in-situ calibration method in building systems using Bayesian inference
2017, Automation in Construction
Citation Excerpt :
Several modeling-based methods have been proposed in different research fields to convert these to determined problems, such as an on-line calibration [17], a collaborative calibration [9], a blind calibration [18], and a self-calibration [19]. In chemistry, various calibration methods [20–23] use more than one reference sensor for benchmarks, which can be considered as a known calibration environment. Literature in this regard mainly comes from fields where sensor redundancy, high quality sensors, or known relationships between sensors do exist.
Measurements from sensors and knowledge of key parameters are of great importance in the operation of modern building systems. Accurate and reliable information as these serves as the base for ensuring the desired performance of control algorithms, fault detection and diagnostics rules, analytical optimization strategies. They are also crucial for developing trust-worthy building models. However, unlike mass produced industrial devices, building systems are generally one of a kind and sparsely instrumented. Despite the indispensable need, dense deployment of sensors or a periodic manual calibration for ensuring the quality of thousands variables in building systems is not practical. To address the challenge, we extend our virtual in-situ calibration method by marrying it with Bayesian inference, which has a better capability in handling uncertainties. Strategies, including local, global, and combined calibration, are evaluated in a case with various sensor errors and uncertain parameters. The detailed procedure and results are presented.

View all citing articles on Scopus

Zongyu Geng is a Ph.D candidate in the Industrial and Management Systems Engineering Department at West Virginia University. His research work has been focused on statistics and chemometrics.

Feng Yang is currently an associate professor in the Industrial and Management Systems Engineering Department at West Virginia University. She received her Ph.D. degree in Industrial Engineering and Management Sciences from Northwestern University in 2006. Her research interests include stochastic simulation and metamodeling, design of experiments, and applied statistics.

Xi Chen is an assistant professor in the Department of Industrial and Systems Engineering at Virginia Tech. Her research interests include stochastic modeling and simulation, applied probability and statistics, computer experiment design and analysis, and simulation optimization.

Nianqiang (Nick) Wu received a Ph.D degree in materials science and engineering in 1997. He worked at Keck Interdisciplinary Surface Science Center in Northwestern University from 2001 to 2005. Currently he is a professor of materials science in Mechanical Engineering and Aerospace Engineering, West Virginia University. His research interest lies in low-dimensional nanomaterials, chemical sensors and biosensors, photocatalysts, photoelectrochemical cells and solar cells.

View full text

Gaussian process based modeling and experimental design for sensor calibration in drifting environments

Abstract

Introduction

Section snippets

Calibration model

Experimental data for sensor calibration

The batch sequential procedure

Empirical results

Summary

Acknowledgments

Sens. Actuators B: Chem.

Sens. Actuators B: Chem.

Prog. Energy Combust. Sci.

Sens. Actuators B

Sens. Actuators B

Sens. Actuators B

Surf. Sci.

Sens. Actuators B

Anal. Chim. Acta

Sens. Actuators B

Chemomet. Intell. Lab. Syst.

Sens. Actuators B: Chem.

Sens. Actuators B: Chem.

Sens. Actuators B

Sens. Actuators B: Chem.

Eur. J. Oper. Res.

Sens. Actuators B

Sensors and Actuators B

Sens. Actuators B

J. Stat. Plan. Infer.

Eur. J. Oper. Res.

Measurement

Chemomet. Intell. Lab. Syst.

J. Stat. Plan. Infer.

Demonstration of fast and accurate discrimination and quantification of chemically similar species utilizing a single cross-selective chemiresistor

Anal. Chem.

The effects of CO water vapor and surface temperature on the conductivity of a SnO2 gas sensor

J. Electron. Mater.

Understanding the fundamental principles of metal oxide based gas sensors; the example of CO sensing with SnO2 sensors in the presence of humidity

J. Phys.: Condens. Matter

The effects of CO water vapor and surface temperature on the conductivity of a SnO₂ gas sensor

Understanding the fundamental principles of metal oxide based gas sensors; the example of CO sensing with SnO₂ sensors in the presence of humidity