Statistical monitoring of dynamic processes based on dynamic independent component analysis

doi:10.1016/j.ces.2004.04.031

Chemical Engineering Science

Volume 59, Issue 14, July 2004, Pages 2995-3006

https://doi.org/10.1016/j.ces.2004.04.031 Get rights and content

Abstract

Most multivariate statistical monitoring methods based on principal component analysis (PCA) assume implicitly that the observations at one time are statistically independent of observations at past time and the latent variables follow a Gaussian distribution. However, in real chemical and biological processes, these assumptions are invalid because of their dynamic and nonlinear characteristics. Therefore, monitoring charts based on conventional PCA tend to show many false alarms and bad detectability. In this paper, a new statistical process monitoring method using dynamic independent component analysis (DICA) is proposed to overcome these disadvantages. ICA is a recently developed technique for revealing hidden factors that underlies sets of measurements followed on a non-Gaussian distribution. Its goal is to decompose a set of multivariate data into a base of statistically independent components without a loss of information. The proposed DICA monitoring method is applying ICA to the augmenting matrix with time-lagged variables. DICA can show more powerful monitoring performance in the case of a dynamic process since it can extract source signals which are independent of the auto- and cross-correlation of variables. It is applied to fault detection in both a simple multivariate dynamic process and the Tennessee Eastman process. The simulation results clearly show that the method effectively detects faults in a multivariate dynamic process.

Introduction

In most chemical plants, on-line monitoring and fault diagnosis of the process operating performance are gaining importance for plant safety and the maintenance of yield and quality in a process. An important aspect for the safe operation of chemical processes is the rapid detection of faults or process upsets and the removal of the factors causing such events. Traditionally, statistical process control (SPC) has been used to monitor individual process signals to detect trends, outliers and other anomalies. However, these procedures are of limited use with high-dimensional multivariate data that are strongly cross-correlated. The need to monitor such multivariate processes has led to the development of many process monitoring schemes that use multivariate statistical methods based on principal component analysis (PCA) and partial least squares (PLS). These methods have been used and extended in various applications (Nomikos and MacGregor, 1994; Wise and Gallagher, 1996; Dong and McAvoy, 1996; Bakshi, 1998; Li et al., 2000).

Most multivariate statistical monitoring methods based on PCA assume implicitly that the observations at one time are statistically independent to observations at past time and the latent variables follow a Gaussian distribution. However, in chemical processes, variables rarely remain at a steady state but rather are driven by random noise and uncontrollable disturbances. These effects make the variables have autocorrelation and the system have dynamic properties. This suggests that a method taking into account the serial correlations in the data is needed in order to implement a process monitoring method. Ku et al. (1995) proposed dynamic PCA (DPCA) that uses an augmenting matrix with time-lagged variables. DPCA can extract the time-series model from the eigenvectors of the covariance matrix that corresponds to zero eigenvalues. For its simplicity, DPCA has been used in many cases with other developed methods. Luo et al. (1999) used multiscale analysis and DPCA for sensor fault detection. Tsung (2000) provided an integrated approach to simultaneously monitor and diagnose an automatic controlled process by using DPCA and minimax distance classifier. Yoo et al. (2002) proposed a dynamic monitoring method for multiscale fault detection and diagnosis in the wastewater treatment process, which is based on DPCA, the D statistic, and the monitoring of individual eigenvalues of generic dissimilarity measure (GDM).

Recently, several works using a state space model have been proposed to capture process dynamics. Negiz and Cinar (1997) proposed a monitoring method that utilizes a state space identification technique based on canonical variate analysis (CVA) to solve the dynamic problem. This method takes serial correlations into account during the dimension reduction step, like DPCA, and uses the state variables for computing the monitoring statistic in order to remove the serial correlation. Russell et al. (2000) evaluated and compared the performance of PCA, DPCA and CVA for detecting faults in a realistic chemical process simulation. They also suggested a CVA-based residual space statistic (T_r²) that gave better overall sensitivity and promptness than the existing PCA, DPCA, and CVA statistics. Simoglou et al. (2002) identified the system states and the state space model parameters using the multivariate statistical projection techniques of CVA and PLS. In their paper, a number of metrics based on Hotelling's T² statistic are proposed for the monitoring of the state of the system and the confidence limits for these metrics are calculated using the empirical reference distribution.

There are other approaches for monitoring the dynamic process efficiently. Kano et al. (2002) suggested a statistical process monitoring based on the dissimilarity of process data. It is based on the idea that a change of operating condition can be detected by monitoring the distribution of time-series process data because the distribution reflects the corresponding operating condition. Chiang and Braatz (2003) proposed an advanced method to compare distribution, where the modified distance (DI), based on Kullback-Libler information distance, is used to measure the similarity of the measured variable between the current operating conditions and the historical operating conditions. They also suggested the modified causal dependency (CD) to measure the causal dependency of two variables. Chen and Liao (2002) developed a new monitoring method, NNPCA, which integrates two data driven techniques, neural network (NN) and PCA, to handle the nonlinear dynamic process. The proposed technique uses NN as the nonlinear dynamic operator to remove the nonlinear and dynamic characteristics and applies PCA to generating simple monitoring charts based on the multivariable residuals derived from the difference between the process measurements and the neural network prediction.

More recently, monitoring methods based on independent component analysis (ICA) have been developed (Kano et al., 2003; Lee 2003a, Lee 2003b). The goal of ICA is to decompose observed data into linear combinations of statistically independent components. PCA can only impose independence up to second order statistics information (mean and variance) whereas ICA involves higher-order statistics, i.e., it not only decorrelates the data (second order statistics) but also reduces higher order statistical dependencies (Lee, 1998). Therefore, an ICA based monitoring method can give more sophisticated results than a PCA based one since ICA can extract the essential independent components that drive a process.

In this paper, ICA monitoring on lagged variables, called DICA monitoring, is suggested for developing dynamic models and improving the monitoring performance. In order to consider auto correlation, the time-lagged extension of the data matrix is performed before applying ICA. This paper is organized as follows. PCA and DPCA monitoring methods are introduced in Section 2. In Section 3, the DICA monitoring method is explained in detail with ICA algorithm and monitoring statistics. The superiority of DICA monitoring method over PCA, DPCA, and ICA ones is illustrated in Section 4 through two examples of a simple multivariate process and the Tennessee Eastman process. Finally, conclusion will be presented in Section 5.

Section snippets

PCA and DPCA monitoring

PCA has been widely used in the field of process monitoring since it can handle high dimensional, noisy, and correlated data by projecting the data onto a lower dimensional subspace which contains most of the variance of the original data (Wise and Gallagher, 1996). It decomposes the data matrix into the sum of the outer product of score vectors and loading vectors. Two typical statistical indices of T² and squared prediction error (SPE) are used in the PCA monitoring (Kresta et al., 1991). A

Independent component analysis (ICA)

ICA is a statistical technique for revealing hidden independent components that underlie sets of random variables, measurements, or signals. In the ICA algorithm, it is assumed that at time k the observed d-dimensional data vector $x (k)=[x_{1} (k),…,x_{d} (k)]^{T}$ can be expressed as linear combinations of m unknown independent components, s₁(k),…,s_m(k), given by the model, $x (k)= As (k)+ e (k),$ where $A ∈R^{d×m}$ is the unknown mixing matrix, $s (k)=[s_{1} (k),…,s_{m} (k)]^{T}$ is the independent component vector and $e (k)$ is the

Application

In this section, several monitoring methods, including PCA, DPCA, ICA, and DICA, are applied to monitoring problems of a simple multivariate dynamic process and the Tennessee Eastman process.

Conclusions

In this paper, a new statistical process monitoring method using dynamic independent component analysis is proposed to monitor a process with auto- and cross-correlated variables. Since the goal of ICA is to find a linear representation of non-Gaussian data so that the components are statistically independent up to more than second order statistics, ICA can reveal more useful information than PCA. The proposed monitoring method, DICA, using ICA to the augmenting matrix with time-lagged

Acknowledgements

This work was supported by a grant No. (R01-2002-000-00007-0) from Korea Science & Engineering Foundation.

References (33)

G Chen et al.
Predictive on-line monitoring of continuous processes
Journal of Process Control
(1998)
J Chen et al.
Dynamic process fault monitoring based on neural network and PCA
Journal of Process Control
(2002)
L.H Chiang et al.
Process monitoring using causal map and multivariate statisticsfault detection and identification
Chemometrics and Intelligent Laboratory Systems
(2003)
L.H Chiang et al.
Fault diagnosis in chemical processes using Fisher discriminant analysis, discriminant partial least squares, and principal component analysis
Chemometrics and Intelligent Laboratory Systems
(2000)
D Dong et al.
Nonlinear principal component analysis—based on principal curves and neural networks
Computers and Chemical Engineering
(1996)
A Hyvärinen et al.
Independent component analysisalgorithms and applications
Neural Networks
(2000)
M Kano et al.
Evolution of multivariate statistical process control: application of independent component analysis and external analysis
Computers and Chemical Engineering
(2004)
W Ku et al.
Disturbance detection and isolation by dynamic principal component analysis
Chemometrics and Intelligent Laboratory Systems
(1995)
W Li et al.
Recursive PCA for adaptive process monitoring
Journal of Process Control
(2000)
P.R Lyman et al.
Plant-wide control of the Tennessee eastman problem
Computers and Chemical Engineering
(1995)

E.B Martin et al.

Non-parametric confidence bounds for process performance monitoring charts

Journal of Process Control

(1996)

A.C Raich et al.

Multivariate statistical methods for monitoring continuous processesassessment of discriminatory power disturbance models and diagnosis of multiple disturbances

Chemometrics and Intelligent Laboratory Systems

(1995)

E.L Russell et al.

Fault detection in industrial processes using canonical variate analysis and dynamic principal component analysis

Chemometrics and Intelligent Laboratory Systems

(2000)

A Simoglou et al.

Statistical performance monitoring of dynamic multivariate processes using state space modeling

Computers and Chemical Engineering

(2002)

B.M Wise et al.

The process chemometrics approach to process monitoring and fault detection

Journal of Process Control

(1996)

B.R Bakshi

Multiscale PCA with application to multivariate statistical process monitoring

American Institute of Chemical Engineering Journal

(1998)

Cited by (327)

Dynamic multiobjective optimization with varying number of objectives assisted by dynamic principal component analysis
2024, Information Sciences
Dynamic multi-objective optimization problems, which are equipped with the increment or decrement number of time-varying objective functions, have been hardly researched in recent decades. Different from other dynamism handling approaches, we propose a new framework incorporated with the dynamic principal component analysis technique, which embeds the acquired knowledge of Pareto optimal set during the evolutionary search process. As the environmental changes occur, the dynamic principal component analysis technique learns the global structure of Pareto optimal set incrementally as newly generated data are collected to depict the manifold contour. In addition, this method constructs high-quality solutions on the basis of obtained knowledge, which in turn captures the main structure of previous solutions. We undertake comprehensive experiments in which the benchmark instances are given with a varying number of objective functions and the computational values are assessed with respect to performance metrics. The obtained statistical findings with 70% improvement fully demonstrate that our proposed algorithm is efficient and effective for solving dynamic multi-objective optimization problems.
Incipient fault detection enhancement based on spatial-temporal multi-mode siamese feature contrast learning for industrial dynamic process
2024, Computers in Industry
Incipient faults are characterized by low-amplitude, unclear fault features, which are susceptible to unknown disturbances, leading to unsatisfactory detection performance. In this paper, an incipient fault detection enhancement method based on siamese spatial-temporal multi-mode feature contrast learning method is proposed. Firstly, we design a novel siamese spatial-temporal multi-mode convolutional neural network model consisting of two weight-shared spatial-temporal multi-mode convolutional neural networks and one feature discrimination measure operator, which are then used to extract the spatial-temporal multi-mode features of two datasets and to measure the distance between them. Then, an incipient fault feature discrimination intensification training strategy is developed to enhance the incipient fault detection performance. Specifically, this strategy intends to maximize the feature distance between the normal data and the incipient fault data, as well as that between different incipient faults, while minimizing the feature distance between the normal data and between the same incipient faults. Moreover, due to the long-term slow change characteristic of the incipient fault, the multi-head self-attention Long Short-Term Memory is presented as a dynamic feature learning model to further lopsidedly learn the incipient fault temporal long-term dependency according to attention weights utilizing the scaled dot-product multi-head self-attention mechanism. Finally, the performance of the proposed method is demonstrated on two industrial cases.
An industrial process fault diagnosis method based on independent slow feature analysis and stacked sparse autoencoder network
2024, Journal of the Franklin Institute
Deep learning, with its powerful multilayer nonlinear representation of deep neural networks, enables models trained based on deep learning to describe the true distribution of data more accurately and thus have better generalization capabilities. However, the training data are usually assumed to obey independent identical distribution. Industrial process data nowadays usually have complex characteristics such as non-Gaussian, nonlinear, dynamic, strong correlation, and high-dimensional, which make it difficult to meet the assumptions of deep learning. Using industrial process data directly would result in low model accuracy and bias in the model output. This paper proposes a fault diagnosis method based on independent slow feature analysis (ISFA) and stacked sparse autoencoder (SSAE) network. The statistical properties possessed by ISFA are used to perform preliminary feature extraction of industrial process data, and then the extracted preliminary features are input to the SSAE network for fine-grained feature extraction. The proposed two-step feature extraction method not only makes the data input to the SSAE network as close as possible to obey independent identical distribution and provides convincing fault diagnosis results, but also makes full use of the powerful feature extraction capability of the SSAE network, which makes the proposed method also significantly improve the detection accuracy of incipient faults. The feasibility and superiority of the proposed method is verified through the TE process.
Smart batch process: The evolution from 1D and 2D to new 3D perspectives in the era of Big Data
2023, Journal of Process Control
Big Data will revolutionize modern industry by improving process optimization, facilitating insight discovery, and improving decision-making. This big data revolution presents a multitude of possibilities and challenges in evolving from traditional batch processes to smart batch processes. This tremendous potential requires the ability to extract value from vast amounts of industrial process data. Using a new three-dimensional (3D) perspective of time, batch, and operational context, this paper explores smart batch processes with higher efficiency, greater profitability, and longer sustainability. First, we review the traditional one-dimensional (1D) perspective on batch processes and summarize the existing two-dimensional (2D) perspectives on batch processes, i.e., modeling, monitoring, control, and optimization methods. Based on those results, the spotlight will focus on how big data can be used to achieve smart batch processes using the 3D perspective. This will include detailed discussions of definitions and concepts, operational mechanisms, and the benefits and advantages of smart batch processes. For further implementation of the 3D perspective, we present several monitoring and control methodologies. Next, we analyze several challenges and issues in implementing smart batch processes in the era of Big Data. In conclusion, we provide both a novel viewpoint and encouragement for future research into batch process automation from the 3D perspective.
Uncovering sensor faults in wind turbines: An improved multivariate statistical approach for condition monitoring using SCADA data
2023, Sustainable Energy, Grids and Networks
Fault detection in wind turbines is essential for ensuring their safety, reliability, and optimal performance. However, some of the existing approaches for sensor fault detection in wind turbines face several challenges that hinder their effectiveness. These challenges include handling multivariate and non-Gaussian data, low sensitivity to small changes, and setting an appropriate detection threshold to avoid false alarms. Additionally, constructing an analytical model for monitoring wind turbines becomes particularly challenging and time-consuming, especially for large-scale wind turbines. This paper proposes a semi-supervised data-driven approach for sensor fault detection in wind turbines using supervisory control and data acquisition (SCADA) data. The proposed approach combines the advantages of independent component analysis (ICA) and the Kantorovich Distance (KD)-based fault detection scheme. ICA enables efficient handling of multivariate non-Gaussian data, while the KD scheme provides a sensitive indicator for assessing the residuals obtained from ICA. The ICA-based KD scheme needs only fault-free data in training, making it more attractive for fault detection in practice. Kernel density estimation is employed to compute the detection threshold of the KD scheme, making it more flexible. Experimental evaluations using simulated sensor faults based on real wind turbine data demonstrate the superior detection performance of the proposed approach, achieving an average F1-score of approximately 0.96 and outperforming conventional approaches.
A novel white component analysis for dynamic process monitoring
2023, Journal of Process Control
Dynamic principal component analysis has long been a popular multivariate statistical process monitoring method. However, the resulting residuals are typically subject to serial correlations, thereby compromising the detection capability in face of dynamic data. In this paper, a novel white component analysis (WCA) model is proposed to enforce latent variables to have noise-like properties, thereby well catering to the independent assumption that is critical in monitoring statistic design. A new whiteness index for finite-length time series is put forward, which acts as the objective of each component. To solve the optimization problem, a tailored algorithm based on alternating direction method of multipliers is developed. Using “white” components, a new $W$ -statistic is coined to effectively detect violation of dynamic relations by inspecting the presence of unmodeled dynamics. By incorporating classical variance-based statistics, we arrive at a new dynamic process monitoring scheme that offers deep insights into abnormal situations. Comprehensive case studies corroborate the validity of the WCA-based process monitoring approach, and in particular, the sensitivity of $W$ -statistic to dynamics anomalies.

View all citing articles on Scopus

View full text

Statistical monitoring of dynamic processes based on dynamic independent component analysis

Abstract

Introduction

Section snippets

PCA and DPCA monitoring

Independent component analysis (ICA)

Application

Conclusions

Acknowledgements

Journal of Process Control

Journal of Process Control

Chemometrics and Intelligent Laboratory Systems

Chemometrics and Intelligent Laboratory Systems

Computers and Chemical Engineering

Neural Networks

Computers and Chemical Engineering

Chemometrics and Intelligent Laboratory Systems

Journal of Process Control

Computers and Chemical Engineering

Journal of Process Control

Chemometrics and Intelligent Laboratory Systems

Chemometrics and Intelligent Laboratory Systems

Computers and Chemical Engineering

Journal of Process Control

Multiscale PCA with application to multivariate statistical process monitoring

American Institute of Chemical Engineering Journal