Measuring and comparing the resolution performance and the extent of rotation ambiguities of some bilinear modeling methods

https://doi.org/10.1016/j.chemolab.2015.08.005Get rights and content

Highlights

  • Results from different bilinear resolution methods are compared.

  • Rotation ambiguities are ubiquitously present and they should be considered.

  • Extent of rotation ambiguities and AFS are calculated for different datasets.

  • The results estimated by the MCR-BANDS and FAC-PACK are in agreement

Abstract

Bilinear models are often used in the analysis of datasets from spectroscopy and chromatography. Whenever bilinear soft modeling approaches are applied, rotation ambiguities are ubiquitously present and they should be considered. In this work, results obtained by the application of different methods like independent component analysis (ICA), principal component analysis (PCA), and minimum volume simplex analysis (MVSA) are compared with those obtained by multivariate curve resolution (MCR). In order to do this comparison, mutual information (MI), Amari index (AI), and lack-of-fit (lof) parameters are used for the evaluation of the different methods, and the corresponding areas or regions of feasible solutions (AFS) and their boundaries are investigated in each case. The results obtained by the MCR-BANDS method in the calculation of the extension of rotation ambiguities are discussed and compared with those obtained by the FAC-PACK method, which has been recently proposed for the estimation of the whole range of feasible solutions.

Introduction

Chemometric methods provide powerful tools to analyze multi- and megavariate data from modern analytical instruments. Some of these chemometric methods, in particular multivariate curve resolution (MCR) methods, have been proposed for the resolution of chemical data obtained from chromatography [1], spectroscopy [2], nuclear magnetic resonance [3], hyperspectral imaging [4], voltammetry [5], omics microarray [6], and LC-MS [7]data, among others [8], [9]. MCR methods are a group of methods based on the fulfillment of a bilinear model which attempt the extraction of the true underlying sources of chemical variation using a minimum amount of prior assumptions about the process under investigation. For the analysis of complex multi-component mixture systems, they offer the possibility of resolution, identification, and also quantification [10] of the different components present in an unknown mixture, without needing their previous chemical and physical separation.

MCR chemometric methods have their intrinsic drawbacks, especially that they cannot assure encountering a unique solution to explain the measured experimental variation in the data and that a range of feasible solutions may be obtained by their application. Ambiguities appear because different linear combinations of the component profiles fulfilling the constraints of the system fit equally well the data [11]. Unfortunately, the presence of rotation ambiguities and of non-unique solutions decreases the reliability of MCR methods and makes their assessment more difficult. The only way to reduce the extent of rotation ambiguities and to obtain solutions closer to true ones is by the application of additional constraints (soft or hard) which implies using more knowledge about the data system, or also moving from bilinear modeling to multilinear modeling [12].

Bilinear modeling methods like minimum volume simplex analysis (MVSA), independent component analysis (ICA), principal component analysis (PCA), and multivariate curve resolution-function minimization (MCR-FMIN) have already been compared in previously published papers [13], [14]. MVSA initially was developed for satellite imaging individual component (end member) resolution, and more recently, it has been also proposed in analytical chemistry [15]. PCA considers the information between the different components to be orthogonal or linearly uncorrelated [16]. ICA assumes that the components are mutually statistically independent [17]. These assumptions are statistically different (the latter is more restricted than the first), and therefore, the results are different. In particular, ICA and PCA can be used for different purposes like data preprocessing, exploration, classification, regression, and resolution. All these methods have been proposed for analytical chemistry purposes, and some authors have investigated whether one method is better than the other. Different from these approaches, based on statistical assumptions, multivariate curve resolution methods, especially those based in alternating least squares (MCR-ALS), use more natural and physically and chemically meaningful assumptions by means of constraints, like non-negativity, unimodality, closure, selectivity, or local rank, and by means of other constraints related to the data structure (like trilinearity or multilinearity) and find an optimum solution from a least squares fitting convergence criterium [18]. MCR-FMIN has been also proposed as a different way for multivariate curve resolution and it is based on non-linear optimization algorithms using non-linear constraints [19]. MCR-FMIN uses PCA scores and loadings to define the subspace of MCR solutions and rotates them to fulfill the constraints of the system. Therefore, it is also interesting to compare their solutions with those obtained by PCA and MCR-ALS.

In order to evaluate the effect of rotation ambiguities associated to a particular MCR solution and to measure its extent, Lawton and Sylvestre [20] already proposed a first algorithm for determining the area of feasible solutions (AFS) in two-component systems under the assumption of non-negative spectra and concentration profiles. Borgen et al. [21] extended Lawton and Sylvestre's method to three component systems and proposed a linear programming optimization method to calculate the permitted ranges of pure component spectra using tangent and simplex rotation algorithms. Rajkó and István [22] revised Borgen's study and used computational geometry tools to draw Borgen plots of three-component systems. Leger and Wentzell developed a dynamic Monte Carlo SMCR method [23] which seeks to define the boundaries of allowable pure component profiles. For the calculation of the whole range of feasible solutions, a systematic grid search method based on species-based particle swarm optimization has been proposed for three-component systems by H. Abdollahi et al [24], [25]. A. Golshan et al. have also developed a method that finds the simplex volume containing all feasible solutions and facilitate the determination and visualization of rotational ambiguities of four-component mixture [25].

R. Tauler developed the MCR-BANDS method [26] based on a previous idea of P. Gemperline [27], for the calculation of the extension of rotation ambiguities, based on the fast maximization and minimization of a function defined by the relative signal component contribution (SCCF) of each component [11], [23]. This method has no limitation for the number of components and it uses the same constraints as those applied to find out the MCR solution. It gives a simple evaluation of the extent of rotation ambiguity from the difference between the maximum and minimum values of the SCCF function. Recently, Sawall et al. suggested a fast accurate algorithm to find the AFS for two- and three-component systems based on the use of a polygon inflation algorithm [28]. FAC-PACK is an interactive MATLAB toolbox for the computation of non-negative multi-component factorizations and for the numerical approximation of the area of feasible solutions using the inflation polygon algorithm [28].

In this work, FAC-PACK results are compared to those obtained by MCR-BANDS, and with the solutions obtained by different bilinear model methods such as PCA, ICA, MVSA, MCR-FMIN, and MCR-ALS. The aim of this work is to get a deeper understanding of MCR methods and evaluate their performance under different constraints. In addition, the extension of rotation ambiguities associated to MCR solutions is investigated by the MCR-BANDS and FAC-PACK methods. The comparison of results obtained by these two methods can help to evaluate the reliability of their results and to get a deeper understanding of their principles.

Section snippets

Theory

Second-order data (a data matrix) generally can be decomposed by bilinear model-based methods according to Eq. (1).D=CST+E=D*+Ewhere D (I,J) is the experimental data matrix corresponding to a bilinear system with I different samples and J different variables., C (I,N) is the contribution of the N components in each sample, S (J,N) is the pure response matrix of the N components, E (I,J)is the matrix associated to noise or experimental error. Given the data matrix D, the aim of bilinear model is

Principal component analysis (PCA)

PCA provides a mathematical and very efficient way to solve the bilinear model and perform the matrix decomposition given in Eq. (1). PCA decomposes the measurement matrix D into the scores CPCA ܂and loadings SPCA orthogonal factor matrices, and a reduced number of components are selected which explain maximum data variance. The aim of the method is to maximize the explained variance in the data with a minimum number of components. Due to the applied constraints during the PCA bilinear

Independent component analysis (ICA)

The aim of ICA is the decomposition of the measured multivariate signals into statistically independent component contributions with a minimum loss of information. ICA assumes that the mixing vectors in C are linearly independent and that the components in S are mutually statistically independent, as well as independent of noise components. This goal is equivalent to finding an unmixing matrix W that satisfiesWX=ST^where Ŝ is the estimation of the S. The main task of ICA is to find out the

Minimum volume simplex analysis (MVSA) method

MVSA also considers that the underlying mixing model is bilinear, i.e. that the measured spectral vectors are a linear combination of signatures (spectra) of pure components. MVSA is a method that finds the pure components (end members) by fitting the data to a minimum volume simplex, under some constraints, such as having for every pixel no less than zero abundance fractions (non-negativity constraint) and that their sum should be equal to one (closure). The MVSA method starts with an estimate

Multivariate curve resolution-alternating least squares (MCR-ALS)

MCR-ALS solves Eq. (1) iteratively using an ALS algorithm, which optimally fits the experimental data matrix D, and resolves the ‘true’ pure response profiles, in concentration C and pure spectra ST matrices. This optimization is carried out for a proposed number of components using initial estimates of either C or ST. These initial estimates of C or ST can be extracted using procedures for purest variables selection, such as SIMPLISMA [32]. During the ALS optimization, several constraints can

Multivariate curve resolution-objective function minimization (MCR-FMIN)

MCR-FMIN is based on the minimization of an objective function defined directly from the non-fulfillment of constraints and being always in the subspace spanned by PCA solutions. In other words, an appropriate rotation of the PCA solutions is performed searching for physically meaningful solutions (like non-negative). The function can be defined asfT=cnormT+cnonnegTwhere f(T) is the objective scalar function to be minimized and cnorm(T), and cnon  neg(T) are scalar functions for the

MCR-BANDS: Calculation of the extent of rotation ambiguities

In order to evaluate the extent of rotation ambiguities associated to a particular MCR solution under a set of constraints, the MCR-BANDS procedure has been proposed [11]. In this paper, it is applied to evaluate the extent of rotation ambiguities associated to MCR-ALS solutions, and as extension, also to those obtained by MCR-FMIN, PCA, ICA, and MVSA methods previously explained. In addition, MCR-BAND results are compared to those obtained by the FAC-PACK method (see below). MCR-BANDS method

FAC-PACK: Evaluation of the area of feasible solutions (AFS)

The area of feasible solutions (AFS) is a subset of the two-dimensional plane consisting of all pairs of solutions which represent non-negative spectra (case of the spectral AFS) or non-negative concentration profiles (case of the concentration AFS). In FAC-PACK toolbox for MATLAB, AFS is computed using a polygon inflation algorithm and its more recent implementation, the inverse polygon inflation algorithm. The polygon inflation method approximates the border of each AFS segment by a sequence

Mutual information (MI) evaluation

To estimate the degree of independence between component profiles, mutual information (MI) values are used. MI values are defined as proposed in previous works as a natural measure of the mutual independence between two variables [36]. MI between two variables can be expressed using the joint probability density function p(x1, x2) and the marginal probability density function p(x1) and p(x2).Ix1x2=dx1dx2px1x2logpx1x2px1px2

Krasakov et al. have proposed an efficient method of estimating MI values.

Concentration profiles recovery evaluation using Amari index

To estimate the reliability of the results obtained by the different methods applied here, concentration profiles were compared to the correct ones (if available) using the Amari index [13]. It shows the reliability between the compared concentration profiles, and it is defined as follows:P=12NI,J=1Npijmaxkpik+pijmaxkpkj1where pij = (Ĉ+C)i,j, C are the true concentration profiles and Ĉ+ is the pseudoinverse of calculated ones using the considered method. The Amari index is equal to zero when

Data fitting: Calculation of lack-of-fit values

Lack-of-fit values are defined form the difference between input data D values and their reproduced values obtained using a particular method. To evaluate the quality of the data fitting finally achieved after application of the different bilinear method, the percentages of lack of fit (lof) are calculated according to the following equation:lof%=100×ijdijd^ij2ijdij2where dij are the elements of the data matrix D, and d^ij are the corresponding elements recalculated by the considered method,

Datasets

In this work, similar synthetic datasets to those described in a previous work [13] were used. Both reaction and chromatographic type of profiles of two, three, and four components were used for the purposes of this work. The different sets of concentration and spectra profiles used as examples are given in Fig. 1.

Pure spectra were normalized to unit area and relative concentration profiles were always scaled between 0 and 1 in all the cases (white Gaussian noise with zero-mean and constant

Software

All calculations were performed in MATLAB R2013a (Mathworks Inc., Natick, MA, USA) for windows.

MF-ICA [29] methods were obtained from the ICA MATLAB toolbox V3 (ICA Toolbox Homepage (http://isp.imm.dtu.dk/toolbox/ica). MVSA MATLAB codes were downloaded from http://www.lx.it.pt/~bioucas/code.htm. MCR-ALS and MCR-BANDS algorithm code and GUI for MATLAB are freely available from the home page of MCR at http://www.mcrals.info/. FAC-PACK GUI for MATLAB is freely available from webpage //www.math.uni-rostock.de/FAC-PACK/

Results and discussion

Table 1 shows MI values of all spectra profiles, as well as the Amari index (AI) for all concentration profiles and the lack-of-fit values obtained for the different tested methods (MCR-ALS, ICA, PCA, MCR-FMIN, and MVSA) in the analysis of all datasets previously described. Data values in bold font in Table 1 correspond to lowest lack of fit, mutual information, and Amari index, respectively. Fig. 2(a–c) gives the graphical representation of lack-of-fit, MI, and Amari index values in Table 1.

Evaluation of the extension of rotation ambiguities in MCR and other methods

MCR solutions have in general a certain degree of ambiguity, and this can be evaluated by methods like MCR-BANDS. MCR-BANDS is based on the optimization of the function SCCF defined by the relative signal contribution of every component in relation to all the other components in the mixture (see Eq. (2)). In this work, relative signal contributions of all the components measured by SCCF using the different investigated methods such as MCR-ALS, MVSA, PCA, ICA, and MCR-FMIN are evaluated and

Conclusions

When multivariate curve resolution methods are applied to two-way data and local rank/selectivity conditions are not present in the data, there is no way in general to ascertain whether the solutions provided by these methods are correct (equal to the true expected one). Instead of a unique solution, a range of feasible solutions is encountered, all of them equally good from a data fitting point of view and fulfilling the required constraints. Different curve resolution methods may give

Conflict of interest

There is no conflict of interest.

Acknowledgements

Authors acknowledge research grant CTQ2012-3816 from Ministerio de Economía y competividad, Spain. Xin Zhang thanks the grant from China Scholarship Committee (2011637007).

References (38)

Cited by (26)

  • Mutual information map as a new way for exploring the independence of chemically meaningful solutions in two-component analytical data

    2022, Analytica Chimica Acta
    Citation Excerpt :

    MCR decomposition methods are very suitable for solving the linear unmixing problems in chemistry, however, they are confronted with rotational ambiguities [14] which means that, in practice, a range of feasible solutions is obtained instead of a unique solution, and this range of solutions cannot be estimated in the general case. However, for problems of limited complexity, different methods exist to estimate the extent of rotational ambiguities [15,16] in MCR such as Lawton-Sylvester method [17] and Borgen method [18] to name a few. Nevertheless, trying to resolve more independent component profiles is not an optimal approach for the resolution of “component profiles”, since non-negative chemical profiles do not, in general, conform to the stipulated condition of independence in ICA methods.

  • How noise affects the band boundaries in multivariate curve resolution

    2022, Chemometrics and Intelligent Laboratory Systems
    Citation Excerpt :

    In fact, for two-component systems it has been proved that the extreme profiles obtained from both N-BANDS and MCR-BANDS can be compared with those obtained by calculating the AFS. For more than two components, the extreme profiles obtained by N-BANDS and MCR-BANDS cannot geometrically enclose the full set of feasible profiles, and thus the comparison with AFS estimations cannot be directly performed in the same way [10–12]. MCR-BANDS and N-BANDS are based on computing the maximum and minimum values of a signal contribution function (see below), whereas the AFS describes the whole space of feasible solutions.

  • A down-to-earth analyst view of rotational ambiguity in second-order calibration with multivariate curve resolution − a tutorial

    2021, Analytica Chimica Acta
    Citation Excerpt :

    The newly developed N-BANDS uses the area-related parameter α for maximization/minimization in the presence of instrumental noise [46], and has been incorporated in the latest version of the graphical user interface MVC2 for second-order multivariate calibration, which is freely available at www.iquir-conicet.gov.ar (MATLAB version) and at https://www.dropbox.com/sh/nruf3lp0ge1gbww/AAAj6r97UBMIhgQmukRGYFPKa?dl=0 (stand-alone compiled version) [47]. Applications of the latter concepts can be found in refs. [48,49]. A literature discussion concerns the question of whether the extreme feasible solutions, computed in any of the three manners discussed above, are able to enclose the full set of feasible solutions, and/or to measure the extent of rotational ambiguity.

View all citing articles on Scopus
View full text