Machine learning for optical chemical multi-analyte imaging

Zieger, Silvia E.; Koren, Klaus

doi:10.1007/s00216-023-04678-8

Machine learning for optical chemical multi-analyte imaging

Why we should dare and why it’s not without risks

Research Paper
Open access
Published: 18 April 2023

Volume 415, pages 2749–2761, (2023)
Cite this article

Download PDF

You have full access to this open access article

Analytical and Bioanalytical Chemistry Aims and scope Submit manuscript

Machine learning for optical chemical multi-analyte imaging

Download PDF

Silvia E. Zieger¹ &
Klaus Koren¹

1872 Accesses
2 Citations
4 Altmetric
Explore all metrics

Abstract

Simultaneous sensing of metabolic analytes such as pH and O₂ is critical in complex and heterogeneous biological environments where analytes often are interrelated. However, measuring all target analytes at the same time and position is often challenging. A major challenge preventing further progress occurs when sensor signals cannot be directly correlated to analyte concentrations due to additional effects, overshadowing and complicating the actual correlations. In fields related to optical sensing, machine learning has already shown its potential to overcome these challenges by solving nested and multidimensional correlations. Hence, we want to apply machine learning models to fluorescence-based optical chemical sensors to facilitate simultaneous imaging of multiple analytes in 2D. We present a proof-of-concept approach for simultaneous imaging of pH and dissolved O₂ using an optical chemical sensor, a hyperspectral camera for image acquisition, and a multi-layered machine learning model based on a decision tree algorithm (XGBoost) for data analysis. Our model predicts dissolved O₂ and pH with a mean absolute error of < 4.50·10⁻² and < 1.96·10⁻¹, respectively, and a root mean square error of < 2.12·10⁻¹ and < 4.42·10⁻¹, respectively. Besides the model-building process, we discuss the potentials of machine learning for optical chemical sensing, especially regarding multi-analyte imaging, and highlight risks of bias that can arise in machine learning-based data analysis.

Trends in hyperspectral imaging: from environmental and health sensing to structure-property and nano-bio interaction studies

Article 17 February 2022

Machine learning–based sensor array: full and reduced fluorescence data for versatile analyte detection based on gold nanocluster as a single probe

Article 25 October 2022

Hyperspectral Imaging of FRET-Based cGMP Probes

Introduction

Sensing multiple analytes at the same time and space has long been a key challenge in sensor development. Especially for biotechnological [1, 2], environmental [3-5], and medical [6] applications, where entangled biological processes lead to analyte transformations and the establishment of chemical gradients, multi-analyte sensors based on luminescent optical chemical sensors (the so-called optodes) have proven to be beneficial [7, 8] and are therefore in high demand. For instance, in heterogeneous systems such as biofilms, fragmented profiling of pH and O₂ does not reflect the entire heterogeneous distribution within the biofilm, nor would monitoring with two individual sensors be able to capture the interdependence of these analytes and their combined influence on the biofilm [9, 10].

Hence, various approaches for luminescent-based optical chemical sensors are currently being investigated, all aiming at the simultaneous detection of multiple analytes at the exact same position with less complex and affordable equipment. The approaches span single indicators which show sensitivity to multiple analytes [11], to multi-layered systems that meet the spectral requirements of a given read-out system [12-16] (color camera with 3 to 4 channels), to the further development of existing read-out instrumentations [17]. However, despite recent progress in the field, certain limitations are inevitable. Specialized indicators normally require complex synthesis and are rarely commercially available. The combination of multiple indicators into a single sensor often leads to interactions between the respective indicators, such as energy transfer reactions, or to problems regarding the spectral separation of the respective overlapping emissions. Recently, we have shown that the later issue of overlapping emissions can be overcome by using hyperspectral imaging systems and spectral unmixing [17]. At the same time, we had to realize that while conventional methods in statistical data analysis are suitable for simple multi-analyte sensor systems where only the luminescence intensity of the indicators changes as a function of analyte concentration [17], these methods fail when indicators also undergo a spectral shift at the same time. In this case, the interactions and dependencies of the indicators become too complex. Analysts are therefore no longer able to deduce an unambiguous and universal model that considers all potential cross-sensitivities. To overcome this and decipher complex and nested datasets, machine learning algorithms (ML) offer great potential. ML exploits the ability of computers to learn from (training) data, recognize patterns in nested datasets, and automate the construction of analytical models. Since their emergence in the second half of the twentieth century, ML models have been applied in a variety of fields, including life and environmental sciences for predicting extreme natural events using remote sensing [18], enabling smart sensor systems [19], and drug delivery [20, 21]. Some interesting work using ML approaches has already been done related to optical sensors [22, 23]. Expanding on this work, we now want to apply ML models to enable multi-analyte imaging in 2D to visualize the heterogeneity of biological environments and the distributions of multiple analytes in 2 dimensions simultaneously. While other sensing approaches, especially fiber-based single-point sensor approaches might face an operational challenge of creating large (training) datasets, which is, however, a prerequisite for training ML models to derive an underlying trend according to the large number theorem [24, 25], it is the inherent nature of imaging to record hundreds of quality sample data within one single image acquisition.

Therefore, we present a novel proof-of-concept approach for optical chemical multi-analyte imaging using a machine learning (ML) model. Using a dual analyte sensor for pH and dissolved oxygen, we demonstrate the potential of ML for nested and intercoupled emission spectra of optical chemical sensors. Using a hyperspectral camera as the read-out system provides us with a sufficiently large amount of data within one single image acquisition, where each image pixel contains high-quality information over the entire spectral range between 470 and 900 nm. In the following, we first introduce the problem of a complex and nested dataset for the dual analyte sensor, which cannot be solved with conventional statistical models. We then describe the ML model as well as its performance and conclude with a discussion about the benefits and risks of the novel approach for optical chemical multi-analyte sensors.

Material and methods

Refer to the supplementary materials for more information on algorithm optimization, validation of the final ML model, or its visualization. In addition, examples of the raw calibration data for the 2-layered optical chemical sensor as well as a spreadsheet containing the prepared calibration data can be downloaded from the Mendeley data repository [26]. Due to the available space at the repository, we are only able to share examples of the original hyperspectral fluorescence images.

Materials

The O₂-sensitive indicator dye platinum(II)-meso-tetraphenyl-tetrabenzoporphyrin (Pt-TPTBP) and the reference dye Macrolex Fluorescence Yellow (MFY 10GN) were purchased from Frontier Scientific (frontiersci.com, Logan; USA) and Lanxess AG (lanxess.com, Köln; Germany), respectively. The lipophilic pH indicator HPTS (1-hydroxypyrene-3,6,8-tris-bis(2-ethylhexyl)sulfonamide was provided by Dr. Sergey Borisov, Graz University of Technology, Austria) [27]. Additional chemicals for sensor fabrication and calibration, such as polystyrene (PS. MW 192.000 g·mol⁻¹), polyurethane-based hydrogel (HydroMed D4), sodium sulfite (Na₂SO₃), ethanol, and toluene, were bought from Sigma Aldrich (sigmaaldrich.com, St Louis; USA), Advan Source biomaterials (advbiomaterials.com, MA; USA), and Merck KGaA (merckgroup.com, Darmstadt; Germany). The monocrystalline diamond powder was purchased from Pureon (pureon.com, Lengwil; Switzerland). All buffer materials (sodium phosphate monobasic monohydrate NaH₂PO₄ · H₂O and dihydrate NaH₂PO₄ · 2H₂O) were obtained from Sigma Aldrich (sigmaaldrich.com, St Louis; USA). The PET support foil (Lumirror 4001, 125 µm) was obtained from Puetz Folien (puetz-folien.com, Taunusstein; Germany). All chemicals were used as received.

Optode fabrication

A sensor cocktail was prepared for the fabrication of the optodes according to literature [28]. First, the O₂-sensitive layer was prepared, for which 0.94 mg of the Pt-TPTBP indicator and 0.86 mg of the MFYreference dye were dissolved in 1 g of a 10%w/w polymer matrix of PS (in toluene). The sensor cocktail was knife-coated onto a dust-free PET support foil using a film applicator (Byk-Gardner GmbH, Germany) yielding a ~ 12-µm-thick sensor layer after solvent evaporation. For the pH-sensitive layer, 0.95 mg of the lipophilic HPTS and 48 mg monocrystalline diamond powder, serving as a signal enhancer, were dissolved in 1 g of a 10%w/w solution of D4 (in ethanol:water, 9:1 w/w). This sensor cocktail was knife-coated onto the top of the well-dried O₂-sensitive layer yielding a ~ 10-µm-thick pH layer after solvent evaporation. The total thickness of the dual analyte optode was thus ~ 22 µm. In addition to the dual analyte sensor, single sensors consisting of only one layer, sensitive to either pH or O₂, were also coated with the same cocktail compositions as described previously.

Imaging setup and optode calibration

The setup was built similarly to that described in a previous paper of ours with some adaptations to suit the current dual analyte sensor [17]. In Fig. 1, a schematic of the imaging setup is shown for clarification. In short, the setup consisted of a hyperspectral camera (imec SnapScanVNIR camera; imec-int.com, Belgium) equipped with a color-corrected objective (Apo-Xenoplan lens, f2.0; Schneider-Kreuznach GmbH, German). A plastic filter (#10 medium yellow; LEEfilters.com, UK) was placed in front of the objective to reduce background fluorescence. The camera was connected to a PC and controlled using the manufacturers’ hyperspectral image-recording software (HSI Snapscan v1.4.1.0; imec-int.com, Belgium). For image acquisition, the camera was set to scan the full image frame (1088 × 2048 pixels) and the full spectral wavelength range (470–900 nm) with a pixel step of 3 nm and an integration time of 5 ms. The pixel blur and binning were set to 0 and 1, respectively. The dual analyte sensor foil was excited with a high-power LED light source (460 nm; LED Hub, Omicron Laserage Laserprodukte GmbH, Rodgau, Germany) equipped with a 1-m liquid light guide and a collimating lens. The LED light source was controlled via a PC running the manufacturers’ software. The dual analyte sensor foil of approximately 2.5 × 8 cm² was taped on the inner transparent glass wall of a buffer-filled measurement chamber. Excitation and imaging of the sensor foil were done frontally through the chamber wall.

Calibration of the dual analyte optode was performed similarly to as it is described in the literature; however, some adjustments were made to suit the dual analyte optode [14, 28]. For pH calibration, the pH of the phosphate buffer (0.1 mol·L⁻¹ with an ionic strength of 0.377 mol·L⁻¹) was adjusted by using 1 mol·L⁻¹ HCl and NaOH solutions. Oxygen levels were altered by using compressed O₂ and N₂ (Air Liquide S.A., airliquide.dk; Taastrup, Denmark), which were mixed with a gas mixer (Red-y-compact; Vögtlin Instruments GmbH; Muttenz, Switzerland). At the lowest O₂ calibration points, sodium sulfite was added as an extra O₂ scavenger to ensure fully anoxic conditions. All measurements were performed at the same constant temperature (22.5 ± 0.5 °C). A pH meter (PHM210 Meterlab, Radiometer Analytical, Lyon, France) facilitated the monitoring of the pH throughout the calibration. O₂ levels and temperature were monitored with a fiber-optic O₂ phase-fluorimeter (FireSting GO₂; PyroScience GmbH, Aachen, Germany) equipped with a robust O₂ sensor (OXROB3; PyroScience GmbH, Aachen, Germany).

Spectral characterization of individual layers of the dual analyte optode

For full spectral characterization of the single and dual analyte optodes, additional fluorescence and excitation spectra were acquired with a ClarioStar Plus plate reader (BMG Labtech, Ortenberg, Germany) at room temperature and different pH and oxygen conditions. The optodes were taped into a 12-well plate and filled with 1 mL of phosphate buffer, and the pH was adjusted by using 1 M HCl and NaOH solutions. The O₂ levels were either reached by shaking the buffer solution before filling it into the well or by adding a few drops of a 2% solution of sodium sulfite. For the excitation spectra, the excitation wavelength was scanned between 350 and 700 nm (slit width 10 nm, increment 2 nm), while the emission wavelength was set to 770 nm (slit width 10 nm). To record the fluorescence spectrum, the emission wavelength was scanned between 420 and 840 nm (slit width 10 nm, increment 2 nm), and the excitation wavelength was set to 380 nm (slit width 10 nm).

Image analysis and data processing

Required programming packages

The radiometric correction of the raw hyperspectral image is done using a MATLAB script that can be obtained from the camera manufacturer upon request (hsisupport@imec.be). The image analysis, data processing, and the ML model were coded in Python 3.7.4 (python.org) using the following Python packages: for loading and processing hyperspectral images, we used SpectralPython (SPy, spectralpython.net), matplotlib (matplotlib.org), and Python Imaging Library (PIL; pypi.org/project/Pillow); for spectral fitting and solving the integration and the optimization problem, SciPy (scipy.org) and the nonlinear least-square fitting (lmfit; lmfit.githu-b.io) were used. Further packages required are NumPy, pandas, math, random, time, glob, pathlib, os, h5py, and andxlrd. All libraries required were to date at the time the paper was submitted. The Python code can be downloaded from GitHub (github.com/silviaelisabeth/ML_for_pHandO2) and is openly accessible.

Performance analysis

While classification models in ML can be assessed and evaluated straightforwardly based on certain performance measures such as their accuracy, this is not the case for regression models. In regression, the model performance is reported as its deviation or error from the expected target values. While there are various approaches to assessing the regression performance of a model, the commonly used error metrics are the root mean square error (RMSE) and the mean absolute error (MAE) [29, 30].

Root mean square error (RMSE)

The root mean square error is also called root mean square dispersion and measures the difference between the estimated (y_i) and the expected target (x_i) values. The difference between these values is first squared and then averaged across the entire data samples. Finally, the square root is calculated. The RMSE determines the average magnitude of the error and is a negatively oriented scoring rule, i.e., the lower the error, the better the model prediction performance. However, RMSE is less robust towards outliers:

$$RMSE=\sqrt{\frac{{{\sum }_{i=1}^{N}\left({y}_{i}-{x}_{i}\right)}^{2}}{N}}$$

(1)

with y_i being the estimated value and x_i being the expected value for the ith sample. N is the number of samples in the given dataset.

Mean absolute error (MAE)

The mean absolute error instead does not take the square of the difference between observed and predicted values but the absolute value. It is thus more robust towards outliers and does not penalize larger errors more than smaller ones:

$$MAE= \frac{{\sum }_{i=1}^{N}\left|{y}_{i}-{x}_{i}\right|}{N}$$

(2)

with y_i being the estimated value and x_i being the expected value for the ith sample. N is the number of samples in the given dataset.

Results and discussion

In optical chemical sensing, where changing spectral properties of an analyte-sensitive indicator are correlated with the analyte concentration, the situation can quickly become complex. Not only effects such as leaching or bleaching may alter the sensor over time, but also due to the inhered interaction of individual components within the sensor with each other through energy or electron transfer reactions. This makes the evaluation of luminescence spectra more complex since these alternations and cross-interferences must be considered when calibrating the indicators, especially if those cross-interferences do not remain constant [31].

Figure 2 displays such a complex situation for the simultaneous imaging of pH and dissolved O₂. The figure depicts the spectral excitation/emission characteristics of the individual layers of the optical chemical dual analyte sensor for pH (Fig. 2A–B) or dissolved O₂ (Fig. 2C–D). While the two-layered structure of the optode should prevent close-proximity energy transfer reactions, such as Förster resonance energy transfer (FRET) or photoinduced electron transfer (PET), reabsorption of luminescence can still occur when the excitation and emission spectra of the indicators involved overlap [32].

Figure 2A and D reveal that reabsorption may occur to some extent, particularly under basic conditions, as the reference indicator, macrolex fluorescence yellow emits between 500 and 600 nm (Fig. 2D), which overlaps with the absorption of lipophilic HPTS (yellow and green curve in Fig. 2A). However, the reabsorption of the O₂ sensor layer (combination of macrolex fluorescence yellow dye and oxygen-sensitive Pt-TPTBP dye) (Fig. 2C) is predominant due to the overlapping excitation of the lipophilic HPTS dye (Fig. 2B). In particular under acidic conditions, when the pH indicator emits between 400 and 550 nm, reabsorption by the O₂ sensor layer (Fig. 2C) can occur. However, at higher pH values, this resonance and reabsorption are reduced since the overlap between emission and absorption is less. The complex and nested combination of several different effects creates a situation that cannot be predicted and accounted for in one or a few polynomial functions as it is required by conventional approaches to signal deconvolution.

Figure 3 subsequently illustrates this nested and intercoupled situation with spectral cross-interferences when it comes to calibrating the different analytes. While Fig. 3A displays pH calibration data of the dual analyte optodes at two different O₂ concentrations (anoxic and air-saturated), Fig. 3B displays O₂ calibration data at two different pH values (4 and 8). The dashed curves in the panels represent hypothetical calibration curves if respective standard calibration fit functions were applied to the calibration data to calibrate the individual analytes. As can be seen from the graphs and especially from Fig. 3A, the fitted calibration curves fail to describe the calibration data well as there is cross-dependence in both calibrations. While most of the calibration data might be solved individually by conventional fit functions, the pH calibration under anoxic conditions as well as the interpolation of all other analyte combinations can hardly be solved by applying conventional analysis methods and calibration functions. However, it is important to note that the indicators chosen in this example show a spectral overlap and were specifically chosen to also demonstrate the limitations in selecting commonly used (available) indicators. By combining other indicators with less spectral overlap, this issue could be eliminated or reduced; this often requires the synthesis of specialized indicators, which for various reasons is not always possible.

That is the point ML modeling comes into play as an alternative approach. The advantage of ML models lies in their capability of finding underlying patterns and parameter correlations in nested and interconnected datasets whose complexity and dimensionality are beyond human imagination. For the modeling, we decided to use the absolute fluorescence response of the dual analyte optode, as opposed to the usual approach in optical–chemical sensing, which uses ratiometric intensity relative to the reference indicator. Our decision was based on the fact that, in our tests, the former approach yielded slightly better results than the latter one.

In the following, we first explain data extraction and preparation, which are a crucial step in modeling, and describe important aspects that can affect model performance. We then describe the process of model building and optimization, followed by a description and validation of the final model for simultaneous imaging of pH and dissolved O₂. Especially during the validation step, the advantage of ML modeling becomes clear, but at each step, we emphasize the risks of bias that can impact the overall model performance. Scheme 1 provides an overview of the workflow with all processes conducted from initial data acquisition to the final machine learning model.

Data preparation

The first key step in building a strong ML model is to provide a suitable dataset on which the algorithm can train and deduce an underlying (hidden) pattern. For a well-performing ML model, a suitable dataset means providing a large or even big set of samples that contain balanced and high-quality information to follow the large numbers theorem [24, 25, 33]. Although it depends on the individual problem and its complexity, computer scientists argue that a rule of thumb is at least 1000 samples for a suitable dataset. In some cases, when the amount of data is not a limiting factor, researchers may apply dimension reduction techniques such as principal component analysis, factor analysis, or linear discriminant analysis to enhance information density and remove unwanted noise from random variables before applying further regression algorithms [34, 35]. However, applying dimensional reduction techniques may also filter out relevant information for subsequent regression algorithms to find the underlying patterns. Therefore, we opted for an outlier removal test to ensure data quality instead of a dimension reduction technique. In addition, our preliminary tests (not shown here) demonstrated that this approach led to better results without sacrificing relevant information.

Hence, to match the first requirement and extract a sufficiently large amount of spectral data from the optode image, we selected a homogeneous region of interest (RoI) from the optode calibration image. However, unlike the usual approach in optical chemical imaging, we did not average over a larger area of the optode but chose smaller sections of 5 × 5 pixels for the RoIs, cleaned the data from outliers with an interquartile range, and calculated the median average of each RoI. In this way, we obtained 7196 oxygen samples and 6476 pH samples while mitigating the noise of the optode images. A table has been compiled from these processed data, and the interested reader may download the calibration data from the publicly accessible repository Mendeley data [26].

In the next step, we examined the distribution of sample points across calibration points to tailor the dataset to be balanced, i.e., each calibration point is almost equally represented in the dataset. This is critical to avoid bias in model accuracy and to prevent the model from being trained on a hidden bias that may stem from artifacts but has nothing to do with the actual feature correlations (overfitting). As can be seen in Fig. 4, the distribution of the sample data we obtained for each calibration point (original dataset shown in bright colors) is highly imbalanced notably for the oxygen calibration where calibrations at air saturation and under anoxic conditions are prevailing. Thus, we reduced the prevailing samples by averaging larger groups of pixels and ultimately obtained sample sizes of 2506 samples for oxygen and 4450 samples for pH, respectively. The distribution of the final dataset is shown in dark colors in Fig. 4.

Machine learning regression model

When screening the literature for an appropriate machine learning algorithm, one comes across a great variety of machine learning algorithms applied to a wide range of topics and problems, including research questions in life and environmental sciences [1, 2]. The field is constantly evolving, with new algorithms being introduced to solve increasingly complex problems in less time. Each of them with different strengths and potentials, but not all of them are applicable to every research question or sometimes even unnecessarily complex in terms of computational power or do not match the given data or problem at hand. Interested readers can read more about other ML models in the referenced publications [29, 33–37]. Subsequently, we describe how we selected and optimized an appropriate ML model based on the given dataset and validated its ultimate performance.

Model identification

The measured calibration dataset is best described as a structured dataset summarizing the spectral responses of the dual analyte sensor along the entire wavelength between 470 and 900 nm at different pH and O₂ conditions. In addition to the spectral response of the dual analyte sensor (the so-called features of the dataset), the specific pH and O₂ concentrations during calibration are known. Thus, the calibration dataset can be described as a labeled, structured dataset with an additional target vector that allows the use of supervised ML algorithms. Another important point is that although the structured dataset provides discrete calibration points, it must be possible to obtain continuous results in subsequent measurements, which hence requires a regression model rather than a classification model. However, even though the problem can be narrowed down to a supervised regression problem by the given dataset, there remain a myriad of different approaches and algorithms. Moreover, since the dual analyte sensor is sensitive to two analytes simultaneously, the algorithm should reflect that and output both analyte information at the same time. Therefore, we decided to build a multi-layered ML model that first finds the pH that best fits a given spectral response of the dual analyte sensor and then iteratively finds a solution for dissolved O₂. The reason for this order of the multi-layered model was that the dual analyte sensor appears to be more sensitive to changes in O₂ concentration, and cross-interactions that occur, such as FRET or alike, impact the overall sensor response more than changes in pH (see Fig. 3). Furthermore, please note that we used the absolute fluorescence spectra instead of the ratiometric ones, as is usually the case in optical chemical imaging [12].

To now find the best ML regression algorithm, we have assessed different options and determined the performance of the overall model for the given data using different loss functions. For applied ML, the choice of the loss function can be very crucial and can lead to the favoring of different algorithms depending on where the focus lies for a given problem, i.e., whether, for example, accuracy is more relevant than sensitivity or selectivity of the sensor. One common way of describing the performance of a regression model is to determine its accuracy and dispersion in terms of mean absolute error (MAE) and root mean square error (RMSE), respectively [30]. However, while the performance measures of the dataset describe the overall performance of the algorithm for a given set, one cannot rule out that the data carry a hidden bias to which the algorithm mainly responds and trains. Thus, to prevent overfitting, the dataset is typically split into training and validation datasets. The former is used to train the model and describe the overall model performance, while the latter is used to comprehensively describe its performance on a dataset it has never seen before. Splitting the entire dataset is done using the standard split function of the Python package lmfit. This split function divides the dataset into random subsets according to a user-defined ratio, in our case, a ratio of 80:20. At first glance, this may seem counterintuitive compared to conventional validation tests where individual calibration points are removed for validation. However, the ML model is not based on one single deduced fit function and should thus be validated randomly over the entire calibration range. To find the optimal regression algorithm, all performance measures should be as low as possible. Tables 1 and 2 give an overview of the performance measures of the different ML algorithms and for each individual analyte.

Table 1 Performance of different ML regression algorithms assessed for training data as well as for validation data for the separate identification of pH

Full size table

Table 2 Performance of different ML regression algorithms assessed for training data as well as for validation data for the separate identification of dissolved O₂

Full size table

As can be seen from Tables 1 and 2, there is not one regression algorithm that is best suited and provides optimal results for both analytes. However, the algorithms that perform best for both the training data and the validation data are the following regressors: decision tree (DT), random forest (RF), and XGBoost (XGB). Consequently, these three regressors were selected as potential candidates for the ML model, and their respective parameters were further optimized.

Model optimization

To fine-tune the algorithm and optimize its performance, there are several set screws that define the algorithm, control its learning process, and constrain the algorithm in minimizing a predefined loss function. These so-called hyperparameters can be optimized in a process called hyperparameter optimization (HPO). This has been done for all three potential candidates and each analyte. Traditional approaches to HPO are either a parameter sweep, in which parameters are optimized by comprehensively enumerating all combinations over a manually specified subset of the hyperparameter space or a random search, in which a subset of parameter combinations is randomly defined [38]. The supplemental information provides a detailed summary of the HPO process of all three algorithms and both analytes, while Table 3 summarizes the final performance of the optimized algorithms using the same performance measures (MAE and RMSE) as described previously. Please note that the attached Excel file contains a detailed summary of all HPO processes, intended as a guide for readers new to ML modeling to be able to replicate the steps for model optimization. The Word document provides a summary of the most important intermediate results for a quick overview. Note that the Excel file provides very detailed information on the HPO process.

Table 3 Performance of ML regression algorithms optimized in an HPO process. Performance is assessed for both the training data and for the validation data for the separate identification of pH and O₂, respectively

Full size table

As shown in Table 3, the optimal regression algorithm for pH prediction is the XGBoost regression algorithm, a scalable decision tree-based ensemble ML algorithm that uses a gradient boosting framework and provides a parallel tree boosting [39]. This is not surprising, since for small- to medium-sized structured data, XGBoost like all decision tree-based algorithms is considered to be the best performing. While the performance measures for pH prediction are clearly in favor of the XGBoost regression algorithm, this is less clear for dissolved O₂ prediction. We have therefore decided to use the XGBoost regression algorithm for the prediction of dissolved O₂ as well. Table 4 summarizes the optimized hyperparameter for each analyte yielding the performance metrics described before (Table 3).

Table 4 Optimized hyperparameter for each XGBoost regressor

Full size table

Final ML model and model validation

Upon performing several optimization and screening procedures, the final model for simultaneous detection of pH and dissolved O₂ using optical chemical sensors now consists of a two-layer ML model based on XGBoost algorithms. First, the pH value is predicted and, subsequently the O₂ concentration with conditional knowledge of the pH value. However, since the prediction of dissolved O₂ appeared to be rather uncertain, with some outliers and larger uncertainties, an additional ML layer using an XGBoost regression algorithm was used to iteratively optimize the O₂ prediction. Thus, the final ML model includes three XGBoost layers for pH and O₂ prediction. The final algorithm can be downloaded from GitHub (github.com/silviaelisabeth/ML_for_pHandO2) and is freely available.

As mentioned several times in this publication, the validation of an ML model is crucial in the building process to ensure its accuracy and to prevent any bias in the dataset. Besides validation by one-time sub-sampling, as performed previously, another option is cross-validation [36]. Cross-validation is a resampling method that uses either individual samples or a larger subsample from the entire dataset to validate a model on different iterations. During the validation process, the subsamples are removed from the training dataset to avoid a prediction bias. Due to this, however, cross-validation requires more computational power than validation by one-time sub-sampling; but it also assesses the performance of the model much more accurately. An extremely accurate but, due to the large size of our dataset, computational intense cross-validation would be a leave-one-out cross-validation in which all possible combinations are trained and tested. Consequently, a good balance between these two approaches, validation by one-time sub-sampling or validation by leave-one-out cross-validation, would thus be k-fold cross-validation, a non-exhaustive cross-validation in which the dataset is randomly partitioned in k equally sized subsamples. While one of the k subsamples is used as validation data, the remaining subsamples are used as training data. This process is then repeated k times. In ML modeling, it is common practice to perform 10-fold cross-validation [36]. Subsequent to cross-validation, MAE and RMSE can be determined as performance metrics as described before. A summary of the validation process for the ML model is listed in Table 5, while detailed information about the variance of predicted and target pH and dissolved O₂, respectively, can be found in the supplemental information. In order to illustrate the benefits of iterative O₂ prediction, we listed in Table 5 the performance metrics for the initial O₂ prediction as well as the performance metrics after adding the additional layer for iterative O₂ prediction. Please note that 10-fold cross-validation indicates the minimum performance of a model, whereas the true performance of the model is better and achieves lower dispersions.

Table 5 Performance metrics of the multi-layered ML model for simultaneous prediction of pH and dissolved O₂ concentration determined based on a 10-fold cross-validation

Full size table

As can be seen from Table 5, the iterative process of O₂ prediction leads to particularly small performance measures with a mean absolute error of < 4.50·10⁻² (compared to < 1.57 for non-iterative O₂ prediction) and a root mean squared error of < 2.12·10⁻¹ (< 1.25 previously). This reduces the uncertainty of the O₂ prediction on average to the third decimal. As described in the supplemental information (cf. Section 4), the iterative approach for prediction of dissolved O₂ helps circumvent potential reabsorption and interference artifacts. In contrast, the performance measures for pH prediction are significantly worse, with mean absolute error and root mean square error of < 1.96·10⁻¹ and < 4.42·10⁻¹, respectively. Thus, a prediction of pH is on average less accurate with this model and varies in the first decimal. A detailed discussion of the deviation of the predicted pH values compared to the target pH values can be found in the supplemental information (see Section 3). As shown in Fig. 5, as well as in Figure·S2 and Figure·S3, the deviations occur mainly at lower pH values when reabsorption effects and indicator interactions are more dominant. Furthermore, it should be noted that although the calibration is performed over the entire pH range, the dynamic range of the pH sensing layer is, however, limited to a range of ± 2 pH units around the pK_a value, i.e., a range between 5 and 9. pH values outside this range are not considered physically reasonable but invalid, even if it would be possible for the ML algorithm to find a pattern. However, so far, we have not performed any additional experiments to investigate the limitations of the ML model in this regard and therefore recommend using the pH sensing layer only in the known dynamic range. Moreover, where necessary, the performance of the pH prediction could be optimized with an additional model layer, as has been done for the prediction of dissolved O₂. However, it should be emphasized again that these values are minimum values; the actual performance of the model is better.

Conclusion

We have developed a novel approach to multi-analyte optical chemical imaging in 2D using a machine learning model and outlined the building process of the multi-layered ML model for complex and coupled data. While the dual analyte optode calibration data for simultaneous imaging of pH and dissolved O₂ cannot be explained by conventional multivariate analysis methods, machine learning algorithms have proven useful. Consequently, we were able to build a three-layered model with individual pH and iterative O₂ prediction based on a decision tree-based algorithm (the so-called XGBoost). Figure 5C–D illustrates the conversion of the absolute fluorescence intensity emitted by the optode to the corresponding pH and concentration of dissolved O₂ in each pixel. While the dissolved O₂ can thus be predicted with an average error of < 0.045 (MAE) and < 0.212 (RMSE), pH is predicted with an average error of < 0.196 (MAE) or < 0.442 (RMSE), respectively. In other words, the iterative prediction of dissolved O₂ works excellently, while the pH prediction can be improved, if necessary, as shown in the discussion.

While our contribution demonstrates the advantages of ML models for nested and intercoupled datasets that cannot be solved with conventional statistical models, we also highlighted the risks during the ML model process. For each process step during ML model development, we highlighted different risks that researchers should consider when building their own ML model, including data preparation (outlier test vs. dimension reduction), establishing a balanced training dataset and identifying and validating an appropriate ML model for the question at hand. Researchers need to be aware that while it is inherent to ML models to find patterns, it is our responsibility to not indulge into p-hacking or data dredging and follow patterns that in reality do not exist.

Despite the risks that come with ML modeling, we should dare and bring data analysis out of its shadowy existence. We should give it due attention if we want to advance multi-analyte imaging. In particular from a practical aspect, this approach appears very appealing. ML can help construct multi-analyte sensors using already existing indicators and circumvents the need to find indicators that have limited spectral overlap or other types of interactions. Converting acquired images into quantitative data is often cumbersome and not simple, especially when additional signal deconvolution is required. ML algorithms clearly show advantages in deciphering intercoupled datasets with high dimensionality and complexity, where human imagination and conventional methods fail in finding the underlying correlations. However, we must not blindly use any ML algorithm but also be aware of the possible biases and risks when setting up training and validation data.

Abbreviations

ML :: Machine learning
RoI :: Region of interest
MAE :: Mean absolute error
RMSE :: Root mean square error
XGB :: XGBoost regression algorithm
DT :: Decision tree regression algorithm
RF :: Random forest regression algorithm
HPO :: Hyperparameter optimization

References

Kornmann H, Rhiel M, Cannizzaro C, Marison I, von Stockar U. Methodology for real-time, multianalyte monitoring of fermentations using an in-situ mid-infrared sensor. Biotechnol Bioeng. 2003;82(6):702–9.
Article CAS PubMed Google Scholar
Hwang EY, Pappas D, Jeevarajan AS, Anderson MM. Evaluation of the paratrend multi-analyte sensor for potential utilization in long-duration automated cell culture monitoring. Biomed Microdevices. 2004;6(3):241–9.
Article CAS PubMed Google Scholar
Rodriguez-Mozaz S, Reder S, de Alda ML, Gauglitz G, Barcelo D. Simultaneous multi-analyte determination of estrone, isoproturon and atrazine in natural waters by the RIver ANAlyser (RIANA), an optical immunosensor. Biosens Bioelectron. 2004;19(7):633–40.
Article CAS PubMed Google Scholar
Mendoza EA, Robinson D, Lieberman RA. Miniaturized integrated optic chemical sensors for environmental monitoring and remediation. Chem Biochem Environ Fiber Sens VIII. 2836: SPIE; 1996. p. 76–86.
Kortzinger A, Schimanski J, Send U. High quality oxygen measurements from profiling floats: a promising new technique. JTECH. 2005;22(3):302–8.
Google Scholar
Gubala V, Harris LF, Ricco AJ, Tan MX, Williams DE. Point of care diagnostics: status and future. Anal Chem. 2012;84(2):487–515.
Article CAS PubMed Google Scholar
Mosshammer M, Brodersen KE, Kuhl M, Koren K. Nanoparticle- and microparticle-based luminescence imaging of chemical species and temperature in aquatic systems: a review. MCA. 2019. https://doi.org/10.1007/s00604-018-3202-y.
Article Google Scholar
Koren K, Zieger SE. Optode based chemical imaging—possibilities, challenges, and new avenues in multidimensional optical sensing. ACS Sens. 2021;6(5):1671–80.
Article CAS PubMed Google Scholar
Stewart PS, Franklin MJ. Physiological heterogeneity in biofilms. Nat Rev Microbiol. 2008;6(3):199–210.
Article CAS PubMed Google Scholar
Wimpenny J, Manz W, Szewzyk U. Heterogeneity in biofilms. FEMS Microbiol Rev. 2000;24(5):661–71.
Article CAS PubMed Google Scholar
Zieger SE, Steinegger A, Klimant I, Borisov SM. TADF-emitting Zn(II)-benzoporphyrin: an indicator for simultaneous sensing of oxygen and temperature. ACS Sens. 2020;5(4):1020–7.
Article CAS PubMed PubMed Central Google Scholar
Larsen M, Borisov SM, Grunwald B, Klimant I, Glud RN. A simple and inexpensive high resolution color ratiometric planar optode imaging approach: application to oxygen and pH sensing. Limnol Oceanogr Methods. 2011;9(9):348–60.
Article CAS Google Scholar
Ehgartner J, Wiltsche H, Borisov SM, Mayr T. Low cost referenced luminescent imaging of oxygen and pH with a 2-CCD colour near infrared camera. RSC. 2014;139:4924–33. https://doi.org/10.1039/C4AN00783B.
Article CAS Google Scholar
Moßhammer M, Strobl M, Kühl M, Klimant I, Borisov SM, Koren K. Design and application of an optical sensor for simultaneous imaging of pH and dissolved O2 with low cross-talk. ACS Sens. 2016;1(6):681–7.
Li L, Zhdanov AV, Papkovsky DB. Advanced multimodal solid-state optochemical pH and dual pH/O-2 sensors for cell analysis. Sens Actuators B Chem. 2022. https://doi.org/10.1016/j.snb.2022.132486.
Article PubMed PubMed Central Google Scholar
Wang L, Jensen K, Hatzakis N, Zhang M, Sorensen TJ. Robust dual optical sensor for pH and dissolved oxygen. ACS Sens. 2022;7(5):1506–13.
Article CAS PubMed Google Scholar
Zieger SE, Mosshammer M, Kühl M, Koren K. Hyperspectral luminescence imaging in combination with signal deconvolution enables reliable multi-indicator-based chemical sensing. ACS Sens. 2021;6:183–91.
Article CAS PubMed Google Scholar
Lagerquist R, McGovern A, Homeyer CR, Gagne DJ II, Smith T. Deep learning on three-dimensional multiscale data for next-hour tornado prediction. MWR. 2020;148(7):2837–61.
Article Google Scholar
Ha N, Xu K, Ren GH, Mitchell A, Ou JZ. Machine learning-enabled smart sensor systems. Adv Intell Syst. 2020. https://doi.org/10.1002/aisy.202000063.
Article Google Scholar
Hathout RM. Chapter 13 - Machine learning methods in drug delivery. In: Ren J, Shen W, Man Y, Dong L, editors. Appl Artif Intell Process Syst Eng. https://doi.org/10.1016/B978-0-12-821092-5.00007-3.
Mehta S, Laghuvarapu S, Pathak Y, Sethi A, Alvala M, Priyakumar UD. MEMES: machine learning framework for enhanced molecular screening. Chem Sci. 2021;12(35):11710–21.
Article CAS PubMed PubMed Central Google Scholar
Venturini F, Michelucci U, Baumgartner M. Dual oxygen and temperature luminescence learning sensor with parallel inference. Sensors. 2020. https://doi.org/10.3390/s20174886.
Michelucci U, Venturini F. Multi-task learning for multi-dimensional regression: application to luminescence sensing. Appl Sci. 2019. https://doi.org/10.3390/app9224748.
Evaluation of measurement data — supplement 1 to the “Guide to the expression of uncertainty in measurement” — propagation of distributions using a Monte Carlo method. In: Metrology J-JCfGi, editor. JCGM/WG 12008. p. 134. https://www.bipm.org/documents/20126/2071204/JCGM_101_2008_E.pdf/325dcaad-c15a-407c-1105-8b7f322d651c.
Cox M, Harris P, Siebert BRL. Evaluation of measurement uncertainty based on the propagation of distributions using Monte Carlo simulation. Meas Tech. 2003;46(9):824–33.
Article Google Scholar
Zieger SE, Koren K. Hyperspectral fluorescence images of an pH+O2 dual-analyte optical chemical sensor. In: Zieger SE, Koren K, editors. Mendeley Data 2023. https://doi.org/10.17632/zchkwzh3kk.1.
Borisov SM, Herrod DL, Klimant I. Fluorescent poly(styrene-block-vinylpyrrolidone) nanobeads for optical sensing of pH. Sens Actuators B Chem. 2009;139(1):52–8.
Article CAS Google Scholar
Merl T, Koren K. Visualizing NH3 emission and the local O2 and pH microenvironment of soil upon manure application using optical sensors. Environ Int. 2020;144:10.
Article Google Scholar
Bishop CM. Pattern recognition and machine learning. 1 ed. Jordan M, Kleinberg J, Schölkopf B, editors: Springer New York, NY; 2006. XX, 738 p.
Trevor H, Robert T, Jerome F. The elements of statistical learning: data mining, inference, and prediction. 2nd ed. , New York, NY: Springer New York; 2009.
Google Scholar
Janata J. Multivariate Sensing. Principles of chemical sensors. 2nd ed. Boston, MA: Springer; 2009.
Book Google Scholar
Lakowicz JR. Principles of fluorescence spectroscopy. 3rd ed. New York: Springer; 2006. p. 954.
Goodfellow I, Bengio Y, Courville A. Deep learning: MIT Press; 2016.
Berg C, Ihling N, Finger M, Paquet-Durand O, Hitzmann B, Büchs J. Online 2D fluorescence monitoring in microtiter plates allows prediction of cultivation parameters and considerable reduction in sampling efforts for parallel cultivations of Hansenula polymorpha. Bioeng. 2022. https://doi.org/10.3390/bioengineering9090438.
Article Google Scholar
Assawajaruwan S, Reinalter J, Hitzmann B. Comparison of methods for wavelength combination selection from multi-wavelength fluorescence spectra for on-line monitoring of yeast cultivations. Anal Bioanal Chem. 2017;409:10.
Article Google Scholar
Forsyth D. Applied machine learning. 1st ed. Cham: Springer; 2019.
Book Google Scholar
Molnar C. Interpretable machine learning: a guide for making black box models explainable. 2nd ed. 2022.
Feurer M, Hutter F. Hyperparameter optimization. In: Automated machine learning: methods, systems, challenges. The Springer Series on Challenges in Machine Learning. 1 ed. Cham: Springer; 2019. p. XIV, 219.
Chen TQ, Guestrin C. XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco: ACM; 2016. p. 785–94.

Download references

Acknowledgements

The authors would like to take the chance to thank Mette L. G. Nikolajsen and Lars B. Pedersen (Department of Biology, Aarhus University) for their excellent technical assistance. In addition, we would like to take this opportunity to thank Prof. Troels Christian Petersen and Prof. Joachim Mathiessen from the Niels Bohr Institute (NBI) at the University of Copenhagen, who facilitated the summer course in Applied Machine Learning and Big Data Analysis, thereby making a significant concessional contribution to the creation and validation of our model.

Funding

Open access funding provided by Royal Danish Library The study was supported by a research grant from the Grundfos Foundation and a Sapere Aude from the Independent Research Fund Denmark (IRFD) (DFF-8048-00057B).

Author information

Authors and Affiliations

Aarhus University Centre for Water Technology (WATEC), Department of Biology, Section for Microbiology, Aarhus University, Ny Munkegade 114, 8000, Aarhus C, Denmark
Silvia E. Zieger & Klaus Koren

Authors

Silvia E. Zieger
View author publications
You can also search for this author in PubMed Google Scholar
Klaus Koren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Klaus Koren.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 897 KB)

Supplementary file2 (PDF 2463 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zieger, S.E., Koren, K. Machine learning for optical chemical multi-analyte imaging. Anal Bioanal Chem 415, 2749–2761 (2023). https://doi.org/10.1007/s00216-023-04678-8

Download citation

Received: 24 January 2023
Revised: 28 March 2023
Accepted: 30 March 2023
Published: 18 April 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s00216-023-04678-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning for optical chemical multi-analyte imaging

Abstract

Similar content being viewed by others

Trends in hyperspectral imaging: from environmental and health sensing to structure-property and nano-bio interaction studies

Machine learning–based sensor array: full and reduced fluorescence data for versatile analyte detection based on gold nanocluster as a single probe

Hyperspectral Imaging of FRET-Based cGMP Probes

Introduction

Material and methods

Materials

Optode fabrication

Imaging setup and optode calibration

Spectral characterization of individual layers of the dual analyte optode

Image analysis and data processing

Required programming packages

Performance analysis

Root mean square error (RMSE)

Mean absolute error (MAE)

Results and discussion

Data preparation

Machine learning regression model

Model identification

Model optimization

Final ML model and model validation

Conclusion

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's note

Supplementary Information

Supplementary file1 (DOCX 897 KB)

Supplementary file2 (PDF 2463 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation