Extended automated quantification algorithm (AQuA) for targeted 1H NMR metabolomics of highly complex samples: application to plant root exudates

Alexandersson, Elin; Sandström, Corine; Meijer, Johan; Nestor, Gustav; Broberg, Anders; Röhnisch, Hanna E.

doi:10.1007/s11306-023-02073-z

Extended automated quantification algorithm (AQuA) for targeted ¹H NMR metabolomics of highly complex samples: application to plant root exudates

Original Article
Open access
Published: 23 December 2023

Volume 20, article number 11, (2024)
Cite this article

Download PDF

You have full access to this open access article

Metabolomics Aims and scope Submit manuscript

Extended automated quantification algorithm (AQuA) for targeted ¹H NMR metabolomics of highly complex samples: application to plant root exudates

Download PDF

628 Accesses
Explore all metrics

Abstract

Introduction

The Automated Quantification Algorithm (AQuA) is a rapid and efficient method for targeted NMR-based metabolomics, currently optimised for blood plasma. AQuA quantifies metabolites from 1D-¹H NMR spectra based on the height of only one signal per metabolite, which minimises the computational time and workload of the method without compromising the quantification accuracy.

Objectives

To develop a fast and computationally efficient extension of AQuA for quantification of selected metabolites in highly complex samples, with minimal prior sample preparation. In particular, the method should be capable of handling interferences caused by broad background signals.

Methods

An automatic baseline correction function was combined with AQuA into an automated workflow, the extended AQuA, for quantification of metabolites in plant root exudate NMR spectra that contained broad background signals and baseline distortions. The approach was evaluated using simulations as well as a spike-in experiment in which known metabolite amounts were added to a complex sample matrix.

Results

The extended AQuA enables accurate quantification of metabolites in 1D-¹H NMR spectra with varying complexity. The method is very fast (< 1 s per spectrum) and can be fully automated.

Conclusions

The extended AQuA is an automated quantification method intended for 1D-¹H NMR spectra containing broad background signals and baseline distortions. Although the method was developed for plant root exudates, it should be readily applicable to any NMR spectra displaying similar issues as it is purely computational and applied to NMR spectra post-acquisition.

NMR metabolite quantification of a synthetic urine sample: an inter-laboratory comparison of processing workflows

Article Open access 07 July 2023

Quantification of 1H NMR spectra from human plasma

Article 05 July 2015

Probability in HPLC-MS Metabolomics

1 Introduction

Nuclear magnetic resonance (NMR) spectroscopy is commonly used in metabolomics for identification and quantification of metabolites in different biological samples (Crook & Powers, 2020). NMR has many advantages; it is inherently quantitative, highly reproducible, non-destructive, and enables analysis of compounds with different chemical properties in one single experiment. However, the complex mixtures of natural products that are studied in metabolomics typically yield complicated 1D-¹H NMR spectra with extensive spectral overlap, which can make both identification and quantification of individual metabolites challenging. Spectral overlap occurs because one metabolite can generate several NMR signals, and signals from different compounds often appear at similar chemical shifts. The resulting signal interferences are especially problematic for quantitative studies because concentrations of individual metabolites will be overestimated unless the interferences are properly accounted for. Two-dimensional NMR experiments can be used to increase signal dispersion, but 2D spectra typically take longer time both to acquire and to analyse than 1D spectra. Furthermore, quantification based on 2D spectra is not straightforward since the intensity of individual peaks is influenced by their coupling constants and transverse relaxation times. Accordingly, calibration with pure reference compounds, either externally or internally, is required for accurate quantification (Crook & Powers, 2020; Martineau et al., 2020). Therefore, 1D-¹H NMR experiments are still the most common in high-throughput studies and there continues to be a high demand for methods that can accurately quantify metabolites based on 1D-¹H spectra. Various approaches have been developed, both manual (Weljie et al., 2006) and automated (Zheng et al., 2011; Hao et al., 2012; Ravanbakhsh et al., 2015; Tardivel et al., 2017; Lefort et al., 2019; Häckl et al., 2021; Rout et al., 2023).

An Automated Quantification Algorithm (AQuA) for targeted metabolomics has previously been developed in our group (Röhnisch et al., 2018, 2021). This method quantifies metabolites from 1D-¹H NMR spectra using only one signal per metabolite, which reduces the computational time and workload substantially compared to e.g. curve-fitting quantification algorithms (Zheng et al., 2011; Hao et al., 2012; Ravanbakhsh et al., 2015; Tardivel et al., 2017). At the same time, AQuA corrects for signal interferences between different metabolites as well as inter-spectral variation in signal position. Currently, AQuA is optimised for ultra-filtered human plasma samples but it would be desirable to extend its use to other, more heterogeneous, sample types as well, preferably without any time-consuming sample preparation.

Whereas human blood plasma and serum are well studied by NMR and the majority of signals have been assigned (Psychogios et al., 2011; Nagana Gowda et al., 2015), many other biological samples are less well characterised. Plant samples, for example, are very complex with numerous different metabolites of widely different concentrations, which complicates NMR analysis (Deborde et al., 2017). In the present study, aqueous oilseed rape (Brassica napus) root exudate samples were used as a model system to develop the proposed workflow (Fig. 1). Root exudates consist of all substances that are excreted by plant roots during growth, including sugars, organic acids, and amino acids (Vives-Peris et al., 2020). In addition, the samples used in this study all contained various unknown compounds, likely lipids, that gave rise to broad signals in the spectra (Fig. 1b and c). Before accurate quantification can be performed, these signals need to be accounted for in some way.

In blood plasma and serum, macromolecules giving rise to broad signals are routinely removed by ultrafiltration or precipitation with organic solvents before NMR analysis (Daykin et al., 2002; Nagana Gowda & Raftery, 2014). Other options are to use certain NMR experiments that target broad signals, such as the Carr-Purcell-Meiboom-Gill (CPMG) pulse sequence (Carr & Purcell, 1954; Meiboom & Gill, 1958) or diffusion-edited experiments (Liu et al., 1996; de Graaf & Behar, 2003; Bliziotis et al., 2020). There are also methods solely based on computations, such as the Small Molecule Enhancement Spectroscopy (SMolESY) method (Takis et al., 2020, 2021) that utilises the first derivative of the imaginary part of the NMR data to generate a spectrum devoid of broad signals. SMolESY is capable of performing automated relative quantification in blood samples, but for more complex spectra remaining metabolite signal interferences may appear. Because the NMR signals in a SMolESY spectrum are not Lorentzian shaped, standard spectral libraries cannot be used to model these interferences to obtain absolute concentrations. Another strategy is to include broad signals in the quantification methods, either by modelling them as signals using e.g. wavelets (Hao et al., 2012) or Lorentzians (de Graaf et al., 2015), or by treating them as baseline distortions and removing their interference by approximating a baseline correction function through the broad signal (Zheng et al., 2011; Jacob et al., 2017). Most of these methods are developed for plasma, but could potentially also be applied to the plant root exudate samples used as test system in the current study.

The aim of the current study was to develop a rapid, straightforward, and computationally efficient extension of AQuA for absolute quantification of selected metabolites in highly complex spectra containing broad background signals. The method should require minimal sample preparation without compromising the quantitative accuracy. Because of speed and computational cost, we decided to remove the interferences caused by the broad signals before the AQuA computation. This was done using an automatic baseline correction function; here we employed the widely used adaptive iteratively reweighted penalised least squares (airPLS) algorithm (Zhang et al., 2010). The combined method, called extended AQuA, was evaluated using simulations as well as a spike-in experiment performed in a complex sample matrix. This showed that the approach is both accurate, linear, and robust. Furthermore, the proposed workflow is fast and flexible and can easily be fine-tuned for individual samples.

2 Materials and methods

2.1 Root exudate collection

Seeds of various spring varieties of oilseed rape (Brassica napus) were kindly provided by Scandinavian Seed AB and Lantmännen Seed AB. All glassware was rinsed extensively with MilliQ water and autoclaved before use to minimise traces of detergents. Seeds were surface sterilised (10% chlorine bleach for 5 min with mild shaking) and then rinsed with autoclaved MilliQ water four times. The seeds were germinated on petri dishes containing 0.5× Murashige-Skoog medium, including vitamins (MS0222, Duchefa Biochemie B.V., Haarlem, Netherlands) and 0.6% bacto agar, in a growth chamber at 22/20 °C (day/night), 16/8 h photoperiod with 110 µE. After three to five days of germination, when cotyledons and rootlets were expanded, plantlets (n = 8) were transferred to sterile plastic nets attached to 50 ml plastic tubes filled with autoclaved MilliQ water, so that the seedling roots were immersed into the water. This procedure was done in a sterile laminar flow hood. The samples were placed in a sterilised transparent plastic box and kept for four days with slow agitation in a growth chamber at 22/20 °C (day/night), 16/8 h photoperiod with 110 µE. Exudates were collected into glass bottles in a sterile laminar flow hood, shell frozen and lyophilised in darkness. Aliquots of the exudates were spread on plates containing LB agar or 0.5× Murashige-Skoog agar and stored for 48 h to assess any microbial contamination. Blank samples did not contain any seedlings but were otherwise treated as described above.

Lyophilised root exudate and blank samples were dissolved in a few millilitres of MilliQ water, transferred to 15 ml plastic tubes, and dried in a vacuum centrifuge. Dried samples were stored in a desiccator until use.

2.2 Sample preparation

NMR samples were prepared in a similar fashion to a previously published protocol (Kim et al., 2010). All experimental work was performed at room temperature. 750 µl KH₂PO₄ buffer in D₂O (45 mM, pD 7.0 (apparent pH 6.6) containing approximately 0.29 mM DSS-d₆ (sodium 3-(trimethylsilyl)propane-1-sulfonate-d₆) was added to each sample. The samples were vortexed 30 s followed by 10 min ultrasonication. This procedure was repeated once. The samples were then transferred to 1.5 ml plastic tubes and centrifuged for 10 min at 17 000×g. For each sample, 600 µl of the supernatant was added to a 5 mm NMR tube.

2.3 NMR spectroscopy and spectral processing

NMR spectra were acquired on a Bruker Avance III 600 MHz spectrometer with a 5 mm ¹H/¹³C/¹⁵N/³¹P inverse detection cryoprobe equipped with a z gradient. 1D-¹H NMR spectra (256 transients) were recorded at 25 °C using a NOESY presaturation pulse sequence (Bruker’s noesypr1d) with 1 s relaxation delay, 100 ms mixing time, 4.5 s acquisition time, and 12 ppm spectral width, to enable absolute quantification based on the Chenomx library. 65 536 data points were collected and the carrier frequency was placed on the HDO signal (4.70 ppm). After acquisition, an exponential line broadening of 0.3 Hz was applied and the spectral quality was evaluated by assessing the full width half maximum (FWHM) of the DSS signal. If FWHM_DSS was greater than 1.20 Hz, a new spectrum was recorded. Spectra were processed (zero-filling, line broadening, phase correction, crude baseline correction) using Chenomx NMR Suite Professional Software package (version 8.6, Chenomx Inc., Edmonton, Canada). The line-broadening factor was adjusted for each spectrum to obtain FWHM_DSS = 1.20 Hz. If necessary, a crude baseline correction was applied to obtain a flat baseline around the internal standard signal before determining FWHM_DSS. The processed spectra were subjected to spectral binning (− 0.50 to 4.68 ppm and 4.98 to 10.00 ppm, 0.0002 ppm/bin, 51 000 bins in total) and imported to MATLAB (version R2020a, MathWorks Inc., Natick (MA), USA).

To verify metabolite identification, ¹H,¹H-TOCSY (Bruker pulse sequence dipsi2gpphpr) and ¹H,¹³C-HSQC (Bruker pulse sequence hsqcedetgpsisp.2) spectra were recorded for some of the samples. These spectra were processed with TopSpin 4.0.6 (Bruker BioSpin).

2.4 Metabolite identification and quantification

AQuA does not attempt at automated metabolite identification, hence metabolite signals have to be selected prior to AQuA computation. Here, identification of metabolites was based on previous literature (Vives-Peris et al., 2020) and reference NMR spectra included in the Chenomx library. The identity of the metabolites was verified with ¹H,¹H-TOCSY and ¹H,¹³C-HSQC NMR spectra recorded for some of the samples. ¹³C NMR chemical shifts were compared with those available in the Biological Magnetic Resonance Data Bank (Ulrich et al., 2008). Only metabolites verified by 2D NMR experiments, or displaying an excellent fit for several signals with the Chenomx library, were included in the quantification model. Furthermore, only primary metabolites were included since one of the aims was to develop a method capable of quantifying only a subset of all metabolites in an NMR spectrum.

Binned processed NMR spectra (see Sect. 2.3) were imported to MATLAB and subjected to the airPLS algorithm (Zhang et al., 2010) to fine-tune the baseline where affected by irregularities or the presence of broad signals. As default, the airPLS smoothing factor λ was set to 1 × 10⁷, but a local value was determined for spectral regions where the default λ failed to yield a satisfactory baseline correction. The values for the other parameters in the airPLS algorithm were used as default (order = 2, weight exception proportion = 0.1, asymmetry parameter = 0.05, and maximum iteration time = 20). The airPLS algorithm, using the optimised λ values, was incorporated in an automated joint workflow with AQuA in MATLAB. This workflow is referred to as the extended AQuA. Metabolite quantification using AQuA was performed on the corrected spectra according to the strategy previously described (Röhnisch et al., 2018), using the Chenomx library as a basis to model metabolite signals. In total, 24 metabolites were targeted for quantification, including various amino acids, organic acids, and sugars (Table S1). One reporter signal to be used for quantification was selected for each metabolite (Table S1). Additionally, a few unknown signals were included in the model as Lorentzians generated in Chenomx (Fig. S1).

2.5 Simulations

A simple smoothing algorithm developed in-house was applied to one root exudate spectrum to model the spectral background. The algorithm was built in MATLAB based on the ‘smooth’ function. In short, the following steps were employed: (1) localisation of narrow high-intensity signals (spikes), (2) determination of spike borders, (3) spike depletion by linear regression inside spike borders, and (4) average-based smoothing of the spike-depleted spectrum (for more information and a visual description of the process, see Supplementary Information Sect. 3, especially Fig. S5). In the final step, three levels of smoothing (low, medium, and high) were used to obtain three distinct spectral background models (referred to as A, B, and C, respectively, see Fig. S6). Normalised reference spectra of 24 metabolites (Table S1) were summed together and added to each spectral background in seven different scaling levels, thus yielding 21 simulated spectra. The spectra were corrected with the airPLS algorithm using three different λ values (1 × 10⁶, 1 × 10⁷, and 1 × 10⁸) applied to the whole spectra. Peak picking of one signal per metabolite was performed as previously described (Röhnisch et al., 2018) to obtain signal intensities in the corrected spectra.

2.6 Spike-in experiment

Six of the analysed root exudate samples were pooled together and then divided into five portions. Five metabolites (γ-aminobutyric acid (GABA), dl-asparagine, l(+)-tartaric acid, L-threonine, and D-xylose) not present in the pooled sample were added to different concentrations. As control, five identical blank samples were spiked the same way. The chosen metabolites have signals in different spectral regions with different multiplicities and differ in how much they are affected by broad signals or baseline distortions. The large variation in concentration (10 µM-3200 µM) between the spiked metabolites reflects the large dynamic range observed in the experimental data set, both between different metabolites in the same sample and between the same metabolite in different samples. See Supplementary Information, Sect. 4.1, for more details about the design of the spike-in experiment.

The spiked root exudate samples were analysed as described above, i.e. NMR analysis, spectral processing, and metabolite quantification using an airPLS-extended AQuA, which had been adjusted to include all spiked metabolites (Table S1).

The spectra of the spiked blank spectra were carefully baseline corrected in Chenomx. For the analysis of these spectra, the airPLS step was omitted and an AQuA that only targeted the five spiked metabolites plus lactic acid was used to calculate metabolite concentrations.

3 Results and discussion

3.1 Extended AQuA: workflow, parameter optimisation, and general considerations

The 1D-¹H NMR spectra of oilseed rape root exudates displayed baseline irregularities, including broad background signals, that would impair metabolite concentration estimates if not properly accounted for (Fig. 1). The broad signals in the low-frequency part of the spectra were the most problematic distortions, due to their interference with several amino acid signals. Different methods for elimination of the baseline distortions were evaluated, utilising sample preparation, spectral editing, and computations, respectively (see Supplementary Information, Sect. 2). It was found that an automatic baseline correction function such as the airPLS algorithm (Zhang et al., 2010) could be employed to yield root exudate spectra suitable for targeted metabolomics, i.e. with well-preserved metabolite signal line shapes, a flat baseline, no pronounced residual broad signals, and no severe intensity modulation (see Fig. S2). Manual baseline correction was not considered feasible due to the complexity of the spectra.

The airPLS algorithm was combined with AQuA into a joint automated workflow, i.e. the extended AQuA, for quantification of metabolites in experimental ¹H NMR spectra of root exudates, acquired with minimal prior sample preparation (Fig. 2). The identity of the metabolites was confirmed with ¹H,¹H-TOCSY and ¹H,¹³C-HSQC experiments. Because the AQuA quantification is based on just one signal per metabolite, the airPLS algorithm was used to obtain a good baseline around these signals only, rather than aiming for a perfect baseline in the entire spectrum. The spectral library used here was created from Chenomx but other sources, e.g. in-house libraries, can be used instead if desired.

For the airPLS algorithm to work properly, the smoothing factor λ needs to be optimised. This parameter, which can be set to any value between 1 and 1 × 10⁹ (Zhang et al., 2010), strongly affects the result of the baseline correction. If λ is set too high, the fitted baseline does not include enough of the background, whereas if it is set too low, the algorithm starts to remove parts of the metabolite signals (Fig. 3). Here, due to the non-uniform distribution of broad signals and other baseline distortions, a single λ value was not used for an entire spectrum; instead, different λ values were used for different spectral regions (see Sect. 3.2). Despite the virtually unlimited number of options, it was neither difficult nor time-consuming to find suitable λ values. Importantly, the optimised λ values could be kept fairly constant throughout each data set and could thereby be included in the automated workflow. Before applying the extended AQuA to a data set, the result of the baseline correction should be assessed carefully on a representative subset of the spectra, although one has to keep in mind that the procedure is inevitably an estimation and may not exactly match the actual baseline of the spectrum. However, this is true for all baseline correction methods, regardless of if they are manual or automated.

In addition to baseline distortions, interference can also be caused by spectral overlap with narrow unknown signals. In the current study, the aim was to quantify a preselected subset of metabolites while leaving remaining signals in the spectra untargeted. However, other signals that interfere with the metabolite signals used in AQuA need to be included in the quantification model to avoid overestimating the metabolite concentrations. Here, four unknown signals between 0.93 and 0.97 ppm were added to the quantification model as single Lorentzians to obtain a more accurate concentration estimate of leucine based on the signal at 0.96 ppm (Table S1 and Fig. S1).

3.2 Evaluation of the extended AQuA

3.2.1 Simulations

The extended AQuA was first evaluated using simulated spectra of root exudates where the contributions of the broad signal background and the narrow metabolite signals were exactly known (Fig. 4). To test how well the method can handle different types of spectra, three different spectral background models (A, B, and C) with varying smoothness were created (Figs. S5 and S6) and a simulated narrow signal spectrum was added to the backgrounds in seven different intensity levels. In total, 21 simulated spectra were thus obtained with differences in their spectral backgrounds as well as in their ratio between narrow and broad signals (Figs. S7–S10 and Table S2). For reference, Fig. 4a depicts the simulated spectrum created with the medium-smooth background B (Fig. 4b) and an intermediate intensity of the narrow signal spectrum (Fig. 4c). The airPLS algorithm was applied three times to all spectra, with three different λ values, to evaluate the robustness of the method. The signal heights in the airPLS corrected spectra (Fig. 4d) were compared to those in the corresponding narrow signal spectra (Fig. 4c) using linear regression. Thereby, it was possible to precisely assess how well the airPLS algorithm could remove interferences caused by broad signals and baseline irregularities, and to what extent the narrow signal part of the spectrum was affected by the procedure.

In general, the agreement between the intensities in the baseline corrected spectra and the original narrow signal spectra was good for the signals used in AQuA, as indicated by slopes and R² coefficients close to one and intercepts close to zero (Fig. 5 and Table S3). This suggests that the airPLS feature specifically corrected the baseline and removed broad background signals without notably affecting the selected metabolite signals. Percentage differences (Table S3) were calculated to condense the accuracy estimate into a single variable. For most metabolites, the difference was less than 10% with at least one of the λ values. The smoother backgrounds B and C were easier to fit than the rougher background A, hence the smaller intercepts (Fig. 5b and c). Overall, when the airPLS algorithm was applied to spectra created using background A the λ value needed to be smaller than for spectra based on background B or C. For metabolite signals situated in spectral regions without background interference, e.g. formic acid and fumaric acid, the accuracy was good for all spectra regardless of which λ value was being used (Fig. 5 and Table S3). In contrast, some metabolites had signal intensities in the corrected spectra that deviated substantially from their true values. Often, this coincided with a pronounced interference from the spectral background (see Table S2 and Fig. S10). For example, the signals of fructose, glyceric acid, lactic acid, and threonine were all highly influenced by the spectral background and so the quantification accuracy of these metabolites was strongly dependent on the performance of the baseline correction. Because of the large variation in signal intensity in the simulated spectra, the results could have been more accurate if the λ value had been optimised for each individual spectrum (see next section). However, in an experimental data set, the inter-spectral variation is usually not as big. Furthermore, the quantification accuracy for a given metabolite generally increased with increasing signal intensity relative to the spectral background. Thus, the lower the intensity of the narrow signals and the higher the intensity of the background, the more critical it becomes to optimise the method parameters to avoid quantification errors. Ideally, the signals used in the AQuA computation should all have a low degree of interference and high signal to noise ratio (Röhnisch et al., 2018); however, this is not possible for all metabolites. Still, the proposed method appears to be both linear and accurate for most metabolites.

3.2.2 Spike-in experiment

A spike-in experiment was conducted to further evaluate the extended AQuA (Tables S4, S5 and Fig. S11). Five metabolites (asparagine, GABA, tartaric acid, threonine, and xylose) were added both to blank samples and to aliquots of a pooled root exudate sample in concentrations above the limit of quantification (10 × S/N) for each metabolite. The blank spectra displayed minimal signal interference and lacked the broad background signals and baseline distortions that were present in the root exudate spectra. Therefore, these spectra were only subjected to manual baseline correction before the AQuA computation. The spiked root exudate spectra, on the other hand, were baseline corrected with the airPLS algorithm to remove broad background signals. Here, the default λ value gave a satisfactory correction for all metabolites except threonine and GABA, as evaluated by manual inspection. Threonine was the most challenging metabolite to quantify in the spiked root exudate spectra because its selected signal overlapped both with the signal of the methyl group of lactic acid and with a broad signal that was not assigned unambiguously but can be tentatively attributed to a lipid methylene signal (Fig. S11). The latter could not be correctly suppressed unless a lower λ value was used (see Fig. 3). The GABA signal is a broad quintet whose intensity was slightly reduced with the default λ value because the fitted baseline removed a small portion of the signal (Fig. S12). Therefore, the size of λ was increased for this spectral region.

After baseline correction, AQuA computation was performed on both the spectra from the spiked root exudates and the spiked blank samples, and the results were compared with each other using linear regression as well as percent differences (Table 1). The calculated concentrations are listed in Table S6. Because the same amount of metabolites were added to both sample sets, all slopes should theoretically be equal to one, and all intercepts should be equal to zero as none of the spiked metabolites were present in the samples initially. However, since the sample matrices differed somewhat and all metabolite additions were done manually, some deviations could be expected. Still, as shown in Table 1, the R² values were > 0.999, all intercepts were close to the origin, and the percent differences were generally small. This was in agreement with the results from the simulations. To enable comparison of the intercepts amongst the different metabolites despite the big differences in concentration, the intercepts are reported both as the actual value and as percent of the highest concentration for each metabolite. The tartaric acid signal was consistently more intense in the spectra of the spiked root exudate samples than in the spectra of the corresponding blank samples (Fig. S13), hence the large slope and percent differences. An experimental error probably occurred when tartaric acid was added to the root exudate samples since the calculated concentration of tartaric acid in the blank samples, but not the root exudate samples, agreed well with the actual concentrations (Tables S6–S8). The values of the intercepts and slopes for the other metabolites indicated that there was no clear, systematic over- or underestimation of the concentrations obtained using the proposed method compared to when the same metabolites, in the absence of baseline distortions, were quantified with the non-extended AQuA.

Table 1 Comparison of the concentrations obtained for the spiked blank samples and the concentrations obtained for the spiked root exudate samples^a

Full size table

3.3 Application to plant root exudates

The extended AQuA was applied to a data set consisting of 50 NMR spectra from oilseed rape root exudates and 7 blank spectra. Concentration estimates were computed for 24 metabolites (Table S1). Additionally, four unknown signals were included to model signal interferences (Fig. S1) but they were not quantitatively interpreted.

The extended AQuA process (i.e. baseline correction followed by the quantification of 24 target metabolites) applied to all 57 spectra was typically completed in less than 30 s on a standard personal computer. The same method parameters were used for all spectra. In addition to the default λ value, two local values were used in the airPLS baseline correction (λ = 1 × 10⁶ for the spectral region 0.899–0.967 ppm and λ = 1 × 10⁵ for the region 1.225–1.334 ppm). If only one λ value was used, the total computation time decreased to around 10 s. It has been shown that AQuA requires less than one second to quantify 67 metabolites in 1342 spectra (Röhnisch et al., 2018). Introducing the airPLS step thus increases the computation time but the combined method is still very rapid. Because the airPLS algorithm is the rate-limiting step, the computation time increases notably with the number of spectra and λ values whereas it is negligibly affected by the number of metabolites targeted for quantification.

3.4 Advantages and limitations

The method described here allows for quantification of metabolites in complex spectra that contain broad signals and baseline distortions. Only minimal sample preparation is required and because the method is purely computational, knowledge about the compounds causing the broad signals is not needed. However, in case of binding interactions between metabolites and other compounds such as proteins, application of the method would be more challenging. The occurrence of such interactions can be estimated by assessing the line width and shape of the internal standard signal, since both DSS and TSP are known to interact with macromolecules (Bell et al., 1989; Kriat et al., 1992; Shimizu et al., 1994; Nowick et al., 2003). Here, both metabolite signals and the internal standard signal were narrow and symmetric, which indicated that no significant macromolecular interaction was taking place.

As shown here, the baseline correction method airPLS and the quantification method AQuA can be combined into a fully automated workflow, provided that prior metabolite identification and parameter optimisation have been conducted. Optimising the airPLS algorithm is straightforward and depends only on the parameter λ. Here, we did not strive for an optimal baseline in the whole spectrum but only in regions containing signals used in AQuA, which reduced the optimisation time and effort. The combined method is extremely fast and typically requires less than one second per spectrum. This is due both to the sparse matrix characteristic of the airPLS algorithm, but more importantly the AQuA data reduction strategy. AQuA only considers a set of pre-selected signals in the quantitative process, one for each metabolite, which facilitates very rapid computations whilst still accounting for interferences between metabolites.

Here, we chose to use the airPLS algorithm for baseline correction but it is possible to use other methods instead, as long as they are compatible with AQuA. The metabolite library can also be exchanged if e.g. in-house spectral libraries are preferred.

There are also some limitations of the method. Relying on the height of one single metabolite signal for deriving concentrations may make the method more sensitive to systematic errors caused by database discrepancies compared to when several signals are used (see Supplementary Information, Sect. 4.2). However, this has not been fully evaluated, neither have we investigated whether other quantification methods are less susceptible to this kind of errors. Since AQuA is not an identification method, there is also a risk of erroneous metabolite quantification if the chemical shift windows have not been properly selected or if there are unknown signals present in some spectra that have not been accounted for. If a signal from another compound, metabolite or impurity, with higher intensity than the intended metabolite signal resides in the chemical shift window, the algorithm will pick this signal for quantification instead. For reliable results, metabolite identification should ideally be assessed manually. However, the problem with possible false identification of metabolites is not unique to AQuA, especially when the targeted metabolite signals are singlets.

4 Conclusions

We have here presented a fast and accurate approach for automated quantification of selected metabolites in complex NMR spectra. The spectra of minimally handled plant root exudate samples were successfully analysed with the proposed method, despite the presence of unknown broad signals, baseline distortions, and extensive spectral overlap. Although not evaluated here, the method is theoretically applicable to any spectrum with similar characteristics, as long as the metabolite signals are unaffected by macromolecular interactions.

Data availability

The NMR data has been uploaded to the Swedish National Data Service and is available at https://doi.org/10.5878/8t82-d090.

References

Bell, J. D., Brown, J. C. C., & Sadler, P. J. (1989). NMR studies of body fluids. NMR in Biomedicine,2(5–6), 246–256. https://doi.org/10.1002/nbm.1940020513
Article CAS PubMed Google Scholar
Bliziotis, N. G., Engelke, U. F. H., Aspers, R. L. E. G., Engel, J., Deinum, J., Timmers, H. J. L. M., Wevers, R. A., & Kluijtmans, L. A. J. (2020). A comparison of high-throughput plasma NMR protocols for comparative untargeted metabolomics. Metabolomics,16(5), 64. https://doi.org/10.1007/s11306-020-01686-y
Article CAS PubMed PubMed Central Google Scholar
Carr, H. Y., & Purcell, E. M. (1954). Effects of diffusion on free precession in nuclear magnetic resonance experiments. Physical Review,94(3), 630–638. https://doi.org/10.1103/PhysRev.94.630
Article ADS CAS Google Scholar
Crook, A. A., & Powers, R. (2020). Quantitative NMR-based biomedical metabolomics: Current status and applications. Molecules,25(21), 5128. https://doi.org/10.3390/molecules25215128
Article CAS PubMed PubMed Central Google Scholar
Daykin, C. A., Foxall, P. J. D., Connor, S. C., Lindon, J. C., & Nicholson, J. K. (2002). The comparison of plasma deproteinization methods for the detection of low-molecular-weight metabolites by ¹H nuclear magnetic resonance spectroscopy. Analytical Biochemistry,304(2), 220–230. https://doi.org/10.1006/abio.2002.5637
Article CAS PubMed Google Scholar
de Graaf, R. A., & Behar, K. L. (2003). Quantitative ¹H NMR spectroscopy of blood plasma metabolites. Analytical Chemistry,75(9), 2100–2104. https://doi.org/10.1021/ac020782+
Article CAS PubMed Google Scholar
de Graaf, R. A., Prinsen, H., Giannini, C., Caprio, S., & Herzog, R. I. (2015). Quantification of ¹H NMR spectra from human plasma. Metabolomics,11(6), 1702–1707. https://doi.org/10.1007/s11306-015-0828-1
Article CAS PubMed PubMed Central Google Scholar
Deborde, C., Moing, A., Roch, L., Jacob, D., Rolin, D., & Giraudeau, P. (2017). Plant metabolism as studied by NMR spectroscopy. Progress in Nuclear Magnetic Resonance Spectroscopy,102–103, 61–97. https://doi.org/10.1016/j.pnmrs.2017.05.001
Article CAS PubMed Google Scholar
Häckl, M., Tauber, P., Schweda, F., Zacharias, H. U., Altenbuchinger, M., Oefner, P. J., & Gronwald, W. (2021). An R-package for the deconvolution and integration of 1D NMR data: MetaboDecon1D. Metabolites,11(7), 452. https://doi.org/10.3390/metabo11070452
Article CAS PubMed PubMed Central Google Scholar
Hao, J., Astle, W., De Iorio, M., & Ebbels, T. M. D. (2012). BATMAN—An R package for the automated quantification of metabolites from nuclear magnetic resonance spectra using a bayesian model. Bioinformatics,28(15), 2088–2090. https://doi.org/10.1093/bioinformatics/bts308
Article CAS PubMed Google Scholar
Jacob, D., Deborde, C., Lefebvre, M., Maucourt, M., & Moing, A. (2017). NMRProcFlow: A graphical and interactive tool dedicated to 1D spectra processing for NMR-based metabolomics. Metabolomics,13(4), 36. https://doi.org/10.1007/s11306-017-1178-y
Article CAS PubMed PubMed Central Google Scholar
Kim, H. K., Choi, Y. H., & Verpoorte, R. (2010). NMR-based metabolomic analysis of plants. Nature Protocols,5(3), 536–549. https://doi.org/10.1038/nprot.2009.237
Article CAS PubMed Google Scholar
Kriat, M., Confort-Gouny, S., Vion-Dury, J., Sciaky, M., Viout, P., & Cozzone, P. J. (1992). Quantitation of metabolites in human blood serum by proton magnetic resonance spectroscopy. A comparative study of the use of formate and TSP as concentration standards. NMR Biomedicine,5(4), 179–184. https://doi.org/10.1002/nbm.1940050404
Article CAS Google Scholar
Lefort, G., Liaubet, L., Canlet, C., Tardivel, P., Père, M. C., Quesnel, H., Paris, A., Iannuccelli, N., Vialaneix, N., & Servien, R. (2019). ASICS: An R package for a whole analysis workflow of 1D ¹H NMR spectra. Bioinformatics,35(21), 4356–4363. https://doi.org/10.1093/bioinformatics/btz248
Article CAS PubMed Google Scholar
Liu, M., Nicholson, J. K., & Lindon, J. C. (1996). High-resolution diffusion and relaxation edited one- and two-dimensional ¹H NMR spectroscopy of biological fluids. Analytical Chemistry,68(19), 3370–3376. https://doi.org/10.1021/ac960426p
Article CAS PubMed Google Scholar
Martineau, E., Dumez, J. N., & Giraudeau, P. (2020). Fast quantitative 2D NMR for metabolomics and lipidomics: A tutorial. Magnetic Resonance in Chemistry,58(5), 390–403. https://doi.org/10.1002/mrc.4899
Article CAS PubMed Google Scholar
Meiboom, S., & Gill, D. (1958). Modified spin-echo method for measuring nuclear relaxation times. Review of Scientific Instruments,29(8), 688–691. https://doi.org/10.1063/1.1716296
Article ADS CAS Google Scholar
Nagana Gowda, G. A., Gowda, Y. N., & Raftery, D. (2015). Expanding the limits of human blood metabolite quantitation using NMR spectroscopy. Analytical Chemistry,87(1), 706–715. https://doi.org/10.1021/ac503651e
Article CAS PubMed Google Scholar
Nagana Gowda, G. A., & Raftery, D. (2014). Quantitating metabolites in protein precipitated serum using NMR spectroscopy. Analytical Chemistry,86(11), 5433–5440. https://doi.org/10.1021/ac5005103
Article CAS PubMed Google Scholar
Nowick, J. S., Khakshoor, O., Hashemzadeh, M., & Brower, J. O. (2003). DSA: A new internal standard for NMR studies in aqueous solution. Organic Letters,5(19), 3511–3513. https://doi.org/10.1021/ol035347w
Article CAS PubMed Google Scholar
Psychogios, N., Hau, D. D., Peng, J., Guo, A. C., Mandal, R., Bouatra, S., Sinelnikov, I., Krishnamurthy, R., Eisner, R., Gautam, B., Young, N., Xia, J., Knox, C., Dong, E., Huang, P., Hollander, Z., Pedersen, T. L., Smith, S. R., Bamforth, F., … Wishart, D. S. (2011). The human serum metabolome. PLoS ONE,6(2), e16957. https://doi.org/10.1371/journal.pone.0016957
Article ADS CAS PubMed PubMed Central Google Scholar
Ravanbakhsh, S., Liu, P., Bjordahl, T. C., Mandal, R., Grant, J. R., Wilson, M., Eisner, R., Sinelnikov, I., Hu, X., Luchinat, C., Greiner, R., & Wishart, D. S. (2015). Accurate, fully-automated NMR spectral profiling for metabolomics. PLoS ONE,10(5), e0124219. https://doi.org/10.1371/journal.pone.0124219
Article CAS PubMed PubMed Central Google Scholar
Röhnisch, H. E., Eriksson, J., Müllner, E., Agback, P., Sandström, C., & Moazzami, A. A. (2018). AQuA: An automated quantification algorithm for high-throughput NMR-based metabolomics and its application in human plasma. Analytical Chemistry,90(3), 2095–2102. https://doi.org/10.1021/acs.analchem.7b04324
Article CAS PubMed Google Scholar
Röhnisch, H. E., Eriksson, J., Tran, L. V., Müllner, E., Sandström, C., & Moazzami, A. A. (2021). Improved automated quantification algorithm (AQuA) and its application to NMR-based metabolomics of EDTA-containing plasma. Analytical Chemistry,93(25), 8729–8738. https://doi.org/10.1021/acs.analchem.0c04233
Article CAS PubMed PubMed Central Google Scholar
Rout, M., Lipfert, M., Lee, B. L., Berjanskii, M., Assempour, N., Vazquez Fresno, R., Serra Cayuela, A., Dong, Y., Johnson, M., Shahin, H., Gautam, V., Sajed, T., Oler, E., Peters, H., Mandal, R., & Wishart, D. S. (2023). MagMet: A fully automated web server for targeted nuclear magnetic resonance metabolomics of plasma and serum. Magnetic Resonance in Chemistry. https://doi.org/10.1002/mrc.5371
Article CAS PubMed Google Scholar
Shimizu, A., Ikeguchi, M., & Sugai, S. (1994). Appropriateness of DSS and TSP as internal references for ¹H NMR studies of molten globule proteins in aqueous media. Journal of Biomolecular NMR,4(6), 859–862. https://doi.org/10.1007/BF00398414
Article CAS PubMed Google Scholar
Takis, P. G., Jiménez, B., Al-Saffar, N. M. S., Harvey, N., Chekmeneva, E., Misra, S., & Lewis, M. R. (2021). A computationally lightweight algorithm for deriving reliable metabolite panel measurements from 1D ¹H NMR. Analytical Chemistry,93(12), 4995–5000. https://doi.org/10.1021/acs.analchem.1c00113
Article CAS PubMed PubMed Central Google Scholar
Takis, P. G., Jiménez, B., Sands, C. J., Chekmeneva, E., & Lewis, M. R. (2020). SMolESY: An efficient and quantitative alternative to on-instrument macromolecular ¹H-NMR signal suppression. Chemical Science,11(23), 6000–6011. https://doi.org/10.1039/D0SC01421D
Article CAS PubMed PubMed Central Google Scholar
Tardivel, P. J. C., Canlet, C., Lefort, G., Tremblay-Franco, M., Debrauwer, L., Concordet, D., & Servien, R. (2017). ASICS: An automatic method for identification and quantification of metabolites in complex 1D ¹H NMR spectra. Metabolomics,13(10), 109. https://doi.org/10.1007/s11306-017-1244-5
Article CAS Google Scholar
Ulrich, E. L., Akutsu, H., Doreleijers, J. F., Harano, Y., Ioannidis, Y. E., Lin, J., Livny, M., Mading, S., Maziuk, D., Miller, Z., Nakatani, E., Schulte, C. F., Tolmie, D. E., Wenger, R. K., Yao, H., & Markley, J. L. (2008). BioMagResBank. Nucleic Acids Research,36(suppl_1), D402–D408. https://doi.org/10.1093/nar/gkm957
Article CAS PubMed Google Scholar
Vives-Peris, V., de Ollas, C., Gómez-Cadenas, A., & Pérez-Clemente, R. M. (2020). Root exudates: From plant to rhizosphere and beyond. Plant Cell Reports,39(1), 3–17. https://doi.org/10.1007/s00299-019-02447-5
Article CAS PubMed Google Scholar
Weljie, A. M., Newton, J., Mercier, P., Carlson, E., & Slupsky, C. M. (2006). Targeted profiling: Quantitative analysis of 1H NMR metabolomics data. Analytical Chemistry,78(13), 4430–4442. https://doi.org/10.1021/ac060209g
Article CAS PubMed Google Scholar
Zhang, Z. M., Chen, S., & Liang, Y. Z. (2010). Baseline correction using adaptive iteratively reweighted penalized least squares. The Analyst,135(5), 1138–1146. https://doi.org/10.1039/B922045C
Article ADS CAS PubMed Google Scholar
Zheng, C., Zhang, S., Ragg, S., Raftery, D., & Vitek, O. (2011). Identification and quantification of metabolites in ¹H NMR spectra by bayesian model selection. Bioinformatics,27(12), 1637–1644. https://doi.org/10.1093/bioinformatics/btr118
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank Dr Jan Eriksson for giving valuable comments about the manuscript.

Funding

Open access funding provided by Swedish University of Agricultural Sciences. Strategic funding for metabolomics-based research was provided by the Swedish University of Agricultural Sciences.

Author information

Authors and Affiliations

Department of Molecular Sciences, Swedish University of Agricultural Sciences, Uppsala, Sweden
Elin Alexandersson, Corine Sandström, Gustav Nestor, Anders Broberg & Hanna E. Röhnisch
Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
Johan Meijer

Authors

Elin Alexandersson
View author publications
You can also search for this author in PubMed Google Scholar
Corine Sandström
View author publications
You can also search for this author in PubMed Google Scholar
Johan Meijer
View author publications
You can also search for this author in PubMed Google Scholar
Gustav Nestor
View author publications
You can also search for this author in PubMed Google Scholar
Anders Broberg
View author publications
You can also search for this author in PubMed Google Scholar
Hanna E. Röhnisch
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: CS, EA, HER; Methodology: AB, CS, EA, HER, GN, JM; Formal analysis: EA, HER; Investigation: EA, JM; Writing—original draft preparation: EA; Writing—review and editing: AB, CS, EA, HER, GN, JM; Funding acquisition: CS; Resources: CS, JM; Supervision: AB, CS, HER, GN, JM. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Elin Alexandersson.

Ethics declarations

Conflict of interest

There are no conflict of interest to declare.

Ethical approval

This article does not contain any studies with human or animal participants performed by any of the authors.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 2820.5 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alexandersson, E., Sandström, C., Meijer, J. et al. Extended automated quantification algorithm (AQuA) for targeted ¹H NMR metabolomics of highly complex samples: application to plant root exudates. Metabolomics 20, 11 (2024). https://doi.org/10.1007/s11306-023-02073-z

Download citation

Received: 22 June 2023
Accepted: 28 November 2023
Published: 23 December 2023
DOI: https://doi.org/10.1007/s11306-023-02073-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Extended automated quantification algorithm (AQuA) for targeted 1H NMR metabolomics of highly complex samples: application to plant root exudates

Abstract

Introduction

Objectives

Methods

Results

Conclusions

Similar content being viewed by others

NMR metabolite quantification of a synthetic urine sample: an inter-laboratory comparison of processing workflows

Quantification of 1H NMR spectra from human plasma

Probability in HPLC-MS Metabolomics

1 Introduction

2 Materials and methods

2.1 Root exudate collection

2.2 Sample preparation

2.3 NMR spectroscopy and spectral processing

2.4 Metabolite identification and quantification

2.5 Simulations

2.6 Spike-in experiment

3 Results and discussion

3.1 Extended AQuA: workflow, parameter optimisation, and general considerations

3.2 Evaluation of the extended AQuA

3.2.1 Simulations

3.2.2 Spike-in experiment

3.3 Application to plant root exudates

3.4 Advantages and limitations

4 Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher’s Note

Supplementary Information

Supplementary material 1 (PDF 2820.5 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Extended automated quantification algorithm (AQuA) for targeted ¹H NMR metabolomics of highly complex samples: application to plant root exudates