On feature extraction for noninvasive kernel estimation of left ventricular chamber function indices from echocardiographic images

doi:10.1016/j.dsp.2014.12.012

Digital Signal Processing

Volume 39, April 2015, Pages 63-79

https://doi.org/10.1016/j.dsp.2014.12.012 Get rights and content

Highlights

•
Color-Doppler M-mode images are used to characterize left ventricular function.
•
Estimation of end-systolic peak elastance and time-constant of relaxation rate.
•
Comparison and interpretation of different linear estimators.
•
Conditions where non-linear estimators outperform linear ones.
•
High-fidelity study on mini-pigs with echocardiographic images and invasive measures.

Abstract

Two reference indices used to characterize left ventricular (LV) global chamber function are end-systolic peak elastance ( $E_{\max}$ ) and the time-constant of relaxation rate (τ). However, these two indices are very difficult to obtain in the clinical setting as they require invasive high-fidelity catheterization procedures. We have previously demonstrated that it is possible to approximate these indices noninvasively by digital processing color-Doppler M-mode (CDMM) images. The aim of the present study was twofold: (1) to study which feature extraction from linearly reduced input spaces yields the most useful information for the prediction of the haemodynamic variables from CDMM images; (2) to verify whether the use of nonlinear versions of those linear methods actually improves the estimation. We studied the performance and interpretation of different linearly transformed input spaces (raw image, discrete cosine transform (DCT) coefficients, partial least squares, and principal components regression), and we compared whether nonlinear versions of the above methods provided significant improvement in the estimation quality. Our results showed that very few input features suffice for providing a good (medium) quality estimator for $E_{\max}$ (for τ), which can be readily interpreted in terms of the measured flows. Additional covariates should be included to improve the prediction accuracy of both reference indices, but especially in τ models. The use of efficient nonlinear kernel algorithms does improve the estimation quality of LV indices from CDMM images when using DCT input spaces that capture almost all energy.

Introduction

Characterization of left ventricular (LV) systolic and diastolic chamber function is still a pending issue in the clinical setting. In experimental physiology, peak end-systolic elastance ( $E_{\max}$ ) is well established as the best available index to measure systolic performance of the LV chamber. In turn, the time-constant of LV relaxation (τ) is accepted as the gold standard method accounting for the rate of relaxation of the chamber, one of the main diastolic properties of LV function. Measuring $E_{\max}$ requires complex measurements of instantaneous pressure and volume inside the LV chamber as well as preload intervention maneuvers. Measuring τ requires invasive catheterization of the LV using high-fidelity micromanometers. For these reasons, neither $E_{\max}$ nor τ are only measured in patients for research purposes.

A number of noninvasive methods have been developed to obtain surrogate indices that correlate with $E_{\max}$ and τ. Among them, most research has focused on Doppler-echocardiography, because it is a fully noninvasive, non ionizing, cheap and readily available at the patient's bedside. In a previous work we have shown that τ and $E_{\max}$ can be reasonably approximated from CDMM images. Using a fluid-dynamic approach we have shown that $E_{\max}$ correlates closely to the peak-ejection pressure difference developed inside the ventricle, which can be computed by solving Euler's equation from the CDMM velocity data [27]. Similarly, τ can be approximated by the peak reverse end-ejection pressure difference with reasonable accuracy [26]. Importantly, using a learning from samples approach we have obtained similar approximations without the need of complex fluid-dynamic modeling [13]. Hence, an experimental (animal) setup was used to simultaneously measure the catheter-based curves (pressure and flow) and acquire the CDMM images; and a machine-learning model was designed for straightforward estimation of τ and $E_{\max}$ parameters from an input space given by the diastolic period of the digitized CDMM image.

In that precedent work, a linear estimator was used for the image input space, which raises several questions. On the one hand, nonlinear relations between CDMM images and indices can be expected, as the haemodynamic variables in the cardiac circulatory system are known to be mostly interrelated by nonlinear fluid dynamic equations. On the other hand, linear kernel estimators are often suggested in the machine learning literature as the most appropriate choice for high-dimensional input spaces, and they also provide with easier to interpret, black-box models than their nonlinear counterparts. Therefore, our aim was to test whether alternative algorithms on the machine learning specifications could improve the prediction of invasive indices $E_{\max}$ and τ from CDMM images. First, we wanted to study which feature extraction from linearly reduced input spaces yields the most useful information for the prediction of the haemodynamic variables from CDMM images. Second, we wanted to verify whether the use of nonlinear algorithmic versions of those linear methods actually improves the estimation. Accordingly, we benchmarked the performance of several linear kernel estimators, in terms of linear feature extraction transformations, and in addition we analyzed the physical and clinical meaning of the relevant features in these transformed spaces, when possible. We also benchmarked the nonlinear kernel versions of the above analyzed estimators, hence determining the actual improvement obtained by the consideration of nonlinearity in the estimation kernel machine. For this purpose, we chose several kernel methods, namely, Support Vector Regression (SVR), Principal Component Regression (PCR), and Partial Least Squares (PLS), according to different levels of algorithm complexity in terms of the multidimensional output estimation from the multidimensional input. SVR performs one dimensional output robust estimation, PCR performs dimensionality reduction and multidimensional output estimation, and PLS performs a dimensionality reduction according input–output covariance.

The rationale for the chosen input features was as follows. First, the RAW input space conveys all the image information, hence it represents a necessary benchmarking. Also, a linear machine working on the input space will be easy to interpret, in terms of the relative temporal and spatial position of the linear weights. Second, DCT input space is a widespread used frequency transform in image problems, and given the smoothness and low-pass frequency content of CDMM images, it can be expected to work well from an image information compression point of view. Third, PCR provides us with features from an intrinsic image decomposition (different from frequency decompositions), with a decoupled regression stage. And finally, PLS provides us with features from an intrinsic image decomposition in which the regression output quality is an embedded optimization criterion.

The scheme of the paper is as follows. In the next section, the fundamental theory of the multidimensional kernel machines is summarized for the SVR, PCR, and PLS algorithms. Then, a detailed set of experiments is presented for benchmarking and interpretation of linear vs nonlinear kernel versions of the estimators. Finally, conclusions are drawn.

Section snippets

Multidimensional kernel machines

This section first describes the basic equations of SVR, PCR and PLS. These methods allow both linear and nonlinear estimation without explicitly extracting features from the images. Both PCR and PLS implicitly extract features (components or latent vectors) previous to the estimation problem. PCR performs a feature extraction such every new feature captures as much as possible of the remaining variance of the input data, where PLS extracts features that maximize the covariance of the input

Experiments

The experimental design for data processing is as follows. In Experiment 1, linear SVR was built from raw input space (CDMM images without transformations), both for estimating $E_{\max}$ and τ. Attention was paid to the free parameters search. Error analysis and model diagnostic is given. Weights are plotted and analyzed from a physiological perspective. In Experiment 2, we used DCT coefficients of raw images as input space. Linear SVR was used to analyze the resulting images and the dependence on

High-dimensional input spaces and overfitting

Among the kernel methods community, it is generally accepted that linear Support Vector Machine (SVM) is better than nonlinear SVM for high dimensional input spaces like text categorization [25] or fMRI data [4], [9], [12]. This is supported by two theoretical facts: Cover's Theorem [3] and Vapnik–Chervonenkis VC-dimension [22]. Cover's Theorem discusses the probability of separating a set of n samples into two sets in a general setting. He showed that the natural capacity of a transformation

Acknowledgements

This study was partially supported by research projects TEC2010-19263 (EXCALIVUR) from Ministerio de Ciencia e Innovación, TEC2013-48439-C4-1-R (PRINCIPIAS) from Ministerio de Economía y Competitividad, PRIN13_IYA12 from Universidad Rey Juan Carlos and grants PIS09/02603, PS09/02602, and RD12/0042 (Red de Investigación Cardiovascular), from the Plan Nacional de Investigación Científica, Desarrollo e Innovación Tecnológica (I+D+I), Instituto de Salud Carlos III–Ministerio de Economía y

References (27)

J. Bermejo et al.
Spatio-temporal mapping of intracardiac pressure gradients. A solution to Euler's equation from digital postprocessing of color Doppler M-mode echocardiograms
Ultrasound Med. Biol.
(2001)
D.D. Cox et al.
Functional magnetic resonance imaging (fMRI) “brain reading”: detecting and classifying distributed patterns of fMRI activity in human visual cortex
NeuroImage
(2003)
S. de Jong
SIMPLS: An alternative approach to partial least squares regression
Chemom. Intell. Lab. Syst.
(1993)
S. LaConte et al.
Support vector machines for temporal classification of block design fMRI data
NeuroImage
(2005)
J. Mourão Miranda et al.
Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data
NeuroImage
(2005)
M. Misaki et al.
Comparison of multivariate classifiers and response normalizations for pattern-information fMRI
NeuroImage
(2010)
S. Wold et al.
PLS-regression: a basic tool of chemometrics
Chemom. Intell. Lab. Syst.
(2001)
V. Cherkassky et al.
Model complexity control for regression using VC generalization bounds
IEEE Trans. Neural Netw.
(1999)
T.M. Cover
Geometrical and statistical properties of systems of linear inequalities with applications in pattern recognition
IEEE Trans. Electron. Comput.
(1965)
A.S. Hadi et al.
Some cautionary notes on the use of principal components regression
Am. Stat.
(1998)

A. Höskuldsson

PLS regression methods

J. Chemom.

(1988)

I.T. Jolliffe

A note on the use of principal components in regression

J. R. Stat. Soc., Ser. C, Appl. Stat.

(1982)

W.F. Massy

Principal components regression in exploratory statistical research

J. Am. Stat. Assoc.

(1965)

Cited by (0)

Ricardo Santiago-Mozos received the Ph.D. degree in signal processing and communications from the Carlos III University of Madrid, Madrid, Spain, in 2009. He has just finished a Postdoctoral Fellowship of the Spanish Program for Recruitment and Incorporation of Human Resources at the University Rey Juan Carlos of Madrid, Madrid, Spain. He has been a Spanish Foundation for Science and Technology Postdoctoral Fellow at the Machine Learning and Data Mining Group in National University of Ireland, Galway, Ireland, where he had also completed a Marie Curie Fellowship. His research interests include machine learning and medical applications and he has coauthored more than 20 papers appearing in refereed journals and conference proceedings.

José Luis Rojo-Álvarez (born in Bilbao, Spain, 1972) received the Telecommunication Engineering Degree in 1996 from University of Vigo, Spain, and the Ph.D. in Telecommunication in 2000 from the Polytechnical University of Madrid, Spain. Since 2006, he has been an Associate Professor in the Department of Signal Theory and Communications, University Rey Juan Carlos, Madrid, Spain. He has published more than 80 papers in JCR journals and more than 140 (inter)national conference communications. He has participated in more than 50 projects (with public and private fundings), and directed more than 10 of them, including several actions in the National Plan for Research and Fundamental Science. He is a Senior Researcher at the Prometeo program in Ecuador (Army University). His main research interests include statistical learning theory, digital signal processing, and complex system modelling, with applications to digital communications and to cardiac signal and image processing.

J. Carlos Antoranz graduated in Physics in 1977 at Universidad Autonoma Madrid and obtained his Ph.D. in 1982 at Spanish Open University. He works as Physics Full Professor since 1996 at UNED. He has been visiting Professor at University of Oslo, Norway (Mathematics), Kysuhu University, Japan (Physics), IBM Research Center in Kingston, USA, Non-linear Center at Los Alamos National Laboratory, USA, Institute de Mechanique des Fluides, Marseille, France. He has published more than 200 papers.

Mar Desco received her Ph.D. in Medicine and Surgery from Universidad Complutense de Madrid. Her interests include anatomical pathology and circulatory mechanic assistance. She has published more than 10 journal articles.

Daniel Rodríguez-Perez received his M.Sc. (1998) and Ph.D. (2005) in Physics from UNED. He is a Senior Lecturer at UNED Medical Physics Master. He works, among other areas, in mathematical and physical models of biomedical signals and processes. He has coauthored more than 20 journal papers.

Raquel Yotti (Madrid 1973) is a clinical cardiologist with expertise in cardiac imaging that coordinates the Complex Structural Heart Disease Program at the Hospital General Universitario Gregorio Marañón. She obtained the Ph.D. degree in Medicine in 2006 from the Universidad Complutense Madrid. Her research has dedicated to the development and clinical application of new diagnostic tools based on cardiac imaging, to improve the characterization of physiology and hemodynamics in patients with different cardiovascular diseases. She is author of more than 30 articles in this field.

Javier Bermejo (London, 1966) is the Director of Noninvasive Cardiology and Cardiovascular Imaging of the Hospital General Universitario Gregorio Marañón, Madrid, Spain, and Associate Professor of Medicine in the Universidad Complutense de Madrid in 2013. His research has focused on the invasive and noninvasive assessment of cardiac hemodynamics. He leads a multidisciplinary research team integrated by clinical cardiologists, surgeons, engineers and physicists. He has received specific training in a number of fields of biomedical engineering, including advanced biostatistics, computer algorithm programming, image processing, and basic biomechanics. Dr. Bermejo is author of more than 80 articles in the field and external reviewer for major scientific societies and more than 20 scientific journals. Current areas of research are focused on imaging-derived biomechanics in the field of heart failure and valvular heart disease.

View full text

On feature extraction for noninvasive kernel estimation of left ventricular chamber function indices from echocardiographic images

Highlights

Abstract

Introduction

Section snippets

Multidimensional kernel machines

Experiments

High-dimensional input spaces and overfitting

Acknowledgements

Ultrasound Med. Biol.

NeuroImage

Chemom. Intell. Lab. Syst.

NeuroImage

NeuroImage

NeuroImage

Chemom. Intell. Lab. Syst.

Model complexity control for regression using VC generalization bounds

IEEE Trans. Neural Netw.

Geometrical and statistical properties of systems of linear inequalities with applications in pattern recognition

IEEE Trans. Electron. Comput.

Some cautionary notes on the use of principal components regression

Am. Stat.

PLS regression methods

J. Chemom.

A note on the use of principal components in regression

J. R. Stat. Soc., Ser. C, Appl. Stat.

Principal components regression in exploratory statistical research

J. Am. Stat. Assoc.