Classification of electrocardiogram and auscultatory blood pressure signals using machine learning models

https://doi.org/10.1016/j.eswa.2014.12.023Get rights and content

Highlights

Abstract

In this paper, two real-world medical classification problems using electrocardiogram (ECG) and auscultatory blood pressure (Korotkoff) signals are examined. A total of nine machine learning models are applied to perform classification of the medical data sets. A number of useful performance metrics which include accuracy, sensitivity, specificity, as well as the area under the receiver operating characteristic curve are computed. In addition to the original data sets, noisy data sets are generated to evaluate the robustness of the classifiers against noise. The 10-fold cross validation method is used to compute the performance statistics, in order to ensure statistically reliable results pertaining to classification of the ECG and Korotkoff signals are produced. The outcomes indicate that while logistic regression models perform the best with the original data set, ensemble machine learning models achieve good accuracy rates with noisy data sets.

Introduction

Data classification constitutes one of the fundamental requirements in undertaking many decision-making tasks (Örkcü & Bal, 2011). A classification task involves building a model that depicts a mapping from the input feature space to the target output space (Oza & Tumer, 2008). In general, there are a number of classification methods, which include statistical methods, mathematical programming methods, and a variety of machine learning methods (Örkcü & Bal, 2011). Researchers in the medical domain have used many methods to perform data classification. Methods with higher classification accuracy are desirable to correctly identify potential diseases; therefore improving diagnosis accuracy (Fan, Chang, Lin, & Hsieh, 2011).

The main contribution of this study is a comprehensive performance evaluation and analysis pertaining to a number of machine learning models for undertaking real medical data classification problems. Specifically, we use two sets of real data collected from patients, i.e., the electrocardiogram (ECG) and auscultatory blood pressure (Korotkoff) signals. ECGs are signals related to electrical activity of the heart, which can be recorded by placing surface electrodes on a patient’s body (Mitra, Mitra, & Chaudhuri, 2006). It is an effective non-invasive clinical tool for the diagnosis of certain cardiovascular diseases, and it provides useful information pertaining to pathological physiology of heart activity (Chen & Yu, 2012). ECG signals carry valuable information about the heart function, and provide a cardiologist with useful insight about the rhythm and functioning of the heart (Chen & Yu, 2012).

As stated in Mele (2008), an estimated 300 million ECGs are performed each year. As such, there is a clear need for reliable and accurate interpretation tools of ECG readings. Although trained cardiologists can discover different cardiac abnormalities in ECG recordings, it is time-consuming and laborious for them to examine a large number of ECG recordings (Kiranyaz, Ince, Pulkkinen, & Gabbouj, 2011). Moreover, visual inspection can take considerable time, and some vital information can be neglected due to fatigue in carrying out the tedious manual procedure (Sun, Lu, Yang, & Li, 2012). As such, automated tools to help accurately analyze a large number of ECG data samples are required. Similarly, blood pressure (BP) is an established link in determining coronary heart disease and cardiovascular incidents (Mendiola, Luna, Guerra, & Ramírez, 2013). The most commonly used methods for measuring BP in clinical activities is the use of a manual sphygmomanometer and a stethoscope to detect the Korotkoff sounds (Kurl et al., 2001). Korotkoff waveforms are one of the most reliable means for monitoring blood pressure (Mendiola et al., 2013). To exploit the advantages of using computerized tools to help medical prognosis and diagnosis tasks, an empirical study to evaluate a variety of machine learning models for classification of both ECG and Korotkoff signals is undertaken in this study.

In respect to machine learning models, the artificial neural networks (ANNs) are popular methods for tackling medical diagnosis problems (Al-Shayea, 2011). There are a number of advantages of using ANNs for medical data classification, e.g. a detailed mathematical model that relates the input features and the target outputs is not necessary. In addition, ANNs have the ability to learn complex relationship in data samples. Recent advancements in ANNs have shown their usefulness in analyzing signals, which has opened up the possibility of solving problems typically not possible with some existing signal processing techniques (Übeyli & Güler, 2005). On the other hand, model-based signal processing is a new method to describe physiological systems (Porta, Baselli, & Cerutti, 2006). The method is useful for short-term cardiovascular control and analysis of cardiovascular regulation mechanisms (Porta et al., 2006). Besides that, other medical signal processing methods have been used in many different application, e.g. in synthesized and real biomedical signals using frequency-domain methods (Mitov, 1998), medical ultrasound signals using a fast wavelet-based edge method (Nes, 2012), and a multi-sensor fetal movement detection system based on a time–frequency signal processing method (Boashash, Khlif, Ben-Jabeur, East, & Colditz, 2014). A review of different machine learning and related methods for medical applications is presented in Section 2.

The organization of this paper is as follows. After a literature review in Section 2, a description on signal pre-processing, experimental setup, and the background of different classifiers used for experimentation is presented in Section 3. In Section 4, two real medical case studies consisting of ECG and Korotkoff signals are detailed. Finally, conclusions and suggestions for further work are presented in Section 5.

Section snippets

Literature review

Biomedical signal processing is becoming an essential feature in many advanced medical equipment, and is widely used in clinical and biomedical research (Simpson, De Stefano, Allen, & Lutman, 2005). In this review, a number of statistical methods, machine learning models, and other related techniques for medical signal processing are reviewed. The details are as follows.

Data pre-processing and classification

In this section, the acquisition and pre-processing steps of the ECG and Korotkoff signals are detailed. This is then followed by an explanation on the types of classifiers used in the study.

Results and discussion

The experimental study was conducted using Weka (Waikato Environment for Knowledge Analysis) (Hall et al., 2009), a popular suite of machine learning and other models. The k-fold cross validation technique, where k = 10, was employed. As such, the data samples were split into ten equal subsets. Each subset contained approximately the same proportion of data samples from each target class. Nine subsets were used for training, while the remaining was used for performance evaluation. This procedure

Conclusions

Machine learning models are useful for biomedical signal processing as presented in this study whereby abnormal patterns suggesting risks of cardiovascular diseases can be recognized. The main contributions of this study are twofold. The first contribution is a comprehensive review focusing on recent advances pertaining to biomedical signal processing, as presented in Section 2 and summarized in Table 1. The second contribution is an empirical extended evaluation and analysis pertaining to a

Acknowledgment

This research is supported partially by University of Malaya Grant UM.C/HIR/MOHE/ENG/50.

References (67)

  • L. Kang et al.

    Estimation of the volume under the ROC surface with three ordinal diagnostic categories

    Computational Statistics & Data Analysis

    (2013)
  • S. Kiranyaz et al.

    Personalized long-term ECG classification: A systematic approach

    Expert Systems with Applications

    (2011)
  • M. Korürek et al.

    ECG beat classification using particle swarm optimization and radial basis function neural network

    Expert Systems with Applications

    (2010)
  • I. Kurt et al.

    Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease

    Expert Systems with Applications

    (2008)
  • J. Lee et al.

    Atrial flutter and atrial tachycardia detection using Bayesian approach with high resolution time–frequency spectrum from ECG recordings

    Biomedical Signal Processing and Control

    (2013)
  • Q. Li et al.

    A machine learning approach to multi-level ECG signal quality classification

    Computer Methods and Programs in Biomedicine

    (2014)
  • E.J.D.S. Luz et al.

    ECG arrhythmia classification based on optimum-path forest

    Expert Systems with Applications

    (2013)
  • D. Mantzaris et al.

    Genetic algorithm pruning of probabilistic neural networks in medical disease estimation

    Neural Networks

    (2011)
  • P. Mele

    Improving electrocardiogram interpretation in the clinical setting

    Journal of Electrocardiology

    (2008)
  • I.P. Mitov

    A method for assessment and processing of biomedical signals containing trend and periodic components

    Medical Engineering & Physics

    (1998)
  • P.G. Nes

    Fast multi-scale edge-detection in medical ultrasound signals

    Signal Processing

    (2012)
  • H.H. Örkcü et al.

    Comparing performances of backpropagation and genetic algorithms in the data classification

    Expert Systems with Applications

    (2011)
  • N.C. Oza et al.

    Classifier ensembles: Select real-world applications

    Information Fusion

    (2008)
  • Y. Özbay et al.

    A new method for classification of ECG arrhythmias using neural network with adaptive activation function

    Digital Signal Processing

    (2010)
  • Y. Özbay et al.

    Integration of type-2 fuzzy clustering and wavelet transform in a neural network based ECG classifier

    Expert Systems with Applications

    (2011)
  • S. Poungponsri et al.

    An adaptive filtering approach for electrocardiogram (ECG) signal noise reduction using neural networks

    Neurocomputing

    (2013)
  • S. Sargolzaei et al.

    Scalp EEG brain functional connectivity networks in pediatric epilepsy

    Computers in Biology and Medicine

    (2015)
  • F. Scalzo et al.

    Intracranial hypertension prediction using extremely randomized decision trees

    Medical Engineering & Physics

    (2012)
  • R. Sharma et al.

    Classification of epileptic seizures in EEG signals based on phase space representation of intrinsic mode functions

    Expert Systems with Applications

    (2015)
  • D.M. Simpson et al.

    Demystifying biomedical signals: A student centred approach to learning signal processing

    Medical Engineering & Physics

    (2005)
  • N. Torbati et al.

    An efficient neural network based method for medical image segmentation

    Computers in Biology and Medicine

    (2014)
  • E.D. Übeyli et al.

    Improving medical diagnostic accuracy of ultrasound Doppler signals by combining neural network models

    Computers in Biology and Medicine

    (2005)
  • M.H. Vafaie et al.

    Heart diseases prediction based on ECG signals’ classification using a genetic-fuzzy system and dynamical model of ECG signals

    Biomedical Signal Processing and Control

    (2014)
  • Cited by (0)

    View full text