Phonocardiographic Signal and Electrocardiographic Signal Analysis for the Detection of Cardiovascular Diseases

Published by Oriental Scientific Publishing Company © 2018 This is an Open Access article licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (https://creativecommons.org/licenses/by-nc-sa/4.0/ ), which permits unrestricted Non Commercial use, distribution and reproduction in any medium, provided the original work is properly cited. Phonocardiographic Signal and Electrocardiographic Signal Analysis for the Detection of Cardiovascular Diseases

Heart disease is a major health problem and it is the leading cause of death throughout the world.Early detection of cardiovascular diseases has become an important issue in the medical research field.Most of the heart valve disorders are reflected to heart sounds and can be detected through the phonocardiography (PCG) signal analysis.PCG signal is the recording of sound variations of heart and blood flow that provides valuable information for detecting pathological heart sounds.The Figure 1 shows the phonocardiographic signal of sample heart sounds.
Heart sounds are generated by the heart beating and the resultant flow of blood through it.The two normal heart sounds are S1 and S2, and are low frequency signals.The heart sounds with higher frequency are called murmurs.Murmurs are usually generated by the turbulent blood flow through the heart.The frequency ranges of the normal heart sounds and the murmurs are shown in the Table 1.
Similarly, the electrocardiographic signal which records the electrical activity of the human heart helps in the diagnosis of heart disorders.The electrocardiographic signal consists of P-wave, QRS-complex and T-wave.The P-wave is due to the depolarization of atria in response to the sinoatrial node triggering.The QRS-complex is due to the depolarization of ventricles and T-wave is due to ventricular repolarization.
Figure 2 shows the comparison between ECG and ECG.The interval between two consecutive R-peaks is the RR interval which is same as that of the S1-S1 interval.Any deviation from the normal range of these intervals results in the abnormality of the heart.
The detection and classification of heart diseases by analysing the phonocardiographic signal manually is complicated.So, automatic heart sound analysis in computer systems would be very helpful for accurate detection of heart diseases.The proposed heart sound analysis system compares the RR interval of the electrocardiographic signal with the S1-S1 interval of the phonocardiographic signal.

Related work
An extensive survey has been taken, focussing on the detailed description about the pre-processing of the heart sound signal, its segmentation, feature extraction, attribute selection and the classification methods.The separation of murmur from the recorded heart sounds has been performed using the method singular spectrum analysis [9].The segmentation of the heart sound signal into S1, systolic interval, S2 and diastolic interval has been done using tunable-Q wavelet transform [12],boundary detection method [2] and moment based method [18].The features were extracted by discrete wavelet transform.The needed features were selected using multi-level basis selection [8] and genetic algorithm [14].The support vector machine has been used for the classification of heart diseases [15].From the literature survey analysis, it is concluded that in the existing system, heart sound signal alone has been analysed for the detection of heart diseases.Also, ECG signal alone has been used for the detection of cardiac diseases.But in the proposed system, both PCG and ECG signals are analyzed and compared, both features are correlated for the detection of cardiac arrhythmia diseases.

Heart sound analysis system
The ultimate goal of this proposed system is to develop an automated real time heart sound analysis system [1] for the detection of cardiovascular diseases using the PCG and ECG.The system involves the process of de-noising, segmentation, feature extraction and classification of both phonocardiographic and electrocardiographic signals.The development of the system involves Pre-processing, Segmentation [3] Feature extraction and Feature selection and Classification.The initial process deals with the de-noising of the heart sound signal and the signal is decomposed into several sub-bands.Also, the ECG signal is de-noised.The de-noised heart sound signal is segmented into the basic heart sounds S1 and S2, along with the systolic and diastolic interval.Meanwhile, ECG signal also is preprocessed and time domain and frequency domain features are extracted.The R-peaks are identified from the ECG signal and RR interval is obtained.The attribute selection is to find the best attribute values, unwanted attributes are removed to reduce the processing time.It is finally used for the classification process [4].The Figure 3 shows the proposed system.This work is implemented by using MATLAB software.

Database
The heart sound signals are collected from the SoundJay database (http://www.soundjay.com/heartbeat-sound-effect.html),Michigan Heart Sound University (http://www.med.umich.edu/lrc/psb/heartsounds/) and Frontiers in Bioscience (http://int-prop.lf2.cuni.cz/heart_sounds/h14/sound.htm).This database is the record of the sound of heart beating, the sound of heartbeat speeding up, the sound of heart starts beating and speeds up, and various murmur heart sounds.The ECG signals are collected from the Physionet bank (http://physionet.org/cgi-bin/atm/ATM).

Pre-processing
The raw heart sound signals may contain some low and high frequency noises.These noises may be due to the operation of lungs, the muscle contraction and the movement of other parts of the body [21].The low frequency noises affect the edge function of the signal and the high frequency noises affect the structure of the wave form.The presence of these noises makes a difference to the feature extraction process which in turn has an impact on the classification process.These noises are removed by using the Butterworth low pass filter.The Butterworth filter is recursive, that is, past values of the output are used as input, so a phase shift is introduced unless the data are filtered forward and backward through the same filter.When this is done, the filtered signal differs only slightly from that of other low-pass filters.The Butterworth filter is used for the reduction in computation time over other low-pass filters.The four levels of the low pass filters have been used.The amplitude above a particular range is eliminated without changing or disturbing the characteristics of the heart sound signal.All the four levels have been applied.At each level the response rolls off to a certain limit.The first order response rolls off at 6dB per octave.The second order response decreases at 12dB per octave whereas the third and fourth order response decreases at 18dB and 24dB per octave respectively.Meanwhile, high pass and low pass filters are applied to the ECG signals and noises are removed.The low pass filter passes low frequency signals and attenuates signals with frequencies higher than the cut-off frequency.The high pass filter is an electronic filter that passes high frequency signals but attenuates signals with frequencies lower than the cut-off frequency.The low pass filter reduces the high frequency components whereas the high pass filter reduces the low frequency components.

Wavelet Decomposition
The de-noised signal is applied with the Discrete Wavelet Transform and decomposed into several sub-bands.The Discrete Wavelet Transform (DWT) is used to decompose the heart sound signal into various sub-bands [18].The basic principle of DWT is to decompose the signal into finer details using high-pass and low-pass filters.At the first level of decomposition, two signals representing the detail (high frequency) and the approximate (low frequency) will be obtained.The approximate signals are decomposed into the detail and approximate at the further levels and the process continues.Among the various wavelet bases, the daubechies family of wavelet is very efficient.In this paper, the daubechies family of wavelet db4 is used for decomposition.The various detail level coefficients have been obtained.The Figure 4 shows a three level wavelet decomposition.

Segmentation
At the next stage, the basic heart sounds of the heart sound signal are segmented, that is, the heart sound signal is segmented into S1, systolic interval, S2 and diastolic interval.The interval between S1 and S2 is called the systolic interval whereas the interval between S2 and the next S1 is called the diastolic interval.The diastolic interval is usually larger than the systolic interval.At first, the envelope of the heart sound signal is extracted and the maximum peak values are identified [3].Then, based on the systolic and diastolic interval, the maximum peaks are determined to be the basic heart sounds S1 and S2.The normal range of systolic interval is between 100ms and 300ms and the diastolic interval ranges from 300ms to 600ms.
From the de-noised ECG signal, the maximum peaks above a certain threshold are detected as R-peaks.

Feature Extraction and Feature Selection
The third module deals with the extraction of features from the heart sound signal.The various time domain features and the frequency domain features are extracted.The time domain features calculated are mean of the peak S1 (AVGS1), mean of the peak S2 (AVGS2), mean of the systolic interval (AVGSYS), mean of the diastolic interval (AVGDIAS), total average of systolic and diastolic interval (AVG), standard deviation of peaks S1 and S2 (SDS1 and SDS2), standard deviation of NN intervals (SDSYS and SDDIAS), the Root of mean of sum of Squares of difference of adjacent NN intervals (RMSSDSYS and RMSSDDIAS), mean of R-peak (AVGR), mean of RR-interval (AVGRR), standard deviation of RR intervals (SDRR) and the Root of mean of sum of Squares of difference of adjacent RR intervals(RMSSDRR).Frequency domain features are calculated using Power Spectral Density.The features are Very Low Frequency (VLF) which lies between 0 and 0.04, Low Frequency (LF) between 0.04 and 0.15, High Frequency (HF) between 0.15 and 0.4 and the total power between 0 and 0.4.Also the ratio of Low frequency and high frequency (LF/HF) is calculated.The disease identification mainly depends on these attribute values.Giving in all the extracted features to a classifier makes the classification process a complicated one.To avoid this complication, the feature selection process forms a reduced feature set.The most informative features are selected and the redundant features are eliminated by the feature selection process [8].

Classification
Based on the reduced feature set, the classifier classifies the captured heart signal into normal and abnormal.The features are applied to three different classifiers namely K-Nearest Neighbour Classifier, K-Means Clustering and the Classification and Regression Tree (CART) Classifier.KNN is a method for classifying objects based on closest training examples in the feature space.An object is classified by a majority vote of its Neighbours.K is always a positive integer.The Neighbors from a set of objects are considered in which the correct classification is done.The K-Nearest Neighbor classifier is commonly based on the Euclidean distance between a test sample and the specified training samples.
K-means clustering is a method of classifying items into k groups where k is the number of pre-chosen groups.The grouping is done by minimizing the sum of squared distances between items and the corresponding centroid.A centroid is the centre of mass of a geometric object of uniform density.
CART is a recursive partitioning method that builds classification and regression trees for predicting continuous dependent variables (regression) and categorical predictor variables (classification).In the classification trees, the target is categorical and it is used to identify the class.An internal node is a decision-making unit that evaluates a decision function to determine which child node to visit next.A terminal node has no child nodes and is associated with one of the partitions of the input space.Each terminal node has a unique path that leads from the root to itself.In classification trees, each terminal node has a label that indicates the class for the associated input.

RESULTS
The proposed Heart Sound Signal Analysis System de-noises the input PCG and ECG signals, detect the peaks, extract the time domain and frequency domain features, form a reduced feature set and then classify using three different classifiers.
Table 1 shows the extracted time domain features such as mean of peaks and the intervals of heart sound signal The Table 2 shows the extracted time domain features such as standard deviation and root mean square of peaks and the intervals of heart sound signal.
The Table 3 shows the extracted frequency domain features of heart sound signal.
The Table 4 shows the features that are extracted from the ECG signals.
From the above tables, reduced feature set is formed and given to three different classifiers.The performance of three classifiers are determined.
The Table 5 shows the sensitivity, specificity and accuracy of the classifiers used for analyzing the heart sound signals.
The Figure 5 shows the comparison result of the accuracy of the three different classifiers used.
The Table 6 shows the sensitivity, specificity and accuracy of the classifiers used for analyzing the ECG signals.
The Figure 6 shows the comparison result of the accuracy of the three different classifiers used.

CONCLUSION
The detection of cardiovascular diseases from the heart sounds is difficult because the doctors found it difficult to listen to the heart sounds accurately during auscultation.To make it simple and accurate, the phonocardiographic signal features are used.The phonocardiographic features and the electrocardiographic features are compared which helps the physicians to detect the presence of heart diseases.The heart sound signal is first pre-processed to remove all the unwanted sounds.The Butterworth low pass filter is used for removing the noise.The heart sound signal is decomposed using Discrete Wavelet Transform and then segmented into the S1, the systolic interval, the S2, and the diastolic interval.Also, the electrocardiographic signals are filtered using high pass and low pass filters, and the R-peaks are detected.Then, various features of the heart sound signal and the electrocardiographic signal are extracted, and some among them are chosen for the classification process.The classifiers classify the given input signals into normal and abnormal.This automated system for the detection of cardiovascular disease helps the doctors to detect the presence of heart diseases accurately.In future, the system can be extended to identify the type of abnormalities in the arrhythmia diseases

Fig. 3 .
Fig. 3.The Framework of Heart Sound Analysis System

Fig. 5 .Fig. 6 .
Fig. 5. Comparison of Classification Techniques with Accuracy for PCG Fig. 6.Comparison of Classification Techniques with Accuracy for ECG

Table 1 .
Heart Sounds Frequency Range

Table 1 .
Time Domain Features of PCG-Average

Table 2 .
Time Domain Features of PCG signal -SD and RMSSD

Table 3 .
Frequency Domain Features of PCG signal

Table 4 .
Feature Extraction of ECG Signals

Table 5 .
Performance Measures of PCG Signals

Table 6 .
Performance Measures of ECG Signals