Skip to main content

ORIGINAL RESEARCH article

Front. Physiol., 03 March 2023
Sec. Computational Physiology and Medicine
This article is part of the Research Topic Artificial Intelligence in Bioimaging and Signal Processing View all 8 articles

A learning-based image processing approach for pulse wave velocity estimation using spectrogram from peripheral pulse wave signals: An in silico study

  • 1Computer, Electrical, and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Makkah, Saudi Arabia
  • 2Electrical Engineering Department, Alfaisal University, Riyadh, Saudi Arabia
  • 3National Institute for Research in Digital Science and Technology INRIA, Saclay, France

Carotid-to-femoral pulse wave velocity (cf-PWV) is considered a critical index to evaluate arterial stiffness. For this reason, estimating Carotid-to-femoral pulse wave velocity (cf-PWV) is essential for diagnosing and analyzing different cardiovascular diseases. Despite its broader adoption in the clinical routine, the measurement process of carotid-to-femoral pulse wave velocity is considered a demanding task for clinicians and patients making it prone to inaccuracies and errors in the estimation. A smart non-invasive, and peripheral measurement of carotid-to-femoral pulse wave velocity could overcome the challenges of the classical assessment process and improve the quality of patient care. This paper proposes a novel methodology for the carotid-to-femoral pulse wave velocity estimation based on the use of the spectrogram representation from single non-invasive peripheral pulse wave signals [photoplethysmography (PPG) or blood pressure (BP)]. This methodology was tested using three feature extraction methods based on the semi-classical signal analysis (SCSA) method, the Law’s mask for texture energy extraction, and the central statistical moments. Finally, each feature method was fed into different machine learning models for the carotid-to-femoral pulse wave velocity estimation. The proposed methodology obtained an $R2\geq0.90$ for all the peripheral signals for the noise-free case using the MLP model, and for the different noise levels added to the original signal, the SCSA-based features with the MLP model presented an $R2\geq0.91$ for all the peripheral signals at the level of noise. These results provide evidence of the capacity of spectrogram representation for efficiently assessing the carotid-to-femoral pulse wave velocity estimation using different feature methods. Future work will be done toward testing the proposed methodology for in-vivo signals.

1 Introduction

Carotid-to-femoral pulse wave velocity (cf-PWV) is considered a critical index to evaluate arterial stiffness. For this reason, estimating Carotid-to-femoral pulse wave velocity (cf-PWV) is essential for diagnosing and analyzing different cardiovascular diseases. Despite its broader adoption in the clinical routine, the measurement process of cf-PWV is considered a demanding task for clinicians and patients making it prone to inaccuracies and errors in the estimation. A smart non-invasive, and peripheral measurement of cf-PWV could overcome the challenges of the classical assessment process and improve the quality of patient care. This paper proposes a novel methodology for the cf-PWV estimation based on the use of the spectrogram representation from single non-invasive peripheral pulse wave signals [photoplethysmography (PPG) or blood pressure (BP)]. This methodology was tested using three feature extraction methods based on the semi-classical signal analysis (SCSA) method, the Law’s mask for texture energy extraction, and the central statistical moments. Finally, each feature method was fed into different machine learning models for the cf-PWV estimation. The results obtained for each feature method provide evidence of the capacity of spectrogram representation combined with machine learning models as an intelligent tool for efficiently assessing the cf-PWV estimation. Cardiovascular diseases (CVDs) are the leading cause of mortality worldwide, with 17.9 million deaths in 2019, representing 32% of all global deaths, (Mensah et al., 2019). Patients at risk of evolving CVDs are assessed by evaluating different bio-markers ranging from age and sex to arterial stiffness (AS), (Gade et al., 2021). Arterial stiffness is considered one of the highest risk markers and has attracted much attention in clinical and experimental studies, (van Sloten et al., 2014). Arterial stiffness depicts the rigidity of the arterial vessels, positively associated with arterial pulse pressure, which can significantly affect the heart and vascular physiology.

Over the last decades, a myriad of techniques for evaluating AS have been explored and validated, some of which are more widely functional nowadays in clinical practice than others, for example, the cardio-ankle vascular index that reflects the stiffness from the ascending aorta to the ankle arteries (Matsushita et al., 2019), the pulse pressure defined as the difference between the diastolic and systolic pressure (Mackenzie et al., 2002) assessing arterial stiffness. It is usually evaluated by dividing the distance traveled by the pulse wave between two arterial sites divided by the time taken to travel the distance (path length between the two sites). When the two arterial sites are the carotid and femoral sites, then we refer to the Carotid to femoral pulse wave velocity (cf-PWV) and usually provide information on the central arterial stiffness. The feasibility of cf-PWV in evaluating vascular stiffness has been validated through a strong correlation with major parameters and conditions such as hypertension severity levels, vascular aging, and atherosclerosis (Blacher et al., 1999; Shokawa et al., 2005; Mattace-Raso et al., 2006; Willum Hansen et al., 2006; Choi et al., 2007; Kim and Kim, 2019). Despite the crucial role of cf-PWV, there is no reliable method for estimating the cf-PWV. Most methods that exist in the literature have limitations. For instance, they rely on experienced personnel to realize the correct measurement, consisting in acquiring the carotid and femoral pressure waveform and measuring the traveling distance, as highlighted in (Matsushita et al., 2019). The measurement can therefore be subject to errors and inaccuracies, in particular, when evaluating the path length between the carotid and femoral sites (Tavallali et al., 2018). Further discussions on the measurement modalities and the main advantages and limitations of cf-PWV measurements can be read in (Rajzer et al., 2008).

Recent papers have investigated the use of Artificial Intelligence (AI) in estimating pulse wave velocity where non-invasive available measurements are used. AI-based approaches present the advantage of estimating the cf-PWV from non-invasive measurements which can be incorporated into the clinical routine without involving any complex protocol or experienced personnel. For instance, a non-calibrated carotid tonometry pressure waveform has been combined with a clinical routine variable to feed a machine learning model with Intrinsic Frequency features (Tavallali et al., 2015). Recently, a multi-layers perceptron-based cf-PWV estimation using fiducial points-based features extracted from the photoplethysmogram (PPG) signal and its first, second, and third derivatives, has been proposed. Another investigation by Weiwei et al., in (Jin et al., 2021), has proposed two machine learning pipelines, namely the Gaussian process regression and Recurrent Neural Network for the cf-PWV estimation from the radial blood pressure waveform. The two proposed machine learning pipelines used key features generated from the timing and magnitude of the fiducial points and the heart rate. More recently, in 2022, (Garcia et al., 2022) used a Multiple Linear Regression model to study the feasibility of the Semi-Classical Signal Analysis (SCSA)-based features extracted from Blood Pressure (BP) and PPG signals extracted from peripheral locations. In this study, feature extraction from a two-dimensional signal representation of the BP and PPG signals improved the estimation accuracy and robustness compared to the original one-dimensional signals’ results. Finally, (Li et al., 2022) proposed the cf-PWV prediction based on the XGBoost algorithm using wrist photoplethysmogram (wPPG) signals acquired from wearable devices. Despite the promising results, AI-based algorithms are not yet reliable and require improvements in terms of accuracy but also in terms of the universality of the algorithms, which refers to the fact that they can perform well for data that have not been considered in the training of the machine learning model.

In this paper, our objective is to contribute to improving AI-based algorithms for the estimation of the cf-PWV by proposing the use of spectrograms of pulse wave signals instead of one-dimensional signals. We believe that using the spectrogram, which provides both temporal and frequency dimensions of the signal, will help in improving the accuracy of measuring cf-PWV and therefore would help in including the cf-PWV measure in the clinical routine practice without the need for an expert. The use of spectrogram representation on PPG signals has been studied in the past proving good performance over different applications. In 2020, (Donida Labati et al., 2021) used a SVM model with features extracted from the PPG spectrogram for biometric recognition. Another use of PPG spectrogram representation is presented by (Siam et al., 2021) where they use the spectrogram as an input image for Blood Pressure estimation using Siamese networks and Convolutional neural networks (CNN).

This work used the spectrogram representation from peripheral signals for cf-PWV estimation using three feature methods. The first feature type was based on the Semi-Classical Signal Analysis (SCSA) method that relies on the Schrodinger operator’s spectral problem. The second type was based on the Law’s mask filters that compute the energy texture of an image, and the third was based on the central moments that give a statistical description of the image. Finally, these features were fed individually to different machine learning models to obtain the final estimation. Figure 1 illustrates the proposed estimation pipeline of the cf-PWV.

FIGURE 1
www.frontiersin.org

FIGURE 1. Schematic illustration of the estimation pipeline of the carotid-to-femoral pulse wave velocity based on pulse wave images. PPG indicates the photoplethysmography signals, BP the blood pressure signals and ML indicates machine learning models.

2 Materials and methods

2.1 Materials

2.1.1 Dataset

Due to the absence of real hemodynamic data to validate and test the proposed approach, in this paper, we used an in silico hemodynamic public database1. A pre-validated one-dimensional model has been used for generating the database of simulated pulse wave signals at different arterial locations. This model generates the signals based on different cardiovascular properties such as age, heart rate, blood density, and arterial diameter among other cardiovascular parameters (Charlton et al., 2019). These signals are often used to evaluate various pre-clinical assessment studies and hemodynamic analyzer algorithms, such as assessing pulse wave velocity. The database emulates pulse wave signals of one cardiac cycle of length from 4,374 virtual healthy adults with different ages between 25 and 75 in 10-year increments (six age groups) and heart rate between 66 bpm and 86 bpm, using a sampling frequency of 500 Hz for each signal. Each group has 729 virtual subjects’ pulse waves with distinct cardiac and arterial parameters like arterial stiffness and heart rate within normal ranges. In this study, PPG and BP waveforms at the level of the brachial, radial, and digital arteries were used to create the spectrograms and estimate the cf-PWV.

2.2 Method

The proposed methodology is summarized in Figure 1. A spectrogram is created using pulse wave signals from the in silico data. Then, features are extracted from three different methods and are fed individually into the learning approach stage. The best features were selected to train the model and estimate the cf-PWV value.

2.2.1 Spectrogram generation

The spectrogram is a time-frequency representation used to analyze the change of frequency with respect to time from a given input signal. In this paper, spectrograms are generated using the function spectrogram from MATLAB. It is well known that the selection of parameters involved in the creation of the spectrogram will define the quality of the representation, as shown in (Jablonski and Dziedziech, 2022). For this reason, windows’ parameters, spectrogram’s shape, and overlapping percentage were finely tuned.

Regarding the window type, it has been shown that Hamming and Kaiser’s windows are good options for generating spectrograms from pulse wave signals such as PPG (Zong and Jafari, 2015; Esgalhado et al., 2021). Overlapping percentage values of 0, 60, and 95 for Hamming windows and 0, 61, and 70 for Kaiser, were selected based on the values reported in (Trethewey, 2000) and (Heinzel et al., 2002). However, for the Kaiser window, it is necessary to define an extra parameter called α which changes depending on the overlapping percentage (Heinzel et al., 2002). Values of 0.5, 3, and 5 were used for the 0, 61, and 70 overlapping percentages, respectively. Additionally, the shape selection was made, considering the requirement from the 2D-SCSA feature extraction method to use square images (Kaisserli and Laleg-Kirati, 2014). For this reason, a squared spectrogram of sizes 250, 166, 100, 50, and 20 was used.

Finally, to select the final values, the spectrogram quality coefficients used in (Jablonski and Dziedziech, 2022) were calculated for each combination of parameters, as follows,

Qf=1Ff=1FσffμffQt=1Tt=1TσttμttQtf=QfQt(1)

where t and f represent the time (rows) and frequency (columns) of the spectrogram, and T and F are the number of time and frequency points respectively. σ represents the standard derivation, μ is the mean. Higher values in these metrics represent a better capacity of the spectrogram to represent the variability of a given signal.

Finally, the combination of parameters with the higher values for the metrics was obtained using the Hamming window, with 0% of overlapping and an image size of 250 × 250 pixels. More details on the parameters tuning and obtained results for all the combinations can be found in the Supplementary Material.

2.2.2 Semi-classical signal analysis method

2.2.2.1 Definition

The semi-classical signal analysis (SCSA) method has been proposed in (Laleg-Kirati et al., 2013) for pulse-shaped signal reconstruction, denoising, and characterization, where the signal is decomposed into a set of signal-dependent adaptive squared eigenfunctions of the Schrödinger operator. The SCSA method has been successfully used for features extraction of blood pressure (BP) and PPG signals by (Laleg-Kirati et al., 2013; Li and Laleg-Kirati, 2021a; Garcia et al., 2022), showing the feasibility of this method to provide useful information on the shape of the input signal which helps to detect morphology changes in the signal. This method has been extended to image representation (Kaisserli and Laleg-Kirati, 2014), denoising (Chahid et al., 2017; Chahid et al., 2018), and feature extraction (Garcia et al., 2022).

Definition 1. Let I (x, y) be a positive real valued square matrix, the image representation I2h of I (x, y) using the 2D-SCSA is defined as follows:

I2hx,y=h2L2,γclm=1MhλmhγΨmh2x,y1γ+1(2)

where hR+* is known as the semi-classical signal parameter, γR+ .λmh are the negative eigenvalues, and Ψ1h,Ψ2h,,ΨMh correspond to their associated L2-normalized eigenfunctions (m = 1, , Mh the number of eigenvalues) extracted from the two-dimensional semi-classical Schrodinger operator described as follows:

H2,hI2ψ=h22ψx2+2ψy2Iψ(3)

and L2,γcl is the suitable semi-classical constant defined as:

L2,γcl=12π2Γγ+1Γγ+2(4)

where Γ is the Gamma function.

2.2.2.2 Numerical computation

The 2D-SCSA requires the computation of eigenvalues and eigenfunctions from a 2D operator, leading to a complex and time-consuming process. To reduce the computational burden, a separation of variables approach has been proposed in (Kaisserli and Laleg-Kirati, 2014) where the standard 1D-SCSA is used for each row and each column; the results are then combined using the following formula: (Kaisserli and Laleg-Kirati, 2014):

Ih,γi,j=h2L2,γclk=1Khr=1Rhβi,k,h+ρj,r,hγ×ψi,k,h2jϕj,r,h2i11+γ(5)

where βi,k,h, k = 1, …, Kh and ρj,m,h, m = 1, …, Mh are the eigenvalues for each row and each column respectively, with ψkh2[j] and ϕmh2[i] are the corresponding eigenvectors.

2.2.2.3 Parameters selection

The semi-classical parameter h and the parameter γ play crucial roles in the SCSA representation as described in (Laleg-Kirati et al., 2013; Kaisserli and Laleg-Kirati, 2014). When the h value tends to 0, the SCSA reconstruction converges to the original image producing the best result. However, it has been noticed that the number of eigenvalues is restricted by the number of samples (Piliouras, 2020). Even knowing that the SCSA representation improves when h tends to 0, this value cannot be very small as it also depends on the number of samples. In addition, selecting the parameter γ given a specific h is also important since it affects the intensity values of the reconstructed images. It has been found that for small h values, γ tends to increase; in contrast, when h increases, γ tends to decrease.

An appropriate h interval has been proposed in (Piliouras, 2020) for 1D-signals, where a minimum value for h based on the sampling theorem is introduced. This value had been successfully used in (Piliouras, 2020; Li et al., 2021), providing good accuracy for signal representation. This minimum value is defined as:

hmin=TsπVmax(6)

where Vmax is the maximum value of the input signal and Ts is the sampling period of the images.

In this paper, we propose to extend the idea of using the hmin to image representation. hmin is computed for all the rows (hrmin) and columns (hcmin) in the image, as is shown in Figure 2. However, given that each pixel is affected by the h value taken from columns and rows, the mean between these two values was computed to obtain the h value (hmmin) for each pixel in the image.

hmmini,j=hrmini+hcminj2i=1..Nj=1..M(7)

FIGURE 2
www.frontiersin.org

FIGURE 2. Computation of the 2D-SCSA’s design parameters h. hrmin and hcmin represent the hmin values extracted from every row and every column of the spectrogram respectively.

To obtain a single h value for the entire image (ĥmin), the maximum value from hmmin matrix was selected to avoid a bad representation or aliasing in the representation.

ĥmin=maxhmmin(8)

The value of the γ parameter was selected by a sensitivity analysis. Different values of γ have been tested to maximize the structural similarity index measure (SSIM) and the Peak Signal to Noise ratio (PSNR) between the original spectrogram and the reconstructed one using 2D-SCSA. More information about the obtained results from the sensitivity analysis is presented in the Supplementary Material.

2.2.3 Features extraction

In this study, three different feature methods were computed. Obtaining 36 SCSA-based features, 102 Energy-based features, and 6 Statistic based features for the PPG and BP spectrograms for the Radial, Digital, and Brachial locations. Each feature was fed separately into the feature selection method to finally be combined with the different machine learning algorithms to estimate the cf-PWV values. The features used in this study are shown in Table 1, and with the relevant advantages and disadvantages of the three types of features.

TABLE 1
www.frontiersin.org

TABLE 1. Features computed by each type.

2.2.3.1 SCSA-based features

2D-SCSA features were considered to be the eigenvalues computed from each of the three following eigenvalues matrices obtained after applying the 2D-SCSA on the spectrogram:

1. Matrix composed by the extracted eigenvalues from the rows of the spectrogram.

2. Matrix composed of the extracted eigenvalues from the columns of the spectrogram.

3. Matrix composed by the sum of the matrices above.

The first features calculated were the three first invariants proposed by (Laleg-Kirati et al., 2013), consisting of some momentum of the negative eigenvalues. In this work, we compute the invariants for all three cases above: row eigenvalues, column eigenvalues, and combined eigenvalues:

INV1=4ĥminn=1Nhm=1Mhκhm,n,INV2=16ĥmin3n=1Nhm=1Mhκhm,n3,(9)
INV3=256ĥmin7n=1Nhm=1Mhκhm,n7(10)

where ĥmin is the semi-classical constant used for the reconstruction as described in the previous section. Nh is the number of eigenvalues (or columns) and Mh represents the number of rows in the previous eigenvalue matrices. For each matrix, we denote appropriate κm, n, h values depending on the corresponding eigenvalue problem (row, column, or combined) as follows:

κm,nh=λmhγ(11)

These invariant parameters were used by (Laleg-Kirati et al., 2010; Li and Laleg-Kirati, 2021b; Garcia et al., 2022) for signal processing features extraction and in (Garcia et al., 2022) for image processing features extraction. This shows the feasibility of these invariants to obtain relevant information from pulse wave signals such as BP and PPG.

Furthermore, based on the first three eigenvalues that approximate the general profile of the image (Laleg-Kirati et al., 2013; Li and Laleg-Kirati, 2021a), the mean value of the three first eigenvalues and κ for each matrix were calculated as follows:

Kn=1Mhm=1Mhκhi,n,En=1Mhm=1Mhκhi,n1γn=1,2,3(12)

We also considered other features as described in (Li et al., 2021) and which consists of the ratio between the first κ of the eigenvalues matrix and ĥmin (Rh) and the ratio between the median of the κ of all eigenvalues (MRh), as following:

Rh=κ1hĥmin,MRh=median(κmh)ĥmin(13)

The mean number of eigenvalues Mh obtained in each eigenvalue matrices was used as a feature since this value gives valuable information on signal shape (Li et al., 2021), helping to identify changes in the morphology. Finally, the mean and standard deviation of the κ were used as descriptors of the pixel distribution of the eigenvalues matrices.

2.2.3.2 Energy-based features

Laws’ mask features are standard image processing based features used to measure the “Texture energy” of a group of pixels in an image. This method has been used in the past for feature extraction for biomedical images (Rachidi et al., 2008), speech recognition using spectrograms (Wang, 2014), texture classification (Dash and Jena, 2017), and as a method of segmentation based on the texture presented on the image (Kvyetnyy et al., 2017).

The principle of this method is to estimate the texture features using a set of texture energy transformations (Laws, 1980). This transformation detects the variation within a fixed-size window using different convolution masks that compute the energy of the image. This group of masks is invariant to changes in luminance, contrast, and rotation that allows the detection of textures under different conditions (Laws, 1980; Stockman and Shapiro, 2001).

All the convolution masks used for the energy texture estimation came from the following set of one-dimensional (1-D) kernels of five or three pixels:

• Kernels with length 3

L3=1,2,1,E3=1,0,1,S3=1,2,1(14)

• kernels with length 5

L5=1,4,6,4,1,E5=1,2,0,2,1,S5=1,0,2,0,1,W5=1,2,0,2,1,R5=1,4,6,4,1

where L (Level) detects the average grey level, E (Edge) extracts edge features, S (Spot) extracts spots, W (Wave) extracts wave features, and R (Ripple) extracts ripples in the image (Laws, 1980). The convolution masks used for the feature extraction were generated by convoluting any vertical one-dimensional vector with a horizontal vector to generate the following 3 × 3 and 5 × 5 filters:

• 3 × 3 filters

L3TL3E3TL3S3TL3L3TE3E3TE3S3TE3L3TS3E3TS3S3TS3

• 5 × 5 filters

L5TL5E5TL5S5TL5W5TL5R5TL5L5TE5E5TE5S5TE5W5TE5R5TE5L5TS5E5TS5S5TS5W5TS5R5TS5L5TW5E5TW5S5TW5W5TW5R5TW5L5TR5E5TR5S5TR5W5TR5R5TR5

The images obtained after convolution between each mask and the images should be normalized to make the descriptors contrast-independent. The normalization was made based on the implementation made by (Miroslav and Rodojevi´c, 2007), where all the images were normalized using the image min-max normalization as follows

Îmask=ImaskminImaskmaxImaskminImask(15)

After the normalization, each outputs Imask were converted to a texture energy image (Emask) by using a moving non-linear window average of absolutes (Rachidi et al., 2008)

Emaskr,c=j=c7c+7i=r7r+7|Imaski,j|(16)

Finally, the mean (Eq. 17), standard deviation (Eq. 18), and entropy (Eq. 19) to each of the texture energy images obtained was computed to obtain a measurement of the global energy texture for each mask (Rachidi et al., 2008).

MEEmask=i=0Mj=0NEmaski,jM×N(17)
STDEmask=i=0Mj=0NEmaski,jMean2M×N(18)
ENEmask=i=0Mj=0NEmaski,j2M×N(19)

2.2.3.3 Central moment features

Statistical central moments are a set of features used to describe the spread and shape of the pixel’s distribution in an image (Grubbström and Tang, 2006) and are computed as

μ̂k=EIμkk=1,2,3,4(20)

where μk represents the kth central moment about the mean μ of the spectrogram image I.

These features have been used for different applications where spectrograms are involved such as event classification and inference detection (Dennis et al., 2011; Mulimani and Koolagudi, 2018; Oh and Kim, 2019).

In this work, we extracted a set of features inspired by features used for audio event classification in (Mulimani and Koolagudi, 2018) derived from the second, third, and fourth central moment, described as follows:

F1=log10μ2,F2=μ3,F3=log10μ4,F4=μ2̂,F5=μ3̂,F6=μ4̂,(21)

where μ2 represents the standard derivation, μ3 is the skewness, and μ4 represents the kurtosis which indicates the flatness of the image histogram. The μk̂ with k = 1, 2, 3, 4 represent the central moments computed from the normalized spectrogram (Î).

Ii,ĵ=Ii,jminImaxIminI(22)

2.3 Feature selection

The feature selection is a technique used to reduce the number of features by eliminating the irrelevant, redundant, and noisy features to improve the model performance (Kumar and Minz, 2014). In this study, we use the Maximum Relevance—Minimum Redundancy (MRMR) algorithm, which is a feature selection method that chooses S̈ number of features that has maximum relevance with respect to the target variable and minimum redundancy with respect to the features that have been selected at previous iterations (Zhao et al., 2019). We relied on the F-test correlation quotient (FCQ) variant of the Maximum Relevance—Minimum Redundancy (MRMR) algorithm to rank the features. This variant is based on the relevance of a feature to predict the desired variable, measured by the F-statistic between the feature and the target variable and the redundancy of the feature computed by the average Pearson correlation between the feature and all the other features.

FCQscoref=FY,f1S̈sSρf,s(23)

where ρ(f, s) is the Pearson correlation, F(Y, f) is the F-statistic and Y is the target variable to estimate, S the set of selected features, S̈ is the number of feature selected, s is a feature such as sS and f denotes a feature currently not selected (fS) (Zhao et al., 2019).

Finally, the number of features (S̈) to be selected was chosen by a sensitivity analysis taking a different number of features which range between 1 and the total number of features to predict the cf-PWV using the proposed machine learning models with the default hyperparameters defined by the python library scikit-learn. Then, the set of features that produce the best mean R2 performance of the models is selected to ensure high performance. Table 2 shows the number of features selected for each feature type, details on the feature selection can be found in the Supplementary Material.

TABLE 2
www.frontiersin.org

TABLE 2. Number of features selected for BP and PPG spectrogram.

2.4 Machine learning models

To create the training and testing dataset for supervised machine learning models, the dataset was split into two different groups where the 70% of the total dataset was used for the training set, and the 30% left was used for the testing set. Finally, each of the three different features types were fed into the following machine learning methods: Random forest regression (RF), Gradient Boost Regressor (GBR), multilayer perceptron (MLP), Multiple Linear regression (MLR), and Suppor Vector Regression (SVR).

2.4.1 Model training

A common practice in machine learning to increase the performance of the models is to standardize the features to have mean 0 and variance 1 using the z-score defined as follows,

x̃ij=xijμiσi(24)

where xi(j) represents the value of the ith feature of the jth data point, μi represents the mean of each feature, and σi is the standard deviation of each feature. In addition, tuning the hyperparameters of the models helps to maximize the performance on the test data for given a specific problem (Elgeldawi et al., 2021). In this project, the hyperparameters optimization of the Machine learning models was made using a random search that has been used in the past for hyperparameter tuning (Jin et al., 2021; Garcia et al., 2022). This algorithm randomly selects different combinations of hyperparameters from a predefined space of values and tests the model’s performance model. Finally, the combination of hyperparameters with the best performance was selected. More information about the hyperparameters space and the values selected for each model can be found in the supplementary material. In combination with the hyperparameter tuning, a 5-fold cross-validation method was implemented to avoid over-fitting during the models’ training and hyperparameter tuning and increase the generalization capacity.

2.4.2 Model evaluation

To evaluate the performance of the models, we used the R-squared (R2) value and the root mean square error (RMSE) between the actual value and the predicted by the model (Bahloul et al., 2021; Garcia et al., 2022).

R2=1n=1NcfPWVrealncfPWVpredictedn2n=1NcfPWVrealnμcfPWVreal2,(25)
RMSE=n=1NcfPWVrealncfPWVpredictedn2N,(26)

where μ is a function that evaluates the mean of cf − PWVreal over N subjects.

2.5 Noise addition

To test the performance of the proposed methodology against noisy data, a high-frequency Gaussian white noise was added to the pulse wave signals to simulate the electrical noise found during the recording (Ban and Kwon, 2016). The typical cause for this type of noise is radio, TV, cellular, and distant lightning (Kularatna et al., 2019). The intensity of the noise was defined using the signal-to-noise ratio (SNR) defined as follows:

SNR=PsPn,(27)

where Ps and Pn correspond to the power of the signal and Gaussian white noise, respectively (Bahloul et al., 2021; Garcia et al., 2022). The selected noise intensity for the BP signals is 20, 10, and 5 dB based on the values used by (Jin et al., 2021). For the PPG signals, we define the values as 65, 45, and 30 dB based on the values reported by (Maxim integrated, 2017) (Elsamnah et al., 2019).

3 Results

3.1 Noise-free case

This project proposed a novel methodology based on spectrogram representation of the signals to estimate the cf-PWV. Table 3 shows the result for the PPG spectrograms where the MLP and SVR models obtained the best results with a R2 = 0.90 or higher and a RMSE = 0.71 or lower for the three features types, producing the best performance values of R2 = 0.99 and RMSE = 0.16 for the SVR with energy features applied to the brachial location. In contrast, the MLR model shows the worst performance of R2 = 0.73 and RMSE = 1.09 using statistical features extracted from the Brachial location. However, it is important to notice that in the case of the SCSA and energy features, the MLR models presented R2 = 0.90 for higher and RMSE = 0.66 or lower, obtaining the best results of R2 = 0.95 and RMSE = 0.47 for the energy features extracted from the Radial location. The feature type with the better overall performance in the estimation was the energy feature presenting a mean of R2 = 0.97 and RMSE = 0.32 for all the models in the three different locations. Similarly, the result obtained for the BP spectrograms showed the best performance for the SVR and MLP models with at least a R2 = 0.97 and a RMSE = 0.36 for all the features with a maximum difference of 0.02 for the R2 and 0.19 between the features. On the other hand, the worst results were presented for the MLR models with a lower value of R2 = 0.77 and RMSE = 1.00 for the Brachial location using the statistical features. As in the PPG spectrogram, the energy features presented the best overall result with a mean value of R2 = 0.97 and RMSE = 0.36 for all the models in the three different locations. Finally, it is important to notice that all the features obtained a performance of R2 between 0.90 and 0.99 for the BP and PPG spectrograms in the different locations.

TABLE 3
www.frontiersin.org

TABLE 3. Free-noise results obtained for PPG and BP spectrogram.

These results show the capacity of the spectrogram as signal representation for cf-PWV estimation using noise-free PPG and BP signals from the Radial, Brachial, and Digital locations. It is important to notice that BP shows a better performance with all three different features compared with the results obtained for the PPG where the energy-based features performed better than the other features for the three locations proposed. It is important to notice that the MLR model with SCSA and energy features obtained values of R2 ≥ 0.90 showing a great capacity to obtain a linear relationship between the features extracted from the spectrogram and the cf-PWV. This is a great advantage since these linear models can allow an easier implementation of the model in real-life applications.

3.2 Noisy case

Table 4 shows the results obtained for the different levels of noise. The highest results obtained for each of the nose levels were R2 = 0.98 and RMSE = 0.33 for the SNR = 65 using the MLP models with the energy features applied to the Brachial location. In addition, SCSA features applied in the brachial location presented the best result for the SNR = 45 with values R2 = 0.96 and RMSE = 0.43 using the SVR model, and values of R2 = 0.92 and RMSE = 0.60 for the SNR = 30 using the MLP and SVR models. In contrast, the worst results were obtained in all the cases by the MLP models using statistic features with values of R2 = 0.76 and RMSE = 1.01 for the Radial location with SNR = 0.65, R2 = 0.68 and RMSE = 1.17 for Digital and brachial location with SNR = 0.45, and R2 = 0.65 and RMSE = 1.23 for Digital and brachial location with SNR = 0.30. For the BP spectrogram, The best values obtained were applied for Radial locations with values of R2 = 0.98 and RMSE = 0.26 for the SNR = 20 using the GBR model with the energy features, R2 = 0.98 and RMSE = 0.32 for the SNR = 10 using the SVR model with the SCSA features, and R2 = 0.97 and RMSE = 0.38 for the SNR = 5 using the SVR model with the SCSA features. In contrast, similar to the PPG spectrograms, the lower results were obtained for the MLR models using the statistical features obtaining values of R2 = 0.70 and RMSE = 1.14 for SNR = 20, R2 = 0.68 and RMSE = 1.18 for SNR = 10, and R2 = 0.69 and RMSE = 1.16 for SNR = 5.

TABLE 4
www.frontiersin.org

TABLE 4. Results obtained for noisy data.

These results show that even with different levels of white gaussian noise added to the original signals, the proposed methodology obtained R2 ≥ 0.90 for the PPG signals with the different levels of noise, using the MLP and SVR models for the different locations. In contrast, similar to the case of noise-free, the BP signals obtained better results, presenting an R2 ≥ 0.90 using the MLP model for the three different features in the proposed locations. Furthermore, in the case of BP signals, the MLR model obtained accurate results for the cf-PWV.

4 Discussion

This study investigated a novel methodology to estimate the cf-PWV based on the application of the spectrogram representation of single PPG or BP signals extracted from a peripheral location. The use of the spectrogram representation for the analysis of biomedical signals such as PPG had been studied before as input for data-driven approaches like the classification of peripheral diseases by (Allen et al., 2021), or biometric recognition, (Donida Labati et al., 2021). For this reason, in this project, the use of the spectrogram from BP or PPG signals to estimate the cf-PWV values is investigated as a novel methodology to take advantage of the frequency and temporal information encoded in the spectrogram matrix.

In this project, three feature types based on the Schrodinger spectrum, the image’s energy texture, and the image’s statistical distribution were combined with different machine learning algorithms to estimate the carotid-to-femoral pulse wave velocity (cf-PWV). The results for the noisy-free signals presented a mean value of R2 = 0.92 and RMSE = 0.54 for the PPG spectrograms and R2 = 0.95 and RMSE = 0.44 for the BP spectrograms for the noisy-free signals extracted from the Radial, Digital and Brachial location, showing the spectrogram’s capacity to encode valuable information that can be extracted to estimate the cf-PWV presenting. The energy-based features using Law’s masks presented the best performance for the PPG and BP signals with values of R2 = 0.99 and RMSE = 0.16 for the SVR applied to PPG spectrogram the Brachial location and R2 = 0.99 and RMSE = 0.17 for the SVR applied to BP spectrogram the Brachial location. Nevertheless, it is important to recall that all three different feature types obtained one or more models with at least a R2 = 0.90 and RMSE = 0.64 for all the locations. These results show the feasibility of the different types of features to extract valuable information from a spectrogram created using noisy-free signals to estimate the cf-PWV values.

In contrast, for the noisy cases, the PPG spectrograms presented values of R2 = 0.92 and RMSE = 0.56 for the SNR = 65, R2 = 0.88 and RMSE = 0.70 for the SNR = 45, and R2 = 0.82 and RMSE = 0.84 for the SNR = 30, showing a decrease in the performance for the SNR = 45 and SNR = 30 cases of 0.04 and 0.10 for the R2, and 0.16 and 0.30 for the RMSE. It is important to notice that the SCSA features presented the best overall performance for the noisy cases, obtaining the best mean results value of R2 = 0.94 and RMSE = 0.50 for MLP and SVR models applied for the different noisy PPG spectrograms and R2 = 0.97 and RMSE = 0.34 for the SVR model applied for the noisy BP spectrograms, obtaining in a value of R2 = 0.90 or higher for each of the noise level presented in this study. However, it is important to notice that the SCSA method presented a high computational complexity to compute the features for each combination of signals (PPG or BP) and location (Radial, Digital, Brachial). For this reason, it is essential to develop future works to reduce this complexity to extend this method for real applications where time and computational cost play an essential role.

Previous studies have been using machine learning or deep learning models to estimate the cf-PWV based on PPG or BP signals (Tavallali et al., 2015; Jin et al., 2021; Li et al., 2022). However, a direct comparison between our work and many of the previous studies cannot be made given that these studies use real data for the estimation, in contrast with the in silico data used in this study. Nevertheless, the study made by (Jin et al., 2021) used the same Blood Pressure in silico signals from the Radial location for the noisy case. In this study, the authors proposed the use of an LSTM deep-learning model to estimate the cf-PWV, obtaining an R2 ≥ 0.98 and a RMSE ≤ 0.24. In contrast, the proposed method obtained a similar performance of an R2 ≥ 0.97 and a RMSE ≤ 0.38.

It is important to notice that the model parameters used for the generation of the in silico pulse wave signals were changed with age, allowing the investigation of the effects of aging in the estimation of cf-PWV. Previous studies had demonstrated that there could exist a decrease in the performance of the estimation for high PWV values associated with the sensitivity to variations in the transit time during the cf-PWV estimation (Li et al., 2022; Jin et al., 2021). This same behavior was noticed in this project for some of the models where there is an increase of the error estimation for higher values of cf-PWV (usually higher than 9 ms), these values are presented for virtual patients between 55 and 75 years old (Charlton et al., 2019). h Nevertheless, the models with the best performances (R2 ≥ 0.98) do not present this increase of error for the high cf-PWV, showing a great capacity to estimate the cf-PWV for all the different ages (25–75) without presenting an important increase in the error produced by the age of the virtual patient. These results are similar to the results reported in Jin et al. (2021) where they use the LSTM for the noisy data estimation obtaining a great capacity to estimate the cf-PWV regarding the age of the virtual patient.

Even if these results are promising, it is crucial to consider the different limitations presented in this project. The principal limitation is the use of in silico data rather than real data collected from a specific human population. Nevertheless, the in silico data allows us to achieve an initial validation of the proposed methodology, whose results will permit us to proceed with the use of real data. Another limitation of this project is the spectrogram representation made using one-cycle signals; given that the real PPG and BP signals present multiple cycles, this will change the spectrogram image obtained, and this could make the features computed in this work may not work for the multi-cycle representation. However, future work will be done toward solving this limitation to validate the proposed methodology to obtain a more realistic analysis of the feasibility of the spectrogram to estimate the cf-PWV.

5 Conclusion

This paper investigates a new methodology to estimate cf-PWV based on the spectrogram representation obtained from BP or PPG signals taken from peripheral signals using machine learning models. The proposed approach incorporates three different types of features to probe the feasibility of the spectrogram to accurately estimate the cf-PWV. The results prove that the three different methods could obtain good performance, where the energy features showed the best performance for all the models without noise and the SCSA presented the best results against the noise levels proposed in the study. In the future, further validation of the proposed methodology in real human signals needs to be conducted to overcome the limitation of using in silico data with one cardiac cycle. This proposed method may be implemented for personal healthcare applications upon successful clinical validation. Also, it can open the door for future investigations of new machine learning methods and feature extraction techniques to improve the estimation of cf-PWV based on spectrogram representation. The code is available at https://github.com/EMANG-KAUST/Spectrogram_AS_Frontiers. We welcome developments to the existing code or contributions of new algorithms for inclusion in future versions of the arterial stiffness prediction platform.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

The authors confirm their contribution to the paper as follows: study conception and design: MB, JV, and T-ML-K; data collection: MB and JV; simulations and coding: JV, analysis and interpretation of results: MB, JV, and T-ML-K; draft manuscript preparation: MB, JV, and T-ML-K. All authors reviewed the results and approved the final version of the manuscript.

Funding

This work has been supported by the King Abdullah University of Science and Technology (KAUST), Saudi Arabia, Base Research Fund (BAS/1/1627- 01-01) to Taous Meriem Laleg and KAUST.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2023.1100570/full#supplementary-material

Footnotes

1https://peterhcharlton.github.io/pwdb/index.html

References

Allen, J., Liu, H., Iqbal, S., Zheng, D., and Stansby, G. (2021). Deep learning-based photoplethysmography classification for peripheral arterial disease detection: A proof-of-concept study. Physiol. Meas. 42, 054002. doi:10.1088/1361-6579/abf9f3

CrossRef Full Text | Google Scholar

Bahloul, M. A., Chahid, A., and Laleg-Kirati, T.-M. (2021). “A multilayer perceptron-based carotid-to-femoral pulse wave velocity estimation using ppg signal,” in 2021 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI IEEE), 1–6.

CrossRef Full Text | Google Scholar

Ban, D., and Kwon, S. (2016). “Movement noise cancellation in ppg signals,” in 2016 IEEE International Conference on Consumer Electronics (ICCE) (Institute of Electrical and Electronics Engineers Inc), 47–48. doi:10.1109/ICCE.2016.7430517

CrossRef Full Text | Google Scholar

Blacher, J., Guerin, A. P., Pannier, B., Marchais, S. J., Safar, M. E., and London, G. M. (1999). Impact of aortic stiffness on survival in end-stage renal disease. Circulation 99, 2434–2439. doi:10.1161/01.cir.99.18.2434

PubMed Abstract | CrossRef Full Text | Google Scholar

Chahid, A., Serrai, H., Achten, E., and Laleg-Kirati, T.-M. (2018). “A new roi-based performance evaluation method for image denoising using the squared eigenfunctions of the Schrödinger operator,” in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 5579–5582. doi:10.1109/EMBC.2018.8513615

PubMed Abstract | CrossRef Full Text | Google Scholar

Chahid, A., Serrai, H., Achten, E., and Laleg-Kirati, T.-M. (2017). “Adaptive method for mri enhancement using squared eigenfunctions of the Schrödinger operator,” in 2017 IEEE Biomedical Circuits and Systems Conference (BioCAS), 1–4. doi:10.1109/BIOCAS.2017.8325107

CrossRef Full Text | Google Scholar

Charlton, P. H., Harana, J. M., Vennin, S., Li, Y., Chowienczyk, P. J., and Alastruey, J. (2019). Modeling arterial pulse waves in healthy aging: A database for in silico evaluation of hemodynamics and pulse wave indexes. Am. J. Physiology - Heart Circulatory Physiology 317, H1062–H1085. doi:10.1152/ajpheart.00218.2019

PubMed Abstract | CrossRef Full Text | Google Scholar

Choi, C. U., Park, E. B., Suh, S. Y., Kim, J. W., Kim, E. J., Rha, S.-W., et al. (2007). Impact of aortic stiffness on cardiovascular disease in patients with chest pain: Assessment with direct intra-arterial measurement. Am. J. Hypertens. 20, 1163–1169. doi:10.1016/j.amjhyper.2007.07.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Dash, S., and Jena, U. R. (2017). Multi-resolution laws’ masks based texture classification. J. Appl. Res. Technol. 15, 571–582. doi:10.1016/j.jart.2017.07.005

CrossRef Full Text | Google Scholar

Dennis, J., Tran, H. D., and Li, H. (2011). Spectrogram image feature for sound event classification in mismatched conditions. IEEE Signal Process. Lett. 18, 130–133. doi:10.1109/LSP.2010.2100380

CrossRef Full Text | Google Scholar

Donida Labati, R., Piuri, V., Rundo, F., Scotti, F., and Spampinato, C. (2021). “Biometric recognition of ppg cardiac signals using transformed spectrogram images,” in Pattern recognition. ICPR international workshops and challenges. Editors A. Del Bimbo, R. Cucchiara, S. Sclaroff, G. M. Farinella, T. Mei, M. Bertiniet al. (Cham: Springer International Publishing), 244–257.

CrossRef Full Text | Google Scholar

Elgeldawi, E., Sayed, A., Galal, A. R., and Zaki, A. M. (2021). Hyperparameter tuning for machine learning algorithms used for Arabic sentiment analysis. Informatics 8, 79. doi:10.3390/informatics8040079

CrossRef Full Text | Google Scholar

Elsamnah, F., Bilgaiyan, A., Affiq, M., Shim, C. H., Ishidai, H., and Hattori, R. (2019). Comparative design study for power reduction in organic optoelectronic pulse meter sensor. Biosensors 9, 48. doi:10.3390/bios9020048

PubMed Abstract | CrossRef Full Text | Google Scholar

Esgalhado, F., Fernandes, B., Vassilenko, V., Batista, A., and Russo, S. (2021). The application of deep learning algorithms for ppg signal processing and classification. Computers 10, 158. doi:10.3390/computers10120158

CrossRef Full Text | Google Scholar

Gade, J.-L., Thore, C.-J., Sonesson, B., and Stålhand, J. (2021). In vivo parameter identification in arteries considering multiple levels of smooth muscle activity. Biomechanics Model. Mechanobiol. 1–13, 1547–1559. doi:10.1007/s10237-021-01462-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Garcia, J. M. V., Bahloul, M. A., and Laleg-Kirati, T.-M. (2022). “A multiple linear regression model for carotid-to-femoral pulse wave velocity estimation based on schrodinger spectrum characterization,” in 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 143–147. doi:10.1109/EMBC48229.2022.9871031

CrossRef Full Text | Google Scholar

Grubbström, R. W., and Tang, O. (2006). The moments and central moments of a compound distribution. Eur. J. Operational Res. 170, 106–119. doi:10.1016/j.ejor.2004.06.012

CrossRef Full Text | Google Scholar

Heinzel, G., Rüdiger, A., and Schilling, R. (2002). Spectrum and spectral density estimation by the discrete fourier transform (dft), including a comprehensive list of window functions and some new flat-top windows.

Google Scholar

Jablonski, A., and Dziedziech, K. (2022). Intelligent spectrogram – A tool for analysis of complex non-stationary signals. Mech. Syst. Signal Process. 167, 108554. doi:10.1016/j.ymssp.2021.108554

CrossRef Full Text | Google Scholar

Jin, W., Chowienczyk, P., and Alastruey, J. (2021). Estimating pulse wave velocity from the radial pressure wave using machine learning algorithms. Plos one 16, e0245026. doi:10.1371/journal.pone.0245026

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaisserli, Z., and Laleg-Kirati, T.-M. (2014). Image representation and denoising using squared eigenfunctions of schrodinger operator. arXiv preprint arXiv:1409.3720.

Google Scholar

Kim, H.-L., and Kim, S.-H. (2019). Pulse wave velocity in atherosclerosis. Front. Cardiovasc. Med. 6, 41. doi:10.3389/fcvm.2019.00041

PubMed Abstract | CrossRef Full Text | Google Scholar

Kularatna, N., Ross, A. S., Fernando, J., and James, S. (2019). Background to surge protection. Elsevier. doi:10.1016/b978-0-12-811664-7.00001-x

CrossRef Full Text | Google Scholar

Kumar, V., and Minz, S. (2014). Feature selection: A literature review. SmartCR 4, 211–229. doi:10.6029/smartcr.2014.03.007

CrossRef Full Text | Google Scholar

Kvyetnyy, R., Sofina, O., Olesenko, A., Komada, P., Sikora, J., Kalizhanova, A., et al. (2017). “Method of image texture segmentation using laws’ energy measures,” in In Photonics Applications in Astronomy, Communications, Industry, and High Energy Physics Experiments 2017 (SPIE), 10445, 1784–1792.

Google Scholar

Laleg-Kirati, T. M., Crépeau, E., and Sorine, M. (2013). Semi-classical signal analysis. Math. Control, Signals, Syst. 25, 37–61. doi:10.1007/s00498-012-0091-1

CrossRef Full Text | Google Scholar

Laleg-Kirati, T. M., Médigue, C., Papelier, Y., Cottin, F., and Van De Louw, A. (2010). Validation of a semi-classical signal analysis method for stroke volume variation assessment: A comparison with the PiCCO technique. Ann. Biomed. Eng. 38, 3618–3629. doi:10.1007/s10439-010-0118-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Laws, K. I. (1980). Textured image segmentation. University of Southern California. IPI Report 940.

Google Scholar

Li, P., and Laleg-Kirati, T. M. (2021a). Central blood pressure estimation from distal PPG measurement using semiclassical signal analysis features. IEEE Access 9, 44963–44973. doi:10.1109/ACCESS.2021.3065576

CrossRef Full Text | Google Scholar

Li, P., and Laleg-Kirati, T. (2021b). Signal denoising based on the Schrödinger operator’s eigenspectrum and a curvature constraint. IET Signal Process. 15, 195–206. doi:10.1049/sil2.12023

CrossRef Full Text | Google Scholar

Li, P., Piliouras, E., Poghosyan, V., AlHameed, M., and Laleg-Kirati, T.-M. (2021). “Automatic detection of epileptiform eeg discharges based on the semi-classical signal analysis (scsa) method,” in 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC IEEE), 928–931.

CrossRef Full Text | Google Scholar

Li, Y., Xu, Y., Ma, Z., Ye, Y., Gao, L., and Sun, Y. (2022). An xgboost-based model for assessment of aortic stiffness from wrist photoplethysmogram. Comput. Methods Programs Biomed. 226, 107128. doi:10.1016/j.cmpb.2022.107128

PubMed Abstract | CrossRef Full Text | Google Scholar

Mackenzie, I., Wilkinson, I., and Cockcroft, J. (2002). Assessment of arterial stiffness in clinical practice. QJM Int. J. Med. 95, 67–74. doi:10.1093/qjmed/95.2.67

PubMed Abstract | CrossRef Full Text | Google Scholar

Matsushita, K., Ding, N., Kim, E. D., Budoff, M., Chirinos, J. A., Fernhall, B., et al. (2019). Cardio-ankle vascular index and cardiovascular disease: Systematic review and meta-analysis of prospective and cross-sectional studies. J. Clin. Hypertens. 21, 16–24. doi:10.1111/jch.13425

PubMed Abstract | CrossRef Full Text | Google Scholar

Mattace-Raso, F. U., van der Cammen, T. J., Hofman, A., van Popele, N. M., Bos, M. L., Schalekamp, M. A., et al. (2006). Arterial stiffness and risk of coronary heart disease and stroke: The rotterdam study. Circulation 113, 657–663. doi:10.1161/CIRCULATIONAHA.105.555235

PubMed Abstract | CrossRef Full Text | Google Scholar

Maxim integrated (2017). Signal-to-noise ratio as a quantitative measure for optical biosensors.

Google Scholar

Mensah, G. A., Roth, G. A., and Fuster, V. (2019). The global burden of cardiovascular diseases and risk factors: 2020 and beyond.

Google Scholar

Miroslav, G. L., and Rodojevic, R. (2007). Texture segmentation: Co-Occurrence matrix and laws’ texture masks methods.

Google Scholar

Mulimani, M., and Koolagudi, S. G. (2018). “Acoustic event classification using spectrogram features,” in TENCON 2018 - 2018 IEEE Region 10 Conference, 1460–1464. doi:10.1109/TENCON.2018.8650444

CrossRef Full Text | Google Scholar

Oh, M., and Kim, Y.-H. (2019). Statistical approach to spectrogram analysis for radio-frequency interference detection and mitigation in an l-band microwave radiometer. Sensors 19, 306. doi:10.3390/s19020306

PubMed Abstract | CrossRef Full Text | Google Scholar

Piliouras, E. (2020). Contributions to the semi-classical signal analysis method: The arterial stiffness assessment case study. doi:10.25781/KAUST-NVCZ0

CrossRef Full Text | Google Scholar

Rachidi, M., Chappard, C., Marchadier, A., Gadois, C., Lespessailles, E., and Benhamou, C. (2008). “Application of laws’ masks to bone texture analysis: An innovative image analysis tool in osteoporosis,” in 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 1191–1194. doi:10.1109/ISBI.2008.4541215

CrossRef Full Text | Google Scholar

Rajzer, M. W., Wojciechowska, W., Klocek, M., Palka, I., Brzozowska-Kiszka, M., and Kawecka-Jaszcz, K. (2008). Comparison of aortic pulse wave velocity measured by three techniques: Complior, sphygmocor and arteriograph. J. Hypertens. 26, 2001–2007. doi:10.1097/HJH.0b013e32830a4a25

PubMed Abstract | CrossRef Full Text | Google Scholar

Shokawa, T., Imazu, M., Yamamoto, H., Toyofuku, M., Tasaki, N., Okimoto, T., et al. (2005). Pulse wave velocity predicts cardiovascular mortality findings from the Hawaii-los angeles-hiroshima study. Circulation J. 69, 259–264. doi:10.1253/circj.69.259

PubMed Abstract | CrossRef Full Text | Google Scholar

Siam, A. I., Sedik, A., El-Shafai, W., Elazm, A. A., El-Bahnasawy, N. A., El Banby, G. M., et al. (2021). Biosignal classification for human identification based on convolutional neural networks. Int. J. Commun. Syst. 34. doi:10.1002/dac.4685

CrossRef Full Text | Google Scholar

Stockman, G., and Shapiro, L. G. (2001). Computer vision. Upper Saddle River, NJ, USA: Prentice Hall PTR.

Google Scholar

Tavallali, P., Hou, T. Y., Rinderknecht, D. G., and Pahlevan, N. M. (2015). On the convergence and accuracy of the cardiovascular intrinsic frequency method, 2. Royal Society open science, 150475.

CrossRef Full Text | Google Scholar

Tavallali, P., Razavi, M., and Pahlevan, N. M. (2018). Artificial intelligence estimation of carotid-femoral pulse wave velocity using carotid waveform. Sci. miscs 8 (1). doi:10.1038/s41598-018-19457-0

CrossRef Full Text | Google Scholar

Trethewey, M. W. (2000). Window and overlap processing effects on power estimates from spectra. Mech. Syst. Signal Process. 14, 267–278. doi:10.1006/mssp.1999.1274

CrossRef Full Text | Google Scholar

van Sloten, T. T., Schram, M. T., van den Hurk, K., Dekker, J. M., Nijpels, G., Henry, R. M., et al. (2014). Local stiffness of the carotid and femoral artery is associated with incident cardiovascular events and all-cause mortality: The hoorn study. J. Am. Coll. Cardiol. 63, 1739–1747. doi:10.1016/j.jacc.2013.12.041

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, K.-C. (2014). The feature extraction based on texture image information for emotion sensing in speech. Sensors 14, 16692–16714. doi:10.3390/s140916692

PubMed Abstract | CrossRef Full Text | Google Scholar

Willum Hansen, T., Staessen, J. A., Torp-Pedersen, C., Rasmussen, S., Thijs, L., Ibsen, H., et al. (2006). Prognostic value of aortic pulse wave velocity as index of arterial stiffness in the general population. Circulation 113, 664–670. doi:10.1161/CIRCULATIONAHA.105.579342

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, Z., Anand, R., and Wang, M. (2019). “Maximum relevance and minimum redundancy feature selection methods for a marketing machine learning platform,” in 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA), 442–452. doi:10.1109/DSAA.2019.00059

CrossRef Full Text | Google Scholar

Zong, C., and Jafari, R. (2015). “Robust heart rate estimation using wrist-based ppg signals in the presence of intense physical activities,” in 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 8078–8082. doi:10.1109/EMBC.2015.7320268

CrossRef Full Text | Google Scholar

Keywords: pulse wave velocity, spectrogram, PPG, distal blood pressure, machine learning (ML), image processing, semi-classical signal analysis

Citation: Vargas JM, Bahloul MA and Laleg-Kirati T-M (2023) A learning-based image processing approach for pulse wave velocity estimation using spectrogram from peripheral pulse wave signals: An in silico study. Front. Physiol. 14:1100570. doi: 10.3389/fphys.2023.1100570

Received: 16 November 2022; Accepted: 31 January 2023;
Published: 03 March 2023.

Edited by:

Abdul Wahab, Nazarbayev University, Kazakhstan

Reviewed by:

Haipeng Liu, Coventry University, United Kingdom
Pranjali Gajbhiye, Nirvesh Enterprises Private Limited, India

Copyright © 2023 Vargas, Bahloul and Laleg-Kirati. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Taous-Meriem Laleg-Kirati, taousmeriem.laleg@kaust.edu.sa

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.