Blood Pressure Morphology Assessment from Photoplethysmogram and Demographic Information Using Deep Learning with Attention Mechanism

Aguirre, Nicolas; Grall-Maës, Edith; Cymberknop, Leandro J.; Armentano, Ricardo L.

doi:10.3390/s21062167

Open AccessArticle

Blood Pressure Morphology Assessment from Photoplethysmogram and Demographic Information Using Deep Learning with Attention Mechanism

¹

GIBIO, Facultad Regional Buenos Aires, Universidad Tecnológica Nacional, Ciudad Autónoma Buenos Aires C1179AAQ, Argentina

²

LIST3N, Université de Technologie de Troyes, 10004 Troyes, France

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(6), 2167; https://doi.org/10.3390/s21062167

Submission received: 16 February 2021 / Revised: 10 March 2021 / Accepted: 15 March 2021 / Published: 19 March 2021

(This article belongs to the Section Biosensors)

Download

Browse Figures

Versions Notes

Abstract

:

Arterial blood pressure (ABP) is an important vital sign from which it can be extracted valuable information about the subject’s health. After studying its morphology it is possible to diagnose cardiovascular diseases such as hypertension, so ABP routine control is recommended. The most common method of controlling ABP is the cuff-based method, from which it is obtained only the systolic and diastolic blood pressure (SBP and DBP, respectively). This paper proposes a cuff-free method to estimate the morphology of the average ABP pulse (

\bar{A B P M}

) through a deep learning model based on a seq2seq architecture with attention mechanism. It only needs raw photoplethysmogram signals (PPG) from the finger and includes the capacity to integrate both categorical and continuous demographic information (DI). The experiments were performed on more than 1100 subjects from the MIMIC database for which their corresponding age and gender were consulted. Without allowing the use of data from the same subjects to train and test, the mean absolute errors (MAE) were 6.57 ± 0.20 and 14.39 ± 0.42 mmHg for DBP and SBP, respectively. For

\bar{A B P M}

, R correlation coefficient and the MAE were 0.98 ± 0.001 and 8.89 ± 0.10 mmHg. In summary, this methodology is capable of transforming PPG into an ABP pulse, which obtains better results when DI of the subjects is used, potentially useful in times when wireless devices are becoming more popular.

Keywords:

photoplethysmography; continuous arterial blood pressure; cuff-less calibration; deep learning

1. Introduction

Cardiovascular diseases (CVDs) remain the most common cause of morbidity and mortality worldwide [1]. One of its main risk factors which reaches at least 1.3 billion people is high blood pressure (BP) or hypertension [2]. Unfortunately, most of the population is not aware of suffering a CVD until an event such as arrhythmia, heart attack, or stroke occurs. In this context, regular BP monitoring becomes an essential strategy of prevention, detection, and control for health.

Methods for measuring the BP are divided into noninvasive and invasive. The traditional noninvasive method involves the sphygmomanometry technique. In general, the measurement is carried out by a physician or different members of a clinical staff, and the subject to be measured rests for a few minutes in order to stabilize his BP. As it depends on an inflatable cuff, it does not serve as a continuous measurement method due to only two values are obtained: diastolic BP (DBP) and systolic BP (SBP). Invasive methods are performed by inserting intravascular catheters with pressure transducers. They have the disadvantage of exposing the subject to bleeding and infections. The advantage is the access to the continuous arterial BP (ABP) morphology, the gold standard for monitoring the BP. Additionally, a noninvasive practice to estimate the ABP is the tonometry technique in combination with the cuff sphygmomanometer. The tonometry technique provides the estimation of the waveform and the cuff sphygmomanometer provides calibrated values [3].

ABP morphology (ABPM) is defined by the mechanical interaction between the blood flow, originated in the hearth, and the arteries. The DBP is referenced to the minimum value of BP and it is related to the aortic valve opening to blood ejection. The SBP is defined as the maximum pressure value applied by the left ventricle in the heart’s cycle. It is the result of the interaction between the blood ejected into the arterial tree and the reflected waves [4]. The dicrotic notch (DN) represents the closure of the aortic valve and is used to calculate the duration of the ejection period and the beginning of the diastolic phase. ABPM can suffer of local alterations, such as those induced by the respiratory rhythm or specific vascular test maneuvers. On the other hand, permanent alterations can be observed as a result of advanced age or the appearance of vascular pathologies such as arterial stiffness [5]. In addition, the ABMP changes according to the site of the arterial tree at which it is measured. However, if both the waveform and calibration values are known, it is possible to use generalized transfer functions to estimate the ABPM at another site [6]. Furthermore, it is known that ABPM may be more predictive of cardiovascular events than just cuff-pressure values [7,8,9], may alert of CVDs such as diastolic dysfunction [10], or could be a valuable measure of the response to the treastmen of the obstructive sleep apnea [11]. In this sense, through the analysis of the ABPM it is possible to derive many features related to the health of the cardiovascular system [3]. Some of them correspond specifically to ABP values and other ones to temporal occurrences.

An important temporal feature introduced in [12] and studied more in depth in Mukkamala et al. [13] is the pulse transit time (PTT). It is defined as the time between the beginning of a pulse originated in the heart and its arrival at a specific point on the periphery of the artery tree. PTT shows a relationship with arterial pulse wave velocity (PWV) based on Moens–Korteweg equation:

P W V = L / P T T = \sqrt{E h / ρ 2 r}

(1)

in which E is the elastic module, h is the arterial wall thickness,

ρ

is the blood density, and r is the radius of a vessel. And PWV can be related to ABP by Hughes equation [14]:

E = E_{0} e^{α P}

(2)

where both

α

> 0 and

E_{0}

are subject-specific constants.

E_{0}

corresponds to the zero-pressure modulus of the vessel wall and P reference to BP. PTT can also be defined as the difference between pulse arrival time (PAT) and the pre-ejection period (PEP). In this sense, PAT can be assessed as the time delay between the electrocardiogram’s (ECG) R-peak and the BP pulse onset. However, PAT is not expressed in Equation (1) and cannot be related directly to BP. Furthermore, it is shown that PEP represents a significant and variable proportion of PTT, from 10% to 30% [15]. Nevertheless, PAT is widely used by researchers as a good approximation of PTT, mainly due to the ease of its measurement [13].

Following this approach, in recent years there has been an increase in the amount of publications regarding the estimation of BP values in a noninvasive and real-time way, also called "cuff-less calibration". In this context, finger photoplethysmography (PPG) signal, due to its similarities in time and frequency domains [16] with BP, emerges as an interesting measurement for estimating BP. PPG is an optical device that measures the change of blood volume in the vessel. Its advantages are the low-cost, simplicity, and portability, very attractive characteristics for wearable devices [17]. Its disadvantages are the sensitivity to noise and artifact due to subject movements; therefore, a signal processing in general must be applied.

Unfortunately, PTT approach to estimate BP cannot be directly applied to PPG signal morphology. In order to deal with this issue, different approaches based on machine learning such as linear regression [18], AdaBoost [19], classical fully-connected neural network (NN) [20], and Gaussian process regression (GPR) [21] were proposed to modeling the subject-specific relation between PPG and BP. These techniques were focused in the feature extraction between PPG and ECG. In particular, in Monte-Moreno [22] and Ruiz-Rodríguez et al. [23] techniques called Random Forest (RF) and Deep Belief Network-Restricted Boltzmann Machine, respectively, were applied to estimate SBP and DBP extracting features from the PPG. On the contrary, with the disruption of deep learning techniques, feature extraction step could be relegated to the NN. In Eom et al. [24], raw ECG, PPG and ballistocardiogram (BCG) signals were used in a combined convolutional NN (CNN) and recurrent NN (RNN) model. Furthermore, some studies proposed to work only with PPG time-series [25] and its derivatives [26]. In Liang et al. [25], a pretrained CNN was used to classify three levels of hypertension based on the scalogram from the PPG and in Slapniča et al. [26], a spectro-temporal ResNet model was proposed to estimate the DBP and SBP values using the PPG in conjunction of the first and second derivatives (PPG and PPG, respectively). The latter was also a combination of CNN and RNN with gated recurrent units (GRU). Particularly, to the best of our knowledge, few studies aim to the hard task of directly estimate the continuous ABP. In Sideris et al. [27] and Sadrawi et al. [28] a RNN model, with long short-term memory (LSTM) units, and deep convolutional auto-encoder (DCAE) model, respectively, were proposed to transfer signals from PPG to ABP. General surveys on the existing and emerging approaches on this field can be found in Hosanee et al. [29] and Kyriacou [30].

In this context and considering the recommendations from Elgendi et al. [17], the collaborative spirit from Slapniča et al. [26] and the requirements when working with the MIMIC-III Matched Waveform Database (MWDB) and MIMIC-III Clinical Database (CDB) [31], the contributions of our work can be summarized in the next topics:

Morphology of the average ABP pulse ( $\bar{A B P M}$ ): The proposed methodology has the capacity to estimate $\bar{A B P M}$ , from which DBP, DN, and SBP values are then extracted.
Raw PPG signal and demographic information (DI): The proposed deep learning architecture allows the combination of the raw signal of the PPG and the DI age and gender of each subject in the same model. The addition of DI improves the estimation of $\bar{A B P M}$ .
Limited bias: The quantities of records per subject and signals duration are limited to reduce subject’s biases.
Reproducibility: The processed dataset, subject’s ID, temporal information of each signal, model architecture, and training sources codes are available for reproducibility. Please see Supplementary Materials Section. The DI used due to requirements from [31] is not shared, but the codes to extract it if the request to access to MIMIC-III CDB is accepted, are also available.

2. Materials and Methods

Figure 1 shows a block diagram of the proposed methodology. Data for this work come from two public available databases: MIMIC-III MWDB and MIMIC-III CDB. The first one contains over 20,000 waveform records digitized at 125 Hz from more than 10,000 distinct patients in intensive care units and the second one includes information such as demographics, laboratory and microbiology test results, cardiology and radiology reports, and diagnostics. In preprocessing stage, records from MIMIC-III MWDB with invasive ABP and fingertip PPG signals are selected, then their corresponding subject’s age and gender were obtained from MIMIC-III CDB. In processing stage only segments with enough signal quality (SQ) are kept and each morphology of the average ABP pulse (

\bar{A B P M}

) is computed. Deep learning stage compromises the model architecture, hyperparameters settings, and training phase to get the estimated

\bar{A B P M}

(

\hat{A B P M}

) from PPG signal. Finally, the values and time occurrences from

\hat{A B P M}

are evaluated. Each stage will be detailed in the next subsections.

2.1. Preprocessing

The ID of the records with a minimum duration of 15 min and with both ABP and PPG signals were preserved. A 10 min interval was defined to consider the subject in a rest condition and 5 min was defined as a gap between different segments of the record. In this work only the age and gender were extracted from MIMIC-III CDB. The age of analysis was set between 18 and 89 years.

2.2. Processing

In Figure 2a is summarized the processing stage. Part of this section was inspired in the released code from Slapniča et al. [26]. Each record was loaded with the WFDB Toolbox [32] for Matlab and two 15-s segments spaced 5 min apart were analyzed (Figure 2b). Each time a segment was rejected for not meeting the requirements described below, a minute was waited before reanalyzing two new segments. If a record was not able to meet the criteria, it was excluded from the analysis and the next record was evaluated. If the criterion was met, a structured file containing both the raw segments and the processed pulses was generated.

The main steps were called Flat, Peak, PPG-SQ, and ABP-SQ. Several thresholds were set to ensure the quality for each segment and pulses, at least equal to those from Slapniča et al. [26]. Particularly, for PPG-SQ and ABP-SQ others were added. The pulses duration were limited in the range [0.5, 1.5] s considering normal physiological limits at rest. The number of pulses per segment analyzed was limited to [10, 30]. To be accepted, the difference SBP-DBP and the moment coefficient of skewness [33] had to be higher than 10 mmHg and zero, respectively.

More in detail, Flat and Peak detect null data and saturated points in valleys and peaks of signals respectively. Then, a Butterworth filter with cutoff frequencies [0.5, 8] Hz and MinMax normalization were performed only to the PPG segment. A pulse-by-pulse analysis was done with the marker proposed in Li et al. [34]. It is important to clarify that PPG-SQ corresponds to part of the feature extraction step in Slapniča et al. [26], but for this work it was only used for signal quality reasons. Once PPG-SQ was succeed, raw PPG segment was saved. The

\bar{A B P M}

were calculated in ABP-SQ step. The ABP pulses were synchronized regarding their onsets. For each time-step

t = i Δ t

, the mean (

μ_{A B P M_{i}}

) and standard deviation (

σ_{A B P M_{i}}

) was calculated. Finally,

\bar{A B P M}

was computed only with the points in range

μ_{A B P M_{i}} \pm 1.25 σ_{A B P M_{i}}

, as is shown in Figure 2c. Regarding the deep learning explained below, the class of each point related to different cardiac cycle stages was defined as one of the following intervals: [onset, systolic peak] (

C_{[O, S P]}

), [systolic peak, dicrotic notch ] (

C_{[S P, D N]}

) or [dicrotic notch, end] (

C_{[D N, E]}

).

Once all the records were processed, after a visual inspection, ABP and pulse duration were limited to 180 mmHg and 1.2 s, respectively. In addition, only pulses with skewness greater than 0.2 were accepted. At this point, there were 10,696 segments corresponding to 1131 subjects, where 169 subjects had more than 50% of segments. To reduce subject’s bias, the quantity of segments per subject was limited to 10. Finally, there were 6478 segments, where 333 subjects represent the 50% of segments (Figure 3a). Figure 3b,c show the age and gender distributions and DBP and SBP distributions, respectively, of the selected dataset. They were 464 females and 667 and males, while mean and standard deviation of age, DBP, and SBP were 58.6 ± 14.1 years, 64.48 ± 9.51 mmHg, and 130.84 ± 20.27 mmHg, respectively.

The raw PPG segment saved during processing was filtered before being used as input. A band-pass Butterworth filter, with cutoff frequencies [0.5, 45] Hz, was applied. As mentioned before, the PPG provides information that could improve

\hat{A B P M}

. PPG was computed using a Savitzky–Golay filter [35]. The window size and the polynomial degree was 7 and 3, respectively. In addition, one second was removed at the beginning and at the end of the segment to avoid artifacts caused by the two filters just mentioned. In summary, the dataset available for the deep learning stage was constituted by 6478 segments, 13 s each one, equivalent to 23.4 h.

2.3. Deep Learning

The proposed deep learning architecture is inspired by seq2seq encoder-decoder [36] models with attention mechanism [37,38] on the natural language processing domain. Before the detailed description of it in Section 2.3.3, a few concepts in relation with this model are described in Section 2.3.1. Furthermore, some considerations about the input data are presented in Section 2.3.2.

2.3.1. RNN Encoder-Decoder

Encoder reads each input from a variable source sequence and encodes it into a fixed-length vector representation, also called hidden state. Then, the decoder starts initializing its own hidden state with the encoder one, and then generates at each time an output. Figure 4a shows an illustration of the encoder-decoder model using RNN, where the type of RNN selected for this work is called "gated recurrent unit" (GRU). The structure of GRU is shown in Figure 4b.

GRU structure was proposed by Cho et al. [36] to mitigate the vanishing/exploding gradient of the RNN. Input vectors of each GRU unit are the previous hidden state

h_{t - 1}

and the current input

x_{t}

, while the current hidden state

h_{t}

correspond to the output. In this sense,

h_{t}

is computed according to relations given by Equation (3):

\begin{matrix} z_{t} = & σ (W_{z} \cdot [h_{t - 1}, x_{t}]) = σ (W_{h z} h_{t - 1} + W_{t z} x_{t}) \\ r_{t} = & σ (W_{r} \cdot [h_{t - 1}, x_{t}]) = σ (W_{h r} h_{t - 1} + W_{t r} x_{t}) \\ {\tilde{h}}_{t} = & tanh (W_{h} \cdot [r_{t} \otimes h_{t - 1}, x_{t}]) = tanh (W_{h h} (r_{t} \otimes h_{t - 1}) + W_{t h} x_{t}) \\ h_{t} = & (1 - z_{t}) \otimes h_{t - 1} + z_{t} \otimes {\tilde{h}}_{t} \end{matrix}

(3)

where

r_{t}

and

z_{t}

denote the reset gate and the update gate.

W_{z}

,

W_{r}

and

W_{h}

are learnable weight matrices and

{\tilde{h}}_{t}

is the proposed hidden state.

σ (.)

and

tanh (.)

correspond to the logistic sigmoid and hyperbolic tangent function, respectively and ⊗ is the symbol for element-wise multiplication.

Both encoder and decoder are RNNs and they are jointly trained to predict the next value of a target sequence given a source sequence. In particular, two loss functions were used to achieve a multitask objective.

\bar{A B P M}

and

\hat{A B P M}

points were decomposed by their values

{\bar{A B P M}}^{v}

and

{\hat{A B P M}}^{v}

, respectively, and classes

{\bar{A B P M}}^{c}

and

{\hat{A B P M}}^{c}

, respectively.

2.3.2. Model Inputs

Model inputs were defined as 5-s random window signals. PPG and PPG were scaled on-the-fly, independently, in the range [0, 1]. Furthermore,

{\bar{A B P M}}^{v}

was also scaled in [0, 1] but considering global minimum and maximum values in

\bar{A B P M}

dataset. Because of the different durations of

\bar{A B P M}

, an homogenization step was performed. To the largest

\bar{A B P M}

’s duration 0.12 s (15 time-steps) were added. Thus, the fixed length was set to 1.312 s (164 time-steps). Each

\bar{A B P M}

was repeated until the fixed length was reached, as shown on Figure 5. To all repeated points it was assigned a new class called [ended] (

C_{[E D]}

), thus increasing the number of classes to 4.

To accelerate the training, as the objective was to predict only one

\hat{A B P M}

, a mask vector with ones and zeros was created.

{\hat{A B P M}}^{v}

error was masked with it to only penalize the nonrepeated

{\bar{A B P M}}^{v}

adding 0.12 s (15 time-steps). In Section 2.3.4 this mask will be considered. An example of the limit of the mask is shown in Figure 5 with a vertical magenta dotted line. Nevertheless,

{\hat{A B P M}}^{c}

was penalized over the whole fixed-size target signal to force the prediction of the

C_{[E D]}

class.

2.3.3. Model Architecture

Figure 6 shows the model architecture. It is constituted by three main parts: encoder, decoder, and attention modules. The encoder consists of three bidirectional GRU (Bi-GRU) layers, while decoder consists of three GRU and two multiperceptron layers (MPL) (

M P L^{v}

and

M P L^{c}

, respectively). Both encoder and decoder have dense connections [39] to improve the information flow between layers (blue arrows, Figure 6). Input

X_{l}

, with

l \in [1, L]

, is the mentioned 5-s PPG and PPG’ input signal. The whole encoder outputs (

{\bar{h}}_{s}

) go to attention module. In addition, the last hidden state (

h_{s_{L}}

) from each encoder GRU layer is used to initialize the hidden states of the corresponding decoder GRU layer (red arrows, Figure 6). The output of last decoder GRU layer (

h_{t_{i}}

) is sent to the both attention module and

M P L^{c}

layer. Context vector (

c_{i}

) and

h_{t_{i}}

are concatenated and transferred to

M P L^{v}

. Finally,

M P L^{v}

and

M P L^{c}

outputs are concatenated (orange arrows, Figure 6) to produce a prediction time-step (

y_{i}

). Age and gender demographic information vector (

X_{D I}

) is concatenated at each time-step with

y_{i}

to conform the decoder inputs (

y_{i & D I}

). In particular,

y_{0}

is a vector full of ones used to indicate the start of a prediction.

In detail, the attention mechanism used in this work refers to the Luong Attention [38], where

c_{i}

is the weighted sum between an attention weight vector (

a_{i}

) and

h_{t_{i}}

:

c_{i} = \sum_{i = 0}^{T} a_{i} h_{t_{i}}

(4)

where

a_{i}

is computed and normalized using the softmax function:

a_{i} = exp (s c o r e (h_{t_{i}}, h_{s})) / \sum_{s^{'} \in s}^{} exp (s c o r e (h_{t_{i}}, h_{s^{^{'}}}))

(5)

where

h_{s}

is each encoder output and

s c o r e (h_{t_{i}}, h_{s})

is the general context-based function:

s c o r e (h_{t_{i}}, h_{s}) = h_{t_{i}}^{Т} W h_{s}

(6)

in which W is also a weight matrix of a MPL.

2.3.4. Loss Functions

As mentioned before, the model was trained to produce

{\hat{A B P M}}^{v}

and

{\hat{A B P M}}^{c}

. The difference between

{\hat{A B P M}}^{v}

and

{\bar{A B P M}}^{v}

was penalized using mean squared error (MSE) function:

M S E = \frac{1}{N} \sum_{j = 1}^{N} \frac{1}{M} \sum_{i = 1}^{M} {({\bar{A B P M}}_{j i}^{v} - {\hat{A B P M}}_{j i}^{v})}^{2}

(7)

while the difference between

{\hat{A B P M}}^{c}

and

{\bar{A B P M}}^{c}

was penalized with the categorical cross-entropy [40] function (CE):

C E = \frac{1}{N} \frac{1}{T} \sum_{j = 1}^{N} \sum_{i = 1}^{T} {\bar{A B P M}}_{j i}^{c} log ({\hat{A B P M}}_{j i}^{c})

(8)

where, for Equations (7) and (8), N, M, and T correspond, respectively, to the number of samples, the mask length previously mentioned and the fixed input length. Finally, the training loss function was defined as:

L o s s_{t r a i n} = M S E + λ C E

(9)

in which

λ

was a constant empirically determined to 0.01.

2.4. Hyperparameters and Experimental Settings

The encoder Bi-GRU units per layer are 4, 20 and 100, respectively. Similarly, decoder GRU units per layer are 8, 40, and 200. In addition,

M P L^{v}

and

M P L^{c}

layers have 1 and 4 units, respectively, with ELU [41] activation functions.

M P L^{v}

and

M P L^{c}

outputs were assigned to

{\hat{A B P M}}^{v}

and

{\hat{A B P M}}^{c}

, respectively. In particular,

M P L^{v}

output was previously normalized with a softmax function. Adam optimizer [42] was chosen to update the model parameters and the learning rate (LR) value was

10^{- 3}

. LR was scaled by 50% after a patience of 25 epochs without improvement in the loss. Training was stopped when patience reaches 50 epochs. The batch size was set to 48.

Weights of

M P L^{v}

,

M P L^{c}

and Attention layers are initialized from

U (- \sqrt{w}, \sqrt{w})

and weights for GRU layers are initialized from

U (- \sqrt{k}, \sqrt{k})

where

w = \frac{1}{# L a y e r i n p u t s i z e}, k = \frac{1}{# G R U u n i t s}

(10)

In particular, for the weights corresponding for the transition matrix of the GRU layers (

W_{h z}, W_{h r}, W_{h h}

, from Equation (3)) a random orthogonal initialization scheme was selected [43].

Three scenarios were proposed to evaluate the impact of

X_{D I}

and the split of segments by subject. For the first and second scenarios, the mixing of segments from the same subject between train and test sets (

M i x_{n o}

) was not allowed. For the first scenario neither

X_{D I}

was provided. Then, for the second scenario

X_{D I}

information was added. Finally, for the third scenario, it was also allowed that the train and test sets had segments from same subjects (

M i x_{y e s}

). The scenarios were named:

M i x_{n o}

,

M i x_{n o} + D I

and

M i x_{y e s} + D I

, respectively. For scenarios

M i x_{n o}

and

M i x_{n o} + D I

the test set was formed by the segments corresponding to 20% of the subjects. For scenario

M i x_{y e s} + D I

the test set was conformed by 20% of the segments, independently of the subjects. Each scenario was cross-validated 5 times.

2.5. Evaluation

Firstly, to evaluate the cuff-less calibration in respect to real ABP,

{\bar{A B P M}}^{v}

and

{\hat{A B P M}}^{v}

were restored to the minimum and maximum ABP global scale, then using

{\hat{A B P M}}^{v}

the DBP, DN and SBP were computed (

{\hat{A B P M}}^{DBP}

,

{\hat{A B P M}}^{DN}

and

{\hat{A B P M}}^{SBP}

, respectively).

{\hat{A B P M}}^{DBP}

was computed as the mean between the first and last value of

{\hat{A B P M}}^{v}

.

{\hat{A B P M}}^{DN}

and

{\hat{A B P M}}^{SBP}

were considered as the last occurrence of the class

C_{[S P, D N]}

in

{\hat{A B P M}}^{c}

and maximum value in

{\hat{A B P M}}^{v}

, respectively. In this sense, the evaluation metrics for

{\hat{A B P M}}^{DBP}

,

{\hat{A B P M}}^{DN}

,

{\hat{A B P M}}^{SBP}

were root mean squared error (

R M S E

), mean absolute error (

M A E

), standard deviation of the errors (STD), and the coefficient of determination (

R^{2}

):

R M S E = \sqrt{M S E} = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(z_{i} - {\hat{z}}_{i})}^{2}}

(11)

M A E = \frac{1}{N} \sum_{i = 1}^{N} | (z_{i} - {\hat{z}}_{i}) |

(12)

R^{2} = 1 - [\sum_{i = 1}^{N} (z_{i} - {\hat{z}}_{i}) / \sum_{i = 1}^{N} (z_{i} - {\bar{z}}_{i})]

(13)

where

{\bar{z}}_{i}

is the mean of

z_{i}

and

{\hat{z}}_{i}

is the estimated value. Secondly, DN time occurrence (

D N^{TO}

) and pulse duration were also evaluated with

R M S E

,

M A E

, and

R^{2}

metrics.

D N^{TO}

and pulse duration were computed as last and first occurrence of classes

C_{[D N, E]}

and

C_{[E D]}

, respectively, in

{\hat{A B P M}}^{c}

. Finally,

\hat{A B P M}

pulse values were evaluated with

R M S E

and

M A E

, while

\hat{A B P M}

pulse waveforms were evaluated with the Pearson’s coefficient of correlation (R). When

\bar{A B P M}

and

\hat{A B P M}

had different durations, the shorter one was considered for the evaluation. R is defined as:

R = \sum_{i = 1}^{T} (x_{i} - \bar{x}) (y_{i} - \bar{y}) / \sqrt{\sum_{i = 1}^{T} {(x_{i} - \bar{x})}^{2} {(y_{i} - \bar{y})}^{2}}

(14)

where x and y correspond to

\bar{A B P M}

and

\hat{A B P M}

,

\bar{x}

and

\bar{y}

theirs mean, and T the considered duration.

3. Results

Figure 7 shows an input segment, attention weights, and output test example from the

M i x_{n o} + D I

scenario. Upper left plot compares

{\bar{A B P M}}^{v}

with respect to

\hat{A B P M}

. Red, green, and blue points represent

{\hat{A B P M}}^{c}

, while the black line is

{\bar{A B P M}}^{v}

. In the lower left plot the grade of intensity of heat map points determines the level of attention applied to the input (lower right plot) by the model to produce each

\hat{A B P M}

point. In addition, Figure 8 shows other test samples with very different morphologies predicted. It is important to corroborate that the model did not learn a global average morphology.

Table 1 shows the obtained values of

{\hat{A B P M}}^{DBP}

,

{\hat{A B P M}}^{DN}

, and

{\hat{A B P M}}^{SBP}

results. In particular,

{\hat{A B P M}}^{DBP}

and

{\hat{A B P M}}^{SBP}

assessment refers to a cuff-less calibration task. In ascending order, they were

M i x_{n o}

,

M i x_{n o} + D I

, and

M i x_{y e s} + D I

. Regarding the time occurrences assessment from Table 2, there was not a clear difference between

M i x_{n o}

and

M i x_{n o} + D I

scenarios. Despite the mean of the metrics being slightly better for the

M i x_{n o}

scenario, they also show a larger standard deviation. On the contrary,

M i x_{y e s} + D I

scenario show better results. Respecting the evaluation of waveforms and values of

\hat{A B P M}

presented in Table 3 there was again a clear improvement in performance for the scenario

M i x_{y e s} + D I

, followed by scenarios

M i x_{n o} + D I

and

M i x_{n o}

.

Table 4 shows the results regarding the British Hypertension Society (BHS) standards [44] using prediction of each fold per scenario. BHS define thresholds (i.e., 5, 10, and 15 mmHg) to inform the cumulative error percentage and determine the grade of a device when the BP is measured. DBP estimation during

M i x_{y e s} + D I

scenario achieves grade B, requiring 3.4% for the range <5 mmHg to achieve grade A.

M i x_{n o}

and

M i x_{n o} + D I

scenarios achieve grade C, lacking 8.2% and 4.5%, respectively, for the range <5 mmHg to achieve grade B.

Bland–Altman plots were performed using SBP and DBP predictions of each of the 5 fold per scenario. Bland–Altman results are shown in Table 5 in terms of mean (

μ

) and limits of agreement (

μ \pm 1.96

σ

). In particular, Figure 9 shows regression plots, Bland–Altman plots, and histograms of errors corresponding to

M i x_{n o} + D I

scenario.

4. Discussion

In the present work, the

\bar{A B P M}

was estimated by combining the time-series of the PPG and DI of each subject. Firstly, the

\bar{A B P M}

signal was computed and paired to its corresponding PPG signal and DI. Secondly, a model with sequence-to-sequence architecture and attention mechanism was proposed to transfer the information from the optical domain to the pressure domain. Results show the capacity of the proposed method to simultaneously estimate both morphology and calibration values of the ABP signal.

To the best of our knowledge, a distinction is made in the literature between calibration methods. Depending on whether or not data from the same subject are used for training and testing, they are called calibration-based (cal-based) or calibration-free (cal-free), respectively. In this sense, hereafter

M i x_{y e s} + D I

and

M i x_{n o} + D I

scenarios refer to cal-based and cal-free, respectively. Table 6 presents a comparison, in calibration terms, with other studies. Nevertheless, because of different evaluation metrics, dataset sizes, and signal sources, the comparison is not easy and direct. Studies that reported lowest errors are those with fewer number of subjects and in which the restriction to use subject data in both training and training sets was not explicit or was not applied. Particularly, in Chan et al. [18], mean error (ME) was used as a metric and the dataset was unspecified. In Kurylyak et al. [20], despite that only PPG signal was used, the dataset consisted only of 15,000 beats and no information about number of subject was given. In Chowdhury et al. [21] the dataset consists of 226 records, with a signal duration of 2.1 s and corresponding to 126 subjects. Methods in Chan et al. [18], Kurylyak et al. [20] and Chowdhury et al. [21] use the feature extraction approach. On the contrary, in Eom et al. [24] a deep learning model with the capacity to take raw multi signal inputs was proposed. However, the dataset was composed of only 15 subjects, without restricting the use of data from the same subjects to train and test.

Works that have used largest amount of subjects were [19,22,23,26] (410, 572, 1000, and 510 subjects, respectively). In Monte-Moreno [22], estimations of SBP and DBP were obtained using only features extracted from the PPG, combined with the age, weight, and body mass index information of the subjects. The author did not make explicit any subject’s data restriction (cal-based scenario). Assessments were reported in terms of

R^{2}

metric and results reached a grade B under the standards. On the contrary, results from [19,22,26] reported results much more similar with those obtained in the present work and also expressed subject’s data restriction for train and test set. In Ruiz-Rodríguez et al. [23], only a cal-free scenario was reported and errors were informed in terms of a Bland–Altman test. Limits of agreement for SBP and DBP were [−40.91, 34.94 ] and [−20.68, 13.38] mmHg, respectively, and mean values were −2.98 and −3.65 mmHg, respectively. Although they had a lot of clinical information available in their database, this information was not included in their model when estimating BP values. Particularly, in Kachuee et al. [19] were reported the lowest errors in both cal-free and cal-based scenarios. Nevertheless, ECG information was necessary jointly with the PPG time series, followed by a feature extraction step to get estimations. On the contrary, in Slapniča et al. [26], where a leave-one-subject-out experiments were performed, only PPG raw signal was necessary. The dataset used in [26] was nearly the half used in this work, and except for

S B P_{M A E}

evaluation in the cal-based scenario, the results presented here are better. Additionally, in no case authors of studies [19,22,23,26] reported a limitation on the number of records per subject or the total duration per record. In these terms, we suggest our work is less biased.

Table 7 shows a comparison between different methods and results that were focused on the continuous ABP and our results given in Table 3. In Sideris et al. [27] there were 42 subjects and records analyzed from MIMIC database, and each record was composed of two segments. Furthermore, a completely personalized approach was proposed, in which as many different models as subjects were created. On the contrary, in our approach just a single model needs to be trained. In Sadrawi et al. [28] the proposed DCAE model was trained with 18 subjects from closed data. Additionally, while DCAE model only accepted fixed input length, the methodology proposed in the present work does not have that limitation. Although in Sideris et al. [27] and Sadrawi et al. [28] MAE and RMSE values were lower than those in the present work, the number of subjects evaluated was lower and there was no subject’s data restriction between train and test sets.

It is important to mention that while all PPG signals came from the finger, it is unknown from which specific site of the arterial tree BP signals were recorded. Nevertheless, future studies could improve the results by specifying the sites of the source and target signals. Furthermore, the information about devices and filters used during data collection is also unknown for both PPG and ABP signals. Therefore, in addition to the fact that the type of drug supplied or the existence of previous pathologies is also unknown, the scenario does not meet the standards of a rigorous medical protocol.

Finally, compared to the previous studies found in the literature, the presented architecture allows for the use of both raw signals and DI (age and gender) as inputs. An improvement in results can be observed when DI is considered (Table 1 and Table 3). Furthermore, without any modification in the architecture, other characteristics of the subject could be incorporated, such as ethnicity, weight, or height. Despite that many of them are present in the MIMIC-III CDB, the final number of subjects with extra information and also good quality records was less than 30% of the total used. Pre-existing conditions such as diabetes, chronic kidney disease, smoking, and dyslipidemia also could be incorporated.

5. Conclusions

In this paper, a new deep learning architecture to estimate the average arterial blood pressure morphology (

\bar{A B P M}

) is proposed. The proposed methodology, for each point that conforms the

\bar{A B P M}

, estimates the blood pressure value and classifies it according to the stage of the cardiac cycle to which belongs. To the best of our knowledge, this is a contribution to the literature because most of the existing approaches only estimate diastolic and systolic values. The methodology presented here also allows simultaneous use of subject demographic information and raw photoplethysmogram signal from the finger as model input. Further studies are needed with more specific databases in order to expand the presented results. In addition, the source code is shared concerning the reproducibility of the results. Finally, as a potential research direction, this methodology could be adapted to mobile devices where only one source signal is required.

Supplementary Materials

The source codes are available at https://github.com/AguirreNicolas/PPG2IABP and the processed dataset is available at https://doi.org/10.5281/zenodo.4598938.

Author Contributions

Conceptualization, N.A., E.G.-M., and L.J.C., methodology and software, N.A.; funding acquisition, E.G.-M. and R.L.A.; Supervision, E.G.-M. and L.J.C.; writing—original draft preparation, N.A.; writing—review and editing, E.G.-M. and L.J.C.; project administration, R.L.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially founded by Universidad Tecnológica Nacional (Grant: ICUTIBA7647 R&D projects) and by the ML-Cardyn project, cofunded by the European Union. The authors would like to thank Europe for its commitment in Champagne-Ardenne with the European Regional Development Fund (FEDER).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

ABP	Arterial blood pressure
ABPM	Arterial blood pressure morphology
$\bar{A B P M}$	Average arterial blood pressure pulse morphology
$\hat{A B P M}$	Estimated arterial blood pressure pulse morphology
BP	Blood pressure
BCG	Ballistocardiogram
BHS	British Hypertension Society
CDB	Clinical database
CNN	Convolutional neural network
CVDs	Cardiovascular diseases
DCAE	Deep convolutional auto-encoder
DI	Demographic information
DN	Dicrotic notch
$D N^{TO}$	Dicrotic notch time occurrence
GPR	Gaussian process regression
GRU	Gated recurrent unit
LSTM	Long-short term memory
LR	Learning rate
MAE	Mean absolute error
NN	Neural network
R	Pearson’s correlation coefficient
RNN	Recurrent neural network
RMSE	Root-mean squared error
$R^{2}$	Coefficient of determination
SQ	Signal quality
STD	Standard deviation of the errors
PAT	Pulse arrival time
PEP	Pre-ejection period
PPG	Photoplethysmography
PPG’	1st derivative of the photoplethysmogram
PPG”	2nd derivative of the photoplethysmogram
PTT	Pulse transit time
PWV	Pulse wave velocity
MWDB	Matched waveform database

References

Virani Salim, S.; Alvaro, A.; Benjamin Emelia, J.; Bittencourt Marcio, S.; Callaway Clifton, W.; Carson April, P.; Chamberlain Alanna, M.; Chang Alexander, R.; Susan, C.; Delling Francesca, N.; et al. Heart Disease and Stroke Statistics—2020 Update: A Report From the American Heart Association. Circulation 2020, 141, e139–e596. [Google Scholar] [CrossRef]
World Health Organization. Hypertension. Available online: https://www.who.int/news-room/fact-sheets/detail/hypertension (accessed on 20 October 2020).
Avolio, A.P.; Butlin, M.; Walsh, A. Arterial blood pressure measurement and pulse wave analysis—Theitr role in enhancing cardiovascular assessment. Physiol. Meas. 2010, 31, R1–R47. [Google Scholar] [CrossRef] [PubMed]
Salvi, P. Pulse Waves: How Vascular Hemodynamics Affects Blood Pressure, 2nd ed.; Springer International Publishing: Cham, Switzerland, 2012. [Google Scholar] [CrossRef]
O’Rourke, M.F.; Staessen, J.A.; Vlachopoulos, C.; Duprez, D.; Plante, G.E. Clinical applications of arterial stiffness; definitions and reference values. Am. J. Hypertens. 2002, 15, 426–444. [Google Scholar] [CrossRef]
Pauca Alfredo, L.; O’Rourke Michael, F.; Kon Neal, D. Prospective Evaluation of a Method for Estimating Ascending Aortic Pressure From the Radial Artery Pressure Waveform. Hypertension 2001, 38, 932–937. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bryan, W.; Lacy Peter, S.; Thom Simon, M.; Kennedy, C.; Alice, S.; David, C.; Hughes Alun, D.; Thurston, H.; O’Rourke, M. Differential Impact of Blood Pressure—Lowering Drugs on Central Aortic Pressure and Clinical Outcomes. Circulation 2006, 113, 1213–1225. [Google Scholar] [CrossRef] [Green Version]
Hashimoto, J.; Imai, Y.; O’Rourke, M.F. Indices of Pulse Wave Analysis Are Better Predictors of Left Ventricular Mass Reduction Than Cuff Pressure. Am. J. Hypertens. 2007, 20, 378–384. [Google Scholar] [CrossRef] [Green Version]
Nelson, M.R.; Stepanek, J.; Cevette, M.; Covalciuc, M.; Hurst, R.T.; Tajik, A.J. Noninvasive Measurement of Central Vascular Pressures With Arterial Tonometry: Clinical Revival of the Pulse Pressure Waveform? In Mayo Clinic Proceedings; Elsevier: Amsterdam, The Netherlands, 2010; Volume 85, pp. 460–472. [Google Scholar] [CrossRef] [Green Version]
Weber, T.; Auer, J.; O’Rourke, M.F.; Punzengruber, C.; Kvas, E.; Eber, B. Prolonged mechanical systole and increased arterial wave reflections in diastolic dysfunction. Heart 2006, 92, 1616–1622. [Google Scholar] [CrossRef] [Green Version]
Noda, A.; Nakata, S.; Fukatsu, H.; Yasuda, Y.; Miyao, E.; Miyata, S.; Yasuma, F.; Murohara, T.; Yokota, M.; Koike, Y. Aortic Pressure Augmentation as a Marker of Cardiovascular Risk in Obstructive Sleep Apnea Syndrome. Hypertens. Res. 2008, 31, 1109–1114. [Google Scholar] [CrossRef] [Green Version]
Geddes, L.A.; Voelz, M.H.; Babbs, C.F.; Bourland, J.D.; Tacker, W.A. Pulse Transit Time as an Indicator of Arterial Blood Pressure. Psychophysiology 1981, 18, 71–74. [Google Scholar] [CrossRef]
Mukkamala, R.; Hahn, J.; Inan, O.T.; Mestha, L.K.; Kim, C.; Töreyin, H.; Kyal, S. Toward Ubiquitous Blood Pressure Monitoring via Pulse Transit Time: Theory and Practice. IEEE Trans. Biomed. Eng. 2015, 62, 1879–1901. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hughes, D.J.; Babbs, C.F.; Geddes, L.A.; Bourland, J.D. Measurements of Young’s modulus of elasticity of the canine aorta with ultrasound. Ultrason. Imaging 1979, 1, 356–367. [Google Scholar] [CrossRef] [Green Version]
Payne, R.A.; Symeonides, C.N.; Webb, D.J.; Maxwell, S.R.J. Pulse transit time measured from the ECG: An unreliable marker of beat-to-beat blood pressure. J. Appl. Physiol. 2006, 100, 136–141. [Google Scholar] [CrossRef] [Green Version]
Martínez, G.; Howard, N.; Abbott, D.; Lim, K.; Ward, R.; Elgendi, M. Can Photoplethysmography Replace Arterial Blood Pressure in the Assessment of Blood Pressure? J. Clin. Med. 2018, 7, 316. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Elgendi, M.; Fletcher, R.; Liang, Y.; Howard, N.; Lovell, N.H.; Abbott, D.; Lim, K.; Ward, R. The use of photoplethysmography for assessing hypertension. NPJ Digit. Med. 2019, 2, 1–11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Chan, K.; Hung, K.; Zhang, Y. Noninvasive and cuffless measurements of blood pressure for telemedicine. In Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Istanbul, Turkey, 25–28 October 2001; Volume 4, pp. 3592–3593. [Google Scholar] [CrossRef]
Kachuee, M.; Kiani, M.M.; Mohammadzade, H.; Shabany, M. Cuffless Blood Pressure Estimation Algorithms for Continuous Health-Care Monitoring. IEEE Trans. Biomed. Eng. 2016, 64, 859–869. [Google Scholar] [CrossRef]
Kurylyak, Y.; Lamonaca, F.; Grimaldi, D. A Neural Network-based method for continuous blood pressure estimation from a PPG signal. In Proceedings of the 2013 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), Minneapolis, MN, USA, 6–9 May 2013; pp. 280–283. [Google Scholar] [CrossRef]
Chowdhury, M.H.; Shuzan, M.N.I.; Chowdhury, M.E.H.; Mahbub, Z.B.; Uddin, M.M.; Khandakar, A.; Reaz, M.B.I. Estimating Blood Pressure from the Photoplethysmogram Signal and Demographic Features Using Machine Learning Techniques. Sensors 2020, 20, 3127. [Google Scholar] [CrossRef]
Monte-Moreno, E. Non-invasive estimate of blood glucose and blood pressure from a photoplethysmograph by means of machine learning techniques. Artif. Intell. Med. 2011, 53, 127–138. [Google Scholar] [CrossRef] [PubMed]
Ruiz-Rodríguez, J.C.; Ruiz-Sanmartín, A.; Ribas, V.; Caballero, J.; García-Roche, A.; Riera, J.; Nuvials, X.; de Nadal, M.; de Sola-Morales, O.; Serra, J.; et al. Innovative continuous non-invasive cuffless blood pressure monitoring based on photoplethysmography technology. Intensive Care Med. 2013, 39, 1618–1625. [Google Scholar] [CrossRef]
Eom, H.; Lee, D.; Han, S.; Hariyani, Y.S.; Lim, Y.; Sohn, I.; Park, K.; Park, C. End-to-End Deep Learning Architecture for Continuous Blood Pressure Estimation Using Attention Mechanism. Sensors 2020, 20, 2338. [Google Scholar] [CrossRef] [Green Version]
Liang, Y.; Chen, Z.; Ward, R.; Elgendi, M. Photoplethysmography and Deep Learning: Enhancing Hypertension Risk Stratification. Biosensors 2018, 8, 101. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Slapničar, G.; Mlakar, N.; Luštrek, M. Blood Pressure Estimation from Photoplethysmogram Using a Spectro-Temporal Deep Neural Network. Sensors 2019, 19, 3420. [Google Scholar] [CrossRef] [Green Version]
Sideris, C.; Kalantarian, H.; Nemati, E.; Sarrafzadeh, M. Building Continuous Arterial Blood Pressure Prediction Models Using Recurrent Networks. In Proceedings of the 2016 IEEE International Conference on Smart Computing (SMARTCOMP), St. Louis, MO, USA, 18–20 May 2016; pp. 1–5. [Google Scholar] [CrossRef]
Sadrawi, M.; Lin, Y.T.; Lin, C.H.; Mathunjwa, B.; Fan, S.Z.; Abbod, M.F.; Shieh, J.S. Genetic Deep Convolutional Autoencoder Applied for Generative Continuous Arterial Blood Pressure via Photoplethysmography. Sensors 2020, 20, 3829. [Google Scholar] [CrossRef] [PubMed]
Hosanee, M.; Chan, G.; Welykholowa, K.; Cooper, R.; Kyriacou, P.A.; Zheng, D.; Allen, J.; Abbott, D.; Menon, C.; Lovell, N.H.; et al. Cuffless Single-Site Photoplethysmography for Blood Pressure Monitoring. J. Clin. Med. 2020, 9, 723. [Google Scholar] [CrossRef] [Green Version]
El-Hajj, C.; Kyriacou, P.A. A review of machine learning techniques in photoplethysmography for the non-invasive cuff-less measurement of blood pressure. Biomed. Signal Process. Control 2020, 58, 101870. [Google Scholar] [CrossRef]
Johnson, A.E.W.; Pollard, T.J.; Shen, L.; Lehman, L.W.H.; Feng, M.; Ghassemi, M.; Moody, B.; Szolovits, P.; Anthony Celi, L.; Mark, R.G. MIMIC-III, a freely accessible critical care database. Sci. Data 2016, 3, 160035. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Goldberger Ary, L.; Amaral Luis, A.N.; Leon, G.; Hausdorff Jeffrey, M.; Ivanov Plamen, C.; Mark Roger, G.; Mietus Joseph, E.; Moody George, B.; Chung-Kang, P.; Eugene, S.H. PhysioBank, PhysioToolkit, and PhysioNet. Circulation 2020, 101, e215–e220. [Google Scholar] [CrossRef] [Green Version]
Elgendi, M. Optimal Signal Quality Index for Photoplethysmogram Signals. Bioengineering 2016, 3, 21. [Google Scholar] [CrossRef] [Green Version]
Li, B.N.; Dong, M.C.; Vai, M.I. On an automatic delineator for arterial blood pressure waveforms. Biomed. Signal Process. Control 2010, 5, 76–81. [Google Scholar] [CrossRef]
Savitzky, A.; Golay, M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
Cho, K.; van Merrienboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv 2014, arXiv:1406.1078. [Google Scholar]
Bahdanau, D.; Cho, K.; Bengio, Y. Neural Machine Translation by Jointly Learning to Align and Translate. arXiv 2014, arXiv:1409.0473. [Google Scholar]
Luong, M.T.; Pham, H.; Manning, C.D. Effective Approaches to Attention-based Neural Machine Translation. arXiv 2015, arXiv:1508.04025. [Google Scholar]
Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks. arXiv 2017, arXiv:1608.06993. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Clevert, D.A.; Unterthiner, T.; Hochreiter, S. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs). arXiv 2015, arXiv:1511.07289. [Google Scholar]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Saxe, A.M.; McClelland, J.L.; Ganguli, S. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks. arXiv 2013, arXiv:1312.6120. [Google Scholar]
O’Brien, E.; Petrie, J.; Littler, W.; de Swiet, M.; Padfield, P.L.; O’Malley, K.; Jamieson, M.; Altman, D.; Bland, M.; Atkins, N. The British Hypertension Society protocol for the evaluation of automated and semi-automated blood pressure measuring devices with special reference to ambulatory systems. J. Hypertens. 1990, 8, 607–619. [Google Scholar] [CrossRef] [Green Version]
Liang, Y.; Elgendi, M.; Chen, Z.; Ward, R. An optimal filter for short photoplethysmogram signals. Sci. Data 2018, 5, 180076. [Google Scholar] [CrossRef]

Figure 1. Block diagram of the proposed methodology.

Figure 2. (a) Summarized processing stage. (b) In blue, two 15-s segments analyzed with 5 min gap between each other, in black. (c) An example of

\bar{A B P M}

computed. Red, green, and blue lines correspond to the classes [onset—systolic peak], [systolic peak—dicrotic notch] and [dicrotic notch–end], respectively, while gray lines represent ABP pulses and dashed magenta lines represent the limits to consider the average pulse.

Figure 2. (a) Summarized processing stage. (b) In blue, two 15-s segments analyzed with 5 min gap between each other, in black. (c) An example of

\bar{A B P M}

computed. Red, green, and blue lines correspond to the classes [onset—systolic peak], [systolic peak—dicrotic notch] and [dicrotic notch–end], respectively, while gray lines represent ABP pulses and dashed magenta lines represent the limits to consider the average pulse.

Figure 3. Selected dataset distributions: (a) Number of subjects with their corresponding quantity of segments (gray bars) and segments cumulative percentage (red line). (b) Age and gender. (c) Systolic blood pressure (SBP) and diastolic blood pressure (DBP).

Figure 4. (a) recurrent NN (RNN) Encoder-Decoder architecture. (b) Structure of gated recurrent units (GRU) unit.

Figure 5. Fixed-size target signal composed by a completed

\bar{A B P M}

and its repetition. Red, green, blue, and black lines correspond to classes [onset - systolic peak], [systolic peak—dicrotic notch], [dicrotic notch—end] and [Ended], respectively. Vertical magenta dotted line represents the end of the error’s mask.

Figure 5. Fixed-size target signal composed by a completed

\bar{A B P M}

and its repetition. Red, green, blue, and black lines correspond to classes [onset - systolic peak], [systolic peak—dicrotic notch], [dicrotic notch—end] and [Ended], respectively. Vertical magenta dotted line represents the end of the error’s mask.

Figure 6. Model architecture.

Figure 7. Relationship between photoplethysmogram signals (PPG) and PPG input signals and

\hat{A B P M}

via attention weight’s heatmap. Additionally,

{\bar{A B P M}}^{v}

is shown in comparison with

\hat{A B P M}

.

Figure 7. Relationship between photoplethysmogram signals (PPG) and PPG input signals and

\hat{A B P M}

via attention weight’s heatmap. Additionally,

{\bar{A B P M}}^{v}

is shown in comparison with

\hat{A B P M}

.

Figure 8. Comparison between different

{\bar{A B P M}}^{v}

and

\hat{A B P M}

examples.

Figure 8. Comparison between different

{\bar{A B P M}}^{v}

and

\hat{A B P M}

examples.

Figure 9. Regression plots (a,d), Bland–Altman plots (b,e), and histograms of errors (c,f), corresponding to

M i x_{n o} + D I

scenario and DBP and SBP values.

Figure 9. Regression plots (a,d), Bland–Altman plots (b,e), and histograms of errors (c,f), corresponding to

M i x_{n o} + D I

scenario and DBP and SBP values.

Table 1. Mean and standard deviation of the metrics used to evaluate the diastolic (DBP), dicrotic notch (DN), and systolic blood pressure (SBP) errors.

Marker	Scenario	$R^{2}$	$RMSE$	$MAE$	$STD$
DBP	$M i x_{n o}$	0.10 ± 0.03	8.88 ± 0.27	7.01 ± 0.23	8.84 ± 0.25
	$M i x_{n o} + D I$	0.19 ± 0.04	8.47 ± 0.29	6.57 ± 0.20	8.43 ± 0.29
	$M i x_{y e s} + D I$	0.41 ± 0.04	7.40 ± 0.20	5.56 ± 0.18	7.32 ± 0.17
DN	$M i x_{n o}$	0.29 ± 0.02	11.23 ± 0.44	8.72 ± 0.31	11.15 ± 0.38
	$M i x_{n o} + D I$	0.32 ± 0.04	10.95 ± 0.27	8.54 ± 0.37	10.84 ± 0.26
	$M i x_{y e s} + D I$	0.50 ± 0.02	9.67 ± 0.17	7.08 ± 0.19	9.63 ± 0.15
SBP	$M i x_{n o}$	0.17 ± 0.04	18.20 ± 0.52	14.55 ± 0.56	18.04 ± 0.54
	$M i x_{n o} + D I$	0.19 ± 0.05	18.07 ± 0.60	14.39 ± 0.42	17.87 ± 0.40
	$M i x_{y e s} + D I$	0.39 ± 0.05	15.96 ± 0.60	12.08 ± 0.36	15.67 ± 0.50

Root mean squared error (RMSE), mean absolute errors (MAE), and standard deviation of the errors (STD) in mmHg.

Table 2. Mean and standard deviation of the metrics used to evaluate the errors in the dicrotic notch time occurrence (

D N^{TO}

) and the pulse duration from the estimated mean arterial blood pressure pulse morphology (

\hat{A B P M}

).

Table 2. Mean and standard deviation of the metrics used to evaluate the errors in the dicrotic notch time occurrence (

D N^{TO}

) and the pulse duration from the estimated mean arterial blood pressure pulse morphology (

\hat{A B P M}

).

Scenario	${DN}^{TO}$			Pulse Duration
Scenario	$R^{2}$	$RMSE$	$MAE$	$R^{2}$	$RMSE$	$MAE$
$M i x_{n o}$	0.55 ± 0.10	33 ± 3	24 ± 2	0.97 ± 0.02	22 ± 8	15 ± 9
$M i x_{n o} + D I$	0.54 ± 0.05	35 ± 3	25 ± 1	0.97 ± 0.01	24 ± 5	16 ± 4
$M i x_{y e s} + D I$	0.61 ± 0.02	33 ± 1	23 ± 1	0.98 ± 0.01	18 ± 2	11 ± 1

RMSE and MAE in ms.

Table 3. Mean and standard deviation of the metrics used to evaluate the waveform and value errors for each estimated arterial blood pressure pulse morphology (

\hat{A B P M}

).

Table 3. Mean and standard deviation of the metrics used to evaluate the waveform and value errors for each estimated arterial blood pressure pulse morphology (

\hat{A B P M}

).

Scenario	$\hat{ABPM}$
Scenario	$R$	$RMSE$	$MAE$
$M i x_{n o}$	0.98 ± 0.002	10.39 ± 0.11	9.06 ± 0.09
$M i x_{n o} + D I$	0.98 ± 0.001	10.26 ± 0.11	8.89 ± 0.10
$M i x_{y e s} + D I$	0.98 ± 0.001	8.65 ± 0.20	7.37 ± 0.21

RMSE and MAE in mmHg.

Table 4. Comparison with the British Hypertension Society (BHS) Standard.

Scenario		Cumulative Error Percentage
Scenario		$< 5 mmHg$	$< 10 mmHg$	$< 15 mmHg$
$M i x_{n o}$	DBP	41.8%	76.6%	92.9%
$M i x_{n o}$	SBP	21.6%	42.0%	59.3%
$M i x_{n o} + D I$	DBP	45.5%	80.2%	93.5%
$M i x_{n o} + D I$	SBP	21.3%	41.9%	58.6%
$M i x_{y e s} + D I$	DBP	56.6%	86.0%	95.5%
$M i x_{y e s} + D I$	SBP	29.6%	53.2%	70.3%
BHS [44]	Grade A	60%	85%	95%
	Grade B	50%	75%	90%
	Grade C	40%	65%	85%

Table 5. Limits of agreements(

μ \pm 1.96 σ

) and means (

μ

) for Bland–Altman plots.

Table 5. Limits of agreements(

μ \pm 1.96 σ

) and means (

μ

) for Bland–Altman plots.

	Scenario	Limits	Mean
DBP	$M i x_{n o}$	[−17.80, 16.77]	−0.52
	$M i x_{n o} + D I$	[−16.87, 15.90]	−0.49
	$M i x_{y e s} + D I$	[−14.23, 14.48]	0.13
SBP	$M i x_{n o}$	[−32.85, 37.27]	2.21
	$M i x_{n o} + D I$	[−34.36, 35.60]	0.62
	$M i x_{y e s} + D I$	[−28.78, 32.69]	1.95

Limits and Mean in mmHg.

Table 6. Comparison with related works in term of cuff-less calibration results.

Author	Dataset	Method	Input	Signals	Calibration	Error
Author	Dataset	Method	Input	Signals	Calibration	DBP	SBP
Chan et al. [18]	Unspecified	Linear regression	Feature	ECG	Cal-based	ME: 4.08	ME: 7.49
Chan et al. [18]	Unspecified	Linear regression	Feature	PPG	Cal-based	STD: 5.62	STD: 8.82
Kurylyak et al. [20]	MIMIC	Neural network	Feature	PPG	Cal-based	MAE: 2.21	MAE: 3.80
Kurylyak et al. [20]	(15,000 beats)	Neural network	Feature	PPG	Cal-based	STD: 2.09	STD: 3.46
Chowdhury et al. [21]	Dataset from [45] (126 subjects)	Gaussian process regression (GPR)	Feature	PPG	Cal-based	MAE: 1.74	MAE: 3.02
						RMSE: 3.59	RMSE: 6.74
						R: 0.96	R: 0.95
Eom et al. [24]	Own data (15 subjects)	Deep learning (CNN+GRU +Attention)	Raw	ECG	Cal-based	MAE: 3.33 RMSE: 3.42	MAE: 4.06 RMSE: 4.04
				BCG
				PPG
Monte-Moreno [22]	Own data (410 subjects)	Random Forest (RF)	Feature	PPG	Cal-free	$R^{2}$ : 0.89	$R^{2}$ : 0.91
Kachuee et al. [19]	MIMIC-II (1000 subjects)	AdaBoost	Feature	ECG PPG	Cal-free	MAE: 5.35	MAE: 11.17
						STD: 6.14	STD: 10.09
						R: 0.48	R: 0.59
					Cal-based	MAE: 4.31	MAE: 8.21
						STD: 3.52	STD: 5.43
						R: 0.57	R: 0.54
Slapničar et al. [26]	MIMIC-III (510 subjects)	Deep learning (ResNet)	Raw	PPG	Cal-free	MAE: 12.38	MAE:15.41
Slapničar et al. [26]	MIMIC-III (510 subjects)	Deep learning (ResNet)	Raw	PPG	Cal-based	MAE: 6.88	MAE: 9.43
This work	MIMIC-III Matched Subset (1131 subjects)	Deep learning (Seq2seq +Attention)	Raw	PPG	Cal-free	MAE: 6.57	MAE: 14.39
						STD: 8.43	STD: 17.87
						RMSE: 8.47	RMSE: 18.07
						$R^{2}$ : 0.19	$R^{2}$ : 0.19
					Cal-based	MAE: 5.56	MAE: 12.08
						STD: 7.32	STD: 15.67
						RMSE: 7.40	RMSE: 15.96
						$R^{2}$ : 0.41	$R^{2}$ : 0.39

ME, RMSE, STD, and MAE in mmHg.

Table 7. Comparison with related works in term of waveform results.

Author	Dataset	Method	Calibration	Error
Sideris et al. [27]	MIMIC (42 subjects)	LSTM	Cal-based	RMSE: 6.0
				STD: 3.26
				R:0.95
Sadrawi et al. [28]	Own data (18 subjects)	DCAE	Cal-based	RMSE: 3.46
				MAE: 2.33
				R: 0.98
This work	MIMIC-III Matched Subset (1131 subjects)	Seq2seq+Attention	Cal-free	RMSE: 10.26
				MAE: 8.89
				R: 0.98
			Cal-based	RMSE: 8.67
				MAE: 7.39
				R: 0.98

RMSE, STD and MAE in mmHg.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Aguirre, N.; Grall-Maës, E.; Cymberknop, L.J.; Armentano, R.L. Blood Pressure Morphology Assessment from Photoplethysmogram and Demographic Information Using Deep Learning with Attention Mechanism. Sensors 2021, 21, 2167. https://doi.org/10.3390/s21062167

AMA Style

Aguirre N, Grall-Maës E, Cymberknop LJ, Armentano RL. Blood Pressure Morphology Assessment from Photoplethysmogram and Demographic Information Using Deep Learning with Attention Mechanism. Sensors. 2021; 21(6):2167. https://doi.org/10.3390/s21062167

Chicago/Turabian Style

Aguirre, Nicolas, Edith Grall-Maës, Leandro J. Cymberknop, and Ricardo L. Armentano. 2021. "Blood Pressure Morphology Assessment from Photoplethysmogram and Demographic Information Using Deep Learning with Attention Mechanism" Sensors 21, no. 6: 2167. https://doi.org/10.3390/s21062167

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Blood Pressure Morphology Assessment from Photoplethysmogram and Demographic Information Using Deep Learning with Attention Mechanism

Abstract

1. Introduction

2. Materials and Methods

2.1. Preprocessing

2.2. Processing

2.3. Deep Learning

2.3.1. RNN Encoder-Decoder

2.3.2. Model Inputs

2.3.3. Model Architecture

2.3.4. Loss Functions

2.4. Hyperparameters and Experimental Settings

2.5. Evaluation

3. Results

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI