Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression

Wang, Mengsi; Chen, Zhenlei; Zhan, Haoran; Zhang, Jiyu; Wu, Xinglong; Jiang, Dan; Guo, Qing

doi:10.3390/s23239576

Open AccessArticle

Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression^†

¹

School of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Chengdu 611731, China

²

Aircraft Swarm Intelligent Sensing and Cooperative Control Key Laboratory of Sichuan Province, Chengdu 611731, China

³

School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

⁴

School of Instrumentation Science and Engineering, Harbin Institute of Technology, Harbin 150001, China

⁵

School of Mechanical and Electrical Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

^*

Author to whom correspondence should be addressed.

^†

Portions of this work were presented at the 8th International Conference on Advanced Robotics and Mechatronics in 2023, ‘Lower Limb Joint Torque Estimation by Neural Network and Sparse Gaussian Process with RIO Kernel’. This submission has been expanded to add value to the previous conference proceedings, by including more complete data and new verification results for the machine learning method.

Sensors 2023, 23(23), 9576; https://doi.org/10.3390/s23239576

Submission received: 27 October 2023 / Revised: 21 November 2023 / Accepted: 1 December 2023 / Published: 2 December 2023

(This article belongs to the Special Issue Wearable Sensors for Human Movement Analysis Related to Biomechanics and Exercise Physiology)

Download

Browse Figures

Versions Notes

Abstract

:

The accurate prediction of joint torque is required in various applications. Some traditional methods, such as the inverse dynamics model and the electromyography (EMG)-driven neuromusculoskeletal (NMS) model, depend on ground reaction force (GRF) measurements and involve complex optimization solution processes, respectively. Recently, machine learning methods have been popularly used to predict joint torque with surface electromyography (sEMG) signals and kinematic information as inputs. This study aims to predict lower limb joint torque in the sagittal plane during walking, using a long short-term memory (LSTM) model and Gaussian process regression (GPR) model, respectively, with seven characteristics extracted from the sEMG signals of five muscles and three joint angles as inputs. The majority of the normalized root mean squared error (NRMSE) values in both models are below

15 %

, most Pearson correlation coefficient (R) values exceed

0.85

, and most decisive factor (

R^{2}

) values surpass

0.75

. These results indicate that the joint prediction of torque is feasible using machine learning methods with sEMG signals and joint angles as inputs.

Keywords:

joint torque; electromyography signals; machine learning; long short-term memory; Gaussian process regression

1. Introduction

The real-time and accurate prediction of lower limb joint torque has important research significance in many fields. In sports rehabilitation, it serves as a foundation for the understanding of changes in people’s muscle strength, and it enables doctors to guide the process of rehabilitation training effectively [1,2]. In human–machine interaction systems, it forms the basis for machines to discern human motor intentions and adjust assistance strategies promptly [3,4,5]. Unfortunately, the use of mechanical torque sensors to measure subjects’ active joint torque is challenging because the sensed torque signals include undesired torque components, such as gravity torque, friction torque, and inertial torque, which need to be eliminated using complicated processing algorithms [6]. Currently, three methods are employed to compute joint torque, namely the inverse dynamics model, EMG-driven NMS model, and machine learning model.

The inverse dynamics model [7,8] is considered a standard method that requires a complex experimental environment and equipment. Two common approaches to developing an inverse dynamics model are the Newton–Euler equations and Lagrange equations. Both methods rely on GRF measurements, which are typically only available in laboratory settings.

The EMG-driven NMS model builds upon the muscle–tendon model, which can be traced back to the Hill-type muscle model proposed in 1938 [9]. At present, the improved model proposed by Buchanan and Lloyd is the most widely used [10,11]. However, the EMG-driven NMS model faces three major challenges. Firstly, measuring individual physiological parameters in vivo is difficult. Secondly, the model requires input from all muscles involved in joint motion. Thirdly, the iterative optimization process for model parameters is tedious.

In recent years, machine learning models have gained popularity in many research fields due to their ability to learn from large amounts of data without relying on explicit equations [12,13,14]. As sEMG signals exhibit smaller time delays and a higher signal-to-noise ratio (SNR), it may be an ideal method to use sEMG signals to estimate joint torque [6]. Meanwile, user-friendly wireless sEMG sensors make it possible to accurately measure sEMG signals during human movement. Therefore, various non-model-based machine learning regression methods, such as regression trees (RT) [15,16,17], support vector machines (SVM) [18,19,20], neural networks (NN) [21,22,23], and Gaussian process regression (GPR) [24,25,26], have been applied to predict joint torque using sEMG signals and motion information (joint angles, angular velocity, angular acceleration, etc.), which can be measured with portable devices such as sEMG sensors integrated with inertial measurement units (IMUs).

Among neural network models, the long short-term memory (LSTM) model has shown excellent performance in time series prediction tasks, as it effectively captures and remembers long-term dependencies through its gating unit design. The use of an LSTM model to predict joint torque with sEMG signals and motion information is a suitable choice, as all these variables are time series data related to human movement. Previous studies have demonstrated the effectiveness of using LSTM models for torque estimation. Siu et al. [27] found that an LSTM model used to estimate ankle torque with sEMG signals and accelerometry as inputs outperformed other methods, such as dense feedforward neural networks (FNN), convolutional neural networks (CNNs) [28,29], and neural ordinary differential equations (ODEs). Zhang et al. [30] observed that an LSTM model could predict lower limb joint torque during various activities accurately, with a relatively low error, using sEMG signals and joint angles as inputs. Truong et al. [31] extracted several sEMG features to predict joint angles and joint torque using an LSTM model when squatting, picking up an object, and sitting–standing.

Unlike the point prediction in NNs, GPR can not only predict the value of the target variable but also estimate the uncertainty of the prediction. It provides reliability to the prediction by calculating the confidence interval. Yang et al. [32] estimated the joint torque with a GPR model using the GRF and foot motion from wearable smart shoes while walking at three different speeds. Most

R^{2}

values in their experiment were higher than

0.8

, indicating good predictive performance. Ullauri et al. [33] compared a GPR model and pneumatic artificial muscle (PAM) model with muscle activation calculated by measuring sEMG signals to predict elbow torque, and they found that GPR provided relatively more favorable predictions.

Based on previous research about joint torque estimation by BPNN learning [34], this study aims to predict the joint torque during walking using an LSTM model and GPR model with sEMG signals and joint angles as inputs. Figure 1 shows the workflow of this work. This study can be referenced for the prediction of joint torque when only sEMG signals and joint angles are available and the GRF cannot be measured. The main contributions of this work are as follows:

The EMG signals, kinematics, and dynamics data are collected and processed during the normal walking of four subjects.
An LSTM model and GPR model are built to predict torque using EMG signals and joint angles without GRF.

This article is structured as follows. Section 2 describes the methods of data collection and data processing, and the principles of the LSTM model and GPR model. Section 3 shows the results of the two models. Section 4 discusses the performance and limitations of the two models. Conclusions and future work are given in Section 5.

2. Materials and Methods

2.1. Data Collection

Four healthy young male subjects (mean ± STD, age ± 0.5 years, height ± 4.30 kg, weight ± 3.0, Table 1) were recruited to participate in the data collection experiment. All subjects were asked to perform 5 walking trials at a speed of 0.8 m/s on a 5-m flat surface with 7 embedded force platforms. sEMG data, force data, and motion capture data were collected simultaneously. The collected data were divided according to the gait cycle (from left foot toe-off to the subsequent left toe-off). Multiple data of 5 complete walking gait cycles for each subject were acquired.

2.1.1. EMG Data Collection

sEMG signals were collected using 16 wireless sEMG sensors (Pico EMG, Cometa Systems, Inc., Newburg, MO, USA, Figure 2) at 2000 Hz. For each subject, 16 sEMG signals were collected simultaneously from the gluteus maximus (GMX), rectus femoris (RF), vastus medialis (VM), vastus lateralis (VL), biceps femoris (BF), semitendinosus (ST), tibialis anterior (TA), and gastrocnemius (GC) of both the left and right legs. Figure 3 illustrates the locations where the EMG sensors were placed on the subject’s body. sEMG signals from GMX, RF, BF, TA, and GC from the left leg were used in this study.

2.1.2. Force and Motion Data Collection

Force and motion data were collected using a 3D motion capture system (Qualisys, Goteborg, Sweden, Figure 4) at 100 Hz. Subjects were asked to perform movements with their legs stepping on different force plates. A total of 51 reflective markers were fixed on the subjects to record motion trajectories (Figure 3). The GRF data were collected by 7 platforms and the marker trajectories were captured by 41 cameras.

The sEMG signals, joint angles, and joint torque of the left leg of each subject were studied in this work.

2.2. Data Processing

2.2.1. sEMG Feature Extraction

The raw sEMG signals were band-pass filtered (20–450 Hz) by a 4th-order zero-lag Butterworth filter [35]. Then, 5 time domain features, namely the mean absolute value (MAV), root mean square (RMS), zero crossing (ZC), slope sign change (SSC), and waveform length (WL) [36,37,38], and 2 frequency domain features, namely the mean frequency (MNF) and median frequency (MDF) [39], were extracted. The time domain features were extracted using a 150-ms (300 points) movable overlapped window, and the frequency domain features were extracted using a 128-ms (256 points) movable overlapped window.

MAV:

$M A V = \frac{1}{N} \sum_{i = 1}^{N} | X_{i} |,$

(1)

where X represents the EMG signal in a movable window and N represents the window length.
RMS:

$R M S = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} X_{i}^{2}} .$

(2)
ZC:

$Z C = \sum_{i = 1}^{N - 1} s g n (X_{i} \cdot X_{i + 1}) \cap | X_{i} - X_{i + 1} | \geq 0,$

(3)

where

$s g n (x) = \{\begin{matrix} 1, & if x \geq 0 \\ 0, & otherwise . \end{matrix}$
SSC:

$S S C = \sum_{i = 2}^{N - 1} s g n ((X_{i} - X_{i - 1}) \cdot (X_{i} - X_{i + 1})) .$

(4)
WL:

$W L = \sum_{i = 1}^{N - 1} | X_{i} - X_{i + 1} | .$

(5)
MNF:

$M N F = \frac{\sum_{j = 1}^{M} f_{j} P_{j}}{\sum_{j = 1}^{M} P_{j}},$

(6)

where $f_{j}$ is the jth frequency component, $P_{j}$ is the power spectrum at $f_{j}$ , and M is the total number of frequency components.
MDF:

$\sum_{j = 1}^{M D F} P_{j} = \sum_{j = M D F}^{M} P_{j} = \frac{1}{2} \sum_{j = 1}^{M} P_{j} .$

(7)

EMG signals were downsampled to 100 Hz after feature extraction to correspond to force and motion data.

2.2.2. Inverse Kinematics

Joint angles were calculated by the inverse kinematics toolbox in OpenSim 4.4 (an open-source software system for biomechanical modeling, simulation, and analysis, SimTK, Stanford, CA, USA). The Gait2392_Simbody model was chosen for the research in this study. First, for each subject, a scaled model was built to match the markers on his body best. Then, the joint angles were obtained by optimizing the error between the virtual marker positions on the scaled model and the real marker positions from experiments. The detailed objective function is as follows:

\underset{q}{m i n} [\sum_{i \in m a r k e r s} ω_{i} ‖ {x_{i}^{e x p} - x_{i} (q) ‖}^{2}],

(8)

where

ω_{i}

is the weight of the marker i,

q

is the joint angle vector being solved for,

x_{i}^{e x p}

is the experimental position of marker i,

x_{i} (q)

is the virtual position of marker i at given joint angle vector

q

, and

∥ \cdot ∥

represents the standard Euclidean norm.

2.2.3. Inverse Dynamics

Joint torque was calculated by the inverse dynamics toolbox in OpenSim 4.4 after joint angles were obtained. The classical equations of motion can be written as follows:

M (q) \ddot{q} + C (q, \dot{q}) + G (q) = τ,

(9)

where

q

,

\dot{q}

,

\ddot{q}

are angles, angular velocities, and angular accelerations, respectively;

M (q)

is the mass matrix;

C (q, \dot{q})

is the vector of Coriolis and centrifugal torque;

G (q)

is the vector of gravitational torque; and

τ

is the vector of the unknown generalized torque.

The features of sEMG signals, the joint angles, and the joint torque were normalized by the maximum value of each channel. The input signal was 38-dimensional (7 features of 5 sEMG signals and 3 joint angles).

2.3. LSTM Neural Network Model

LSTM is a type of recurrent neural network (RNN) that is designed to overcome the limitations of traditional RNNs in capturing long-term dependencies in sequential data. LSTM units, which are the building blocks of an LSTM network, are more complex than traditional RNN units. A typical LSTM unit is composed of a cell state, an input gate, an output gate, and a forget gate [40,41], as Figure 5 shows. These components work together to process sequential data and maintain information over long periods.

The specific forward propagation formulas of an LSTM unit are as follows. First, the forget gate decides which information to discard in the last cell state,

f_{t} = σ (W_{f} \cdot x_{t} + U_{f} \cdot h_{t - 1} + b_{f}),

(10)

where

f_{t} \in {(0, 1)}^{h}

represents the forget gate’s activation vector,

x_{t} \in R^{d}

represents the input with d features at time t,

h_{t - 1} \in R^{h}

represents the output at time

t - 1

,

W_{f} \in R^{h \times d}

and

U_{f} \in R^{h \times h}

represent the weight matrix,

b_{f} \in R^{h}

represents the bias vector, and

σ

represents the sigmoid activation function. Second, the input gate determines which information to keep,

i_{t} = σ (W_{i} \cdot x_{t} + U_{i} \cdot h_{t - 1} + b_{i}),

(11)

{\tilde{c}}_{t} = t a n h (W_{c} \cdot x_{t} + U_{c} \cdot h_{t - 1} + b_{c}),

(12)

where

i_{t} \in {(0, 1)}^{h}

represents the update gate’s activation vector,

{\tilde{c}}_{t} \in {(- 1, 1)}^{h}

stands for the cell input activation vector,

W_{i}, W_{c} \in R^{h \times d}, U_{i}, U_{c} \in R^{h \times h}

represents the weight matrix,

b_{i}, b_{c} \in R^{h}

represents the bias vector, and

t a n h

represents the hyperbolic tangent activation function. Then, the cell state is updated from time

t - 1

to time t,

c_{t} = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ {\tilde{c}}_{t},

(13)

where

c_{t} \in R^{h}

is the cell state vector at time t, and

c_{t - 1}

is the cell state vector at time

t - 1

. Finally, the output is obtained,

o_{t} = σ (W_{o} \cdot x_{t} + U_{o} \cdot h_{t - 1} + b_{o}),

(14)

h_{t} = o_{t} ⊙ t a n h (c_{t}),

(15)

where

o_{t} \in {(0, 1)}^{h}

represents the output gate’s activation vector;

h_{t} \in {(- 1, 1)}^{h}

is the hidden state vector, also known as the output vector of an LSTM unit;

W_{o} \in R^{h \times d}

and

U_{o} \in R^{h \times h}

represent the weight matrices;

b_{o} \in R^{h}

represents the bias vector; and the operator ⊙ represents the Hadamard product. The initial values are

c_{0} = 0

and

h_{0} = 0

, and the parameters W, U, and b need to be learned during the training process.

In this study, a network model with 3 LSTM layers and a fully connected (FC) layer was built using the deep learning toolbox of Matlab 2021b. The architecture of the model is shown in Figure 6. Each LSTM layer consisted of 32, 16, and 8 hidden neuron units, respectively, and the output layer had 1 neuron. The output mode of the first and second LSTM layers was set to ‘sequence’, and the output mode of the third LSTM layer was set to ‘last’. A sigmoid layer was included between the third LSTM layer and the fully connected layer. The solver was set to be ‘adam’ and the training was performed for 500 epochs. To prevent the gradients from exploding during training, a gradient threshold of 1 was set. The initial learning rate was set to 0.005. After 125 epochs, the learning rate was dropped by multiplying it with a factor of 0.2. The format of the input data was transformed to time slices (sample number × features × time step). The time step was set to be 5, which was equivalent to 50 ms. The model was trained to predict the output at the next time step, which was 10 ms ahead of real time. For each joint of each subject, a model was trained individually. In this study, 12 different LSTM models were obtained.

2.4. GPR Model

The Gaussian process (GP) is a non-parametric learning method that has advantages over parametric methods when given a small training set [42]. Unlike point prediction in NNs, GP can quantify the uncertainty of the point prediction, which is favorable to decrease risks in decision making. In the regression model, decision makers can forecast the possible outcomes with an explicit probability when provided a 95% confidence interval around the prediction [43].

Any finite subset of a set of random variables

{f (x) | x \in X}

that follows a Gaussian distribution is known as a Gaussian process [44]. To specify a GP, the mean function

m (x)

and the covariance function

k (x, x^{^{'}})

should be defined, which contain a series of hyperparameters

θ

[45,46]. Then, the GP can be written as

f (x) \sim GP (m (x), k (x, x^{^{'}})) .

(16)

Given a training data set

D (X, y) = {(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{N}, y_{N})}

of N pairs of vectorial input

x_{i}

and noisy scalar output

y_{i}

,

y_{i}

is obtained by the latent function

f (x_{i})

with Gaussian noise

y_{i} = f (x_{i}) + ϵ_{i},

(17)

where

f (x) \sim GP (0, k (x, x^{^{'}}))

,

ϵ_{i} \sim N (0, σ^{2})

. We have

f \sim N (0, K (X, X)),

(18)

y \sim N (0, K (X, X) + σ^{2} I) .

(19)

For a testing data set

D = (X_{*}, y_{*})

of

N_{*}

points, the joint distribution

p (y, f_{*})

can be obtained as

[\begin{matrix} y \\ f_{*} \end{matrix}] \sim N (0, [\begin{matrix} K_{N N} + σ^{2} I & K_{N N^{*}} \\ K_{N^{*} N} & K_{N^{*} N^{*}} \end{matrix}]),

(20)

where

K_{N N^{*}}

denotes the

N \times N_{*}

covariance matrix of the training points and the testing points.

K_{N N}

,

K_{N^{*} N}

and

K_{N^{*} N^{*}}

are similar. The Gaussian predictive distribution

p (f_{*} | y)

can be obtained from the joint distribution

p (y, f_{*})

as

f_{*} | y \sim N ({\bar{f}}_{*}, c o v (f_{*})),

(21)

where

{\bar{f}}_{*} = K_{N^{*} N} {[K_{N N} + σ^{2} I]}^{- 1} y,

(22)

c o v (f_{*}) = K_{N^{*} N^{*}} - K_{N^{*} N} {[K_{N N} + σ^{2} I]}^{- 1} K_{N N^{*}} .

(23)

The training process of GP is actually to learn the hyperparameters (

θ, σ^{2}

) by maximizing the log-marginal likelihood

l o g p (y) = \int p (y | f) p (f) d f

, namely

l o g p (y) = - \frac{1}{2} y^{⊤} {(K + σ^{2} I)}^{- 1} y - \frac{1}{2} l o g | K + σ^{2} I | - \frac{n}{2} l o g 2 π,

(24)

where

K

denotes

K (X, X)

in (9).

In this study, the GPR model was implemented using the regression learner in Matlab 2021b. The goal was to predict the torque at the current time point based on input feature data from the previous time point, with a prediction horizon of 10 ms ahead of real time. After evaluating the performance of 4 kinds of kernel functions (rational quadratic, squared exponential, matern5/2, exponential), the exponential kernel function was finally selected. The basis function was set to be constant and the standardization was set to be true. For each joint of each subject, a separate GPR model was trained, resulting in a total of 12 distinct GPR models in this study.

2.5. Evaluation Protocol

For each subject, 5 complete gait cycles were acquired. Out of the 5 gait cycles, 4 cycles were used as the training set to train the models, while the remaining cycle was used as the testing set to evaluate the performance of the models.

The NRMSE is as follows:

N R M S E = \frac{R M S E}{y_{m a x} - y_{m i n}} \times 100 %,

(25)

where

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}

represents the root mean squared error, the Pearson correlation coefficient R,

R = \frac{\sum_{}^{} (y_{i} - {\bar{y}}_{i}) ({\hat{y}}_{i} - {\bar{\hat{y}}}_{i})}{\sqrt{\sum_{}^{} {(y_{i} - {\bar{y}}_{i})}^{2} {({\hat{y}}_{i} - {\bar{\hat{y}}}_{i})}^{2}}}

(26)

and the decisive factor

R^{2}

,

R^{2} = 1 - \frac{S S E}{S S T},

(27)

where

S S E = \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}

,

\sum_{i = 1}^{N} {(y_{i} - {\bar{y}}_{i})}^{2}

, are all used to evaluate the model performance. A lower NRMSE and higher R and

R^{2}

close to 1 indicate better regression performance.

3. Results

3.1. The Result of the LSTM Model

Table 2 shows the performance of the LSTM model in predicting joint torque for all subjects. For the hip joints of four subjects, the NRMSE values are less than

15 %

, the R values are more than

0.92

, and the

R^{2}

values are more than

0.79

. For the knee joint, the NRMSE values are more than

17 %

, the R values are more than

0.83

, and the

R^{2}

values are more than

0.62

. For the ankle joint, the NRMSE values are less than

8 %

, the R values are more than

0.95

, and the

R^{2}

values are more than

0.90

.

Certain differences exist between different people. The best hip prediction occurs on subject 4, of which the NRMSE value is

11.2691

, the R value is

0.9469

, and the

R^{2}

value is

0.8512

. The best knee prediction occurs on subject 1, of which the NRMSE value is

12.3561

, the R value is

0.9102

, and the

R^{2}

value is

0.8207

. The best ankle prediction occurs on subject 3, of which the NRMSE value is

7.6790

, the R value is

0.9837

, and the

R^{2}

value is

0.9646

. Figure 7 displays the real torque and the predicted torque of the LSTM model.

3.2. The Result of the GPR Model

Table 3 presents the performance of the GPR model in predicting the joint torque for all subjects. For the hip joints of four subjects, the NRMSE values are less than

17 %

, the R values are more than

0.91

, and the

R^{2}

values are more than

0.75

. For the knee joint, the NRMSE values are less than

18 %

, the R values are more than

0.85

, and the

R^{2}

values are more than

0.56

. For the ankle joint, the NRMSE values are less than

8 %

, the R values are more than

0.94

, and the

R^{2}

values are more than

0.81

.

The best hip prediction occurs on subject 1, of which the NRMSE value is

8.6833

, the R value is

0.9661

, and the

R^{2}

value is

0.9070

. The best knee prediction occurs on subject 3, of which the NRMSE value is

11.4735

, the R value is

0.9328

, and the

R^{2}

value is

0.7999

. The best ankle prediction occurs on subject 3, of which the NRMSE value is

6.2121

, the R value is

0.9801

, and the

R^{2}

value is

0.9590

. Figure 8 shows the real torque and the predicted torque of the GP model, with shaded areas representing the

95 %

confidence intervals of the predicted torque.

4. Discussion

In this study, an LSTM model and GPR model were used to predict joint torque in the sagittal plane during walking, using sEMG signals and joint angles as inputs. The relevant regression performance (NRMSE, R,

R^{2}

) indicated that the predicted torque was in good agreement with the torque calculated by inverse dynamics. In the LSTM model, the average regression performance for the ankle (NRMSE: 6.5850%, R: 0.9711,

R^{2}

: 0.9361), hip (NRMSE: 12.5728%, R: 0.9324,

R^{2}

: 0.8336), and knee (NRMSE: 14.4465%, R: 0.8763,

R^{2}

: 0.7560) decreased in sequence. In [27], Siu et al. took sEMG signals and accelerations as inputs, using an LSTM model to predict the ankle torque (average R of five subjects: 0.84) accurately. However, only the maximum value and area for each window of the sEMG signals were extracted. In our study, seven features of sEMG signals were extracted, and, in comparison, the average R was improved. In the GPR model, the average regression performance for the ankle (NRMSE: 6.4766%, R: 0.9636,

R^{2}

: 0.9018), hip (NRMSE: 12.3216%, R: 0.9391,

R^{2}

: 0.8371), and knee (NRMSE: 14.3591%, R: 0.8980,

R^{2}

: 0.7335) also decreased in order. In [32], Yang et al. used a GP model to predict joint torque with plantar pressure, orientations, and angular velocities as inputs. In the case of walking at the speed of 0.8 m/s, the prediction of ankle (average

R^{2}

: 0.8905), knee (average

R^{2}

: 0.7250), and hip (average

R^{2}

: 0.7390) torque was precise. However, it is inconvenient to measure plantar pressure with smart shoes. In our study, the measurement of the sEMG signal was more user-friendly. Portable wireless sEMG sensors integrated with IMUs made it possible to predict the joint torque in real time.

Although the low NRMSE and the high R and

R^{2}

in both the LSTM model and GPR model indicate that the joint torque has a relatively strong correlation and high consistency with sEMG signals and joint angles, there still exist some limitations. The performance of the hip and knee torque is not as good as that of the ankle torque. In [30], Zhang et al. also used an LSTM model to predict joint torque, with good performance. Their study indicated that the best prediction performance during walking was for the hip, followed by the ankle and finally the knee. Our result was not exactly consistent with this. We suppose that the prediction performance can be essentially attributed to the complexity of the output curve to be predicted, including the number of data points, number of peaks, and data volatility. The more complex the output curve, the more challenging it is to accurately predict. Therefore, one solution to fundamentally improve the prediction performance is to add more valid EMG signals.

There are certain differences in the physiological and movement characteristics of different subjects, reflected in the data and results. In the LSTM model, the ability to estimate the knee joint torque of subject 1 was better than for subject 4. In the GP model, the ability to estimate the hip joint torque of subject 1 was better than for subject 3. For a specific subject, the regression performance would be improved if the adjustable parameters in the model (number of network layers, number of neurons, etc.) were individually adjusted.

Additionally, the EMG signal itself lacks robustness. In this research, the measurement of sEMG signals posed a challenge. The absolute amplitude of the sEMG signal itself has little reference significance. Factors such as the sensor itself, where the sensor is attached, and the tightness of the sensor attachment all affect the quality of the sEMG signal. To ensure consistency and reliability, all data must be collected in the same experiment, minimizing variations caused by these factors.

5. Conclusions and Future Work

In this study, the hip, knee, and ankle torque were predicted during walking in four subjects by an LSTM model and GPR model. The results indicate that both the LSTM and GPR models performed well in predicting the joint torque given the sEMG signals and joint angles as inputs, with low NRMSE values and high R and

R^{2}

. For all joints of each model, the average NRMSE values were all less than

15 %

, the R were all more than

0.87

, and the

R^{2}

were all more than

0.73

. When using machine learning models like LSTM and GPR, on the one hand, there is no need to build dynamic models or measure the GRF compared to inverse dynamic models. On the other hand, the complex optimization process associated with NMS models is not necessary. The proposed models have potential application in various fields, including exoskeleton rehabilitation systems, exoskeleton assistance systems, and sports training. These models can provide accurate and real-time estimates of joint torque during walking when force plates are limited or unavailable.

In the future, our research group will conduct more in-depth research considering the following four aspects. Firstly, the diversity of the subjects should be increased. A larger data set including more subjects with different body types and different ages should be built. Additionally, studying subjects with lower limb disabilities has broader significance. Secondly, expanding the diversity of movements beyond walking would be beneficial. Including activities such as running, jumping, and cycling in daily life will provide a broader perspective on joint torque prediction across various dynamic movements. Thirdly, investigating inter-movement and inter-subject predictions is a more challenging task. It would involve using the current model and data set to predict joint torque for new movements and individuals. This research direction would contribute to a more comprehensive understanding of the generalizability and adaptability of the proposed model. Lastly, exploring torque prediction over a period of time, rather than at a single point in time, would be a valuable future direction.

Author Contributions

M.W. was responsible for the writing of this article; Q.G. proposed the conception of the method; Z.C. and H.Z. collected and processed related experimental data; J.Z. and X.W. verified the experimental results; D.J. was responsible for the writing supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant Nos. 52175046 and 52305054), Sichuan Science and Technology Program (Grant Nos. 2022JDRC0018 and 2022YFG0341), and China Postdoctoral Science Foundation (No. 2022M720024).

Informed Consent Statement

All subjects gave informed consent for inclusion before they participated in the data collection stage of the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Burnfield, J.M.; Josephson, K.R.; Powers, C.M.; Rubenstein, L.Z. The influence of lower extremity joint torque on gait characteristics in elderly men. Arch. Phys. Med. Rehabil. 2000, 81, 1153–1157. [Google Scholar] [CrossRef]
Kerrigan, D.C.; Lelas, J.L.; Goggins, J.; Merriman, G.J.; Kaplan, R.J.; Felson, D.T. Effectiveness of a lateral-wedge insole on knee varus torque in patients with knee osteoarthritis. Arch. Phys. Med. Rehabil. 2002, 83, 889–893. [Google Scholar] [CrossRef] [PubMed]
Guo, Q.; Chen, Z.; Yan, Y.; Xiong, W.; Jiang, D.; Shi, Y. Model identification and human-robot coupling control of lower limb exoskeleton with biogeography-based learning particle swarm optimization. Int. J. Control Autom. Syst. 2022, 20, 589–600. [Google Scholar] [CrossRef]
Chen, Z.; Guo, Q.; Li, T.; Yan, Y.; Jiang, D. Gait prediction and variable admittance control for lower limb exoskeleton with measurement delay and extended-state-observer. IEEE Trans. Neural Netw. Learn. Syst. 2022, 34, 8693–8706. [Google Scholar] [CrossRef] [PubMed]
Chen, Z.; Guo, Q.; Li, T.; Yan, Y. Output Constrained Control of Lower Limb Exoskeleton Based on Knee Motion Probabilistic Model With Finite-Time Extended State Observer. IEEE/ASME Trans. Mechatron. 2023, 28, 2305–2316. [Google Scholar] [CrossRef]
Gui, K.; Liu, H.; Zhang, D. A practical and adaptive method to achieve EMG-based torque estimation for a robotic exoskeleton. IEEE/ASME Trans. Mechatron. 2019, 24, 483–494. [Google Scholar] [CrossRef]
Winter, D.A. Biomechanics and Motor Control of Human Movement; John Wiley & Sons: Hoboken, NJ, USA, 2009. [Google Scholar]
Zajac, F.E.; Neptune, R.R.; Kautz, S.A. Biomechanics and muscle coordination of human walking: Part I: Introduction to concepts, power transfer, dynamics and simulations. Gait Posture 2002, 16, 215–232. [Google Scholar] [CrossRef] [PubMed]
Hill, A.V. The heat of shortening and the dynamic constants of muscle. Proc. R. Soc. Lond. Ser. B-Biol. Sci. 1938, 126, 136–195. [Google Scholar]
Lloyd, D.G.; Besier, T.F. An EMG-driven musculoskeletal model to estimate muscle forces and knee joint moments in vivo. J. Biomech. 2003, 36, 765–776. [Google Scholar] [CrossRef]
Buchanan, T.S.; Lloyd, D.G.; Manal, K.; Besier, T.F. Neuromusculoskeletal modeling: Estimation of muscle forces and joint moments and movements from measurements of neural command. J. Appl. Biomech. 2004, 20, 367–395. [Google Scholar] [CrossRef]
Lu, L.; Huang, C.; Song, X. Bifurcation control of a fractional-order PD control strategy for a delayed fractional-order prey–predator system. Eur. Phys. J. Plus 2023, 138, 1–11. [Google Scholar] [CrossRef]
Xu, C.; Liu, Z.; Li, P.; Yan, J.; Yao, L. Bifurcation mechanism for fractional-order three-triangle multi-delayed neural networks. Neural Process. Lett. 2022, 55, 6125–6151. [Google Scholar] [CrossRef]
Li, P.; Peng, X.; Xu, C.; Han, L.; Shi, S. Novel extended mixed controller design for bifurcation control of fractional-order Myc/E2F/miR-17-92 network model concerning delay. Math. Methods Appl. Sci. 2023, 46, 18878–18898. [Google Scholar] [CrossRef]
Shi, Y.; Dong, W.; Lin, W.; He, L.; Wang, X.; Li, P.; Gao, Y. Human Joint Torque Estimation Based on Mechanomyography for Upper Extremity Exosuit. Electronics 2022, 11, 1335. [Google Scholar] [CrossRef]
Al-Timemy, A.H.; Zonnino, A.; Sergi, F. Estimating wrist joint torque using regression ensemble of bagged trees under multiple wrist postures. In Proceedings of the 2020 8th IEEE RAS/EMBS International Conference for Biomedical Robotics and Biomechatronics (BioRob), New York, NY, USA, 29 November–1 December 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1152–1157. [Google Scholar]
Molinaro, D.D.; Kang, I.; Camargo, J.; Young, A.J. Biological hip torque estimation using a robotic hip exoskeleton. In Proceedings of the 2020 8th IEEE RAS/EMBS International Conference for Biomedical Robotics and Biomechatronics (BioRob), New York, NY, USA, 29 November–1 December 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 791–796. [Google Scholar]
Song, Q.; Sun, B.; Lei, J.; Gao, Z.; Yu, Y.; Liu, M.; Ge, Y. Prediction of human elbow torque from EMG using SVM based on AWR information acquisition platform. In Proceedings of the 2006 IEEE International Conference on Information Acquisition, Veihai, China, 20–23 August 2006; IEEE: Piscataway, NJ, USA, 2006; pp. 1274–1278. [Google Scholar]
Anwar, T.; Al Jumaily, A. EMG signal based knee joint torque estimation. In Proceedings of the 2016 International Conference on Systems in Medicine and Biology (ICSMB), Kharagpur, India, 4–7 January 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 182–185. [Google Scholar]
Wu, G.; Zhang, J.; Li, G.; Wang, L.; Yu, Q.; Guo, J. Identification method of nonlinear maneuver model for unmanned surface vehicle from sea trial data based on support vector machine. J. Mech. Sci. Technol. 2022, 36, 4257–4267. [Google Scholar] [CrossRef]
Zhang, L.; Li, Z.; Hu, Y.; Smith, C.; Farewik, E.M.G.; Wang, R. Ankle joint torque estimation using an EMG-driven neuromusculoskeletal model and an artificial neural network model. IEEE Trans. Autom. Sci. Eng. 2020, 18, 564–573. [Google Scholar] [CrossRef]
Peng, L.; Hou, Z.G.; Wang, W. A dynamic EMG-torque model of elbow based on neural networks. In Proceedings of the 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, Italy, 25–29 August 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 2852–2855. [Google Scholar]
Zhang, Y.; Zhang, X.; Lu, Z.; Jiang, Z.; Zhang, T. A novel wrist joint torque prediction method based on EMG and LSTM. In Proceedings of the 2020 10th Institute of Electrical and Electronics Engineers International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), Xi’an, China, 10–13 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 242–245. [Google Scholar]
Dai, X.; Andani, H.T.; Alizadeh, A.; Abed, A.M.; Smaisim, G.F.; Hadrawi, S.K.; Karimi, M.; Shamsborhan, M.; Toghraie, D. Using Gaussian Process Regression (GPR) models with the Matérn covariance function to predict the dynamic viscosity and torque of SiO2/Ethylene glycol nanofluid: A machine learning approach. Eng. Appl. Artif. Intell. 2023, 122, 106107. [Google Scholar] [CrossRef]
Nguyen-Tuong, D.; Seeger, M.; Peters, J. Computed torque control with nonparametric regression models. In Proceedings of the 2008 American Control Conference, Seattle, WA, USA, 11–13 June 2008; IEEE: Piscataway, NJ, USA, 2008; pp. 212–217. [Google Scholar]
Pei, X.; Zhou, Y.; Wang, N. A Gaussian process regression based on variable parameters fuzzy dominance genetic algorithm for B-TFPMM torque estimation. Neurocomputing 2019, 335, 153–169. [Google Scholar] [CrossRef]
Siu, H.C.; Sloboda, J.; McKindles, R.J.; Stirling, L.A. A neural network estimation of ankle torques from electromyography and accelerometry. IEEE Trans. Neural Syst. Rehabil. Eng. 2021, 29, 1624–1633. [Google Scholar] [CrossRef]
Guo, Q.; Zhang, Y.; Celler, B.G.; Su, S.W. Neural adaptive backstepping control of a robotic manipulator with prescribed performance constraint. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 3572–3583. [Google Scholar] [CrossRef]
Guo, Q.; Chen, Z. Neural adaptive control of single-rod electrohydraulic system with lumped uncertainty. Mech. Syst. Signal Proc. 2021, 146, 106869. [Google Scholar] [CrossRef]
Zhang, L.; Soselia, D.; Wang, R.; Gutierrez-Farewik, E.M. Lower-limb joint torque prediction using LSTM neural networks and transfer learning. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 600–609. [Google Scholar] [CrossRef] [PubMed]
Truong, M.T.N.; Ali, A.E.A.; Owaki, D.; Hayashibe, M. EMG-Based Estimation of Lower Limb Joint Angles and Moments Using Long Short-Term Memory Network. Sensors 2023, 23, 3331. [Google Scholar] [CrossRef] [PubMed]
Yang, J.; Yin, Y. Dependent-Gaussian-process-based learning of joint torques using wearable smart shoes for exoskeleton. Sensors 2020, 20, 3685. [Google Scholar] [CrossRef] [PubMed]
Ullauri, J.B.; Peternel, L.; Ugurlu, B.; Yamada, Y.; Morimoto, J. On the EMG-based torque estimation for humans coupled with a force-controlled elbow exoskeleton. In Proceedings of the 2015 International Conference on Advanced Robotics (ICAR), Istanbul, Turkey, 27–31 July 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 302–307. [Google Scholar]
Wang, M.; Chen, Z.; Zhan, H.; Zhang, J.; Wu, X.; Jiang, D.; Guo, Q. Lower limb joint torque estimation by neural network and Sparse Gaussian Process with RIO Kernel. In Proceedings of the 2008 8th International Conference on Advanced Robotics and Mechatronics, Sanya, China, 8–10 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 809–814. [Google Scholar]
Moreira, L.; Figueiredo, J.; Fonseca, P.; Vilas-Boas, J.P.; Santos, C.P. Lower limb kinematic, kinetic, and EMG data from young healthy humans during walking at controlled speeds. Sci. Data 2021, 8, 103. [Google Scholar] [CrossRef] [PubMed]
Li, G.; Li, J.; Ju, Z.; Sun, Y.; Kong, J. A novel feature extraction method for machine learning based on surface electromyography from healthy brain. Neural Comput. Appl. 2019, 31, 9013–9022. [Google Scholar] [CrossRef]
Toledo-Perez, D.; Rodriguez-Resendiz, J.; Gomez-Loenzo, R.A. A study of computing zero crossing methods and an improved proposal for EMG signals. IEEE Access 2020, 8, 8783–8790. [Google Scholar] [CrossRef]
Bhattacharya, A.; Sarkar, A.; Basak, P. Time domain multi-feature extraction and classification of human hand movements using surface EMG. In Proceedings of the 2017 4th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 6–7 January 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–5. [Google Scholar]
Thongpanja, S.; Phinyomark, A.; Phukpattaranont, P.; Limsakul, C. Mean and median frequency of EMG signal to determine muscle force based on time-dependent power spectrum. Elektron. Elektrotechnika 2013, 19, 51–56. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. LSTM can solve hard long time lag problems. In Proceedings of the 9th International Conference on Neural Information Processing Systems, Denver, CO, USA, 3–5 December 1996; Volume 9. [Google Scholar]
Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to forget: Continual prediction with LSTM. Neural Comput. 2000, 12, 2451–2471. [Google Scholar] [CrossRef]
Cao, D.; Zhao, J.; Hu, W.; Zhang, Y.; Liao, Q.; Chen, Z.; Blaabjerg, F. Robust deep Gaussian process-based probabilistic electrical load forecasting against anomalous events. IEEE Trans. Ind. Inform. 2021, 18, 1142–1153. [Google Scholar] [CrossRef]
Qiu, X.; Meyerson, E.; Miikkulainen, R. Quantifying point-prediction uncertainty in neural networks via residual estimation with an i/o kernel. arXiv 2019, arXiv:1906.00588. [Google Scholar]
Titsias, M. Variational learning of inducing variables in sparse Gaussian processes. In Proceedings of the Artificial Intelligence and Statistics, PMLR, Clearwater Beach, FL, USA, 16–18 April 2009; pp. 567–574. [Google Scholar]
Rasmussen, C.E.; Williams, C.K. Gaussian Processes for Machine Learning; Springer: Berlin/Heidelberg, Germany, 2006; Volume 1. [Google Scholar]
Koriyama, T.; Kobayashi, T. Statistical parametric speech synthesis using deep Gaussian processes. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 27, 948–959. [Google Scholar] [CrossRef]

Figure 1. The overall workflow of this study. First, raw sEMG signals, marker trajectories, and ground reaction forces are collected; next, after being processed, the inputs (sEMG features and joint angles) and the output (joint torques) are obtained; then, parameters in the model are trained using data from the training set; finally, the performance of the models is evaluated using data from the testing set.

Figure 2. Pico EMG, Cometa Systems.

Figure 3. The locations where EMG sensors and reflective markers were placed on the subject’s body.

Figure 4. Qualisys 3D motion capture system.

Figure 5. The structure of a typical LSTM unit.

Figure 6. The structure of the LSTM network model.

Figure 7. The predicted torque using LSTM network model.

Figure 8. The predicted torque using GPR model.

Table 1. The age, height, and weight of the subjects.

Subject	Age (Years)	Height (cm)	Weight (kg)
Subject 1	23.0	168.5	67.70
Subject 2	23.0	172.0	57.20
Subject 3	23.0	175.0	62.50
Subject 4	22.0	174.5	61.75
Average	22.75	172.5	62.29

Table 2. The NRMSE, R, and

R^{2}

of the joint torque prediction using LSTM model.

Table 2. The NRMSE, R, and

R^{2}

of the joint torque prediction using LSTM model.

Subject	Hip			Knee			Ankle
Subject	NRMSE (%)	$R$	$R^{2}$	NRMSE (%)	$R$	$R^{2}$	NRMSE (%)	$R$	$R^{2}$
Subject 1	11.7438	0.9208	0.8299	12.3561	0.9102	0.8207	5.4290	0.9696	0.9318
Subject 2	12.3223	0.9416	0.8583	14.3956	0.8973	0.7877	7.0349	0.9517	0.9010
Subject 3	14.7159	0.9201	0.7949	14.1828	0.8360	0.7949	7.6790	0.9837	0.9646
Subject 4	11.2691	0.9469	0.8512	16.8513	0.8615	0.6206	6.1971	0.9793	0.9469
Average	12.5128	0.9324	0.8336	14.4465	0.8763	0.7560	6.5850	0.9711	0.9361

Table 3. The NRMSE, R, and

R^{2}

of the joint torque prediction using GPR model.

Table 3. The NRMSE, R, and

R^{2}

of the joint torque prediction using GPR model.

Subject	Hip			Knee			Ankle
Subject	NRMSE (%)	R	$R^{2}$	NRMSE (%)	R	$R^{2}$	NRMSE (%)	R	$R^{2}$
subject 1	8.6833	0.9661	0.9070	12.0694	0.9137	0.8290	5.3030	0.9558	0.8976
subject 2	14.3114	0.9165	0.8088	15.9334	0.8870	0.7400	7.7864	0.9443	0.8175
subject 3	16.1742	0.9245	0.7522	11.4735	0.9328	0.7999	6.2121	0.9801	0.9590
subject 4	10.1146	0.9491	0.8802	17.9601	0.8583	0.5690	6.6049	0.9743	0.9334
average	12.3216	0.9391	0.8371	14.3591	0.8980	0.7335	6.4766	0.9636	0.9018

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, M.; Chen, Z.; Zhan, H.; Zhang, J.; Wu, X.; Jiang, D.; Guo, Q. Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression. Sensors 2023, 23, 9576. https://doi.org/10.3390/s23239576

AMA Style

Wang M, Chen Z, Zhan H, Zhang J, Wu X, Jiang D, Guo Q. Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression. Sensors. 2023; 23(23):9576. https://doi.org/10.3390/s23239576

Chicago/Turabian Style

Wang, Mengsi, Zhenlei Chen, Haoran Zhan, Jiyu Zhang, Xinglong Wu, Dan Jiang, and Qing Guo. 2023. "Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression" Sensors 23, no. 23: 9576. https://doi.org/10.3390/s23239576

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression^†

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Collection

2.1.1. EMG Data Collection

2.1.2. Force and Motion Data Collection

2.2. Data Processing

2.2.1. sEMG Feature Extraction

2.2.2. Inverse Kinematics

2.2.3. Inverse Dynamics

2.3. LSTM Neural Network Model

2.4. GPR Model

2.5. Evaluation Protocol

3. Results

3.1. The Result of the LSTM Model

3.2. The Result of the GPR Model

4. Discussion

5. Conclusions and Future Work

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression †

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Collection

2.1.1. EMG Data Collection

2.1.2. Force and Motion Data Collection

2.2. Data Processing

2.2.1. sEMG Feature Extraction

2.2.2. Inverse Kinematics

2.2.3. Inverse Dynamics

2.3. LSTM Neural Network Model

2.4. GPR Model

2.5. Evaluation Protocol

3. Results

3.1. The Result of the LSTM Model

3.2. The Result of the GPR Model

4. Discussion

5. Conclusions and Future Work

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Lower Limb Joint Torque Prediction Using Long Short-Term Memory Network and Gaussian Process Regression^†