Fast Wearable Sensor–Based Foot–Ground Contact Phase Classification Using a Convolutional Neural Network with Sliding-Window Label Overlapping

Jeon, Haneul; Kim, Sang Lae; Kim, Soyeon; Lee, Donghun

doi:10.3390/s20174996

Open AccessArticle

Fast Wearable Sensor–Based Foot–Ground Contact Phase Classification Using a Convolutional Neural Network with Sliding-Window Label Overlapping

School of Mechanical Engineering, Soongsil University, Seoul 06978, Korea

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(17), 4996; https://doi.org/10.3390/s20174996

Submission received: 3 August 2020 / Revised: 27 August 2020 / Accepted: 31 August 2020 / Published: 3 September 2020

(This article belongs to the Collection Sensors for Gait, Human Movement Analysis, and Health Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

Classification of foot–ground contact phases, as well as the swing phase is essential in biomechanics domains where lower-limb motion analysis is required; this analysis is used for lower-limb rehabilitation, walking gait analysis and improvement, and exoskeleton motion capture. In this study, sliding-window label overlapping of time-series wearable motion data in training dataset acquisition is proposed to accurately detect foot–ground contact phases, which are composed of 3 sub-phases as well as the swing phase, at a frequency of 100 Hz with a convolutional neural network (CNN) architecture. We not only succeeded in developing a real-time CNN model for learning and obtaining a test accuracy of 99.8% or higher, but also confirmed that its validation accuracy was close to 85%.

Keywords:

time-series data; biomechanics; walking gait; CNN; sliding window; wearable sensor

1. Introduction

Human motion recognition (HMR) is a technology domain that recognizes and distinguishes different types of human activities using sensor data [1]; it is widely used in rehabilitation and medical treatment like the classification and rehabilitation evaluation of patients with hip osteoarthritis, neurological disorders such as stroke, and Parkinson’s disease through gait analysis [2,3,4,5,6,7,8,9,10,11]. It also has been used in training assistance like exercise coaching through motion tracking and feedback, speed and position tracking in sports training [12,13,14,15,16,17], sudden fall prevention [18] along with the development of wearable sensor technology. This paper describes the development of a foot–ground contact phase classification (FGCC) algorithm as FGCC is one of the most fundamental and elemental processes in lower-limb motion analysis.

The major sensors used in HMR research can be categorized as cameras, force sensors, and inertial motion sensors. According to Farooq et al. [19], 20 human motions were classified with 74.4% accuracy using an RGB-Depth camera. However, in order to obtain an acceptable quality of three-dimensional (3D) point cloud data (PCD) of the entire human body, which is necessary for algorithm training, the camera must be accurately aligned with the coronal or frontal plane of the human body; noise such as outliers in the depth map must be eliminated as well. Abellanas et al. [11] and Kim et al. [20] achieved 99.8% and 93.1% accuracy in foot–ground contact detection with force plates and force sensitive resistors (FSRs), respectively. However, as their methods were based on measuring the physical contact between the foot and the ground, lower-limb motion could not be analyzed simultaneously. Qui et al. [7] and Mohammad et al. [9] proposed a method for the independent detection of walking, squatting, and jumping using wearable inertial sensors. Although a wearable inertial sensor is very easy to use and has limitless measurement workspace [21], acceptable detecting accuracy has not been continuously obtained owing to sensor drifts as well as initial calibration issues [22].

In HMR algorithms, rule-based applications for class prediction based on a threaded range of feature data extracted through sensor data analysis as well as various neural network paradigms, such as convolutional neural network (CNN) and region-based CNN (R-CNN) have been utilized. In the study by Kim et al. [23] and Teufl et al. [24], a rule-based classifier with 99% accuracy was developed after examining the major features of the foot-to-ground contact phase classification through data-driven analysis. Shin et al. [1] developed an inertial and altitude sensor data based human activity classifier with a long-short term memory (LSTM) architecture. The model was able to classify six static gestures with 99.92% classification accuracy. Similarly, Hsu et al. [25] applied principal component analysis (PCA) and support vector machine (SVM) for the classification of 10 different routine activities and 11 dynamic activities. They achieved classification accuracies of 98.23% and 99.55% for routine and dynamic activities, respectively. However, they discussed the limitation of their model in that the accuracy was greatly affected by individual datasets and the number of routine and dynamic activities. The study conducted by Janidarmian et al. [26] classified activities using 293 different machine learning algorithms. Their work used PCA to identify the features in the data from 70 activities in real environments allowing for the fact that a wearable acceleration sensor’s attachment position, posture, and learning algorithm affects the performance of the recognition model. As a result, it was suggested that the human activity of all subjects could be recognized with an average accuracy of 96.44% through the K-fold evaluation method, and that human activity recognition could be performed with an average accuracy of 79.92% for each subject. To improve this, Almaslukh [27] proposed a method that could recognize human activity in real-time while not being affected by the location of the attachment. Using RealWorld Human Activity Recognition (HAR) public data [28], a hyper-parameter tuning was performed for optimal learning on CNN and eight dynamic activities were classified with 84–88% accuracy. In the study conducted by Um et al. [29], 50 upper-limb resistance exercise movements were recognized with 92.1% accuracy by a CNN unrelated to the sequence of time, instead of by a recurrent neural network suitable for time series data. The data set they used was the time-series data of an inertial sensor, which was given by the PUSH Sensor Company; the time-series data could be imaged through the sliding window method to learn the classification model on CNN. However, as 99% of the exercises ended within 3.92 s, input image format was performed within 3.92 s for all exercises, which led to a limit in recognition in real-time owing to an inability to distinguish various phases that make up one action, such as the foot–ground contact phase and the swing phase.

In this study, based on a lower-limb wearable inertial sensor and CNN model, an FGCC algorithm is developed that can recognize the four phases of heel strike (HS), full contact (FC), heel off (HO), and swing (SW) in real time. In order to distinguish the multiple phases in real time in a very short time interval based on time-series data of lower-limb behavior collected by inertia sensors, it is most important to secure a labeled time-series motion dataset and convert it to a neural network (NN) input image. Therefore, in this study, a sliding window-based label overlapping (SLO) method is proposed to secure an effective labeled time-series motion dataset. The most significant research contribution of the proposed method is that it makes it possible to obtain a dataset capable of learning a real-time FGCC algorithm with high-recognition precision based on the NN structure without modification of the existing time-series motion data acquisition method. The 13,837 raw time-series datasets collected directly in this study were expanded to 575,880 through data augmentation, then, divided into 60% training sets and 40% test sets; the performance of the proposed method was verified through actual validation experiments.

In this paper, Section 2 defines the research objective; the experimental equipment, data collection, and data labeling are also explained. Section 3 consists of a description of the data preprocessing and application of the SLO method. In Section 4, the model design, selection of optimal parameters using the Taguchi method and validation are described. Finally, in Section 5, the paper concludes with a discussion of the results, limitations of this study, and future research possibilities.

2. Foot–Ground Contact Phases and Labeling Method

In biomechanics and ergonomics, the walking phases are generally divided into a swing and a stance phase, according to the contact between the foot and the ground. As shown in Figure 1, the stance phase, defined as the foot–ground contact phase in this study, can be subdivided into the following four sub-phases: heel strike, full contact, heel off, and toe off [23,30]. As the goal of this study is to accurately and individually detect these multiple sub-phases only with wearable inertial motion sensors on the lower-limb part, an additional measurement device for labeling the sampled inertial motion sensor data according to the sub-phase should be considered in the training dataset acquisition process.

In this study, FSR-arrayed insoles were fabricated, as shown in Figure 2 Considering the foot pressure distributions in each sub-phase, one FSR sensor was attached to each of the three following parts: the distal phalangeal tuberosity of the first toe, the metatarsophalangeal joint, and the calcaneus [31]. A single board computer equipped with Bluetooth modules was also assembled onto the FSR-arrayed insole, as shown in Figure 2, so that the lower-limb motion data acquisition and the data labeling process could be simultaneously achieved without any restrictions on the subjects’ walking range. Figure 3 shows how to determine the individual foot–ground contact phase according to the 3-ch FSR measurement result. (Refer to Appendix A for Pseudocode of four sub-phase labeling process.).

To examine the feasibility of real-time and individual detection of these four foot–ground contact phases, and to identify how many times each sub-phase is detected in one stance phase while walking at a normal pace, a feasibility study, as shown in Figure 4, has been performed. Thus, a motion capture system was built to track the walking trajectories and speeds with six OptiTrack Prime 13 vision cameras. The number of detections per each sub-phase at various walking speeds and in various directions has been successfully recorded in real time. As a result, as shown in Figure 5, it was confirmed that the walking speed range was 0.2–1.5 m/s, and the walking range was found to be within 3 m × 2 m.

Figure 6 shows the results of the performed feasibility study in terms of numbers of the max., min., and average detection per sub-phase. An average of 1.93 toe off phases were detected during one cycle, which is a very short time period corresponding to just 1.59% of a single walk. Because we are going to use the sliding window to extract the wearable motion sensor data, it is expected that the average and minimum number of detections for each phase have a very significant correlation with the sliding window capture width and the FGCC accuracy. This is the reason why we now check the number of detections of each sub-phase per walk prior to determining the class. We will discuss effects of the correlation between these two factors in Section 3 in more detail.

3. Data Acquisition and Preprocessing

This section may be divided by subheadings. It should provide a concise and precise description of the experimental results, their interpretation as well as the experimental conclusions that can be drawn.

3.1. Training Dataset Acquisition

To obtain labeled walking motion data for the lower limbs, wearable experimental equipment was designed with five wireless inertia measurement unit (IMU) sensors [32] attached to each segment of the lower-limb part and a wireless FSR-arrayed insole unit for sub-phase labeling, as shown in Figure 7. An operating console for integrated data collection and preprocessing is installed near the subjects. The IMU sensors, measuring 3-axis orientation, 3-axis angular velocity, and 3-axis acceleration at 100 Hz, were fixed on each foot, shank, and waist using rubber straps.

Owing to the nature of the wearable sensors, every sensor attached to a lower-limb part has at first different positions and orientations. However, because IMU sensor output is expressed with respect to its own sensor-fixed coordinate frame, we created a common reference coordinate frame by using initial sensor calibration gestures. In this study, the standing–stooping calibration motion, which is the result of our preceding research [23], as shown in Figure 8, was applied.

In the operating console, the wearable motion sensor data were integrated with the label from the FSR-arrayed insole as shown in Figure 9. A single integrated message is composed of a timestamp, labels (left FSR, right FSR), and IMU sensor feature data, in that order.

3.2. Data Augmentation

In this study, three subjects, as shown in Figure 10, participated in flat ground walking experiments at a speed of 0.24–1.37 m/s to collect a labeled walking motion dataset of the lower limbs. Around 14,000 raw labeled datapoints were successfully obtained.

In order to significantly improve the generalization accuracy of the trained models without actually collecting new datasets, white noise was added to the entire raw dataset, as in Equation (1):

S [n] \pm {\max (| S [n] |) - | m e a n |} \times 0.1 = N [n]

(1)

where,

S [n]

denotes the feature data of

ℝ^{n \times 1}

. As a result, the raw data were increased by about 25 times to a total of 359,924 datapoints.

3.3. Standardization

As shown in Figure 11, because the augmented raw feature dataset still comprises data of various scales, standardization, which is the process of rescaling one or more features so that they have a mean value of 0 and a standard deviation of 1, had to be performed. Let us suppose this standardization process is not performed before training our models. If the distribution of a specific feature data is relatively low compared to the distribution of other feature data, the feature data may possibly be incorrectly evaluated as a feature that does not contribute to improving classification accuracy owing to its relatively low sensitivity in the corresponding class.

This is the reason why we performed standardization according to Equations (2) and (3) on the augmented raw feature dataset. Basically, as standardization assumes that the data has a Gaussian distribution, this process is more effective if the distribution of the feature data is Gaussian. It was confirmed that the distribution of our data does not follow an exact Gaussian distribution, but shows a very similar trend.

σ_{j} = \sqrt{\frac{\sum_{i = 1}^{N} {(x_{i, j} - E (x_{j}))}^{2}}{N}}

(2)

{\tilde{x}}_{i, j} = \frac{(x_{i, j} - \bar{x})}{σ_{j}}

(3)

where, N and j denote the total number of the feature dataset and the feature index, respectively.

x_{i, j}

and

E (x_{j})

represent a feature datapoint and a mean of the j-th feature

x_{j}

, respectively.

σ_{j}

and

{\tilde{x}}_{i, j}

are the standard deviation and standardization results of

x_{j}

, respectively.

The standardization results through Equations (2) and (3) are shown in Figure 12. It could be confirmed that the relative differences in scale of specific feature data within each label were significantly reduced. In addition, it is expected that the distribution pattern of features between labels will show a distinct difference, which will be a positive factor for multiclass classifier learning. (Refer to Appendix B for mean and standard deviation values for all subjects and Appendix C for mean and standard deviation plots for each subject.)

3.4. Sliding-Window Label Overlapping Method

In our preceding studies, it has been confirmed that the beta angles, also considered as the pitch angle, has significant sensitivities in the foot–ground contact detection [23], as well as some regular changing patterns over time during walking. As shown in Figure 13, the overall patterns of the beta angles of three subjects show quite similar tendencies; the difference in its values according to the label is very significant. Therefore, if a certain period of these feature data with distinct differences according to labels is extracted and converted into an image, class classification may be possible with an NN architecture, such as CNN.

Based on this, only the features that could contribute to the FGCC were carefully selected and the corresponding feature data plot over time only includes the right foot and shank motion data, as shown on the left side of Figure 14. Figure 14 shows the entire process of the SLO method. A width of the sliding window can be considered as the desired time span to be extracted of the time-series feature data. If the sliding window width and the sampling frequency are set at 14 and 100 Hz, respectively, a finite-horizon of the sliding window including 14 series of labeled feature data shifts right every 10 ms; hence, the name sliding-window label overlapping. It is important to note that the extracted feature data with SLO of a 22 (height) × 14 (width) window may be mixed with several different labels of the FGCC. As mentioned earlier, because label overlapping must inevitably occur due to the nature of the time-series data obtained in the stance phase, it should be noted that one sliding window may include one to three labels. However, because it is very rare to include three different labels at the same time, a sliding window including three different labels is regarded as an outlier. That is, for the HO phase, most sliding windows including the HO phase are outliers, because the HO phase is detected only 1.93 times on average during one walking step as well as being located between the TO and SW phases. Therefore, HO is integrated into the TO, which has a similar tendency to SW in the phase adjacent to HO. The pseudocode in Appendix D describes how to assign a label in the label-overlapped sliding window and how to convert the window into the image in more detail. The mat2gray function in MATLAB is used to convert the NN data to grayscale image in this study [33].

3.5. Validation Dataset Acquisition

In addition to the training dataset, the data of new subjects were collected to examine the validity of the trained model as shown in Figure 15. The data were collected on a 49.6 m long flat ground at a speed of 1.21–1.37 m/s; the collected data were preprocessed using the same method except for the data augmentation (refer to Appendix B for standard deviation and mean values of each label) described earlier to generate a total of 1506 efficient validation image sets.

4. CNN Model for Real-Time FGCC

The structure of CNN in this study was designed as shown in Figure 16. It consists of three consecutive convolutional layers and a fully connected layer at the end. Each convolutional layer performs convolution internally and a pooling process repeatedly to produce various feature maps for the input image. The initial hyper parameters are presented in Table 1, and the learning rate is set to 0.001 as the default value. ReduceLROnPlateau was used to lower the learning rate when the loss did not improve so that the local minimum could be exited. In addition, to prevent overfitting, the callback function EarlyStopping was used to stop learning when the performance of test loss no longer increased.

Sensitivity Analysis

As CNN was originally developed for image recognition [34], it is important to select the optimal hyper parameter that affects the learning accuracy of the image used. One of the input images in this study is represented in Figure 17, and we can see that certain features have a regular gray gradient shape over time.

Based on observations of several input images, it is reasonable to expect that the filter in the convolution layer will act as an important factor in extracting the features of the image. It is also expected that the label overlapping ratio, which shows how many past datapoints are used to encode current labels, will play an important role in improving FGCC accuracy. This is the reason why we performed a sensitivity analysis of the two major parameters for finding an optimal combination of these two parameters.

In this study, level average analysis using the Taguchi method was applied to examine the individual sensitivity of all parameters in terms of width, height of the convolution filter and label overlapping ratio; every combination was evaluated in terms of the training, test, and validation accuracies, which are the so-called the-larger-the-better indices. As a result, the three 3-level parameters to be examined are shown in Table 2, and an orthogonal array table of L₉(3³) is also shown in Table 3 with results of the training, test and validation accuracies by parameter combination.

60% and 40% in the entire dataset of 360,000 were used for training and test, respectively. And 1506 validation datapoints were predicted with each model obtained by a combination of orthogonal array tables. After performing learning along the orthogonal array table, the level average analysis, as in Figure 18, confirmed that the SLO ratio was the parameter that had the greatest effect on learning and validation accuracy, and that all three parameters showed the highest accuracy at 1st combination.

As a result of the level average analysis, the best combination of the parameter levels was confirmed as being the first combination in training with a batch size of 4000 and 10,000 epochs. The feature map created by the filter for each layer at this combination is presented in Appendix E.

Figure 19a indicates the loss change of the model: the loss change for the training-set was 0.000954 and the test-set was 0.005478. In addition, Figure 19b represents the accuracy change of the model, with a value of 0.9997 for the training-set and a value of 0.9984 for the test-set.

In addition, validation data was predicted through the trained model, and the four positions were classified with an average probability of 84.80%. Detailed accuracy results by label are shown in Table 4.

5. Results and Discussion

In this study, the SLO method is proposed to accurately detect the foot–ground contact phases composed of three sub-phases, as well as the swing phase using only wearable motion sensors attached to the foot and shank. We succeeded in developing a CNN model with a learning and test accuracy of 99.8% or more and confirmed that its validation accuracy was close to 85%.

Especially, whereas many previous studies did not consider overlapping labels in sliding window-based time-series data capture, this study shows that FGCC via CNN at a rate of 100 Hz can be realized with the proposed SLO method. Studies without labeling overlap have significant disadvantages in terms of real-time monitoring and reliability as they can only be used in limited situations. In this study, to overcome these shortcomings, a sliding window method was applied, which opens wider fields of applications and research.

However, more diverse studies are needed to verify the data augmentation method utilized in this study. Although the method of applying noise generated by sensors was sufficiently useful, there was a limitation in that the disturbance or deformation generated while walking could not be applied. In future studies, it is necessary to investigate various methods for improving classification accuracy in the real-world through sensor fusion of EMG [35], IMU, etc. as well as the data augmentation.

Author Contributions

This work was carried out in collaboration among all authors. D.L. provided supervision and guidance in the sensor data calibration, data acquisition, data processing and model learning. H.J., S.L.K., and S.K. carried out the sensor data acquisition and data mining, and learning simulation. H.J. and S.K. also validated the performance of the resultant CNN model. All authors participated in the original draft preparation. D.L. helped H.J. improved the quality of the work. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded and conducted under the Competency Development Program for Industry Specialists of the Korean Ministry of Trade, Industry and Energy (MOTIE), operated by Korea Institute for Advancement of Technology (KIAT). (No. P0002397, HRD program for Industrial Convergence of Wearable Smart Devices).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Algorithm A1 Pseudocode of four sub-phase labeling process

1: Interrupt Service Routine (every 10 ms)

2: if (read FSR0 > 10 kg_f)

3: FSR0 = True

4: else

5: FSR0 = False

6: if (read FSR1 > 10 kg_f)

7: FSR1 = True

8: else

9: FSR1 = False

10: if (read FSR2 > 10 kg_f)

11: FSR2 = True

12: else

13: FSR2 = False

14: if (FSR0 and FSR1 = False and FSR2 = True)

15: Output = Heel Strike

16: else if (FSR0 = False and FSR1 = True and FSR2

17: = True) or (FSR0 = True and FSR1 = False

18: and FSR2 = True)

19: Output = Full Contact

20: else if(FSR0 = True and FSR1 = True and FSR2

21: = False)

22: Output = Heel off

23: else if(FSR0 = True and FSR1 = False and

24: FSR2 = False)

25: Output = Toe off

Appendix B

Table A1. Means, standard deviations of every feature of each label.

Label 2
	Orientation			Acceleration			Angular Velocity			Rate of Euler Angle
	$Y a w (α)$	$P i t c h (β)$	$R o l l (γ)$	$a_{x}$	$a_{y}$	$a_{z}$	$w_{x}$	$w_{y}$	$w_{z}$	$\dot{α}$	$\dot{β}$
Foot	−0.1551 (±0.4065)	−0.0377 (±0.3778)	−0.1357 (±1.0435)	−0.0044 (±0.6404)	0.0074 (±0.2723)	0.0367 (±0.2364)	0.0494 (±0.4883)	0.0514 (±0.6035)	−0.2059 (±0.5320)	−0.00019 (±0.6639)	0.1616 (±0.3891)
Shank	0.3824 (±0.5310)	0.2539 (±0.4975)	−0.0946 (±1.0713)	0.4032 (±0.6129)	−0.3422 (±0.4613)	0.0198 (±0.3701)	0.3205 (±0.4281)	0.3953 (±0.4091)	−0.2978 (±0.4173)	0.1115 ( $\pm$ 0.4251)	0.4286 (±0.2765)
Label 3
	Orientation			Acceleration			Angular Velocity			Rate of Euler Angle
	$Y a w (α)$	$P i t c h (β)$	$R o l l (γ)$	$a_{x}$	$a_{y}$	$a_{z}$	$w_{x}$	$w_{y}$	$w_{z}$	$\dot{α}$	$\dot{β}$
Foot	−0.3302 (±0.4323)	−0.2600 (±0.3925)	0.0593 (±0.8412)	−0.0069 (±0.8265)	−0.0069 (±0.6012)	0.2146 (±0.5781)	0.0421 (±0.5644)	−0.0461 (±0.4672)	−0.0828 (±0.6505)	0.0063 (±0.8427)	0.1086 (±0.4537)
Shank	−0.2768 (±0.5474)	−0.4322 (±0.4090)	0.2467 (±0.8486)	−0.2761 (±0.8104)	0.2586 (±0.8352)	0.0909 (±0.7499)	0.3244 (±0.5664)	0.2431 (±0.3592)	−0.1093 (±1.0977)	−0.0776 (±0.9521)	0.2585 (±0.3217)
Label 4
	Orientation			Acceleration			Angular Velocity			Rate of Euler Angle
	$Y a w (α)$	$P i t c h (β)$	$R o l l (γ)$	$a_{x}$	$a_{y}$	$a_{z}$	$w_{x}$	$w_{y}$	$w_{z}$	$\dot{α}$	$\dot{β}$
Foot	−0.5414 (±0.6849)	−0.9097 (±0.6816)	0.0868 (±1.0554)	0.0475 (±1.4317)	−0.0921 (±1.5240)	−0.4572 (±1.5838)	0.1292 (±0.9869)	0.0885 (±0.8901)	0.4093 (±1.0212)	−0.0021 (±0.9594)	0.0802 (±1.0908)
Shank	−1.0112 ( $\pm$ 0.6641)	−1.0129 (±0.5170)	−0.1106 (±0.9957)	−0.6221 (±0.8928)	0.5041 (±1.0838)	−0.1153 (±1.8441)	−0.0473 (±1.1490)	−0.1010 (±0.8285)	0.6609 (±1.1596)	0.3421 (±1.0326)	−0.1097 (±0.9633)
Label 5
	Orientation			Acceleration			Angular Elocity			Rate of Euler Angle
	$Y a w (α)$	$P i t c h (β)$	$R o l l (γ)$	$a_{x}$	$a_{y}$	$a_{z}$		$w_{y}$	$w_{z}$	$\dot{α}$	$\dot{β}$
Foot	1.0371 (±1.3766)	1.1429 (±1.1608)	0.0018 (±1.0502)	−0.0273 (±1.0757)	0.0821 (±1.2921)	0.0955 (±1.1537)	−0.2158 (±1.6274)	−0.0773 (±1.6817)	−0.0241 (±1.5070)	−0.0058 (±1.4361)	−0.3793 (±1.6018)
Shank	0.7715 (±1.1886)	1.1079 (±1.0357)	−0.0841 (±1.0382)	0.4077 (±1.2366)	−0.3574 (±1.2485)	−0.0321 (±0.6482)	−0.7134 (±1.3111)	−0.6467 (±1.6110)	−0.0981 (±0.9490)	−0.3254 (±1.3263)	−0.6997 (±1.5509)

Appendix C

Box plots of all features of every subject participated in this study after standardization process.

Figure A1. (a) label 2, (b) label 3, (c) label 4, (d) label 5.

Figure A2. (a) label 2, (b) label 3, (c) label 4, (d) label 5.

Figure A3. (a) label 2, (b) label 3, (c) label 4, (d) label 5.

Figure A4. (a) label 2, (b) label 3, (c) label 4, (d) label 5.

Figure A5. Pitch angles of foot and shank of subject 4.

Appendix D

Algorithm A2 Pseudocode for assigning the representative label to a single window composed of the multiple labels

1: Function SLO (Entire labeled data set M (

ℝ^{n \times 22}

),

2: SLO_width=14, SLO_pitch=1, SLO_ratio=N)

3: (Rotate 90 degrees counterclockwise to create

4: 22

\times

14 images)

5: rotate M CCW 90 ← M(23

\times

n)

6: for i=1:SLO_pitch:(size(M,2)-SLO_width)

7: slo_image(:,:,i) = M([i:slo_width+(i-1)],[2:23])

8: → Save the 22

\times

14 image in three dimensions

9: slo_label(1,:,i) = M([i:slo_width+(i-1)],1)

10: → Save the 1

\times

14 label in three dimensions

11: end

12: for i=1:1:size(slo_image,3)

13: if (slo_label(1,14-(N-1):14,i)==5)

14: save mat2gray(slo_image(i)) to the folder "L_5"

15: else if (slo_label(1,14-(N-1):14,i)==4)

16: save mat2gray(slo_image(i)) to the folder "L_4"

17: else if (slo_label(1,14-(N-1):14,i)==3)

18: save mat2gray(slo_image(i)) to the folder "L_3"

19: else if (slo_label(1,14-(N-1):14,i)==2)

20: save mat2gray(slo_image(i)) to the folder "L_2"

21: end

22: end

Appendix E

Figure A6. Resultant feature map obtained with the 1st hyper parameter combination in Table 3.

References

Shin, S.; Cha, J. Human activity recognition system using multimodal sensor and deep learning based on LSTM. Trans. Korean Soc. Mech. Eng. 2018, 42, 111–121. [Google Scholar] [CrossRef]
Horst, F.; Lapuschkin, S.; Samek, W.; Müller, K.-R.; Schöllhorn, W.I. What is Unique in Individual Gait Patterns? Understanding and Interpreting Deep Learning in Gait Analysis. arXiv 2018, arXiv:1808.04308. [Google Scholar]
Zügner, R.; Tranberg, R.; Lisovskaja, V.; Kärrholm, J. Different reliability of instrumented gait analysis between patients with unilateral hip osteoarthritis, unilateral hip prosthesis and healthy controls. BMC Musculoskelet. Disord. 2018, 19, 224. [Google Scholar] [CrossRef] [PubMed]
Hsu, W.-C.; Sugiarto, T.; Lin, Y.-J.; Yang, F.-C.; Lin, Z.-Y.; Sun, C.-T.; Hsu, C.-L.; Chou, K.-N. Multiple-Wearable-Sensor-Based Gait Classification and Analysis in Patients with Neurological Disorders. Sensors 2018, 18, 3397. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Caramia, C.; Torricelli, D.; Schmid, M.; Munoz-Gonzalez, A.; Gonzalez-Vargas, J.; Grandas, F.; Pons, J.L. IMU-Based Classification of Parkinson’s Disease from Gait: A Sensitivity Analysis on Sensor Location and Feature Selection. IEEE J. Biomed. Health Inform. 2018, 22, 1765–1774. [Google Scholar] [CrossRef] [PubMed]
Zhao, H.; Wang, Z.; Qiu, S.; Shen, Y.; Wang, J. IMU-based gait analysis for rehabilitation assessment of patients with gait disorders. In Proceedings of the 2017 4th International Conference on Systems and Informatics (ICSAI), Hangzhou, China, 11–13 November 2017; pp. 622–626. [Google Scholar]
Qiu, S.; Liu, L.; Zhao, H.; Wang, Z.; Jiang, Y. MEMS Inertial Sensors Based Gait Analysis for Rehabilitation Assessment via Multi-Sensor Fusion. Micromachines 2018, 9, 442. [Google Scholar] [CrossRef] [Green Version]
Yang, C.-C.; Hsu, Y.-L. A Review of Accelerometry-Based Wearable Motion Detectors for Physical Activity Monitoring. Sensors 2010, 10, 7772–7788. [Google Scholar] [CrossRef]
Al-Amri, M.; Nicholas, K.; Button, K.; Sparkes, V.; Sheeran, L.; Davies, J. Inertial Measurement Units for Clinical Movement Analysis: Reliability and Concurrent Validity. Sensors 2018, 18, 719. [Google Scholar] [CrossRef] [Green Version]
Lim, C.K.; Luo, Z.; Chen, I.-M.; Yeo, S.H. Wearable wireless sensing system for capturing human arm motion. Sens. Actuators A Phys. 2011, 166, 125–132. [Google Scholar] [CrossRef]
Abellanas, A.; Frizera, A.; Ceres, R.; Gallego, J.A. Estimation of gait parameters by measuring upper limb–walker interaction forces. Sens. Actuators A Phys. 2010, 162, 276–283. [Google Scholar] [CrossRef]
Ofli, F.; Kurillo, G.; Obdrzalek, S.; Bajcsy, R.; Jimison, H.B.; Pavel, M. Design and Evaluation of an Interactive Exercise Coaching System for Older Adults: Lessons Learned. IEEE J. Biomed. Health Inf. 2016, 20, 201–212. [Google Scholar] [CrossRef] [PubMed]
Yuan, Q.; Chen, I.-M. Localization and velocity tracking of human via 3 IMU sensors. Sens. Actuators A Phys. 2014, 212, 25–33. [Google Scholar] [CrossRef]
Yuan, Q.; Chen, I.-M. Human velocity and dynamic behavior tracking method for inertial capture system. Sens. Actuators A Phys. 2012, 183, 123–131. [Google Scholar] [CrossRef]
Zhang, J.; Cao, Y.; Qiao, M.; Ai, L.; Sun, K.; Mi, Q.; Zang, S.; Zuo, Y.; Yuan, X.; Wang, Q. Human motion monitoring in sports using wearable graphene-coated fiber sensors. Sens. Actuators A Phys. 2018, 274, 132–140. [Google Scholar] [CrossRef]
Vu, C.C.; Kim, J. Human motion recognition using SWCNT textile sensor and fuzzy inference system based smart wearable. Sens. Actuators A Phys. 2018, 283, 263–272. [Google Scholar] [CrossRef]
King, K.; Yoon, S.W.; Perkins, N.C.; Najafi, K. Wireless MEMS inertial sensor system for golf swing dynamics. Sens. Actuators A Phys. 2008, 141, 619–630. [Google Scholar] [CrossRef]
Martínez-Villaseñor, L.; Ponce, H.; Espinosa-Loera, R.A. Multimodal Database for Human Activity Recognition and Fall Detection. Proceedings 2018, 2, 1237. [Google Scholar] [CrossRef] [Green Version]
Farooq, A.; Won, C.S. A Survey of Human Action Recognition Approaches that use an RGB-D Sensor. IEIE Trans. Smart Process. Comput. 2015, 4, 281–290. [Google Scholar] [CrossRef] [Green Version]
Kim, H.; Kang, Y.; Valencia, D.R.; Kim, D. An Integrated System for Gait Analysis Using FSRs and an IMU. In Proceedings of the 2018 Second IEEE International Conference on Robotic Computing (IRC), Laguna Hills, CA, USA, 31 January–2 February 2018; pp. 347–351. [Google Scholar]
Wu, D.; Wang, Z.; Chen, Y.; Zhao, H. Mixed-kernel based weighted extreme learning machine for inertial sensor based human activity recognition with imbalanced dataset. Neurocomputing 2016, 190, 35–49. [Google Scholar] [CrossRef]
Woodman, O.J. An introduction to inertial navigation. Citado 2007, 2, 19. [Google Scholar]
Kim, M.; Lee, D. Development of an IMU-based foot–ground contact detection (FGCD) algorithm. Ergonomics 2017, 60, 384–403. [Google Scholar] [CrossRef] [PubMed]
Teufl, W.; Lorenz, M.; Miezal, M.; Taetz, B.; Fröhlich, M.; Bleser, G. Towards Inertial Sensor Based Mobile Gait Analysis: Event-Detection and Spatio-Temporal Parameters. Sensors 2018, 19, 38. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hsu, Y.-L.; Yang, S.-C.; Chang, H.-C.; Lai, H.-C. Human Daily and Sport Activity Recognition Using a Wearable Inertial Sensor Network. IEEE Access 2018, 6, 31715–31728. [Google Scholar] [CrossRef]
Janidarmian, M.; Roshan Fekr, A.; Radecka, K.; Zilic, Z. A Comprehensive Analysis on Wearable Acceleration Sensors in Human Activity Recognition. Sensors 2017, 17, 529. [Google Scholar] [CrossRef] [PubMed]
Almaslukh, B.; Artoli, A.; Al-Muhtadi, J. A Robust Deep Learning Approach for Position-Independent Smartphone-Based Human Activity Recognition. Sensors 2018, 18, 3726. [Google Scholar] [CrossRef] [Green Version]
Sztyler, T.; Stuckenschmidt, H. On-body localization of wearable devices: An investigation of position-aware activity recognition. In Proceedings of the 2016 IEEE International Conference on Pervasive Computing and Communications (PerCom), Sydney, NSW, Australia, 14–19 March 2016; pp. 1–9. [Google Scholar]
Um, T.T.; Babakeshizadeh, V.; Kulic, D. Exercise motion classification from large-scale wearable sensor data using convolutional neural networks. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 2385–2390. [Google Scholar]
Tiberio, D. The Effect of Excessive Subtalar Joint Pronation on Patellofemoral Mechanics: A Theoretical Model. J. Orthop. Sports. Phys. 1987, 9, 160–165. [Google Scholar] [CrossRef]
Available online: https://sites.google.com/a/mdex.co.kr/mdex-sensor-info-2017/online-shop-eng/ra12p (accessed on 1 September 2020).
Paulich, M.; Schepers, M.; Rudigkeit, N.; Bellusci, G. Xsens MTw Awinda: Miniature wireless Inertial-Magnetic Motion Tracker for Highly Accurate 3D Kinematic Applications; Xsens: Enschede, The Netherlands, 2018. [Google Scholar]
MathWorks Home Page. Available online: https://www.mathworks.com/help/images/ref/mat2gray.html?s_tid=srchtitle (accessed on 1 September 2020).
LeCun, Y.; Boser, B.; Denker, J.S.; Henderson, D.; Howard, R.E.; Hubbard, W.; Jackel, L.D. Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989, 1, 541–551. [Google Scholar] [CrossRef]
Toledo-Pérez, D.C.; Martínez-Prado, M.A.; Gómez-Loenzo, R.A.; Paredes-García, W.J. A study of movement classification of the lower limb based on up to 4-EMG channels. Electronics 2019, 8, 259. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Foot–ground contact phase definition: swing, heel strike, full contact, heel off, and toe off.

Figure 2. FSR-arrayed insole for dataset labeling of the four sub-phases in the stance phase.

Figure 3. Four sub-phase labeling criteria according to the 3-ch FSR measurement result.

Figure 4. Experiment environment for data acquisition of phase detection in a motion capture area.

Figure 5. Walking trajectories (a) and speeds (b) of every subject measured with six OptiTrack Prime 13 cameras.

Figure 6. Results of the feasibility study in terms of numbers of the maximum, minimum, and average detections per sub-phase.

Figure 7. Configuration of the designed wearable experimental equipment for labeled lower-limb walking motion dataset acquisition.

Figure 8. Procedure of standing–stooping calibration motion for creating a common sensor-fixed reference coordinate frame.

Figure 9. Low-level communication protocol: timestamp, labels, and inertial motion sensor data.

Figure 10. Experimental environment and subject information.

Figure 11. Box plot results of the augmented raw feature dataset with mean, standard deviation for each label, (a) label 2, (b) label 3, (c) label 4, (d) label 5.

Figure 12. Box plot results of the feature dataset after standardization with mean, standard deviation for each label, (a) label 2, (b) label 3, (c) label 4, (d) label 5.

Figure 13. Pitch angles of foot and shank of three subjects.

Figure 14. Sliding-window label overlapping method.

Figure 15. Environment and additional subject for acquisition of the validation dataset.

Figure 16. Four label classification CNN architecture.

Figure 17. Sample input image.

Figure 18. Results of the level average analysis of three major hyper parameters in terms of training, test, and validation accuracy.

Figure 19. Results of the CNN model training: (a) model loss, (b) model accuracy.

Table 1. Input parameters and hyper parameters of CNN model.

Input Parameter		CNN Model Parameter
SLO width	Image Shape	Pooling method	Layer no.	Filter size	Drop-out rate	Stride width	Activation function
14	$22 \times 14$	Average-Pooling	3	$m \times n$	0.3	1	ReLU

Table 2. 3-level parameter table including SLO ratio and size of the convolution filter.

Level	SLO Ratio [%]	Filter Width	Filter Height
1	30(4)	3	3
2	50(7)	5	5
3	70(10)	7	7

Table 3. Orthogonal array of L₉(3³) with training, test and validation accuracies by combination.

No.	SLO Ratio	Filter Width	Filter Height	Train Acc [%]	Test Acc [%]	Val Acc [%]
1	1	1	1	99.97	99.84	84.80
2	1	2	2	100	99.88	79.40
3	1	3	3	99.99	99.86	79.29
4	2	1	2	99.90	99.48	78.23
5	2	2	3	99.87	99.33	76.96
6	2	3	1	99.86	99.35	77.55
7	3	1	3	99.99	99.49	78.77
8	3	2	1	99.99	99.51	81.41
9	3	3	2	99.90	99.32	74.92

Table 4. Result of experimental validation set data [%].

	SW	HS	FC	HO	Total
Accuracy	82.27	81.61	82.12	93.18	84.80

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jeon, H.; Kim, S.L.; Kim, S.; Lee, D. Fast Wearable Sensor–Based Foot–Ground Contact Phase Classification Using a Convolutional Neural Network with Sliding-Window Label Overlapping. Sensors 2020, 20, 4996. https://doi.org/10.3390/s20174996

AMA Style

Jeon H, Kim SL, Kim S, Lee D. Fast Wearable Sensor–Based Foot–Ground Contact Phase Classification Using a Convolutional Neural Network with Sliding-Window Label Overlapping. Sensors. 2020; 20(17):4996. https://doi.org/10.3390/s20174996

Chicago/Turabian Style

Jeon, Haneul, Sang Lae Kim, Soyeon Kim, and Donghun Lee. 2020. "Fast Wearable Sensor–Based Foot–Ground Contact Phase Classification Using a Convolutional Neural Network with Sliding-Window Label Overlapping" Sensors 20, no. 17: 4996. https://doi.org/10.3390/s20174996

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fast Wearable Sensor–Based Foot–Ground Contact Phase Classification Using a Convolutional Neural Network with Sliding-Window Label Overlapping

Abstract

1. Introduction

2. Foot–Ground Contact Phases and Labeling Method

3. Data Acquisition and Preprocessing

3.1. Training Dataset Acquisition

3.2. Data Augmentation

3.3. Standardization

3.4. Sliding-Window Label Overlapping Method

3.5. Validation Dataset Acquisition

4. CNN Model for Real-Time FGCC

Sensitivity Analysis

5. Results and Discussion

Author Contributions

Funding

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

Appendix E

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI