Next Article in Journal
Laser Velocimetry for the In Situ Sensing of Deep-Sea Hydrothermal Flow Velocity
Next Article in Special Issue
Salient Arithmetic Data Extraction from Brain Activity via an Improved Deep Network
Previous Article in Journal
On the Importance of Attention and Augmentations for Hypothesis Transfer in Domain Adaptation and Generalization
Previous Article in Special Issue
Deep Learning for Detecting Multi-Level Driver Fatigue Using Physiological Signals: A Comprehensive Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Instrumented Gait Classification Using Meaningful Features in Patients with Impaired Coordination

by
Zeus T. Dominguez-Vega
1,
Mariano Bernaldo de Quiros
1,
Jan Willem J. Elting
1,
Deborah A. Sival
2 and
Natasha M. Maurits
1,*
1
Department of Neurology, University Medical Center Groningen, University of Groningen, 9713 GZ Groningen, The Netherlands
2
Department of Paediatrics, Beatrix Children’s Hospital, University Medical Center Groningen, University of Groningen, 9713 GZ Groningen, The Netherlands
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(20), 8410; https://doi.org/10.3390/s23208410
Submission received: 22 August 2023 / Revised: 29 September 2023 / Accepted: 9 October 2023 / Published: 12 October 2023
(This article belongs to the Special Issue Biomedical Signal Processing and Health Monitoring Based on Sensors)

Abstract

:
Early onset ataxia (EOA) and developmental coordination disorder (DCD) both affect cerebellar functioning in children, making the clinical distinction challenging. We here aim to derive meaningful features from quantitative SARA-gait data (i.e., the gait test of the scale for the assessment and rating of ataxia (SARA)) to classify EOA and DCD patients and typically developing (CTRL) children with better explainability than previous classification approaches. We collected data from 18 EOA, 14 DCD and 29 CTRL children, while executing both SARA gait tests. Inertial measurement units were used to acquire movement data, and a gait model was employed to derive meaningful features. We used a random forest classifier on 36 extracted features, leave-one-out-cross-validation and a synthetic oversampling technique to distinguish between the three groups. Classification accuracy, probabilities of classification and feature relevance were obtained. The mean classification accuracy was 62.9% for EOA, 85.5% for DCD and 94.5% for CTRL participants. Overall, the random forest algorithm correctly classified 82.0% of the participants, which was slightly better than clinical assessment (73.0%). The classification resulted in a mean precision of 0.78, mean recall of 0.70 and mean F1 score of 0.74. The most relevant features were related to the range of the hip flexion–extension angle for gait, and to movement variability for tandem gait. Our results suggest that classification, employing features representing different aspects of movement during gait and tandem gait, may provide an insightful tool for the differential diagnoses of EOA, DCD and typically developing children.

1. Introduction

Movement coordination is an ability that allows smooth and efficient goal-directed movements involving several body parts. Due to its complexity, a coordinated gait requires the interaction between different body parts, various muscles to maintain balance, sensory inputs and proprioception [1]. The cerebellum is an essential part of the coordination and planning of complex movements. It is well recognized that the cerebellum regulates postural equilibrium and muscle tone by interacting with the brainstem, basal ganglia, and cerebral cortex. The cerebellum also contributes to the cognitive aspects of postural control such as the maintenance of postural verticality and anticipatory postural adjustment [2]. In particular, gait is an ability that starts to develop during the first year of life but maturity is only reached around 11 years of age [3]. The maturity and development of coordinated gait movements are affected by movement and developmental disorders such as dystonia, early onset ataxia (EOA) and developmental coordination disorder (DCD). In this study, we focus on the distinction between EOA, DCD and typically developing (CTRL) children, based on gait.
Ataxia presenting before 25 years of age is denominated as early onset ataxia (EOA). EOA involves lack of coordination, impaired voluntary and goal-directed movements and loss of balance control [4]. Distinguishing clinical features in EOA patients include dysdiadochokinesia, dysmetria, overshoot, impaired gait and posture, intention tremor, oculomotor dysfunction and speech abnormalities [5]. The scale for the assessment and rating of ataxia (SARA) is one of the clinical tools commonly used to assess ataxia. In patients with movement disorders, the SARA was designed to identify and quantify ataxic characteristics [6].
Developmental coordination disorder (DCD) is defined as an impairment in the coordination and execution of motor function which affects the child’s academic or social development, while no underlying neurological diagnosis or intellectual deficits, such as cerebral palsy, muscular dystrophy, visual impairment or intellectual disability, are present [7,8]. Mostly, the features consistent with DCD are identified by the parents when the children present delays in gross or fine motor milestones [8]. Some of the most common symptoms in DCD patients are abnormal coordination, having motor coordination below expectations for their age as well as delays in early motor milestones, such as walking and crawling.
Patients with EOA and DCD may thus present with an overlap in clinical characteristics, hampering the clinical distinction between both [5]. Reliable phenotypic recognition of EOA among other developmental disorders with coordination impairment, such as DCD, in relation to typically developing children is important for selecting the correct diagnostic algorithm, predicting familial recurrence risk, treating the child and/or, in case of typical development of an immature motor system, consolidation of the parents [5]. However, in the context of ambiguous clinical descriptions and in the absence of clear clinical criterion standards, the clinical distinction between mildly initiating EOA features and DCD may be challenging [5]. In the latter study, 5 out of 21 EOA and DCD patients were inhomogeneously phenotyped by three paediatric neurologists. The phenotypic interobserver agreements for the EOA, DCD and central hypotonia (sub)groups (Gwet’s agreement coefficient) were: EOA = 0.801 (p < 0.001; substantial); DCD = 0.327 (p = 0.037; fair); central hypotonia = 0.415 (p = 0.005; moderate).
Several earlier studies compared the gait of typically developing children with DCD [9,10,11] or with ataxia patients [12,13,14]. However, to our knowledge, few studies have aimed to achieve the clinically more relevant goal of distinguishing between EOA, DCD and the immature motor behaviour of typically developing children, in a single study [15,16]. The reason for this may be that DCD is typically diagnosed by rehabilitation doctors, whereas EOA is typically diagnosed by paediatric neurologists. Since both patient groups are thus examined and treated by different specialists, a direct comparison between the groups may be hampered. In a recent study, DCD and typically developing children were asked to walk barefoot on a motor driven treadmill for 2 min, while their gait was assessed using a 3D motion capture system. The authors found that children with DCD exhibited more complexity in their shank movements and also greater variability in thigh and shank movements compared to age- and gender-matched typically developing participants [9]. In another study, DCD and typically developing children were asked to walk up and down a flat 10-m-long pathway for 1 min, while the movement of their feet and trunk was recorded using motion analysis. The gait pattern of children with DCD was characterised by higher variability and an increased range of movement compared to their healthy peers [17]. In a previous study from our own group, we aimed to distinguish between patients with EOA or DCD and typically developing children [15]. Using inertial measurement units (IMUs), we collected data while the participants walked independently, and derived time and frequency domain-based, as well as statistical, information. Employing this information, we improved the classification performance compared to three expert evaluators. Although we achieved good accuracy, explainability of this classification model was low, i.e., we could not directly relate the distinguishing features to the clinical construct of impaired coordination. Knowing which interpretable features contribute most to classification is informative for doctors assessing these patients, as they may expose new, unstudied parameters distinguishing between the different groups. We have shown before, for upper limb SARA tasks, that informing clinicians of such distinguishing parameters, can help improve phenotypic assessment of EOA and DCD [5].
In the present study, we therefore aimed to provide meaningful features as derived from the quantitative data of the two SARA gait tests to classify EOA and DCD patients and typically developing CTRL children. Our goal was to obtain features with better explainability on the one hand, while achieving similar or even better accuracy, on the other hand. To do so, we developed our features on the clinical premise that impaired gait coordination is characterised by irregularities in movement and loss of smoothness. We extracted gait features from IMU data collected during both SARA gait tests, accordingly.

2. Materials and Methods

The raw data used in this study consist of a data subset recorded for a larger project on the quantification of coordination impairment employing the SARA [15,16,18,19]. We followed the research and integrity codes of the University Medical Center Groningen (UMCG) and the principles of the declaration of Helsinki (2013). The SARA test battery is performed as part of clinical routine, therefore the Medical Ethical Committee of the UMCG provided a waiver for ethical approval of the study. It was also recognized that the attachment of IMUs to the participant’s body is non-invasive. Finally, parents and children 18 years or older were asked to sign informed consent and informed assent was given by minors 12 years or older.

2.1. Participants

We recruited participants between 2014 and 2019. EOA and DCD patients were asked to participate during routine visits to the outpatient clinic of the UMCG. All DCD patients received an independent neurological examination at the outpatient clinic to ensure that no underlying neurological or intellectual deficits were present. Both EOA and DCD patients fulfilled the official inclusion criteria [7,20]. Healthy participants were siblings of the included patients. They were declared to be healthy by their parents and had no other neurological or orthopaedic diagnosis that could theoretically interfere with coordinated motor performance. Intending to balance age, we tried to match the age of the participants in the three groups. None of the included children received medication with known negative side effects on motor coordination. Participants were excluded when they were unable to execute the SARA lower limb gait tests independently.

2.2. Clinical Assessment

We recorded gait and tandem gait SARA test performances using IMUs. During the gait test, participants were asked to walk independently for six meters, turn and walk back the same distance. The tandem gait test consisted of ten independently executed consecutive tandem steps.

2.3. Data Acquisition

Before the performance of both SARA gait tests, the participants were fitted with six lightweight IMUs (51 mm × 34 mm × 14 mm, 23.6 g, Shimmer3, Shimmer, Dublin, Ireland): one IMU was placed on the sternum, another one on the lower back close to the L3 vertebra, two were placed bilaterally halfway each upper leg over the quadriceps and two on the lateral side of the shanks, just above the malleolus. This set-up was chosen to enable various analyses including joint kinematics analysis during SARA motor test performance. For the goal of this study, only the data from a subset of IMUs were used.
Before attachment, the six Shimmer IMUs were calibrated and programmed to record inertial data into embedded SD cards. We configured each IMU to acquire 3D acceleration (±4 G), 3D angular velocity (±500 dps) and 3D magnetic data (±1.9 Ga), at a sampling frequency of 256 Hz. Additionally, we videotaped the start of recording by pressing a button on the multi-charger (Shimmer sensing, Dublin, Ireland) which resets the internal timer of each device so all devices, as well as full SARA gait test execution, are time-synchronised. This allowed later identification and delineation of each of the gait sequences, i.e., forward walking and turning.

2.4. Signal Preprocessing

We obtained six files per patient (one per Shimmer device), containing 3D inertial data. Subsequently, we created separate MATLAB (version R2020b, Mathworks, Natick, MA, USA) files for each test (gait and tandem gait) and each of the straight walking sections, before and after turning. Then, we synchronised the data from all devices, using a MATLAB script that employed the starting and ending time of each test to create a global timing for all devices. After synchronisation, we sorted the data from the sensors such that the x-axis corresponded to the vertical direction, the y-axis to the mediolateral direction and the z-axis to the anterior–posterior direction.

2.4.1. Gait Data Segmentation

After preprocessing, we plotted the mediolateral angular velocity (gyroscope) signals from the shimmers attached to each (right and left) shank. These signals are most representative of gait [21]. By visual inspection, we then selected only those signals that had a similar pattern to that reported by Salarian et al. [21] for gait, for further analysis.
Based on the algorithm described by Salarian et al. [21], we created a MATLAB script to segment individual steps from each shank gait signal (left and right). First, we used a decimation technique to reduce the noise of the signal by increasing two bits of resolution [22]. Then, we applied a zero-phase 0.5–5 Hz band-pass fourth-order Butterworth filter. This allowed identification of all positive peaks in the signal, which represent the mid-swing events. Subsequently, we identified the adjacent valleys to each peak, where the valley before the peak represents the moment of terminal contact (toe-off) and the valley after the peak represents the moment of initial contact (heel strike). Similar to Salarian et al. [21], we searched for these peaks in time intervals of 0.5 s before and after mid-swing to automatically identify the toe-off and heel strike events, respectively. We only used the gait cycle data for which all three events could be identified. We identified the events for each shank (left and right) separately, but used the information from both shanks to extract temporal and spatial features (Section 2.5).

2.4.2. Tandem Gait Data Segmentation

Similar to gait segmentation we first plotted the angular velocity recorded from the Shimmer IMUs attached to the shanks (left and right). By visual inspection, we identified the signals that were correctly acquired for further analysis.
For signal segmentation, we followed the same technique as we did for gait (Section 2.4.1). Once each mid-swing event and the corresponding toe-off and heel-strike events were identified, we segmented each gait cycle from toe-off to toe-off instead of from heel-strike to heel-strike as we did in gait. We did this to have more usable tandem gait cycles per patient.

2.5. Feature Extraction

The extraction of gait and tandem gait features employed expert knowledge from clinicians and knowledge about gait characteristics as described in the literature. The latter were derived from IMU-based gait models [23]. We aimed to derive gait features that reflected the gait characteristics clinicians look for during the evaluation of the SARA protocol, which include gait symmetry and regularity, as well as variability of gait performance. The use of such features instead of more general time or frequency domain-based or statistical features will allow for better explainability of the final model. This approach resulted in 36 extracted features. Details of the feature extraction are provided in Appendix A.

2.6. Classification

For further processing, we combined the 36 extracted features (24 for gait and 12 for tandem gait) in a table, where each column represented one of the 36 features. One row of this table contained feature values from one complete gait cycle of gait and one tandem gait cycle, randomly chosen, thereby building a ‘combined movement’ per patient, for classification. Each patient was represented by 10 rows in this table.
We decided to use a random forest classifier since it performed well in a previous study with a similar application and allows the most relevant features in the model to be identified, thereby contributing to model explainability [18]. For the classifier, we made an implementation pipeline using Python version 3.7 and scikit-learn 0.32. Random forest (RF) is an ensemble learning algorithm that combines several randomized decision trees and aggregates their predictions by averaging. It also returns estimates of feature importance, that we also report here. The latter characteristic of the method provides some insight into the most relevant features for classification which is of importance for the clinical interpretability of the classifier. Similar to a previous study from our group [18], we used 300 trees to reduce the computational cost, and the default Gini index threshold hyperparameter in scikit-learn for our random forest classification, which was repeated 100 times. The average and standard deviation of the classification metrics were used to obtain an estimate of classification performance.
To deal with the imbalanced number of participants in our dataset (EOA = 18, DCD = 13 and CTRL = 29), we used the ADASYN algorithm, which synthetically oversamples the minority classes (EOA and DCD), to obtain a balanced dataset [24]. In addition, aiming for a robust estimation (avoiding overfitting) and to overcome the relatively small number of samples we implemented leave-one-out cross validation. However, instead of leaving one instance (one row in the feature matrix) out, we took one participant (10 rows) out to be used as a test set, while keeping the remaining participants as a training set. Note that, for each LOOCV iteration, ADASYN was only applied to this training set. We classified each instance from each participant individually and used a majority vote strategy to obtain the classification per participant, per iteration. In case of a tie, the class was randomly determined from the two tied classes. Using this information over all 100 iterations, we also calculated and plotted the probability for each participant of being classified in one of the three classes. The entire procedure is outlined in the flowchart in Figure 1.
Even though accuracy is one of the most used metrics to evaluate classification performance, the interpretation of the results could be misleading if there is imbalance in the number of participants per group (class). In such cases, precision and recall could be used as additional metrics to evaluate the performance of the classifier. In our case, as we have an imbalanced dataset (EOA = 18, DCD = 13 and CTRL = 29) we, therefore, decided to use these metrics to evaluate classifier performance in addition to accuracy. Precision (also known as positive predictive value) summarizes the number of positive predictions that actually belong to the positive class. Precision is simply the ratio of correct positive predictions out of all positive predictions made. In an imbalanced classification problem with more than two classes, precision is calculated as the sum of true positives across all classes divided by the sum of true positives and false positives across all classes. Precision thus quantifies the ratio of correct predictions across both positive classes (EOA and DCD).
P r e c i s i o n = T r u e   P o s i t i v e T r u e   P o s i t i v e + F a l s e   P o s i t i v e
Recall summarizes how well the positive class was predicted. Recall quantifies the number of positive class predictions in relation to all positive examples in the dataset. Unlike precision that only comments on the correct positive predictions out of all positive predictions, recall indicates missed positive predictions. In an imbalanced classification problem with more than two classes, recall is calculated as the sum of true positives across all classes divided by the sum of true positives and false negatives across all classes.
R e c a l l = T r u e   P o s i t i v e T r u e   P o s i t i v e + F a l s e   N e g a t i v e
Precision and recall can be combined into a single score that seeks to balance both concerns, called the F-score or the F-measure. The F1-measure, which weights precision and recall equally by taking their harmonic mean, is the variant most often used when learning from imbalanced data [25].
F 1 = 2 × P r e c i s i o n × R e c a l l P r e c i s i o n + R e c a l l

3. Results

3.1. Participants

Data from 79 participants of the three groups (EOA, DCD and CTRL) were collected while performing the SARA gait tests. Data from 60 participants could be included for analysis after application of the exclusion criteria: 18 EOA, 13 DCD and 29 CTRL. Age was not normally distributed, and age ranges varied between groups (EOA: 6–25 years; DCD: 6–16 years; CTRL: 5–25 years). Yet, age did not significantly differ between groups (Kruskal–Wallis test, χ2(2) = 3.2404, p = 0.198). Table 1 provides a summary of the participant characteristics. Details on individual participants are provided in Table A2 in Appendix B.

3.2. Classification

The classifier had a mean accuracy of 82.0% (SD 3.0%) on new data. The confusion matrix (Table 2) was obtained over 100 iterations and the values presented correspond to mean and standard deviation. On a group level, the accuracy was for EOA: mean = 62.9%, SD = 6.3%, for DCD: mean = 85.5%, SD = 5.9% and for CTRL: mean = 94.5%, SD = 2.2%. We also determined the misclassification: of the EOA patients, 16.6 (SD 4.9)% was classified as CTRL and 21.9 (SD 6.2)% was classified as DCD; of the DCD patients, 3.8 (SD 4.7)% was classified as EOA and 13.7 (SD = 3.8)% was classified as CTRL; and finally, of the CTRL participants 4.3 (SD 1.5)% was classified as EOA and 1.2 (SD 1.9)% was classified as DCD.

3.3. Feature Importance

Feature importance or the relevance of the feature for the classifier was calculated as the probability of each feature reaching the final node in the ensemble, as described by the decrease of node impurity. According to this metric, there is a combination of features that made it possible to classify between classes. The most relevant features obtained over 100 iterations were those related to variability (the dynamic time warping distance to the mean from the signal of the thigh during tandem gait; the dynamic time warping distance to the mean from the signal of the shank during tandem gait) and range of movement (the range of the hip flexion–extension angle during gait) with mean feature importance of 6.3%, 5.4% and 5.2%. Here, the dynamic time warping distance allows the variability in the acquired signal across gait cycles to be quantified. The first 20 out of 36 features represented 66.3% of the relative importance, as visualized in Figure 2. For detailed information on the features, please see Appendix A.

3.4. Classification Performance

When considering the combination of EOA and DCD as a positive class, the mean precision of the classifier is 0.78 (SD 0.05), with a mean recall of 0.70 (SD 0.05) and mean F1 of 0.74 (SD 0.04). For EOA as a positive class, the mean precision is 0.87 (SD 0.05), with a mean recall of 0.61 (SD 0.06) and mean F1 of 0.72 (SD 0.05). For DCD as a positive class, the mean precision is 0.72 (SD 0.07), with a mean recall of 0.82 (SD 0.06) and mean F1 of 0.77 (SD 0.05). For CTRL as a positive class, the mean precision is 0.85 (SD 0.03), with a mean recall of 0.95 (SD 0.02) and a mean F1 of 0.90 (SD 0.02). These and additional measures of classification performance are summarized in Table 3.
Finally, we calculated the probability for each participant of being classified in any of the three groups. Over the 100 iterations, Figure 3 shows, for all participants, the probability of being classified as EOA (orange), DCD (blue) or CTRL (green). The upper left corner represents a high probability of being classified in the CTRL group. From Figure 3, we can see that 28 out of 29 healthy participants are localized in this corner (probability of being classified as a CTRL participant is >0.5), with one close to the EOA group with a probability of 0.1 of being classified as CTRL. The lower right corner represents a high probability of being classified as a DCD patient. Eleven out of thirteen DCD patients are localised in this corner (probability of being classified as a DCD patient is >0.5) while two DCD patients are close to the CTRL group with a probability of being classified as a CRTL > 0.6. Finally, the lower-left corner represents a high probability of being identified as an EOA patient. Nine of the EOA patients are localised in this corner (probability of being classified as an EOA patient is >0.5), while seven are spread in this probability plot with relatively high probabilities of being classified as a DCD patient or a CTRL participant. Two of the EOA patients are close to the CTRL group, with probabilities of 1.0 of being classified as a CTRL participant.
As can be concluded from Table A2 in Appendix B, in total 10 participants were misclassified; seven EOA patients (five classified as DCD and two as control), two DCD participants (both classified as control) and one control participant that was classified as EOA. Table A2 illustrates that the misclassified participants did not differ in age or sex from the other participants. The gait of the two DCD participants that were misclassified as controls was phenotypically assessed as (almost) normal (three evaluators scored the SARA gait as 0/0/0 and 0/1/0), probably explaining their misclassification. The SARA gait scores—to the extent that we have them available—for the EOA patients that were misclassified are similar to those of correctly classified EOA patients. One EOA patient, that was misclassified as a control, had one of the highest SARA gait scores in this group (3) and walked very broad-based. This is an aspect of movement that we cannot capture with IMUs and may explain why this particular patient was misclassified. The one control participant that was classified as an EOA patient had a normal gait and there is no explanation why this particular participant was misclassified. It should be noted that all participants had to be able to walk independently, leading to relatively mildly affected patients (maximum SARA gait score of 3), in both patient groups, making classification more challenging.

4. Discussion

We used IMU data obtained during the execution of both SARA gait tests to extract meaningful features commonly used for gait analysis (e.g., step length and stride velocity) or for representing gait pattern variability (e.g., deviations from the mean). We employed these features for the classification of children with EOA or DCD and typically developing children. We slightly improved classification performance (0.82) when compared to clinical assessment (0.73) [15], but more importantly, we identified the most contributing, meaningful, features, thereby increasing model explainability. We found that features that represent variability in gait are most relevant for classification using a random forest classifier, compared to temporal and spatial features, which contributed less.
The most relevant features used by our random forest classifier were related with the range of the hip flexion–extension angle during gait or with variability of movement during tandem gait. This illustrates that the accurate classification of EOA, DCD and CTRL children depends on the incorporation of features representing variability, from both gait and tandem gait tests. This fits with our a priori hypothesis that abnormal gait is characterised by irregularities in movement and loss of smoothness, and also corresponds with the clinical concept of ataxic gait as visualized in Figure 4.
As far as we know, there is limited literature on the automatic classification of EOA, DCD and CTRL children based on quantified movement data. Nevertheless, there are some studies aiming to distinguish between these groups. For example, in our previous study using information from gait only, we classified 80.0%, 85.7% and 70.0% of the CTRL, DCD and EOA children, respectively. Overall, that classifier correctly classified 78.4% of the participants [15]. However, in that study we only used statistical features, that provide limited insight in the classification model. In the current study, we correctly classified 94.5%, 85.5% and 61.4% of the CTRL, DCD and EOA children, respectively. Overall, the random forest classifier correctly classified 82.0% of the participants. Not only did we achieve good classification performance, even slightly better than in our previous study, we also now derived knowledge-based features—related to how gait is assessed clinically, incorporating stride length and gait velocity, for example—instead of statistical features, thereby providing meaningful features that enhance the explainability of the classification model.
In the literature, there are some other studies aiming to distinguish between DCD patients and healthy participants using 3D motion capture systems [9,11,16]. In a study aiming to examine the complexity and variability of gait, Rosengren et al. [9] found that DCD patients performed movements with higher complexity, meaning that they had higher variability as well as asymmetry in their movement patterns compared to healthy children. In our study, in line with these findings, we found that the most relevant features used by the classifier to identify between groups are those related to the variability of gait (i.e., distance to the mean trajectory for thigh and shank sensors during tandem gait execution and the range of the hip flexion–extension angle during gait).
In another study, Woodruff et al. [11], developed an index of walking performance to compare the gait patterns of children with DCD and healthy children. They combined four variables (percentage of cycle at opposite toe off, percentage of cycle of single stance, percentage of cycle at toe off and step length as a percentage of the gait cycle) to create a single index. DCD children showed abnormal gait patterns having larger variance in this index than healthy children. This is in line with our results, again suggesting that variability-based features are more meaningful for DCD distinction from healthy participants. To enhance the interpretability of our results, for the four most relevant features for our classification, we investigated if there was a significant difference between groups, by means of the Kruskal–Wallis H-test. These four features represented variability of movement, two measuring the distance of each gait cycle to the mean (TG_DIS_DTW_MEAN_THIGH, TG_DIS_DTW_MEAN_SHANK), one measuring the range of the angles during each gait cycle (G_RHFE) and one measuring the local curvature from the 3D angular velocity (G_CURV_SHANK). We calculated the median value per most relevant feature and group (see Table 4 and the Appendix A for an explanation of the features).
For the first three features, the medians are higher for the EOA group, lower for the CTRL group and values for the DCD group are in between, which is expected according to clinical experience. A possible explanation of the increased variability of gait parameters in DCD patients compared to CTRL children could be that DCD patients present increased coactivation, as suggested by Raynor [27], meaning that DCD patients have not developed the same level of muscular organization compared to their healthy peers. Since motor incoordination in DCD patients is clinically associated with cerebellar dysfunction, patients may present with more complex movements, i.e., unskilled movements similar to those observed in typically developing children in the earlier stages of childhood [28].
To the best of our knowledge, this is the first classification study using meaningful features based on the clinical construct of motor incoordination by combining information from the SARA gait and tandem gait tests. Although we obtained a good and insightful classification of EOA, DCD and CTRL children, there are some limitations to the study. First, we used a relatively small dataset (60 participants). However, it should be noted in that respect that, since EOA is a rare disorder, and we wanted to make sure that our labels were as accurate as possible, we carefully identified children with a formal diagnosis of having ataxia, validated by DNA analysis. This limited the number of children with ataxia eligible for our study. Another limitation is the use of inertial measurement units; we are aware that the use of even smaller and lighter sensors could further reduce any experienced restraint during the execution of the gait and tandem gait SARA tests. Hence, participants may not have walked as freely as during regular clinical examinations. However, we avoided pulling the Velcro bands too tight and the Shimmer IMUs only weigh 23.6 g each. Furthermore, we did not take the potential influence of height, cognitive processing, or other disease-independent participant characteristics on gait (variability) features into account. Yet, our results show that features that reflect gait variability during the straight walking part of both SARA gait tasks are most important for classifying the three participant groups. Hence, the potential noise added to these features due to disease-independent participant characteristics apparently does not hamper significant classification.
In the near future, we aim to include more data from the other SARA subtests, so that we can integrate all information for an improved classification of EOA, DCD and typically developing children. In this study, we aimed to distinguish between EOA, DCD and CTRL participants using meaningful information, deriving features similar to what clinicians look at when phenotypically evaluating patients. We expect that techniques such as deep learning may be helpful in further distinguishing hidden patterns, and can probably provide clinicians with new motor characteristics to consider. However, deep learning techniques typically require ample data, which should then first be collected.
We further hope that this study motivates other researchers to use wearable sensors for assessment of ataxia and other coordination disorders. Using this technology not only in the clinical environment but also during daily activities could potentially improve the early identification of ataxic signs.

5. Conclusions

Combining information from the SARA gait and tandem gait tests and deriving meaningful features for the classification of EOA, DCD and typically developing children resulted in a slightly improved classification performance compared to clinical assessment and increased model explainability. Features representing variability of movement during gait and tandem gait were found to be more relevant for classification between groups compared to temporal and spatial features. These clinically meaningful biomarkers obtained from inertial sensors may contribute to the evaluation of patients with EOA or DCD during gait and thus to the classification of patients with coordination disorders.

Author Contributions

Conceptualization, Z.T.D.-V., D.A.S. and N.M.M.; data curation, Z.T.D.-V.; formal analysis, M.B.d.Q.; investigation, Z.T.D.-V.; methodology, Z.T.D.-V., J.W.J.E. and N.M.M.; software, Z.T.D.-V.; writing—original draft, Z.T.D.-V. and N.M.M.; writing—review and editing, Z.T.D.-V., J.W.J.E., D.A.S. and N.M.M. All authors have read and agreed to the published version of the manuscript.

Funding

The Mexican National Council for Science and Technology (CONACYT) provided Z.T.D.V. with a scholarship to financially support this research. Funding sources were not involved in the study design, collection, analysis and interpretation of data, writing of the report, or in the decision to submit the article for publication.

Institutional Review Board Statement

Medical–ethical committee of the University Medical Center Groningen.

Informed Consent Statement

Written informed consent has been obtained from the participant(s) to publish this paper.

Data Availability Statement

The data presented in this study are not available on request, as participants or their legal representatives did not provide consent.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

DCDdevelopmental coordination disorder
EOAearly onset ataxia
CTRLtypically developing controls
SARAscale for the assessment and rating of ataxia

Appendix A

Appendix A.1. Features for Gait

Using the gait events derived from the two shank signals (left and right), we extracted temporal and spatial features similar to Salarian et al. [22]. To identify complete gait cycles, we determined whether the gait events fulfilled the requirements in Equation (A1):
I C R k < T C L k < I C L k < T C R k < I C R k + 1
Here, I C R is the initial contact (heel strike) from the right shank, T C R is the terminal contact (toe-off) of the right shank, I C L is the initial contact of the left shank, T C L is the terminal contact of the left shank and k runs over the number of gait cycles. We here assume that the participant started to walk with the right foot, but this could be easily adapted if the participant started with the left foot. Once complete gait cycles were identified, we extracted temporal and spatial features similar to Salarian et al. [22], which are thought to give information about gait quality. We here reiterate the equations that were used by these authors and as implemented by us in a MATLAB script to obtain temporal gait features:
Gait   cycle   time   G C T k = I C R k + 1 I C R k
Stance   S T k = T C R k I C R ( k ) G C T ( k ) × 100
Initial   double   support   I D S k = T C L k I C R k G C T ( k ) × 100
Terminal   double   support   T D S k = T C R k I C L k G C T ( k ) × 100
Double   support   D S k = I D S k + 1 + T D S k
Limp   L i m p k = I D S k + 1 T D S k
After obtaining the temporal features, we calculated the spatial features as described by Aminian et al. [23] and adopted their terminology. We here summarize these features. The authors developed a model for human gait based on the double pendulum model. In this model, the lower limbs of the human body are considered as segments of a double pendulum, and in general, the four segments (thighs and shanks) are represented as rigid elements of a triangle. Subsequently, we calculated the angles between joints and associated distances using trigonometry (for further details see Aminian et al. [23]).
Figure A1. Double pendulum model of the lower limb human body according to Aminian et al. [23]. We derived spatial and temporal features for our gait analysis based on this model. Reprinted from J. Biomech. 35(5), K. Aminian, B. Najafi, C. Büla, P.-F. Leyvraz, Ph. Robert, Spatio-temporal parameters of gait measured by an ambulatory system using miniature gyroscopes, 689–699, Copyright (2002), with permission from Elsevier.
Figure A1. Double pendulum model of the lower limb human body according to Aminian et al. [23]. We derived spatial and temporal features for our gait analysis based on this model. Reprinted from J. Biomech. 35(5), K. Aminian, B. Najafi, C. Büla, P.-F. Leyvraz, Ph. Robert, Spatio-temporal parameters of gait measured by an ambulatory system using miniature gyroscopes, 689–699, Copyright (2002), with permission from Elsevier.
Sensors 23 08410 g0a1
We first calculated the rotation angles of the thighs (hip flexion–extension) and shanks (right and left) by integrating the data from the gyroscopes (angular velocity) attached to them, as proposed by Aminian et al. [23]:
Hip   flexion extension   angle   α k = h s t o ω T ( t ) d t
Shank   angle   β k = h s t o ω S ( t ) d t
Here, ω T is the angular velocity of the sensor attached to the thigh, ω S is the angular velocity of the sensor attached to the shank, h s is the time of the heel strike event and t o is the time of the toe-off event.
For our analysis, we defined four features using the information from the angles:
Range   of   hip   flexion extension   angle   R H F E = max α m i n ( α )
Range   of   shank   rotation   R S R = max β m i n ( β )
The distance d 1 + d 2 corresponds to the part of the stride length related to the swing phase of the right shank and thigh and is calculated as:
d 1 k + d 2 k = L 2 + M 1 ( k ) 2 + L 2 + M 2 ( k ) 2 L 2 + M 1 k × L 2 + M 2 k × c o s ( β k )
where L 2 is the shank length, M 1 k = s i n φ ( k ) s i n β ( k ) × d ( k ) , M 2 k = s i n γ ( k ) s i n β ( k ) × d k , φ k = π 2 α k + β ( k ) 2 , d k = l 2 k × 2 × ( c o s β ( k ) ) and γ k = π β ( k ) 2 and k runs over the number of detected gait events (heel-strike and toe-off).
The distance d 3 corresponds to the part of the stride length related to the stance phase and is calculated as:
d 3 k = L 1 + M 2 ( k ) 2 + L 1 + M 1 ( k ) 2 L 1 + M 2 k × L 1 + M 1 k × c o s ( α k )
where L 1 is the thigh (hip–knee) length.
Stride length thus equals the sum of the distances d 1 + d 2 + d 3 , taking into account a complete gait cycle from toe-off, through heel strike and to the next toe-off from the same foot.
Stride   length   S L k = d 1 ( k ) + d 2 ( k ) + d 3 ( k )
Stride velocity is defined by stride length divided by gait cycle time.
Stride   velocity   S V k = S L ( k ) G C T ( k )
Finally, we obtained features that describe the quality of movement of patients in space. These features were extracted using the information from the signals obtained from the shimmers attached to the thighs and shanks (right and left). We first performed principal component analysis of the 3D angular velocity. Similar to previous studies [19], we here hypothesize that the movement of healthy participants is mostly in one direction, while the movement of patients with gait abnormalities has components in different directions reflecting a lack of balance and coordination. We therefore used the first principal component (PC1_shank and PC1_thigh) to describe the movement of participants along the main axis of movement, similar to an approach we took earlier in classifying EOA and DCD patients and healthy participants, using upper limb SARA tests [19]. As we hypothesized that the movement of participants during gait would be mostly in one direction, PC1_shank and PC1_thigh should represent most of the variance. We also used the sum of the variances of the first two principal components (PC1 + PC2_shank and PC1 + PC2_thigh) as an additional feature reflecting (lack of) balance. Using 3D interpolation on the angular velocity, we first made sure that each gait cycle had the same number of data points (we used 300 points), to be able to compare between gait cycles form the same side (right/left) and patient. Then, we obtained the mean gait cycle over the identified gait cycles per participant and side (right and left) and calculated the Euclidean distance of each gait cycle to the mean (DIS_EUC_MEAN_SHANK and DIS_EUC_MEAN_THIGH) as well as the standard deviation (DIS_EUC_STD_SHANK and DIS_EUC_STD_ THIGH). We also calculated the dynamic time warping distance (DIS_DTW_MEAN_SHANK and DIS_DTW_MEAN_THIGH), from each gait cycle to the mean aiming to quantify the variability of movement between individual gait cycles. The mean local curvature of each gait cycle was also determined (CURV_SHANK and CURV_THIGH), using the 3D angular velocity signal and the expression:
c u r v a t u r e = 1 r
Here, local curvature is defined as the inverse of the radius ( r ) of a circle fitting through three consecutive points of the 3D angular velocity signal.
In summary, for the gait we extracted the following 24 features: G_GCT, G_ST, G_IDS, G_TDS, G_DS, G_LIMP, G_SL, G_D12, G_D3, G_SV, G_RSR, G_RHFE, G_PCA1_SHANK, G_PC1+PCA2_SHANK, G_PCA1_THIGH, G_PC1+PCA2_THIGH, G_DIS_EUC_MEAN_SHANK, G_DIS_EUC_STD_SHANK, G_DIS_DTW_MEAN_SHANK, G_DIS_EUC_MEAN_THIGH, G_DIS_EUC_STD_THIGH, G_DIS_DTW_MEAN_THIGH, G_CURV_SHANK, G_CURV_THIGH.

Appendix A.2. Features for Tandem Gait

We here obtained features that describe the quality of movement. We used the gait events identified in Section Appendix A.1, particularly the information of the toe-off events; we segmented the signal from one toe-off event to the next, which we defined as a tandem gait cycle. Then, similar to the gait analysis, we used principal component analysis (PCA) to determine the main direction of movement for each segment, assuming that healthy participants without balance problems would perform the movement mostly in one direction (forward), while patients with balance problems would perform the movement in a more variable direction. Therefore, the first extracted feature was the variance explained by the first principal component (TG_PCA1_SHANK and TG_PCA1_THIGH) to assess variability in the direction of movement. We also used the variance explained by the first and second principal components as a measure of the extent to which the direction of the movement was variable (TG_PC1+PCA2_SHANK and TG_PC1+PCA2_THIGH). We also implemented 3D interpolation for the tandem gait cycles to have the same number of data points (300 points), and compared between tandem gait cycles, obtaining the mean tandem gait cycle over all the tandem gait cycles per patient and side (right and left) and calculating the Euclidean distance of each tandem gait cycle to the mean (TG_DIS_EUC_MEAN_SHANK and TG_DIS_EUC_MEAN_THIGH), the standard deviation (TG_DIS_EUC_STD_SHANK and TG_DIS_EUC_STD_THIGH), the dynamic time warping distance (TG_DIS_DTW_MEAN_SHANK and TG_DIS_DTW_MEAN_THIGH) and mean local curvature (TG_CURV_SHANK and TG_CURV_THIGH) using the 3D angular velocity segmented signal and Equation (A16).
In summary, for the tandem gait we extracted the following 12 features: TG_PCA1_SHANK, TG_PC1+PCA2_SHANK, TG_PCA1_THIGH, TG_PC1+PCA2_THIGH, TG_DIS_EUC_MEAN_SHANK, TG_DIS_EUC_STD_SHANK, TG_DIS_DTW_MEAN_SHANK, TG_DIS_EUC_MEAN_THIGH, TG_DIS_EUC_STD_THIGH, TG_DIS_DTW_MEAN_THIGH, TG_CURV_SHANK, TG_CURV_THIGH.
Table A1. Features and explanation.
Table A1. Features and explanation.
FeatureDescriptionInterpretationType
1G_GCTGait cycle timeTime taken during a single gait cycle (from heel-strike to heel-strike of the same foot)Temporal
2G_STGait stanceTime taken between heel-strike and toe-off of the same footTemporal
3G_IDSGait initial double supportTime taken between heel-strike of one foot and toe-off of the other foot. Temporal
4G_TDSGait terminal double supportTime taken between heel-strike and toe-off of the contralateral footTemporal
5G_DSGait double supportTotal time of double supportTemporal
6G_LIMPGait limpTime difference between initial and terminal double supportTemporal
7G_SLGait stride lengthDistance between successive points of initial contact of the same footSpatial
8G_D12Gait distance d1+d2Distance travelled during the swing phasesSpatial
9G_D3Gait distance d3Distance travelled during the stance phaseSpatial
10G_SVGait stride velocityVelocity during a single gait cycleSpatial
11G_RSRGait range of shank rotationRange of angle of the right or left shank estimated by integration of the angular velocityVariability
12G_RHFEGait range of hip flexion–extension angleRange of right or left hip flexion–extension angle estimated by integration of the angular velocityVariability
13G_PCA1_SHANKGait pc1 shankFirst principal component of the 3D angular velocity from the sensor attached to each shankVariability
14G_PC1+PCA2_SHANKGait pc1+pc2 shankSum of the first and second principal components of the 3D angular velocity from the sensor attached to each shankVariability
15G_PCA1_THIGHGait pc1 thighFirst principal component of the 3D angular velocity from the sensor attached to each thighVariability
16G_PC1+PCA2_THIGHGait pc1+pc2 shankSum of the first and second principal components of the 3D angular velocity from the sensor attached to each thighVariability
17G_DIS_EUC_MEAN_SHANKGait Euclidian distance shankMean Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shank Variability
18G_DIS_EUC_STD_SHANKGait Euclidian distance shankStandard deviation of the Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shankVariability
19G_DIS_DTW_MEAN_SHANKGait DTW distance shankMean dynamic time warping distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shank. This metric is used for measuring the similarity between the mean 3D angular velocity and each individual 3D angular velocity signals taking the shape of the signal into account.Variability
20G_DIS_EUC_MEAN_THIGHGait Euclidian distance thighMean Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the thigh.Variability
21G_DIS_EUC_STD_THIGHGait standard deviation of the Euclidian distance thighStandard deviation of the Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the thighVariability
22G_DIS_DTW_MEAN_THIGHGait mean of the Euclidian distance tightMean dynamic time warping distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the thigh.Variability
23G_CURV_SHANKGait local curvature on the shankLocal curvature of the 3D angular velocity from the signal of each sensor attached to the shank, during gait testVariability
24G_CURV_THIGHGait local curvature on the thighLocal curvature of the 3D angular velocity from the signal of each sensor attached to the thigh, during gait testSpatial
25TG_PCA1_SHANKTandem gait pc1 in the shankFirst principal component of the 3D angular velocity from the sensor attached to each shank, during tandem gait testVariability
26TG_PC1+PCA2_SHANKTandem gait pc1+pc2 in the shankSum of the first and second principal components of the 3D angular velocity from the sensor attached to each shank, during tandem gait testVariability
27TG_PCA1_THIGHTandem gait pc1 in the thighFirst principal component of the 3D angular velocity from the sensor attached to each thigh, during tandem gait testVariability
28TG_PC1+PCA2_THIGHTandem gait pc1+pc2 in the thighSum of the first and second principal components of the 3D angular velocity from the sensor attached to each thigh, during tandem gait testVariability
29TG_DIS_EUC_MEAN_SHANKTandem gait mean of the Euclidean distance in the shankMean Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shankVariability
30TG_DIS_EUC_STD_SHANKTandem gait standard deviation of the Euclidean distance in the shankStandard deviation of the Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shankVariability
31TG_DIS_DTW_MEAN_SHANKTandem gait mean DTW distance in the shankMean dynamic time warping distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shankVariability
32TG_DIS_EUC_MEAN_THIGHTandem gait mean Euclidian distance in the thighMean Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the thighVariability
33TG_DIS_EUC_STD_THIGHTandem gait standard deviation of the Euclidian distance in the thighStandard deviation of the Euclidian distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the shankVariability
34TG_DIS_DTW_MEAN_THIGHTandem gait mean DTW distance in the thighMean dynamic time warping distance between the 3D angular velocity for each gait cycle and the mean 3D angular velocity per patient and side from the sensor attached to the thighVariability
35TG_CURV_SHANKTandem gait curvature in the shankLocal curvature of the 3D angular velocity from the signal of each sensor attached to the shank, during the tandem gait testSpatial
36TG_CURV_THIGHTandem gait curvature in the thighLocal curvature of the 3D angular velocity from the signal of each sensor attached to the thigh, during the tandem gait testSpatial

Appendix B

Table A2. Individual participant characteristics. Predictions are the modes over the 100 iterations of modelling and lead to the probabilities presented in this table. Age is provided in years. Cases where the prediction deviates from the diagnosis/label are indicated in bold. CTRL: control, DCD: developmental coordination disorder, EOA: early onset ataxia, F: female, M: male, NA: not available and SARA: scale for the assessment and rating of ataxia.
Table A2. Individual participant characteristics. Predictions are the modes over the 100 iterations of modelling and lead to the probabilities presented in this table. Age is provided in years. Cases where the prediction deviates from the diagnosis/label are indicated in bold. CTRL: control, DCD: developmental coordination disorder, EOA: early onset ataxia, F: female, M: male, NA: not available and SARA: scale for the assessment and rating of ataxia.
DiagnosisPredictionProbabilitiesAgeSexSARA Gait Scores
EOACTRLDCD
1CTRLCTRL01010MNA
2CTRLCTRL01010MNA
3CTRLCTRL0.20.70.18MNA
4CTRLCTRL0107MNA
5CTRLCTRL01012MNA
6 CTRLCTRL01010FNA
7 CTRLCTRL01013MNA
8 CTRLCTRL01017MNA
9 CTRLCTRL00.90.113FNA
10 CTRLCTRL00.90.116FNA
11 CTRLCTRL01014FNA
12 CTRLCTRL00.80.210FNA
13 CTRLCTRL0108FNA
14 CTRLCTRL01020FNA
15 CTRLCTRL0109FNA
16 CTRLCTRL00.80.212FNA
17 CTRLCTRL00.90.114FNA
18 CTRLCTRL01012FNA
19 CTRLEOA0.90.1015FNA
20 CTRLCTRL01024FNA
21 CTRLCTRL01022MNA
22 CTRLCTRL01023FNA
23 CTRLCTRL00.90.125FNA
24 CTRLCTRL0.20.809MNA
25 CTRLCTRL00.60.412FNA
26 CTRLCTRL0.10.80.19FNA
27 CTRLCTRL0.30.50.27MNA
28 CTRLCTRL0.20.805MNA
29 CTRLCTRL0.20.809FNA
1 DCDDCD00110M1/0/0
2 DCDDCD0.100.913F1.5/1/1
3 DCDCTRL00.60.412F0/1/0
4 DCDDCD0.10.10.88F2/2/2
5 DCDDCD00.30.78M1/0/1
6 DCDCTRL00.70.38F0/0/0
7 DCDDCD00.10.915MNA
8 DCDDCD0.30.10.66MNA
9 DCDDCD0.20.10.711MNA
10 DCDDCD0018MNA
11 DCDDCD00112MNA
12 DCDDCD0.30.10.69MNA
13 DCDDCD0.300.716FNA
1 EOADCD0.30.20.58M2.25/2/2.25
2 EOACTRL01011M2/1.75/2
3 EOAEOA0.50.40.118M1.5/1/0
4 EOAEOA10010F2/2/3
5 EOAEOA0.900.112M3/2/3
6 EOADCD0.10.40.519M2/2/2.75
7 EOACTRL01025F3
8 EOAEOA10011F2
9 EOAEOA0.900.117F1.5
10 EOAEOA0.90.1013M2
11 EOAEOA1006FNA
12 EOAEOA0.90.1021M2/2/2
13 EOAEOA10015FNA
14 EOAEOA10011M2/1.5/2
15 EOADCD0.400.620M2/1.25/2
16 EOAEOA0.40.30.39MNA
17 EOADCD0.300.78MNA
18 EOADCD0.10.10.87FNA

References

  1. Thach, W.T.; Bastian, A.J. Role of the cerebellum in the control and adaptation of gait in health and disease. Prog. Brain Res. 2004, 143, 353–366. [Google Scholar]
  2. Shirai, S.; Yabe, I.; Takahashi-Iwata, I.; Matsushima, M.; Ito, Y.M.; Takakusaki, K.; Sasaki, H. The Responsiveness of Triaxial Accelerometer Measurement of Gait Ataxia Is Higher than That of the Scale for the Assessment and Rating of Ataxia in the Early Stages of Spinocerebellar Degeneration. Cerebellum 2019, 18, 721–730. [Google Scholar] [CrossRef] [PubMed]
  3. Hillman, S.J.; Stansfield, B.W.; Richardson, A.M.; Robb, J.E. Development of temporal and distance parameters of gait in normal children. Gait Posture 2009, 29, 81–85. [Google Scholar] [CrossRef] [PubMed]
  4. Harding, A.E. Classification of the hereditary ataxias and paraplegias. Lancet 1983, 321, 1151–1155. [Google Scholar] [CrossRef] [PubMed]
  5. Lawerman, T.F.; Brandsma, R.; Maurits, N.M.; Martinez-Manzanera, O.; Verschuuren-Bemelmans, C.C.; Lunsing, R.J.; Brouwer, O.F.; Kremer, H.P.; Sival, D.A. Paediatric motor phenotypes in early-onset ataxia, developmental coordination disorder, and central hypotonia. Dev. Med. Child Neurol. 2020, 62, 75–82. [Google Scholar] [CrossRef]
  6. Schmitz-Hübsch, T.; du Montcel, S.T.; Baliko, L.; Berciano, J.; Boesch, S.; Depondt, C.; Giunti, P.; Globas, C.; Infante, J.; Kang, J.S.; et al. Scale for the assessment and rating of ataxia: Development of a new clinical scale. Neurology 2006, 67, 1717. [Google Scholar] [CrossRef]
  7. Diagnostic and Statistical Manual of Mental Disorders, 5th ed.; American Psychiatric Association: Washington, DC, USA, 2013.
  8. Harris, S.R.; Mickelson, E.C.R.; Zwicker, J.G. Diagnosis and management of developmental coordination disorder. Can. Med. Assoc. J. 2015, 187, 659–665. [Google Scholar] [CrossRef]
  9. Rosengren, K.S.; Deconinck, F.J.; Diberardino, L.A., 3rd; Polk, J.D.; Spencer-Smith, J.; De Clercq, D.; Lenoir, M. Differences in gait complexity and variability between children with and without Developmental Coordination Disorder. Gait Posture 2009, 29, 225–229. [Google Scholar] [CrossRef]
  10. Smith, M.; Ward, E.; Williams, C.M.; Banwell, H.A. Differences in walking and running gait in children with and without developmental coordination disorder: A systematic review and meta-analysis. Gait Posture 2021, 83, 177–184. [Google Scholar] [CrossRef]
  11. Woodruff, S.J.; Bothwell-Myers, C.; Tingley, M.; Albert, W.J. Gait Pattern Classification of Children with Developmental Coordination Disorder. Adapt. Phys. Act. Q. 2002, 19, 378–391. [Google Scholar] [CrossRef]
  12. Palliyath, S.; Hallett, M.; Thomas, S.L.; Lebiedowska, M.K. Gait in patients with cerebellar ataxia. Mov. Disord. 1998, 13, 958–964. [Google Scholar] [CrossRef] [PubMed]
  13. Serrao, M.; Pierelli, F.; Ranavolo, A.; Draicchio, F.; Conte, C.; Don, R.; Di Fabio, R.; LeRose, M.; Padua, L.; Sandrini, G.; et al. Gait Pattern in Inherited Cerebellar Ataxias. Cerebellum 2012, 11, 194–211. [Google Scholar] [CrossRef] [PubMed]
  14. Stolze, H. Typical features of cerebellar ataxic gait. J. Neurol. Neurosurg. Psychiatry 2002, 73, 310–312. [Google Scholar] [CrossRef] [PubMed]
  15. Mannini, A.; Martinez-Manzanera, O.; Lawerman, T.F.; Trojaniello, D.; Croce, U.D.; Sival, D.A.; Maurits, N.M.; Sabatini, A.M. Automatic classification of gait in children with early-onset ataxia or developmental coordination disorder and controls using inertial sensors. Gait Posture 2017, 52, 287–292. [Google Scholar] [CrossRef]
  16. Aguilar, V.S.; Manzanera, O.M.; Sival, D.A.; Maurits, N.M.; Roerdink, J.B.T.M. Distinguishing Patients With a Coordination Disorder From Healthy Controls Using Local Features of Movement Trajectories During the Finger-to-Nose Test. IEEE Trans. Biomed. Eng. 2019, 66, 1714–1722. [Google Scholar] [CrossRef]
  17. Wilmut, K.; Du, W.; Barnett, A.L. Gait patterns in children with Developmental Coordination Disorder. Exp. Brain Res. 2016, 234, 1747–1755. [Google Scholar] [CrossRef]
  18. Martinez-Manzanera, O.; Lawerman, T.F.; Blok, H.J.; Lunsing, R.J.; Brandsma, R.; Sival, D.A.; Maurits, N.M. Instrumented finger-to-nose test classification in children with ataxia or developmental coordination disorder and controls. Clin. Biomech. 2018, 60, 51–59. [Google Scholar] [CrossRef]
  19. Dominguez-Vega, Z.T.; Dubber, D.; Elting, J.W.J.; Sival, D.A.; Maurits, N.M. Instrumented classification of patients with early onset ataxia or developmental coordination disorder and healthy control children combining information from three upper limb SARA tests. Eur. J. Paediatr. Neurol. 2021, 34, 74–83. [Google Scholar] [CrossRef]
  20. Harding, A.E.A. Clinical features and classification of inherited ataxias. Adv. Neurol. 1993, 61, 1–14. [Google Scholar]
  21. Salarian, A.; Russmann, H.; Vingerhoets, F.J.; Dehollain, C.; Blanc, Y.; Burkhard, P.R.; Aminian, K. Gait Assessment in Parkinson’s Disease: Toward an Ambulatory System for Long-Term Monitoring. IEEE Trans. Biomed. Eng. 2004, 51, 1434–1443. [Google Scholar] [CrossRef]
  22. AVR121: Enhancing ADC Resolution by Oversampling; Atmel Corporation: San Jose, Ca, USA, 2005.
  23. Aminian, K.; Najafi, B.; Büla, C.; Leyvraz, P.-F.; Robert, P. Spatio-temporal parameters of gait measured by an ambulatory system using miniature gyroscopes. J. Biomech. 2002, 35, 689–699. [Google Scholar] [CrossRef] [PubMed]
  24. He, H.; Bai, Y.; Garcia, E.A.; Li, S. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In Proceedings of the 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–8 June 2008; IEEE: Piscataway, NJ, USA, 2008. [Google Scholar]
  25. He, H.; Ma, Y. Imbalanced Learning: Foundations, Algorithms, and Applications; Wiley: Hoboken, NJ, USA, 2013. [Google Scholar]
  26. Oosterhuis, H.J.G.H. Klinische Neurologie: Een Beknopt Leerboek, 13th ed.; Bohn Stafleu Van Loghum: Houten, The Netherlands, 1997. [Google Scholar]
  27. Raynor, A.J. Strength, power, and coactivation in children with developmental coordination disorder. Dev. Med. Child Neurol. 2007, 43, 676–684. [Google Scholar] [CrossRef]
  28. Mariën, P.; Wackenier, P.; De Surgeloose, D.; De Deyn, P.P.; Verhoeven, J. Developmental Coordination Disorder: Disruption of the Cerebello-Cerebral Network evidenced by SPECT. Cerebellum 2010, 9, 405–410. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flowchart of the entire analysis procedure, including signal segmentation, feature extraction and model training and validation.
Figure 1. Flowchart of the entire analysis procedure, including signal segmentation, feature extraction and model training and validation.
Sensors 23 08410 g001
Figure 2. Average relative feature importance of the 20 most relevant features for classification. The feature importance according to the decrease of node impurity is normalised across all features. For details and explanation of all features, see Appendix A.
Figure 2. Average relative feature importance of the 20 most relevant features for classification. The feature importance according to the decrease of node impurity is normalised across all features. For details and explanation of all features, see Appendix A.
Sensors 23 08410 g002
Figure 3. Mean probability over 100 iterations, each dot represents a participant from one of the three groups, EOA, DCD and healthy participants. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: healthy control. The relative position of the participant indicates the estimated probability of being identified as EOA, DCD or CTRL. Dots near the (0,0) coordinate were identified as belonging to the EOA group, dots close to the (1,0) coordinate were identified as DCD patients and finally, dots in the (0,1) coordinate were classified as CTRL. The area outside these coordinates was defined as the intersection area, where each dot in this region is defined by its relative probability to have characteristics from the three groups (i.e., a dot at position (0.4,0.2) indicates that this participant has 0.4 probability of being classified as a DCD patient, 0.2 probability of being classified as a healthy participant and (1 − 0.4 − 0.2 = 0.4) probability of being classified as an EOA patient). We added jitter to the probability values to improve the visualisation of each participant and its relative position on the plot.
Figure 3. Mean probability over 100 iterations, each dot represents a participant from one of the three groups, EOA, DCD and healthy participants. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: healthy control. The relative position of the participant indicates the estimated probability of being identified as EOA, DCD or CTRL. Dots near the (0,0) coordinate were identified as belonging to the EOA group, dots close to the (1,0) coordinate were identified as DCD patients and finally, dots in the (0,1) coordinate were classified as CTRL. The area outside these coordinates was defined as the intersection area, where each dot in this region is defined by its relative probability to have characteristics from the three groups (i.e., a dot at position (0.4,0.2) indicates that this participant has 0.4 probability of being classified as a DCD patient, 0.2 probability of being classified as a healthy participant and (1 − 0.4 − 0.2 = 0.4) probability of being classified as an EOA patient). We added jitter to the probability values to improve the visualisation of each participant and its relative position on the plot.
Sensors 23 08410 g003
Figure 4. Typical clinically observed gait patterns in healthy persons and ataxia patients. (a) Gait (top) and tandem gait (bottom) of a healthy participant, (b) gait (top) and tandem gait (bottom) of an ataxic patient. Figure adapted with permission from Oosterhuis [26]. Copyright 1997, Oosterhuis.
Figure 4. Typical clinically observed gait patterns in healthy persons and ataxia patients. (a) Gait (top) and tandem gait (bottom) of a healthy participant, (b) gait (top) and tandem gait (bottom) of an ataxic patient. Figure adapted with permission from Oosterhuis [26]. Copyright 1997, Oosterhuis.
Sensors 23 08410 g004
Table 1. Participant characteristics for each of the three groups. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: typically developing, IQR: interquartile range.
Table 1. Participant characteristics for each of the three groups. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: typically developing, IQR: interquartile range.
EOA (n = 18)DCD (n = 13)CTRL (n = 29)
Age (median, IQR)11.5 (8.5)10.0 (4.5)12.0 (6.0)
Range (years)6–256–165–25
Sex (male/female)11/78/511/18
Table 2. Confusion matrix averaged over 100 iterations and subsequently expressed in percentages. Each row displays the mean percentage of participants predicted to belong to each of the three classes. Overall mean accuracy is 82%. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: healthy control. Note that the percentages do not add up to exactly 100% due to the LOOCV and the 100 iterations.
Table 2. Confusion matrix averaged over 100 iterations and subsequently expressed in percentages. Each row displays the mean percentage of participants predicted to belong to each of the three classes. Overall mean accuracy is 82%. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: healthy control. Note that the percentages do not add up to exactly 100% due to the LOOCV and the 100 iterations.
Prediction
ActualEOADCDCTRL
EOA61.4% (6.3%)21.9% (6.2%)16.6% (4.9%)
DCD3.8% (4.7%)82.5% (5.9%)13.7% (3.8%)
CTRL4.3% (1.5%)1.2% (1.9%)94.5% (2.2%)
Table 3. Measures of classification performance for EOA, CTRL and DCD participants as a positive class. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: healthy control.
Table 3. Measures of classification performance for EOA, CTRL and DCD participants as a positive class. EOA: early onset ataxia, DCD: developmental coordination disorder, CTRL: healthy control.
ActualBalanced AccuracySensitivity (Recall)SpecificityPrecisionF1-Score
EOA0.79 (0.03)0.61 (0.06)0.96 (0.02)0.87 (0.05)0.72 (0.05)
DCD0.87 (0.03)0.82 (0.06)0.91 (0.03)0.72 (0.07)0.77 (0.05)
CTRL0.90 (0.02)0.95 (0.02)0.85 (0.03)0.85 (0.03)0.90 (0.02)
Table 4. Median values for the four most relevant gait characteristics and result of the Kruskal–Wallis test for each of the characteristics. EOA: early onset ataxia, DCD: developmental coordination disorder and CTRL: healthy control.
Table 4. Median values for the four most relevant gait characteristics and result of the Kruskal–Wallis test for each of the characteristics. EOA: early onset ataxia, DCD: developmental coordination disorder and CTRL: healthy control.
FeatureEOADCDCTRLKruskal–Wallis Test
TG_DIS_DTW_MEAN_THIGH578.42532.75212.43χ2(2) = 100.11, p = 1.819 × 10−22
G_RHFE203.64183.91173.80χ2(2) = 45.51, p = 1.306 × 10−10
TG_DIS_DTW_MEAN_SHANK631.38597.52200.56χ2(2) = 103.68, p = 30.57 × 10−23
G_CURV_SHANK0.00120.00170.0013χ2(2) = 25.531, p = 2.857 × 10−6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dominguez-Vega, Z.T.; de Quiros, M.B.; Elting, J.W.J.; Sival, D.A.; Maurits, N.M. Instrumented Gait Classification Using Meaningful Features in Patients with Impaired Coordination. Sensors 2023, 23, 8410. https://doi.org/10.3390/s23208410

AMA Style

Dominguez-Vega ZT, de Quiros MB, Elting JWJ, Sival DA, Maurits NM. Instrumented Gait Classification Using Meaningful Features in Patients with Impaired Coordination. Sensors. 2023; 23(20):8410. https://doi.org/10.3390/s23208410

Chicago/Turabian Style

Dominguez-Vega, Zeus T., Mariano Bernaldo de Quiros, Jan Willem J. Elting, Deborah A. Sival, and Natasha M. Maurits. 2023. "Instrumented Gait Classification Using Meaningful Features in Patients with Impaired Coordination" Sensors 23, no. 20: 8410. https://doi.org/10.3390/s23208410

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop