Combining SDAE Network with Improved DTW Algorithm for Similarity Measure of Ultra-Weak FBG Vibration Responses in Underground Structures

Li, Sheng; Zuo, Xiang; Li, Zhengying; Wang, Honghai; Sun, Lizhi

doi:10.3390/s20082179

Open AccessArticle

Combining SDAE Network with Improved DTW Algorithm for Similarity Measure of Ultra-Weak FBG Vibration Responses in Underground Structures

¹

National Engineering Laboratory for Fiber Optic Sensing Technology, Wuhan University of Technology, Wuhan 430070, China

²

School of Information Engineering, Wuhan University of Technology, Wuhan 430070, China

³

Department of Civil and Environmental Engineering, University of California, Irvine, CA 92697-2175, USA

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(8), 2179; https://doi.org/10.3390/s20082179

Submission received: 23 March 2020 / Revised: 8 April 2020 / Accepted: 11 April 2020 / Published: 12 April 2020

(This article belongs to the Special Issue Optical Sensors for Structural Health Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

Quantifying structural status and locating structural anomalies are critical to tracking and safeguarding the safety of long-distance underground structures. Given the dynamic and distributed monitoring capabilities of an ultra-weak fiber Bragg grating (FBG) array, this paper proposes a method combining the stacked denoising autoencoder (SDAE) network and the improved dynamic time wrapping (DTW) algorithm to quantify the similarity of vibration responses. To obtain the dimensionality reduction features that were conducive to distance measurement, the silhouette coefficient was adopted to evaluate the training efficacy of the SDAE network under different hyperparameter settings. To measure the distance based on the improved DTW algorithm, the one nearest neighbor (1-NN) classifier was utilized to search the best constraint bandwidth. Moreover, the study proposed that the performance of different distance metrics used to quantify similarity can be evaluated through the 1-NN classifier. Based on two one-dimensional time-series datasets from the University of California, Riverside (UCR) archives, the detailed implementation process for similarity measure was illustrated. In terms of feature extraction and distance measure of UCR datasets, the proposed integrated approach of similarity measure showed improved performance over other existing algorithms. Finally, the field-vibration responses of the track bed in the subway detected by the ultra-weak FBG array were collected to determine the similarity characteristics of structural vibration among different monitoring zones. The quantitative results indicated that the proposed method can effectively quantify and distinguish the vibration similarity related to the physical location of structures.

Keywords:

1. Introduction

Over the past decades, with the rapid development of rail transit infrastructure in China, the operation safety and security of subway systems have attracted much attention. According to the recent research progress of distributed optical fiber-sensing technology [1,2,3,4,5,6,7], the requirement for time- and space-continuous monitoring for the geotechnical underground structures [8] has gradually become feasible. Comparisons between various commonly used sensors for underground structure monitoring were reported in [9,10], which revealed that the ultra-weak fiber optic Bragg grating (FBG) array [11] can be used for both static and dynamic measurements [12,13,14]. In the field of dynamic measurement, it was reported that the distributed vibration detected by the ultra-weak FBG array can be applied to track train and identify incursion [10,15]. Moreover, the change of the structural vibration responses usually reflects the evolution of the structure state to a certain extent. A wide range of research reports concerning the vibration-based structural condition assessment can be found in [16,17,18,19]. Compared with ground transportation, the daily operation of underground trains is of obvious regularity. For example, the speed of trains in each travel zone always follows the operation schedule, and the number of passengers does not change suddenly within a certain period due to commuting habits. Moreover, the temperature and humidity fields of underground infrastructure are relatively stable due to the management measures of tunnel ventilation. Therefore, it can be assumed that the structural vibration responses corresponding to the excitation of multiple passing trains in a certain structural state should be stable and similar. With the support of distributed vibration monitoring adapted to the long-distance underground structures, it is possible to quantify the structural status by measuring the similarity of structural vibration responses for a specified monitoring area under different stages and this is the research motivation of the paper.

The vibration responses of subway tunnel structures can be regarded as a collection of typical one-dimensional time-series signals. The similarity measure between time series can often be converted to measure the distance between vectors. The Euclidean distance (ED) [20] and its variants based on common L_p-norm [21] are the most straightforward methods for similarity measures of such one-dimensional time-series. However, there is a slight difference in the length of duration in the vibration responses excited by each train passing through the monitoring area, making the ED and its variants unable to directly perform the similarity measure for unequal-length sequences. Even when dealing with equal-length vibration signals, these methods are susceptible to noise and time misalignment and are unable to deal with local time-shifting. Dynamic time warping (DTW) [22] is an option to overcome time-shifting, which allows a time series to be either stretched or compressed to provide a better match with another time series. Therefore, it can be used to handle similarity measures between inconsistent length sequences. Another group of similarity measures suitable for processing unequal-length time series is developed based on the concept of the edit distance for strings [23]. Compared with DTW which only considers the constrain bandwidth, the similarity measure based on the edit distance requires tuning more parameters [24,25,26] to find the most similar set of matching patterns. It is reported [15] that the data amount is huge for vibration responses detected by ultra-weak FBG of each monitoring area under the excitation of passing trains. This often results in high time complexity and is expensive in terms of processing and storage costs to directly use the above methods to perform a similarity measure on the raw format of high-dimensional vibration responses of underground structures. Furthermore, it is difficult to completely avoid random outlier interference during data collection and transmission. Therefore, the results of the similarity measure based on any algorithm may significantly deviate from expectations if the raw signals are not carefully wrangled.

Feature extraction should be the most intuitive idea to solve the above problems. It can improve the effectiveness and efficiency of the similarity measure by maintaining the characteristics of the original signal in a smaller dimensionality. Compared with principal component analysis (PCA) [27], linear discriminant analysis (LDA) [28] and other linear feature extraction methods, manifold learning [29], restricted Boltzmann machine (RBM) [30], autoencoder (AE) [31], as typical representatives of non-linear feature extraction methods, can retain much richer sample features of high-dimensional vibration signals. High computational complexity is the bottleneck of manifold learning based on local domain classification and its feature extraction process is sensitive to noise [32]. Therefore, this method is not suitable for extracting the characteristics of the vibration responses of underground structures that cannot avoid noise interference. RBM and its derivative deep belief network [33] use the probability distribution rather than the real-valued sequence to express the characteristics of the hidden layer. These two methods for dimensionality reduction are not suitable for the similarity measure of real-valued sequences. The training of AE resembles that of the RBM. However, models of AE can be easier to train than that of RBM with contrastive divergence and are thus preferred in contexts where RBM training is less effective [34]. Adding a denoising process makes AE models substantially more robust to input variations or distortion, causing the deep network formed by a stacked denoising autoencoder (SDAE) with higher accuracy than that of the stacked autoencoder (SAE) [35,36]. Thus, the SDAE network is used to achieve feature extraction before the similarity measure in this paper. In the following second section, the implementation process of the proposed similarity measure is introduced in combination with typical one-dimensional datasets in the public UCR (University of California, Riverside) time-series data archives [37]. This part also illustrates the metrics used to evaluate the effectiveness of feature extraction and similarity measure. After that, based on the ultra-weak FBG vibration response of the actual underground structure, the feasibility and significance of the proposed similarity measure method in engineering are discussed.

2. Methodology and Implementation of Signal Similarity Measure

2.1. Overview of the Procedure for Signal Similarity Measure

As shown in Figure 1, the proposed method and performance test constituted the processing flow for the similarity measurement of one-dimensional time-series signals. The method in the left part of Figure 1 contained dimensionality reduction of the original sequence through feature extraction based on the SDAE network and distance measurement for the extracted feature sequences of equal length through the improved DTW algorithm. The silhouette coefficients and one nearest neighbor (1-NN) classifier in the right part of Figure 1 were used to evaluate the performances of feature extraction and distance measure, respectively.

For the general time series involving the similarity measure, the need for dataset partitioning and the purpose of each divided dataset are shown in Figure 2, which primarily included two parts, namely, the unsupervised learning through the SDAE network and the supervised learning through the 1-NN classifier. Since the cost of collecting and processing the distributed high-dimensional vibration responses is often expensive or even prohibitive, two datasets with data labels (CinCECGTorso and SemgHandMovementCh2 [38]) were selected from the UCR time series archives to help explain the implementation process. The two selected datasets have moderate sample sizes and relatively long sequence lengths, ensuring the operation feasibility of dimensionality reduction based on the SDAE network under acceptable computing overhead. Moreover, both the selected datasets and the vibration of interest face some negatives in common, such as redundant information and outliers, which should be overcome when measuring the similarity of sequences, although their appearance and type are varied. The default training set and test set ratio of each dataset in UCR databases are different. For each of the selected raw datasets used for the subsequent research, we first merged the training and test sets, then shuffled the samples, and finally set a uniform split ratio to form datasets A and B. Table 1 shows the final processing results, in which the ratio of datasets A to B was three. Note that other partitioning ratios were also acceptable as long as the dataset to be partitioned had a sufficient sample size to ensure corresponding algorithm training. The Python packages Tensorflow and Keras, as well as their libraries [39], were utilized to establish the SDAE network and calculate different distance metrics, in which the operation of the 1-NN classifier that can choose different distance measurement methods referred to in the work by Regan [40].

2.2. Feature Extraction Based on Stacked Denoising Autoencoder (SDAE) Network

As shown in the left part of Figure 2, feature extraction was primarily based on unsupervised training to shorten the sequence length in the second column of Table 1. Here, labels were just a supplement to fine-tune in the second training stage of the SDAE network, which was generally stacked by multiple three-layer DAE models. Figure 3 shows the structure of a typical SDAE network, which was formed by stacking three sub-DAE networks. Because the noise was actively added to the input data, hidden layers in such networks can retain more robust sample features during the learning process [41]. Here, greedy layer-wise training [42] that can boost the network learning efficiency was a preferred solution to conduct the pre-training process. In the first stage of feature extraction, the initial features of the input sample can be forcibly extracted through the unsupervised learning network. To obtain a better feature extraction effect, labels of the input sample were used to establish a classification output layer to perform a supervised training. Thereafter, a feature extraction model based on the SDAE network can be obtained through training the dataset A in Table 1. When a new sample dataset B was fed into the trained model, the feature representation of the last hidden layer can be regarded as reduced-dimensional features of the original input.

2.3. Dimensionality Reduction Evaluation with Silhouette Coefficients

Through the above processing, a sequence having a length shorter than that of the original sequence listed in the second column in Table 1 can be obtained. It was straightforward that the effect and rationality of dimensionality reduction needed to be evaluated, indicating that the hyperparameter settings of the SDAE network should be assessed. Ideally, the feature vector generated due to dimensionality reduction should be able to represent the category information of the original sample to the greatest extent, namely, good feature extraction results should make the dimensionality reduction sequences belong to the same category closer, and the distance between the dimensionality reduction sequences belong to different categories farther. Silhouette coefficients [43] described as Equation (1) provide a single value measuring both the above two traits.

s_{i} = \frac{b_{i} - a_{i}}{\max (a_{i}, b_{i})}

(1)

where

s_{i}

is the silhouette coefficient for observation

i

,

a_{i}

is the mean distance between

i

and all observations of the same class, and

b_{i}

is the mean distance between

i

and all observations from the different classes. Silhouette coefficients range between −1 and 1, with 1 indicating dense, well-separated different categories. Therefore, the mean silhouette coefficient for all observations can be used to evaluate the impact of the selection of various key hyperparameters on the performance of feature extraction based on the SDAE network for the two UCR datasets in Table 1. The operation based on grid search [44] combined with cross-validation [45] can guarantee to find the most accurate set of hyperparameter settings within a specified range, but it required iterating through all possible parameter combinations, which was very time-consuming in the face of large datasets and multiple parameters of interest. Another feasible option was to optimize the hyperparameter set step by step. Considering the characteristics of the SDAE network, the key hyperparameters that are usually concerned are the number of network layers, the number of hidden layer nodes, and the noise level [41]. As shown in Figure 4, the number of hidden layers of the SDAE network can be firstly determined by the mean silhouette coefficient. Here, in the training process, the adaptive moment estimation optimizer [46] was used to search the right learning rate automatically, and the maximum number of training epochs can be controlled based on the early stopping [47] technique. Other hyperparameters under the different number of hidden layers were derived through trial-and-error under the control of maximum number of training epochs and minimum reconstruction errors. As shown in Figure 4, when the number of hidden layers of the datasets CinCECGTorso and SemgHandMovementCh2 were set to two and three, respectively, the node number of the last hidden layer that reflected the dimensionality reduction effect can be further analyzed. Here, the hyperparameter configurations determined through trial and error in the previous step were used as the initial settings for the next tuning step and the key hyperparameter determined in the previous step remained constant in the subsequent tuning step. As shown in Figure 5, the number of nodes in the last hidden layer was expressed as a percentage of the original sequence length. After the number of hidden layers and the number of nodes in the last hidden layer were determined in turn, the reasonable value of the denoising coefficient [48] in the input layer of the SDAE network can be discussed. Figure 6 gave the relationship between different denoising coefficient and corresponding silhouette coefficient based on the tuning strategy of hyperparameters mentioned above. Hence, based on the hyperparameters determined by the maximum mean silhouette coefficients, the network structures of the SDAE for the two selected datasets used to obtain the dimensionality reduction sequences can be established.

After the above step-by-step tuning of hyperparameters, the optimal mean silhouette coefficients of the datasets CinCECGTorso and SemgHandMovementCh2 with the feature extraction dimensions reduced to 20% of the original sequence length were 0.455 and 0.432, respectively. Figure 7 further shows that the capability of SDAE-based feature extraction was significantly better than that of other methods, although all the silhouette coefficient results did not exceed 0.5. Here, various comparison methods maintained a unified feature dimension reduction scale. Therefore, the SDAE network training process used for feature extraction for dataset B listed in Table 1 in this section was the premise for the subsequent similarity measure of reducing dimensionality sequences.

2.4. Distance Measure Based on Improved Dynamic Time-Warping (DTW) Algorithm

Since the features were extracted as dimensionality reduction sequences of equal length, the impact of high time complexity and low calculation efficiency can be effectively avoided when measuring distance based on the DTW algorithm. Although the reported window-based constraint methods have some positive effects on avoiding the DTW’s matching path from falling into the suboptimum under certain circumstances, improvements against the influences of undesired warping [49] still deserve attention. Based on the DTW with a constraint of Sakoe–Chubaband [50] (hereinafter abbreviated as SDTW), warping offset distance (

d_{WOD}

) was defined in the proposed improved DTW algorithm to further mitigate the effects of undesired warping. The defined

d_{WOD}

was the area between the optimal matching path and the diagonal path under the SDTW algorithm. As shown in Figure 8, these two paths were derived from the distance matrix

D

of two equal-length sequences after feature extraction, and the

d_{WOD}

described in Equation (2) can be shown as the cumulative sum of the differences between each point on the optimal matching path and each corresponding point to the unbiased state. By aligning the feature points of two sequences processed by the SDAE network, this method not only ensured that the matching path can recognize the slight warping of the time axis but also realized the constraint on the length of the matching path. Detailed definition of the distance matrix of DTW and the searching method of the optimal matching sequence based on dynamic programming can be found in [51]:

d_{WOD} = \sum_{i = 1}^{m} | w_{i} - d i a (i) |

(2)

where

w_{i}

and

d i a (i)

represent the i-th point in the optimal matching path and the i-th point in the diagonal of the distance matrix

D

, respectively. The sum of

d_{WOD}

and the distance based on the SDTW (

d_{SDTW}

) was used as the distance metric of the improved DTW algorithm in Equation (3) and therefore

d_{similarity}

was regarded as the result of the similarity measure:

d_{similarity} = d_{SDTW} + d_{WOD}

(3)

2.5. Similarity Measure Evaluation with One Nearest Neighbor (1-NN) Classifier

The bandwidth

r

defines the constraint range of the matching path in the distance matrix and suppresses the influence of undesired convergence in the matching path [52]. Because there was a correlation between the defined warping offset distance and the SDTW algorithm, as well as the SDTW-based distance and the constraint bandwidth

r

, different

r

not only affected the optimal matching path of the SDTW but also led to the change of

d_{similarity}

. The

r

determined the efficacy of the proposed similarity measurement method. It was reported that the 1-NN classifier on labeled data was a feasible way to evaluate the efficacy of the selected distance metric and its classification accuracy directly reflected the effectiveness of the similarity measure [53]. Moreover, the 1-NN classifier can be used to search for a proper

r

and the idea was to train a labeled dataset with different bandwidth constraints based on two distance metrics

d_{SDTW}

,

d_{WOD}

, respectively. Then, two sets of classification error rates

E_{SDTW} (r)

and

E_{WOD} (r)

at different

r

through the 1-NN classifier model can be derived. We defined

E_{SUM}

as the sum of

E_{SDTW} (r)

and

E_{WOD} (r)

and the constraint bandwidth

r

that minimized

E_{SUM}

was considered to be the appropriate choice for calculating

d_{similarity}

.

Figure 9 depicted the possibly typical variation of

E_{SUM}

at different

r

. For cases Ⅰ and Ⅳ, it was easy to determine the appropriate

r

based on the minimum

E_{SUM}

. For case Ⅱ, it can be considered that the constraint bandwidth did not affect the distance measured by the SDTW algorithm, and the first

r

corresponding to the minimum can be seen as the candidate. For the situation in case Ⅲ that multiple candidate values within the convergence region corresponded to the same minimum value

E_{SUM}

, the median of these candidate values was selected as

r

. Here, the general rules for determining and adjusting the preset range for

r

can refer to [52].

According to the data-processing procedure in the right part of Figure 2, the dataset B listed in Table 1 was further divided into sub-training and sub-test sets after the dimension reduction through the SDAE network. The dataset information used for the supervised learning of the 1-NN classifier was given in Table 2. Here, the sample size of the test set was made significantly larger than that of the training set according to the ratio commonly adopted in the dataset sheet of UCR archives [38]. First, the best

r

was searched based on the training results of the 1-NN classifier under two distance metrics. Figure 10 showed the variation of the classification error rate

E_{SUM}

of two datasets with respect to

r

after the dimensionality reduction in Table 2, and

r

for datasets of CinCECGTorso and SemgHandMovementCh2 should choose 2 and 3, respectively. Next, the defined

d_{similarity}

under the specified

r

was used as the distance metric of the 1-NN classifier to perform supervised training on the sub-training set. Also, other distance metrics can be applied in the 1-NN classifier to train the sub-training set. Furthermore, the performance evaluation of the similarity measure can be transformed into a comparison of the classification error rate of the 1-NN classifier under different distance measures. The generalization capacities of the 1-NN classifier with different distance metrics were compared in Figure 11 for the sub-test set by the classification error rate. The bar distribution reflected that the distance based on the improved DTW had lower classification error rates for the two sub-test datasets than that of the other distance measure functions, which also meant that the proposed distance metric was more suitable for similarity evaluation.

3. Similarity Measurement for On-Site Vibration Monitoring

3.1. Vibration Sequences Acquisition and Preparation

As shown in Figure 12, the regular moving loads caused by subway trains can be regarded as a vibration source. Owing to such excitation, the surface waves propagate omni-directionally on the ground. Because the surface wave couples to the track bed and subway rail, a distributed sensing optic fiber mounted beside the rail along the on-site monitoring area can detect the vibration and can be used to establish the vibration database for each monitoring zone. Figure 13 showed part of the actual engineering scenario in the subway tunnel. The monitoring area of interest covered three underground stations with a total length of nearly three kilometers. According to the spatial resolution of the sensing optic fiber and the on-spot layout of the tunnel structure, more than 500 vibration regions along the track bed can be distinguished based on the interrogated address of the light interference [54]. Here, the repeatability of the demodulator was revealed in [55] and the layout of the monitoring system can refer to [15]. When a train passed, the real-time vibration response triggered in each monitoring zone was fully transmitted back to the platform monitoring center at a sampling rate of 1 kHz and processed by the demodulator and servers. Therefore, the database of vibration response caused by passing train can be established for each monitoring zone and the location code of the monitoring area can be used as a unique label of each vibration sequence database.

Figure 14 demonstrates the typical vibration responses of a track bed area due to a passing subway train. The triggered vibration responses of each monitoring area automatically recorded due to the passing of the train were basically within 12 s [15]. The characteristics of the vibration response were mainly composed of pulses with a duration of about 9 s caused by the axle weight. To meet the requirement that the node number of the input samples in the SDAE network must be consistent, the main vibration characteristics caused by the action of the train axle in each sample were retained. The sampling points at both ends of the vibration response were then truncated to match the minimum sequence length of the vibration response. Finally, min-max normalization [56] was used to normalize all the vibration amplitudes to the range of 0~1, which can boost a better learning efficiency for the SDAE network.

3.2. Result Analysis and Discussion

Considering the processing power of current experimental hardware that was composed of a graphics processing unit (GPU) core (GTX 1080 Ti) with twelve 2.20 GHz processors (Intel Xeon E5-2650 v4), we collected the distributed vibration responses caused by 100 passing trains within 2 h in the subway, aiming at three randomly selected monitoring zones labeled #130, #135 and #145 to perform similarity measurement through training the SDAE network and searching the optimal constraint bandwidth. The three selected zones belong to the common track bed in the same traveling area and the ultra-weak FBG sensors used to detect vibration were installed with the consistent craft [15]. First of all, all samples were truncated into the sequence with 10,000 dimensionalities and processed by the min-max normalization. Based on the step-by-step parameter tuning, the most appropriate SDAE network structure assessed by the silhouette coefficient is shown in Figure 15, which set the denoising coefficient to 0.2, contained 5 hidden layers and reduced the input length from 10,000 to 600. The constraint bandwidth was then set to 13 by the 1-NN classifier training. For each set of candidate hyperparameters, it took approximately 2–2.5 h to perform the task on feature extraction and bandwidth search of the 100 groups of datasets of the three monitoring zones.

After completing the two-step training of feature extraction of the SDAE network and distance measurement of the improved DTW algorithm, the established model can be applied to calculate the similarity of new samples. In another subway operation period, three groups of vibration responses of the #130, #135, and #145 monitoring zones caused by passing trains were collected and used to verify the proposed method of measurement similarity.

Figure 16 shows similarity measures between two vibration sequences related to the monitoring zone and the passing train. The left side of the dotted boundary in the bar graph displays the similarities between each pair of vibration responses of the same monitoring zone under different passing trains, while the right side of the boundary shows the similarities of different monitoring areas passed by the same train. Here, the subscripts A, B, and C of the monitoring area numbers in each bar column indicate the different passing trains, and two related monitoring area labels for measuring distance are connected by the symbol ‘&’. Obviously, the threshold of the two comparison types represented by the distance derived from the improved DTW algorithm can be identified, and it was about 800 με. The distance unit here depended on the vibration signal denoted by the strain-induced phase variation between two ultra-weak FBGs [10]. Because the results in the left part of Figure 16 are all below the threshold, it is quantitatively revealed that the similarity of the vibration response at the same physical location in the underground structure is significantly higher than the measurement results between different locations. Moreover, by using the mean distance based on the improved DTW algorithm, the similarities for monitoring zones #130, #135 and #145 can be determined as 700 με, 656 με and 756 με, respectively. The quantitative outcomes based on distance measure not only indicated that the similarity of the structural vibration changed with the location of the underground structure but also revealed that the proposed method can effectively quantify and distinguish such similarity difference. Hence, the condition change of the surrounding structure and environment can be tracked by similarities of structural vibration detected by distributed ultra-weak FBG array.

4. Conclusions

This study proposed a similarity measure method to quantify the distributed vibration responses of underground structures, which involved feature extraction by the SDAE network and distance measurement by the improved DTW algorithm. Combining two datasets of one-dimensional time series from UCR archives, the detailed implementation processes for the similarity measure were introduced, and the advantages of feature extraction and distance measure in the proposed method were revealed according to algorithm comparisons. Considering the current processing capabilities of the experimental hardware, the size of the field dataset used to train the SDAE network was limited, but the subsequent experimental outcomes on distance measure still agreed well with the expected cognition. The prediction results of similarity based on the modeling of 100 groups of vibration sequences in three monitoring zones on the subway site demonstrated that the vibration similarity of the same monitoring zone was significantly higher than that from different ones. Moreover, the similarities of distributed vibration closely related to the physical location of the underground structure can be distinguished effectively by the improved DTW distance, demonstrating that the proposed method assisted with the distributed vibration detected by the ultra-weak FBG array is promising for quantifying structural status and locating structural anomalies.

Author Contributions

Conceptualization, S.L.; Data curation, X.Z.; Funding acquisition, S.L.; Investigation, S.L.; Methodology, X.Z.; Project administration, H.W.; Resources, Z.L.; Software, H.W.; Supervision, L.S.; Validation, Z.L.; Writing—original draft, X.Z.; Writing—review and editing, S.L. and L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China grant numbers 61875155 and 61735013.

Acknowledgments

The research work reported in this paper was supported by the National Engineering Laboratory for Fiber Optic Sensing Technology, Wuhan University of Technology, and the Smart Nanocomposites Laboratory, University of California, Irvine.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hong, C.; Zhang, Y.; Li, G.; Zhang, M.; Liu, Z. Recent progress of using Brillouin distributed fiber optic sensors for geotechnical health monitoring. Sens. Actuators A Phys. 2017, 258, 131–145. [Google Scholar] [CrossRef]
Lu, P.; Lalam, N.; Badar, M.; Liu, B.; Chorpening, B.T.; Buric, M.P.; Ohodnicki, P.R. Distributed optical fiber sensing: Review and perspective. Appl. Phys. Rev. 2019, 6, 041302. [Google Scholar] [CrossRef]
Thévenaz, L. Review and progress in distributed fiber sensing. In Proceedings of the Optical Fiber Sensors, OFS 2006, Cancun, Mexico, 23 October 2006. [Google Scholar]
Yang, M.; Li, C.; Mei, Z.; Tang, J.; Guo, H.; Jiang, D. Thousand of fiber grating sensor array based on draw tower: A new platform for fiber-optic sensing. In Proceedings of the Optical Fiber Sensors, OFS 2018, Lausanne, Switzerland, 24–28 September 2018. [Google Scholar]
Muanenda, Y. Recent advances in distributed acoustic sensing based on phase-sensitive optical time domain reflectometry. J. Sens. 2018, 2018, 3897873. [Google Scholar] [CrossRef]
Liu, X.; Jin, B.; Bai, Q.; Wang, Y.; Wang, D.; Wang, Y. Distributed fiber-optic sensors for vibration detection. Sensors 2016, 16, 1164. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lai, J.; Qiu, J.; Fan, H.; Zhang, Q.; Hu, Z.; Wang, J.; Chen, J. Fiber Bragg grating sensors-based in situ monitoring and safety assessment of loess tunnel. J. Sens. 2016, 2016, 8658290. [Google Scholar] [CrossRef] [Green Version]
Pamukcu, S.; Cheng, L.; Pervizpour, M. Chapter 1—Introduction and overview of underground sensing for sustainable response. In Underground Sensing. Monitoring and Hazard Detection for Environment and Infrastructure, 1st ed.; Pamukcu, S., Cheng, L., Eds.; Academic Press: Cambridge, MA, USA, 2018; pp. 1–42. [Google Scholar]
Gong, H.; Kizil, M.S.; Chen, Z.; Amanzadeh, M.; Yang, B.; Aminossadati, S.M. Advances in fibre optic based geotechnical monitoring systems for underground excavations. Int. J. Min. Sci. Technol. 2017, 29, 229–238. [Google Scholar] [CrossRef]
Gan, W.; Li, S.; Li, Z.; Sun, L. Identification of ground intrusion in underground structures based on distributed structural vibration detected by ultra-weak FBG sensing technology. Sensors 2019, 19, 2160. [Google Scholar] [CrossRef] [Green Version]
Yang, M.; Bai, W.; Guo, H.; Wen, H.; Yu, H.; Jiang, D. Huge capacity fiber-optic sensing network based on ultra-weak draw tower gratings. Photonic Sens. 2016, 6, 26–41. [Google Scholar] [CrossRef] [Green Version]
Zhou, L.; Li, Z.; Xiang, N.; Bao, X. High-speed demodulation of weak fiber Bragg gratings based on microwave photonics and chromatic dispersion. Opt. Lett. 2018, 43, 2430–2433. [Google Scholar] [CrossRef]
Li, Z.; Tong, Y.; Fu, X.; Wang, J.; Guo, Q.; Yu, H.; Bao, X. Simultaneous distributed static and dynamic sensing based on ultra-short fiber Bragg gratings. Opt. Express 2018, 26, 17437–17446. [Google Scholar] [CrossRef]
Bai, W.; Yang, M.; Hu, C.; Dai, J.; Zhong, X.; Huang, S.; Wang, G. Ultra-weak fiber Bragg grating sensing network coated with sensitive material for multi-parameter measurements. Sensors 2017, 17, 1509. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Nan, Q.; Li, S.; Yao, Y.; Li, Z.; Wang, H.; Wang, L.; Sun, L. A novel monitoring approach for train tracking and incursion detection in underground structures based on ultra-weak FBG sensing array. Sensors 2019, 19, 2666. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Moughty, J.J.; Casas, J.R. A state of the art review of modal-based damage detection in bridges: Development, challenges, and solutions. Appl. Sci. 2017, 7, 510. [Google Scholar] [CrossRef] [Green Version]
Amezquita-Sanchez, J.P.; Adeli, H. Signal processing techniques for vibration-based health monitoring of smart structures. Arch. Comput. Method Eng. 2016, 23, 1–15. [Google Scholar] [CrossRef]
Abdeljaber, O.; Avci, O.; Kiranyaz, M.S.; Gabbouj, M.; Inman, D.J. Real-time vibration-based structural damage detection using one-dimensional convolutional neural networks. J. Sound Vib. 2017, 388, 154–170. [Google Scholar] [CrossRef]
Wang, T.; Han, Q.; Chu, F.; Feng, Z. Vibration based condition monitoring and fault diagnosis of wind turbine planetary gearbox: A review. Mech. Syst. Signal Process. 2019, 126, 662–685. [Google Scholar] [CrossRef]
Faloutsos, C.; Ranganathan, M.; Manolopoulos, Y. Fast subsequence matching in time-series databases. In Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, USA, 24–27 May 1994. [Google Scholar]
Yi, B.K.; Faloutsos, C. Fast time sequence indexing for arbitrary L_p norms. In Proceedings of the 26th International Conference on Very Large Data Bases, VLDB 2000, Cairo, Egypt, 10–14 September 2000. [Google Scholar]
Liu, L.; Li, W.; Jia, H. Method of time series similarity measurement based on dynamic time warping. Comput. Mater. Contin. 2018, 57, 97–106. [Google Scholar] [CrossRef]
Riesen, K.; Schmidt, R. Online signature verification based on string edit distance. Int. J. Doc. Anal. Recognit. 2019, 22, 41–54. [Google Scholar] [CrossRef]
Gold, O.; Sharir, M. Dynamic time warping and geometric edit distance: Breaking the quadratic barrier. ACM Trans. Algorithms 2018, 14, 50. [Google Scholar] [CrossRef]
Yu, M.; Wang, J.; Li, G.; Zhang, Y.; Deng, D.; Feng, J. A unified framework for string similarity search with edit-distance constraint. VLDB J. 2017, 26, 249–274. [Google Scholar] [CrossRef]
Chen, X.; Huo, H.; Huan, J.; Vitter, J.S. An efficient algorithm for graph edit distance computation. Knowl. Based Syst. 2019, 163, 762–775. [Google Scholar] [CrossRef]
Shlens, J. A Tutorial on Principal Component Analysis. Available online: https://arxiv.org/abs/1404.1100 (accessed on 17 March 2020).
Izenman, A.J. Chapter 8–Linear Discriminant Analysis. In Modern Multivariate Statistical Techniques, 2nd ed.; Springer: New York, NY, USA, 2013; pp. 237–280. [Google Scholar]
Yan, J.; Sun, H.; Chen, H.; Junejo, N.U.R.; Cheng, E. Resonance-based time-frequency manifold for feature extraction of ship-radiated noise. Sensors 2018, 18, 936. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Liu, J.; Chen, F.; Wang, D. Data compression based on stacked RBM-AE model for wireless sensor networks. Sensors 2018, 18, 4273. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Pathirage, C.S.N.; Li, J.; Li, L.; Hao, H.; Liu, W.; Ni, P. Structural damage identification based on autoencoder neural networks and deep learning. Eng. Struct. 2018, 172, 13–28. [Google Scholar] [CrossRef]
Seyyedsalehi, S.Z.; Seyyedsalehi, S.A. Simultaneous learning of nonlinear manifolds based on the bottleneck neural network. Neural Process. Lett. 2014, 40, 191–209. [Google Scholar] [CrossRef]
Yang, H.; Shen, S.; Yao, X.; Sheng, M.; Wang, C. Competitive deep-belief networks for underwater acoustic target recognition. Sensors 2018, 18, 952. [Google Scholar] [CrossRef] [Green Version]
Hearty, J. Advanced Machine Learning with Python; Packt Publishing: Birmingham, UK, 2016; pp. 57–61. [Google Scholar]
Fan, Z.; Bi, D.; He, L.; Shiping, M.; Gao, S.; Li, C. Low-level structure feature extraction for image processing via stacked sparse denoising autoencoder. Neurocomputing 2017, 243, 12–20. [Google Scholar] [CrossRef]
Xia, M.; Li, T.; Liu, L.; Xu, L.; de Silva, C.W. Intelligent fault diagnosis approach with unsupervised feature learning by stacked denoising autoencoder. IET Sci. Meas. Technol. 2017, 11, 687–695. [Google Scholar] [CrossRef]
Dau, H.A.; Bagnall, A.; Kamgar, K.; Yeh, C.M.; Zhu, Y.; Gharghabi, S.; Ratanamahatana, C.A.; Keogh, E. The UCR time series archive. IEEE/CAA J. Autom. Sin. 2019, 6, 1293–1305. [Google Scholar] [CrossRef]
Chen, Y.; Keogh, E.; Hu, B.; Begum, N.; Bagnall, A.; Mueen, A.; Batista, G. The UCR Time Series Classification Archive. Available online: https://www.cs.ucr.edu/~eamonn/time_series_data/ (accessed on 10 February 2020).
Keras. Available online: http://github.com/keras-team/keras (accessed on 4 April 2020).
Regan, M. K Nearest Neighbors with Dynamic Time Warping. Available online: https://github.com/markdregan/K-Nearest-Neighbors-with-Dynamic-Time-Warping (accessed on 4 April 2020).
Vincent, P.; Larochelle, H.; Lajoie, I.; Bengio, Y.; Manzagol, P. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 2010, 11, 3371–3408. [Google Scholar]
Bengio, Y.; Lamblin, P.; Popovici, D.; Larochelle, H. Greedy layer-wise training of deep networks. In Proceedings of the 20th Annual Conference on Neural Information Processing Systems, NIPS 2006, Vancouver, BC, Canada, 4–7 December 2006. [Google Scholar]
Aranganayagi, S.; Thangavel, K. Clustering categorical data using silhouette coefficient as a relocating measure. In Proceedings of the International Conference on Computational Intelligence and Multimedia Applications, ICCIMA 2007, Sivakasi, Tamil Nadu, India, 13–15 December 2007. [Google Scholar]
Hackeling, G. Mastering Machine Learning with Scikit-Learn, 1st ed.; Packt Publishing: Birmingham, UK, 2014; pp. 84–86. [Google Scholar]
Browne, M.W. Cross-validation methods. J. Math. Psychol. 2000, 44, 108–132. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. Available online: https://arxiv.org/abs/1412.6980 (accessed on 18 March 2020).
Caruana, R.; Lawrence, S.; Giles, L. Overfitting in neural nets: Backpropagation, conjugate gradient, and early stopping. In Proceedings of the 14th Annual Neural Information Processing Systems Conference (NIPS 2000), Denver, CO, USA, 27 November–2 December 2000. [Google Scholar]
Poole, B.; Sohl-Dickstein, J.; Ganguli, S. Analyzing Noise in Autoencoder and Deep Networks. Available online: https://arxiv.org/abs/1406.1831 (accessed on 18 March 2020).
Chen, Q.; Hu, G.; Gu, F.; Xiang, P. Learning optimal warping window size of DTW for time series classification. In Proceedings of the 2012 11th International Conference on Information Science, Signal Processing and their Application, ISSPA 2012, Montreal, QC, Canada, 2–5 July 2012. [Google Scholar]
Sakoe, H.; Chiba, S. Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 1978, 26, 43–49. [Google Scholar] [CrossRef] [Green Version]
Senin, P. Dynamic Time Warping Algorithm Review. Available online: http://seninp.github.io/assets/pubs/senin_dtw_litreview_2008.pdf (accessed on 18 March 2020).
Dau, H.A.; Silva, D.F.; Petitjean, F.; Forestier, G.; Bagnall, A.; Mueen, A.; Keogh, E. Optimizing dynamic time warping’s window width for time series data mining applications. Data Min. Knowl. Discov. 2018, 32, 1074–1120. [Google Scholar] [CrossRef] [Green Version]
Ding, H.; Trajcevski, G.; Scheuermann, P.; Wang, X.; Keogh, E. Querying and mining of time series data: Experimental comparison of representations and distance measures. Proc. VLDB Endow. 2008, 1, 1542–1552. [Google Scholar] [CrossRef] [Green Version]
Luo, Z.; Wen, H.; Guo, H.; Yang, M. A time- and wavelength-division multiplexing sensor network with ultra-weak fiber Bragg gratings. Opt. Express 2013, 21, 22799–22807. [Google Scholar] [CrossRef] [Green Version]
Tong, Y.; Li, Z.; Wang, J.; Wang, H.; Yu, H. High-speed Mach-Zehnder-OTDR distributed optical fiber vibration sensor using medium-coherence laser. Photonic Sens. 2018, 8, 203–212. [Google Scholar] [CrossRef] [Green Version]
Patro, S.G.K.; Sahu, K.K. Normalization: A Preprocessing Stage. Available online: https://arxiv.org/abs/1503.06462 (accessed on 18 March 2020).

Figure 1. Similarity measurement and evaluation process.

Figure 2. Functions of the dataset used for similarity measurement during implementation.

Figure 3. Schematic diagram of the stacked denoising autoencoder (SDAE) network stacking process.

Figure 4. Mean silhouette coefficients at different number of hidden layers.

Figure 5. Mean silhouette coefficients at different reduction dimensionality.

Figure 6. Mean silhouette coefficients at different denoising coefficients.

Figure 7. Comparison of different dimensionality reduction methods.

Figure 8. Warping offset distance expressed by the diagram of the DTW distance matrix.

Figure 9. Typical variation of the sum of classification error rates at different constrains.

Figure 10. Variation of the defined error of two datasets with different constraint bandwidths.

Figure 11. Performance comparison of 1-NN classifier under different distance metrics.

Figure 12. The processing sketch from ultra-weak fiber Bragg grating (FBG) array to distributed vibration.

Figure 13. Field layout of ultra-weak FBG sensing cable used for detecting distributed vibration.

Figure 14. Typical vibration response of a monitoring zone caused by a passing train.

Figure 15. Hidden layers schematic of the SDAE network used for processing vibration responses.

Figure 16. Similarity comparison based on the improved DTW distance.

Table 1. Experimental datasets for validation method.

Name of Datasets	Sequence Length	Size of the Dataset A	Size of the Dataset B	Number of Labels
CinCECGTorso	1639	1050	350	4
SemgHandMovementCh2	1500	675	225	6

Table 2. Datasets for searching the constraint bandwidth and evaluating the distance metrics.

Name of Datasets	Sequence Length	Size of the Sub-Training Set	Size of the Sub-Test Set	Number of Labels
CinCECGTorso	164	50	300	4
SemgHandMovementCh2	150	25	200	6

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, S.; Zuo, X.; Li, Z.; Wang, H.; Sun, L. Combining SDAE Network with Improved DTW Algorithm for Similarity Measure of Ultra-Weak FBG Vibration Responses in Underground Structures. Sensors 2020, 20, 2179. https://doi.org/10.3390/s20082179

AMA Style

Li S, Zuo X, Li Z, Wang H, Sun L. Combining SDAE Network with Improved DTW Algorithm for Similarity Measure of Ultra-Weak FBG Vibration Responses in Underground Structures. Sensors. 2020; 20(8):2179. https://doi.org/10.3390/s20082179

Chicago/Turabian Style

Li, Sheng, Xiang Zuo, Zhengying Li, Honghai Wang, and Lizhi Sun. 2020. "Combining SDAE Network with Improved DTW Algorithm for Similarity Measure of Ultra-Weak FBG Vibration Responses in Underground Structures" Sensors 20, no. 8: 2179. https://doi.org/10.3390/s20082179

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Combining SDAE Network with Improved DTW Algorithm for Similarity Measure of Ultra-Weak FBG Vibration Responses in Underground Structures

Abstract

1. Introduction

2. Methodology and Implementation of Signal Similarity Measure

2.1. Overview of the Procedure for Signal Similarity Measure

2.2. Feature Extraction Based on Stacked Denoising Autoencoder (SDAE) Network

2.3. Dimensionality Reduction Evaluation with Silhouette Coefficients

2.4. Distance Measure Based on Improved Dynamic Time-Warping (DTW) Algorithm

2.5. Similarity Measure Evaluation with One Nearest Neighbor (1-NN) Classifier

3. Similarity Measurement for On-Site Vibration Monitoring

3.1. Vibration Sequences Acquisition and Preparation

3.2. Result Analysis and Discussion

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI