A Two-Stage Structural Damage Detection Method Based on 1D-CNN and SVM

Jiang, Chenhui; Zhou, Qifeng; Lei, Jiayan; Wang, Xinhong

doi:10.3390/app122010394

Open AccessArticle

A Two-Stage Structural Damage Detection Method Based on 1D-CNN and SVM

¹

Department of Automation, Xiamen University, Xiamen 361005, China

²

Department of Civil Engineering, Xiamen University, Xiamen 361005, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(20), 10394; https://doi.org/10.3390/app122010394

Submission received: 31 August 2022 / Revised: 30 September 2022 / Accepted: 9 October 2022 / Published: 15 October 2022

(This article belongs to the Special Issue Machine Learning in Vibration and Acoustics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Deep learning has been applied to structural damage detection and achieved great success in recent years, such as the popular structural damage detection methods based on structural vibration response and convolutional neural networks (CNN). However, due to the limited number of vibration response samples that can be acquired in practice for damage detection, the CNN-based models may not be fully trained; thus, their performance for identifying different damage severity as well as the damage locations may be reduced. To solve this issue, in this paper, we follow the strategy of "divide-and-conquer" and propose a two-stage structural damage detection method. Specifically, in the first stage, a 1D-CNN model is constructed to extract the damage features automatically and identify the damage locations. In the second stage, a support vector machine (SVM) model and wavelet packet decomposition technique are combined to further quantify the damage. Experiments are conducted on an eight-level steel frame structure, and the accuracy of the experimental results is greater than 99%, which demonstrates the superiority of the proposed method compared to the state-of-the-art approaches.

Keywords:

structural damage detection; convolutional neural networks; support vector machine; multi-level damage classification

1. Introduction

Structural damage is inevitable and more likely to happen when a variety of mechanical or environmental elements are present. Structural damage can decrease the life of a structure and threaten people’s safety. Establishing a structural health monitoring (SHM) system, which is also crucial for enhancing structural reliability and safety and lowering maintenance costs, is an efficient solution to solve this issue [1]. SHM is a multidisciplinary research field that includes experimental testing, system identification, data collecting and management, and long-term environmental data measurement [2,3]. The most important part of SHM is structural damage detection (SDD), which is a methodical, automated process for detecting damage, locating it, and determining its severity [4].

SDD begins with visual inspection, but visual inspection has numerous limitations. First, due to the generally high scale of civil construction, routine inspections are time-consuming and laborious. Second, the inspector’s specific knowledge is required for visual inspection. Third, load-bearing structures are frequently hidden behind flooring, ceilings, and other decorative materials, making visual inspection impossible [5]. With the continued advancement in the fields of structural health monitoring and structural damage detection, numerous strategies have been employed to identify, locate, and quantify structural damage to overcome the limitations of visual inspection [6,7,8].

Damage to a structure alters the structure’s mass and stiffness distribution, causing changes in the inherent frequency and vibration pattern [9]. Many approaches based on structural vibration response, which extract features from the vibration response and then determine the corresponding damage state, have been presented [10]. Some traditional machine learning algorithms have been applied to this field, with support vector machine (SVM) being one of the most classic. Lei et al. [11] proposed a method based on vibration statistical indicators and SVM, employing variance, regression coefficients, and cross-correlation function amplitude as feature vectors for SVM, and achieved good results on an eight-level steel frame structure. An enhanced Hilbert–Huang transform (HHT) and SVM-based structural damage detection technique is proposed by Diao et al. [12]. The structural vibration function’s Hilbert spectral energy is obtained by decomposing the vibration signal using improved empirical modal decomposition. Then, the structural damage feature vector is constructed, and the damage location and severity are detected using SVM. The method’s efficacy is evaluated on an experimental model of an offshore platform.

However, feature extractors based on traditional machine learning techniques require significant domain expertise, whereas deep-learning-based methods can automatically extract and select features from data, thus avoiding the need for the manual design of feature extraction methods and reducing workload [13]. In the field of structural health monitoring based on vibration response, one-dimensional convolutional neural networks (1D-CNN) are very popular deep learning methods; 1D-CNN takes time series directly as input and conducts one-dimensional convolution on the time axis. Ma et al. [14] used 1D-CNN to detect damage in a steel beam numerical model. According to their results, 1D-CNN based on acceleration signals could detect 94.1% of the damage. Wang et al. [15] used the time-frequency graph of the damaged signal after HHT transform and the marginal spectrum of the signal as the input of CNN and optimized the parameters of CNN with particle swarm optimization (PSO) to improve the model performance. Xiao et al. [16] proposed an improved denoising auto-encoder-based neural network and optimized it by using gray relation analysis. It is capable of automatically extracting high-level features from the original signal by multi-layer extraction and can achieve high accuracy in noisy environments. In addition, numerous works have demonstrated that 1D-CNN outperforms traditional machine learning methods in structure damage detection.

In applications of SHM, different users have different requirements. For example, a house owner only needs to know the location of damage to his house to contact the maintenance staff, while the maintenance staff needs to know more details about the damage, including the location and severity of the damage to make better repairs. Therefore, it makes sense to design a multi-stage structural damage detection method to save computational costs.

To enhance both the model performance for detecting the damage location and the damage severity, in this paper, we propose a two- stage structural damage detection method in which the selection of a classifier for each stage is very important. We choose 1D-CNN to detect the damage location because 1D-CNN has been widely adopted in the field of SDD with good results. It was also discovered in [17] that 1D-CNN is more accurate for identifying the damage location than the damage severity. This is because the variety of structural vibration responses on different damage locations is much larger than that of the damage severity. In addition, there is a very small difference between the vibration response corresponding to different damage severity at the same location. In this case, a CNN-based model may need more complex structures and more training data. Therefore, after detecting the damage location, we can choose another method that may produce better results with fewer samples and classes to detect the damage severity instead of detecting the damage location and severity at the same time.

SVM is a strong classification machine for small-scale sample learning problems. It is a sparse kernel decision machine that avoids computing posterior probabilities when building its learning model. SVM has been extensively used for classification, regression, novelty detection tasks, and feature reduction [18]. Compared with 1D-CNN, SVM is more suitable for damage severity identification, and the computational cost and required training samples of SVM are less than those of 1D-CNN.

In summary, the novelty and the main contributions of this paper are as follows:

We propose a new two-stage structural damage detection method which follows the strategy of "divide-and-conquer" to solve the problem of insufficient training data and enhance the model performance for multi-level structural damage detection.
Our method fully combines the advantages of 1D-CNN and SVM, reducing computational costs and eliminating the need to rely on expertise to design complex feature extraction methods.
We verify the proposed model on an eight-level steel frame structure. The experimental results show that the proposed method outperforms the state-of-the-art methods in terms of both damage location detection and damage severity detection.

2. Methods

The framework of the proposed two-stage structural damage detection method based on 1D-CNN and SVM is shown in Figure 1. After data preprocessing, the samples are identified using a two-stage approach. In the first stage, the samples are classified according to the damage location using 1D-CNN. In the second stage, the frequency domain features of the samples are extracted using wavelet packet decomposition. Then, the feature vectors are learned using support vector machine, and the damage severity of the samples can be obtained.

2.1. 1D-CNN

In this work, 1D-CNN is adopted as the classifier for the first stage. Using 1D-CNN to extract features from time series is a natural way. Compared with the traditional approaches, 1D-CNN directly takes a one-dimension time series as input, without the need to understand the physical meaning contained in the time series. Therefore, we can construct a 1D-CNN model to automatically extract rich features from structural vibration response and then classify the structural damage locations. The basic 1D-CNN model includes three parts: convolutional layer, pooling layer, and fully connected layer. The 1D-CNN in practical applications contains more components to improve the model’s performance. The 1D-CNN model constructed in this work includes convolutional layers, pooling layers, global average pooling layers, dropout layers, fully connected layers, and softmax output layers.

2.1.1. Convolutional Layer

The convolutional layer uses a time window sliding along the time axis direction of the time series to obtain a set of subsequences and then multiplies each subsequence with the kernel element-by-element to obtain the convolution result, as shown in Equation (1). The convolutional layer has three characteristics: sparse weights, parameter sharing, and equal variation [19]. These properties significantly reduce the model’s memory cost and improve the model’s ability to extract data features automatically.

y (k) = \sum_{i = 0}^{N} h (k - i) u (i),

(1)

where h represents the subsequence, u (i) represents the kernel, y represents the output signal, k represents the index of the subsequence, and N represents the length of the kernel.

ReLU [20] is chosen as the activation function in the convolutional layer, which has less computational overhead and faster computation than activation functions such as Sigmoid. The formula of ReLU is as in Equation (2). When the input x of ReLU is non-negative, the output result is x. When x is less than 0, the output result is 0.

R e L U (x) = \{\begin{matrix} x, & x \geq 0 \\ 0, & x < 0 \end{matrix}

(2)

2.1.2. Pooling Layer

The pooling layer can reduce the output feature dimension of the convolutional layer by downsampling [21]. In this paper, the max pooling method is used to reduce the feature dimension, which takes the maximum value in the neighborhood as the representation of the neighborhood. In this paper, we utilize global average pooling to compress the high-dimensional feature vector output from the convolutional layer to one dimension vector as the input of the subsequent model. Global average pooling aggregates the feature information of each dimension, which is more robust to the noise in the feature vector [22].

2.1.3. Droput Layer

Dropout is an effective tool for solving the overfitting problem [23]. Dropout operation is applied to the output of the global average pooling layer. Dropout is based on randomly masking some units during training and enabling them during validation, which can effectively improve the performance of CNN.

2.1.4. Full Connected Layer

After several layers of convolution and pooling, all information needs to be integrated from the hidden feature space using the fully connected layer to complete the damage detecting task.

2.2. SVM

SVM is a very effective machine learning technique widely used in classification, regression, anomaly detection, and other learning tasks [24,25]. Given the training samples and labels

x_{i} \in R^{n}, y_{i} \in {- 1, + 1}, i = 1, \dots, m

, SVM can solve the following optimization problem:

\begin{matrix} \begin{matrix} min_{w, b, ξ_{i}} & \frac{1}{2} {∥ w ∥}^{2} + C \sum_{i = 1}^{m} ξ_{i} \\ s . t . & y_{i} (w^{T} x_{i} + b) ⩾ 1 - ξ_{i}, \\ ξ_{i} ⩾ 0, i = 1, 2, \dots, m, \end{matrix} \end{matrix}

(3)

where w represents the weight and y is the sample label,

ξ_{i}

is the relaxation variable, C is the penalty coefficient, m is the number of samples, and b is the bias. SVM finds a linear partitioned hyperplane with maximum margin in a high-dimensional space. By solving this optimization problem for

w

, b, and

ξ_{i}

, the optimal hyperplane which can be used to classify samples is obtained. This optimization problem can be solved with the help of the Lagrange multiplier method or quadratic programming. However, this optimization problem is usually complicated and needs to be solved with the help of the dual problem. The dual form is given by Equation (4).

\begin{matrix} \begin{matrix} max_{α} & \sum_{i = 1}^{m} α_{i} - \frac{1}{2} \sum_{i = 1}^{m} \sum_{j = 1}^{m} α_{i} α_{j} y_{i} y_{j} x_{i}^{T} x_{j} \\ s . t . & \sum_{i = 1}^{m} α_{i} y_{i} = 0, \\ 0 \leq α_{i} \leq C i = 1, 2, \dots, m, \end{matrix} \end{matrix}

(4)

where

α_{i}

represents the Lagrange multiplier. After solving the dual problem, the solution of the original problem can be obtained:

w = \sum_{i = 1}^{m} α_{i} y_{i} x_{i},

(5)

b = y_{j} - \sum_{i = 1}^{m} α_{i} y_{i} x_{i}^{T} x_{j}

(6)

SVM can be extended to a nonlinear classifier by introducing kernel functions. There are many commonly used kernel functions such as linear, polynomial, sigmoid, and radial basis function (RBF) [26]. Among them, the RBF function is the most widely used kernel function [27] because it has a solid ability to distinguish the non-linearly separable data. The formula of the RBF function is shown in Equation (7).

K (x_{i}, x_{j}) = exp (- \frac{{∥x_{i} - x_{j}∥}^{2}}{2 δ^{2}}), δ > 0,

(7)

where

δ

is the hyperparameter.

2.3. Wavelet Packet Decomposition

Wavelet packet decomposition combines wavelet transform and multi-resolution approximation. More detailed features are extracted as the signal is subdivided step-by-step [28]. After performing a wavelet packet decomposition of level N,

2^{N}

different waveform signals

D_{N j}, (j = 1, 2, \dots, 2^{N} - 1)

with low to high frequencies are generated. The energy of each band signal can be calculated by Equation (8).

E_{N j} = \int {|D_{N j} (t)|}^{2} d t = \sum_{k = 1}^{n} {|d_{j k}|}^{2},

(8)

where

d_{j k}

is the amplitude of the k point of the reconstructed signal

D_{N j}

, and n is the number of discrete points. The energy of each frequency band can be calculated from Equation (8), and the feature vector

S_{N}

can be constructed from these energy values as in Equation (9).

S_{N} = [E_{N 0}, E_{N 1}, \dots, E_{N j}, \dots, E_{N (2^{N} - 1)}]

(9)

S_{N}

can be normalized by the min-max normalization method to obtain the new feature vector

S_{N}^{^{'}}

.

S_{N}^{^{'}} = [E_{N 0}^{^{'}}, E_{N 1}^{^{'}}, \dots, E_{N j}^{^{'}}, \dots, E_{N (2^{N} - 1)}^{^{'}}]

(10)

Figure 2 shows an example of decomposition with 4-layer wavelet packets from the experimental data in this work.

3. Experiments

3.1. Dataset

In this paper, the vibration response signals were collected through an eight-layer steel frame structure; each layer was 35 cm long and 25 cm wide, with a height of 20 cm between the two layers. Anchor bolts were used to fasten the bottom of the frame to the ground, while double-row bolts were used to join the beam to the columns. The diagram of the frame is shown in Figure 3. White noise excitation was applied at the third layer of the frame, and eight acceleration sensors were installed at each level along the direction of external excitation to record the acceleration response. The white noise generator model was RIGOL DG-1022, an electromagnetic exciter was used as the actuator, and the type of white noise was pre-defined (Ex1-Ex10). The model structure’s steel material had an elasticity modulus of

E = 2.0 \times 10^{11}

Pa and a density of

ρ

= 7.8 × 10

^{3}

kg/m

^{3}

. Each column member was made of a

200 \times 30 \times 3

mm steel plate in a undamaged condition. The damage to the steel frame structure was achieved by reducing the stiffness of the steel plates (i.e., replacing the current plate with thinner ones:

200 \times 30 \times 2.5

mm). In the vibration experiments, the duration of each recording was 32 s, the sampling frequency was 128 Hz, and a record contained 4096 data points for one sensor. Ten damage states were set up for the experiments, and the damage locations and severity for the ten damage cases are shown in Table 1.

In Table 1, UD is the undamaged state, D1–D6 are the cases of single-layer damage, and D7-D9 are the cases of two-layer damage. Ten different white noise excitations (Ex1-Ex10) were applied for each damage case. For each noise effect, one record was gathered. A total of 100 data were collected under 10 kinds of damage and 10 kinds of noise. Each record contains the vibration response of eight sensors, with a sampling time of 32 s and a sampling frequency of 128 Hz, for a total of 32,768 data points. Figure 4 shows the structural vibration response for the UD and D1 cases.

3.2. Data Preprocessing

A data preprocessing operation was needed to change the original data into a more suitable form that meets the requirements of the model. In this work, preprocessing contains four parts: (1) eliminate offset; (2) min–max normalization; (3) data slicing; and (4) splitting the training and validation sets.

3.2.1. Offset Elimination

During vibration testing, sensors or acquisition devices are likely to offset due to their performance problems or environmental disturbances (e.g., temperature, power supply, etc.). The offset will directly affect the accuracy of signal analysis and should be eliminated. In this paper, the elimination of offset was achieved by subtracting the mean value from the samples. For the time series

X = {x_{1}, x_{2}, \dots, x_{n}}

, the formula for eliminating the offset is as follows:

\hat{X} = X - \frac{1}{n} \sum_{i = 1}^{n} x_{i}

(11)

3.2.2. Data Normalization

From Figure 4, it can be found that there are differences in the vibration magnitudes of different samples. To eliminate the differences in the magnitudes of different samples and improve the classification performance [29], a min–max normalization method [30] expressed by Equation (12) was used to normalize all sample magnitudes to the range of 0 to 1.

\hat{X} = \frac{X - min (X)}{max (X) - min (X)}

(12)

3.2.3. Data Slicing

In order to make full use of the data, this paper adopts a slicing approach to enhance the data information. For example, one original time series contained vibration responses recorded by eight sensors, and each vibration response contained 4096 data points. The slice length

N s

and the sliding step s were chosen to divide the sample into multiple slices, and each slice had the same class label as the original time series. Thus, by varying

N s

and s, different numbers of training samples can be obtained, and this method is particularly effective in the case of insufficient data samples. In this work, we set Ns = 1024, s = 100, and the data was expanded from 100 to 3000 using data slicing. Figure 5 shows the detailed method of data slicing.

3.2.4. Dataset Splitting

To validate the performance of the proposed model, the dataset is usually partitioned into training and test sets. During the model’s training, only data from the training set is used to train it, and when testing the model’s performance, the model is cross-validated using data from the test set that the model has never seen before. The process of k-fold cross-validation is one of the widespread cross-validation methods. The original sample is randomly partitioned into k equal sized subsamples in k-fold cross-validation, a single subsample from the k subsamples is retained as test data for evaluating model effectiveness, and the remaining

k - 1

subsamples are used as training data. The cross-validation process is then repeated k times, with each of the k subsamples supplied as test data exactly once. In this paper, k was set to five.

3.3. Baselines

We compared our model with the following baseline models:

SVM: The feature vector was obtained by four-layer wavelet packet decomposition, and then SVM was used to identify both the damage location and the damage severity.
1D-CNN: Using 1D-CNN to identify both damage location and damage severity, the structure of 1D-CNN was the same as the 1D-CNN used in the method proposed in this paper.
1D-CNN and1D-CNN: After identifying the damage location using a 1D-CNN, the damage severity was identified using another 1D-CNN. The structure of the two 1D-CNNs were consistent with the 1D-CNN in the method proposed in this paper.

3.4. CNN Configurations

The structure and details of the specific parameters of the 1D-CNN used in this paper are shown in Table 2.

3.5. Experimental Results

The proposed model was trained on a server equipped with an Intel Xeon Silver 420 (10) @ 2.194GHz CPU and an Nvidia Tesla V100 (32GB) GPU. The model was developed using the Python (version 3.7.13) programming language with the Python modules Keras (version 2.2.4) and Pycaret (version 2.3.10). We used Keras to build the 1D-CNN model and Pycaret to build the SVM model.

We adopted five-fold cross-validation to train and validate the models. The experimental results of the proposed method compared with other baseline methods are given in Table 3. From Table 3, we can see that the model performance of only using SVM combined with four-layer wavelet packet decomposition was the worst, with an average accuracy of 75.7%. This might be because the complexity of the problem to detect the damage locations and damage severity simultaneously exceeds/ed the learning ability of the model, and the informative features for classification were not well extracted. We can also find that the method using 1D-CNN worked better than the method using only SVM, which verifies the powerful feature extraction ability of 1D-CNN in this task.

Moreover, the two-stage approach 1D-CNN and 1D-CNN worked better than the other directly identification approach, which verifies the effectiveness of the idea of “divide-and-conquer”. In addition, the method using 1D-CNN and SVM achieved the best results, slightly higher than that of 1D-CNN and 1D-CNN. The specific experimental results of the two-stage methods are shown in Table 4. Since the 1D-CNN was used to identify the damage locations in the first stage in both two methods, their accuracies of detecting damage locations were very close, while in the second stage, the SVM was able to maintain a stable accuracy of 100%, which is better than that of 1D-CNN, indicating that SVM can achieve better results on the case of the small number of samples. In addition, deep learning-based methods need sufficient training samples; otherwise, they fall into an overfitting state and lower the generalization performance of the model.

Figure 6a,b show the confusion matrix of the 1D-CNN and 1D-CNN&SVM methods. It can be seen that when only 1D-CNN was used to identify the damage location and damage severity, the error was mainly concentrated on the case of different damage severity at the same damage location, such as three samples of D1 were identified as D2 and eight samples of D5 were identified as D4. The remaining six D9 samples were incorrectly identified as D6 (note that D9 represents the existence of damage in the fifth and seventh layers, and D6 represents the existence of damage in the seventh layer only), and the model only identified one of the damage locations. In contrast, in the case of using 1D-CNN and SVM, there were no erroneous samples within the confusion matrix, and the two-stage method based on 1D-CNN and SVM improved the performance of the model.

3.6. Further Comparison and Results Visualization

In this work, the training and testing speeds of each method were evaluated under the environment as mentioned in Section 3.5 and shown in Table 5. From Table 5, we can see that training the SVM model took much less time than that of the 1D-CNN model because training 1D-CNN requires more parameters and epochs. The time required for testing the SVM model and testing 1D-CNN model were both small, but the SVM model was still 33% faster than 1D-CNN. The training time required for the two-stage model 1D-CNN and SVM was 38.5 s, which was 10% faster than that of 1D-CNN and 1D-CNN which needed 43.2 s, and the testing time required for both methods was 1.2 s and 1.5 s, respectively.

We further adopted t-distributed stochastic neighbor embedding (T-SNE) to visualize the classification results with the proposed model. T-SNE is a nonlinear dimensionality reduction method that can reduce high-dimensional data to two or three dimensions for visualization [31]. The T-SNE results of the test samples before and after classification are shown in Figure 7a,b, respectively. Different colors represent different types of damage, and it can be found that the sample distribution was chaotic and nearly indistinguishable before classification; however, after classification, samples of different damage types were clustered together separately, indicating that the method proposed in this paper has a strong classification ability for structural vibration response samples.

4. Discussion

The health of buildings is significant for the safety of human life. Damage detection by structural damage response is a relatively popular method in SHM. With the development of machine learning and deep learning, more and more methods are being applied to this field. Machine learning requires fewer training samples and low computational cost but requires manually designed complex feature extraction methods. Deep learning can automatically extract feature, but has a high computational cost and requires more training samples.

We propose a two-stage structural damage response method based on 1D-CNN and SVM by analyzing the vibration damage response of an eight-layer steel framework. The 1D-CNN is used in the first stage to detect the damage location, and the SVM is used in the second stage to detect the damage severity, which fully combines the advantages of 1D-CNN and SVM to achieve better damage detection with less computational cost and simpler feature extraction methods and is meaningful for the application of SHM.

In this paper, we compare our proposed method with several other methods. From Table 3, it can be found that, in general, the two-stage damage detection methods work better than the single-stage damage detection method. Among the single-stage damage detection methods, 1D-CNN is much better than SVM. On the one hand, it is because 1D-CNN has an extremely strong feature extraction ability, and on the other hand, it is because we have not designed a feature extraction method for SVM that combines expertise, but directly uses wavelet packet decomposition as the feature extraction method, resulting in a large difference between the two effects. In Lei et al. [11], the authors achieved extremely high accuracy by using variance, regression coefficients, and cross-correlation function amplitude as features, followed by SVM for classification. This suggests that better results can be achieved with SVM if the designed feature extraction method is good enough, but this requires strong expertise, and the designed feature extraction method may not be applicable to other structures. In contrast, 1D-CNN only uses a simpler structure, which is good at automatically extracting features and achieving a high accuracy rate and is applicable to a variety of structures.

Among the two-stage methods, the proposed method in this paper is slightly better than 1D-CNN and 1D-CNN. It can be seen from Table 4 that the two methods are close to each other in detecting the damage location because the first stage of both methods is same. The main difference is in the second stage of damage severity detection. To further probe the reason, we compared the confusion matrix of 1D-CNN with 1D-CNN and SVM. From Figure 6a, we can find that the samples incorrectly identified by 1D-CNN were all samples with the same damage location and different damage severity. On the one hand, this is because the change of damage response caused by the change of damage severity was small, and on the other hand, it is because the number of samples with different damage severity at the same damage location was small, which was not enough for 1D-CNN to learn sufficient information. While SVM uses the wavelet packet decomposition of the damage response as the feature vector, which is a classification problem with high-dimensional small samples, and is well suited to be solved by using SVM, 1D-CNN and SVM have indeed achieved better results.

There are not many studies on multi-level damage detection. Shao et al. [32] proposed a multilevel damage classification method based on Lamb wave and transfer learning. They divided the damage detection into three levels, which detected the existence, location, and size of damage. In future work, we will consider adding a stage to detect the existence of damage. Their method used 1D-CNN for damage detection at all three levels, and although their method achieved high accuracy, it also takes more time to train. They also realized this problem, so they used the transfer learning method to share part of the structure and weights of the 1D-CNN in all three levels, which makes the training faster and saves more time. If they consider using SVM to detect the size of the damage, they may be able to save more time while ensuring the accuracy is not reduced.

Our study also has many limitations. In real-world applications, the location and severity of damage are continuous, whereas the experiments in this paper have only limited classes of damage locations and severity and use a classification model rather than a regression model, which makes the method in this paper unable to predict the type of damage outside the dataset.

In future work, we intend to replace the classification model by employing a regression model and designing more types of damage locations and damage severity so that the model can accurately discriminate between types of damage outside the dataset.

5. Conclusions

In this paper, a two-stage structural damage detection method based on 1D-CNN and SVM is proposed. It is still challenging to detect the damage of a structure accurately based on the vibration response of the structure. To solve the problem that traditional machine learning methods need to design feature extraction methods by manually combining expert knowledge, this paper uses 1D-CNN to automatically extract rich features from vibration responses. To solve the problem that the number of samples with different severity of damage at the same damage location is small and 1D-CNN cannot distinguish these samples well, this paper uses SVM combined with wavelet packet decomposition to achieve the accurate identification of these samples. Experiments were conducted on an eight-layer steel frame, and damage responses were collected for ten damage cases. After preprocessing operations such as offset elimination, normalization, and slicing, the damage locations corresponding to the samples were first determined by 1D-CNN, and then the damage severity corresponding to the samples was determined by SVM. In the comparison experiments with other methods, the method proposed in this paper achieved the best results, taking into account the operation speed and recognition effect.

However, the method in this paper still has some limitations, as it can only determine predefined damage cases and cannot determine continuous damage locations or damage severity. In future research, we intend to use deep learning-based regression methods to predict continuous damage and combine expert knowledge to improve detection performance. In addition, the data are provided by other labs, and we do not have permission to share the data. The code is released at https://github.com/jch12138/two-stage-structure-damage-detection (accessed on 30 August 2022).

Author Contributions

Data curation, J.L. and X.W.; methodology, C.J. and Q.Z.; software, C.J.; supervision, Q.Z. and J.L.; visualization, X.W.; writing—original draft, C.J.; writing—review & editing, Q.Z. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by China Natural Science Foundation under grant No. 62171391. Shaorong Fang and Tianfu Wu from Information and Network Center of Xiamen University are acknowledged for the help with the GPU computing.

Institutional Review Board Statement

Not applicable, this research not involving humans or animals.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

Avci, O.; Abdeljaber, O.; Kiranyaz, S.; Hussein, M.; Gabbouj, M.; Inman, D.J. A review of vibration-based damage detection in civil structures: From traditional methods to machine learning and deep learning applications. Mech. Syst. Signal Process. 2021, 147, 107077. [Google Scholar] [CrossRef]
Wahab, M.M.A.; Roeck, G.D. Damage detection in bridges using modal curvatures: Application to a real damage scenario. J. Sound Vib. 1999, 226, 217–235. [Google Scholar] [CrossRef]
Park, S.; Yun, C.; Roh, Y. Jong-jae leepzt-based active damage detection techniques for steel bridge component smart mater. Structure 2006, 15, 957–966. [Google Scholar]
Farrar, C.R.; Doebling, S.W.; Nix, D.A. Vibration–based structural damage identification. Philos. Trans. R. Soc. London. Series Math. Phys. Eng. Sci. 2001, 359, 131–149. [Google Scholar] [CrossRef]
Zhang, Y.; Miyamori, Y.; Mikami, S.; Saito, T. Vibration-based structural state identification by a 1-dimensional convolutional neural network. Comput. -Aided Civ. Infrastruct. Eng. 2019, 34, 822–839. [Google Scholar] [CrossRef]
Chesne, S.; Deraemaeker, A. Damage localization using transmissibility functions: A critical review. Mech. Syst. Signal Process. 2013, 38, 569–584. [Google Scholar] [CrossRef] [Green Version]
Amezquita-Sanchez, J.P.; Adeli, H. Signal processing techniques for vibration-based health monitoring of smart structures. Arch. Comput. Methods Eng. 2016, 23, 1–15. [Google Scholar] [CrossRef]
Meruane, V.; Heylen, W. An hybrid real genetic algorithm to detect structural damage using modal properties. Mech. Syst. Signal Process. 2011, 25, 1559–1573. [Google Scholar] [CrossRef] [Green Version]
Adeli, H.; Jiang, X. Intelligent Infrastructure: Neural Networks, Wavelets, and Chaos Theory for Intelligent Transportation Systems and Smart Structures; CRC Press: Boca Raton, FL, USA, 2008. [Google Scholar]
Wu, R.; Jahanshahi, M.R. Data fusion approaches for structural health monitoring and system identification: Past, present, and future. Struct. Health Monit. 2020, 19, 552–586. [Google Scholar] [CrossRef]
Lei, J.; Cui, Y.; Shi, W. Structural damage identification method based on vibration statistical indicators and support vector machine. Adv. Struct. Eng. 2022, 25, 1310–1322. [Google Scholar] [CrossRef]
Diao, Y.; Jia, D.; Liu, G.; Sun, Z.; Xu, J. Structural damage identification using modified hilbert–huang transform and support vector machine. J. Civ. Struct. Health Monit. 2021, 11, 1155–1174. [Google Scholar] [CrossRef]
Zhang, X.; Han, P.; Xu, L.; Zhang, F.; Wang, Y.; Gao, L. Research on bearing fault diagnosis of wind turbine gearbox based on 1dcnn-pso-svm. IEEE Access 2020, 8, 192248–192258. [Google Scholar] [CrossRef]
Lin, Y.; Nie, Z.; Ma, H. Structural damage detection with automatic feature-extraction through deep learning. Comput. -Aided Civ. Infrastruct. Eng. 2017, 32, 1025–1046. [Google Scholar] [CrossRef]
Wang, X.; Shahzad, M.M.; MomanShahzad, M. A novel structural damage identification scheme based on deep learning framework. Structures 2021, 29, 1537–1549. [Google Scholar] [CrossRef]
Xiao, H.; Wang, W.; Dong, L.; Ogai, H. A novel bridge damage diagnosis algorithm based on deep learning with gray relational analysis for intelligent bridge monitoring system. IEEJ Trans. Electr. Electron. Eng. 2021, 16, 730–742. [Google Scholar] [CrossRef]
Huang, L.; He, H.X.; Wang, W. Intelligent recognition of bridge damage based on convolutional neural networks and recursive graphs. J. Basic Sci. Eng. 2020. [Google Scholar] [CrossRef]
Awad, M.; Khanna, R. Support vector machines for classification. In Efficient Learning Machines; Springer: Berlin/Heidelberg, Germany, 2015; pp. 39–66. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Nair, V.; Hinton, G.E. Rectified Linear Units Improve Restricted Boltzmann Machines. Lcml. 2010. Available online: http://www.csri.utoronto.ca/~hinton/absps/reluICML.pdf (accessed on 30 August 2022).
Tabian, I.; Fu, H.; Khodaei, Z.S. A convolutional neural network for impact detection and characterization of complex composite structures. Sensors 2019, 19, 4933. [Google Scholar] [CrossRef] [Green Version]
Lin, M.; Chen, Q.; Yan, S. Network in network. arXiv 2013, arXiv:1312.4400. [Google Scholar]
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
Boser, B.E.; Guyon, I.M.; Vapnik, V.N. A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, PA, USA, 27–29 July 1992; pp. 144–152. [Google Scholar]
Wolpert, D.H.; Macready, W.G. No Free Lunch Theorems for Search; Technical Report, Technical Report SFI-TR-95-02-010; Santa Fe Institute: Santa Fe, NM, USA, 1995. [Google Scholar]
Chang, C. Libsvm: A Library for Support Vector Machines. 2001. Available online: http://www.csie.ntu.edu.tw/cjlin/libsvm (accessed on 30 August 2022).
Keerthi, S.S.; Lin, C. Asymptotic behaviors of support vector machines with gaussian kernel. Neural Comput. 2003, 15, 1667–1689. [Google Scholar] [CrossRef] [Green Version]
Kim, H.; Melhem, H. Damage detection of structures by wavelet analysis. Eng. Struct. 2004, 26, 347–362. [Google Scholar] [CrossRef]
Jahan, A.; Edwards, K.L. A state-of-the-art survey on the influence of normalization techniques in ranking: Improving the materials selection process in engineering design. Mater. Des. (1980–2015) 2015, 65, 335–342. [Google Scholar] [CrossRef]
Patro, S.; Sahu, K.K. Normalization: A preprocessing stage. arXiv 2015, arXiv:1503.06462. [Google Scholar] [CrossRef]
der Maaten, L.V.; Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
Shao, W.; Sun, H.; Wang, Y.; Qing, X. A multi-level damage classification technique of aircraft plate structures using lamb wave-based deep transfer learning network. Smart Mater. Struct. 2022, 31, 075019. [Google Scholar] [CrossRef]

Figure 1. An overview of our proposed framework.

Figure 2. The feature vectors obtained by wavelet packet decomposition under the different damage cases. The 4 damage case are represented by D1–D4.

Figure 3. Eight-layer steel frame diagram.

Figure 4. Examples of two kinds of damage data.

Figure 5. The illustration of data slicing.

Figure 6. Confusion matrixes of 1D-CNN and 1D-CNN and SVM methods.

Figure 7. Visualization results of the test samples before and after classification.

Table 1. Description of different damage cases.

Case	Location	Decreased Stiffness (%)
UD	-	0
D1	3	8.3
D2	3	16.7
D3	5	8.3
D4	5	16.7
D5	7	8.3
D6	7	16.7
D7	3 & 5	8.3 (both layers)
D8	3 & 7	8.3 (both layers)
D9	5 & 7	8.3 (both layers)

Table 2. Configuration of the 1D-CNN used in this paper.

Layer	Output Shape	Parameter	Activation	Variables
Input	1024 × 8	None	None	0
Convolution 1-D	1021 × 8	Kernel number: 4; Kernel size: 8 × 8;	ReLU	264
Convolution 1-D	1014 × 16	Kernel number: 8; Kernel size: 16 × 8;	ReLU	1040
Max Pooling 1-D	507 × 16	Kernel number: 2;	None	0
Convolution 1-D	500 × 16	Kernel number: 8; Kernel size: 16 × 8;	ReLU	2064
Global Average Pooling 1-D	16	None	None	0
Dropout	16	None	None	0
Dense	7	None	Softmax	119
Total parameters				3487

Table 3. The 5-fold cross-validation results of the 4 methods on the test set.

	SVM	1D-CNN	1D-CNN&1D-CNN	1D-CNN&SVM
Fold 1	0.75	0.9718	0.9833	0.9966
Fold 2	0.6964	0.9364	0.9921	1.0
Fold 3	0.7143	0.9833	0.9845	0.9983
Fold 4	0.8036	0.9718	0.9743	1.0
Fold 5	0.8214	0.9645	0.9874	0.9989

Table 4. Comparison of 1D-CNN and 1D-CNN with 1D-CNN and SVM in two-stage classification performance.

	1D-CNN&1D-CNN		1D-CNN&SVM
	Location	Severity	Location	Severity
Fold 1	0.9989	0.9743	0.9984	1.0
Fold 2	1.0	0.9734	1.0	1.0
Fold 3	0.9968	0.9876	0.9991	1.0
Fold 4	0.9937	0.9804	1.0	1.0
Fold 5	1.0	0.9856	0.9994	1.0

Table 5. The time required for training and testing of the 4 methods.

	SVM	1D-CNN	1D-CNN&1D-CNN	1D-CNN&SVM
Train	9.1 s	31.5 s	43.2 s	38.5 s
Test	0.6 s	0.9 s	1.5 s	1.2 s

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, C.; Zhou, Q.; Lei, J.; Wang, X. A Two-Stage Structural Damage Detection Method Based on 1D-CNN and SVM. Appl. Sci. 2022, 12, 10394. https://doi.org/10.3390/app122010394

AMA Style

Jiang C, Zhou Q, Lei J, Wang X. A Two-Stage Structural Damage Detection Method Based on 1D-CNN and SVM. Applied Sciences. 2022; 12(20):10394. https://doi.org/10.3390/app122010394

Chicago/Turabian Style

Jiang, Chenhui, Qifeng Zhou, Jiayan Lei, and Xinhong Wang. 2022. "A Two-Stage Structural Damage Detection Method Based on 1D-CNN and SVM" Applied Sciences 12, no. 20: 10394. https://doi.org/10.3390/app122010394

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Two-Stage Structural Damage Detection Method Based on 1D-CNN and SVM

Abstract

1. Introduction

2. Methods

2.1. 1D-CNN

2.1.1. Convolutional Layer

2.1.2. Pooling Layer

2.1.3. Droput Layer

2.1.4. Full Connected Layer

2.2. SVM

2.3. Wavelet Packet Decomposition

3. Experiments

3.1. Dataset

3.2. Data Preprocessing

3.2.1. Offset Elimination

3.2.2. Data Normalization

3.2.3. Data Slicing

3.2.4. Dataset Splitting

3.3. Baselines

3.4. CNN Configurations

3.5. Experimental Results

3.6. Further Comparison and Results Visualization

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI