A Combination of Dilated Self-Attention Capsule Networks and Bidirectional Long- and Short-Term Memory Networks for Vibration Signal Denoising
Abstract
:1. Introduction
2. Preliminaries
2.1. Signal Denoising
2.2. Dilated Convolution
2.3. Self-Attention
2.4. Spatial Features Captured by CapsNet
2.5. BiLSTM
3. DACapsNet–BiLSTM Network for Signal Denoising
3.1. Data Preprocessing
3.2. DACapsNet–BiLSTM Model
3.2.1. Framework
3.2.2. Proposed Model
Dilated Self-Attention Convolution Network
Algorithm 1. Dynamic Routing Algorithm [21] |
1 procedure ROUNTING() 2 Initialize the coupling coefficients: 3 for iterations do 4 for all capsule in layer : 5 for all capsule in layer : 6 for all capsule in layer : 7 for all : 8 return |
Temporal Features Captured by BiLSTM
3.3. ReLu Activation Function
3.4. Performance Evaluation
4. Experimental Results
4.1. Experimental Environment
4.2. Pre-Processing
4.3. Evaluation Indicators
4.4. Simulation Experiment
4.5. Experimental Analysis and Engineering Applications
Locomotive Bearing Vibration Signal Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Shen, S.; Lu, H.; Sadoughi, M.; Hu, C.; Kenny, S. A physics-informed deep learning approach for bearing fault detection. Eng. Appl. Artif. Intell. 2021, 103, 104295. [Google Scholar] [CrossRef]
- Gao, S.; Ren, Y.; Zhang, Y.; Li, T. Fault diagnosis of rolling bearings based on improved energy entropy and fault location of triangulation of amplitude attenuation outer raceway. Measurement 2021, 185, 109974. [Google Scholar] [CrossRef]
- Jang, Y.; Sim, J.; Yang, J.R.; Kwon, N. The Optimal Selection of Mother Wavelet Function and Decomposition Level for Denoising of DCG Signal. Sensors 2021, 21, 1851. [Google Scholar] [CrossRef] [PubMed]
- Chen, W.; Li, J.; Wang, Q.; Han, K. Fault feature extraction and diagnosis of rolling bearings based on wavelet thresholding denoising with ceemdan energy entropy and pso-lssvm. Measurement 2020, 172, 108901. [Google Scholar] [CrossRef]
- Chen, X.; Yang, Y.; Cui, Z.; Shen, J. Wavelet denoising for the vibration signals of wind turbines based on variational mode decomposition and multiscale permutation entropy. IEEE Access 2020, 8, 40347–40356. [Google Scholar] [CrossRef]
- Geng, H.; Peng, Y.; Ye, L.; Guo, Y. Fault identification of rolling bearing with variable speed based on generalized broadband mode decomposition and distance evaluation technique. Digit. Signal Processing 2022, 129, 103662. [Google Scholar] [CrossRef]
- Gao, Z.; Liu, Y.; Wang, Q.; Wang, J.; Luo, Y. Ensemble empirical mode decomposition energy moment entropy and enhanced long short-term memory for early fault prediction of bearing. Measurement 2022, 188, 110417. [Google Scholar] [CrossRef]
- Bhavsar, K.; Vakharia, V.; Chaudhari, R.; Vora, J.; Pimenov, D.Y.; Giasin, K. A Comparatove Study to Predict Bearing Degradation Using Discrete Wavelet Transform (DWT), Tabular Generative Adversarial Networks (TGAN) and Machine Learning Models. Machines 2022, 10, 176. [Google Scholar] [CrossRef]
- Sharma, S.; Tiwari, S.K.; Singh, S. The rotary machine fault detection by hybrid method based on local mean decomposition and fluctuation based dispersion entropy. J. Sound Vib. 2021, 43, 700–705. [Google Scholar] [CrossRef]
- Liu, B.; Li, Y. Ecg signal denoising based on similar segments cooperative filtering. Biomed. Signal Processing Control 2021, 68, 102751. [Google Scholar] [CrossRef]
- Chandra, M.; Goel, P.; Ankita, A.; Kar, A. Design and analysis of improved high-speed adaptive filter architectures for ecg signal denoising. Biomed. Signal Process. Control 2021, 63, 102221. [Google Scholar] [CrossRef]
- Smith, J. The local mean decomposition and its application to EEG perception data. J. R. Soc. Interface 2005, 2, 443–454. [Google Scholar] [CrossRef] [PubMed]
- Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal Processing 2014, 62, 531–544. [Google Scholar] [CrossRef]
- Wang, C.; Xu, Z. An intelligent fault diagnosis model based on deep neural network for few-shot fault diagnosis. Neurocomputing 2021, 456, 550–562. [Google Scholar] [CrossRef]
- Wang, K.; Guo, Y.; Zhao, W.; Zhou, Q.; Guo, P. Gas path fault detection and isolation for aero-engine based on LSTM-DAE approach under multiple-model architecture. Measurement 2022, 202, 111875. [Google Scholar] [CrossRef]
- Han, H.; Wang, H.; Liu, Z.; Wang, J. Intelligent vibration signal denoising method based on non-local fully convolutional neural network for rolling bearings. ISA Trans. 2021, 122, 13–23. [Google Scholar] [CrossRef]
- Jain, V.; Seung, H.S. Natural Image Denoising with Convolutional Networks. In Proceedings of the International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 8–10 December 2008; pp. 769–776. [Google Scholar]
- Lou, S.; Deng, J.; Lyu, S. Chaotic signal denoising based on simplified convolutional denoising auto-encoder. Chaos Solitons Fractals 2022, 161, 112333. [Google Scholar] [CrossRef]
- Fan, G.; Li, J.; Hao, H. Vibration signal denoising for structural health monitoring by residual convolutional neural networks. Measurement 2020, 157, 107651. [Google Scholar] [CrossRef]
- Yu, F.; Koltun, V. Multi-Scale Context Aggregation by Dilated Convolutions. In Proceedings of the 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, 4–6 May 2016. [Google Scholar] [CrossRef]
- Kumar, P.; Hati, A.S. Dilated convolutional neural network based model for bearing faults and broken rotor bar detection in squirrel cage induction motors. Expert Syst. Appl. 2022, 191, 116290. [Google Scholar] [CrossRef]
- Chu, C.; Ge, Y.; Qian, Q.; Hua, B.; Guo, J. A novel multi-scale convolution model based on multi-dilation rates and multi-attention mechanism for mechanical fault diagnosis. Digit. Signal Process. 2021, 122, 103355. [Google Scholar] [CrossRef]
- Wu, H.; Zhang, B.; Liu, N. Self-adaptive denoising net: Self-supervised learning for seismic migration artifacts and random noise attenuation. J. Pet. Sci. Eng. 2022, 214, 110431. [Google Scholar] [CrossRef]
- Wang, H.; Liu, Z.; Peng, D.; Cheng, Z. Attention-guided joint learning CNN with noise robustness for bearing fault diagnosis and vibration signal denoising. ISA Trans. 2021, 128, 470–484. [Google Scholar] [CrossRef]
- Fu, E.; Zhang, Y.; Yang, F.; Wang, S. Temporal self-attention-based Conv-LSTM network for multivariate time series prediction. Neurocomputing 2022, 501, 162–173. [Google Scholar] [CrossRef]
- Zhong, H.; Lv, Y.; Yuan, R.; Yang, D. Bearing fault diagnosis using transfer learning and self-attention ensemble lightweight convolutional neural network. Neurocomputing 2020, 501, 765–777. [Google Scholar] [CrossRef]
- Sabour, S.; Frosst, N.; Hinton, G.E. Dynamic routing between capsules. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 3859–3869. [Google Scholar] [CrossRef]
- Zheng, L.; Zhou, B.; Siu, W.O.; Cao, Y.; Wang, H.; Li, Y.; Chan, K.W. Spatio-temporal wind speed prediction of multiple wind farms using capsule network. Renew. Energy 2021, 175, 718–730. [Google Scholar] [CrossRef]
- Dong, J.; Wang, N.; Fang, H.; Wu, R.; Zheng, C.; Ma, D.; Hu, H. Automatic damage segmentation in pavement videos by fusing similar feature extraction siamese network (SFE-SNet) and pavement damage segmentation capsule network (PDS-CapsNet). Autom. Constr. 2022, 143, 104537. [Google Scholar] [CrossRef]
- Karim, F.; Shah, M.A.; Khattak, H.A.; Ameer, Z.; Shoaib, U.; Rauf, H.T.; AI-Turjman, F. Towards an effective model for lung disease classification: Using Dense Capsule Nets for early classification of lung diseases. Appl. Soft Comput. 2022, 124, 109077. [Google Scholar] [CrossRef]
- Lin, Z.; Jia, J.; Huang, F.; Gao, W. A coarse-to-fine capsule network for fine-grained image categorization. Neurocomputing 2021, 456, 200–219. [Google Scholar] [CrossRef]
- Ma, X.; Zhong, H.; Li, Y.; Ma, J.; Wang, Y. Forecasting transportation network speed using deep capsule networks with nested lstm models. IEEE Trans. Intell. Transp. Syst. 2020, 22, 4813–4824. [Google Scholar] [CrossRef]
- Ye, X.; Yan, J.; Wang, Y.; Wang, J.; Geng, Y. A novel U-Net and capsule network for few-shot high-voltage circuit breaker mechanical fault diagnosis. Measurement 2022, 199, 111527. [Google Scholar] [CrossRef]
- Liu, J.; Zhang, C.; Jiang, X. Imbalanced fault diagnosis of rolling bearing using improved MsR-GAN and feature enhancement-driven CapsNet. Mech. Syst. Signal Process. 2022, 168, 108664. [Google Scholar] [CrossRef]
- Long, J.; Qin, Y.; Yang, Z.; Huang, Y.; Li, C. Discriminative feature learning using a multiscale convolutional capsule network from attitude data for fault diagnosis of industrial robots. Mech. Syst. Signal Process. 2022, 182, 109569. [Google Scholar] [CrossRef]
- Han, T.; Ma, R.; Zheng, J. Combination bidirectional long short-term memory and capsule network for rotating machinery fault diagnosis. Measurement 2021, 176, 109208. [Google Scholar] [CrossRef]
- Graves, A.; Schmidhuber, J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 2005, 18, 602–610. [Google Scholar] [CrossRef]
- Han, L.; Deng, Y.; Chen, H.; Wei, G.; Sheng, K.; Shi, J. A robust VRF fault diagnosis method based on ensemble BiLSTM with attention mechanism:Considering uncertainties and generalization. Energy Build. 2022, 269, 112243. [Google Scholar] [CrossRef]
- Cui, X.; Chen, Z.; Yin, F. Multi-objective based multi-channel speech enhancement with bilstm network. Appl. Acoust. 2021, 177, 107927. [Google Scholar] [CrossRef]
- Shi, J.; Peng, D.; Peng, Z.; Zhang, Z.; Goebel, K.; Wu, D. Planetary gearbox fault diagnosis using bidirectional-convolutional lstm networks. Mech. Syst. Signal Process. 2022, 162, 107996. [Google Scholar] [CrossRef]
- Wang, Y.; Cheng, L. A combination of residual and long-short-term memory networks for bearing fault diagnosis based on time-series model analysis. Meas. Sci. Technol. 2021, 32, 015904. [Google Scholar] [CrossRef]
- Rafiee, J.; Rafiee, M.A.; Tse, P.W. Application of mother wavelet functions for automatic gear and bearing fault diagnosis. Expert Syst. Appl. 2010, 37, 4568–4579. [Google Scholar] [CrossRef]
0.3 | 700 |
Parameters | Output Size | |||||
---|---|---|---|---|---|---|
Kernel Size | Channel | Rate | Stride | Capsule Dimension | ||
Input | / | / | / | / | / | 1 × 2000 |
Reshape | / | / | / | / | / | 50 × 40 × 1 |
Conv 1 | 3 × 3 | 32 | 1 | 1 | / | 50 × 40 × 32 |
Conv 2 | 3 × 3 | 32 | 2 | 1 | / | 50 × 40 × 32 |
Conv 3 | 3 × 3 | 32 | 5 | 1 | / | 50 × 40 × 32 |
Improved Self-Attention1 | / | / | / | / | / | 50 × 40 × 32 |
Conv 4 | 3 × 3 | 32 | 1 | 1 | / | 50 × 40 × 32 |
Conv 5 | 3 × 3 | 32 | 2 | 1 | / | 50 × 40× 32 |
Conv 6 | 3 × 3 | 32 | 5 | 1 | / | 50 × 40× 32 |
Improved Self-Attention2 | / | / | / | / | / | 50 × 40× 32 |
Primary capsule | 5 × 5 | 32 | / | 2 | 8 | 17 × 12 × 32 × 8 |
Digital capsule | / | / | / | / | 16 | 10 × 16 |
FC | / | / | / | / | / | 1 × 2000 |
BiLSTM | / | / | / | / | / | 1 × 2000 |
Output | / | / | / | / | / | 1 × 2000 |
Noise SNR (dB) | Evaluation Metrics | Original Signal | Traditional Method | Deep Learning Methods | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WT | EMD | CNN | CapsNet | LSTM | Vanilla LSTM | Stacked LSTM | BiLSTM | BLC–CNN | ResNet–LSTM | DACapsNet–BiLSTM | |||
−10 | SNR | −10.0008 | 0.1748 | −2.3971 | 0.2665 | 0.2783 | −0.0088 | 0.0021 | 0.0983 | 0.1024 | 0.3062 | 0.2794 | 0.3261 |
MSE | 0.5873 | 0.0564 | 0.1020 | 0.0552 | 0.0542 | 0.0615 | 0.0601 | 0.0562 | 0.0545 | 0.0537 | 0.0541 | 0.0532 | |
MAE | 0.6116 | 0.1681 | 0.2410 | 0.1672 | 0.1655 | 0.1748 | 0.1732 | 0.1679 | 0.1659 | 0.1649 | 0.1652 | 0.1646 | |
−5 | SNR | −4.9975 | 1.3244 | 0.0161 | 1.0340 | 1.5813 | 1.4953 | 1.4836 | 1.8125 | 2.2774 | 2.7766 | 1.9263 | 3.2113 |
MSE | 0.1855 | 0.0432 | 0.0585 | 0.0482 | 0.0370 | 0.0395 | 0.0398 | 0.0364 | 0.0351 | 0.0335 | 0.0361 | 0.0329 | |
MAE | 0.3436 | 0.1513 | 0.1645 | 0.1610 | 0.1439 | 0.1479 | 0.1481 | 0.1432 | 0.1427 | 0.1421 | 0.1438 | 0.1412 | |
0 | SNR | −0.0008 | 2.8428 | 2.9396 | 3.3862 | 3.6590 | 3.4550 | 3.5283 | 3.8124 | 4.0892 | 4.8326 | 3.8714 | 5.2637 |
MSE | 0.0587 | 0.0305 | 0.0298 | 0.0286 | 0.0220 | 0.0256 | 0.0248 | 0.0201 | 0.0167 | 0.0163 | 0.0208 | 0.0162 | |
MAE | 0.1933 | 0.1311 | 0.1301 | 0.1293 | 0.1184 | 0.1252 | 0.1236 | 0.1173 | 0.1134 | 0.1102 | 0.1161 | 0.1094 | |
5 | SNR | 4.9978 | 6.9163 | 8.9529 | 9.4165 | 9.6525 | 9.4891 | 9.4893 | 9.7962 | 11.4734 | 11.9966 | 10.1771 | 12.6432 |
MSE | 0.0185 | 0.0119 | 0.0074 | 0.0068 | 0.0054 | 0.0058 | 0.0058 | 0.0053 | 0.0040 | 0.0037 | 0.0046 | 0.0032 | |
MAE | 0.1087 | 0.0870 | 0.0688 | 0.0673 | 0.0649 | 0.0663 | 0.0663 | 0.0642 | 0.0621 | 0.0619 | 0.0641 | 0.0619 | |
10 | SNR | 9.9906 | 11.3013 | 12.0532 | 13.4246 | 13.6537 | 13.4909 | 13.5238 | 13.5976 | 13.6255 | 14.4083 | 14.0801 | 14.4862 |
MSE | 0.0058 | 0.0039 | 0.0036 | 0.0033 | 0.0030 | 0.0032 | 0.0032 | 0.0031 | 0.0031 | 0.0029 | 0.0030 | 0.0029 | |
MAE | 0.0611 | 0.0530 | 0.0477 | 0.0470 | 0.0467 | 0.0469 | 0.0468 | 0.0466 | 0.0467 | 0.0462 | 0.0464 | 0.0461 |
Model | Inner Diameter | Outer Diameter | Roller Diameter | Number of Rollers | (Degree) |
---|---|---|---|---|---|
552732QT | 160 (mm) | 290 (mm) | 34 (mm) | 17 | 0 |
Noise SNR (dB) | Evaluation Metrics | Original Signal | Traditional Method | Deep Learning Methods | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WT | EMD | CNN | CapsNet | LSTM | BiLSTM | Vanilla LSTM | Stacked LSTM | BLC–CNN | ResNet–LSTM | DACapsNet–BiLSTM | |||
−10 | SNR | −9.9987 | 0.0384 | −1.6465 | 0.0946 | 0.2464 | −0.5444 | 0.1475 | 0.1511 | 0.2387 | 0.2639 | 0.2511 | 0.3092 |
MSE | 3.7104 | 0.3678 | 0.5422 | 0.3421 | 0.3311 | 0.3689 | 0.3358 | 0.3328 | 0.3298 | 0.3291 | 0.3302 | 0.3283 | |
MAE | 1.5370 | 0.4074 | 0.5373 | 0.3933 | 0.3898 | 0.4052 | 0.3917 | 0.3913 | 0.3902 | 0.3899 | 0.3901 | 0.3892 | |
−5 | SNR | −4.9999 | 1.1645 | 1.9972 | 2.2323 | 2.3944 | 2.1180 | 2.2965 | 2.2864 | 2.4362 | 2.5466 | 2.4133 | 3.0625 |
MSE | 1.1737 | 0.2838 | 0.2343 | 0.2324 | 0.2256 | 0.2330 | 0.2303 | 0.2311 | 0.2247 | 0.2246 | 0.2252 | 0.2238 | |
MAE | 0.8644 | 0.3978 | 0.3862 | 0.3840 | 0.3799 | 0.3851 | 0.3796 | 0.3804 | 0.3741 | 0.3713 | 0.3762 | 0.3635 | |
0 | SNR | −0.0012 | 2.5660 | 2.6498 | 3.2902 | 3.4103 | 3.3522 | 3.8473 | 3.6176 | 3.8735 | 4.3613 | 3.7358 | 5.1271 |
MSE | 0.3712 | 0.2055 | 0.1958 | 0.1797 | 0.1743 | 0.1749 | 0.1724 | 0.1733 | 0.1698 | 0.1659 | 0.1711 | 0.1515 | |
MAE | 0.4862 | 0.3266 | 0.3249 | 0.3228 | 0.3195 | 0.3206 | 0.3182 | 0.3197 | 0.3171 | 0.3169 | 0.3183 | 0.3162 | |
5 | SNR | 5.0004 | 7.3964 | 8.5021 | 9.4298 | 9.6120 | 9.4749 | 10.0173 | 10.0972 | 10.5764 | 11.4395 | 10.4284 | 12.2110 |
MSE | 0.1173 | 0.0675 | 0.0524 | 0.0454 | 0.0411 | 0.0448 | 0.0378 | 0.0369 | 0.0355 | 0.0342 | 0.0388 | 0.0328 | |
MAE | 0.2733 | 0.2031 | 0.1826 | 0.1779 | 0.1728 | 0.1753 | 0.1703 | 0.1701 | 0.1693 | 0.1686 | 0.1697 | 0.1682 | |
10 | SNR | 9.9976 | 12.6933 | 12.9967 | 13.0822 | 13.2107 | 13.5083 | 14.0711 | 14.0773 | 14.0791 | 14.2370 | 14.2237 | 14.3027 |
MSE | 0.0371 | 0.0199 | 0.0186 | 0.0178 | 0.0173 | 0.0160 | 0.0147 | 0.0147 | 0.0146 | 0.0145 | 0.0146 | 0.0145 | |
MAE | 0.1537 | 0.1101 | 0.1088 | 0.1072 | 0.1065 | 0.1061 | 0.1053 | 0.1052 | 0.1052 | 0.1049 | 0.1051 | 0.1047 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Y.; Cao, G.; Han, J. A Combination of Dilated Self-Attention Capsule Networks and Bidirectional Long- and Short-Term Memory Networks for Vibration Signal Denoising. Machines 2022, 10, 840. https://doi.org/10.3390/machines10100840
Wang Y, Cao G, Han J. A Combination of Dilated Self-Attention Capsule Networks and Bidirectional Long- and Short-Term Memory Networks for Vibration Signal Denoising. Machines. 2022; 10(10):840. https://doi.org/10.3390/machines10100840
Chicago/Turabian StyleWang, Youming, Gongqing Cao, and Jiali Han. 2022. "A Combination of Dilated Self-Attention Capsule Networks and Bidirectional Long- and Short-Term Memory Networks for Vibration Signal Denoising" Machines 10, no. 10: 840. https://doi.org/10.3390/machines10100840