YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention

Zhao, Shilong; Li, Gang; Zhou, Mingle; Li, Min

doi:10.1007/s10586-023-04079-7

YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention

Published: 30 June 2023

Volume 27, pages 2329–2344, (2024)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Shilong Zhao¹,
Gang Li¹,
Mingle Zhou¹ &
…
Min Li¹

571 Accesses
2 Citations
Explore all metrics

Abstract

This paper proposes a real-time industrial defect detection method based on context enhancement and attention to address the problem that current general-purpose target detectors can hardly achieve high detection accuracy and fast detection speed simultaneously. First, a modified MonileNetV3 is used as the backbone network to reduce the number of parameters and improve the model detection speed. A lightweight TRANS module is proposed at the end of the backbone network to combine more layers of features provided by global contextual information for complex background small target detection. Secondly, a cross-layer multi-scale feature fusion network is designed to fully fuse the fine-grained and semantic feature information extracted by the backbone and enhance the spatial location information between neighboring feature layers. Finally, a cascaded Two-channel Efficient Space attention module is used to fully extract texture and semantic features from the defective regions, allowing the model to focus more on the wrong locations and improve the feature representation capability of the network. The NEU-DET steel and PCB datasets are used to test the effectiveness of the proposed model. The experimental results show that compared to the original YOLOv5s algorithm, the mAP metrics are improved by 5.9% and 0.6%, F1 is improved by 4.82% and 0.93%, respectively, and the parameters are reduced by 33.77 M, enabling fast detection of industrial surface defects and meeting the needs of the entire industry.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 10

CSC-YOLO: An Image Recognition Model for Surface Defect Detection of Copper Strip and Plates

Article 23 April 2024

AENet: attention enhancement network for industrial defect detection in complex and sensitive scenarios

Article 03 February 2024

Dual-branch information extraction and local attention anchor-free network for defect detection

Article Open access 13 May 2024

Data availibility

The data used to support the findings of this study are available from the corresponding author upon request.

References

Zhang, Z., Zhou, M., Wan, H., Li, M., Li, G., Han, D.: IDD-Net: Industrial defect detection method based on deep-learning. Eng. Appl. Artif. Intell. 123, 106390 (2023)
Article Google Scholar
Learning, D. Deep learning. High-dimensional Fuzzy Clustering, (2020)
Gong, Y., Srivastava, G.: Multi-target trajectory tracking in multi-frame video images of basketball sports based on deep learning. EAI Endors. Trans. Scalable Info. Syst. 10, e9–e9 (2023)
Google Scholar
Pan, K., Zhao, Y., Wang, T., Yao, S.: MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification. Signal Image Video Process. 17, 3091 (2023)
Girshick, R.: Fast R-CNN. Computer Science, (2015)
Ren S., He K., Girshick R., Sun J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Machine Intell. 39, 6 (2017)
Xuelong W., Ying G., Junyu D., Xukun Q., Lin Q., Hui M., Jun L.: Surface defects detection of paper dish based on Mask R-CNN. International Workshop on Pattern Recognition, SPIE, Washington (2018)
Joseph R., Santosh Kumar D., Ross B.G., Ali F.: You only look once: unified, real-time object detection. IEEE, New Jersey (2015)
Wei L., Dragomir A., Dumitru E., Christian S., Scott E.R., Cheng-Yang F., Alexander C.B.: SSD: single shot multibox detector. Springer, Cham (2015)
Li, G., Shao, R., Wan, H., Zhou, M., Li, M.: A model for surface defect detection of industrial products based on attention augmentation. Comput. Intell. Neurosci. (2022). https://doi.org/10.1155/2022/9577096
Zhang, Z.K., Zhou, M.L., Shao, R., Li, M., Li, G.: A defect detection model for industrial products based on attention and knowledge distillation. Comput. Intell. Neurosci. 2022, 6174255 (2022). https://doi.org/10.1155/2022/6174255
Article Google Scholar
Luo, H., Wang, P., Chen, H., Kowelo, V.: Small object detection network based on feature information enhancement. Comput. Intell. Neurosci. (2022). https://doi.org/10.1155/2022/6394823
Guo, Z., Wang, C., Yang, G., Huang, Z., Li, G.: MSFT-YOLO: improved YOLOv5 based on transformer for detecting defects of steel surface. Sensors 22(9), 3467 (2022). https://doi.org/10.3390/s22093467
Xu, S., Wang, X., Lv, W., Chang, Q., Cui, C., Deng, K., Wang, G., Dang, Q., Wei, S., Du, Y., et al.: PP-YOLOE: an evolved version of YOLO. Preprint at http://arxiv.org/abs/2203.16250 (2022)
Dlamini, S., Kuo, C., Chao, S.: Developing a surface mount technology defect detection system for mounted devices on printed circuit boards using a MobileNetV2 with feature pyramid network. Eng. Appl. Artif. Intell. 121, 105875 (2023)
Article Google Scholar
Jiang, X., Cai, W., Ding, Y., Wang, X., Yang, Z., Di, X., Gao, W.: Camouflaged object detection based on ternary cascade perception. Remote Sens. 15, 1188 (2023)
Article Google Scholar
Bin H.: Multi-scale feature fusion network with attention for single image dehazing. Pattern Recognit. Image Anal. 31, 31 (2021)
Lin, T., Dollár, P., Girshick, R., He, K., Hariharan, B. & Belongie, S.: Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117-2125. IEEE, New Jersey (2017)
Tan M., Pang R., Le Q.V.: EfficientDet: scalable and efficient object detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, New Jersey (2020)
Shu L., Lu Q., Haifang Q., Jianping S., Jiaya J.: Path aggregation network for instance segmentation, (2018)
Golnaz G., Tsung-Yi L., Ruoming P., Quoc V.L.: NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection. IEEE, New Jersey (2019)
Guo M.H., Xu T.X., Liu J.J., Liu Z.N., Jiang P.T., Mu TJ, Zhang S.H., Martin R.R., Cheng M.M., Hu S.M.: Attention mechanisms in computer vision: a survey. Comput. Visual Media 8, 3 (2022)
Wang Q., Wu B., Zhu P., Li P., Zuo W., Hu Q.: ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, New Jersey (2020)
Hou, Q., Zhou, D., Feng, J.: IEEE Comp Soc, “coordinate attention for efficient mobile network design,’’ presented at the,: IEEE/CVF conference on computer vision and pattern recognition. CVPR 2021(2021), 13708–13717 (2021). https://doi.org/10.1109/CVPR46437.2021.01350
Article Google Scholar
Jan G., Krzysztof G.: Awareness of self attention. Avant J. Philos. Interdiscip. Vanguard 7, 3 (2016)
Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Houlsby, N.: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, (2020)
Zhu, X., Su, W., Lu, L., Li, B., Wang, X., Dai, J.: Deformable DETR: deformable transformers for end-to-end object detection. International Conference on Learning Representations, (2021)
Zhiqiang, W., Jun, L.: A review of object detection based on convolutional neural network. 2017 36th Chinese Control Conference (CCC), pp. 11104-11109. (2017)
Howard, A., Sandler, M., Chu, G., Chen, L., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., Vasudevan, V., et al.: Searching for mobilenetv3. Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314-1324. IEEE, New Jersey (2019)
Liao, Y., Lu, S., Yang, Z., Liu, W.: Depthwise grouped convolution for object detection. Machine Vision Appl. (2021). https://doi.org/10.1007/s00138-021-01243-0
Liang, F., et al.: Efficient neural network using pointwise convolution kernels with linear phase constraint. Neurocomputing 423, 572–579 (2021). https://doi.org/10.1016/j.neucom.2020.10.067
Article Google Scholar
Zheng, Z., Wang, P., Ren, D., Liu, W., Ye, R., Hu, Q., Zuo, W.: Enhancing geometric factors in model learning and inference for object detection and instance segmentation. IEEE Trans. Cybern. 52, 8574–8586 (2021)
Article Google Scholar
Zhang, Y. F., Ren, W., Zhang, Z., Jia, Z., Wang, L., Tan, T.: Focal and Efficient IOU Loss for Accurate Bounding Box Regression, (2021)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778. IEEE, New Jersey (2016)
Gouider, C., Seddik, H.: YOLOv4 enhancement with efficient channel recalibration approach in CSPdarknet53. 2022 IEEE Information Technologies Smart Industrial Systems (ITSIS), pp. 1-6. IEEE, New Jersey (2022)
Tan, M., Le, Q.: Efficientnetv2: Smaller models and faster training. Preprint http://arxiv.org/abs/2104.00298 (2021)
Zhang, X., Zhou, X., Lin, M., Sun, J.: Hufflenet: an extremely efficient convolutional neural network for mobile devices. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6848-6856. IEEE, New Jersey (2018)
Wang, C., Bochkovskiy, A., Liao, H.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Preprint at http://arxiv.org/abs/2207.02696 (2022)

Download references

Acknowledgements

The authors would like to thank all the anonymous reviewers for their insightful comments and constructive suggestions.

Funding

This work was supported by the Taishan Scholars Program (NO. tsqn202103097) and the Key R and D plan of Shandong Province (Soft Science Project)(2022RZB02012).

Author information

Authors and Affiliations

Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan, 250353, China
Shilong Zhao, Gang Li, Mingle Zhou & Min Li

Authors

Shilong Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Gang Li
View author publications
You can also search for this author in PubMed Google Scholar
Mingle Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Min Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Project administration, GL; data curation; writing—original draft, SZ; writing—review and editing, ML; funding acquisition, MZ.

Corresponding author

Correspondence to Gang Li.

Ethics declarations

Conflict of interest

The authors declare that there are no conflicts of interest regarding this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhao, S., Li, G., Zhou, M. et al. YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention. Cluster Comput 27, 2329–2344 (2024). https://doi.org/10.1007/s10586-023-04079-7

Download citation

Received: 28 December 2022
Revised: 22 May 2023
Accepted: 02 June 2023
Published: 30 June 2023
Issue Date: June 2024
DOI: https://doi.org/10.1007/s10586-023-04079-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention

Abstract

Access this article

Similar content being viewed by others

CSC-YOLO: An Image Recognition Model for Surface Defect Detection of Copper Strip and Plates

AENet: attention enhancement network for industrial defect detection in complex and sensitive scenarios

Dual-branch information extraction and local attention anchor-free network for defect detection

Data availibility

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention

Abstract

Access this article

Similar content being viewed by others

CSC-YOLO: An Image Recognition Model for Surface Defect Detection of Copper Strip and Plates

AENet: attention enhancement network for industrial defect detection in complex and sensitive scenarios

Dual-branch information extraction and local attention anchor-free network for defect detection

Data availibility

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation