Abstract
Fire is one of the most harmful hazards that affect daily life. The existing fire detection methods have the problems of large computation, slow detection speed, and low detection accuracy to varying degrees, and do not achieve a better trade-off between model complexity, accuracy, and detection speed. In this paper, a multiscale fire image detection method combining Convolutional Neural Network(CNN) and Transformer is proposed. In the shallow layer of the model, the CNN-based multiscale feature extraction module is used to obtain rich fire image information. In the deep layers of the model, the powerful global learning ability of the Transformer is used to carry out overall perception and macroscopic understanding of images. The experimental results show that the best detection accuracy of the model can reach 94.62%, and the fastest detection speed can reach 158.12FPS, F1 score is stable at around 94%, which is fully capable of real-time and accurate detection of fire. Compared with the existing detection methods, this method has higher detection accuracy under similar model complexity and detection speed. With similar detection accuracy, our method has a faster detection speed. The proposed method achieves a better balance between model complexity, detection speed, and accuracy.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author on reasonable request. Specific sources of the datasets used in this article:
Dataset-1 in https://mivia.unisa.it/datasets/video-analysis-datasets/fire-detection-dataset/.
Dataset-2 in https://cvpr.kmu.ac.kr/.
Dataset-3 mainly sourced in https://www.kaggle.com/datasets/phylake1337/fire-datasethttps://www.kaggle.com/datasets/atulyakumar98/test-dataset and https://www.kaggle.com/datasets/studioai/fire1500.
Our dataset has been uploaded to Kaggle https://www.kaggle.com/datasets/firedetect/fire-dataset-10000 and will be open-sourced in due course.
References
National Fire and Rescue Administration (2023), Retrieved August 2023 from: https://www.119.gov.cn/qmxfxw/xfyw/2023/36210.shtml
Chen TH, Wu PH, Chiou YC (2004) An early fire-detection method based on image processing. In: 2004 International Conference on Image Processing (ICIP), pp 1707–1710
Marbach G, Loepfe M, Brupbacher T (2006) An image processing technique for fire detection in video images. Fire Saf 41(4):285–289
Celik T, Demirel H (2009) Fire detection in video sequences using a generic color model. Fire Saf 44(2):147–158
Celik T (2010) Fast and efficient method for fire detection using image processing. ETRI J 32(6):881–890
Celik T, Ozkaramanli H, Demirel H (2007) Fire pixel classification using fuzzy logic and statistical color model. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp 1205-1208
Rinsurongkawong S, Ekpanyapong M, Dailey MN (2012) Fire detection for early fire alarm based on optical flow video processing. In: 2012 9th International Conference on electrical engineering/electronics, computer, telecommunications and information technology, pp 1–4
Piri J, Mohapatra P, Pradhan MR et al (2021) A binary multi-objective chimp optimizer with dual archive for feature selection in the healthcare domain. IEEE Access 10:1756–1774
Das H, Chakraborty S, Acharya B et al. (2020) Optimal selection of features using teaching-learning-based optimization algorithm for classification. Appl Intell Decis Making Machine Learn 213–227
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1–9
Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Muhammad K, Ahmad J, Mehmood I, Rho S, Baik SW (2018) Convolutional neural networks based fire detection in surveillance videos. IEEE Access 6:18174–18183
Muhammad K, Ahmad J, Baik SW (2018) Early Fire detection using convolutional neural networks during surveillance for effective disaster management. Neurocomputing 288:30–42
Wu H, Xiao B, Codella N et al (2021) Cvt: introducing convolutions to vision transformers. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp 22–31
Töreyin BU, Dedeoğlu Y, Güdükbay U, Cetin AE (2006) Computer vision-based method for real-time fire and flame detection. Pattern Recognit Lett 27(1):49–58
Ha C, Hwang U, Jeon G, Cho J, Jeong J (2012) Vision-based fire detection algorithm using optical flow. In: 2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems, pp 526–530
Chunyu Y, Jun F, Jinjun W, Yongming Z (2010) Video fire smoke detection using motion and color features. Fire Technol 46:651–663
Qiu T, Yan Y, Lu G (2011) An auto adaptive edge-detection algorithm for flame and Fire image processing. IEEE Trans Instrum Meas 61(5):1486–1493
Ko BC, Cheong KH, Nam JY (2009) Fire detection based on vision sensor and support vector machines. Fire Saf J 44(3):322–329
Frizzi S, Kaabi R, Bouchouicha M, Ginoux JM, Moreau E, Fnaiech F (2016) Convolutional neural network for video fire and smoke detection. In: IECON 2016-42nd Annual Conference of the IEEE Industrial Electronics Society, pp 877–882
Mao W, Wang W, Dou Z, Li Y (2018) Fire recognition based on multi-channel convolutional neural network. Fire Technol 54:531–554
Muhammad K, Ahmad J, Lv Z, Bellavista P, Yang P, Baik SW (2018) Efficient deep CNN-based Fire detection and localization in video surveillance applications. IEEE Trans Syst Man Cybern: Syst 49(7):1419–1434
Muhammad K, Khan S, Elhoseny M, Ahmed SH, Baik SW (2019) Efficient fire detection for uncertain surveillance environment. IEEE Trans Industr Inf 15(5):3113–3122
Li S, Yan Q, Liu P (2020) An efficient fire detection method based on multiscale feature extraction, implicit deep supervision and channel attention mechanism. IEEE Trans Image Process 29:8467–8475
Jeon M, Choi HS, Lee J, Kang M (2021) Multi-scale prediction for fire detection using convolutional neural network. Fire Technol 57(5):2533–2551
Ghosh R, Kumar A (2022) A hybrid deep learning model by combining convolutional neural network and recurrent neural network to detect forest fire. Multimed Tools Appl 81(27):38643–38660
Jianxin Z, Siwen G, Guolan Z, Lin T (2021) Fire detection model based on multi-scale feature fusion. J Zhengzhou Univ (Engineering Science) 42(5):13–18
Abdusalomov A, Baratov N, Kutlimuratov A, Whangbo TK (2021) An improvement of the fire detection and classification method using YOLOv3 for surveillance systems. Sensors 21(19):6519
Xu H, Li B, Zhong F (2022) Light-YOLOv5: a Lightweight Algorithm for Improved YOLOv5 in Complex Fire scenarios. Appl Sci 12(23):12312
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv:2010.11929
Li Y, Zhang W, Liu Y, Jing R, Liu C (2022) An efficient fire and smoke detection algorithm based on an end-to-end structured network. Eng Appl Artif Intell 116:105492
Mingliang Y, Huiying Z, Jianjun L (2022) Forest fire detection algorithm based on an improved swin transformer. J Cent South Univ Forestry Technol 42(08):101–110
Cao J, Li Y, Sun M et al (2022) Do-conv: depthwise over-parameterized convolutional layer. IEEE Trans Image Process 31:3726–3736
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 7132–7141
Foggia P, Saggese A, Vento M (2015) Real-time fire detection for video-surveillance applications using a combination of experts based on color, shape, and motion. IEEE Trans Circ Syst Video Technol 25(9):1545–1556
Ko BC, Ham SJ, Nam JY (2011) Modeling and formalization of fuzzy finite automata for detection of irregular Fire flames. IEEE Trans Circ Syst Video Technol 21(12):1903–1912
Gomes P, Santana P, Barata J (2014) A vision-based approach to fire detection. Int J Adv Robot Syst 11(9):1–12
Zhang D, Han S, Zhao J et al (2009) Image based forest fire detection using dynamic characteristics with artificial neural networks. International Joint Conference on Artificial Intelligence. IEEE, pp 290–293
Sharma J, Granmo OC, Goodwin M, Fidje JT (2017) Deep convolutional neural networks for fire detection in images. In: Proceedings of Engineering Applications of Neural Networks: 18th International Conference (EANN), pp 183–193
Park M, Tran DQ, Jung D, Park S (2020) Wildfire-detection method using DenseNet and cycleGAN Data Augmentation-based remote Camera Imagery. Remote Sens 12:3715
Wang W, Xie E, Li X, Fan DP, Song K, Liang, et al (2021) Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp 568–578
Graham B, El-Nouby A, Touvron H, Stock P, Joulin A, Jégou H, Douze M (2021) Levit: a vision transformer in convnet’s clothing for faster inference. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp 12259–12269
Acknowledgements
This research was supported by the Major scientific and technological project of Hubei Province, China [grant number 2021AAA007].
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wu, S., Sheng, B., Fu, G. et al. Multiscale fire image detection method based on CNN and Transformer. Multimed Tools Appl 83, 49787–49811 (2024). https://doi.org/10.1007/s11042-023-17482-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17482-4