A small attentional YOLO model for landslide detection from satellite remote sensing images

Cheng, Libo; Li, Jia; Duan, Ping; Wang, Mingguo

doi:10.1007/s10346-021-01694-6

A small attentional YOLO model for landslide detection from satellite remote sensing images

Original paper
Published: 22 May 2021

Volume 18, pages 2751–2765, (2021)
Cite this article

Landslides Aims and scope Submit manuscript

Libo Cheng¹,
Jia Li¹,
Ping Duan¹ &
…
Mingguo Wang²

3625 Accesses
66 Citations
Explore all metrics

Abstract

The use of high-spatial-resolution remote sensing image technology on mobile and embedded equipment is an important and effective way for emergency rescue and evaluation decision-makers to quickly and accurately detect landslide areas. Deep learning-based landslide detection models include one-stage and two-stage models. The two-stage landslide detection models are slower. The one-stage landslide detection models are faster but less accurate. Both types of detection models have many parameters. This research aims to improve the speed, accuracy, and parameters of landslide detection models. A you only look once-small attention (YOLO-SA) landslide detection model is proposed. YOLO-SA is an improved version of the one-stage detection model YOLOv4. First, the group convolution (Gconv) and ghost bottleneck (G-bneck) residual modules are used to replace the convolution components and residual module consisting of standard convolution. The purpose is to reduce the parameters of the model. Then, on this basis, an attention mechanism is added to improve the detection accuracy of the model. Finally, the position of the attention mechanism is adjusted to determine the framework of YOLO-SA. Qiaojia and Ludian counties in Yunnan Province, China, are used as the study area to acquire three-channel (red, green, blue) historical landslide optical remote sensing images from Google Earth, with a total of 1818 images, for training the model. YOLO-SA is compared with 11 advanced models, including Faster-RCNN, 3 types of EfficientDet, 2 types of Centernet, SSD-efficient, and 4 types of YOLOv4 models. The results show that the number of YOLO-SA parameters is reduced to 1.472 mb compared to EfficientDet-D0; the accuracy is improved to 94.08% compared to Centernet-hourglass; and the speed is up to 42 f/s. In addition, the effectiveness of the YOLO-SA model for potential landslide detection is verified, with an F1 score of 90.65%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An improved fire detection approach based on YOLO-v8 for smart cities

Article Open access 28 July 2023

CBAM: Convolutional Block Attention Module

SCA-YOLO: a new small object detection model for UAV images

Article 25 May 2023

Availability of data and material

The datasets used or analyzed during the current study are available from the corresponding author on reasonable request.

Code availability

The code used during the current study is available from the corresponding author on reasonable request.

References

Amatya P, Kirschbaum D, Stanley T (2019) Use of very high-resolution optical data for landslide mapping and susceptibility analysis along the Karnali Highway, Nepal. Remote Sens 11:2284. https://doi.org/10.3390/rs11192284
Article Google Scholar
Bochkovskiy A, Wang CY, Liao HYM (2020) YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
Cao Y, Xu J, Lin S, Wei F, Hu H (2019) GCNet: non-local networks meet squeeze-excitation networks and beyond. arXiv preprint arXiv:1904.11492
Cheng G, Guo L, Zhao T, Han J, Li H, Fang J (2013) Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA. Int J Remote Sens 34:45–59. https://doi.org/10.1080/01431161.2012.705443
Article Google Scholar
Cheng G, Li R, Lang C, Han J (2021) Task-wise attention guided part complementary learning for few-shot image classification. Sci China Inform Sci 64:120104
Article Google Scholar
Di Napoli M et al (2020) Machine learning ensemble modelling as a tool to improve landslide susceptibility mapping reliability. Landslides 17:1897–1914. https://doi.org/10.1007/s10346-020-01392-9
Article Google Scholar
Du S, Zang P, Zang B, Xu H (2021) Weak and occluded vehicle detection in complex infrared environment based on improved YOLOv4. IEEE Access 9:25671–25680. https://doi.org/10.1109/ACCESS.2021.3057723
Article Google Scholar
Fu CY, Liu W, Ranga A, Tyagi A, Berg AC (2017) DSSD: deconvolutional single shot detector. arXiv preprint arXiv:170106659
Galli M, Ardizzone F, Cardinali M, Guzzetti F, Reichenbach PJG (2008) Comparing landslide inventory maps. Geomorphology 94:268–289. https://doi.org/10.1016/j.geomorph.2006.09.023
Article Google Scholar
Gu T, Li J, Wang M, Duan P (2021) Landslide susceptibility assessment in Zhenxiong County of China based on geographically weighted logistic regression model. Geocarto Int:1–23. https://doi.org/10.1080/10106049.2021.1903571
He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal 37:1904–1916
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, p 770-778. https://doi.org/10.1109/CVPR.2016.90
He K, Gkioxari G, Dollár P, Girshick R Mask R-CNN (2017) In: arXiv preprint arXiv:1703.06870
Hong et al (2019) Improved faster R-CNN with multiscale feature fusion and homography augmentation for vehicle detection in remote sensing images. IEEE Geosci Remote Sens Lett 16:1761–1765. https://doi.org/10.1109/LGRS.2019.2909541
Article Google Scholar
Howard AG et al. (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:170404861
Howard A et al. (2019) Searching for MobileNetV3. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1314-1324
Hu J, Shen L, Albanie S, Sun G, Wu E (2018) Squeeze-and-excitation networks. IEEE Trans Pattern Anal Mach Intell:7132–7141
Ji S, Yu D, Shen C, Li W, Xu Q (2020) Landslide detection from an open satellite imagery and digital elevation model dataset using attention boosted convolutional neural networks. Landslides 17:1337–1352. https://doi.org/10.1007/s10346-020-01353-2
Article Google Scholar
Li T-T, Pei X-J, Huang R-Q, Jin L-D (2016) The formation and evolution of the Qiaojia pull-apart basin, North Xiaojiang Fault Zone, Southwest China. J Mt Sci Engl 13:1096–1106. https://doi.org/10.1007/s11629-015-3778-1
Article Google Scholar
Li X, Wang W, Hu X, Yang J (2019) Selective Kernel Networks. Proceedings of the IEEE conference on computer vision and pattern recognition, p 510-519
Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. IEEE Trans Pattern Anal:2999–3007
Liu W, Anguelov D, Erhan D (2016) SSD: Single shot multibox detector. arXiv preprint arXiv:151202325: 21-37
Loshchilov I, Hutter F (2016) SGDR: stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983
Ma H, Liu Y, Ren Y, Yu J (2020) Detection of collapsed buildings in post-earthquake remote sensing images based on the improved YOLOv3. Remote Sens 12:44. https://doi.org/10.3390/rs12010044
Article Google Scholar
Maxwell AE, Pourmohammadi P, Poyner JD (2020) Mapping the topographic features of mining-related valley fills using mask R-CNN deep learning and digital elevation data. Remote Sens 12:547. https://doi.org/10.3390/rs12010044
Article Google Scholar
Messeri A, Morabito M, Messeri G, Brandani G, Petralli M, Natali F, Grifoni D, Crisci A, Gensini G, Orlandini S (2015) Weather-related flood and landslide damage: a risk index for Italian Regions. PLoS One 10:e0144468
Article Google Scholar
Polyak BT, Juditsky AB (1992) Acceleration of stochastic approximation by averaging. SIAM J Control Optim 30:838–855. https://doi.org/10.1137/0330046
Article Google Scholar
Ramachandran P, Zoph B, Le QV (2017) Swish: a self-gated activation function. arXiv preprint arXiv:1710.05941
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6517-6525
Redmon J, Farhadi A (2018) YOLOv3: an incremental improvement. arXiv preprint arXiv:180402767
Redmon J, Divvala S, Girshick R, Farhadi A (2015) You only look once: unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, p 779-788
Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal 39:1137–1149
Article Google Scholar
Rezatofighi H, Tsoi N, Gwak JY, Sadeghian A, Savarese S (2019) Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Roy AG, Navab N, Wachinger C (2018) Concurrent spatial and channel squeeze & excitation in fully convolutional networks. International conference on medical image computing and computer-assisted intervention. Springer, Cham, pp 421–429
Google Scholar
Srivastava RK, Greff K, Schmidhuber J (2015) Training very deep networks. arXiv preprint arXiv:150500387
Tan M, Pang R, Le QV (2019) EfficientDet: scalable and efficient object detection. arXiv preprint arXiv:191109070
Wang X, Girshick R, Gupta A, He K (2017) Non-local neural networks. Proceedings of the IEEE conference on computer vision and pattern recognition, p 7794-7803
Wang H, Liu S, Xu W, Yan L, Qu X, Xie W-C (2020) Numerical investigation on the sliding process and deposit feature of an earthquake-induced landslide: a case study. Landslides 17:2671–2682. https://doi.org/10.1007/s10346-020-01446-y
Article Google Scholar
Xu Z, Chen Y, Yang F, Chu T, Zhou H (2020) A postearthquake multiple scene recognition model based on classical SSD method and transfer learning. ISPRS Int J Geo-Inf 9:238. https://doi.org/10.3390/ijgi9040238
Article Google Scholar
Yang Y, Deng H (2020) GC-YOLOv3: you only look once with global context block. Electronics 9:1235. https://doi.org/10.3390/electronics9081235
Article Google Scholar
Yang Y, Yang J, Xu C, Xu C, Song C (2019) Local-scale landslide susceptibility mapping using the B-GeoSVC model. Landslides 16:1301–1312. https://doi.org/10.1007/s10346-019-01174-y
Article Google Scholar
Zhang X, Zou Y, Wei S (2017) Dilated convolution neural network with LeakyReLU for environmental sound classification. In: Digit Signal Process, pp 1-5
Zhang H et al. (2020) ResNeSt: split-attention networks. arXiv preprint arXiv:2004.08955
Zheng Z, Wang P, Liu W, Li J, Ren D (2020) Distance-IoU loss: faster and better learning for bounding box regression. In: Distance-IoU loss: faster and better learning for bounding box regression, pp 12993-13000
Zhou X, Wang D, Krhenbühl P (2019) Objects as points. arXiv preprint arXiv:191109070

Download references

Acknowledgements

We thank all the authors for their great contribution to this study. We thank Zhoujie Luo and Peng Lai for their help in the experimental data collection and labeling process. Thanks for the valuable landslide data provided by the Yunnan Geological Disaster Department.

Funding

This research was funded by the National Natural Science Foundation of China (No. 41961061) and Yunnan Fundamental Research Projects (grant NO. 202001AT070057).

Author information

Authors and Affiliations

Faculty of Geography, Yunnan Normal University, Kunming, 650500, Yunnan, China
Libo Cheng, Jia Li & Ping Duan
Yunnan Institute of Geological Sciences, Kunming, 650501, Yunnan, China
Mingguo Wang

Authors

Libo Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Jia Li
View author publications
You can also search for this author in PubMed Google Scholar
Ping Duan
View author publications
You can also search for this author in PubMed Google Scholar
Mingguo Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: [Jia Li], [Ping Duan]; methodology: [Libo Cheng], [Jia Li]; formal analysis and investigation: [Libo Cheng]; writing - original draft preparation: [Libo Cheng]; writing - review and editing: [Libo Cheng], [Jia Li]; funding acquisition: [Jia Li]; resources: [Mingguo Wang], [Ping Duan]; supervision: [Jia Li], [Ping Duan]; accuracy evaluation: [Mingguo Wang], [Ping Duan].

Corresponding authors

Correspondence to Jia Li or Ping Duan.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

All authors have read and agreed to the published version of the manuscript.

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cheng, L., Li, J., Duan, P. et al. A small attentional YOLO model for landslide detection from satellite remote sensing images. Landslides 18, 2751–2765 (2021). https://doi.org/10.1007/s10346-021-01694-6

Download citation

Received: 11 December 2020
Accepted: 07 May 2021
Published: 22 May 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s10346-021-01694-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A small attentional YOLO model for landslide detection from satellite remote sensing images

Abstract

Access this article

Similar content being viewed by others

An improved fire detection approach based on YOLO-v8 for smart cities

CBAM: Convolutional Block Attention Module

SCA-YOLO: a new small object detection model for UAV images

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval

Consent to participate

Consent for publication

Competing interests

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A small attentional YOLO model for landslide detection from satellite remote sensing images

Abstract

Access this article

Similar content being viewed by others

An improved fire detection approach based on YOLO-v8 for smart cities

CBAM: Convolutional Block Attention Module

SCA-YOLO: a new small object detection model for UAV images

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval

Consent to participate

Consent for publication

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation