Abstract
Synthetic aperture radar (SAR) is instrumental in ship monitoring owing to its all-weather capabilities and high resolution. In SAR images, ship targets frequently display blurred or mixed boundaries with the background, and instances of occlusion or partial occlusion may occur. Additionally, multi-scale transformations and small-target ships pose challenges for ship detection. To tackle these challenges, we propose a novel SAR ship detection model, FESAR. Firstly, in addressing multi-scale transformations in ship detection, we propose the Fused Convolution Enhancement Module (FCEM). This network incorporates distinct convolutional branches designed to capture local and global features, which are subsequently fused and enhanced. Secondly, a Spatial Relationship Analysis Module (SRAM) with a spatial-mixing layer is designed to analyze the local spatial relationship between the ship target and the background, effectively combining local information to discern feature distinctions between the ship target and the background. Finally, a new backbone network, SPD-YOLO, is designed to perform deep downsampling for the comprehensive extraction of semantic information related to ships. To validate the model’s performance, an extensive series of experiments was conducted on the public datasets HRSID, LS-SSDD-v1.0, and SSDD. The results demonstrate the outstanding performance of the proposed FESAR model compared to numerous state-of-the-art (SOTA) models. Relative to the baseline model, FESAR exhibits an improvement in mAP by 2.6% on the HRSID dataset, 5.5% on LS-SSDD-v1.0, and 0.2% on the SSDD dataset. In comparison with numerous SAR ship detection models, FESAR demonstrates superior comprehensive performance.
Similar content being viewed by others
Availability of data and materials
The source code of the project used in this experiment is open source. From the open source website https://github.com. The datasets SSDD, LS-SSDD-v1.0 and HRSID used in the paper are both publicly available online datasets, and the datasets are also cited in the references.
References
Zhang, T., Zhang, X.: High-speed ship detection in SAR images based on a grid convolutional neural network. Remote Sens. 11(10), 1206 (2019)
Zhang, T., Zhang, X.: Injection of traditional hand-crafted features into modern CNN-based models for SAR ship classification: what, why, where, and how. Remote Sens. 13(11), 2091 (2021)
Zhang, T., Zhang, X.: A mask attention interaction and scale enhancement network for SAR ship instance segmentation. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Ai, J., Mao, Y., Luo, Q., et al.: Robust CFAR ship detector based on bilateral-trimmed statistics of complex ocean scenes in SAR imagery: a closed-form solution. IEEE Trans. Aerosp. Electron. Syst. 57(3), 1872–1890 (2021)
Hou, B., Chen, X., Jiao, L.: Multilayer CFAR detection of ship targets in very high resolution SAR images. IEEE Geosci. Remote Sens. Lett. 12, 811–815 (2014)
Renga, A., Graziano, M.D., Moccia, A.: Segmentation of marine SAR images by sublook analysis and application to sea traffic monitoring. IEEE Trans. Geosci. Remote Sens. 57, 1463–1477 (2018)
He, K., Gkioxari, G., Dollár, P., et al.: Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017).
Tiwari, V., Singhal, A., Dhankhar, N.: Detecting COVID-19 opacity in X-ray images using YOLO and RetinaNet ensemble. In: 2022 IEEE Delhi Section Conference (DELCON), pp. 1–5. IEEE (2022)
Zheng, W., Tang, W., Jiang, L., et al.: SE-SSD: self-ensembling single-stage object detector from point cloud. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14494–14503 (2021)
Tan, M., Pang, R., Le, Q.V.: Efficientdet: scalable and efficient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 10781–10790 (2020)
Cui, Z., Wang, X., Liu, N., et al.: Ship detection in large-scale SAR images via spatial shuffle-group enhance attention. IEEE Trans. Geosci. Remote Sens. 59(1), 379–391 (2020)
Zhou, K., Zhang, M., Wang, H., et al.: Ship detection in SAR images based on multi-scale feature extraction and adaptive feature fusion. Remote Sens. 14(3), 755 (2022)
Yu, J., Zhou, G., Zhou, S., et al.: A fast and lightweight detection network for multi-scale SAR ship detection under complex backgrounds. Remote Sens. 14(1), 31 (2021)
Xiong, B., Sun, Z., Wang, J., et al.: A lightweight model for ship detection and recognition in complex-scene SAR images. Remote Sens. 14(23), 6053 (2022)
Zhang, T., Zhang, X.: ShipDeNet-20: an only 20 convolution layers and< 1-MB lightweight SAR ship detector. IEEE Geosci. Remote Sens. Lett. 18(7), 1234–1238 (2020)
Wang, Z., Wang, B., Xu, N.: SAR ship detection in complex background based on multi-feature fusion and non-local channel attention mechanism. Int. J. Remote Sens. 42(19), 7519–7550 (2021)
Li, X., Li, D., Liu, H., et al.: A-BFPN: an attention-guided balanced feature pyramid network for SAR ship detection. Remote Sens. 14(15), 3829 (2022)
Ren, X., Bai, Y., Liu, G., et al.: YOLO-Lite: an efficient lightweight network for SAR ship detection. Remote Sens. 15(15), 3771 (2023)
Yang, X., Zhang, X., Wang, N., et al.: A robust one-stage detector for multiscale ship detection with complex background in massive SAR images. IEEE Trans. Geosci. Remote Sens. 60, 1–12 (2021)
Tang, G., Zhuge, Y., Claramunt, C., et al.: N-Yolo: a SAR ship detection using noise-classifying and complete-target extraction. Remote Sens. 13(5), 871 (2021)
Xia, R., Chen, J., Huang, Z., et al.: CRTransSar: a visual transformer based on contextual joint representation learning for SAR ship detection. Remote Sens. 14(6), 1488 (2022)
Chen, Z., Liu, C., Filaretov, V.F., et al.: Multi-scale ship detection algorithm based on YOLOv7 for complex scene SAR images. Remote Sens. 15(8), 2071 (2023)
Li, X., Zhong, Z., Wu, J., et al.: Expectation-maximization attention networks for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9167–9176 (2019)
Gong, Y., Zhang, Z., Wen, J., et al.: Small ship detection of SAR images based on optimized feature pyramid and sample augmentation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. (2023).
Chen, J., Kao, S., He, H., et al.: Run, Don’t walk: chasing higher FLOPS for faster neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12021–12031 (2023)
Tolstikhin, I.O., Houlsby, N., Kolesnikov, A., et al.: Mlp-mixer: An all-mlp architecture for vision. Adv. Neural. Inf. Process. Syst. 34, 24261–24272 (2021)
Sunkara, R., Luo, T.: No more strided convolutions or pooling: a new CNN building block for low-resolution images and small objects. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 443–459. Springer Nature Switzerland, Cham (2022)
Zhang, T., Zhang, X., Ke, X., et al.: LS-SSDD-v1.0: a deep learning dataset dedicated to small ship detection from large-scale Sentinel-1 SAR images. Remote Sens. 12(18), 2997 (2020)
Wei, S., Zeng, X., Qu, Q., et al.: HRSID: a high-resolution SAR images dataset for ship detection and instance segmentation. IEEE Access 8, 120234–120254 (2020)
Zhang, T., Zhang, X., Li, J., et al.: SAR ship detection dataset (SSDD): Official release and comprehensive data analysis. Remote Sens. 13(18), 3690 (2021)
Bai, L., Yao, C., Ye, Z., et al.: Feature enhancement pyramid and shallow feature reconstruction network for SAR Ship detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 16, 1042–1056 (2023)
Guo, H., Yang, X., Wang, N., et al.: A CenterNet++ model for ship detection in SAR images. Pattern Recogn. 112, 107787 (2021)
Wan, H., Chen, J., Huang, Z., et al.: AFSar: an anchor-free SAR target detection algorithm based on multiscale enhancement representation learning. IEEE Trans. Geosci. Remote Sens. 60, 1–14 (2021)
Zhang, T., Zhang, X., Liu, C., et al.: Balance learning for ship detection from synthetic aperture radar remote sensing imagery. ISPRS J. Photogramm. Remote Sens. 182, 190–207 (2021)
Zhang, Y., Chen, C., Hu, R., et al.: ESarDet: an efficient SAR ship detection method based on context information and large effective receptive field. Remote Sens. 15(12), 3018 (2023)
Zhang, T., Zhang, X., Ke, X.: Quad-FPN: a novel quad feature pyramid network for SAR ship detection. Remote Sens. 13(14), 2771 (2021)
Zhang, T., Zhang, X., Shi, J., et al.: Depthwise separable convolution neural network for high-speed SAR ship detection. Remote Sens. 11(21), 2483 (2019)
Cao, M., Lei, J., Xie, W., et al.: SAR-Net: multi-scale direction-aware SAR network via global information fusion. Preprint at http://arxiv.org/abs/2312.16943 (2023)
Li, Q., Xiao, D., Shi, F.: A decoupled head and coordinate attention detection method for ship targets in SAR images. IEEE Access 10, 128562–128578 (2022)
Zhou, Z., Chen, J., Huang, Z., et al.: HRLE-SARDet: a lightweight SAR target detection algorithm based on hybrid representation learning enhancement. IEEE Trans. Geosci. Remote Sens. 61, 1–22 (2023)
Zhang, Y., Han, D.: Swin-PAFF: a SAR ship detection network with contextual cross-information fusion. Comput. Mater. Contin. 77(2) (2023)
Funding
This study was not supported by any funding agency or project grant.
Author information
Authors and Affiliations
Contributions
Chongchong Liu contributed to conceptualization, methodology, software, writing, data curation, and visualization. Chunman Yan contributed to methodology, project administration, and supervision
Corresponding author
Ethics declarations
Conflict of interest
We declare that there is no interest of a financial or personal nature associated with this study. The design, conduct, data analysis, and interpretation of the results of this study are free from any potential interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, C., Yan, C. FESAR: SAR ship detection model based on local spatial relationship capture and fused convolutional enhancement. Machine Vision and Applications 35, 34 (2024). https://doi.org/10.1007/s00138-024-01516-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00138-024-01516-4