Abstract
Images are easily polluted by noise in the process of acquisition and transmission, which will affect people's understanding and utilization of knowledge and information in images. Therefore, image denoising, as a classic problem, has received extensive attention from researchers. At present, many image denoising methods based on deep learning have been proposed and achieved good performance. However, most existing methods are insufficient in acquiring and utilizing crucial information in the image when removing noise under complex image denoising tasks such as blind denoising and real-world denoising, resulting in the loss of fine details in the reconstructed image. To overcome this shortcoming, in this paper, we propose a novel image denoising algorithm combining attention mechanism and residual UNet network, named Att-ResUNet. Specifically, we propose a novel UNet-based image denoising framework, which employs residual enhancement blocks and skip connections to form global–local residuals, which can fuse multi-scale global context and local features to more thoroughly capture and remove hidden noise in the image. A channel attention mechanism is introduced, which can better focus on the crucial information in the image and improve the denoising performance. In addition, we use adaptive average pooling for down-sampling, which can preserve more image structure information, reduce the loss of edge details, and adopt a residual learning strategy to enhance the learning and expressive capabilities of the denoising model. Extensive experiments on several publicly available standard datasets demonstrate the superiority of our method over 15 state-of-the-art methods and achieve excellent denoising performance. Compared with mainstream methods, our method outperforms current state-of-the-art methods by up to 0.76 dB and 1.10 dB on PSNR evaluation metrics on BSD68 and Set12 datasets, respectively. Notably, our method achieves an average PSNR value of 37.88 dB on the CC dataset in real-world denoising experiments, a significant improvement of 2.14 dB over the most advanced methods.
Similar content being viewed by others
References
He W, Zhang H, Shen H, Zhang L (2018) Hyperspectral image denoising using local low-rank matrix recovery and global spatial–spectral total variation. IEEE J Sel Top Appl Earth Observ Remote Sens 11(3):713–729
Shi Q, Tang X, Yang T, Liu R, Zhang L (2021) Hyperspectral image denoising using a 3-D attention denoising network. IEEE Trans Geosci Remote Sens 59(12):10348–10363
Pan E, Ma Y, Mei X, Fan F, Huang J, Ma J (2022) Sqad: spatial-spectral quasi-attention recurrent network for hyperspectral image denoising. IEEE Trans Geosci Remote Sens 60:1–14
Zhao W, Lu H (2017) Medical image fusion and denoising with alternating sequential filter and adaptive fractional order total variation. IEEE Trans Instrum Meas 66(9):2283–2294
Chen M, Pu YF, Bai YC (2021) Low-dose CT image denoising using residual convolutional network with fractional TV loss. Neurocomputing 452:510–520
Geng M, Meng X, Zhu L, Jiang Z, Gao M, Huang Z, Lu Y (2022) Triplet cross-fusion learning for unpaired image denoising in optical coherence tomography. IEEE Trans Med Imaging 41(11):3357–3372
Buades A, Coll B, Morel JM (2005) A review of image denoising algorithms, with a new one. Multiscale Model Simul 4(2):490–530
Thakur RS, Yadav RN, Gupta L (2019) State-of-art analysis of image denoising methods using convolutional neural networks. IET Image Proc 13(13):2367–2380
Tian C, Fei L, Zheng W, Xu Y, Zuo W, Lin CW (2020) Deep learning on image denoising: an overview. Neural Netw 131:251–275
Buades A, Coll, B, Morel, JM (2005) A non-local algorithm for image denoising. In: 2005 IEEE Computer society conference on computer vision and pattern recognition (CVPR'05), vol 2, IEEE, pp 60–65
Dabov K, Foi A, Katkovnik V, Egiazarian K (2007) Image denoising by sparse 3-d transform-domain collaborative filtering. IEEE Trans Image Process 16(8):2080–2095
Gu S, Zhang L, Zuo W, Feng X (2014) Weighted nuclear norm minimization with application to image denoising. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2862–2869
Aharon M, Elad M, Bruckstein A (2006) K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54(11):4311–4322
Vardan P, Yaniv R, Jeremias S, Michael E (2018) Theoretical foundations of deep learning via sparse representations: a multilayer sparse model and its connection to convolutional neural networks. IEEE Signal Process Mag 35(4):72–89
Jain V, Murray JF, Roth F, Turaga S, Zhigulin V, Briggman KL, Seung H S (2007) Supervised learning of image restoration with convolutional networks. In: 2007 IEEE 11th International Conference on Computer Vision, IEEE, pp 1–8
Burger HC, Schuler CJ, Harmeling S (2012) Image denoising: Can plain neural networks compete with bm3d? In: 2012 IEEE conference on computer vision and pattern recognition, IEEE, pp 2392–2399
Schmidt U, Roth S (2014) Shrinkage fields for effective image restoration. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2774–2781
Mao X, Shen C, Yang YB (2016) Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connections. Advances in neural information processing systems. https://doi.org/10.48550/arXiv.1603.09056
Chen Y, Pock T (2016) Trainable nonlinear reaction diffusion: a flexible framework for fast and effective image restoration. IEEE Trans Pattern Anal Mach Intell 39(6):1256–1272
Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a gaussian denoiser: residual learning of deep cnn for image denoising. IEEE Trans Image Process 26(7):3142–3155
Zhang K, Zuo W, Gu S, Zhang L (2017) Learning deep CNN denoiser prior for image restoration. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3929–3938
Zhang K, Zuo W, Zhang L (2018) FFDNet: toward a fast and flexible solution for CNN-based image denoising. IEEE Trans Image Process 27(9):4608–4622
Tian C, Xu Y, Fei L, Wang J, Wen J, Luo N (2019) Enhanced cnn for image denoising. CAAI Trans Intell Technol 4(1):17–23
Tian C, Xu Y, Li Z, Zuo W, Fei L, Liu H (2020) Attention-guided CNN for image denoising. Neural Netw 124:117–129
Quan Y, Chen Y, Shao Y, Teng H, Xu Y, Ji H (2021) Image denoising using complex-valued deep CNN. Pattern Recognit 111:107639
Tian C, Xu Y, Zuo W, Du B, Lin CW, Zhang D (2021) Designing and training of a dual CNN for image denoising. Knowl Based Syst 226:106949
Zhang Q, Xiao J, Tian C, Chun-Wei Lin J, Zhang S (2022) A robust deformed convolutional neural network (CNN) for image denoising. CAAI Trans Intell Technol. https://doi.org/10.1049/cit2.12110
Tian C, Zheng M, Zuo W, Zhang B, Zhang Y, Zhang D (2023) Multi-stage image denoising with the wavelet transform. Pattern Recognit 134:109050
Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention, Springer, pp 234–241
Li C, Tan Y, Chen W, Luo X, Gao Y, Jia X, Wang Z (2020) Attention Unet++: a nested attention-aware U-Net for liver CT image segmentation. In: 2020 IEEE international conference on image processing, IEEE, pp 345–349
Amer A, Ye X, Zolgharni M, Janan F (2020) ResDUnet: residual dilated UNet for left ventricle segmentation from echocardiographic images. In: 2020 42nd Annual international conference of the IEEE engineering in medicine & biology society (EMBC), IEEE, pp 2019–2022
Han Z, Jian M, Wang GG (2022) ConvUNeXt: an efficient convolution neural network for medical image segmentation. Knowl Based Syst 253:109512
Lin A, Chen B, Xu J, Zhang Z, Lu G, Zhang D (2022) Ds-transunet: dual swin transformer u-net for medical image segmentation. IEEE Trans Instrum Meas 71:1–15
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. Las Vegas, pp 770–778
Zhang Y, Li J, Wei S, Zhou F, Li D (2021) Heartbeats classification using hybrid time-frequency analysis and transfer learning based on ResNet. IEEE J Biomed Health Inform 25(11):4175–4184
Zhang Z, Liu Q, Wang Y (2018) Road extraction by deep residual u-net. IEEE Geosci Remote Sens Lett 15(5):749–753
Sun T, Ding S, Guo L (2022) Low-degree term first in ResNet, its variants and the whole neural network family. Neural Netw 148:155–165
Dentamaro V, Giglio P, Impedovo D, Moretti L, Pirlo G (2022) AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath. Pattern Recogn 127:108656
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Roy SK, Dubey SR, Chatterjee S, Chaudhuri BB (2020) FuSENet: fused squeeze-and-excitation network for spectral-spatial hyperspectral image classification. IET Image Proc 14(8):1653–1661
Li Y, Liu Y, Cui WG, Guo YZ, Huang H, Hu ZY (2020) Epileptic seizure detection in EEG signals using a unified temporal-spectral squeeze-and-excitation network. IEEE Trans Neural Syst Rehabil Eng 28(4):782–794
Li G, Fang Q, Zha L, Gao X, Zheng N (2022) HAM: hybrid attention module in deep convolutional neural networks for image classification. Pattern Recognit 129:108785
Cheng J, Tian S, Yu L, Gao C, Kang X, Ma X, Lu H (2022) ResGANet: residual group attention network for medical image classification and segmentation. Med Image Anal 76:102313
Martin D, Fowlkes C, Tal D, & Malik J (2001) A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proceedings Eighth IEEE International Conference on Computer Vision, IEEE, vol. 2, pp 416-423
Ma K, Duanmu Z, Wu Q, Wang Z, Yong H, Li H, Zhang L (2016) Waterloo exploration database: new challenges for image quality assessment models. IEEE Trans Image Process 26(2):1004–1016
Agustsson E & Timofte R (2017) Ntire 2017 challenge on single image super-resolution: dataset and study. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 126–135
Xu J, Li H, Liang Z, Zhang D, & Zhang L (2018) Real-world noisy image denoising: a new benchmark. arXiv preprint arXiv:1804.02603
Roth S, Black MJ (2005) Fields of experts: A framework for learning image priors. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), Vol 2, Citeseer, pp 860–867
Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2009) Non-local sparse models for image restoration. In: 2009 IEEE 12th international conference on computer vision, IEEE, pp 2272–2279
Franzen R (1999) Kodak lossless true color image suite, vol 4, http://r0k.us/graphics/kodak
Zhang L, Wu X, Buades A, Li X (2011) Color demosaicking by local directional interpolation and nonlocal adaptive thresholding. J Electron Imaging 20(2):023016
Nam S, Hwang Y, Matsushita Y, & Kim S J (2016) A holistic approach to cross-channel image noise modeling and its application to image denoising. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1683–1691
Huynh-Thu Q, Ghanbari M (2008) Scope of validity of psnr in image/video quality assessment. Electron Lett 44(13):800–801
Hore A, Ziou D (2010) Image quality metrics: PSNR vs. SSIM. In: 2010 20th international conference on pattern recognition, IEEE, pp 2366–2369
D Zoran, Weiss Y (2011) From learning models of natural image patches to whole image restoration. In: 2011 International conference on computer vision, pp 479–486
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grant Nos. 62276265, 61976216, and 61672522.
Author information
Authors and Affiliations
Contributions
Prof. Shifei Ding helped in supervision. Dr. Qidong Wang helped in conceptualization and methodology. Dr. Lili Guo helped in supervision. Dr. Jian Zhang worked in software and writing. Dr. Ling Ding helped in supervision. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ding, S., Wang, Q., Guo, L. et al. A novel image denoising algorithm combining attention mechanism and residual UNet network. Knowl Inf Syst 66, 581–611 (2024). https://doi.org/10.1007/s10115-023-01965-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-023-01965-9