Skip to main content
Log in

A novel image denoising algorithm combining attention mechanism and residual UNet network

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Images are easily polluted by noise in the process of acquisition and transmission, which will affect people's understanding and utilization of knowledge and information in images. Therefore, image denoising, as a classic problem, has received extensive attention from researchers. At present, many image denoising methods based on deep learning have been proposed and achieved good performance. However, most existing methods are insufficient in acquiring and utilizing crucial information in the image when removing noise under complex image denoising tasks such as blind denoising and real-world denoising, resulting in the loss of fine details in the reconstructed image. To overcome this shortcoming, in this paper, we propose a novel image denoising algorithm combining attention mechanism and residual UNet network, named Att-ResUNet. Specifically, we propose a novel UNet-based image denoising framework, which employs residual enhancement blocks and skip connections to form global–local residuals, which can fuse multi-scale global context and local features to more thoroughly capture and remove hidden noise in the image. A channel attention mechanism is introduced, which can better focus on the crucial information in the image and improve the denoising performance. In addition, we use adaptive average pooling for down-sampling, which can preserve more image structure information, reduce the loss of edge details, and adopt a residual learning strategy to enhance the learning and expressive capabilities of the denoising model. Extensive experiments on several publicly available standard datasets demonstrate the superiority of our method over 15 state-of-the-art methods and achieve excellent denoising performance. Compared with mainstream methods, our method outperforms current state-of-the-art methods by up to 0.76 dB and 1.10 dB on PSNR evaluation metrics on BSD68 and Set12 datasets, respectively. Notably, our method achieves an average PSNR value of 37.88 dB on the CC dataset in real-world denoising experiments, a significant improvement of 2.14 dB over the most advanced methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. He W, Zhang H, Shen H, Zhang L (2018) Hyperspectral image denoising using local low-rank matrix recovery and global spatial–spectral total variation. IEEE J Sel Top Appl Earth Observ Remote Sens 11(3):713–729

    Article  Google Scholar 

  2. Shi Q, Tang X, Yang T, Liu R, Zhang L (2021) Hyperspectral image denoising using a 3-D attention denoising network. IEEE Trans Geosci Remote Sens 59(12):10348–10363

    Article  Google Scholar 

  3. Pan E, Ma Y, Mei X, Fan F, Huang J, Ma J (2022) Sqad: spatial-spectral quasi-attention recurrent network for hyperspectral image denoising. IEEE Trans Geosci Remote Sens 60:1–14

    Google Scholar 

  4. Zhao W, Lu H (2017) Medical image fusion and denoising with alternating sequential filter and adaptive fractional order total variation. IEEE Trans Instrum Meas 66(9):2283–2294

    Article  Google Scholar 

  5. Chen M, Pu YF, Bai YC (2021) Low-dose CT image denoising using residual convolutional network with fractional TV loss. Neurocomputing 452:510–520

    Article  Google Scholar 

  6. Geng M, Meng X, Zhu L, Jiang Z, Gao M, Huang Z, Lu Y (2022) Triplet cross-fusion learning for unpaired image denoising in optical coherence tomography. IEEE Trans Med Imaging 41(11):3357–3372

    Article  Google Scholar 

  7. Buades A, Coll B, Morel JM (2005) A review of image denoising algorithms, with a new one. Multiscale Model Simul 4(2):490–530

    Article  MathSciNet  Google Scholar 

  8. Thakur RS, Yadav RN, Gupta L (2019) State-of-art analysis of image denoising methods using convolutional neural networks. IET Image Proc 13(13):2367–2380

    Article  Google Scholar 

  9. Tian C, Fei L, Zheng W, Xu Y, Zuo W, Lin CW (2020) Deep learning on image denoising: an overview. Neural Netw 131:251–275

    Article  Google Scholar 

  10. Buades A, Coll, B, Morel, JM (2005) A non-local algorithm for image denoising. In: 2005 IEEE Computer society conference on computer vision and pattern recognition (CVPR'05), vol 2, IEEE, pp 60–65

  11. Dabov K, Foi A, Katkovnik V, Egiazarian K (2007) Image denoising by sparse 3-d transform-domain collaborative filtering. IEEE Trans Image Process 16(8):2080–2095

    Article  MathSciNet  Google Scholar 

  12. Gu S, Zhang L, Zuo W, Feng X (2014) Weighted nuclear norm minimization with application to image denoising. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2862–2869

  13. Aharon M, Elad M, Bruckstein A (2006) K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54(11):4311–4322

    Article  Google Scholar 

  14. Vardan P, Yaniv R, Jeremias S, Michael E (2018) Theoretical foundations of deep learning via sparse representations: a multilayer sparse model and its connection to convolutional neural networks. IEEE Signal Process Mag 35(4):72–89

    Article  Google Scholar 

  15. Jain V, Murray JF, Roth F, Turaga S, Zhigulin V, Briggman KL, Seung H S (2007) Supervised learning of image restoration with convolutional networks. In: 2007 IEEE 11th International Conference on Computer Vision, IEEE, pp 1–8

  16. Burger HC, Schuler CJ, Harmeling S (2012) Image denoising: Can plain neural networks compete with bm3d? In: 2012 IEEE conference on computer vision and pattern recognition, IEEE, pp 2392–2399

  17. Schmidt U, Roth S (2014) Shrinkage fields for effective image restoration. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2774–2781

  18. Mao X, Shen C, Yang YB (2016) Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connections. Advances in neural information processing systems. https://doi.org/10.48550/arXiv.1603.09056

  19. Chen Y, Pock T (2016) Trainable nonlinear reaction diffusion: a flexible framework for fast and effective image restoration. IEEE Trans Pattern Anal Mach Intell 39(6):1256–1272

    Article  Google Scholar 

  20. Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a gaussian denoiser: residual learning of deep cnn for image denoising. IEEE Trans Image Process 26(7):3142–3155

    Article  MathSciNet  Google Scholar 

  21. Zhang K, Zuo W, Gu S, Zhang L (2017) Learning deep CNN denoiser prior for image restoration. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3929–3938

  22. Zhang K, Zuo W, Zhang L (2018) FFDNet: toward a fast and flexible solution for CNN-based image denoising. IEEE Trans Image Process 27(9):4608–4622

    Article  MathSciNet  Google Scholar 

  23. Tian C, Xu Y, Fei L, Wang J, Wen J, Luo N (2019) Enhanced cnn for image denoising. CAAI Trans Intell Technol 4(1):17–23

    Article  Google Scholar 

  24. Tian C, Xu Y, Li Z, Zuo W, Fei L, Liu H (2020) Attention-guided CNN for image denoising. Neural Netw 124:117–129

    Article  Google Scholar 

  25. Quan Y, Chen Y, Shao Y, Teng H, Xu Y, Ji H (2021) Image denoising using complex-valued deep CNN. Pattern Recognit 111:107639

    Article  Google Scholar 

  26. Tian C, Xu Y, Zuo W, Du B, Lin CW, Zhang D (2021) Designing and training of a dual CNN for image denoising. Knowl Based Syst 226:106949

    Article  Google Scholar 

  27. Zhang Q, Xiao J, Tian C, Chun-Wei Lin J, Zhang S (2022) A robust deformed convolutional neural network (CNN) for image denoising. CAAI Trans Intell Technol. https://doi.org/10.1049/cit2.12110

    Article  Google Scholar 

  28. Tian C, Zheng M, Zuo W, Zhang B, Zhang Y, Zhang D (2023) Multi-stage image denoising with the wavelet transform. Pattern Recognit 134:109050

    Article  Google Scholar 

  29. Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention, Springer, pp 234–241

  30. Li C, Tan Y, Chen W, Luo X, Gao Y, Jia X, Wang Z (2020) Attention Unet++: a nested attention-aware U-Net for liver CT image segmentation. In: 2020 IEEE international conference on image processing, IEEE, pp 345–349

  31. Amer A, Ye X, Zolgharni M, Janan F (2020) ResDUnet: residual dilated UNet for left ventricle segmentation from echocardiographic images. In: 2020 42nd Annual international conference of the IEEE engineering in medicine & biology society (EMBC), IEEE, pp 2019–2022

  32. Han Z, Jian M, Wang GG (2022) ConvUNeXt: an efficient convolution neural network for medical image segmentation. Knowl Based Syst 253:109512

    Article  Google Scholar 

  33. Lin A, Chen B, Xu J, Zhang Z, Lu G, Zhang D (2022) Ds-transunet: dual swin transformer u-net for medical image segmentation. IEEE Trans Instrum Meas 71:1–15

    Google Scholar 

  34. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. Las Vegas, pp 770–778

  35. Zhang Y, Li J, Wei S, Zhou F, Li D (2021) Heartbeats classification using hybrid time-frequency analysis and transfer learning based on ResNet. IEEE J Biomed Health Inform 25(11):4175–4184

    Article  Google Scholar 

  36. Zhang Z, Liu Q, Wang Y (2018) Road extraction by deep residual u-net. IEEE Geosci Remote Sens Lett 15(5):749–753

    Article  Google Scholar 

  37. Sun T, Ding S, Guo L (2022) Low-degree term first in ResNet, its variants and the whole neural network family. Neural Netw 148:155–165

    Article  Google Scholar 

  38. Dentamaro V, Giglio P, Impedovo D, Moretti L, Pirlo G (2022) AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath. Pattern Recogn 127:108656

    Article  Google Scholar 

  39. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141

  40. Roy SK, Dubey SR, Chatterjee S, Chaudhuri BB (2020) FuSENet: fused squeeze-and-excitation network for spectral-spatial hyperspectral image classification. IET Image Proc 14(8):1653–1661

    Article  Google Scholar 

  41. Li Y, Liu Y, Cui WG, Guo YZ, Huang H, Hu ZY (2020) Epileptic seizure detection in EEG signals using a unified temporal-spectral squeeze-and-excitation network. IEEE Trans Neural Syst Rehabil Eng 28(4):782–794

    Article  Google Scholar 

  42. Li G, Fang Q, Zha L, Gao X, Zheng N (2022) HAM: hybrid attention module in deep convolutional neural networks for image classification. Pattern Recognit 129:108785

    Article  Google Scholar 

  43. Cheng J, Tian S, Yu L, Gao C, Kang X, Ma X, Lu H (2022) ResGANet: residual group attention network for medical image classification and segmentation. Med Image Anal 76:102313

    Article  Google Scholar 

  44. Martin D, Fowlkes C, Tal D, & Malik J (2001) A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proceedings Eighth IEEE International Conference on Computer Vision, IEEE, vol. 2, pp 416-423

  45. Ma K, Duanmu Z, Wu Q, Wang Z, Yong H, Li H, Zhang L (2016) Waterloo exploration database: new challenges for image quality assessment models. IEEE Trans Image Process 26(2):1004–1016

    Article  MathSciNet  Google Scholar 

  46. Agustsson E & Timofte R (2017) Ntire 2017 challenge on single image super-resolution: dataset and study. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 126–135

  47. Xu J, Li H, Liang Z, Zhang D, & Zhang L (2018) Real-world noisy image denoising: a new benchmark. arXiv preprint arXiv:1804.02603

  48. Roth S, Black MJ (2005) Fields of experts: A framework for learning image priors. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), Vol 2, Citeseer, pp 860–867

  49. Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2009) Non-local sparse models for image restoration. In: 2009 IEEE 12th international conference on computer vision, IEEE, pp 2272–2279

  50. Franzen R (1999) Kodak lossless true color image suite, vol 4, http://r0k.us/graphics/kodak

  51. Zhang L, Wu X, Buades A, Li X (2011) Color demosaicking by local directional interpolation and nonlocal adaptive thresholding. J Electron Imaging 20(2):023016

    Article  Google Scholar 

  52. Nam S, Hwang Y, Matsushita Y, & Kim S J (2016) A holistic approach to cross-channel image noise modeling and its application to image denoising. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1683–1691

  53. Huynh-Thu Q, Ghanbari M (2008) Scope of validity of psnr in image/video quality assessment. Electron Lett 44(13):800–801

    Article  Google Scholar 

  54. Hore A, Ziou D (2010) Image quality metrics: PSNR vs. SSIM. In: 2010 20th international conference on pattern recognition, IEEE, pp 2366–2369

  55. D Zoran, Weiss Y (2011) From learning models of natural image patches to whole image restoration. In: 2011 International conference on computer vision, pp 479–486

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Grant Nos. 62276265, 61976216, and 61672522.

Author information

Authors and Affiliations

Authors

Contributions

Prof. Shifei Ding helped in supervision. Dr. Qidong Wang helped in conceptualization and methodology. Dr. Lili Guo helped in supervision. Dr. Jian Zhang worked in software and writing. Dr. Ling Ding helped in supervision. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Lili Guo or Ling Ding.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ding, S., Wang, Q., Guo, L. et al. A novel image denoising algorithm combining attention mechanism and residual UNet network. Knowl Inf Syst 66, 581–611 (2024). https://doi.org/10.1007/s10115-023-01965-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-023-01965-9

Keywords

Navigation