Elsevier

Neurocomputing

Volume 492, 1 July 2022, Pages 343-352
Neurocomputing

CT image quality enhancement via a dual-channel neural network with jointing denoising and super-resolution

https://doi.org/10.1016/j.neucom.2022.04.040Get rights and content

Abstract

In recent years, computed tomography (CT) has been widely used in various clinical diagnosis. Given potential health risks bring by the X-ray radiation, the major objective of the current research is to achieve high-quality CT imaging while reducing X-ray radiation. However, most existing studies on low-dose CT image super-resolution reconstruction do not focus on the interaction between the denoising task and the super-resolution task. In this paper, we propose a dual-channel joint learning framework to accurately reconstruct high-resolution CT images from low-resolution CT images. Unlike the previous cascaded models which directly combine the denoising network and the super-resolution network, our method can process the denoising reconstruction and the super-resolution reconstruction in parallel. Additionally, we design a filter gate module that can filter features from the denoising branch and highlight important features which can benefit the super-resolution task. We evaluate the performance of our method in medical image enhancement by testing on the 2016 Low-Dose CT Grand Challenge dataset and the piglet dataset. The experimental results show that the proposed network is superior to other state-of-the-art methods in terms of both peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). We also demonstrate that our method can better remove noise and recover details. Furthermore, the method achieves competitive results not only for super-resolution reconstruction of low-dose CT, but also for super-resolution reconstruction of sparse-view CT.

Introduction

Computed tomography (CT) is a disease detection technique that scans a specific area of the human body by using X-rays, gamma rays, or other types of beams [1]. CT has the advantages of being simple to operate and producing high-resolution images. Therefore, it is widely used in a variety of clinical diagnoses [2]. However, the excessive radiation exposure will bring the potential cancer risk to the patient [3]. To ensure the health of patients, efforts have been made to reduce the radiation dose. Nevertheless, decreasing the radiation dose to obtain low-dose CT (LDCT) images may degrade the quality of images, with noise and more artifacts, which will affect the final diagnosis results [4]. Thus, how to improve the quality of LDCT images is a research priority [5], [6].

Over the past decade, researchers focused on developing new iterative algorithms to enhance the image quality of LDCT [7], [8]. Iterative reconstruction algorithms significantly improve image quality, but they require a high computational cost, which limits their practical applications. Additionally, the methods based on dictionary learning and sparse representation are also widely explored [9], [10]. Considering the enormous potential of neural networks in the field of image processing, researchers began applying deep learning techniques to dual-energy CT imaging [11], [12] and LDCT denoising [6], [13], [5]. For example, a convolutional encoder-decoder with residual learning [5] was used for restoring normal dose CT images. Recently, Yin et al. [14] proposed a domain progressive 3D residual convolution network, which significantly improves the image reconstruction performance by combining the network processing in sinogram domain and image domain. However, current CT reconstruction methods were primarily concerned with denoising, neglecting the resolution constraints of the image.

Single-image SR technology is a classic image enhancement technology, which refers to the process of recovering a high-resolution (HR) image from a low-resolution (LR) image [15]. SR of medical images improves the quality of digital imaging systems, therefore, physicians can make a more precise diagnosis of the disease using the enhanced SR images. Generally, there are three types of SR reconstruction methods [16]: (1) interpolation-based methods [17], [18], (2) model-based reconstruction methods [19], [20], and (3) learning-based methods [21], [22], [23]. Those methods based on interpolation mainly include nearest-neighbor interpolation, bilinear interpolation, and bicubic interpolation [24], which are simple to implement and have been widely used in the SR restoration of LR images. These methods can effectively increase the resolution of the LR image but have poor visual quality. When dealing with real images with complex structures, such as CT images, due to the fact that traditional interpolations do not take structural information into account, they had a limited effect and may even produce artifacts. The model-based methods explicitly model the degradation process of the image and regularize the reconstruction according to the characteristics of the data. In comparison to the interpolation-based reconstruction methods, they can recover more detailed information from LR images. The learning-based methods learn the nonlinear mapping from paired LR and HR images in order to recover missing high-frequency details. Yang et al. [22] suggested a novel approach for single image SR based on sparse representations in terms of the coupled dictionaries that were jointly trained from HR and LR image patch pairs, and achieved encouraging results on the SR reconstruction of the face image. Considering the fact that images frequently contain many repetitive image structures, adaptive regularization and adaptive sparse domain selection were combined by [21] to achieve excellent results in terms of visual quality and PSNR.

With the rapid development of artificial intelligence and the continuous improvement of computer hardware performance in recent years, researchers started to restore images by using deep learning. Dong et al. [25] noticed that traditional SR methods based on the sparse coding can be interpreted as a deep convolutional neural network, and achieved advanced reconstruction results. In addition, many studies have been conducted on the application of deep learning to medical imaging [26], [27], [28]. In [26], a novel convolutional neural network was designed to learn residual-based transformations from LR to HR images. Subsequently, a modified U-Net was used in [27] to learn an end-to-end mapping between LR and HR images. Recently, to accurately improve the quality of CT images, Jiang et al. [28] designed a novel semi-supervised adversarial generative network, and proposed a new loss function to enforce the mappings between the discriminator and generator.

Many researchers worked on the noise removal of LDCT images and SR reconstruction of CT images, but most of these methods only focus on the single task of denoising or SR reconstruction, without considering the relationship between the two tasks. In particular, denoising task is necessary and beneficial for SR task [29]. To reduce CT scan radiation while maintaining the quality of the CT image, the generative adversarial network was used as a building block to establish a nonlinear end-to-end mapping from the noisy LR input to the denoised HR output [30]. Later, Chi et al. [31] first utilized a dense-inception network integrating the dense skip connection and the inception structure to estimate the noise level, followed by a modified residual-dense network to reconstruct HR images. Recently, to improve the quality of LDCT images, Yim et al. [32] linearly combined the denoising autoencoder and the SR convolutional neural network, where two networks were trained separately for the denoising and the SR. However, the methods mentioned above have the following limitations: (1) The cascaded models [32] that denoising the image first and then applying an SR algorithm have a drawback: along with noise, the denoising step always loses some of the high-frequency content of the image [29]. (2) The methods [30], [31] ignore the potential relationship between the denoising and the SR tasks. They use a single branch network to achieve denoising and SR tasks, which limits the quality of image reconstruction [33].

To deal with these drawbacks, we explore how to improve the interaction of denoising and SR tasks on the SR reconstruction of LDCT images. First, without noise removal, super-resolving the noisy input directly will magnify the undesired noise [33]. In order to remove the noise, the simple solution is to perform denoising first, followed by resolution improvement. But in this way, denoising is a pre-processing procedure that frequently results in the loss of high-frequency material, impairing subsequent SR performance [33]. For this reason, we achieve the interaction between the denoising and the SR by processing the image denoising and the SR reconstruction parallelly. In summary, our network consists of two parallel network branches, one for denoising and the other for SR reconstruction. The denoising branch will provide additional details to guide the SR reconstruction of the image. Extensive experiments demonstrate that our network can retain a large amount of detailed information of the image and obtain satisfactory image reconstruction results.

In summary, the main contributions of this article are as follows:

  • (1)

    We propose an Encoder-Decoder network for joint learning of image SR and denoising to achieve SR reconstruction of LDCT images. The suggested model learns the potential connection between the dual tasks through the interaction of the denoising task and the SR task. It provides a novel framework for SR reconstruction of noisy images.

  • (2)

    We design an filter gate module for controlling the interaction of denoising and SR reconstruction features. Rather than directly passing denoised features to the SR task, this module filters features from the denoising branch. It can learn implicitly to suppress irrelevant features from the denoising branch and highlight important features useful for the SR branch.

  • (3)

    Our joint learning network effectively solves the reconstruction of image details on the SR reconstruction of noisy images. Experiments show that our network achieves competitive results not only in LDCT images but also in sparse-view CT images.

The structure of this paper is as follows. We introduce the proposed dual-channel learning framework and the optimization settings of model in Section 2. Section 3 describes the datasets of experiments and training details of the model. The results of qualitative and quantitative comparisons with other methods are also discussed in this section. Finally, the conclusion and future works are presented in Section 4.

Section snippets

Proposed Approach

In this section, we first briefly review the Encoder-Decoder architecture for medical images. Then, we present the dual-channel learning framework and introduce the filter gate module in detail. Finally, the optimization settings will be described briefly.

Experiment and Results

In this section, we describe the datasets used to train and evaluate, as well as the experimental setup which includes hyperparameter selection and data preparation. Additionally, we compare the results of our method with the state-of-the-art methods, and demonstrate the effectiveness of our model.

Conclusion

In this work, we propose a novel dual-channel SR learning framework for the SR reconstruction of LDCT images. In the proposed network, the denoising task and the SR task can be implemented in parallel. The DN branch can remove the artifacts of the images and restore the complex structural features of the images. The introduction of the FG module can highlight the denoising features that have an effect on the SR task and further improve the SR reconstruction performance. Dual-task interactions

CRediT authorship contribution statement

Hongyu Hou: Data curation, Writing - original draft. Qunchao Jin: Visualization, Investigation. Guixu Zhang: Supervision, Validation. Zhi Li: Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This work was supported by the Natural Science Foundation of China (Grant No. 62001167, 61731009, 61961160734), East China Normal University through startup funding.

Hongyu Hou received the B.S. degree in materials science and engineering from Shanghai University of Engineering Science, Shanghai, China, in 2019. He is currently pursuing the M.S. degree in computer science and technology with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His current research interests include CT imaging, image processing, and deep learning.

References (47)

  • A.B. de Gonzalez et al.

    Risk of cancer from diagnostic X-rays: estimates for the UK and 14 other countries

    The lancet

    (2004)
  • T. Lyu et al.

    Estimating dual-energy CT imaging from single-energy CT data with material decomposition convolutional neural network

    Medical image analysis

    (2021)
  • P. Gu et al.

    Low-Dose Computed Tomography Image Super-Resolution Reconstruction via Random Forests

    Sensors

    (2019)
  • R.D. MacDougall et al.

    ”Improving low-dose pediatric abdominal CT by using Convolutional neural networks,” Radiology

    Artificial Intelligence

    (2019)
  • M.K. Kalra et al.

    Strategies for CT radiation dose optimization

    Radiology

    (2004)
  • H. Chen et al.

    Low-dose CT with a residual encoder-decoder convolutional neural network

    IEEE transactions on medical imaging

    (2017)
  • M. Li et al.

    SACNN: self-attention convolutional neural network for low-dose CT denoising with self-supervised perceptual loss network

    IEEE transactions on medical imaging

    (2020)
  • Q. Xu et al.

    Low-dose X-ray CT reconstruction via dictionary learning

    IEEE transactions on medical imaging

    (2012)
  • B. De Man et al.

    Distance-driven projection and backprojection in three dimensions

    Physics in Medicine & Biology

    (2004)
  • X. Li et al.

    Recovering quantitative remote sensing products contaminated by thick clouds and shadows using multitemporal dictionary learning

    IEEE Transactions on Geoscience and Remote Sensing

    (2014)
  • D. Wu et al.

    An effective approach for underwater sonar image denoising based on sparse representation

  • W. Zhao, T. Lv, R. Lee, Y. Chen, and L. Xing, ”Obtaining dual-energy computed tomography (CT) information from a...
  • K.H. Jin et al.

    Deep convolutional neural network for inverse problems in imaging

    IEEE Transactions on Image Processing

    (2017)
  • X. Yin et al.

    Domain progressive 3D residual convolution network to improve low-dose CT imaging

    IEEE transactions on medical imaging

    (2019)
  • W. Yang et al.

    Deep learning for single image super-resolution: A brief review

    IEEE Transactions on Multimedia

    (2019)
  • T. Dai et al.

    Second-order attention network for single image super-resolution

  • H. Hou et al.

    Cubic splines for image interpolation and digital filtering

    IEEE Transactions on acoustics, speech, and signal processing

    (1978)
  • J. Sun et al.

    Image super-resolution using gradient profile prior

  • R. Zhang et al.

    Model-based iterative reconstruction for dual-energy x-ray ct using a joint quadratic likelihood model

    IEEE transactions on medical imaging

    (2013)
  • Z. Yu et al.

    Fast model-based x-ray ct reconstruction using spatially nonhomogeneous icd optimization

    IEEE Transactions on image processing

    (2010)
  • W. Dong et al.

    Image deblurring and super-resolution by adaptive sparse domain selection and adaptive regularization

    IEEE Transactions on image processing

    (2011)
  • J. Yang et al.

    Image super-resolution via sparse representation

    IEEE transactions on image processing

    (2010)
  • C. Jiang et al.

    Super-resolution ct image reconstruction based on dictionary learning and sparse representation

    Scientific reports

    (2018)
  • Cited by (13)

    • CT Image Denoising and Deblurring with Deep Learning: Current Status and Perspectives

      2024, IEEE Transactions on Radiation and Plasma Medical Sciences
    View all citing articles on Scopus

    Hongyu Hou received the B.S. degree in materials science and engineering from Shanghai University of Engineering Science, Shanghai, China, in 2019. He is currently pursuing the M.S. degree in computer science and technology with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His current research interests include CT imaging, image processing, and deep learning.

    Qunchao Jin received the B.S. degree in electrical engineering and automation from Nanjing Tech University, Jiangsu, China, in 2020. He is currently pursuing the M.Eng degree in computer technology with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His current research interests include image processing, and deep learning.

    Guixu Zhang received the Ph.D. degree from the Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou, China, in 1998. He is currently a Professor with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His research interests include hyperspectral remote sensing, image processing, and artificial intelligence.

    Zhi Li received the B.S. degree and the M.S. degree from China University of Petroleum, in 2007 and 2010, respectively. He also received the M.S. degree in applied science from Saint Mary’s University, Canada, in 2012. After being awarded the HK Ph.D. Fellowship, he went to Hong Kong Baptist University, where he received the Ph.D. degree in 2016. Then he worked as a Postdoctoral Researcher at Michigan State University, USA, from 2016 to 2019. He is currently an associate researcher with the Department of Computer Science and Technology, East China Normal University, Shanghai, China.

    View full text