CT image quality enhancement via a dual-channel neural network with jointing denoising and super-resolution

doi:10.1016/j.neucom.2022.04.040

Neurocomputing

Volume 492, 1 July 2022, Pages 343-352

https://doi.org/10.1016/j.neucom.2022.04.040 Get rights and content

Abstract

In recent years, computed tomography (CT) has been widely used in various clinical diagnosis. Given potential health risks bring by the X-ray radiation, the major objective of the current research is to achieve high-quality CT imaging while reducing X-ray radiation. However, most existing studies on low-dose CT image super-resolution reconstruction do not focus on the interaction between the denoising task and the super-resolution task. In this paper, we propose a dual-channel joint learning framework to accurately reconstruct high-resolution CT images from low-resolution CT images. Unlike the previous cascaded models which directly combine the denoising network and the super-resolution network, our method can process the denoising reconstruction and the super-resolution reconstruction in parallel. Additionally, we design a filter gate module that can filter features from the denoising branch and highlight important features which can benefit the super-resolution task. We evaluate the performance of our method in medical image enhancement by testing on the 2016 Low-Dose CT Grand Challenge dataset and the piglet dataset. The experimental results show that the proposed network is superior to other state-of-the-art methods in terms of both peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). We also demonstrate that our method can better remove noise and recover details. Furthermore, the method achieves competitive results not only for super-resolution reconstruction of low-dose CT, but also for super-resolution reconstruction of sparse-view CT.

Introduction

Computed tomography (CT) is a disease detection technique that scans a specific area of the human body by using X-rays, gamma rays, or other types of beams [1]. CT has the advantages of being simple to operate and producing high-resolution images. Therefore, it is widely used in a variety of clinical diagnoses [2]. However, the excessive radiation exposure will bring the potential cancer risk to the patient [3]. To ensure the health of patients, efforts have been made to reduce the radiation dose. Nevertheless, decreasing the radiation dose to obtain low-dose CT (LDCT) images may degrade the quality of images, with noise and more artifacts, which will affect the final diagnosis results [4]. Thus, how to improve the quality of LDCT images is a research priority [5], [6].

Over the past decade, researchers focused on developing new iterative algorithms to enhance the image quality of LDCT [7], [8]. Iterative reconstruction algorithms significantly improve image quality, but they require a high computational cost, which limits their practical applications. Additionally, the methods based on dictionary learning and sparse representation are also widely explored [9], [10]. Considering the enormous potential of neural networks in the field of image processing, researchers began applying deep learning techniques to dual-energy CT imaging [11], [12] and LDCT denoising [6], [13], [5]. For example, a convolutional encoder-decoder with residual learning [5] was used for restoring normal dose CT images. Recently, Yin et al. [14] proposed a domain progressive 3D residual convolution network, which significantly improves the image reconstruction performance by combining the network processing in sinogram domain and image domain. However, current CT reconstruction methods were primarily concerned with denoising, neglecting the resolution constraints of the image.

Single-image SR technology is a classic image enhancement technology, which refers to the process of recovering a high-resolution (HR) image from a low-resolution (LR) image [15]. SR of medical images improves the quality of digital imaging systems, therefore, physicians can make a more precise diagnosis of the disease using the enhanced SR images. Generally, there are three types of SR reconstruction methods [16]: (1) interpolation-based methods [17], [18], (2) model-based reconstruction methods [19], [20], and (3) learning-based methods [21], [22], [23]. Those methods based on interpolation mainly include nearest-neighbor interpolation, bilinear interpolation, and bicubic interpolation [24], which are simple to implement and have been widely used in the SR restoration of LR images. These methods can effectively increase the resolution of the LR image but have poor visual quality. When dealing with real images with complex structures, such as CT images, due to the fact that traditional interpolations do not take structural information into account, they had a limited effect and may even produce artifacts. The model-based methods explicitly model the degradation process of the image and regularize the reconstruction according to the characteristics of the data. In comparison to the interpolation-based reconstruction methods, they can recover more detailed information from LR images. The learning-based methods learn the nonlinear mapping from paired LR and HR images in order to recover missing high-frequency details. Yang et al. [22] suggested a novel approach for single image SR based on sparse representations in terms of the coupled dictionaries that were jointly trained from HR and LR image patch pairs, and achieved encouraging results on the SR reconstruction of the face image. Considering the fact that images frequently contain many repetitive image structures, adaptive regularization and adaptive sparse domain selection were combined by [21] to achieve excellent results in terms of visual quality and PSNR.

With the rapid development of artificial intelligence and the continuous improvement of computer hardware performance in recent years, researchers started to restore images by using deep learning. Dong et al. [25] noticed that traditional SR methods based on the sparse coding can be interpreted as a deep convolutional neural network, and achieved advanced reconstruction results. In addition, many studies have been conducted on the application of deep learning to medical imaging [26], [27], [28]. In [26], a novel convolutional neural network was designed to learn residual-based transformations from LR to HR images. Subsequently, a modified U-Net was used in [27] to learn an end-to-end mapping between LR and HR images. Recently, to accurately improve the quality of CT images, Jiang et al. [28] designed a novel semi-supervised adversarial generative network, and proposed a new loss function to enforce the mappings between the discriminator and generator.

Many researchers worked on the noise removal of LDCT images and SR reconstruction of CT images, but most of these methods only focus on the single task of denoising or SR reconstruction, without considering the relationship between the two tasks. In particular, denoising task is necessary and beneficial for SR task [29]. To reduce CT scan radiation while maintaining the quality of the CT image, the generative adversarial network was used as a building block to establish a nonlinear end-to-end mapping from the noisy LR input to the denoised HR output [30]. Later, Chi et al. [31] first utilized a dense-inception network integrating the dense skip connection and the inception structure to estimate the noise level, followed by a modified residual-dense network to reconstruct HR images. Recently, to improve the quality of LDCT images, Yim et al. [32] linearly combined the denoising autoencoder and the SR convolutional neural network, where two networks were trained separately for the denoising and the SR. However, the methods mentioned above have the following limitations: (1) The cascaded models [32] that denoising the image first and then applying an SR algorithm have a drawback: along with noise, the denoising step always loses some of the high-frequency content of the image [29]. (2) The methods [30], [31] ignore the potential relationship between the denoising and the SR tasks. They use a single branch network to achieve denoising and SR tasks, which limits the quality of image reconstruction [33].

To deal with these drawbacks, we explore how to improve the interaction of denoising and SR tasks on the SR reconstruction of LDCT images. First, without noise removal, super-resolving the noisy input directly will magnify the undesired noise [33]. In order to remove the noise, the simple solution is to perform denoising first, followed by resolution improvement. But in this way, denoising is a pre-processing procedure that frequently results in the loss of high-frequency material, impairing subsequent SR performance [33]. For this reason, we achieve the interaction between the denoising and the SR by processing the image denoising and the SR reconstruction parallelly. In summary, our network consists of two parallel network branches, one for denoising and the other for SR reconstruction. The denoising branch will provide additional details to guide the SR reconstruction of the image. Extensive experiments demonstrate that our network can retain a large amount of detailed information of the image and obtain satisfactory image reconstruction results.

In summary, the main contributions of this article are as follows:

(1)
We propose an Encoder-Decoder network for joint learning of image SR and denoising to achieve SR reconstruction of LDCT images. The suggested model learns the potential connection between the dual tasks through the interaction of the denoising task and the SR task. It provides a novel framework for SR reconstruction of noisy images.
(2)
We design an filter gate module for controlling the interaction of denoising and SR reconstruction features. Rather than directly passing denoised features to the SR task, this module filters features from the denoising branch. It can learn implicitly to suppress irrelevant features from the denoising branch and highlight important features useful for the SR branch.
(3)
Our joint learning network effectively solves the reconstruction of image details on the SR reconstruction of noisy images. Experiments show that our network achieves competitive results not only in LDCT images but also in sparse-view CT images.

The structure of this paper is as follows. We introduce the proposed dual-channel learning framework and the optimization settings of model in Section 2. Section 3 describes the datasets of experiments and training details of the model. The results of qualitative and quantitative comparisons with other methods are also discussed in this section. Finally, the conclusion and future works are presented in Section 4.

Section snippets

Proposed Approach

In this section, we first briefly review the Encoder-Decoder architecture for medical images. Then, we present the dual-channel learning framework and introduce the filter gate module in detail. Finally, the optimization settings will be described briefly.

Experiment and Results

In this section, we describe the datasets used to train and evaluate, as well as the experimental setup which includes hyperparameter selection and data preparation. Additionally, we compare the results of our method with the state-of-the-art methods, and demonstrate the effectiveness of our model.

Conclusion

In this work, we propose a novel dual-channel SR learning framework for the SR reconstruction of LDCT images. In the proposed network, the denoising task and the SR task can be implemented in parallel. The DN branch can remove the artifacts of the images and restore the complex structural features of the images. The introduction of the FG module can highlight the denoising features that have an effect on the SR task and further improve the SR reconstruction performance. Dual-task interactions

CRediT authorship contribution statement

Hongyu Hou: Data curation, Writing - original draft. Qunchao Jin: Visualization, Investigation. Guixu Zhang: Supervision, Validation. Zhi Li: Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This work was supported by the Natural Science Foundation of China (Grant No. 62001167, 61731009, 61961160734), East China Normal University through startup funding.

Hongyu Hou received the B.S. degree in materials science and engineering from Shanghai University of Engineering Science, Shanghai, China, in 2019. He is currently pursuing the M.S. degree in computer science and technology with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His current research interests include CT imaging, image processing, and deep learning.

References (47)

A.B. de Gonzalez et al.
Risk of cancer from diagnostic X-rays: estimates for the UK and 14 other countries
The lancet
(2004)
T. Lyu et al.
Estimating dual-energy CT imaging from single-energy CT data with material decomposition convolutional neural network
Medical image analysis
(2021)
P. Gu et al.
Low-Dose Computed Tomography Image Super-Resolution Reconstruction via Random Forests
Sensors
(2019)
R.D. MacDougall et al.
”Improving low-dose pediatric abdominal CT by using Convolutional neural networks,” Radiology
Artificial Intelligence
(2019)
M.K. Kalra et al.
Strategies for CT radiation dose optimization
Radiology
(2004)
H. Chen et al.
Low-dose CT with a residual encoder-decoder convolutional neural network
IEEE transactions on medical imaging
(2017)
M. Li et al.
SACNN: self-attention convolutional neural network for low-dose CT denoising with self-supervised perceptual loss network
IEEE transactions on medical imaging
(2020)
Q. Xu et al.
Low-dose X-ray CT reconstruction via dictionary learning
IEEE transactions on medical imaging
(2012)
B. De Man et al.
Distance-driven projection and backprojection in three dimensions
Physics in Medicine & Biology
(2004)
X. Li et al.
Recovering quantitative remote sensing products contaminated by thick clouds and shadows using multitemporal dictionary learning
IEEE Transactions on Geoscience and Remote Sensing
(2014)

D. Wu et al.

An effective approach for underwater sonar image denoising based on sparse representation

W. Zhao, T. Lv, R. Lee, Y. Chen, and L. Xing, ”Obtaining dual-energy computed tomography (CT) information from a...

K.H. Jin et al.

Deep convolutional neural network for inverse problems in imaging

IEEE Transactions on Image Processing

(2017)

X. Yin et al.

Domain progressive 3D residual convolution network to improve low-dose CT imaging

IEEE transactions on medical imaging

(2019)

W. Yang et al.

Deep learning for single image super-resolution: A brief review

IEEE Transactions on Multimedia

(2019)

T. Dai et al.

Second-order attention network for single image super-resolution

H. Hou et al.

Cubic splines for image interpolation and digital filtering

IEEE Transactions on acoustics, speech, and signal processing

(1978)

J. Sun et al.

Image super-resolution using gradient profile prior

R. Zhang et al.

Model-based iterative reconstruction for dual-energy x-ray ct using a joint quadratic likelihood model

IEEE transactions on medical imaging

(2013)

Z. Yu et al.

Fast model-based x-ray ct reconstruction using spatially nonhomogeneous icd optimization

IEEE Transactions on image processing

(2010)

W. Dong et al.

Image deblurring and super-resolution by adaptive sparse domain selection and adaptive regularization

IEEE Transactions on image processing

(2011)

J. Yang et al.

Image super-resolution via sparse representation

IEEE transactions on image processing

(2010)

C. Jiang et al.

Super-resolution ct image reconstruction based on dictionary learning and sparse representation

Scientific reports

(2018)

Cited by (13)

Noise aware content-noise complementary GAN with local and global discrimination for low-dose CT denoising
2024, Neurocomputing
In response to rising concerns over radiation exposure in computed tomography (CT) imaging, effective denoising methods for low-dose CT (LDCT) images are crucial. In recent years, the use of deep learning techniques especially generative adversarial networks (GANs) significantly enhanced the efficiency of LDCT denoising methods, surpassing traditional methods. However, GAN-based denoising methods often face challenges in preserving structural consistency and fine details. This study introduces a novel GAN framework with three accretions to enhance the effectiveness of LDCT denoising. Firstly, our generator is designed to leverage a complementary learning scheme between image noise and image content via two distinct paths. One path focuses on exploring the anatomical information of the image, while the second path is dedicated to learning the noise pattern. This complementary learning scheme provides stable noise cancellation while preserving the maximum structural information of the image. Subsequently, we propose a novel noise-conscious mean absolute error loss to address the challenge posed by the non-stationary characteristic of CT noise. In contrast to conventional MAE loss, this loss attentively prioritizes the different parts of the image based on the local distribution of noise in that region. We also incorporate a gradient-domain loss into the loss function, which inspires the generator to preserve precise image details through explicit guidance. Finally, this study adopted a U-Net-based design for the discriminator to better regularize the model by discriminating between the clean image and the denoised image at both global and local levels. The merit of this discriminator is that it can better adapt to the non-stationary environment of GAN training and guide the generator to produce denoised images that are locally and globally consistent. Our thorough experiments using abdominal CT and lung CT datasets demonstrate the superior performance of our approach compared to state-of-the-art methods.
Soil CT image quality enhancement via an improved super-resolution reconstruction method based on GAN
2023, Computers and Electronics in Agriculture
Computed tomography (CT) is an effective instrument to characterize the internal structure of soil. However, the resolution of soil CT images is often limited by the physical properties of the scanner and the imaging protocol, which can lead to difficulties in accurately characterizing fine-scale features of soil. At present, the super-resolution reconstruction results of CT images often have the problems of blurred feature boundaries and low image quality, causing inaccuracies in the analysis of soil structure. Therefore, this study developed an improved super-resolution reconstruction method based on generative adversarial networks (SRLGAN) to accurately reconstruct high-resolution CT images from low-resolution CT images and assist in digital soil descriptions. SRLGAN utilized a lightweight CNN super-resolution reconstruction module as the generator model, which can improve the reconstruction image quality while reducing the number of trainable parameters and inference time. Meanwhile, a closed-loop loss function was employed to reduce input image information loss and improve method generalization. Compared with traditional reconstruction methods and deep learning methods, the proposed SRLGAN method showed superior performance with a higher peak signal-to-noise ratio (PSNR) of 45.869 dB and a structural similarity index (SSIM) of 0.992. Particularly, the PSNR was 9.95% and 4.49% higher than the best-performing traditional method (Bicubic) and deep learning method (SRGAN), respectively. Furthermore, compared to the SRGAN, trainable parameters and inference time of the SRLGAN have decreased by 77.84% and 57.86%, respectively, which indicates the highest degree of lightweighting. This study demonstrated that SRLGAN can generate high-quality, high-resolution soil CT images that can be used for subsequent segmentation and 3D structural analysis, offering an intelligent approach to understanding the internal soil structure.
Sonar image garbage detection via global despeckling and dynamic attention graph optimization
2023, Neurocomputing
Sonar is widely used in marine water cleaning tasks, so sonar images have become an effective tool for garbage detection and underwater scene analysis. However, it is an extremely difficult task to achieve fully supervised denoising and garbage detection for sonar images. This is because sonar images are weakly annotated samples that are susceptible to noise interference and have no reference clean image. To this end, we propose a sonar image garbage instance segmentation model via global despeckling and dynamic attention graph optimization(GD-DAGO). Specifically, a self-supervised blind spot network denoising structure is presented in this paper. The proposed denoising model overcomes the defects of information loss in the traditional blind spot network structure and performs global awareness speckle suppression for the noise characteristics of sonar images themselves. In addition, a novel dynamic attention structure is employed to improve the target region estimation in the instance segmentation module and does not require supervision beyond image-level category labeling. Finally, in order to enhance the cooperative ability between the two tasks, we adopt a local perceptual loss strategy based on mask proposals guided by the downstream task, so that the whole model takes more into account the characteristics of sonar images and better serves the sonar garbage detection task. Experimental results on ARACATI 2017 and marine-debris-fls-datasets (MDFD) show that the proposed algorithm achieves a performance gain of 0.4218 and 4.2% in terms of denoising effect (ENL) and detection accuracy ( ${AP}_{25}$ ), respectively, compared with suboptimal algorithms.
Computed tomography simulation projection acquisition method of artistic relics based on voxel model
2024, Multimedia Tools and Applications
CT Image Denoising and Deblurring with Deep Learning: Current Status and Perspectives
2024, IEEE Transactions on Radiation and Plasma Medical Sciences
Super-Resolution Reconstruction of CT Images Based on Generative Adversarial Networks
2024, Lecture Notes in Electrical Engineering

View all citing articles on Scopus

Qunchao Jin received the B.S. degree in electrical engineering and automation from Nanjing Tech University, Jiangsu, China, in 2020. He is currently pursuing the M.Eng degree in computer technology with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His current research interests include image processing, and deep learning.

Guixu Zhang received the Ph.D. degree from the Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou, China, in 1998. He is currently a Professor with the Department of Computer Science and Technology, East China Normal University, Shanghai, China. His research interests include hyperspectral remote sensing, image processing, and artificial intelligence.

Zhi Li received the B.S. degree and the M.S. degree from China University of Petroleum, in 2007 and 2010, respectively. He also received the M.S. degree in applied science from Saint Mary’s University, Canada, in 2012. After being awarded the HK Ph.D. Fellowship, he went to Hong Kong Baptist University, where he received the Ph.D. degree in 2016. Then he worked as a Postdoctoral Researcher at Michigan State University, USA, from 2016 to 2019. He is currently an associate researcher with the Department of Computer Science and Technology, East China Normal University, Shanghai, China.

View full text

CT image quality enhancement via a dual-channel neural network with jointing denoising and super-resolution

Abstract

Introduction

Section snippets

Proposed Approach

Experiment and Results

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgment

The lancet

Medical image analysis

Low-Dose Computed Tomography Image Super-Resolution Reconstruction via Random Forests

Sensors

”Improving low-dose pediatric abdominal CT by using Convolutional neural networks,” Radiology

Artificial Intelligence

Strategies for CT radiation dose optimization

Radiology

Low-dose CT with a residual encoder-decoder convolutional neural network

IEEE transactions on medical imaging

SACNN: self-attention convolutional neural network for low-dose CT denoising with self-supervised perceptual loss network

IEEE transactions on medical imaging

Low-dose X-ray CT reconstruction via dictionary learning

IEEE transactions on medical imaging

Distance-driven projection and backprojection in three dimensions

Physics in Medicine & Biology

Recovering quantitative remote sensing products contaminated by thick clouds and shadows using multitemporal dictionary learning

IEEE Transactions on Geoscience and Remote Sensing

An effective approach for underwater sonar image denoising based on sparse representation

Deep convolutional neural network for inverse problems in imaging

IEEE Transactions on Image Processing

Domain progressive 3D residual convolution network to improve low-dose CT imaging

IEEE transactions on medical imaging

Deep learning for single image super-resolution: A brief review

IEEE Transactions on Multimedia

Second-order attention network for single image super-resolution

Cubic splines for image interpolation and digital filtering

IEEE Transactions on acoustics, speech, and signal processing

Image super-resolution using gradient profile prior

Model-based iterative reconstruction for dual-energy x-ray ct using a joint quadratic likelihood model

IEEE transactions on medical imaging

Fast model-based x-ray ct reconstruction using spatially nonhomogeneous icd optimization

IEEE Transactions on image processing

Image deblurring and super-resolution by adaptive sparse domain selection and adaptive regularization

IEEE Transactions on image processing

Image super-resolution via sparse representation

IEEE transactions on image processing

Super-resolution ct image reconstruction based on dictionary learning and sparse representation

Scientific reports