Restoration of Atmospheric Turbulence-Degraded Short-Exposure Image Based on Convolution Neural Network

Cheng, Jiuming; Zhu, Wenyue; Li, Jianyu; Xu, Gang; Chen, Xiaowei; Yao, Cao

doi:10.3390/photonics10060666

Open AccessCommunication

Restoration of Atmospheric Turbulence-Degraded Short-Exposure Image Based on Convolution Neural Network

¹

Key Laboratory of Atmospheric Optics, Anhui Institute of Optics and Fine Mechanics, HFIPS, Chinese Academy of Sciences, Hefei 230031, China

²

Science Island Branch of Graduate School, University of Science and Technology of China, Hefei 230026, China

³

Advanced Laser Technology Laboratory of Anhui Province, Hefei 230037, China

^*

Author to whom correspondence should be addressed.

Photonics 2023, 10(6), 666; https://doi.org/10.3390/photonics10060666

Submission received: 24 March 2023 / Revised: 3 June 2023 / Accepted: 6 June 2023 / Published: 8 June 2023

(This article belongs to the Special Issue Computational Optical Imaging and Its Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Ground-based remote observation systems are vulnerable to atmospheric turbulence, which can lead to image degradation. While some methods can mitigate this turbulence distortion, many have issues such as long processing times and unstable restoration effects. Furthermore, the physics of turbulence is often not fully integrated into the image reconstruction algorithms, making their theoretical foundations weak. In this paper, we propose a method for atmospheric turbulence mitigation using optical flow and convolutional neural networks (CNN). We first employ robust principal component analysis (RPCA) to extract a reference frame from the images. With the help of optical flow and the reference frame, the tilt can be effectively corrected. After correcting the tilt, the turbulence mitigation problem can be simplified as a deblurring problem. Then, we use a trained CNN to remove blur. By utilizing (i) a dataset that conforms to the turbulence physical model to ensure the restoration effect of the CNN and (ii) the efficient parallel computing of the CNN to reduce computation time, we can achieve better results compared to existing methods. Experimental results based on actual observed turbulence images demonstrate the effectiveness of our method. In the future, with further improvements to the algorithm and updates to GPU technology, we expect even better performance.

Keywords:

atmospheric turbulence; Zernike polynomials; image restoration; convolutional neural networks

1. Introduction

The imaging performance of a ground-based remote observation system is degraded due to the influence of atmospheric turbulence. The cause of turbulence is complex and is affected by atmospheric temperature, pressure, humidity, wind speed and other factors [1]. When light waves pass through turbulence, the wavefront phase will be distorted and accumulated. The random distortion of the phase is converted into the corresponding point spread function (PSF). Two driving factors affect turbulent PSF. The first factor is the random tilt that causes the image pixel offset. The other is the higher-order distortion that leads to image blurring [2,3].

The traditional turbulence image restoration method is generally divided into three steps. The first step involves removing the pixel offset caused by tilt. The most commonly used method for this process is the optical flow method, such as Xie et al. [4], Hardie et al. [5] and Gilles et al. [6]. In addition, because the random phase distortion is zero mean value, clear areas occasionally appear in the image [7]. Therefore, the next processing method is to compare the sharpness of the pixel blocks within a certain shooting interval and select the clearest pixel block for fusion. This method is also called lucky image fusion, such as Anantrasirichai et al. [8], Aubailly et al. [9] and Zhu et al. [10]. However, this method requires a large number of measured data to be filtered and fused, and the fusion effect is highly dependent on the quality of measured data. Finally, most literatures use a blind deconvolution algorithm [8,9,10,11,12,13,14,15] to remove the residual fuzziness after lucky image fusion. However, the PSF priori defined by them in the process of blind deconvolution is too general and not optimized for turbulence characteristics. These priors have little connection with the statistical behavior of turbulence, especially the 5/3 power kernel proved by Fried [16].

Recently, deep learning methods have been gradually applied to turbulence mitigation strategies. Chen et al. [17] proposed a U-net-like deep-stacked autoencoder neural network model. Fazlali et al. [18] proposed an end-to-end convolutional autoencoder to mix several registered blurry frames to generate a high-quality output image of the scene. However, they are usually based on a simplified assumption of atmospheric turbulence where they assume the blur to be spatially invariant. Such an assumption cannot extend to general scene reconstruction. Hoffmire et al. [19] proposed the block matching and CNN (BM-CNN) method. Zhiyuan et al. [20] introduce a physics-inspired turbulence restoration model (TurbNet). Although they performed well in the simulated data set, they did not test and evaluate the measured data.

To tackle the challenges, this paper makes two contributions:

We tune a turbulence simulator based on a physical model to generate a large-scale dataset. The highly realistic and diverse dataset provides strong support for the training of our neural network.
Realizing the limitations of lucky image fusion and blind deconvolution algorithms, we introduced convolutional neural networks to replace the original two algorithms. Our algorithm reduces the requirements for computation time while ensuring the effectiveness of image restoration.

2. Method

2.1. Problem Setting and Motivation

The degradation process of atmospheric turbulence on images can be approximated by the following equation:

\tilde{I} = (T \circ B) (I) + n

(1)

where

\tilde{I}

is an image degraded by turbulence, and

I

is the corresponding clear image. The operation

T

is a mapping representing the geometric pixel displace (known as the tilt), and

B

is a convolution matrix representing the spatially varying blur. The operation “

\circ

” means the functional composition. The variable

n

denotes the additive noise.

Due to the simultaneous existence of the

T

and

B

operators in Equation (1), the problem becomes difficult. If there is only

B

, the problem become a deblurring. A common method is to remove the tilt using the reference frame and optical flow method. Equation (1) is simplified as

\tilde{I} = B (I) + n

(2)

Removing blurring caused by turbulence is an ill-posed inverse problem. From a Bayesian perspective, the solution

I_{M A P}

can be obtained by solving the Maximum a Posterior (MAP) estimation problem,

I_{M A P} = a r g \min_{I} - \log p (\tilde{I} | I) - \log p (I)

(3)

where

\log p (\tilde{I} | I)

represents the log-likehood of observation

\tilde{I}

,

\log p (I)

delivers the clean image

I

and is independent of degraded image

\tilde{I}

. More formally, Equation (3) can be reformulated as

I_{M A P} = a r g \min_{I} \frac{1}{2 σ^{2}} {‖ \tilde{I} - B (I) ‖}_{2}^{2} + λ R (I)

(4)

where the solution minimizes an energy function composed of a data term

\frac{1}{2 σ^{2}} {‖ \tilde{I} - B (I) ‖}_{2}^{2}

and a regularization or prior term

λ R (I)

with regularization parameter

λ

.

Generally, the methods to solve EquationEquation (4) can be divided into two main categories, i.e., model-based and learning-based methods. The former method often requires multiple iterations, resulting in a long calculation time. Therefore, we choose the latter method. The latter mostly trains a truncated unfolding inference through optimization of a loss function on a training set containing

N

degraded-clean image pairs

{\{({\tilde{I}}_{i}, I_{i})\}}_{i}^{N}

. In particular, the learning-based methods are usually modeled as the following bi-level optimization problem.

\{\begin{matrix} \min_{Θ} \sum_{i = 1}^{N} L (I_{M A P i}, I_{i}) \\ s . t . I_{M A P i} = a r g \min_{I} \frac{1}{2 σ^{2}} {‖\tilde{I} - B (I_{i})‖}_{2}^{2} + λ R (I_{i}), \end{matrix}

(5)

where

Θ

denotes the trainable parameters,

L (I_{M A P i}, I_{i})

measures the loss of estimated clean image

I_{M A P i}

with respect to ground truth image

I_{i}

. The learning-based methods employ the defined

I_{M A P i} = f (\tilde{I}, Θ)

to iteratively optimize parameters

Θ

through the upper equation of Equation (5)., such that the function

f

progressively approaches the lower equation of Equation (5).

According to the above description, we use the optical flow method to remove the pixel offset caused by tilt. Afterwards, we remove the blurring effect by convolutional neural networks (CNN). Our pipeline diagram is shown in Figure 1. The detailed algorithm will be presented in the following subsection.

2.2. Construction of Reference Frame and Optical Flow Registration

Using the optical flow method to register images requires the reference frame. Referring to the method of sparse decomposition for background modeling [21,22], we use robust principal component analysis (RPCA) to perform matrix decomposition to obtain the low-rank component and construct a reference frame for registration. The RPCA low-rank decomposition can be defined as follows:

m i n i m i z e {‖L‖}_{*} + λ {‖S‖}_{1} s . t . L + S = G

(6)

where

L

is the low-rank component,

S

is the sparse component,

G

is the observed image, and

λ

is a constant providing a trade-off between the sparse and low-rank components. A decomposed result is illustrated in Figure 2. Average the low-rank components of multiple images to obtain the final reference frame. After obtaining the reference frame through RPCA, we referred to the coarse-to-fine optical flow method [23] for registration, as shown in Figure 3.

2.3. Data Sets Generation

After obtaining the image without tilt, we must use CNN to remove blur. CNN is a neural network that is specially used to process data with a similar grid structure. Compared with the prior knowledge of the blind deconvolution algorithm, the previous knowledge of CNN was given by training set. Therefore, the data of the training set must conform to the physical rules of atmospheric turbulence. However, in real life, it is challenging to collect images affected and not affected by the atmosphere at the same time. A better solution is to use numerical simulation.

This paper chooses the Zernike polynomial method [24,25] to simulate turbulence-affected images. For an observation system with aperture

D

, the atmospheric phase disturbance

φ

can be represented by the expansion of the Zernike polynomial:

φ (\frac{D \overset{⃑}{r}}{2}) = \sum_{i = 1}^{N} a_{i} Z_{i} (\overset{⃑}{r})

(7)

where

\overset{⃑}{r}

is the polar coordinate vector of the unit circle,

a_{i}

is the coefficient of the Zernike polynomial, and

Z_{i}

is the Zernike polynomial. The Zernike polynomial defined within the unit circle is as follows:

\{\begin{matrix} Z_{e v e n i} = \sqrt{2 (n + 1)} R_{n}^{m} (r) \cos (m θ) m \neq 0 \\ Z_{o d d i} = \sqrt{2 (n + 1)} R_{n}^{m} (r) \sin (m θ) m \neq 0 \\ Z_{i} = \sqrt{2 (n + 1)} R_{n}^{0} (r) m = 0 \end{matrix}

(8)

R_{n}^{m} (r) = \sum_{s = 0}^{(n - m) / 2} \frac{{(- 1)}^{s} (n - s)!}{s! [(n + m) / 2 - s]! [(n - m) / 2 - s]!} r^{n - 2 s}

(9)

where

r

is the polar axis,

θ

is the polar angle,

i

is the sequence number of the Zernike polynomial,

n

is the radial frequency coefficient, and

m

is the angular frequency coefficient of the Zernike polynomial. In addition,

n

and

m

also satisfy

n \geq m \geq 0

and

n - m

being even numbers.

Each term of the Zernike polynomial

Z_{i}

is fixed, but the coefficient

a_{i}

is variable. We need to determine the relationship between each coefficient that conforms to the laws of atmospheric turbulence. From the energy perspective, Noll [24] provides the covariance between any two Zernike polynomial coefficients

a_{i} (n_{i}, m_{i})

and

a_{j} (n_{j}, m_{j})

:

〈a_{i}, a_{j}〉 = 2.2802 {(\frac{D}{r_{0}})}^{5 / 3} \sqrt{(n_{i} + 1) (n_{j} + 1)} δ_{m_{i} m_{j}} \times \frac{Γ [(n_{i} + n_{j} - 5 / 3) / 2] \times {(- 1)}^{(n_{i} + n_{j} - m_{i} - m_{j}) / 2}}{Γ [(n_{i} - n_{j} + 17 / 3) / 2] Γ [(- n_{i} + n_{j} + 17 / 3) / 2] Γ [(n_{i} + n_{j} + 23 / 3) / 2]}

(10)

δ_{m_{i} m_{j}} = \{\begin{matrix} 1, m_{i} = m_{j} \\ 0, m_{i} = m_{j} \end{matrix}

(11)

where

Γ (\cdot)

is the gamma function,

D

is the aperture of the observation system,

r_{0}

is the atmospheric coherence length,

a_{i}

and

a_{j}

are the coefficients of the i-th and j-th Zernike polynomials,

m_{i}

and

m_{j}

are the angular frequency coefficients,

n_{i}

and

n_{j}

are the radial frequency coefficients, and

δ_{m_{i} m_{j}}

is the Kronecker function. Using Equation (10), we calculate the covariance matrix of the Zernike polynomial for the first 36 orders, as shown in Figure 4a. Finally, the method of generating point spread function (PSF) conforming to atmospheric turbulence characteristics is shown in Algorithm 1. The process of generating PSF is shown in Figure 4b.

Algorithm 1. Using Zernike polynomials to generate PSF conforming to turbulence characteristics

Input: None
Output: PSF matrix
STEP 1:

Select the first 36 Zernike polynomials, use Equation (10), and calculate the covariance matrix C

of their coefficients
STEP 2:

By using \sin gular value decomposition, obtain C = V S V^{T}

, where S

is the feature matrix and V

is the unitary matrix
STEP 3:

Then simulate and generate random variable β

satisfying normal distribution, and calculate α^{,} = V β

STEP 4:

Make α = (a_{1}, a_{2}, \dots, a_{36}) = α' {(\frac{D}{r_{0}})}^{5 / 6}

STEP 5:

Here we only simulate the blurring caused by turbulence, so we choose the Zernike polynomial coefficients a_{4}, a_{5}, \dots, a_{36}

corresponding to higher - order aberrations . Calculate φ

using Equation (7)
STEP 6:

P S F = {|F [e^{2 π i φ (\overset{⃑}{r})}]|}^{2}

We noticed that within the range of isoplanatic angle [26], the wavefront distortion caused by turbulence on the atmospheric path is basically consistent. In other words, image blocks within the range of isoplanatic angle can be convoluted using the same PSF. However, the field of view angle of the actual observation system is much larger than the isoplanatic angle. Therefore, different blocks of an image have different corresponding PSFs, as shown in Figure 5.

Through the above methods, we can simulate and generate a large-scale dataset. The specific parameters of the simulation are given in Table 1. In this paper, the ratio D/

r_{0}

of observation aperture D to atmospheric coherence length

r_{0}

[27] is used to quantify the effect of turbulence on imaging. Figure 6 shows the simulation results under different conditions.

This paper uses the ImageNet2012 data set to generate 1,491,648 blurred images under

D / r_{0} = 2.4312

,

D / r_{0} = 4.2130

, and

D / r_{0} = 16.7721

turbulence intensities by the above method. These images are divided into training sets, validation sets, and test sets. The training set contains 1,044,096 images, the validation set contains 149,184 images, and the test set contains 298,368 images. The training sets are used for the training of deep learning network parameters. The validation sets are used for the preliminary evaluation of the model performance during the network training process. The final ability of the network model after training is evaluated by the test sets.

2.4. Feature Extraction Technology

For a CNN, its foundation is the feature extraction technology of convolution. The specific implementation method is to perform convolution operations through convolutional (Conv) layer, as shown in Figure 7a. Let

X (i, j)

be input feature map,

K (i, j)

be the convolutional kernel. The output feature map

Y (i, j)

that passes through a convolutional layer can be represented by the following function:

Y (i, j) = \sum_{m} \sum_{n} X (i - m, j - n) K (m, n)

(12)

After passing through the Conv layer, the feature map must often pass through the activation function Leaky Rectified Linear Unit (LeakReLU) or Rectified Linear Unit (ReLU). CNN needs an activation function to provide nonlinear factors for it so that CNN can approach any nonlinear function. The ReLU and LeakReLU functions are represented as follows:

Y (i, j) = R e L U (X (i, j)) = \{\begin{matrix} X (i, j), X (i, j) > 0 \\ 0, X (i, j) < 0 \end{matrix}

(13)

Y (i, j) = L e a k R e L U (X (i, j)) = \{\begin{matrix} X (i, j), X (i, j) > 0 \\ - k \cdot X (i, j), X (i, j) < 0 \end{matrix},

(14)

where

k

is a constant real number.

The Batch Normalization (BN) layer is also an important layer. Its function is to map the feature map to a distribution with a mean of 0 and a variance of 1. The mathematical expression is as follows:

Y (i, j) = \frac{X (i, j) - E (X (i, j))}{\sqrt{V a r (X (i, j)) + ϵ}} \times γ + β

(15)

where

E (\cdot)

and

V a r (\cdot)

are the mean and variance functions,

ϵ

is the variable added to prevent the denominator from appearing as zero.

γ

and

β

are the parameters for affine operation on the input value.

After the above operation, we need to use feature maps for upsampling, usually using transposed convolutional (ConvTrans) layers. The operation of transposed convolution requires dilation of the feature map before convolution, as shown in Figure 7b. Its mathematical expression can be expressed as follows:

Y (i, j) = \sum_{m} \sum_{n} D (X (i, j)) K (i - m, j - n)

(16)

where

D

is expansion operation.

2.5. Structure of Convolution Neural Network

The structure of the deep learning model used in this paper is shown in Figure 8. The model includes 8 Conv layers, 7 LeakyReLU layers, 14 BN layers, 8 ConvTrans layers, 8 ReLU layers, 45 layers in total.

The black arrow in Figure 8 indicates the skip connection operation. Its function is to splice the front and rear feature maps, so as to improve the information acquisition in the up-sampling process and improve the reconstruction of image details by network. The blue plus sign in Figure 8 is the residual structure. Its function is to add the corresponding channels of the two feature maps to get a new image, which is conducive to the training of the depth network.

The input of CNN in this paper is a single-channel two-dimensional image with a size of

512 \times 512

pixels. After the Conv layer with a convolutional kernel size of

4 \times 4

and step size of 2, 64 feature maps with a size of

256 \times 256

pixels are obtained. The second layer is the LeakyReLU layer, with a negative slope 0.2. The number and size of the feature maps have not changed. The third layer is the same as the first layer. After convolution, 128 feature maps with the size of

128 \times 128

pixels are obtained. The fourth layer is the BN layer, which normalizes the value of each feature map to a mean value of 0 and a variance of 1. After five times of the same LeakyReLU layer, Conv layer and BN layer, 512 feature maps with the size of

4 \times 4

pixels are obtained. After passing through the LeakyReLU layer and Conv layer of the 20th and 21st layers, the number of feature maps remains unchanged, and the size changes to

2 \times 2

, and the 22nd layer is the ReLU layer. The 23rd layer is the ConvTrans layer. The convolution core size is

4 \times 4

, and the step size is 2. After passing through this layer, 512 feature maps with the size of

4 \times 4

pixels are obtained. The 24th layer is the same BN layer. Before entering the 25th layer, the network splices the output feature maps of the 19th layer and the feature maps of the 24th layer to obtain 1024 feature maps with a size of

4 \times 4

pixels. After seven repetitions of the ReLU layer, CovTrans layer and BN layer, the number of feature maps is compressed to 1, and the size is expanded to

512 \times 512

pixels. Before the final output, the input image and the network output feature map are added to obtain the generated deblurred image.

2.6. Model Hyperparameter Details

The hyper parameters for each layer in our model are shown in Table 2. The input requirement for our network is a

512 \times 512

grayscale image. The image batch size loaded onto the network is 48. The epochs for network training is 300. The initial learning rate of the network is set to 0.0005. After every 20 epochs of training, the learning rate decreases by 0.5 times. Our network optimizer chooses Adaptive Moment Estimation (Adam). Use the blurred image and the corresponding clear image to calculate the loss function of the network, as follows:

L o s s (y, z) = \frac{1}{N} \underset{i = 1}{\sum^{N}} | y_{i} - z_{i} |

(17)

where

y

and

z

are the network-generated and clear images,

N

is the number of image pixels,

y_{i}

and

z_{i}

are the pixel values of the network-generated image and the clear image.

3. Experimental Results and Analysis

3.1. Training Results

We use the image training sets of

D / r_{0} = 2.4312, D / r_{0} = 4.2130, D / r_{0} = 16.7721

turbulence intensities for the training of CNN. The variation curve of the training loss function is shown in Figure 9. It can be seen from Figure 9 that with the increase of the number of training epochs, the loss functions of the three training sets gradually decrease and finally converge to 0.02065, 0.02403 and 0.03211 respectively. At the same time, the loss functions of the validation sets also gradually decrease and converge to 0.02170, 0.02422 and 0.03221 respectively. Use the test sets to evaluate the restoration effect of the model, as shown in Figure 10.

To verify the effectiveness of the neural network algorithm, the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) are used as objective evaluation indicators to evaluate the differences between the generated image and the original clear image. The PSNR reflects the difference between the corresponding pixels of the original clear image and the restored image. The higher the PSNR, the sharper the reconstructed image and the better the effect. PSNR is calculated as follows:

f_{PSNR} = 20 \lg [\frac{(2^{k} - 1)}{\sqrt{f_{MSE}}}]

(18)

where

f_{MSE}

is the mean square deviation and

k

is the number of image bits. SSIM reflects the similarity between the original clear image and the restored image. The closer the value is to 1, the closer the reconstructed image is to the clear image. SSIM calculation equation is as follows:

f_{SSIM} (I, I_{g}) = \frac{(2 μ_{I} μ_{I_{g}} + c_{1}) (2 δ_{I, I_{g}} + c_{2})}{({μ_{I}}^{2} + {μ_{I_{g}}}^{2} + c_{1}) ({δ_{I}}^{2} + {δ_{I_{g}}}^{2} + c_{2})}

(19)

where

I

is the original clear image,

I_{g}

is the reconstructed image generated by the neural network,

μ

is the mean value of the image,

δ

is the variance of the image,

δ_{I, I_{g}}

is the covariance of the image,

c_{1}

and

c_{2}

are constants, set to 6.5 and 58.5. In the experiment, we take the average values of PSNR and SSIM for the test sets, and the results are shown in Table 3. It can be seen from Table 3 that under different turbulence intensities, the PSNR and SSIM of the images reconstructed by neural network are improved compared with the blurred images. For the three turbulence intensities from low to high, the PSNR of the image increased by 6.32 dB, 5.23 dB and 3.11 dB respectively, and the SSIM increased by 22.99%, 20.89% and 15.58% respectively. It can be seen that the reconstructed images have improved in distribution and details compared with the burred images.

3.2. Comparison of Actual Restoration Effects

In the winter of 2022, we chose sunny weather with high visibility for the experiment. We observed ground targets at a distance of 7000 m and obtained 12,521 short-exposure images. The observed environmental parameters are shown in Table 4. After removing the pixel offset in the observed images by optical flow method, we use the trained CNN to restore the registered images. The advanced algorithms currently used for turbulence image restoration include Mao et al. [2] (Optical Flow + Lucky Fusion + Deconv), Chen et al. (CNN) [17], Fazlali et al. (Lucky Fusion + CNN) [18], Hoffmire (Lucky Fusion + CNN) [19], Zhiyuan et al. [20] (CNN). We compared the restored image with the results of the above algorithm, as shown in Figure 11.

It can be seen from Figure 11 that the overall definition and local details of the short-exposure turbulence-degraded image are improved by using our algorithm. Both PSNR and SSIM require the participation of original clear images in the calculation. But in the actual observation process, images that are not affected by the atmosphere cannot be obtained. Therefore, we introduced the Tenengrad function [28]. The Tenengrad function is a commonly used image clarity evaluation function. The larger the value of the Tenengrad function, the higher the clarity of the image. The calculation method for the Tenengard function is as follows:

f_{T e n} = \sqrt{f_{x}^{2} + f_{y}^{2}}

(20)

f_{x} = I_{g} * [\begin{matrix} - 1 & 0 & 1 \\ - 2 & 0 & 2 \\ - 1 & 0 & 1 \end{matrix}], f_{y} = I_{g} * [\begin{matrix} 1 & 2 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}]

(21)

To demonstrate the correlation between Tenengard and PSNR, we plotted scatter plot of Tenengard and PSNR using simulated restored images, as shown in Figure 12. From Figure 12, there is a significant positive correlation between Tenengard and PSNR. It can also be considered that the larger the Tenengard of the restored image, the larger its PSNR may be, and the closer it is to the original clear image.

We calculate the Tenengard values for all images in Figure 11, as shown in Figure 13. Overall, our algorithm has slightly higher Tenengard values than other algorithms. In tower body images with a single content, our algorithm significantly improves the image.

4. Discussion

Presently, there are many algorithms for restoring turbulent degraded images, and most of them can achieve good results. However, many conditions limit the application of algorithms. The two main limitations are the demand for computing time and data volume. We have calculated the computational time required abovementioned algorithms to restore 100 images, as shown in Figure 14. From Figure 14, our algorithm has the shortest running time. We replaced the most time-consuming lucky image fusion and blind deconvolution with CNN. At the same time, the CNNs in our algorithm have smaller convolution kernels than other CNNs, and the number of convolutional layers is not particularly large.

Although CNN has certain advantages in computational time, it is not stable enough in image restoration. The basic method for CNN image restoration is to extract image features through different convolutional kernels. In cases where the gradient changes in the image are insignificant, the features extracted by CNN will also be affected. Classifying CNN datasets and training images with less significant gradient changes (also known as more blurry images) separately may be a better method. It can also be understood that reducing the complexity of the dataset can potentially improve the accuracy of deep learning [29,30].

5. Conclusions

Image restriction algorithms for the atmosphere are known to be a challenge because it is an ill-posed problem. Traditional lucky image fusion and blind deconvolution require significant computational time. The CNN restoration algorithms have problems such as unstable restoration performance and poor universality. We propose a turbulence mitigation algorithm combining the optical flow method and CNN to solve the above problems. We first use RPCA to extract reference frames from observed images. Then, the optical flow and the reference frame are used to register the restored image. We use the Zernike polynomial method to simulate turbulent degradation datasets. This training set is in line with the turbulence physics model, ensuring that CNN can correctly learn the laws of turbulence. Finally, the trained CNN removes the blurred image after registration.

The experimental results demonstrated that our method could effectively alleviate the degradation of turbulence mitigation. In addition, compared to other methods, our algorithm only requires a shorter computational time to restore high-quality images. With the optimization of algorithm parameters and the update of hardware technology, this algorithm will have wider applications.

Author Contributions

Conceptualization, J.C. and W.Z.; methodology, J.C.; software, J.C.; validation, J.C., G.X. and C.Y.; formal analysis, J.C. and C.Y.; investigation, J.C.; resources, G.X. and X.C.; data curation, J.C.; writing—original draft preparation, J.C.; writing—review and editing, W.Z. and J.L.; visualization, J.C.; supervision, W.Z. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Tatarskii, V.I. Wave propagation in a turbulent medium. Science 1961, 134, 324–325. [Google Scholar] [CrossRef]
Mao, Z.; Chimitt, N.; Chan, S.H. Image Reconstruction of Static and Dynamic Scenes Through Anisoplanatic Turbulence. IEEE Trans. Comput. Imaging 2020, 6, 1415–1428. [Google Scholar] [CrossRef]
Paxman, R.G.; Rogne, T.J.; Sickmiller, B.A.; Lemaster, D.A.; Miller, J.J.; Vollweiler, C.G. Spatial stabilization of deep-turbulence-induced anisoplanatic blur. Opt. Express 2016, 24, 29109–29125. [Google Scholar] [CrossRef] [PubMed]
Xie, Y.; Zhang, W.; Tao, D.; Hu, W.; Qu, Y.; Wang, H. Removing turbulence effect via hybrid total variation and deformation-guided kernel regression. IEEE Trans. Image Process. 2016, 25, 4943–4958. [Google Scholar] [CrossRef]
Hardie, R.C.; Rucci, M.A.; Dapore, A.J.; Karch, B.K. Block matching and wiener filtering approach to optical turbulence mitigation and its application to simulated and real imagery with quantitative error analysis. Opt. Eng. 2017, 56, 071503. [Google Scholar] [CrossRef]
Gilles, J.; Osher, S. Wavelet burst accumulation for turbulence mitigation. J. Electron. Imaging 2016, 25, 033003. [Google Scholar] [CrossRef]
Fried, D.L. Probability of getting a lucky short-exposure image through turbulence. J. Opt. Soc. Am. 1978, 68, 1651–1658. [Google Scholar] [CrossRef]
Anantrasirichai, N.; Achim, A.; Kingsbury, N.G.; Bull, D.R. Atmospheric turbulence mitigation using complex wavelet-based fusion. IEEE Trans. Image Process. 2013, 22, 2398–2408. [Google Scholar] [CrossRef] [Green Version]
Aubailly, M.; Vorontsov, M.A.; Carhart, G.W.; Valley, M.T. Automated video enhancement from a stream of atmospherically-distorted images: The lucky-region fusion approach. Proc. SPIE 2009, 74, 74630C. [Google Scholar]
Zhu, X.; Milanfar, P. Removing atmospheric turbulence via spaceinvariant deconvolution. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 157–170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lau, C.P.; Lai, Y.H.; Lui, L.M. Restoration of atmospheric turbulence-distorted images via RPCA and quasiconformal maps. Inverse Probl. 2019, 35. [Google Scholar] [CrossRef] [Green Version]
He, R.; Wang, Z.; Fan, Y.; Feng, D. Atmospheric turbulence mitigation based on turbulence extraction. In Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 20–25 March 2016; pp. 1442–1446. [Google Scholar]
Zhang, Z.; Yang, X. Reconstruction of distorted underwater images using robust registration. Opt. Express 2019, 27, 9996–10008. [Google Scholar] [CrossRef] [PubMed]
Vorontsov, S.V.; Strakhov, V.N.; Jefferies, S.M.; Borelli, K.J. Deconvolution of astronomical images using SOR with adaptive relaxation. Opt. Express 2011, 19, 13509–13524. [Google Scholar] [CrossRef]
Shan, Q.; Jia, J.; Agarwala, A. High-quality motion deblurring from a single image. ACM Trans. Graph. 2008, 27, 1–10. [Google Scholar]
Fried, D.L. Optical resolution through a randomly inhomogeneous medium for very long and very short exposures. J. Opt. Soc. Am. 1966, 56, 1372–1379. [Google Scholar] [CrossRef]
Chen, G.; Gao, Z.; Wang, Q.; Luo, Q. U-net like deep autoencoders for deblurring atmospheric turbulence. J. Electron. Imaging 2019, 28, 053024–053034. [Google Scholar] [CrossRef]
Fazlali, H.R.; Shirani, S.; BradforSd, M.; Kirubarajan, T. Atmospheric Turbulence Removal in Long-Range Imaging Using a Data-Driven-Based Approach. Int. J. Comput. Vis. 2022, 130, 1031–1049. [Google Scholar] [CrossRef]
Hoffmire, M.A.; Hardie, R.C.; Rucci, M.A.; Van Hook, R.L.; Karch, B.K. Deep learning for anisoplanatic optical turbulence mitigation in long-range imaging. Opt. Eng. 2021, 60, 033103–033113. [Google Scholar] [CrossRef]
Mao, Z.; Jaiswal, A.; Wang, Z.; Chan, S.H. Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and A New Physics-Inspired Transformer Model. Eur. Conf. Comput. Vis. 2022, 60, 033103–033124. [Google Scholar]
Candés, E.J.; Li, X.; Ma, Y.; Wright, J. Robust principal component analysis? J. ACM 2011, 58, 1–37. [Google Scholar] [CrossRef]
Liu, X.; Zhao, G.; Yao, J.; Qi, C. Background subtraction based on low-rank and structured sparse decomposition. IEEE Trans. Image Process. 2015, 24, 2502–2514. [Google Scholar] [CrossRef]
Amiaz, T.; Lubetzky, E.; Kiryati, N. Coarse to over-fine optical flow estimation. Pattern Recognit. 2007, 40, 2496–2503. [Google Scholar] [CrossRef] [Green Version]
Noll, R.J. Zernike polynomials and atmospheric turbulence. JOsA 1976, 66, 207–211. [Google Scholar] [CrossRef]
Chimitt, N.; Chan, S.H. Simulating Anisoplanatic Turbulence by Sampling Correlated Zernike Coefficients. In Proceedings of the 2020 IEEE International Conference on Computational Photography (ICCP), St. Louis, MO, USA, 24–26 April 2020; pp. 1–12. [Google Scholar]
Fried, D.L. Limiting resolution looking down through the atmosphere. J. Opt. Soc. Am. 1966, 56, 1380–1384. [Google Scholar] [CrossRef]
Roggemann, M.; Welsh, B.; Hunt, B. Imaging Through Turbulence; Laser & Optical Science & Technology; CRC Press: Boca Raton, FL, USA, 1996. [Google Scholar]
Groen, F.; Young, I.; Ligthard, G. A comparison of different focus functions for use in autofocus algorithms. Cytometry 1985, 6, 81–91. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bolon-Canedo, V.; Remeseiro, B. Feature selection in image analysis: A survey. Artif. Intell. Rev. 2020, 53, 2905–2931. [Google Scholar] [CrossRef]
Kabir, H.; Garg, N. Machine learning enabled orthogonal camera goniometry for accurate and robust contact angle measurements. Sci. Rep. 2023, 13, 1497. [Google Scholar] [CrossRef]

Figure 1. Block diagram of the proposed restoration scheme.

Figure 2. Low-rank decomposition of the observed image.

Figure 3. Coarse-to-fine optical flow method for image registration.

Figure 4. Zernike polynomial coefficient covariance matrix and generated PSF diagram. (a) Covariance matrix; (b) Generated PSF diagram.

Figure 5. Convolutional results of different blocks and PSFs in an image.

Figure 6. Simulation results under different conditions of

D / r_{0}

: (a) clean image; (b)

D / r_{0} = 2.4312

; (c)

D / r_{0} = 4.2130

; (d)

D / r_{0} = 16.7721

.

Figure 6. Simulation results under different conditions of

D / r_{0}

: (a) clean image; (b)

D / r_{0} = 2.4312

; (c)

D / r_{0} = 4.2130

; (d)

D / r_{0} = 16.7721

.

Figure 7. The operational logic of Conv and ConvTrans layers in CNNs: (a) Conv Layer; (b) ConvTrans layer.

Figure 8. CNN structure.

Figure 9. Loss function curves of three turbulence intensities.

Figure 10. Image restoration results under three turbulence intensities: (a–c) Turbulence blurred images; (d–f) Neural network restored images; (g–i) Original clear images.

Figure 11. Comparison of restoration effects of actual observation images: (a,i) original observation images; (b,j)Optical flow registration images [4]; (c,k) Ours CNN; (d,l) Mao et al. [2] (Optical Flow + Lucky Fusion + Deconv);(e,m) Chen et al. [17] (CNN);(f,n) Fazlali et al. [18] (Lucky Fusion + CNN);(g,o) Hoffmire et al. [19] (Lucky Fusion + CNN); (h,p)Zhiyuan et al. [20] (CNN).

Figure 12. Scatter plot of Tenengard and PSNR.

Figure 13. Tenengard values of all restored images [2,17,18,19,20].

Figure 14. Running time for all algorithms to restore 100 images [2,17,18,19,20].

Table 1. Simulator parameters.

Parameters	Values
Path length	$L = 7 km$
Aperture diameter	$D = 0.305 m$
Focal length	$d = 2.438 m$
Wavelength	$λ = 525 nm$
Zernike phase size	$64 \times 64 pixels$
Image size	$512 \times 512 pixels$
$Nyquist spacing (object plane) \frac{L λ}{2 D}$	$δ_{o} = 6.0245 mm$
$Nyquist spacing (focal plane) \frac{d λ}{2 D}$	$δ_{f} = 2.0983 μ m$

Table 2. Model hyperparameters.

Layer Number	1,3,6,9, 12,15,18,21	23,26,29,32, 35,38,41,44	2,5,8,11,14,17,20	22,25,28,31, 34,37,40,43	4,7,10,13,16,19,24,27,30,33,36,39,42,45
Type	Conv	ConvTrans	LeakyReLU	ReLU	BN
Kernel	$4 \times 4$	$4 \times 4$	-	-	-
Stride	$2 \times 2$	$2 \times 2$	-	-	-
Padding	1	1
$Negative slope k$	-	-	0.2	0	-

Table 3. Mean results of PSNR and SSIM under three turbulence intensities.

Images	Mean PSNR/dB (↑)	Mean SSIM/% (↑)
$Blurred images of D / r_{0} = 2.4312$	22.94	64.65
$Reconstructed images of D / r_{0} = 2.4312$	29.26	87.64
$Blurred images of D / r_{0} = 4.2130$	22.91	63.78
$Reconstructed images of D / r_{0} = 4.2130$	28.14	84.67
$Blurred images of D / r_{0} = 16.7721$	22.45	59.50
$Reconstructed images of D / r_{0} = 16.7721$	25.56	75.08

↑: the PSNR and SSIM of the images reconstructed by neural network are improved compared with the blurred images.

Table 4. Environmental parameters during observation.

Parameter	Value
Path length	$L = 7 km$
Aperture diameter	$D = 0.305 m$
Focal length	$d = 2.438 m$
Atmospheric coherent length	$r_{0} = 0.0229 m$
Exposure time	$t = 2 ms$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cheng, J.; Zhu, W.; Li, J.; Xu, G.; Chen, X.; Yao, C. Restoration of Atmospheric Turbulence-Degraded Short-Exposure Image Based on Convolution Neural Network. Photonics 2023, 10, 666. https://doi.org/10.3390/photonics10060666

AMA Style

Cheng J, Zhu W, Li J, Xu G, Chen X, Yao C. Restoration of Atmospheric Turbulence-Degraded Short-Exposure Image Based on Convolution Neural Network. Photonics. 2023; 10(6):666. https://doi.org/10.3390/photonics10060666

Chicago/Turabian Style

Cheng, Jiuming, Wenyue Zhu, Jianyu Li, Gang Xu, Xiaowei Chen, and Cao Yao. 2023. "Restoration of Atmospheric Turbulence-Degraded Short-Exposure Image Based on Convolution Neural Network" Photonics 10, no. 6: 666. https://doi.org/10.3390/photonics10060666

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Restoration of Atmospheric Turbulence-Degraded Short-Exposure Image Based on Convolution Neural Network

Abstract

1. Introduction

2. Method

2.1. Problem Setting and Motivation

2.2. Construction of Reference Frame and Optical Flow Registration

2.3. Data Sets Generation

2.4. Feature Extraction Technology

2.5. Structure of Convolution Neural Network

2.6. Model Hyperparameter Details

3. Experimental Results and Analysis

3.1. Training Results

3.2. Comparison of Actual Restoration Effects

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI