Automatic Noise Estimation from the Multiresolution Support

Jean‐Luc Starck; Fionn Murtagh

doi:10.1086/316124

1. INTRODUCTION

Noise is a necessary evil in astronomical image processing. If we can reliably estimate noise, through knowledge of instrument properties or otherwise, subsequent analyses may be very much better behaved. In fact, major problems would disappear if this were the case; for example, image restoration or sharpening could become simpler.

In practice, rule‐of‐thumb calculation of noise is often carried out. For instance, limited convex regions of what is considered as background are sampled, and the noise is determined in these regions. One perspective on the theme of this paper is that we take such artisanal methods and put them on a firm footing. Advantages of doing so include objectivity of treatment, better quality noise estimates because of far greater thoroughness, and automation of an otherwise interactive procedure.

In many instances, knowledge of the instrument leads to information about the detector's noise properties (Snyder, Hammoud, & White 1993; Tekalp & Pavlović 1991). In this paper we address the issue of estimating noise when such detector information is not available to us. This may be because the image calibration and preprocessing are difficult to model. It may be because we have to analyze images of unknown detector provenance. Or it may be to have methods available for noise determination which serve to check on supposed noise properties of the detector. Consider, for example, the case of the ISOCAM infrared camera on board Infrared Space Observatory (ISO) (see § 4.4 below). Noise must be estimated separately for each of the 32 × 32 pixels. Consider, too, a survey, or the setting up of instrument pipeline processing, in which the volume of data necessitates automated approaches.

We start with a consideration of the Gaussian case and then incorporate the Poisson case. Such noise distributions are the most important for CCD, digitized photographic, and other common astronomical image detectors.

In various studies of image and spectral filtering and image compression (see Starck, Bijaoui, & Murtagh 1995; Starck et al. 1996), the basis of the methodology lay in determining noise and separating signal from noise. "Signal" is what we term the scientifically interesting part of the data. Signal is often very compressible, whereas noise by definition is not compressible. Effective separation of signal and noise is evidently of great importance in the physical sciences.

For common noise distributions, noise is classically specified by its standard deviation. There are different ways to estimate the standard deviation of Gaussian noise in an image (Lee 1981; Mastin 1985; Meer, Jolian, & Rosenfeld 1990; Bracho & Sanderson 1985; Vorhees & Poggio 1987; Lee & Hoppel 1989). Olsen (1993) carried out an evaluation of six methods and showed that the best was the average method, which is also the simplest. This method (Olsen 1993) consists of filtering the data I with the average filter and subtracting from I the filtered image. Then a measure of the noise at each pixel is computed.

To keep image edges from contributing to the estimate, the noise measure is disregarded if the magnitude of the intensity gradient is larger than T. This threshold value may be found from the cumulative histogram of the magnitude of the intensity gradient. An alternative to this thresholding is to apply a k‐σ clipping.

We show in this paper how the noise variance can be estimated in a robust way from a recently introduced data structure called the multiresolution support (Starck et al. 1995). The effect of an error in the variance estimation is also studied in two different applications. The first is source detection, and the second is cosmic‐ray removal. In § 5, the case of a noise composed of both Poisson and Gaussian noise is analyzed. We propose an algorithm to find the Gaussian part of the noise.

2. MULTIRESOLUTION SUPPORT

2.1. Definition

We will say that a multiresolution support (Starck et al. 1995) of an image describes in a logical or Boolean way if an image I contains information at a given scale j and at a given position (x,y). If M^(I)(j,x,y) = 1 (or = true), then I contains information at scale j and at the position (x,y). M depends on several parameters:

The input image;
The algorithm used for the multiresolution decomposition;
The noise;
All constraints that we want the support to satisfy in addition.

Such a support results from the data and the treatment (noise estimation, etc.), and from our knowledge of the objects contained in the data (size of objects, linearity, etc.). In the most general case, a priori information is not available to us.

The multiresolution support of an image is computed in several steps:

1.
The wavelet transform of the image is computed.
2.
Booleanization of each scale leads to the multiresolution support.
3.
A priori knowledge can be introduced by modifying the support.

The last step depends on the knowledge we have of our images. For instance, if we know there is no interesting object smaller or larger than a given size in our image, we can suppress, in the support, anything that is due to that kind of object. This can often be done conveniently by the use of mathematical morphology. In the most general setting, we naturally have no information to add to the multiresolution support.

2.2. Multiresolution Support from the Wavelet Transform

The wavelet transform of an image by an algorithm such as the à trous method (Shensa 1992) produces, at each scale j, a set {w_j}. This has the same number of pixels as the image, and thus this wavelet transform is a redundant one. The original image c₀ can be expressed as the sum of all the wavelet scales and the smoothed array c_p:

A pixel at position (x,y) can also be expressed as the sum all the wavelet coefficients at this position, plus the smoothed array:

The multiresolution support will be obtained by detecting at each scale the significant coefficients. The multiresolution support is defined by

Given stationary Gaussian noise, in order to define if w_j is significant it suffices to compare w_j(x,y) to kσ_j. Often k is chosen as 3. If w_j(x,y) is small, it is not significant and could be due to noise. If w_j(x,y) is large, it is significant. That is,

So we need to estimate, in the case of Gaussian noise models, the noise standard deviation at each scale.

2.3. Estimation of Noise Standard Deviation at Each Scale

The appropriate value of σ_j in the succession of wavelet planes is assessed from the standard deviation of the noise σ_I in the original image and from study of the noise in the wavelet space. This study consists of simulating an image containing Gaussian noise with a standard deviation equal to 1 and taking the wavelet transform of this image. Then we compute the standard deviation σ^e_j at each scale. We obtain a curve σ^e_j as a function of j, which gives the behavior of the noise in the wavelet space. (Note that if we had used an orthogonal wavelet transform, this curve would be linear.) As a result of the properties of the wavelet transform, we have σ_j = σ_Iσ^e_j. The standard deviation of the noise at a scale j of the image is equal to the standard deviation of the noise of the image multiplied by the standard deviation of the noise of the scale j of the wavelet transform.

2.4. Algorithm for Determining the Multiresolution Support

The algorithm to create the multiresolution support is as follows:

1.
We compute the wavelet transform of the image.
2.
We estimate the noise standard deviation at each scale. We deduce the statistically significant level at each scale.
3.
Booleanization of each scale leads to the multiresolution support.
4.
Modification using a priori knowledge (if desired).

Step 4 is optional. A typical use of a priori knowledge is the suppression of isolated pixels in the multiresolution support in the case in which the image is obtained with a point‐spread function (PSF) of more than 1 pixel. Then we can be sure that isolated pixels are residual noise that has been detected as significant coefficients. If we use a pyramidal wavelet transform method (Starck et al. 1996) or a nonlinear multiresolution transform, the same method can be used.

In order to visualize the support, we can create an image S defined by

Figure 1 shows such a multiresolution support visualization of an image of galaxy NGC 2997.

**Fig. 1.—** Multiresolution support representation of a spiral galaxy

3. NOISE VARIANCE ESTIMATION

The Gaussian noise σ_I can be estimated automatically in an image I. This estimation is very important because all the noise standard deviations σ_j in the scales j are derived from σ_I. Thus an error associated with σ_I will introduce an error on all σ_j. This measure of σ_I can be refined by the use of the multiresolution support. Indeed, if we consider the set of pixels S in the image that are due only to the noise and take the standard deviation of them, we should find the same value σ_I. This set is easily obtained from the multiresolution support. We say that a pixel (x,y) belongs to the noise if M(j,x,y) = 0 for all j (i.e., there is no significant coefficient at any scale). The new estimation of σ_I is then computed by the following iterative algorithm:

1.
Estimate the standard deviation of the noise in I. We have σ⁰_I.
2.
Compute the wavelet transform (à trous algorithm) of the image I with p scales. We have
where w_j are the wavelet scales and c_p is the low‐frequency part of I. The noise in c_p is negligible.
3.
Set n to 0.
4.
Compute the multiresolution support M that is derived from the wavelet coefficients and from σ⁽ⁿ⁾_I.
5.
Select the pixels that belong to the set S: if M(j,x,y) = 0 for all j in 1,...,p, then the pixel (x,y)∈S.
6.
For all the selected pixels (x,y), compute the values I(x,y) - c_p(x,y) and compute the standard deviation σ^{(n + 1)}_I of these values. (We compute the difference between I and c_p in order not to include the background in the noise estimation.)
7.
n = n + l.
8.
If |σ⁽ⁿ⁾_I - σ^{(n - 1)}_I|/σ⁽ⁿ⁾_I>, then go to step 4.This method converges in a few iterations and allows the noise estimation to be improved.

The approach is in fact very physical. It consists of detecting the set Script N of pixels that does not contain any significant signal (only the background plus noise). A pixel (x,y) is dominated by the noise if all wavelet coefficients at this position are not significant. The background affects only the last scale of the wavelet transform. We subtract this last scale from the original image, and we compute the standard deviation of the set Script N in this background‐free image. Wavelet coefficients larger than 3 σ_j are considered as significant, but a small fraction of them will be due to the noise. This introduces a small systematic bias in the final solution, which is easily corrected by dividing the standard deviation by a given constant value, found experimentally as equal to 0.974. Therefore, we downgrade the empirical variance in this way. The method is robust, and whatever the initial estimation of noise, it converges quickly to a good estimate.

4. APPLICATIONS

4.1. Example 1

We simulate an image containing around 100 objects (stars and galaxies) and a background. Then we derive a set of images by adding a Gaussian noise with different standard deviation. The original image had a mean value of 1431 ADU (analog‐to‐digital units), minimum and maximum values of 1236 and 18,365, and a σ (standard deviation) of 546. The added Gaussian values, using the original image's mean value, had standard deviations ranging from 2 (an almost imperceptible change) to 5000 (so that noise totally dominates).

Results presented in Table 1 show the robustness of the standard deviation estimation from the multiresolution support. The error percentages are defined as

Note how, with imperceptible change (σ small, close to 2), most methods perform badly relative to this admittedly tiny value of σ. Broadly speaking, for sufficiently large noise, the various methods do well. We see, however, that the multiresolution support does remarkably well, for all values of the noise.

4.2. Example 2

Figure 2 shows a simulated image mostly of galaxies, as used by Freudling & Caulet (1993) and Caulet & Freudling (1993). We added Gaussian noise, which leads to varying signal‐to‐noise ratios (SNR) (defined as 10×log ₁₀[σ(input image)/σ(added noise)]). Figure 3 shows an image with SNR = −0.002. This image was produced by adding noise of standard deviation arbitrarily chosen to be equal to the overall standard deviation of the original image. For varying SNRs, the following three methods were assessed. First, a median‐smoothed version of the image (which provides a crude estimate of the image background) is subtracted, and the noise is estimated from this difference image. This noise estimate is made more robust by 3 σ clipping. Second, the multiresolution support approach described in this article was used, in which the specification of the multiresolution support was iteratively refined. Third, the noise statistics in a running block or square window are used, and the averages of these values are returned.

**Fig. 2.—** Simulated image of galaxies

**Fig. 3.—** Simulated image of galaxies with Gaussian noise (σ = 582.5) added.

Table 2 gives results obtained. The first column gives the noise as a standard deviation. The second column expresses this as SNR. The third, fourth, and fifth columns give automatic noise estimates using the median/3 σ clipping method, the iterative multiresolution support method, and the block method. The excellent estimation capability of the iterative multiresolution support method can be seen: the fourth column values are very close to the first column values.

4.3. The Most Appropriate Noise Variance Estimation Procedure for Object Detection

The goal of this section is to estimate to what extent an error in noise evaluation is significant for the detection of objects in astronomical images. The methodology used is to take an image with objects of known positions, to add noise, and then to compare the true object list with that found by an object detection program.

Starting from a simulated image containing 116 objects, their positions being known, we add a Gaussian noise with a standard deviation σ_N. Using an astronomical package for source detection, in which the noise standard deviation is a free parameter, it is possible to measure the impact of an error on the standard deviation in the results. Introducing an error between −20% and +20%, we compare the results of the detection using two criteria:

Percentage of real detected objects:
Percentage of false detections:
Since extended sources are sometimes detected as several smaller objects, we can have also multiple detections for one object.

A positive error is equivalent to an increase in σ_N, which implies detection with a higher threshold. Fewer objects are detected, but they are more likely to be true detections. Conversely, a negative error means decreasing the threshold, thus detecting more objects but making more false detections.

We define a parameter, called ratio, which links together ρ(o) and ρ(e):

Figure 4 shows values of this ratio for various values of standard deviations (σ) of added noise. From Figure 4, it seems that an error between −10% and +10% is acceptable. In this case, most methods are satisfactory. This result is important because it means that for a large survey, in which the amount of data is very great, we do not need a high‐precision estimate of the standard deviation of the noise. In such a situation, the computation time may well be a more important issue than the quality. Obviously, in this case, the multiresolution support, which requires somewhat greater computation time, should not be used.

**Fig. 4.—** Influence on ratio (see eq. [6], § 4.3) for an added Gaussian σ_N (ADU) noise.

4.4. Impact of Noise Variance Estimation for Deglitching

So far we have targeted small‐valued nuisance data. Now we turn our attention to very large‐valued nuisance or artifact data.

The ISOCAM infrared camera is one of the four instruments on board ISO, which was launched successfully on 1995 November 17. It operates in the 2.5–17 μm range and was developed by the French Service d'Astrophysique of CEA Saclay. For one observation, the ISOCAM instrument delivers a set of frames, so several measurements are available for a sky position. This is important because cosmic‐ray hits seriously affect the detector. Figure 5 (top) shows an example of what a detector pixel sees. Data averaging without taking into account these glitches (cosmic‐ray hits) would lead to a large error. Figure 5 (middle) shows the result using a deglitching algorithm (Starck et al. 1997). A parameter of this method is the standard deviation of the noise. As can be seen in Figure 5 (middle), many weak glitches have not been eliminated because of overestimation of noise variance. The effect of the error is a large number of false source detections in the final calibrated image. In Figure 5 (bottom), the same deglitching algorithm was used, but we used as input the noise variance found by the multiresolution method. The result is better, and all false detections have disappeared in the calibrated image.

**Fig. 5.—** *Top*: Original data—all peaks are due to cosmic rays; *middle*: deglitched data using a standard variance estimation; *bottom*: deglitched data using the multiresolution support method for the variance estimation. Pixels (at 2.1 s intervals) vs. ADU.

5. CASE OF POISSON AND GAUSSIAN NOISE

5.1. Anscombe Generalized Transform

If the noise in the data I is Poisson, the transform

acts as if the data arose from a Gaussian white noise model (Anscombe 1948), with σ = l, under the assumption that the mean value of I is large. The arrival of photons, and their expression by electron counts, on CCD detectors may be modeled by a Poisson distribution. In addition, there is additive Gaussian readout noise. The Anscombe transformation has been extended to take this combined noise into account. The generalization of the variance stabilizing Anscombe formula is derived as (Murtagh, Starck, & Bijaoui 1995)

where α is the gain, σ is the standard deviation, and g is the mean of the readout noise.

5.2. Automatic Estimation of Standard Deviation of Readout Noise

If the readout noise standard deviation is not well known, we can use the variance stabilization transform to estimate it. This is done by using the fact that we know that the standard deviation in t(I) should be equal to 1. The algorithm is as follows:

1.
Set n to 0, S_min to 0, and S_max to σ(I).
2.
Set r_n to (S_min + S_max)/2.
3.
Compute the transform of I with a readout noise standard deviation equal to r_n.
4.
Estimate the standard deviation of the noise σ_S in t(I) by the method described in § 3.
5.
If σ_S<1, then S_min = r_n; otherwise S_max = r_n.
6.
If S_max - S_min>, then n = n + l and go to step 2.
7.
r_n = (S_min + S_max)/2 is a good estimation of the readout noise standard deviation.

5.3. Simulation

Starting from the same image as in § 4.1, we simulate a set of images by adding Poisson noise and Gaussian noise. Using the algorithm described above, the Gaussian component was sought. The gain was taken as equal to one in all simulations. Results of these simulations are presented in Table 3. As we can see, the standard deviation of the readout noise can be found with good accuracy (less than 1%) when the readout noise is greater than 45. For very low readout noise, the error can become relatively high.

6. CONCLUSION

This work is complementary to any approach that seeks to noise‐filter (or "de‐noise") image data or to compress image data. It is a contribution also to a general methodology for handling of noise. Since noise is so fundamental in astronomy, the need for such a methodology is clear. The methods described here are very efficient. Our examples and applications have also shown the effectiveness and power of the innovative methodology described, which is based on the multiresolution support data structure and wavelet transforms.

Automatic Noise Estimation from the Multiresolution Support

Article metrics

Permissions

Author affiliations

Dates

ABSTRACT

1. INTRODUCTION