Block thresholding and wavelet estimation for nonequispaced samples

https://doi.org/10.1016/S0378-3758(02)00238-0Get rights and content

Abstract

For samples with the design points occurring as a Poisson process or having a uniform distribution, the wavelet method of block thresholding can be applied directly to the data as though it was equispaced without sacrificing adaptivity or optimality. When the underlying true function is in certain Besov and Hölder classes, the resulting estimator achieves the minimax rate of convergence. Simulation results are examined.

Introduction

Wavelets have been shown to be very successful in nonparametric function estimation. Specifically, they excel in the areas of spatial adaptivity, optimality, and low computational cost. Typically, wavelet analysis is performed through the use of term-by-term thresholding of wavelet coefficients, such as the VisuShrink method of Donoho and Johnstone (1994). There, a noisy signal is transformed into empirical wavelet coefficients by the discrete wavelet transform, these coefficients are denoised by comparison with a specified thresholding rule, and the underlying function is estimated by applying the inverse discrete wavelet transform to these denoised coefficients. This method is adaptive and is within a logarithmic factor of the optimal minimax convergence rate over large classes of Besov functions.

Hall et al. (1999) proposed a method of wavelet analysis whereby the optimal minimax convergence rate is attained without the logarithm penalty found in the term-by-term methods. Using block thresholding, where the empirical wavelet coefficients are thresholded in blocks rather than individually, they achieved this optimal rate and maintained adaptivity over a large class of Hölder functions. By looking at coefficients in blocks rather than individually, more precise comparisons between the coefficients and the threshold is allowed, resulting in improved rates. Cai (1999) has extended this idea. Using a James–Stein thresholding rule, he has shown that this block-thresholded wavelet estimator attains the optimal convergence rate from both a global and local estimation perspective. A specific block length and threshold value are established that attain these rates. Additionally, this estimate is easy to implement and has a low computational cost of O(n).

The above methods have been developed for data that is equispaced. Little emphasis has been placed on sample data that is not equispaced, however. Cai and Brown (1998) investigated wavelet methods on samples with fixed, nonequispaced designs via an approximation approach. They showed that applying the methods devised for equispaced data directly to nonequispaced data can lead to suboptimal estimators. They then proposed a method that was adaptive and near optimal. Hall and Turlach (1997) used interpolation methods to deal with samples with random design. Unfortunately, these methods are much more complex from a computational standpoint than their equispaced counterparts.

Cai and Brown (1999) have also examined convergence rates when the unknown function is in a Hölder class with exponent α and the positions of the sample points are distributed as independent uniform random variables. Using term-by-term thresholding, they showed that the equispaced wavelet method can be directly applied to the nonequispaced data without a loss in the rate of convergence, i.e., to within a logarithmic factor of the optimal convergence rate of n−2α/(2α+1). This method maintains the computational efficiency and simplicity of the equispaced algorithm.

In this paper, it is known that these results of Cai and Brown (1999) for uniform design can be improved upon for Hölder classes and extended to many Besov classes through the use of block thresholding. Additionally, when the sample points occur as a Poisson process, the same optimal rates are attained. Adaptivity is kept, and the computational cost remains low since the equispaced algorithm is used. In this paper, the rates given are upper bounds on the rate of convergence. For a discussion on the lower bound of the rate of convergence, see Antos et al. (2000).

In Section 2 of this paper, the method of thresholding the wavelet coefficient is stated, and the theorems on convergence rates are put forth. Section 3 discusses simulation results, Section 4 defines the functions spaces of interest, and Section 5 contains the proofs.

Section snippets

Wavelets

φ and ψ will represent the father and mother wavelets, respectively. Both are assumed to be compactly supported. Let φjk and ψjk be the translations and dilations of φ and ψ:φjk(x)=2j/2φ(2jx−k),ψjk(x)=2j/2ψ(2jx−k).In this paper, wavelets periodized to the interval [0,1] will be used. The setj0kp:k=1,…,2j0}∪{ψjkp:j⩾j0,k=1,…,2j}is an orthonormal basis of L2[0,1], whereφjkp(x)=l=−∞φjk(x−l),ψjkp(x)=l=−∞ψjk(x−l),for x in [0,1]. These periodized wavelets will be used for the rest of the paper

Simulation

To compare the efficiency of the block estimator under the different sampling designs, it was run on equispaced, uniform, and Poisson process samples and the results were compared. Also, the block estimator was compared to the VisuShrink estimator (VU) with universal term thresholding on uniform distributed sample points.

The test functions used are those specified in Donoho and Johnstone (1994). These eight functions represent varying degrees of spatial inhomogeneity. Sample sizes ranged from n

Function spaces

In order to define the function space Fp,qα(M), two other spaces are needed. The first, and more general, of the two is the Besov space Bp,qα, where 0<p,q⩽∞ and α>0. A function f is said to be in this space if its Besov norm is finite||f||Bp,qα<∞,where, for 0<α⩽1||f||Bp,qα=||f||Lp+01h1hα||f(·+h)−f(·)||Lpqdh1/q,q<∞,sup0<h||f(·+h)−f(·)||Lp/hα,q=∞.For α>1,α=⌊α⌋+s,0<s⩽1,||f||Bp,qα=m=0⌊α⌋||f(m)||Bp,qs.The Hölder space Λα is a special case of a Besov space. Specifically, Λα=B∞,∞α. This paper

Proof

Before the main theorems are proved, some preliminaries are necessary. First, a result about the order statistics of the uniformly spaced sample points is stated.

Lemma 1

Let xi be independent, uniform (0,1) random variables. Let x(1)<x(2)<⋯<x(n) be their order statistics. Then x(k) is a Beta (k,nk+1) random variable with mean k/(n+1) and variance ((n+1)kk2)/(n+1)2(n+2).

The following result from Cai (1999) will also be needed.

Lemma 2

Suppose θjkΘp,qα(M) andyjkjk+1nσεjk,j⩾j0,k=1,2,…,2j,where zjk are iid

References (10)

  • A. Antos et al.

    Lower bounds on the rate of convergence of nonparametric regression estimates

    J. Statist. Plann. Inference

    (2000)
  • T. Cai

    Adaptive wavelet estimationa block thresholding and oracle inequality approach

    Ann. Statist.

    (1999)
  • T. Cai et al.

    Wavelet shrinkage for nonequispaced samples

    Ann. Statist.

    (1998)
  • T. Cai et al.

    Wavelet estimation for samples with random uniform design

    Statist. Probab. Lett.

    (1999)
  • Chicken, E., 2001. Nonparametric regression and density estimation in Besov spaces via wavelets. Ph.D. Thesis, Purdue...
There are more references available in the full text version of this article.

Cited by (0)

View full text