Feature extraction using wavelet and fractal

doi:10.1016/S0167-8655(01)00003-4

Pattern Recognition Letters

Volume 22, Issues 3–4, March 2001, Pages 271-287

https://doi.org/10.1016/S0167-8655(01)00003-4 Get rights and content

Abstract

In this paper, we are investigating the utility of several emerging techniques to extract features. A novel method of feature extraction is proposed, which includes utilizing the central projection transformation (CPT) to describe the shape, the wavelet transformation to aid in the boundary identification, and the fractal features to enhance image discrimination. It reduces the dimensionality of a two-dimensional pattern by way of a central projection approach, and thereafter, performs Daubechies' wavelet transform on the derived one-dimensional pattern to generate a set of wavelet transform sub-patterns, namely, curves that are non-self-intersecting. The divider dimensions are computed from these curves with a modified box-counting approach. These divider dimensions constitute a new feature vector for the original two-dimensional pattern, defined over the curve's fractal dimensions. We have conducted several experiments in which a set of printed Chinese characters, English letters of varying fonts and other images were classified. Based on the Euclidean distance between the different feature vectors, the experiments have satisfying results.

Introduction

Feature extraction is the heart of a pattern recognition system. In pattern recognition, features are utilized to identify one class of pattern from another. The goal of feature extraction is to find a transformation from an n-dimensional observation space X to a smaller m-dimensional feature space X_i^T=(x_i1,x_i2,…,x_in), i=1,…,m that retains most of the information needed for pattern classification. The coordinate axes that define the feature space X_i^T are called features. There are two main reasons for performing feature extraction. First, the computational complexity for pattern classification is reduced by dealing with the data in a lower dimensional space. Secondly, for a given number of training samples, one can generally obtain more accurate estimates of the class-conditional density functions and thus formulate a more reliable decision rule. Whether or not this decision rule actually performs better than the one applied in the observation space depends on how much information was lost in the feature transformation. In some cases, it is possible to derive features that sacrifice none of the information needed for classification.

In this paper, one may think that the best description of a pattern is itself. So, why do we need the feature extraction step? In other words, why not use the pattern themselves as features? Classification may be seen as a function $C$ from the feature space X_i^T to the pattern space $X(C :X→X_{i}^{T})$ , where $C$ is the set of all possible classes. If X has a big dimension, and this will be the case if bitmaps are used as features, the application of $C$ may require a lot of CPU time. By extracting features from the pattern we decrease the dimensionality of X_i^T and therefore speed up the classification. The second important reason is that normalization is faster and more accurate in feature space than in pattern space.

The “source” features can be extracted from the bitmaps. This is not the only possible source. Other commonly used methods can be described below (Jain et al., 1996; Sazaklis, 1997):

1. Extraction from the boundary curves of the pattern. A boundary following algorithm can be run on the bitmap in order to obtain the coordinates [x(t),y(t)] of the contour of the pattern. This curve is closed and therefore the functions x(t) and y(t) are periodic. Feature extraction from the boundaries can, however, easily be normalized to translation, scaling and rotation. In addition, they have low sensitivity to noise. Of course, methods based on these features cannot be used if noisy gaps “open” the characters, or if the characters are not connected.

2. Extraction from the HV-projections. From the bitmap b(x,y), 0⩽x⩽W−1, 0⩽y⩽H−1, the following two functions are computed: $h(y)=∑_{i=0}^{W−1} b(i,y) and v(x)=∑_{j=0}^{H−1} b(x,j),$ h and v are, respectively, called the horizontal and vertical projections of b. Features computed from h and v cannot be normalized to rotation nor can the shapes of the characters be reconstructed from h(y) and v(x).

3. Pattern profiles. A symbol can be described in terms of its four profiles: left L(y), right R(y), top T(x), and bottom B(x). The profiles give a measure of the variations of the shape on each side of the character. For the bitmap b(x,y), 0⩽x⩽W−1, 0⩽y⩽H−1, and the left profile L(y) may be defined by $L(y): x if b(x,y) is the first set of pixels on the row y, W if there is no set of pixel on the row y.$ Fig. 1 shows the left and right profiles of the character `R'. The profile functions are very sensitive to noise, especially if it is present all over the bitmap. The contours cannot keep the information about the inside patterns.

4. Direct extraction from the bitmap. No special remark needs to be done about this source, all depend on the features that are extracted.

5. Extraction from gray-level rasters. The four previous schemes are used in bitmaps. Recently some methods have been described that use gray-level rasters instead. Variations in the gray-levels are used either directly as features or for generating functions from which features are computed.

6. Structural features. Structural features aim at capturing the essential shape information of the characters, generally from their skeletons, and sometimes from their contours. The features include: loops, junctions, crossing and end points, concavity and convexity, arcs and strokes.

Pattern recognition requires the extraction of features from regions of the image, and the processing of these features with a pattern classification algorithm. Many of the features used in our applications tend to be local in nature, which means their calculation requires a connected region of the image over which an average or other statistics is extracted. In this paper, we present a novel approach to extracting features in pattern recognition that utilizes a central projection transformation which combines the wavelet and fractal theories. In particular, this approach reduces the dimensionality of a two-dimensional pattern by way of a central projection method, and thereafter, performs Daubechies' wavelet transform on the derived one-dimensional pattern to generate a set of wavelet transformation sub-patterns, namely, curves that are non-self-intersecting. The divider dimensions are readily computed from the resulting non-self-intersecting curve. These divider dimensions constitute a new feature vector for the original two-dimensional pattern, defined using the curves' fractal dimensions. Once these feature vectors have been captured, we can compare them to the training set by calculating the Euclidean distance between the different feature vectors. The smallest distance was considered the match. Fig. 2 illustrates the overall approach.

Section snippets

Dimensionality reduction based on central projection

Projection is a very basic and common operation that is used in pattern recognition and image processing. Projection refers to the mapping of a two-dimensional region of an image into a waveform whose values are the sums of the values of the image points. It is obtained by determining the number of black pixels that fall onto a projection axis. Projection profiles represent a global feature of a character (Jain et al., 1996). They play a very important role in character recognition.

Multiresolution analysis and wavelet decomposition

In the preceding section, we have shown how to effectively transform two-dimensional patterns into one-dimensional ones with central projection transformation. In the present section, we shall apply wavelet transform and multiresolution analysis to derived one-dimensional patterns to produce wavelet transform sub-patterns. The wavelet transform is also to aid in the boundary identification. Multiresolution analysis (MRA) was first published in 1989 by Mallat, and the advanced research and

Computing divider dimension of one-dimensional patterns

The fractal dimension is a useful method to quantify the complexity of feature details present in an image. In this section, we shall discuss the problem of computing the divider dimension of those curves, and thereafter, use the computed divider dimension to construct a feature vector for the original two-dimensional pattern in question for pattern recognition.

Until today, there is no common definition of what is a fractal. But it is clear that a fractal has many differences from Euclidean

Experiments

This section presents the procedure as well as the results of our experiments that aim at recognizing a set of two-dimensional patterns.

Conclusions

In this paper, the notion of extraction feature with wavelet and fractal theories is presented as a powerful technique in pattern recognition. We have investigated the utility of several emerging techniques to extract features. This novel method of feature extraction includes utilizing the CPT to describe the shape, the wavelet transformation to aid in the boundary identification, and the fractal features to enhance image discrimination. Its essential advantage is that it can be used to

Acknowledgements

This work was supported by research grants received from the Research Grant Council (RGC) of Hong Kong and a Faculty Research Grant (FRG) from Hong Kong Baptist University.

References (13)

C.K. Chui
Wavelets – A Mathematical Tool for Signal Analysis
(1997)
T. Crimmins
A complete set of Fourier descriptors for two dimensional shapes
IEEE Trans. Syst., Man Cybernet
(1982)
I. Daubechies et al.
Introduction to the special issue on wavelet transforms and multiresolution signal analysis
IEEE Trans. Inform. Theory
(1992)
G.A. Edgar
Measure, Topology and Fractal Geometry
(1990)
K.L. Falconer
The Geometry of Fractal Sets
(1985)
K. Falconer
Fractal Geometry: Mathematical Foundation and Applications
(1990)

There are more references available in the full text version of this article.

Cited by (55)

Improving time–frequency domain sleep EEG classification via singular spectrum analysis
2016, Journal of Neuroscience Methods
Manual sleep scoring is deemed to be tedious and time consuming. Even among automatic methods such as time–frequency (T–F) representations, there is still room for more improvement.
To optimise the efficiency of T–F domain analysis of sleep electroencephalography (EEG) a novel approach for automatically identifying the brain waves, sleep spindles, and K-complexes from the sleep EEG signals is proposed. The proposed method is based on singular spectrum analysis (SSA). The single-channel EEG signal (C3-A2) is initially decomposed and then the desired components are automatically separated. In addition, the noise is removed to enhance the discrimination ability of features. The obtained T–F features after preprocessing stage are classified using a multi-class support vector machines (SVMs) and used for the identification of four sleep stages over three sleep types. Furthermore, to emphasise on the usefulness of the proposed method the automatically-determined spindles are parameterised to discriminate three sleep types.
The four sleep stages are classified through SVM twice: with and without preprocessing stage. The mean accuracy, sensitivity, and specificity for before the preprocessing stage are: 71.5 ± 0.11%, 56.1 ± 0.09% and 86.8 ± 0.04% respectively. However, these values increase significantly to 83.6 ± 0.07%, 70.6 ± 0.14% and 90.8 ± 0.03% after applying SSA.
The new T–F representation has been compared with the existing benchmarks. Our results prove that, the proposed method well outperforms the previous methods in terms of identification and representation of sleep stages.
Experimental results confirm the performance improvement in terms of classification rate and also representative T–F domain.
An intelligent approach for variable size segmentation of non-stationary signals
2015, Journal of Advanced Research
Citation Excerpt :
DWT represents the signal variation in frequency with respect to time. After decomposing the signal, fractal dimension (FD) is employed as a relevant tool to detect the transients in a signal [13]. FD can be used as a feature for adaptive signal segmentation because FD can indicate changes not only in amplitude but also in frequency.
In numerous signal processing applications, non-stationary signals should be segmented to piece-wise stationary epochs before being further analyzed. In this article, an enhanced segmentation method based on fractal dimension (FD) and evolutionary algorithms (EAs) for non-stationary signals, such as electroencephalogram (EEG), magnetoencephalogram (MEG) and electromyogram (EMG), is proposed. In the proposed approach, discrete wavelet transform (DWT) decomposes the signal into orthonormal time series with different frequency bands. Then, the FD of the decomposed signal is calculated within two sliding windows. The accuracy of the segmentation method depends on these parameters of FD. In this study, four EAs are used to increase the accuracy of segmentation method and choose acceptable parameters of the FD. These include particle swarm optimization (PSO), new PSO (NPSO), PSO with mutation, and bee colony optimization (BCO). The suggested methods are compared with other most popular approaches (improved nonlinear energy operator (INLEO), wavelet generalized likelihood ratio (WGLR), and Varri’s method) using synthetic signals, real EEG data, and the difference in the received photons of galactic objects. The results demonstrate the absolute superiority of the suggested approach.
Dynamic unbalance detection of Cardan shaft in high-speed train applying double decomposition and double reconstruction method
2015, Measurement: Journal of the International Measurement Confederation
Citation Excerpt :
While effective for small quantities, wavelet decomposition of large data volumes often leads to difficulty in judging the data processing result in practice. Therefore, post-processing methods of wavelet decomposition such as wavelet scale energy statistical analysis [6–8], wavelet fractal analysis [9–12], and wavelet singular value decomposition [13–16] have been fully developed. Because wavelet singular value decomposition has unique advantages in de-noising and eliminating correlations of signals, it has widely been used in fault diagnosis and image processing.
The unbalance of Cardan shaft compromises operations of high-speed train. A new method is proposed to detect the unbalance by applying DDDR (double decomposition and double reconstruction method). The vibration acceleration of gearbox was decomposed into eight scale wavelet coefficients through wavelet packet decomposition. The eight single scale vibration signals were reconstructed by the corresponding scale wavelet coefficients. Hankel matrices in different scales were constructed through the reconstructed vibration signals in wavelet domain. SVD (singular value decomposition) of Hankel matrices was executed, and critical singular values were selected based on the maximum change of singular values. Those selected singular values were used to reconstruct the single scale vibration signal. So far, DDDR processing of signal has been completed. Fourier spectrum of signal acquired by DDDR processing was used to detect dynamic unbalance of high-speed train Cardan shaft. The validity of this method is supported by experimental data collected on dynamic unbalance experiments. The results show that this method can effectively extract the vibration characteristics of fundamental, multiplier, and divider frequencies. With comparison to the traditional wavelet decomposition, wavelet singularity value decomposition, the clarity and sensitive force have been significantly improved.
Spike detection approaches for noisy neuronal data: Assessment and comparison
2014, Neurocomputing
Citation Excerpt :
At any point, the mean value of the two envelopes defined respectively by local maxima and local minima must be zero. FD is a measure of nonlinear dynamics of time series and has recently become popular in analysis of biomedical signals such as EEG [33,34]. If the amplitude and frequency of a signal change, the dimension of FD will change the same as Fig. 2.
Spike detection in extracellular recordings is a difficult problem, especially when there are several noise sources. In this paper, three new approaches based on fractal dimension (FD), smoothed nonlinear energy operator (SNEO) and standard deviation to detect the spikes for noisy neuronal data are proposed. These methods however do not perform well in some cases, especially when the noise level is high. To overcome these problems, we use five smoothing techniques, namely, discrete wavelet transform (DWT), Kalman filter (KF), singular spectrum analysis (SSA), Savitzgy-Golay filter, and empirical mode decomposition (EMD). Although filtering approach based on EMD is relatively slow, when SNRs>0 dB, those approaches which use EMD have the best efficiency and accuracy. While SNRs<0 dB, it is demonstrated that for SSA followed by SNEO, the performance in terms of the average spikes detection accuracy and CPU time is the most desirable.
A hybrid evolutionary approach to segmentation of non-stationary signals
2013, Digital Signal Processing: A Review Journal
Automatic segmentation of non-stationary signals such as electroencephalogram (EEG), electrocardiogram (ECG) and brightness of galactic objects has many applications. In this paper an improved segmentation method based on fractal dimension (FD) and evolutionary algorithms (EAs) for non-stationary signals is proposed. After using Kalman filter (KF) to reduce existing noises, FD which can detect the changes in both the amplitude and frequency of the signal is applied to reveal segments of the signal. In order to select two acceptable parameters of FD, in this paper two authoritative EAs, namely, genetic algorithm (GA) and imperialist competitive algorithm (ICA) are used. The proposed approach is applied to synthetic multi-component signals, real EEG data, and brightness changes of galactic objects. The proposed methods are compared with some well-known existing algorithms such as improved nonlinear energy operator (INLEO), Varriʼs and wavelet generalized likelihood ratio (WGLR) methods. The simulation results demonstrate that segmentation by using KF, FD, and EAs have greater accuracy which proves the significance of this algorithm.
Fractal pursuit for compressive sensing signal recovery
2013, Computers and Electrical Engineering
Citation Excerpt :
Fractal dimension has been used in many fields of study. Tang et al. [15,16] select fractal feature to recognize the printed Chinese characters, English letters and handwritten signatures; Lee and Hsieh [17] propose a more robust and efficient improvement of BCD and apply it to the classification of images; Ebrahimi and Vrscay [18] research the fractal method of projections onto convex sets; Lin [19] utilizes fractal to solve the inverse problem in the signal denoising. On the basis of the above mentioned fractal measure theory, we will use fractal recognition to extract the prior sparsity from signal sensing data, and use fractal minimization to realize the CS reconstruction.
Basis pursuit (BP) and matching pursuit (MP) are two important basic recovery methods in compressive sensing (CS) research. BP can compute the global optimal solution in CS recovery problem, but its computational complexity is high and dimensional universality (regardless of 1D or 2D or higher dimensions) is not good. On the other side, the computational cost of MP is lower than BP, but the sparsity of signal needs to be known beforehand and its solution may not necessarily be global optimal. In this paper, a new CS recovery method is proposed, termed fractal pursuit (FP) which integrates the advantage of BP and MP. It acquires the prior knowledge of signal by fractal recognition to cut down the computational cost of pursuit operation, and uses fractal minimization in place of l₁-norm minimization for improving the recovery quality and dimensional universality in CS framework. Two experiments show the feasibility and performance of FP in CS recovery.

View all citing articles on Scopus

View full text

Feature extraction using wavelet and fractal

Abstract

Introduction

Section snippets

Dimensionality reduction based on central projection

Multiresolution analysis and wavelet decomposition

Computing divider dimension of one-dimensional patterns

Experiments

Conclusions

Acknowledgements

Wavelets – A Mathematical Tool for Signal Analysis

A complete set of Fourier descriptors for two dimensional shapes

IEEE Trans. Syst., Man Cybernet

Introduction to the special issue on wavelet transforms and multiresolution signal analysis

IEEE Trans. Inform. Theory

Measure, Topology and Fractal Geometry

The Geometry of Fractal Sets

Fractal Geometry: Mathematical Foundation and Applications