Terrain classification of polarimetric synthetic aperture radar imagery based on polarimetric features and ensemble learning

Abstract. An evolutionary classification system for terrain classification of polarimetric synthetic aperture radar (PolSAR) imagery based on ensemble learning with polarimetric and texture features is proposed. Polarimetric measurements cannot produce sufficient identification information for PolSAR terrain classification in some complex areas. To address this issue, texture features have been successfully used in image segmentation. The system classification feature has been adopted using a combination of Pauli features and the last principal component of Gabor texture-feature dimensionality reduction. The resulting feature combination assigned through experimental analysis is very suitable for describing structural and spatial information. To obtain a good integration effect, the basic classifier should be as precise as possible and the differences among the features should be as distinct as possible. We therefore examine and construct an ensemble-weighted voting classifier, including two support vector machine models that are constructed using kernel functions of the radial basis and sigmoid, extreme learning machine, k-nearest neighbor, and discriminant analysis classifier, which can avoid redundancy and bias because of different theoretical backgrounds. An experiment was performed to estimate the proposed algorithm’s performance. The results verified that the algorithm can obtain better accuracy than the four classifiers mentioned in this paper.

In recent years, PolSAR research in remote sensing has become widespread because a greater amount of information on scattering objects is found in PolSAR data than in single polarization and double polarization data. 15,16PolSAR data depict the state changes of microwave polarization, which is produced by the dielectric constant and terrain structure.Therefore, terrain features can be obtained from the PolSAR data. 1 Moreover, terrain classification of PolSAR imagery is considered a typical PolSAR application, which employs features and classifiers to separate different terrain types.
Currently, PolSAR image classification has been successfully applied to many practical problems.Related information processing techniques are being continuously developed, such as *Address all correspondence to: Chuanbo Huang, E-mail: huangchuanbocq@126.comimage filtering, feature extraction, and classification algorithms.PolSAR can measure each observation terrain using a full scattering matrix.Its feature distribution complexity often results in scattered signals of different terrains with similar features, which increases the difficulty of discriminating information extraction.Many algorithms have been proposed 4,5 for polarimetric data analyses, such as eigenvalue analysis, polarization decomposition, and so on.
Feature extraction and the classifier structure are two important factors that affect the final classification result.Pixel-based PolSAR terrain classification specifies the category of each pixel in the feature space.These features indicate the pixel characteristics and can be Pauli features, grayscales, textures, and so on.They must be extracted and used as input for the terrain discrimination.Different objects have different surface features.Therefore, the PolSAR image textural features have location discontinuities in the properties of adjacent image regions.Thus, they can be considered features for classification.Because the texture features contain rich identification information, they are important for classification. 17everal approaches exist for extracting the texture features, such as the gray-level co-occurrence matrix (GLCM), wavelets, Gabor filters, and local binary patterns (LBPs).Furthermore, textural features have been utilized for object detection and terrain classification. 18,19An efficient use of the Gabor filter was demonstrated by Leone and Distante 20 for terrain detection in a textured scene.The LBPs are utilized to evaluate the similarity of neighboring image regions. 21onetheless, GLCM remains a more popular method. 22More effective terrain classification performance can be achieved by combining different textures, and the resulting discrimination performance is significantly improved.
Many classification methods exist, such as the classical maximum likelihood, 23 neural networks, 24 and the decision tree methods. 25In synthetic aperture radar (SAR) image classification and segmentation, the statistical distribution model has been commonly used.Among these models, the Markov random field (MRF) 26 is an efficient approach for image classification.However, in addressing nonstationary SAR images, MRF has difficulty obtaining improved results.Meanwhile, the support vector machine (SVM) [27][28][29] is a supervised method based on the pixel level.As a discriminative model, SVM directly employs training sample modeling based on the structural risk minimization criterion.It can thereby handle the linear inseparability problem by introducing the kernel function.Moreover, it does not have local minima because it solves the convex quadratic programming problem.Nonetheless, it does not consider spatial information.
The classifier of the naive Bayes method is a probabilistic classifier based on the independence assumption of Bayes' theorem.Meanwhile, the k-nearest neighbor (KNN) algorithm, proposed by Cover and Hart, 30 has high precision and is not sensitive to outliers.In addition, discriminant analysis is a renowned statistical recognition method that has been successfully applied to category prediction. 31Recently, neural networks have been introduced for image classification on account of their ability to approximate nonlinear functions in the input space. 32,33oreover, the extreme learning machine (ELM) 34 is a feedforward neural network with a single hidden layer.It has strong generalization performance and a very fast classification speed.
With consideration of the above approaches, we herein present and discuss terrain classification of PolSAR imagery based on ensemble learning with Pauli and Gabor features.The main contributions of this paper are the following.Polarimetric and texture features are combined for PolSAR image classification.We utilize an efficient method for PolSAR image feature representation.Our method adopts an ensemble learning strategy and selects five base classifiers to form an ensemble classifier.These classifiers are not only more accurate but also distinct from each other.The weight of each base classifier is obtained by a differential evolution (DE) algorithm.
The remainder of this paper is organized as follows.Pixel feature representation is introduced in Sec. 2. In Sec. 3, the classification system construction is presented.We provide and analyze the simulation results of the experiment in Sec. 4. In Sec. 5, our conclusions and future work are discussed.
2 Pixel Feature Representation

Texture Features
The image texture is determined based on the differences in spatial distributions in the image, specifically the frequency and intensity of each pixel.In this study, we examine the use of enhanced discriminatory information to improve classification accuracy.We therefore introduce texture features for classification.The Gabor transform has been widely applied in image processing.The texture features are derived by convolving the image and Gabor filter banks.A set of two-dimensional (2-D) Gabor filters is employed to convolve the image.A 2-D Gabor function is formed by a sinusoidal plane wave with a certain frequency and direction.It is modulated by a 2-D Gaussian function. 35In the spatial domain, the Gabor filter is formulated as E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 1 ; 1 1 6 ; 6 2 1 gðx; yjλ; θ; σ; ψ; γÞ where γ represents the spatial aspect ratio and specifies the elliptical support of the Gabor function.In addition, ψ represents the phase shift in degrees; σ represents the Gaussian standard deviation, which specifies the receptive field size; θ denotes the normal direction of the Gabor function parallel stripes; and λ is the cosine factor wavelength.Moreover, y 0 ¼ y cosðθÞ − x sinðθÞ, x 0 ¼ x cosðθÞ þ y sinðθÞ, and f ¼ 1∕λ represent the cosine factor frequency in the spatial domain.
The ratio σ∕λ specifies the bandwidth of the spatial frequency of simple cells.The bandwidth, b, (in octaves) of the half-response spatial frequency and ratio σ∕λ are given as follows: 36 E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 2 ; 1 1 6 ; 4 8 5 where ψ ¼ 90 deg and ψ ¼ 0 deg, respectively, return the imaginary part and real part of the Gabor filter.The real part is equivalent to an even symmetric filter; therefore, we use the real part of the Gabor filter.Orientation parameter θ is, respectively, assigned as 0, 30, 60, 90, 120, and 150 deg.The frequency values can be described as follows: E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 4 ; 1 1 6 ; 3 4 0 E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 5 ; 1 1 6 ; 2 9 5 where i ∈ f1; 2; : : : ; log 2 ðN c ∕8Þg, and N c is the number of image columns.Note that 0.25 ≤ F H ðiÞ < 0.5 and 0 < F L ðiÞ < 0.25.Bandwidth b of the Gabor filter is set to one octave in this study.
The Gaussian smoothing function is used to perform feature extraction.It is defined as follows: E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 6 ; 1 1 6 ; 1 9 5 gðx; yÞ Note that we apply a bicubic interpolation method to restore the image size.Finally, we obtain the 96-dimensional Gabor feature.
Principal component analysis (PCA) is an important method of identifying data patterns.PCA can reduce the dimensions without losing excessive information.It is thus often applied to image compression.We therefore employed PCA to reduce the Gabor feature dimensions to five.Figures 1(a)-1(f) show the original grayscale imagery, and the first, third, second, fourth, and fifth principal component imagery, respectively, which were obtained by the PCA reduction of the Gabor feature dimensions.

Polarimetric Features
The classification results of polarimetric feature extraction by the Pauli decomposition method are more stable and effective in polarimetric decomposition methods.Therefore, Pauli features were utilized to extract information on the terrain scattering mechanism.Analysis of PolSAR image data is generally based on a matrix.The complex scattering matrix corresponding to the scattering terrain can be measured by PolSAR with different polarizations. 7Four combinations exist for reception and transmission polarizations, namely, horizontal-horizontal (HH), horizontal-vertical (HV), vertical-vertical (VV), and vertical-horizontal (VH).Scattering matrix S characterizes the relations between the polarization states of the microwaves scattered and those received by the terrains.The general form of the scattering process can be given by E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 7 ; 1 1 6 ; 2 9 0 where E r H;V and E t H;V , respectively, denote the received and transmitted electric fields corresponding to the polarization types.The matrix elements are obtained by S HV ¼ jS HV j expðiϕ HV Þ.
In Eq. ( 7), the parameters jS HH j, jS HV j, jS VV j and ∅ HV , ∅ VV are, respectively, the amplitudes and the phases.PolSAR can obtain the information of the scattering terrains from the five parameters. 37The formula eigenvalue analysis forms are generally provided based on coherency matrix T 3 .The coherency matrix is represented by the Pauli feature vector as follows: E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 8 ; 1 1 6 ; 1 7 1 The Pauli feature is used in this study.The Pauli feature vector is given by E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 0 9 ; 1 1 6 ; 1 2 6 With consideration of the correlations between these descriptors and their ability to represent actual terrain imagery, we chose a reasonable combination of features through experiments for terrain classification.

Classification System Construction
The primary goal of constructing the ensemble classifier is to increase the classifier capability by combining the superior performance of the various base learners.In general, to obtain effective integration, the base classifiers should be as precise and distinct as possible.This requirement was shown by Krogh and Vedelsby. 38In this study, to construct an effective classifier, we explore and construct an ensemble-weighted voting classifier, including two SVMs, which are formed by the two kernel functions of radial basis function (RBF) and sigmoid, as well as ELM, KNN, and the discriminant analysis classifier.These classifiers have lower computational complexity and different theoretical backgrounds and can avoid redundancies and biases.
SVM is a supervised learning model with an associated learning method that analyzes the classification data.The SVM algorithm first establishes a model by training samples.Using this model, the new test samples are classified.It is a nonprobabilistic classifier.Intuitively, a good classification result of the SVM classifier is realized by the optimal separating hyperplane.In general, the larger the margin is, the lower the classification error rate is.SVM can efficiently find nonlinear solutions using the "kernel trick."Then different types of SVM classifiers are formed by various kernel functions, including RBF, sigmoid, linear, and polynomial functions.
An effective learning method, ELM, is a feedforward neural network with one hidden layer.In this method, the weights of the hidden layer nodes are stochastically selected.The weights of the output layer are slated by the least-squares method.ELM can be derived from randomly generated hidden neurons and independent training data.It offers significant advantages over conventional neural network learning algorithms, including ease of implementation, a fast learning speed, and minimal need for human intervention.The pseudocode for ELM is given in Table 1.A more detailed introduction to ELM can be found in Ref. 34.
Meanwhile, KNN is an instance-based learning algorithm.The output is a member of the class in the KNN classification.A sample is classified according to the majority of votes of its adjacent samples, with the sample being specified in the category of its KNN samples.In addition, discriminant analysis is a method for learning identification features.It has been applied to predict the category to which a subject belongs.There are two basic steps in discriminant analysis.The first step is to estimate the weight factors, which can be used to characterize the attributes of the known samples and compute the measurements of their trends.In the second step, information is applied to create the decision-making rule that ensures some threshold for forecasting.
We designed the presented classification model based on the requirements of PolSAR terrain classification.We identified the favorable weights of all base classifiers using the approaches presented in Ref. 39.The main steps are given in Table 2.
The scheme of the classification method is illustrated in Fig. 3.The implementing procedure of this methodology is described as follows.Step 1: Extract textural features based on the Gabor transform and obtain Pauli features from the PolSAR image data.The most suitable texture feature is selected through experiments.The combination of the selected texture features and Pauli features is used for the classification systems.
II. Set the generation iterator G ¼ 0.
While the stopping criterion is not satisfied III. Randomly select the indices r 1 , r 2 , and r 3 , which are different from each other and from index i.
IV. Calculate a mutated vector V i;G using I.
V. Obtain random number j rand .

End for
VII. Calculate the fitness of X i;G and T i;G using a ten-fold crossover operation, and update X i;Gþ1 of the next generation (G þ 1) using V.

VIII. Update generation iterator
End while N×m .
Step 3: Optimize the weights of each individual classifier to obtain an ensemble classifier.
Search the optimal weights by DE.The algorithmic steps of optimizing the weights are given in Table 2. Details are provided in Ref. 39.
Step 4: Provide the prediction category label, L n , of the weighted voting for each sample, n.The operation is as follows: E Q -T A R G E T ; t e m p : i n t r a l i n k -; e 0 1 0 ; 1 1 6 ; 3 3 0 where g ji is the binary variable.If the i'th base classifier classifies sample n in the j'th category, then g ji ¼ 1; otherwise, g ji ¼ 0. In addition, ω i is the weight of the i'th base classifier of the ensemble classifier.

Experiments and Discussion
To verify the performance of the proposed PolSAR image classification scheme, we adopted airborne PolSAR data of San Francisco Bay.The PolSAR data were presented with no header in the STK-MLC format with 900 rows × 1024 columns.They provided coverage of three classes: urban areas, ocean, and vegetation [Fig.4(a)].First, we selected five groups of 20 × 20 pixels from each category and composed the initial training sample set.Each class had 2000 training samples.The building unit block size is shown in Fig. 4(b).We determined the testing accuracy rate to more accurately compare the various methods.Owing to the lack of a real and reliable terrain map, we selected 112,500 pixels that could be specified as the labels of test samples.The image-marked test samples are shown in Fig. 4(c).In the DE algorithm, the choice of DE parameters can significantly influence the optimal performance.For simplicity, we set factor F ¼ 0.5, crossover rate ¼ 0.9, population

Experiment 1
To achieve feature selection, we compared the effectiveness in different composite approaches with polarimetric and Gabor features using the classifier based on discriminant analysis.This classifier is a highly efficient algorithm for building image classifiers.We used the "linear" discriminant type in the experiment.We randomly selected 100 samples from the initial training sample set as the training sample set.3, where textural features change the classification accuracies on the three terrain areas.
The single principal component feature of the Gabor feature reduced in dimension by PCA is important in distinguishing the different terrain pixels.To elucidate this importance, we considered different composition schemes of Pauli features and five principal component features of the Gabor feature reduced in dimension by PCA for each pixel.Figure 5 shows the terrain classification results using different composite approaches with Pauli features and Gabor features when the dimensionality of the Gabor feature is reduced to 5. The classification effects with different combinations of features when the Gabor feature is reduced in dimension are shown in Table 3.
Based on the data in Table 3, it is possible to assess the distinction ability of each composition feature.The table shows that the composition of Pauli features and the final principal component feature of the Gabor feature reduced in dimension by PCA has the best classification results (accuracy: 97.5867%).In addition, a regular pattern shows that the classification accuracies are all the same for the composite approach with Pauli features and the final principal component feature of the Gabor feature reduced in dimension.

Experiment 2
We compared the performances of eight classifiers, including the five basic classifiers, namely, ELM, KNN, discriminant analysis, and the two SVM classifiers, and the three ensemble classifiers, specifically AdaBoostM2, random forest, and the proposed method.In view of the previous experimental results, we adopted the combination of Pauli features and the fifth principal component feature of the Gabor feature reduced to five dimensions by PCA.Each pixel was represented by the four-dimensional feature vector, which was comprised of the three-dimensional Pauli feature vector and one-dimensional Gabor feature.
We randomly selected 300 samples from the initial training sample set as the training sample set.According to the experimental results (Table 4), k ¼ 5, and the discriminant analysis classifier is designated as a linear discriminant type.The AdaBoostM2 algorithm randomly selected an ensemble of 400 trees and used the default tree options.Random forest chose 500 as n (the number of trees).We applied fivefold cross-validation on the training set to adjust the SVM parameters.
The classification maps of the eight classifying methods are shown in Fig. 6.To obtain an improved predictive accuracy measure, we compared these methods using the average of the five estimates for the ELM classifier, random forest classifier, and the proposed classification method.We employed the best estimate value for the KNN classifier, discriminant analysis classifier, and AdaBoostM2 algorithm when k ¼ 5; the discriminant analysis classifier was linear, and the AdaBoostM2 algorithm randomly selected an ensemble of 400 trees.Table 4 shows the precision values of the eight classifiers.Note: (1) k denotes the number of nearest neighbors.(2) m specifies the discriminant type: m ¼ 1, "linear;" m ¼ 2, "quadratic;" m ¼ 3, "diagQuadratic;" m ¼ 4, "diagLinear;" m ¼ 5, "pseudoQuadratic."(3) n denotes the number of trees arbitrarily chosen for AdaBoostM2.(4) Accuracy (h times) denotes the accuracy of the first h experiments (h ¼ 1, 2, 3, 4, 5).
Figure 6 and Table 4 show that the eight classifiers can better distinguish the uniform regions corresponding to the primary scattering class, such as the urban areas, vegetation, and ocean.The common limitation of these classifiers is that they cannot effectively distinguish vegetation regions from urban areas.In Table 4, the overall precision of the eight classifiers is compared using the training and testing areas (Fig. 4).For the individual classifier, the classification accuracies of the ELM and discriminant analysis classifiers are higher.Generally speaking, the  proposed ensemble method outperforms the AdaBoostM2 algorithm with a high accuracy, and it has the best prediction accuracy (98.2305%) of all eight methods.Hence, by combining general machine learning and artificial intelligence classification methods, the terrain classification accuracy of the full PolSAR image can be improved to a certain extent.

Conclusion
In this paper, an evolutionary classification system for terrain classification of PolSAR imagery based on ensemble learning with polarimetric and texture features was proposed.For supervised terrain classification of PolSAR imagery, our classification approach is based on a weighted voting ensemble using the composite of Pauli features and the fifth principal component feature of the Gabor feature reduced in dimension by PCA.The terrain classification accuracy of PolSAR imagery heavily relies on the image features.Our approach thus leverages the texture features in addition to the original polarimetric features.
The Gabor transform method is a standard technique for extracting texture features from remote sensing images.To identify the discrepancies between neighboring pixels, we analyzed different Gabor transform features that are reduced to different dimensions by PCA in different composite approaches with polarimetric features.The most effective combination of features was selected for PolSAR imagery terrain classification.
The proposed method can effectively classify terrain in PolSAR imagery.This is because the five base classifiers for combining the ensemble classifier are distinct from each other and have strong complementarity.Moreover, we can obtain each base classifier weight by a DE algorithm to automatically stop the computation.This capability provides considerable practicability for PolSAR image processing.
Experimental results confirmed that our approach consistently outperformed the existing approaches.Nevertheless, our method is limited in that the terrain classification accuracy may decrease for very complex scenes.The basic reason for this limitation is that the classification method employs a simple selection method of training samples that may be insufficient for providing a reasonably accurate classification for some multiple scenes.Consequently, in future work, we intend to explore a more appropriate algorithm for selecting training samples that can more precisely depict multiple scenes of PolSAR imagery.In addition, we will examine the combination of other known advanced information types to obtain a more accurate classification result.

Fig. 1
Fig. 1 Original grayscale image, obtained from the red/green/blue (RGB) composites of Pauli decomposition, and five principal component images: (a) original grayscale image; (b) first principal component image; (c) third principal component image; (d) second principal component image; (e) fourth principal component image; and (f) fifth principal component image.

Fig. 5
Fig. 5 Classification accuracy maps obtained by the discriminant analysis classifier using different features when the Gabor feature was reduced to five dimensions by PCA.(a) Pauli features; (b) Pauli features and the first principal component feature of the Gabor feature reduced in dimension by PCA; (c) Pauli features and the third principal component feature of the Gabor feature reduced in dimension by PCA; (d) Pauli features and the second principal component feature of the Gabor feature reduced in dimension by PCA; (e) Pauli features and the fourth principal component feature of the Gabor feature reduced in dimension by PCA; (f) Pauli features and the fifth principal component feature of the Gabor feature reduced in dimension by PCA.("Linear" specifies the discriminant type.)

Fig. 4
Fig. 4 Pseudocolor synthetic map of (a) Pauli decomposition, (b) training sample map, and (c) test sample map corresponding to the experimental dataset.

Table 2
Algorithmic steps for DE optimization.
Input: Parameters in DE: crossover rate (CR), variation coefficients F , and population scale N.

Table 1
Algorithmic steps for ELM.Number L of hidden layer nodes, training dataset D ¼ fðx i ; y i Þjx i ∈ R n ; y i ∈ R Randomly assign the weight vector connecting the i'th hidden node and the input nodes ω i .III. Randomly assign the threshold of the i'th hidden node b i .

Table 3
Classification accuracies of the discriminant analysis classifier in different compositions of features when the Gabor feature is reduced to different dimensions.[Accuracy (%); "linear" specifies the discriminant type.]