Abstract

Manifold learning based dimensionality reduction algorithms have been payed much attention in plant leaf recognition as the algorithms can select a subset of effective and efficient discriminative features in the leaf images. In this paper, a dimensionality reduction method based on local discriminative tangent space alignment (LDTSA) is introduced for plant leaf recognition based on leaf images. The proposed method can embrace part optimization and whole alignment and encapsulate the geometric and discriminative information into a local patch. The experiments on two plant leaf databases, ICL and Swedish plant leaf datasets, demonstrate the effectiveness and feasibility of the proposed method.

1. Introduction

Plant recognition based on leaf images plays an important role in agricultural informatization, ecological protection, and automatic plant recognition system. One of the most important steps in the image based plant recognition is to validly extract classifying features. Currently, the commonly employed classifying features for plant recognition based on leaf image could be categorized into color, shape, and texture features [13]. Plant leaf classification is a challenging problem due to its high dimensionality data, complexity, and irregular shape of plant leaf images [46]. Traditional dimensionality reduction methods typically have a smaller data space from linear combinations of the original data. The most common example is principal component analysis (PCA), which seeks a low-dimensional linear subspace spanned by the eigenvectors which correspond to the largest eigenvalues of the covariance matrix of all the samples. However, for plant leaf images, the assumption of global linearity is a severe constraint because the leaf images are quite sensitive to seasonality, location, and illumination conditions. Thus, it is not reasonable to believe that the leaf image data could be linearly separable from each other.

Manifold learning has been utilized in many applications such as pattern recognition, visualization, and classification tasks. In the last ten years, many manifold learning nonlinear algorithms have been introduced with an assumption that the processed data lies on or close to some low-dimensional manifolds which are embedded in a high-dimensional unorganized Euclidean space. In these manifold learning algorithms, the most representative ones are isometric feature mapping (ISOMAP) in [7], locally linear embedding (LLE) in [8], Laplacian eigenmaps (LE) in [9], Hessian-based locally linear embedding (HLLE) in [10], maximum variance unfolding (MVU) in [11], local tangent space alignment (LTSA) in [12], local spline embedding (LSE) in [13], and local discriminative tangent space alignment (LDTSA) in [14]. One of the most important advantages of manifold learning [714] compared with conventional methods is how the data are treated mathematically. Manifold learning methods allow the data to be related nonlinearly, which leads to the fact that manifold learning methods can much more accurately capture the proper structures among the data, thus allowing for accurate recognition. For every manifold learning algorithm, it tries to preserve a different geometrical property of the underlying manifold. Local methods such as LLE, HLLE, LE, LTSA, and LSE try to preserve the neighborhood structure in the data, while global methods like ISOMAP aim to preserve the metrics at all scales. Thanks to their nonlinear nature, geometric intuition, and computational feasibility, these nonlinear methods have promising results on some artificial and real-world datasets. In [15] a framework, which is called patch alignment, was proposed and it consists of two stages: part optimization and whole alignment. In this paper, we take an alternative view of the framework to introduce an efficient method based on local discriminative tangent space alignment (LDTSA) for plant leaf recognition. Compared with current plant leaf recognition methods, the proposed one can avoid the small sample size problem, preserves the discriminative capability, and detects the intrinsic structure from the plant leaf image data.

The paper is organized as follows: Section 2 briefly describes the dimensionality reduction algorithm based on local discriminative tangent space alignment and its procedures. Experiments on plant leaf database are offered in Section 3 and the paper is ended with some conclusions in Section 4.

2. Local Discriminative Tangent Space Alignment Algorithm

Suppose original labeled data points , including all the samples , . The objective of a dimensionality reduction algorithm is to compute the corresponding low-dimensional representations of , , , where . For the linear dimensionality reduction, it is necessary to find projection matrix , such that . For the nonlinear dimensionality reduction, it is usually difficult to provide an explicit mapping to transform data from a high-dimensional space to a low-dimensional subspace. For classification task, in part optimization stage we always hope to project the high-dimensional data into a low-dimensional feature space, in which the projection is characterized by within-class compactness and between-class separability [15]. Assume that there is an interaction force between any pairwise points in the ambient space; the mutual force can be distinguished as within-class attraction or between-class repulsion between any pairwise points from the same or different class, respectively (see Figure 1) [16].

In the reduced subspace, in order to achieve within-class attraction for data point , the following objective function is defined aswhere is the number of the nearest neighbors with respect to from data points in the same class as .

In order to achieve between-class separability purpose for the data point , the following objective function is defined as

LTSA uses tangent coordinates to indicate the local geometry. Assume that there is an affine projection matrix, which projects tangent coordinates to the low-dimensional coordinates in a local patch which contains neighbors from both the same and different classes. To obtain the optimal tangent coordinates, we have the following objective function on each patch:where denotes the centralization matrix; ; is identity matrix; and is the tangent coordinates corresponding to an orthonormal basis matrix of the tangent space.

Since the patch formed by the local neighborhood can be considered approximately linear, we write the part discriminator by using the linear manipulation as follows:where and are scaling factors to unify different data points of the within-class distance and the between-class distance and they are selected based on experiments.

Then objective function (4) can be rewritten by patch alignment:where where denotes the matrix of right singular vectors of corresponding to its largest values; and is a coefficient vector.

In (5), the first two parts only involve the data points belonging to within-class neighbors and they share the same selection matrix . The third part concerns the between-class neighbors and uses selection matrix .

Then (5) can be reformed to the following:where and are selection matrix and

In summary, the main procedure of the proposed algorithm for the plant leaf image classification task can be described as follows.

Step 1. Select representative labeled plant leaf image samples to which the following dimensionality reduction will be done.

Step 2. For each point , find its within-class neighborhood set with elements and between-class neighborhood set with elements.

Step 3. Generate two vectors and , with their elements from and , respectively.

Step 4. Construct with within-class neighbors, then centralize the neighbors and compute the top- eigenvector from the autocorrection matrix, and then construct and record the selection matrix .

Step 5. Construct with between-class neighbors, and record the selection matrix .

Step 6. Generate global with patch alignment through respective selection matrix.

Step 7. Perform eigendecomposition on and the eigenvectors form projection matrix .

Step 8. Final reduced dimensionality results are .

3. Experiment Results

3.1. Experiment Results on ICL Dataset

ICL leaf database has 17032 plant leaf images of 220 species and image number of each class is unequal [17]. In order to verify the effectiveness of the proposed method in this paper, we construct one leaf image subset from the ICL leaf dataset, which has 15 species with 11 samples per species, and all classes are carefully chosen so that the shapes could be distinguished easily by human eyes or the shapes are similar but still can be identified [16]. Some typical example images are demonstrated in Figures 2(a), 2(b), and 2(c). Preprocessing is performed to crop all leaf images from two databases. The illuminations keep the same condition and the backgrounds are white, and the size of each cropped leaf image in experiments is 64 × 64 pixels, with gray level of 256 gray levels per pixel in preprocessing step, as demonstrated in Figure 2(c) of one species.

Then, every image is represented in a 4096-dimensional vector in the image space. By prereducing by PCA, 98 percent image energy is kept and all principal components are selected corresponding to the nonzero eigenvalues for each method. The 1-NN classifier is employed to classify leaf images for its simplicity. The distance measure is Euclidean distance.

The leaf image dataset is randomly separated into two subsets: one part is for training (sizes are 30, 45, 60, 75, 105, and 120) and the other is for testing purpose. The training sets are used to obtain the low-dimensional subspace with a projection matrix. The testing sets are utilized to test the final classification accuracy. Each time the test is repeated 20 times and the accuracy rate is calculated each time, as follows:where is the right sample number detected and is the total sample number tested.

Table 1 shows the average classification rates and standard deviations of three algorithms in our experiments on the selected datasets which are locality sensitive discriminant analysis (LSDA), linear local tangent space alignment (LLTSA), and the proposed LDTSA. It can be seen that the proposed method obtains better accuracy.

3.2. Experiment Results on Swedish Dataset

Swedish leaf dataset [18] has 1125 images from 15 different plant species, with 75 leaf images per species. The preprocess of the leaf image is the same as ICL dataset [16]. For each method, random subsets with 20, 40, and 60 images per species are selected for training, the rest for testing. Such experiment with a specific number is independently performed 20 times, and then the best average classification results are recoded. Table 2 shows the maximal average classification accuracy with different size of training sets and test sets. It could be found that the proposed method outperforms the other algorithms in all the cases.

4. Conclusions

Plant recognition based on leaf images has been an important and difficult research topic, especially for leaves with different and complicated shapes. Although there are many existing algorithms for plant leaf recognition, the recognition rates are still low due to the complexity of plant leaf. Manifold learning based dimensionality reduction algorithms are promising alternatives to traditional plant leaf recognition methods. A dimensionality reduction method based on local discriminative tangent space alignment (LDTSA) is proposed for plant leaf recognition task in this paper, and it embraces part optimization and whole alignment and encapsulates the geometric and discriminative information into a local patch. The experiment performed on two plant leaf databases shows the effectiveness and feasibility of the proposed method in this paper.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The paper was supported by Tianjin Research Program of Application Foundation and Advanced Technology 14JCYBJC42500 and Tianjin City High School Science & Technology Fund Planning Project 20140802. It was partly funded by the young academic team construction projects of the “twelve five” integrated investment planning in Tianjin University of Science and Technology and the 2015 key projects of Tianjin Science and Technology Support Program no. 15ZCZDGX00200. This work was also supported by the National Natural Science Foundation of China under Grant no. 61502338 and the Open Fund of Guangdong Provincial Key Laboratory of Petrochemical Equipment Fault Diagnosis no. GDUPTKLAB201334.