Multi-band PCA based ear recognition technique

Zarachoff, Matthew Martin; Sheikh-Akbari, Akbar; Monekosso, Dorothy

doi:10.1007/s11042-022-12905-0

Multi-band PCA based ear recognition technique

Open access
Published: 17 June 2022

Volume 82, pages 2077–2099, (2023)
Cite this article

Download PDF

You have full access to this open access article

Multimedia Tools and Applications Aims and scope Submit manuscript

Multi-band PCA based ear recognition technique

Download PDF

Matthew Martin Zarachoff ORCID: orcid.org/0000-0002-9868-8389¹,
Akbar Sheikh-Akbari¹ &
Dorothy Monekosso¹

1085 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

Principal Component Analysis (PCA) has been successfully applied to many applications, including ear recognition. This paper presents a Two Dimensional Multi-Band PCA (2D-MBPCA) method, inspired by PCA based techniques for multispectral and hyperspectral images, which have demonstrated significantly higher performance to that of standard PCA. The proposed method divides the input image into a number of images based on the intensity of the pixels. Three different methods are used to calculate the pixel intensity boundaries, called: equal size, histogram, and greedy hill climbing based techniques. Conventional PCA is then applied on the resulting images to extract their eigenvectors, which are used as features. The optimal number of bands was determined using the intersection of number of features and total eigenvector energy. Experimental results on two benchmark ear image datasets demonstrate that the proposed 2D-MBPCA technique significantly outperforms single image PCA by up to 56.41% and the eigenfaces technique by up to 29.62% with respect to matching accuracy on images from two benchmark datasets. Furthermore, it gives very competitive results to those of learning based techniques at a fraction of their computational cost and without a need for training.

Ear Recognition Using Block-Based Principal Component Analysis and Decision Fusion

Image Band-Distributive PCA Based Face Recognition Technique

PCA Based Face Recognition on Curvelet Compressive Measurements

1 Introduction

Ear recognition, a field within biometrics, concerns itself with the use of images of the ears to identify individuals. Much like fingerprints, ears are unique to an individual; even identical twins can have distinguishable ears [23]. Similar to images of the face, ear images can be captured from a distance, making them a useful biometric for security, surveillance, and other related purposes. Researchers have explored this topic extensively over the last two decades, investigating techniques for extracting features from ear images and their subsequent comparison [10, 28]. Successful feature extraction techniques in ear recognition and other biometrics include Principal Component Analysis- (PCA) [29,30,31,32, 37], wavelet-based [5, 13, 18, 25], Support Vector Machine (SVM) [4, 26, 27] and neural network-based and other [1, 2, 7, 9, 11, 15, 22, 24, 27, 33, 39, 40] methods. Amongst these techniques, PCA has been used for both feature extraction in the form of eigenvectors and dimensionality reduction. Several PCA based image classification and dimensionality reduction methods, including techniques for 3D and hyperspectral images have been reported in the literature [14, 16, 34, 35]. Research has shown that extended PCA based methods achieve greater performance to that of standard PCA in terms of computation costs, dimensionality reduction capability, memory usage, and classification accuracy. The application of single image PCA based ear recognition has been reported in the literature [32, 38]. However, the classification accuracy of these PCA based techniques is lower than that of the learning based techniques. Yet learning based techniques are computationally expensive, data dependent, and require extensive training data, which may not always be available. Consequently, there is demand for robust, low computation cost ear recognition techniques that are less data dependent while offering acceptable accuracy. Recent research on the application of extensions of PCA for hyperspectral image classification [35] has shown the potential of PCA based methods to deliver significantly higher classification and recognition accuracy at a much lower computational cost.

This paper investigates robust, low computation cost PCA based algorithms for ear recognition, built upon the successes of PCA based techniques in the field of hyperspectral image processing. To accomplish this, this paper examines methods of generating a hyperspectral-like image from an input grayscale image to increase matching accuracy of PCA based ear recognition techniques. This investigation has resulted in the development of a multi-band, single image PCA based ear recognition technique, called Two Dimensional Multi-Band PCA (2D-MBPCA). Initial experimental results for the proposed Two Dimensional Multi-Band PCA (2D-MBPCA) ear recognition algorithm were published in [36]. The published algorithm uses either the hill climbing optimization method, or equally splits the full gray scale image between the target numbers of the images. It then performs the standard PCA method on the resulting set of images, extracting their principal components as their features, which are used for recognition. The performance of the proposed algorithm was assessed using images of two benchmark datasets. Results show that the proposed 2D-MBPCA algorithm significantly outperforms PCA based matching algorithms. This paper presents an ear recognition technique, which unlike other PCA-based methods, does not require input images to be projected into a common eigenspace. Instead, the input image is divided into multiple images based on its pixels’ values, representing the data in a novel fashion that can then be subjected to PCA. In [36], the authors presented two methods called equal size and greedy hill climbing based partitioning techniques for generating multiple images from the input image. Further results, including a new partitioning method, called histogram based boundary calculation, which utilizes the histogram of a set of training images to determine the pixel value boundaries, are presented. The proposed technique then applies the standard PCA method on each resulting set of images of the input image to extract their principal components, which are used as features. These features are then used for recognition. In order to maximize the performance of the proposed technique, the intersection of the number of features and total eigenvector energy, which is empirically consistent with the matching performance of the proposed technique, is used as the optimal number of images to be generated from the input image. Experimental results on the images of two benchmark ear image datasets demonstrate that the proposed 2D-MBPCA technique greatly outperforms traditional PCA applied to single images and the well-known ‘eigenfaces’ technique [31], as well as providing very competitive results with those of the learning based techniques. Moreover, the computational burden of the proposed 2D-MBPCA algorithm has been evaluated and compared with other PCA based and learning based ear recognition techniques, demonstrating that 2D-MBPCA generates competitive results to learning based algorithms at a fraction of their computational cost. The proposed 2D-MBPCA can be used for other biometric techniques, such as iris recognition, which has been successfully demonstrated by Ghaffari et al. in [12], where the authors demonstrated significantly higher recognition performance to those of traditional techniques. In addition, the proposed algorithm can be applied to other biometric applications, including face recognition. Furthermore, the proposed algorithm has significant potential to be incorporated with other statistical or learning based recognition techniques and has the potential to improve their performance. The main contributions of this paper are: a) Development of a Two Dimensional Multi-Band PCA (2D-MBPCA) ear recognition technique; b) Generation of a hyperspectral like image cube from a single gray image; c) Use of three different methods called equal size, greedy hill climbing and histogram-based boundary calculation algorithm for generating multiple images from the input image; d) Using the intersection of the number of features and total eigenvector energy to determine the optimal number of images to be generated from the input image and experimentally verifying it.

The rest of the paper is organized as follows: Section 2 introduces the proposed 2D-MBPCA method and its image partitioning algorithms, Section 3 details the benchmark ear image datasets and presents the experimental results, and Section 4 concludes the paper.

2 2D multi-band PCA technique

In this section, a 2D Multi-Band PCA (2D-MBPCA) ear recognition technique is presented. The proposed method is inspired by the application of PCA on hyperspectral images, which has been shown to produce high accuracy results for classification. 2D-MBPCA divides the input ear image into multiple bands, mimicking a hyperspectral image, and then applies standard PCA on the resulting bands. The resulting eigenvectors are then used as features for matching. Consequently, 2D-MBPCA translates the success of PCA based hyperspectral image classification to the single image ear recognition domain. The proposed 2D-MBPCA method includes the following four components: A) Pre-Processing; B) Multiple-Image Generation; C) standard PCA; and D) Matching. Figure 1 shows the block diagram of the proposed algorithm.

2.1 Pre-processing

Let E be the set of all ear images, where each image in E is of size x × y. It is assumed that the input image e ∈ E is an 8-bit, grayscale image. First, each pixel value p∈ e is converted to a new value p’ as shown in (1):

$$ p' = p/255 $$

(1)

Histogram equalization is then applied on the resulting image to increase its contrast. To do so, the Probability Mass Function (PMF) P_X of the image is first calculated:

$$ P_{X} (x_{k} )=P(X=x_{k} ) \text{ for } k=0,1,...,255 $$

(2)

where X = x₀,x₁,...,x₂₅₅ represent the pixel values and P_X(x_k) is the probability of coefficients in bin k. The resulting PDF is then used to calculate the Cumulative Distribution Function (CDF) C_X of the image:

$$ C_{X} (k)=P(X \le x_{k}) \text{ for } k=0,1,...,255 $$

(3)

where C_X(k) is the cumulative probability of X ≤ x_k. Finally, each pixel value within the image is mapped to a new value using its resulting CDF. After histogram equalization, each of the resulting images in E is ready to be converted into multiple images.

2.2 Multiple-image generation

The proposed 2D-MBPCA method can use any method to generate multiple images from the histogram equalized input image. In this investigation three methods called: equal size, histogram based, and greedy hill climbing based boundary calculation methods were used to determine the boundaries. These methods are detailed in subsequent subsections. The multiple image generation can be formulated as follows: Assume e is the input histogram equalized image and let N be the number of desired images to be generated from the input image e. The proposed algorithm uses N-1 boundaries to split the input image pixels into N target images according to the pixel values. Let B = {b₁,b₂,...,b_N− 1} be the boundary values. Then the input image e pixels are divided into N target images as follows:

1.
Generate N images of the same size of e and set their pixels to zero. Let these images be: F = [f₁,f₂,...,f_N].
2.
Assign each pixel p in e of value in the range [0,b₁),[b₁,b₂),...,[b_N− 1,1] to image f₁,f₂,...,f_N, respectively.

The input image e has now been partitioned into F images, where F can be considered to be, in a sense, a multispectral image and each f ∈ F captures its own intensity band. Figure 2 shows an example of multiple image generation, where the input image has been divided into four images using the equal size boundary calculation, yielding the following bands: [0, 0.25), [0.25, 0.5), [0.5, 0.75), and [0.75, 1] for image f₁, f₂, f₃, and f₄, respectively.

In this research, three different methods to calculate boundaries for image partitioning called: equal size, histogram based, and greedy hill climbing based methods are introduced. Equal size boundary calculation divides the input image into equal bands based on the selected number of bands. The histogram based technique takes a training subset of the dataset and filters its histogram to determine the boundaries. The greedy hill climbing technique attempts all possible positions to add a boundary. Once a single boundary is found, it then iterates to add further boundaries until the matching accuracy is maximized. These methods are detailed in the following subsections.

2.2.1 Equal size boundary calculation

Let N be the number of target images for the input image e to be divided into. The pixel value boundaries B = {b₁,b₂,...,b_N− 1} are then calculated using (8):

$$ b_{n}=n/N \text{ for } n=1,2, ..., (N-1) $$

(4)

2.2.2 Histogram based boundary calculation

The proposed histogram based boundary calculation method first divides the pre-processed input image dataset into training and test images. The proposed algorithm then calculates the histogram of all images within the training set using 256 equal sized bins. The resulting histogram is then smoothed using a 1D Gaussian low pass filter with coefficients [0.25 0.5 0.25]. The minima of the histogram are then used as the boundaries for the 2D-MBPCA algorithm. This process is repeated several times to determine the set of boundaries that maximize the accuracy of the training set, and the resulting boundaries are finally used for testing. An example of such a histogram is shown in Fig. 3 and the block diagram for the histogram technique is shown in Fig. 4.

2.2.3 Greedy hill climbing based boundary calculation

The greedy hill climbing algorithm calculates the boundaries by iteratively running 2D-MBPCA on a training set of images. The proposed algorithm initializes with a set of training images called input_images, an empty set of boundaries called bnds, and a measure of the overall best matching percent found called top_percent. It then attempts to find the first optimal boundary to add to the set of boundaries found. This is accomplished by using a variable curr_bnd that represents the potential boundary to be added to the set of optimal boundaries (bnds), which is initialized to a pre-selected step size κ. Two other variables, best_percent and opt_bnd, which contain the best matching accuracy and its associated boundary point found in this iteration, are initialized to zero. While curr_bnd is less than one, a temporary set of boundaries temp_bnds is created by copying bnds, concatenating opt_bnd, and sorting the result. 2D-MBPCA is performed on the input_images using temp_bnds, with the resulting correct match percentage stored as matching_percent. If matching_percent is higher than the best_percent found so far in this iteration, best_percent is set to matching_percent and curr_bnd is saved as opt_bnd. The variable curr_bnd is then incremented by κ. When curr_bnd finally exceeds one, the algorithm now has a single boundary opt_bnd that may be permanently added to the boundary set bnds. If the overall matching percentage has increased in this iteration by adding opt_bnd (i.e, if best_percent is greater than top_percent), top_percent becomes best_percent, opt_bnd is permanently added to bnds, bnds is sorted, and a new iteration begins. If adding opt_bnd does not increase the overall matching percentage, the algorithm returns the already calculated set bnds as the best set of boundaries found and the boundaries are then used on the test set of images. A block diagram of the proposed greedy hill climbing technique can be seen in Fig. 5.

A κ value of 0.05 was chosen as a compromise between performance, overfitting, and computation time for the results presented in this paper. Although this greedy hill climbing approach is not guaranteed to find the global optimum for boundary values, it produces sufficient results while simultaneously reducing computation time when compared to a brute force method.

2.3 Principal component analysis

Assume F is a set of images of the same size and F = f₁,f₂,...,f_N. For each image f ∈ F, a mean adjusted image f’ is created as follows:

$$ f' = f - \overline{f} $$

(5)

where $\overline {f}$ is the mean value of the pixels in image f. Every image f’ is then converted to a column wise vector, allowing F to be represented as a two dimensional matrix S. PCA is then performed using Singular Value Decomposition (SVD) on matrix S creating the following decomposition:

$$ S = U {\Sigma} V^{T} $$

(6)

where U is a unity matrix and the columns of V are the orthonormal eigenvectors of the covariance matrix of S and Σ is a diagonal matrix of their respective eigenvalues. The eigenvectors form a basis for an eigenspace for each set of images F. The resulting principal components in V are finally used for matching.

2.4 Matching

Let M = m₁,m₂,...,m_N− 1 be the set of principal components of a query image q and let r be an image in the dataset of images R with principal components L = [l₁,l₂,...,l_N− 1]. Each Euclidean distance d_n ∈ D = [d₁,d₂,...,d_N− 1] between q and r can be calculated using (6):

$$ d_{n}=\sqrt{{\Sigma}_{n}(m_{n}-l_{n})^{2} )} $$

(7)

After the calculation of the Euclidean distances between the principal components, they are averaged into an average distance metric, as written in (7):

$$ AvD={\Sigma} D/(N-1) $$

(8)

The best match for query image q in the image dataset R is the image for which AvD is minimized.

3 Experimental results

To assess the performance of the proposed 2D-MBPCA technique and compare its performance against standard Principal Component Analysis (PCA) method for ear recognition, experimental results were generated using two benchmark ear image datasets called the Indian Institute of Technology

Delhi II (IITD II) [17] and the University of Science and Technology Beijing I (USTB I) [8], which are widely used in the literature [3, 4, 25, 27, 36]. These two datasets were selected due to their widespread use and because they have been pre-aligned. The IITD II dataset consists of 793 images of the right ear of 221 participants. Each participant was photographed between three and six times, with each image being of size 180 × 50 pixels and in 8-bit grayscale. The images of IITD II dataset are tightly cropped, of equal size, and are manually centered and aligned. The USTB I dataset consists of 180 images of the right ear of 60 participants, each of whom were photographed three times. The images in this dataset are 8-bit grayscale of size 150 × 80. The images in USTB I are tightly cropped; however, they exhibit some slight rotation and shearing. Multiple example images from these two datasets are illustrated in Fig. 6, where Fig. 6a-b and c-d show images from the IITD II and USTB I datasets, respectively.

The proposed 2D-MBPCA technique using the three different boundary selection algorithms described in Section 2.2, standard single-image PCA, and the eigenfaces methods were applied to the images of the two datasets. The algorithms were all assessed via Rank-1 and Rank-5 criteria [10]. For each subject, two images were randomly selected to serve as database images and a third image was randomly selected to be a query image. Given a particular query image, if it was correctly matched with an image of the same subject in the database, it was marked as a Rank-1 image. Similarly, if the subject was found within the closest five images to the query image, it was marked as a Rank-5 image. The percentage of Rank-1 and Rank-5 images in the dataset are then listed as Rank-1 and Rank-5 accuracies. This process was repeated for two additional permutations, with the Rank-1 and Rank-5 accuracies averaged across all trials. It should be mentioned that 10% of the image datasets were randomly selected and used for calculating the boundary values in the histogram based boundary calculation experiment. The same 10% was also selected for tuning the κ parameter for greedy hill climbing based boundary calculation. The remaining 90% of each image dataset was then used to generate experimental results.

3.1 Experimental results for the standard PCA method

To create results for the standard PCA method, it was applied to each image individually. The resulting eigenvectors were then used for matching. The results for both the IITD II and USTB I image datasets are presented in Table 1. From Table 1, it can be seen that the PCA performance on the IITD II is higher than its performance on the USTB I. This could be explained by the fact that some images within the USTB I dataset are slightly rotated.

Table 1 Rank-1 and Rank-5 matching accuracy (%) for Standard PCA

Multi-band PCA based ear recognition technique

Abstract

Similar content being viewed by others

Ear Recognition Using Block-Based Principal Component Analysis and Decision Fusion

Image Band-Distributive PCA Based Face Recognition Technique

PCA Based Face Recognition on Curvelet Compressive Measurements

1 Introduction

2 2D multi-band PCA technique

2.1 Pre-processing

2.2 Multiple-image generation

2.2.1 Equal size boundary calculation

2.2.2 Histogram based boundary calculation

2.2.3 Greedy hill climbing based boundary calculation

2.3 Principal component analysis

2.4 Matching

3 Experimental results

3.1 Experimental results for the standard PCA method

3.2 Experimental results for independent component analysis

3.3 Experimental results for the eigenfaces method

3.4 Experimental results for the 2DPCA method

3.5 Experimental results for the (2D)2 PCA method

3.6 Experimental results for the proposed 2D-MBPCA using equal size boundaries

3.7 Experimental results for the proposed 2D-MBPCA using histogram based boundaries

3.8 Experimental Results for the Proposed 2D-MBPCA Using Greedy Hill Climbing Based Boundaries

3.9 Justification of the achieved performance

3.10 Execution time

4 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

3.5 Experimental results for the (2D)² PCA method