Wheat grain classification by using dense SIFT features with SVM classifier

doi:10.1016/j.compag.2016.01.033

Computers and Electronics in Agriculture

Volume 122, March 2016, Pages 185-190

https://doi.org/10.1016/j.compag.2016.01.033 Get rights and content

Highlights

•
We put an automated system to classify the wheat grains with a high accuracy rate.
•
We used the performance of DSIFT evaluated by SVM classifier.
•
The proposed method provides an overall 88.33% accuracy rate.

Abstract

The demand for identification of cereal products with computer vision based applications has grown significantly over the last decade due to economic developments and reducing the labor force. With this regard, we have proposed an automated system that is capable to classify the wheat grains with the high accuracy rate. For this purpose, the performance of Dense Scale Invariant Features (DSIFT) is evaluated by concentrating on Support Vector Machine (SVM) classifier. First of all, the concept of k-means clustering is operated on DSIFT features and then images are represented with histograms of features by constituting the Bag of Words (BoW) of the visual words. By conducting an experimental study on a special dataset, we can make a commitment that the proposed method provides the satisfactory results by achieving an overall 88.33% accuracy rate.

Introduction

Wheat (Triticum spp.) is a major food source in the world and it is commonly grown in most of the countries. It has wide adaptability to various environments including irrigated and dry land conditions; this explains that why it prevails in food production of the world. Healthy wheat production mainly requires certified pure grain use and in production process grains shouldn’t be mixed with different genotypes. The identification of varieties requires some knowledge of the appearance of grains and is assisted by information regarding its grain appearance. Confusion in grain appearance is a frequently encountered problem, especially in Turkey, where the number of wheat genotypes is very large. Commercially, wheat can be categorized with two groups as grain hardness (soft, medium-hard and hard) and appearance (color, degree of damage by insects or fungi, shriveling, shape of embryo). This separation can be branch out by considering growing habit (spring or winter). Also each subclasses can be ordered by their grades depending upon the price of a wheat stock as applying premiums or penalties by taking such properties (rain, heat, frost, insect and mould damage) and the cleanliness (dockage and foreign material) of the wheat lot into account. Grading factors can also be varied with respect to quality of grain such as protein content and sedimentation test weight in wheat trading as emphasized in Peña (2002).

Classification of wheat grains can be made in two ways as manually or automatically. In manual way, the type and the quality of wheat grain is specified based on an expert judgement. However, the judgement of the expert is inaccurate for some cases when the difference between the variety and the quality of wheat species is very close to each other. Thereby, the decision of the expert could result in financial loss, bankrupt or loss of confidence on behalf of manufacturer. In another way, using image processing and pattern recognition algorithms, called expert systems, with a purpose for wheat classification are slightly more accurate than manual way. In this way, the shape, the texture, the length and the color features are considered and combined to construct the feature vector, which represents the image with reduced dimension by discriminative characteristics. Later, the obtained feature set is put forward as input into a machine learning algorithm, i.e., K-Nearest Neighbor (K-NN), Decision Tree (DT) or Artificial Neural Network (ANN), to obtain a concise decision about its label. It should be noted that using expert systems depending upon the pattern recognition methodologies is more effective, fairer, cheaper and faster when compared with expert judgement.

Going through the previous studies on wheat classification, we found a few studies as counted something on the fingers of two hands. In Zayas et al. (1989) developed a structural prototype for discrimination of wheat and non-wheat components in a grain sample, by using a multivariate discriminant analysis technique. In referred paper, the execution time for discrimination between wheat and non-wheat was elapsed as about 10 s. Also, priorly, the same authors carried out another two experiments as discrimination between Arthur and Arkan wheats (Zayas et al., 1985) and discrimination between wheat classes (hard red winter, soft red winter and hard red spring) and varieties by benefiting from image processing methods (Zayas et al., 1986) with maximum 99.52 (1 out of 209 grain was not identified correctly) and 83% accuracy rates in the test stage, respectively. In another work, Majumdar and Jayas had reported four different approaches for classification commercial cereal grains by employing morphology models (Majumdar and Jayas, 2000a), color models (Majumdar and Jayas, 2000b), texture models (Majumdar and Jayas, 2000c) and hybrid one as combination of morphology, color and texture models (Majumdar and Jayas, 2000d). Moreover, the discrimination of wheat class and variety by digital image analysis of whole grain samples (Neuman et al., 1987), the performance of pattern recognition methods for the separation of cereal grains (corn, soybeans, rice, sorghum grain, barley and wheat) (Lai et al., 1986), the identification of Australian wheat varieties (Myers and Edsall, 1989), identification of Canada Western Red Spring (CWRS) wheat (Sapirstein and Kohler, 1999) and analyzing the shape Indian wheat varieties (Shouche et al., 2001) by image analyses techniques, particularly with morphological operations (Zapotoczny et al., 2008) had been presented and realized by concentrating on some computer programs.

Recently, a crowded set of machine learning techniques have been experimented to determine some wheat types. In a work (Guevara-Hernandez and Gomez-Gil, 2011), a machine vision system was developed for classification of wheat and barley grains based on the 21 morphological features, 6 of them were color features, 72 of them were texture features, totally 99 features. To reduce high dimension of feature set and eliminate the ones which gain the less contribution, the sequential forward feature selection has utilized after zero mean normalization, weighting and sorting processes with respect to larger Fisher discriminant ratio. The experimented approach achieves an accuracy rate that is higher than 99% when conducting on two classes. In another work (Ronge and Sardeshmukh, 2014), the widely known four Indian wheat seed varieties (Lokvan Gujrat, Lokvan MP, MP sure, Khapali) are classified by extracting 131 textural features as 32 gray level textural features, 31 Local Binary Pattern (LBP) features, 31 Local Similarity Patterns (LSP) features, 15 Local Similarity Number (LSN) features, 10 Gray Level Co-occurrence matrix (GLCM) features and 12 Gray Level Run-length Matrix (GLRM) features. By observing the results, the average accuracy values are 66.68% and 39% in case of intra class classification with ANN and K-NN classifiers, respectively. Again, the Multi-Layer Perceptron (MLP) Neural Network based classification system (Pazoki and Pazoki, 2013) has been developed to distinguish the six classes of rain fed wheat grain cultivars with 21 statistical features. The average accuracies returned from the system have been reported as 86.48% and 87.22% as before and after applying the utility additive (UTA) algorithm to ignore the less promised features. Moreover, six varieties (Demir, Gün, İkizce, Mızrak, Seval, Tosunbey) of bread wheat are classified by using the common vector approach (CVA), which is a subspace based feature extraction method (Gulmezoglu and Gulmezoglu, 2015). By using the CVA, firstly, a common vector which represents common or invariant properties of each class is computed and then a given test image is assigned to its label based on minimum distance criteria. The average accuracy for 500 test images has been reported as 36.7. Also, the impact of four machine learning algorithms (One-R, J48, IBK and Apriori) (Romero et al., 2013) for the prediction of wheat yield from several phenotypic plant traits has been examined by using the machine learning software WEKA (Hall Mark, 2003). Authors emphasized that among the aforementioned algorithms, the best overall accuracy obtained from Apriori, which is noted as 90% when executed to predict drum wheats for three cities. In given study, the measured yield components have considered as features in case of prediction.

Despite the good performance of proposed methods, the limitations of them become apparent when considering big datasets with high sample size and dimension, which are not carried out in related works. In another word, training a pattern recognition system with the low number of samples does not give stable and precise results in terms of accuracy and effectiveness in reality. We consider this detail and present a solution to close the gap with a different wheat classification approach. With this aim, we have proposed an algorithm based on the dense SIFT features, which is a popular feature extraction method utilized in tasks of object recognition (Loncomilla and Ruiz-del-Solar, 2005), object tracking (Zhou et al., 2009) and image retrieval (Ledwich and Williams, 2004). The reason for choosing DSIFT instead of SIFT, is attributed to its good results by obtaining descriptors from every locations, when compared with performance generated from specific locations as performed in SIFT algorithm. Moreover, the study on comparison of feature detectors and descriptors for object class matching (Hietanen et al., 2015) emphasizes that the performance DSIFT is superior to some feature selection methods. Also, SVM is selected as an optimized classifier that generates an optimal decision boundary between classes. The obtained features are concentrated on SVM classifier. The experimental results show that the proposed method gives satisfactory, realistic and convincing accuracy rates when making trials on 40 classes as consists of 160 samples per each class.

The rest of paper is organized as follows. Section 2 introduces the materials and procedure for wheat classification. The related work is presented in Section 3. In Section 4, the performance of proposed method on special dataset is discussed with objective evaluation measures. Finally, a conclusion is touched and future work is discussed.

Section snippets

Material and methods

The classification procedure of wheat objects is conducted on most of the state of art feature extraction methods. The related work is summarized with Fig. 1. By inspecting Fig. 1, some stages that are required for a particular classification problem are considered as dense SIFT features that are extracted in (i), (ii) the k-means clustering is operated on DSIFT features, (iii) the BoW model from the histogram clustering features are acquired and finally SVM classifier is utilized on BoW model

Proposed method

In this study, it is aimed at developing a classification model by evaluating the performance of DSIFT features. Typically, a classification algorithm consists of three steps as feature extraction, training (model construction) and test stages. Similarly, the proposed algorithm mainly involves three stages, which are obtaining the BoW model with visual words, a decision model construction with SVM and specifying a test process.

By analyzing Fig. 3, the taken images are put forward to the DSIFT

Dataset

To validate the distinctiveness property of the proposed method, a variety of experiments are conducted on a special dataset that includes the wheat images of our country. The referred dataset involves 40 wheat grain species (classes) (Adana 99, Ahmetağa, Aldane, Alparslan, Altınbaşak, Ayyıldız, Bayraktar, Bereket, Canik, Ceyhan, Çetinel, Çukurova 86, Demir, Doğankent, Ekiz, Eser, Fuatbey, Gökhan, Göksu, Gün, İkizce, Karatoprak, Kenanbey, Kınacı, Kirik, Lütfübey, Osmaniyem, Özcan, Palandöken,

Conclusion

In this paper, we have reported a new approach to identify the wheat grains by using a smart decision system. The implemented system promises to make the automated classification of wheat objects with an effortless way and without losing the time. Moreover, the experimental results obtained on special dataset; show that the utilized features are discriminative and robust when the contribution is considered in term of accuracy rate. The misclassified samples could be attributed to lack of

References (39)

A.K. Jain
Data clustering: 50 years beyond K-means
Pattern Recogn. Lett.
(2010)
M. Neuman et al.
Discrimination of wheat class and variety by digital image analysis of whole grain samples
J. Cereal Sci.
(1987)
K. Özkan et al.
A new classification scheme of plastic wastes based upon recycling labels
Waste Manage.
(2015)
J.R. Romero et al.
Using classification algorithms for predicting durum wheat yield in the province of Buenos Aires
Comput. Electron. Agric.
(2013)
S. Shouche et al.
Shape analysis of grains of Indian wheat varieties
Comput. Electron. Agric.
(2001)
P. Zapotoczny et al.
Application of image analysis for the varietal classification of Barley: morphological features
J. Cereal Sci.
(2008)
H. Zhou et al.
Object tracking using SIFT features and mean shift
Comput. Vis. Image Underst.
(2009)
M.F. Caropreso et al.
Beyond the bag of words: a text representation for sentence selection
H. Cevikalp et al.
Comparisons of features for automatic eye and mouth localization
F. Guevara-Hernandez et al.
A machine vision system for classification of wheat and barley grain kernels
Spanish J. Agric. Res.
(2011)

Gulmezoglu, M.B., Gulmezoglu, N., 2015. Classification of bread wheat varieties and their yield characters with the...

E.F. Hall Mark

The WEKA data mining software: an update

SIGKDD Explor.

(2003)

A. Hietanen et al.

A comparison of feature detectors and descriptors for object class matching

Neurocomputing

(2015)

T. Joachims

A probabilistic analysis of the rocchio algorithm with TFIDF for text categorization

T. Kanungo et al.

An efficient k-means clustering algorithm: analysis and implementation

IEEE Trans. Pattern Anal. Mach. Intell.

(2002)

F. Lai et al.

Application of pattern recognition techniques in the analysis of cereal grains

Cereal Chem.

(1986)

S. Lazebnik et al.

Beyond bags of features: spatial pyramid matching for recognizing natural scene categories

Ledwich, L., Williams, S., 2004. Reduced SIFT features for image retrieval and indoor localisation. In: Proceedings of...

P. Loncomilla et al.

Improving SIFT-based object recognition for robot applications

Cited by (78)

A methodical analysis of deep learning techniques for detecting Indian lentils
2024, Journal of Agriculture and Food Research
Automatic identification of Indian lentils requires the selection of a suitable classification model among various CNN models. The CNN models are the most effective for image classification. However, more than one CNN model occasionally performs closer to each other. Choosing the best classification model requires considerable effort in this situation. Automated identification of lentils would allow for faster selection processes and reduced time compared to manual assessment. In this study, 18 CNN models were tested to identify the lentils. The performance of the CNN models was investigated and a two-phase statistical analysis was conducted to select the best CNN model for identifying Indian lentils. The 18 CNN models are Alexnet, Darknet19, Darknet53, Densenet, EfficientNetB0, Google net, InceptionResnetV2, InceptionV3, MobilenetV2, NasnetLarge, NasnetMobile, Resnet18, Resnet50, Resnet101, Squeezenet, Vgg16, Vgg19, Xception. The two-phase statistical analysis was Duncan's multiple range test, and the Wilcoxon signed-rank test was performed. Again, nine measures, such as accuracy, sensitivity, specificity, precision, FPR, F1 Score, MCC, Kappa, and computational time, were considered for statistical analysis. After the execution of 18 CNN models and two-phase statistical analysis, it was revealed that EfficientNetB0 is superior among the CNN models for the identification of lentils.
Self-Supervised Domain-Adaptive learning for Self-Labeling unknown rice grains during actual rice transportation process
2024, Computers and Electronics in Agriculture
Industrial rice agriculture has transitioned towards intelligent computing, with an increasing likelihood of incorporating unmanned decision-making systems. However, one lingering issue that persists is the continued reliance on annotations for activity description. Applying transfer learning and domain-adaptive learning to non-labeled target data with the assistance of source data can be explicitly, but depends on mitigating domain discrepancy instead of principally generating predictive. This compromise both target robustness and comprehensibility. Our aim to enable a model to rely effectively on explicit source knowledge and self-labeling to reliably self-annotate. Our proposed self-supervised domain adaptation method provides debiasing by utilizing both former and present states, which reduces inconsistency when generating imbalanced pseudo-labels and implicitly utilizes source knowledge to consistently increase target-specific knowledge. Comprehensive results show improved performance by 7.5% for public data benchmarks and a gain of 2.4% for a rice transportation scenario. Ablation studies reveal improvements of up to 8.2% in Top-1 accuracy and 2.5% in Top-5 accuracy compared to an existing debiasing method, and highlights substantial parameter and FLOPs saving of up to 4.2 and 14 times, respectively. This underscores the feasibility of our system over self-label-dependent learning.
Machine learning approach for the classification of wheat grains
2023, Smart Agricultural Technology
Citation Excerpt :
This section summarizes recent state-of-the-art techniques developed for the classification of wheats and different kind of seeds. Olgun et al. [26] proposed a ML based classification system to classify 40 different kinds of wheat species. The SVM classifier was designed following the extraction of Dense Scale-Invariant Feature Transform (DSIFT).
Wheat is known to be one of the most important agricultural crops throughout the world. Due to its mass storage in warehouses, tonnes of wheat grains rotten every year that eventually affects its market price. This paper presents an end-to-end automatic system that utilizes computer vision techniques for the quality grading of wheat grains. The main purpose of this work is to determine most discriminatory features and a suitable classifier that may classify the given wheat sample into two classes ‘fresh’ and ‘rotten’. At first, shadow removal, segmentation, and separation of each grain is performed as parts of the pre-processing step. The pre-processing is followed by the features extraction step where 7 color and 16 texture handcrafted features are determined for each grain. The four binary classification models, namely, Support Vector Machine (SVM), K-Nearest Neighbour (KNN), Multi-Layer Perceptron (MLP), and Naïve Bayes (NB) are then built using 10-fold cross validation approach. The classifiers are compared on the basis of performance metrics- accuracy, error rate, recall, specificity, precision, and F₁-score. The comparative analysis depicts that based on color features, the SVM classifier outperforms other classifiers by achieving the accuracy of 93%. In contrast, based on texture features, the NB classifier achieved accuracy at 65%; highest among all classifiers. Experimental results encourage the utility of SVM classifier modelled on color features in automatic quality grading of wheat grains.
Classification of seven Iranian wheat varieties using texture features
2022, Expert Systems with Applications
Citation Excerpt :
Based on this conversion, it obtains the optimal boundary between possible outputs. In other words, SVM performs very complex conversions to determine how to separate the data based on the defined tags or outputs (Olgun et al., 2016). Today, artificial neural network (ANN) inspired by the human brain is utilized as one of the most widely used methods of modeling, classification, and optimization (Jagtap & Kokare, 2016).
Wheat is a major crop in Iran and plays an essential role in satisfying human nutritional needs. Depending on the geographical location as well as the climatic conditions of each region, different varieties of this crop can be cultivated. The purity of the seeds used for field cultivation has a substantial impact on both field efficiency and flour quality as the final product. To increase the seed supplied for cultivation in seed supply centers, the use of new non-destructive automatic methods for this purpose is indispensable. In this study, 7 varieties of wheat in the East Azerbaijan Province were investigated by employing a machine vision system. By extracting texture features using Gray Level Co-occurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRM), and Local Binary Pattern (LBP) algorithms and classifying using principal component analysis (PCA) as the unsupervised method, along with the Support Vector Machine (SVM) and Artificial Neural Network (ANN) as supervised methods, the system accuracy was surveyed. Results indicated that modeling using a combination of all extracted features (125 features) yielded the best results by the ANN model, which the Correct Classification Rate (CCR) was obtained 100% and 95.04% for training and testing datasets, respectively. The testing dataset CCR was improved using 20 selected features by the chi-square test feature selection method up to 98.10%, which was better than the modeling results with all extracted features. Results also revealed that by using image processing and texture features extraction methods, it was possible to identify the classification of different wheat varieties with over 95% accuracy.
Durum Wheat Classification Using Feature Selection, Bayesian Optimization and Support Vector
2024, 2024 4th International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies, ICAECT 2024
Analysis of selected deep features with CNN-SVM-based for bread wheat seed classification
2024, European Food Research and Technology

View all citing articles on Scopus

¹: Tel.: +90 222 3242991/4862.

²: Tel.: +90 222 2393750/3275.

View full text

Original papersWheat grain classification by using dense SIFT features with SVM classifier

Highlights

Abstract

Introduction

Section snippets

Material and methods

Proposed method

Dataset

Conclusion

Pattern Recogn. Lett.

J. Cereal Sci.

Waste Manage.

Comput. Electron. Agric.

Comput. Electron. Agric.

J. Cereal Sci.

Comput. Vis. Image Underst.

Beyond the bag of words: a text representation for sentence selection

Comparisons of features for automatic eye and mouth localization

A machine vision system for classification of wheat and barley grain kernels

Spanish J. Agric. Res.