Computer vision-based plants phenotyping: A comprehensive survey

Summary The increasing demand for food production due to the growing population is raising the need for more food-productive environments for plants. The genetic behavior of plant traits remains different in different growing environments. However, it is tedious and impossible to look after the individual plant component traits manually. Plant breeders need computer vision-based plant monitoring systems to analyze different plants' productivity and environmental suitability. It leads to performing feasible quantitative analysis, geometric analysis, and yield rate analysis of the plants. Many of the data collection methods have been used by plant breeders according to their needs. In the presented review, most of them are discussed with their corresponding challenges and limitations. Furthermore, the traditional approaches of segmentation and classification of plant phenotyping are also discussed. The data limitation problems and their currently adapted solutions in the computer vision aspect are highlighted, which somehow solve the problem but are not genuine. The available datasets and current issues are enlightened. The presented study covers the plants phenotyping problems, suggested solutions, and current challenges from data collection to classification steps.


Satellite imaging-based plant phenotyping
The increasing demand for food makes each government need to support their agricultural system.Satellite and the images collected by some means are used for biomass estimation and other crop yields. 18The pH of soil, soil moisture, and other soil measurement-based calculations can be estimated using satellite imageries. 19Furthermore, regular and consistent monitoring for a large-scale area could be monitored, and a historical analysis could be performed.Geographical information systems (GISs) often use these images to get more promising results and precision.Many satellite image-providing institutes offer their services, whereas the most common are NASA, Geocento, 20 Google Earth Engine, and many more.Similarly, forest monitoring in many countries also utilizes satellite imagery to monitor forest climate change. 21rom all the previous discussion on plant phenotyping-computer vision applications, we conclude that computer vision assists biologists with plant phenotyping solutions to conduct monitoring and improvements.However, the image collection methods and image processingassisted solutions face different problems and give solutions to them, which are discussed in this review.Finally, the system diagram of the The abbreviations used in this survey are described.
presented study is shown in Figure 1, where it illustrates all the basic steps discussed in the presented manuscript.[24][25][26][27][28][29] Comparison with existing surveys This survey covers most data acquisition techniques with pros and cons, whereas existing surveys are topic specific regarding technologies or methods.Furthermore, the presented survey covers data collection methods, segmentation, and classification techniques.It also discussed and observed the challenges and problems in the current era using computer vision-assisted methods.The short comparison with previous surveys is shown in Table 1.

Contributions of this survey
The existing surveys, although much appreciated, are regarding discussion on plant phenotyping techniques and methods.However, a single report on major modalities of plant phenotyping with corresponding challenges and solutions is not discussed.Therefore, we present a review of plant phenotyping covering the major and traditional methods including: 1. a detailed discussion of data acquisition methods against different species and the acquired challenges that need to be solved; 2. an insightful discussion on challenges that need to be met via segmentation, leaf counting, and classification; and 3. a detailed discussion on previously applied ML and deep learning (DL) approaches for segmentation and classification with their shortcomings and strengths The remaining review contains these sections: computer vision-based plant phenotyping, pre-processing, segmentation methods with their different approaches, feature extraction methods from multiple aspects, and classification methods using ML and DL with their corresponding results.The last sections contain current challenges and the availabilities of a few famous datasets.

COMPUTER VISION-BASED PLANT PHENOTYPING
The plant's phenotyping-based experimental designs were made using computer vision methods for the quantitative measurements.It analyzes the gene-environment, plant-growing infrastructures, substrate handling, and other monitoring-based installations.To make standard protocols for plant monitoring, the imaging sensor needs precise imaging data collection and processing.The quantitative measures of phenotype data need metadata for the evaluation of results.By evaluation, we can control the growing environment from the field, in a greenhouse, or in plant growth chambers.Many plant imaging techniques can be used for explained purposes, from which some available datasets are discussed in the coming sections.

Image collection tools and challenges
Different imaging data can be categorized into two major types of plants: below-ground and above-ground plant-based imaging.These types are discussed individually with their current challenges, mode of use, limitations, desired outcomes, and the targeted species with their concerning components.

Plant above-ground organ phenotyping
Plants grow from the soil and appear in the ground, whereas their roots remain below ground.For above ground, many image collection methods are used nowadays, such as thermal, hyperspectral, visible light, laser, fluorescence, near-infrared, and 3D imaging.To collect data using each technology, there are certain limitations and challenges such as occlusion fact, light effects, and the growing variation of different plants in different growing environments.A few methods presented by previous studies using particular species organs with their reporting quantitative metrics are shown in Figure 2.
Among above-ground image collection modalities, thermal imaging is used for biotic status information, surface temperature, and water stress analysis.It is also limited due to variations in the atmospheric behavior of plants.Some species, such as grapevine, 24 barley, 33 maize, 34 wheat, 35 and rice, 24 are analyzed using thermal imaging as a basic image acquisition step.Studies propose metrics that inspect plants and measure their structures' temperature.However, differentiating between soil and plant temperature is still challenging, even using thermal sensors.Likewise, hyperspectral imaging techniques are used to get water leaf area measurements, leaf growth, and other health statuses of multiple growth stages.It is limited due to its cost, complexity, and big interpretation problems.
Some hyperspectral imaging data are used to analyze certain species such as Arabidopsis, 36 rice, triticale, 37 and wheat, which output the canopy status.In contrast, a few of the water-based health statuses are also analyzed.However, the pigments concatenation, plant stress, and water-controlling environments are acquired challenges using this imaging tool.The sunlight influences the image collection angle for canopy structural analysis.Therefore, it needs to change frequently, even in one round.
However, visible light imaging has many limitations, such as it returns only physiological information about plants in a controlled environment.It also limits the calibration process for spectral information collection; the sunlight under shadow may cause over-exposure to process the images further.In visible light imaging, the production analysis uses area-based yield, the growth rate uses morphology, and the time slot is calculated.Likewise, plant blooming is also noticed in various species using their images, such as Arabidopsis thaliana, bean, barley, citrus fruits, 38 maize, 39 rice, and Medicago truncatula. 40A certain range of specific scanners is required for the laser-based image collection method.The embedding with GPS (global positioning service) acquired georeferencing.Further, the particular illumination is also a problem in a controlled environment.
Similarly, when using fluorescence imaging, a pre-acclimation state is required for complicated analysis of biotic and abiotic measurements that are difficult to achieve using visible light imaging in a field environment.The canopy structure, angle-based leaves, and stem-based architect analysis can be done using computer vision-based techniques via laser imaging.Hence, some of the geometric analysis, canopy, and biomass structure analysis are performed on species such as barely, wheat, sugar beet, 41 triticale, 37 and soybean.Fluorescence imaging can be used as emitted light from different organs of plants and red-region map analysis wherein the output of computer vision automation, the health status of leaves, photosynthesis, and photochemical analysis can be performed on various species such as Arabidopsis, bean, barley, chicory plant, 42 tomato, sugar, and wheat.The near-infrared imaging can be used as night vision analysis as well.The continuous and discrete data using these sensors are created where they contains region-based spectra data.Time analysis with single shoot and multi-shoots of canopies assessments can be performed wherein output, the water controlling contents measures, and the seeds indexes concerning leaf areas can be analyzed.Near-infrared-based species analysis is performed, including barley, maize, rice, soybean, and wheat.
From the earlier discussion on different types of image collection methods, it is concluded that many analyses are performed using aboveground imaging techniques for different kinds of imaging tools for other species.Growth, structural analysis, time-based single and multiple shoots, stress analysis, surface analysis, health status, photosynthesis, and much more analysis are performed, leading to control and effectively increasing plant production.

Plant below-ground organ phenotyping
Other techniques like positron emission tomography (PET), magnetic resonance imaging (MRI), and X-ray imaging are used for below-ground plant-growing environment and structure analysis.In X-ray technology, computer-processed images have been taken by scanning the objects and specific parts to take an inner 3D view of that particular object.It provides us with volumetric data when used in plants' environments, and then a soil analysis can be performed that can help us measure root systems.MRI modality assists in analyzing soil organs of the plant root system for water distribution or in estimating the water outflow and other quantitative 32 measures.The water content analysis using 3D morphological patterns can be detected using MRI whereas in some species such as bean, sugar beet, Beta vulgaris, and Hordeum spontaneum 26 MRI-based analysis was performed.Likewise, soil and water measurements can also take place using MRI images.Li et al. 30 Chandra et al. 18 Mochida et al. 31 Li et al. 32 Imaging techniques and problems U U Furthermore, the presented survey covers data collection methods, segmentation, and classification techniques.It also discussed and observed the challenges and problems in the current era using computer vision-assisted methods.The short comparison with previous surveys is shown in Table 1.
Similarly, PET is also used to get information on plants' functional processes using nuclear or gamma rays.It is used to get images via a pair of nuclear (gamma) rays injected via a PET sensor.It can be used as an individual or also can be used by fusing with MRI technology.However, using these 3D data, we can analyze the functional position of water in plant organs.
Furthermore, the velocity of water movement can be predicted using PET imaging.After acquiring plant data from different aspects of tools and technologies, the next step is to use the computer vision cycle of pre-processing, segmentation, feature extraction, and classification, which will be discussed in the coming sections.The transport analysis uses emitted signals, whereas sector-wise analysis can also be performed using these images.However, many species, such as Beta vulgaris and Hordeum spontaneum, 26 are being analyzed, and velocity measurement, transportation, and sectorality are also used to be measured via computer-vision intelligent methods.CT scans are used in many medical and natural imaging analyses, whereas plant phenotyping can provide slices of voxels and plant tissues.Grain quality measurement and 3D structural analysis can also be analyzed.However, these analyses are performed for a few species, such as wheat and rice. 28ncluding all types of image collection tools and technologies discussed earlier, certain species are frequently used.We can analyze the demand for and use of computer vision-based monitoring systems specifically using these species.

Pre-processing
Pre-processing is an important step in image processing as it enhances the region of interest, ultimately leading to a more precise segmentation or classification.However, it also helps expose tiny grains and spiked particles of plants.Many other data augmentation techniques like cropping, rotation, and scale in variances are used, which increase variations to make patterns much more generalized and robust. 31Many noise reduction and enhancement methods are proposed in the image processing domain that could be used in plant imaging noise removal.Enhances such as salt and pepper noise are removed using a fuzzy operator and morphological operator-based method.In this method, 43 erosion and dilation are used.In contrast, noise is removed using a morphological dual operator, and the peak signal-to-noise ratio (PSNR) metric is used to evaluate the proposed method to validate its performance over other methods.This salt and pepper noise was also removed using a non-linear filter on images, where it was built using hybridization of multiple methods. 44 noise reduction approach is proposed for monochromatic images.It is a two-phase noise reduction approach as it first detects noise and then removes it using the adaptive filtering method.The proposed study PSNR values show the method's robustness in removing the noise from image. 45he applied methods of pre-processing in image processing are used to enhance the region of interest by removing noise, enhancing their contrast level, etc.Therefore, if some noise and problems are left while collecting plant phenotyping images, they could be removed by applying these methods.

Segmentation
Segmenting and analyzing the parts of plants are challenging tasks as organs move and rotate, whereas size and shapes also vary with respect to time.To measure or count the area of organs, estimate the length and width of plant components, detect the degree of sloping and azimuth angle, look into the characteristics of leaf vein, growth rate, and dynamic motion detection, and carry out many more tasks, it is necessary to isolate them precisely.The segmentation of fruit crop diseases using DL techniques and other features was also proposed previously.

Segmentation evaluation measures in plant phenotyping
There are several evaluation measures of segmented components in which the Dice score is the most basic and essential metric.It calculates the pixel-wise area of ground truth vs. predicted.The Dice score is modified with multi-label objects in the case of a plant to get the average of each predicted part of the plant as compared to the overall object.Therefore, the modified Dice score is a symmetric best Dice (SBD) score.Both have been shown in Equations 1 and 2. 1)

Dice
Dice score is calculated as a ratio of the overlapping area between ground-truth mask m gt and predicted mask m pr with the union of m gt and m pr whereas the overlapping area is multiplied by 2. This Dice score is then updated as plants have many components, not only one.Therefore, it is necessary to calculate the Dice score accordingly, considering all leaf components' maximum Dice-yielding score and calculating it by getting the average, as shown in Equation 2.
In Equation 2, l denotes the leaf area, whereas l A and l B are the sets of leaf components.

BD l
(Equation 2) Here (1 % x % P) and (1 % y % Q) are the ranges for leaf segments (l A and l B ).These best Dice scores are used by SBD to get the minimum from best Dice of predicted (M Pr ) and ground truth (M gt ) labeled mask with ground truth (M gt ) and predicted masks (M Pr ) as shown in Equation 3.
For the leaf count challenge, the leaves after segmentation are counted mostly, whereas, in some other techniques, it is counted without segmentation and reported as a difference in count (DiC), as shown in Equation 4.

DiC =
Lpr f :r À Lgt f :r (Equation 4) The leaf count frequency of the predicted (L pr ) mask is subtracted from the actual mask (L gt ) to get the DiC from the given object of plants.All these measures are given by a collection study on computer vision-based plant phenotyping segmentation. 46Many types of image processing, ML, and DL have been used in previous studies for plant subject segmentation, which will be discussed in the coming sections.

Threshold-based segmentation
The threshold is the most common and old method in image processing to isolate an object from the image, which is much faster and easier to implement.However, in previous years before 2017, it was mostly used for segmentation, whereas few of their methods are discussed here.Using the Oxford flower color dataset and Lab color space, an OTSU thresholding-based segmentation of flowers is performed.It improves the time efficiency and results compared to a previous study. 47It reported results regarding the mean-overlap score using 13 different types of flowers. 48The visual results of some threshold techniques taken from cited studies are shown in Figure 3. Different 4 samples taken from previous studies [47][48][49][50] which used thresholding methods for segmentation of objects.
The jujube leaf image segmentation is adopted using the canny edge detection and OTSU thresholding method, in which an optimization and mapping function is also used to segment the real-life video or images of the jujube plant.Furthermore, the same method is suggested to segment other plant types. 49An OTSU method-based thresholding on hue, saturation, value (HSV) and YCbCr color spaces is applied on a mango plant leaf and then compared using precision, recall, and F1-score.It is further suggested that the Cr component is the right color for mango leaf image segmentation. 50nother method for mango leaf disease recognition and segmentation is proposed using DL, whereas shape, color, and texture features are extracted from pre-processed images. 51Similarly, another study 52 proposed a fully connected convolutional neural network (CNN) model for mango leaf disease detection.The accuracy reached up to 99.2%.The local contrast haze reduction method is applied to pre-process the images and then fed to the proposed fully connected CNN model.The discussed studies use different imaging methods to apply their manually calculated values for image segmentation.These image segmentation methods are mainly using thresholding techniques and other evaluation metrics.

Clustering-based segmentation
A clustering method-based time series analysis on the Panicoid Phenomap-1 public dataset is performed.The stem angles of maize plants are being analyzed in their growth life cycle.A certain behavior of temporal patterns is summarized in three main groups.It is reported that stem angle temporal variations are regulated with the help of genetic variations. 53The kiwifruits in day and night images are clustered out to count and segment fruits from the background. 54The calyxes line-based adjacent fruits are distinguished and counted.The daytime-based fruit calyxes are detected with 93.7% accuracy, whereas nighttime images with flash are correctly detected with 92% accuracy.
To meet the needs of the perfume industry and herbs used in the medical field, it is necessary to calculate the jasmine flowers in the field, which costs more labor.Therefore, the image processing-based segmentation and counting of such challenging tasks are proposed by researchers.The density-based spatial clustering method density-based spatial clustering of applications with noise (DBSCAN) uses neighborhood density information to iteratively cluster and segment the jasmine flowers. 55Some of the clustering-based image segmentation results are shown in Figure 4.][56][57][58] The defective part segmentation of apple plants using the k-means method is proposed, which uses various steps to make clusters of the defective part.The color features and spatial information cluster pixels at first, and then clustered parts are merged for certain regions.It decreased each pixel cluster calculation time and finally showed that the proposed study is promising to segment out the defected parts. 57Disease prediction in plants is necessary to avoid decreasing the yield of that plant.Similarly, wheat production is affected by wheat diseases.Three types of wheat disease, stripe rust, powdery mildew, and leaf rust, are segmented by converting RGB images into Lab color space.The results show that the 90% accuracy of these segmented-out wheat diseases is achieved. 58Another method of clustering using the regularized sub-space method is adopted for hyperspectral images. 59Three datasets, namely, Indian Pines, University of Pavia, and Salinas, have been used.The widely spread land pictures segment the trees, meadows, bricks, etc.However, the applied method uses a regularized method with a sub-space method that includes spatial information and improves the performance of the sub-space method.It achieved an overall 99% accuracy at the end.

DL-based segmentation
It was observed from the literature that DL-based segmentation was primarily adopted after 2016.However, DL-based image segmentation solutions are used in bio-medical, natural, and plants.These solutions are more promising as they use big data for training, making DL solutions more confident than other methods.
A study proposed a solution for segmentation and leaf counting challenges using deep deconvolution and convolutional methods.At first, segmentation uses a deep deconvolutional approach, whereas counting is performed using the deep convolutional method.Previously published datasets are used for evaluation purposes, and absolute count difference is reported with a mean of 1.6 and a standard deviation of 2.30. 60Some of the DL model-based segmentation results are shown in Figure 5.[62][63][64][65][66][67][68][69]  A custom branch method proposed an instance segmentation solution for wheat disease recognition and localization.It used one public dataset and a self-collected wheat disease dataset.One method, VGGFCNVD-16, and another method, VGG-FCN-S, outperformed the mean accuracy of 97.95% for 1st model and 95.12% for the second model.It also develops a mobile app for this localization purpose. 63Similar to this study, which not only gave recognition and localization challenges solutions with a dataset, another study also gave solutions and a dataset with their ground-truth labels.The dataset is named Dense Leaves.A pyramid CNN approach is proposed to detect interior texture.The detected boundaries are then used for estimating the overall shape using a watershed algorithm.The reported results are much more promising for leaves having dense foliage. 64o handle limited data challenges, different data augmentation techniques are used.To utilize data augmentation, a combined dataset of synthetic and real augmented images of plant phenotyping is used to propose a deep semantic segmentation network.The proposed study tested on five datasets and reported a Leaf Segmentation Challenge (LSC).It achieved 91% accuracy on the A1-test set from CVPP LSC. 68Using the same Computer Vision Problems in Plant Phenotyping (CVPPP) 2017 LSC, two methods are proposed for the counting challenge.The first method did counting using direct regression, whereas the 2nd method predicted leaf centers using a deep convolutional network and then did the counting.Both approaches are named Multi-Scale Regression (MSR) and Detection, Regressor (D + R) approaches.The DiC and absolute DiC (ADiC) are reported by comparing the results of previous studies. 62A well-known encoderdecoder-based architect of semantic segmentation, U-Net, is also used for Arabidopsis leaf segmentation with the post-processing of the watershed method.It uses three datasets to test it on the proposed method, whereas it achieved 95% and 97% Dice coefficient using synthetic data generation.The RGB real images are converted into fluorescence images to increase data training and testing samples for more promising results. 66 deep semantic segmentation approach is used to locate, count, and detect plants.This approach is transferable and adaptable.Moreover, it also proposed that all parameters are learnable for training data from end to end.It also reported that the proposed framework could detect different shapes and sizes of plants.Initially, this approach is applied to grapes dataset. 61Another study uses a semantic deep network to segment augmented training data.It also uses the iterative linear clustering method for superpixel test patch generation.These superpixel patches assisted the deep-counted model in quantification.The wheat spikes data are used in this study, whereas data samples are also increased using data augmentation method. 65To meet big data training challenges, some adversarial data generation methods have also been proposed by some researchers.A study proposes a rosette image data generator.It produces realistic synthetic data for the CVPPP 2017 dataset.It also used the Ax dataset to increase the efficiency of its results.The Ax dataset contains artificially produced plant images. 67 similar approach to data generation using conditional generative adversarial networks is proposed.It also uses a semantic network to report the ADiC and DiC with standard deviation.It claims that the average 16.67% leaf counting error is reduced by using these extensively generated training images to meet the limited annotated scarcity problem in plant phenotyping. 69he unmanned aerial vehicles (UAVs)-collected images for yellow rust detection in wheat crops have been present to feed the deep convolutional neural network.This model includes inception-ResNet multiple layers and proposes a wide and deep network.An accuracy of 0.85 has been achieved by this model and suggested that combining the spatial and spectral information results in the improvement of model detection. 70The symptom-based disease detection in plants via images could play an important role in the yield of crops.However, the limited annotated data availability reduces the model's performance.A synthetic image generation of fluorescence images has been proposed to remove this limitation.Furthermore, a U-Net model is trained to segment the diseased part from the images.The model is tested on the empirical fluorescence dataset.It shows 0.73 precision and 0.79 recall scores when tested on the fluorescence dataset. 71her methods-based segmentation Before 2017, some edge detection, watershed, 72 and region-based techniques were used for multiple purposes, including leaf segmentation and counting of different plants.The fuzzy numerical morphological operations using edge detection on leaves of tobacco plants are proposed.However, it lacks the continuity of edges and can create a detection risk for adjacent leaves edges. 73Similarly, a canny edge detection-based approach is also proposed using the Oxford Flowers dataset.It also has disadvantages in the case of many edges. 74Another method based upon canny edge detection but using the orange fruits dataset is proposed.It also has the disadvantage of more edge detection in a single image.In contrast, a comparative analysis is also performed using color-and edge-based segmentation methods on the same plant images. 75Automatic segmentation of the same orange images is proposed using shape and color analysis.It also proposed a method to measure the overall yield. 76The watershed method is also extensively used to segment and count different plants.It affects a complex algorithm but has efficient results for complex images.Moreover, it works well for images having sharp contrast differences in the objects of image. 77For apple images, another method improves the segmentation performance using watershed method. 78The different segmentation methods with their different evaluation measure and results are shown in Table 2.
The images in RGB are converted into L*a*b color space, which is proposed as a good method to discriminate between foreground and background.Furthermore, another method is to separate the leaf center points.The leaf segments are separated by applying the split lines method.This study also uses the LSC dataset using the same subsets but claims that the tobacco class is not segmented out well as compared to the Arabidopsis-type of plants.It is also suggested that components of shape adjustment be used to improve the segmentation results. 79A watershed method with a stem linking algorithm is used to get leaf count after their segmentation.It uses the CVPPP 2014 dataset of LSC and two subsets, A1 and A2, for segmentation. 85Another method using RGB to HSV colorspace conversion and a thresholding method of histogram quantization is used for segmentation.It assists in getting a leaf count.It also uses the same LSC dataset by taking three classes, A1, A2, and A3. 80nother study uses the same LSC dataset with an 8:1:1 split ratio of training, testing, and validation datasets.It uses the leaf instance segmentation method using Mask-RCNN to segment and count the leaves.The reported actual evaluation measures are used which are DiC, | DiC|, and SBD. 86A two-step approach for leaf segmentation is applied to the LSC dataset using three datasets A1 (Arabidopsis Thaliana), A2 (Arabidopsis Thaliana variant), and A3 (tobacco).The transfer learning methods are used for segmentation as a fully convolutional network (FCN).The initial learning on a major set of plants is adopted in 1st step, whereas in 2nd step the minor dataset transfer learning adaptation is used using FCN.The precision, recall, and F-measure are reported using various numbers of images and adaptation methods. 81A study using various patch size variations by doing customization in CNN architect is proposed, which did edge detection at first.
It uses time-lapse images of plants that are fed into CNN as input.Multiple experiments of binary vs. four-way, patch size variation, and single image vs. patch variants approaches are applied.The CNN and random forest classifiers are used at the end with a patch size of 12. 87 Researchers have proposed many data augmentation techniques in recent years.One of them used single leaves as data augmentation objects with various angles, and the background was taken as transparent.This data augmentation technique differed from previous approaches because others used synthetic data generation methods.It uses a proposed collage method for data augmentation.It is one of the strong contribution-based segmentation methods, whereas it achieved the mean best Dice score (86.7%) on the A1-A5 dataset. 88Another method used the DL method U-Net with the proposed architect to segment the A1-A4 class data.It achieved very good accuracy, Jaccard index, and Dice loss but did not report the SBD, DiC, and |DiC| measures. 82rom the all aforementioned cited studies on segmentation of leaf segmentation and counting challenges, it is observed that the techniques before 2017 or 2018 were mostly based on thresholding, clustering, histogram quantization, 3D histograms, distance mapping, and other graphical methods.However, in and after 2017-2018, most of the studies used DL methods, whereas more valuable metrics were also included, which improved the performance of each task of plant phenotyping.Some techniques also use traditional feature extraction and classification methods, which are discussed in the coming section.

Features extraction
Feature extraction techniques are used to extract useful information from images.Classifiers use these feature extraction methods to classify the image data into separable classes.Similarly, many plant species are classified using various feature extractor and descriptor techniques.These features are hand-crafted and use techniques like gradient, intensity, texture, and other geometrical formulation methods.A multilayer perceptron method using visual hand-crafted features is proposed for wheat grain classification into bread and durum.A total of 21 features are extracted from 12 main features to increase the diversity of distinguishing features.The model was trained on 180-grain inputs, whereas it was only tested on 20. 89nother method using the same dataset and classes was proposed using different reproduced nine features from the texture, color, and dimensional domains.However, it claims 99% accuracy is achieved with 100% right detection of wheat grains using an adaptive neuro-fuzzy inference system (ANFIS). 90The deep features are the rising good fundamental descriptor to distinguish the objects, whereas the fusion of deep features and selection of fruits is also performed in fruits classification. 91Similarly, feature fusion based on deep features and selection is performed using partial least regression (PLS) for crop disease classification. 92Seed image recognition is proposed using color and morphological features.This study is performed to analyze the Malva alliance's systematic positions.The taxonomic genus Table 2. Different segmentation approaches and their result organization is observed on sections and species level. 93The joint features are used for cucumber leaf disease detection.It proposed a robust and dimensionality reduction approach for features extraction. 94Another study proposes a global and local texture features approach using histogram-level fusion.It uses scale-invariant feature transform (SIFT) and other features, including mean and standard deviation.These visual features are called a bag of visual features, whereas high-resolution remote sensing images are used for scene classification. 95imilarly, the features fusion method on grapes leaf 96 and cucumber leaf 97 diseases is applied, including salient, deep, and canonical correlation analysis-based feature extraction methods.A linear discriminant method of stepwise selection is used to classify the species of the genus Cistus.Different colorimetric and morphological variables, including mean weight, size, shape, and colors, are used; 137 values or features are used for this purpose. 98 new feature named venation, which consists of leaf vein patterns responsible for food and water transport, is proposed.It uses different combinations of venation features, which shows that these features are best compared to outline shape.It claims that DL highlights the many hidden features of leaf images. 99The different fruit diseases using novel feature selection approaches are applied using cascaded design 100 whereas entropy-ranked feature selection approach also applied on segmented fruit diseases images that enhance the performance of classifiers. 101

Classification
Classification is performed to identify or recognize the actual species of plant.The taxonomical classification, physiological states, and image analysis-related tasks were performed using various ML and DL methods.ML classification uses previously extracted features or calculated features that assist classifiers.DL methods use images directly as input instances and cover the low-, mid-, and high-level features using their structural layers.However, both of these are discussed in the coming section.

ML-based methods
Different classical classification methods are used in ML, in which support vector machine (SVM) is most famously used and performed well compared to many other ML classification techniques.However, many studies were conducted in the plant phenotyping domain to classify the multi or binary class problem to recognize the seeds or traits of plants.Different imagery techniques are used in this aspect, as one study uses laser scanning and airborne spectroscopy data of different plants.Data limitation is reduced to some extent by using two techniques: 499 images of 31 different species, different spectroscopy and laser data features, and feature selection to get important features.
The classification is performed using SVM and random forest classifiers, in which SVM performs better than random forest. 102Image classification and regression are performed using pre-trained CNN and Auto-Keras models, and the lodging score is used as a model evaluation criterion.The classification as lodged or not lodged is performed on both strategies, whereas pre-trained CNN got 93.2% and Auto-Keras got a slightly lower accuracy of 92.4%. 103To supervise the quality monitoring of plants, a study used statistical modeling via the visual perception of images.The spatial structures are taken under consideration to perform complex statistical modeling.An omnidirectional and multi-scale method of Gaussian filtering is proposed to describe the spatial structure of grains.The classification problem is also solved by proposing the multi-kernel least squares SVM method.The food production quality of rice grains is tested by this method. 104Many other studies use other ML classification methods, as discussed in the previous section on feature extraction.However, a summary of different classification methods based on their feature extraction methods and classification methods are shown in Table 3.

DL-based methods
After 2017, DL techniques found much fame in their use for recognition, segmentation, and object detection in medical, natural, and other fields of the real world.However, recent studies on plant species contributed to species recognition. 115One of the studies uses big data of more than 54k images, containing 14 crop data and 26 disease data.The testing data using the holdout validation method is tested on a trained DL model.It achieves a surprising classification accuracy of 99.34%. 106imilarly, another big dataset of tomato diseases uses more than 14k images.CNN deep layer-based feature extraction detects or localizes the disease area.It also achieves a good accuracy of 99.18%. 107Big data usage in DL is also used in other species of plants to accurately identify the species within the testing data belonging to other scales, positions, and angles.Similarly, a study says that our proposed model is robust to image perturbations and highly eligible for real-time deployment.It uses 25k images of plants' biotic and abiotic diseases, which were causing plant stress.It achieves 90.3% classification accuracy. 108nother study used big data as 87k+ for CNN's training and testing purposes.It contains 58 different types of plant disease data, whereas, when substituted, it contains 25 species data.However, the proposed model achieved 99.53% classification accuracy. 110Some other studies use multiple pre-trained models to train them from scratch.A study uses a small amount of data as compared to previous studies.It fine-tuned VGG-16, 19, ResNet-50, and Inception-V3 models.All models perform well, and it is suggested that they be used by training from scratch, whereas the VGG19 model claims the highest accuracy of 90.4%. 109A transfer learning-based CNN model is proposed.It uses various types of plant species for their automatic recognition.It is used to detect cassava disease.However, different class-wise classification results are presented, whereas an overall accuracy of 93% is achieved. 111pple leaf internal identification is proposed using five different categories of them.It uses 13k+ images to make more promising results as compared to previous studies.The 97.62% accuracy is achieved from the proposed CNN architect.It also suggested using their proposed CNN compared to the AlexNet model by reducing the number of parameters and increasing the convergence rate. 112Wheat crop disease detection using wheat-14, 15, and 16 datasets is used, which becomes a total of 8k+ images.Full, leaf-mask, and superpixel approaches using ResNet-50 have been used.Various classification accuracies have been discussed, but total balanced accuracy was increased from 78% to 84%. 113A study used a comparison of various fine-tuning CNN models in which 14 species data with their 38 different categories were used.It used VGG, Dense-Net, ResNet, and Inception variants with their different layers.The best accuracy was achieved (99.75%) by Dense-Net by reporting that it takes less time and uses fewer parameters. 114We have many cited studies in plant species that use mostly big data and many species.It is also observed that more data usage in DL models increases the classification results whereas ML-based approaches do not use big data; also, their results are inaccurate.DL makes the classification or disease identification tasks more challenging by making big data a prerequisite.However, using many public and private datasets can increase the dataset samples for DL models.

Challenges and problems
We discussed many new and old research trends on plant phenotyping problems and their provided solutions using computer vision.It concludes that many studies have been proposed previously to solve plant phenotyping-related problems, but there are still many open challenges.Data acquisition is the most important first step in the computer vision domain.While collecting data, many problems cause inappropriate collection, such as the naturally varying environment of plants, winds, illumination factor, spatial varying location of plants traits, and many more.It also concludes that while data are collected using visible light or above-ground data collection methods, the complete representation and view from each aspect are not very promising.The below-ground modalities consume more time to collect and analyze the soil environment, and it is also costly to measure individual plants.
For plant phenotyping-based geometrical and yield production measurement, an appropriate method of image collection is also a challenging task.It has many challenges, such as the range and the time measurement, the resolution power, and zoom-in, and zoom-out at some points.The size, geometrical, and morphological measurements will vary based on these problems.Considering the aforementioned problems, we analyzed that a single aspect of image acquisition may not give promising results.In contrast, if multimodal techniques such as fluorescence, 3D imaging, and visible light imaging are adopted, results could be improved.Furthermore, it is suggested that low-cost, portable, and high-throughput methods be used to analyze the plants' traits more quickly and cheaply.It is further concluded that various image collection techniques for different species perform different kinds of plant phenotyping analyses.
However, many problems remain, including growth analysis, structural analysis, time-based single and multiple shoots analysis, stress analysis, surface analysis, health status monitoring, photosynthesis, and many others, which could be improved effectively using the latest computer vision techniques.The more accurate and more confident results of DL methods urged us to create more labeled data.However, some researchers make adversarial methods based on data transformation techniques to create resampled data.Hence, labeled big data creation is still needed for segmentation and classification tasks.The existing studies are species-specific on limited data, whereas some of them used 3D methods of image reconstruction using DL methods.The data limitation can be fulfilled in the future using generative adversarial network (GAN)-based methods or by manual annotation of different plant species.Consequently, it will improve the overall performance of plant phenotyping regarding leaf count, time-lapse traits analysis, and other morphological aspects of plants.

Datasets
Many public and private datasets and their corresponding metadata are available online.In the following, Table 4 illustrates the acquired challenges to meet them using intelligent ML methods.Furthermore, the most famous and publicly used datasets of different color spaces are also shown.
Many other datasets can be found at the site (https://www.quantitative-plant.org/dataset).It contained both annotated and non-annotated 2D and 3D datasets.

CONCLUSION
This review comprehensively studied and discussed the shifting paradigms of plant phenotyping methods using computer vision techniques.The data collection methods using various modalities have been discussed in detail.We also summarize their limitations and feasibility issues.Furthermore, the technical methods using intelligent pre-processing, segmentation, feature extraction, and classification techniques are discussed.Different challenges about benchmark datasets are discussed and solved by previous studies, but still there are open challenges.It is concluded that many DL methods enhance the confidence of the previously achieved results by researchers.However, many methods have been proposed regarding data limitation solutions, including manual annotation using various tools, data augmentation, and adversarial image generation methods.However, the more meaningful evaluation measures of segmentation are used to better interpret and solve quantitative measurements in plant phenotyping.Due to environmental variations, recognizing plant categories and diseases, leaf counting, and many more tasks are still challenging.Much research can be proposed in segmentation methods, data limitation solutions, and more meaningful quantitative measures.

Figure 1 .
Figure 1.Systematic diagram of the presented survey with basic image processing steps of recognition including image collection methods

Figure 2 .
Figure 2. The challenges acquired on different organs of plants (above-ground) using machine learning

Figure 5 .
Figure 5. Deep Learning-based image segmentation methods

Table 1 .
Comparison with existing surveys

Table 3 .
Classification results on plants species using ML and DL approaches