Image-Based Detection of Plant Diseases: From Classical Machine Learning to Deep Learning Journey

. Plant disease automation in agriculture science is the primary concern for every country, as the food demand is increasing at a fast rate due to an increase in population. Moreover, the increased use of technology today has increased the e ﬃ cacy and accuracy of detecting diseases in plants and animals. The detection process marks the beginning of a series of activities to ﬁ ght the diseases and reduce their spread. Some diseases are also transmitted between animals and human beings, making it hard to ﬁ ght them. For many years, scientists have researched how to deal with the common diseases that a ﬀ ect humans and plants. However, there are still many parts of the detection and discovery process that have not been completed. The technology used in medical procedures has not been adequate to detect all diseases on time, and that is why some diseases turn out to become pandemics because they are hard to detect on time. Our focus is to clarify the details about the diseases and how to detect them promptly with arti ﬁ cial intelligence. We discuss the use of machine learning and deep learning to detect diseases in plants automatically. Our study also focuses on how machine learning methods have been moved from conventional machine learning to deep learning in the last ﬁ ve years. Furthermore, di ﬀ erent data sets related to plant diseases are discussed in detail. The challenges and problems associated with the existing systems are also presented.


Introduction
The use of technology in the detection and analysis process increases the accuracy and reliability of these processes. For example, the people who use the latest technology to analyze the diseases that arise unexpectedly are at a higher chance of controlling them than those that do not. In the recent occurrence of coronavirus, the world relied on the latest technology to develop preventive measures that have helped reduce the rate at which the disease is transmitted. Crop diseases are a significant threat to human existence because they are likely to lead to droughts and famines. They also cause substantial losses in cases where farming is done for commercial purposes. The use of computer vision (CV) and machine learning (ML) could improve the detection and fighting of diseases. Computer vision is a form of artificial intelligence (AI) that involves using computers to understand and identify objects. It is primarily applied in testing drivers, parking, and driving of self-driven vehicles and now in medical processes to detect and analyze objects [1]. Computer vision helps increase the accuracy of disease protection in plants, making it easy to have food security.
One of the areas that CV has helped most is the detection of the severity of the diseases. Deep learning (DL), a part of the CV, is useful and promising in determining the severity of diseases in plants and animals [2]. It is also used to classify diseases and avoid the late detection of diseases [1]. Plant diseases are slightly different from those that affect human beings. Many factors make diseases similar as well. However, the diseases that can be transmitted from humans to plants and vice versa are rare. The analysis of the data related to this field helps identify how the use of the latest technology can be improved. The images of leaves and other parts of the plants can be used to detect diseases in plants [3]. The technology could be applied in analyzing images in human beings that also prove the presence of diseases and determine the extent of their destruction. This research study is aimed at analyzing the way image-based technology can be used in detecting diseases in both plants and animals.

Background
ML is the technology that allows machines to communicate with human beings and understand their needs. It also makes machines act like human beings and make the decision on behalf of humans. It is one of the areas that have grown fast over the past few years. ML helps in classifying plant diseases. The use of this technology is seen as a significant beginning and achievement in dealing with plant diseases. It has also increased productivity in the field of cultivation. Visualization techniques have also been included in this technology, and it has been improved over the last three years to the current improved levels [4]. The challenges that face the world today, related to the diseases affecting plants and humans, can be reduced if the diseases are identified before they spread to vast areas. The use of ML is widespread in the world today. Diverse methods used in ML and DL help the experts to analyze the plant diseases and know their source in time [4]. The detection of these diseases is affected mainly by several challenges that affect the effectiveness and accuracy of this technology.
The first challenge is the time complexity associated with the use of ML and DL, whereby some of the technologies used in the detection of these diseases are outdated or based on some information from the past. The other challenge is segmentation sensitivity [5]. It means that the region of interest (RoI) requires a high level of accuracy and sensitivity to acquire the required usage and accuracy. The other challenge is that there is a language barrier that affects the way the technology is applied. Another challenge is the inadequate resources that are required to support the application of this technology. Most of the ML and DL activities need many resources to use and implement. Private and government entities usually fund the institutes that use this technology to detect diseases in humans and plants, which could affect the success of the research and implementation of the technology.
The importance of plants in the world has increased over time. The discoveries about the critical roles that plants could play in medicine, energy production, and the recent concerns about the reduction of global warming have for long been a significant part of science and technology [6]. A reduction in the plant cover in the world increases the risk of higher global warming and an increase in the related challenges. The need to build a state-of-the-art convolutional system that supports the image detection technology and classification of plant diseases has led to many research programs to provide the scientists with the required knowledge [7]. Image detection could be applied when necessary to differentiate healthy leaves from those that are not healthy. The convolutional neural networks (CNNs) provide the differences among plant images that help determine the abnormalities that could exist in the plants in the natural environment [6]. The background study shows that the scanning of the images that show the healthy and unhealthy plants forms a basis for comparison by the scientists in this field.
DL can be used to detect abnormalities in both humans and plants. The pixel-wise operations are used to analyze the leaves collected from sick plants, and this is used to classify the diseases according to their impact on the plants. The visible patterns in these leaves are used to decide the diseases that affect the plants and how they can be dealt with to prevent them from spreading. Research shows that the use of DL technology is up to 98.59% accurate [8]. The field of plant pathology has contributed immensely to the control of diseases and reduced global warming. One of the essential background knowledge that guides the use of image detection technology is that the leaves of the infected plants are different from the healthy ones. The leaves are likely to have dark parts, and some may be dry along the edges. The dried parts are also likely to fold up, and this is easy to detect even with a bare eye. The use of ML is to detect these differences without human intervention.
The ML methods used to make decisions on the detection of diseases include Artificial Neural Networks (ANN), decision trees, support vector machines (SVM), and K-means, among others [9]. The computers may not work directly with the images taken in the fields. The images are converted into data that the machines and computers can interpret. It means that the technology requires the coding of the images into data that can be fed into the computer systems. Therefore, the basic knowledge in computer codes and programming forms an integral part of the background knowledge required in this field.
Apart from the aforementioned application of the knowledge in plants and detecting the diseases on time, the imagebased detection and recognition of diseases are applied in humans to detect the diseases that affect different body parts. Some human diseases have significant impacts on the tissues and organs that they affect. Adenocarcinoma of the prostate is one of the most common forms of cancer. It is detected using the image-based detection methods whereby the body is scanned for the abnormalities and the images obtained are used to determine if the patient has the disease. It is the second most diagnosed type of cancer in men, with about one in every nine men diagnosed with it in their lifetime [10]. The use of subjective tissue examination has been the primary way of detecting cancer in men suspected of it. The examination of these body tissues is largely dependent on the Gleason system [11]. The field of AI has offered many solutions to the challenge of inaccuracy in the Gleason scale and how it is used to provide solutions to people who have prostate cancer. The AI solutions are applied in analyzing the prostatectomy specimens to determine the impact of cancer on them. Before and after they have been affected by a disease like cancer, the appearance of the body organs is used to design the image-based detection process that is more accurate than the human eyes. In other words, computer systems can detect the difference in the organs more efficiently than humans. It helps to detect the disease long before they get to a fatal stage.

Wireless Communications and Mobile Computing
As for some plants like Cassava, it is possible to provide some protection to them from common diseases through image-based detection technology. Agriculture depends on some drought-resistant crops like Cassava to regulate the food supply and ensure an adequate supply of nutrients [12]. However, diseases threaten some of these crops' survival, which makes it hard for the agriculture departments to meet their targets. The background knowledge about the diseases could help avoid such instances. The use of CNN generates a reliable platform where the diseases are analyzed in detail [13]. The accuracy of this method is high and reliable. Further research by experts on the diseases that affect plants like Cassava has classified the diseases in terms of their impact on the leaves and other parts of the plants [14]. For example, many diseases affect the part of the leaves, some affect the entire leaf, and others attack the edges or the stalk. The images of the leaves can be analyzed using the imagebased detection system to determine the classification that suits the disease.
In places like sub-Saharan Africa, Cassava is one of the essential foods because it provides people with many carbohydrates. However, due to its vulnerability to viral diseases, it has not effectively sustained the nutritional value it could achieve in this region. In 2014, about 145 million tons of Cassava was harvested in Africa [13]. The food control methods applied in the world today mainly focus on increasing production. The application of the latest technology to detect and control the disease that affects production is a reliable way of reducing the challenges of food inadequacy. Most of the plants are also used as raw materials in many industries. If such plants are of low quality, it means that they will lead to the production of low-standard products.

Literature Survey on Types of Plant Diseases
Fungi usually cause diseases that affect the plants, and they typically attack the leaves. Viral and bacterial pathogens cause many others. Precision in agriculture has improved with the increased use of ML and its related features [15]. The reduced production quantity in agriculture hurts many people and animals, which requires modern technology to solve. The extraction and detection of diseases are easier when the image-based detection system is used because of its high accuracy and reduced complications and duplication of data. In some plants like tomatoes, the use of the images to determine the diseases that affect them and the extent of the damages cannot be achieved unless there is a high accuracy rate [16]. The survey on plant diseases shows that many diverse factors determine how technology-based image detection is applied. In other words, the diseases that cause visible dents and changes on the plants are the ones that can be detected using this technology as opposed to the ones that cause damages that cannot be detected from the plants' images [17]. The analysis in this research shows that plant diseases are usually detected when they start showing an impact on the physical appearance of the plants.
The main challenge affecting the field of agriculture is the reduction in production and poor-quality production in plants. The challenge is a result of the poor detection and management of the diseases that affect the plants. The challenge is also extended to affect human beings in several ways. The reduced plant cover due to plant diseases means that global warming, famine, and reduced air purification ensue. Hyperspectral imaging has become a reliable way of detecting crop diseases on time [18]. It is hard to determine the factors that lead to the diseases unless they are detected on time. In other words, if a disease is detected on time, it is easy to relate it to the possible factors that lead to its occurrence. For example, scientists could determine if there was a change in weather or climate that could have led to the occurrence of the disease.
Further research by [6] shows an inadequate database that could be used to provide background knowledge for comparing the images taken. The other challenge is that the symptoms and characteristics of the diseases are diverse and could be similar to a certain degree [19]. For example, many diseases could lead to the wilting of leaves. The challenge is yet to be resolved because more and new images are uploaded progressively by experts.
The other challenge is the lack of suitable instruments for use in the work of image detection. Most of the experts in the field do not have the equipment they require to analyze the images they get from the field, and this makes it hard for them to acquire accurate data and identify the diseases [20]. The other one is that there is a low rate of implementation in some areas due to the regulations put in place to ensure the credibility and reliability of the data from these analyses. For example, after the 4th and 6th International Conference on Machine Learning and Soft Computing, there have been many regulations that may derail the use of ML in some parts [21]. The rules discourage some of the results from the ML functions from being applied in practice because they do not meet the required parameters.
The technology has been in existence for several years now. However, there are still many issues that have not been clarified about its application. The other challenge is related to this fact. Some of the important images that could help determine if disease exists have not been captured. The other one is that the future perspectives of the research are not clear, and this is because of the increased diversity in the diseases that affect both humans and animals [19]. The application of image-based detection is also affected by the increased diversity in the way the diseases appear. Some of the diseases that used to affect the plants a few years ago have evolved into new forms, and they have different impacts and outcomes. It is difficult for the images to be used alone to conclude the diseases and choose a solution. Some of the solutions used in the past have also become ineffective, reducing the effectiveness of the technology.
The aforementioned challenges show that there are many possible ways in which the image-based detection could be applied, but the challenges reduce its usability. The first solution is to provide adequate data that can be used to identify the diseases accurately without confusing the ones that are closely related. The changes in weather, global warming, and other impacts have led to many diseases that have not been documented. The solution is to increase scientists' coverage and promote a better way of collecting information 3 Wireless Communications and Mobile Computing [22]. The other solution would be to offer training to the scientists in this field to ensure that they are equipped to collect the data. Yet another solution is to create better ways of capturing the data collected about the diseases. The challenge of inadequate information about the diseases can be solved if there is an improved data-captioning process that involves fine details of the images taken and the differences that define them [6]. The images should be analyzed keenly to determine the ones that are affected or infected.
Another solution would be to focus on using the latest technology that is reliable and valid. The confusion that comes with the inadequate database for use in detecting the diseases results from inferior technology and low storage abilities of the existing systems. Most of the images are not stored correctly, which affects the accessibility of the information. It could be solved by the use of modern methods of storing information. For example, the use of cloud computing could help increase the accuracy of storage and accessibility. The other solution is training the people in charge of the research and analysis of the information. A trained DL algorithm increases the accuracy of the technology [23]. The other solution would be to understand the phenotypes used in detecting diseases [24]. The phenotype used in the detection of diseases is usually a result of climate and weather [23,25]. The other solution would be to update the systems to ensure the data captured is up to date. The high level of uncertainty in the detection of diseases affects the way the technology is implemented. For example, the use of Bayesian DL is associated with several uncertainties [26]. It means that this method is unreliable if used alone.
A combination of several methods could reduce the inefficiencies of the processes. For example, the application of DL meta-architectures provides a solution to some of the errors that are experienced when using other methods of identifying diseases [27]. The other solution would be the application of deep convolutional generative adversarial networks that help in the identification and analysis of the images [28]. The contribution of the adversarial networks increases the accuracy of the detection process. The use of CNN methods could also be effective in dealing with the inaccuracy and slow identification processes of the diseases [29]. The methods have been used to detect the diseases that affect rice and have many benefits [28]. Image-based detection requires many resources, and the authorities should ensure they are available to ensure the activities are smooth.

Plant Disease Image Data Sets
The data sets used in the research include the descriptions of the leaves before and after the diseases affect them. The data comprises tables and images of the leaves that are taken in the fields. The data is analyzed and classified in a way that is easy for the readers to understand. For example, Wallelign et al. [9] show the leaves used to determine the soybean plants affected by the diseases. The data set shows healthy leaves and the ones that had dents due to the attack by septorial leaf blight, others by frogeye leaf spot, and those affected by downy mildew as shown in Figure 1.
The images in Figure 1 show that there were visible differences between the leaves affected by the disease and those that had not. The data set was clear and easy to understand. Another form of data was the table that showed the number of leaves that were classified under each disease. The data set is clear and indicates the total number of leaves analyzed and classified into four categories. The other set of data is by [4], which shows the graphical representation of the captured information in the tables.
The data set used in the research can be shown in tables, texts, graphs, and other forms. However, an essential aspect of all of them is the ease of analysis and ease of understanding. Some of the data are also grouped as per the required levels. For example, data can be grouped in terms of the diseases they represent, the time they were collected, or the analysis method [30]. The data captured in some of the research outcomes also show the use of technology and its effectiveness. For instance, the data sets captured using technology allow for a controlled environment; the data sets show the type of control used and its effectiveness. For example, computer-aided diagnosis (CAD) systems were captured in the data analyzed by [31]. The data sets are provided to help understand the usage of this technology and its impact on the quality of the research. The same data shows the classification techniques used and gives reasons for the choice.
The other feature is that the data sets for leaves' analysis are based on the primary data collected in the fields. The reliability of the data is high because it is based on the observable features of the leaves. The data sets are also divided into sections that are easy to understand. For example, the work of [32] shows the divisions of the work in terms of the diseases like Rice Blast (RB), Bacterial leaf Blight (BLB), and Sheath Blight (SB). The use of a PlantVillage data set was also applied in the research by [32]. The data set consists of 54,306 images of 14 different crops representing 26 plant diseases. The images that were included in the data set included leaves having different colors. Figure 2 shows some samples of the PlantVillage data set. The colors indicate the parts of the leaves affected by the diseases that were under investigation. The authors also used the augmented data set proposed by Geetharamani and Pandian [33]. The ImageNet data set was also used in the research, and this led to high-quality research outcomes because of the synergy of combining various methods [32].
The data sets used in the research studies are dependent on the type of information contained. For example, the research by [14] focused on the effects of uncontrolled pests in China and the impacts on the total food produced. The research shows that poor control of pests in China leads to a loss of about 30% of the total foods produced [35]. The data sets in the research are large and show the different production levels and how the pests affect them. The use of a public data set also made it possible to understand the different ways in which the research was done and verified using data that are available in the public domain [36]. The PlantVillage data set was also used in [37] where 14 different types of leaves of cucumber plant are analyzed for seven different diseases. The data sets were mainly combined to provide a good presentation of the data that was collected. The use of the PlantVillage 4 Wireless Communications and Mobile Computing data set by [38] was used to show how the collected information was helpful in understanding plant diseases. The data sets used in most references were from the data collected by different researchers and combined in one set. The reliability of the research could be compromised if there is no control over what is to be included in the data sets [39].
The nature of the research also determines how the data sets are used. For example, the collection of leaves and combining them in a table usually involves using the PlantVillage data set. The use of coffee leaf data sets in the research by [40] is backed by the need to show the diseases that affect the coffee leaves and tomatoes and how they can be detected using

Wireless Communications and Mobile Computing
image-based detection methods [36]. In the research by [41], the authors used an information-rich color data set and large numerical data sets to display the data collected. The machine learning process also employs the training data set to predict and analyze unseen data. The data sets also include the expected data, making it easy to rely on the data in the research and determine if the data is valid. The use of data sets related to the nature of the research helps achieve the goals of the data collection and analysis [42]. The aforementioned data sets are affected by some challenges and limitations that reduce their applicability.
Another well-known data set used by the research community is Northern Leaf Blight (NLB), which contains infected maize leaves [1]. Some sample images from the NLB data set are shown in Figure 3. NLB consists of 1787 images having 7669 lesions. The images were obtained from maize plants in the field while using a handheld camera. The images in NLB were captured in uncontrolled conditions as opposed to the PlantVillage data set.

Challenges and Problems with the Data Sets.
The explored data sets have several challenges and problems that affect their application. One of the challenges is how the data sets are organized to show the data collected from different fields. For example, when using the PlantVillage data set, the data collected from different fields could be hard to pres-ent in the same way. Most of the data may be diverse. The missing uniformity could lead to a challenge in how the data is shown in the tables and other exhibits [43]. The other problem is that the data sets may be hard to convert into graphical representations. The information in the data sets could make more sense if the information is shown in graphical formats [44]. The data could also make more sense to the readers when presented in graphs. However, some of the data sets cannot be transformed into graphs directly and may require redoing the graphs entirely. For the data sets that include tables, it is possible to convert the data into graphs without having to draw manually. The other challenge is that some of the data sets do not summarize the data contained [45]. For example, when data is presented in the PlantVillage, a summary of the total value is given at the end of the table. However, in some data sets, this information is missing.
The other challenge with the data sets is that they may not capture all the information about the factors that lead to the values shown in them. For example, the data presented in work by [27] does not show the factors that affect the classification and detection process. Such missing details could affect the way the data is applied in making decisions about the diseases [46]. The data sets could also be complex for people to use if they do not have some prior training and education related to the statistical analysis of the data. The data sets also contain some details that may be inconsistent with the  Wireless Communications and Mobile Computing research. For example, the data sets could contain information about the classification of the diseases and fail to capture the prevention measures that could be used to reduce the chances of having the diseases affecting the plants [47]. In some instances, the data sets are unclear, which could confuse the people who interpret them. Some of the data could show information about the diseases that affect coffee or rice and fail to contain the exact impact on production [20]. The exact impact on the quantity and quality of the product should be captured in such research. The other challenge is that some of the data sets may not allow the use of mobile gadgets [13]. Their complications make it hard to analyze their contents using mobile devices because of a higher screen resolution requirement.

Feature Extraction for Disease Identification
The images of the plants have three key features, namely, color, shape, and texture. Compared to color and texture, the shape feature cannot help find the plant's diseases [48]. Hlaing and Zaw [48] classified tomato plant disease using a combination of texture and color features. They used the Scale Invariant Feature Transform (SIFT) to find the texture information, containing details about the shape, location, and scale. Similarly, they gathered the color details from the RGB channel. Dandawate and Kokare [49] developed an approach for the automatic detection of diseases in soybean plants. They converted the image from RGB to HSV (Hue Saturation Value) color space. Color and cluster-based methods were employed for segmentation. The SIFT method was used to detect the type of plant, based on the shape of the leaf.
Pydipati et al. [50] identified the citrus disease using color texture features along with discriminant analysis. They also employed the color cooccurrence method (CCM) to determine if hue, saturation, and intensity (HSI) color features and the statistical classification algorithms could help identify the diseased leaves. Their method achieved an accuracy of more than 0.95.
Al-bayati and Üstündağ [51] extracted only the area of the leaf affected by the disease. Furthermore, they used feature fusion which helped in feature reduction.
Image-based detection requires many resources, and the authorities should ensure they are available so that the activities are smooth. DL in general and CNN in particular have been developed to analyze multidimensional data such as images. The underlying model is based on the multilayer ANN. Nevertheless, a convolutional layer performs kernel operations over various areas of the provided image. The obtained representation is independent of the operations such as translation or rotation. These kinds of features have been proved to work better as compared to the traditional features earlier used in the detection of plant diseases [2].

Comparison of Performance and Result Discussion
Our current research work covers state-of-the-art plant disease recognition using AI in the last five years. We summarize a series of observations that emerge from this work in the following paragraphs:   [22,[52][53][54][55][56]. Due to the small database size problem, a large portion of the data set is used for the training phase in most of the deep learning methods. However, very few exceptions are there, for example [57][58][59]. Furthermore, the available database images are collected in very constrained environmental conditions. We believe that images must be gathered in real-world conditions to make the algorithms more practical. Efficient image acquisition of leaf images is the need of the hour. If these images are captured in real-time scenarios, such databases would be warmly welcomed in the research community. In most of the recently reported works, the images captured with smart mobile devices are gaining popularity. Some single-click image systems are also introduced, but much more is supposed to be done by the researchers to automate plant disease identification algorithms. The transition of image-capturing systems to smart devices may help overcome serious issues related to database size.
(ii) Issues with Available Feature Extraction Methods.
Performing the tasks of preprocessing, feature extraction, and segmentation plays a key role in developing a machine learning-based algorithm.
Selecting the most suitable method for preprocessing and segmentation further depends on the nature of the data set. Among many techniques, one that is most suitable for a specific acquisition usually serves the purpose. We observe variability span in the reported algorithms so far under different modules. We observe somehow similar observations for various feature extraction techniques. In a nutshell, the standardization of the reported methods is yet to be fixed and achieved.
(iii) Difficulties in Classification Module. Plant disease automation and detection is an active area of research for a long time. Considering very few images for training and testing, highly acceptable results are reported by researchers. Many classifiers are explored by researchers in this domain. This study concluded that backpropagation neural network, SVM, and discriminant analysis (particularly linear) perform much better than others. These are then followed by Naïve Bayes, random forest, knearest neighbor, and multilayer perceptron. However, state-of-the-art results are much improved with recently introduced optimized deep neural networks. More proper utilization of deep convolutional neural networks can help in improving the results for large data sets.
(iv) Limitations of Available Systems. We argue that image analysis methods are comparatively better than the techniques that visually rate the severity of a particular disease. However, systems which are designed using these imaging techniques are not perfect. The performance of a system highly depends on the quality of the training data. In plant disease automation, it is the training images and certain extracted features, which significantly affect the performance of a system. A system trained with good quality data is trained well. However, most of the existing systems have a specific set of requirements needed to be fulfilled for a system to perform accurately. If some of these constraints are not fulfilled, the system may give inaccurate results, ultimately leading to wrong disease detection. For example, most of the DL-based methods particularly and conventional machine learning methods generally are faced with the problem of overfitting. Researchers must think of adaptive systems which are designed with more flexible requirements. Additionally, some generalized methods should be adapted which work in heterogeneous environments. For improving efficiency, in-depth knowledge of the methods and proper usage of the tools are also necessary.  Table 1.

Conclusion and Future Works
In this paper, we have discussed how ML in general and DL in particular have helped identify the diseases in plants. If the diseases are not correctly identified, they affect the crop yield and ultimately result in long-term issues, such as global warming and even famine. The proposed work summarizes multiple studies regarding plant disease automation and identification through different ML methods. The proposed manuscript also shows well-acceptance of a range of CV methods in this domain, making it a wide area of research to be explored in the near future. Some points are summarized in the following paragraphs, which may help to improve and enhance the current state-of-the-art and give researchers some potential ideas to further explore the field in the future.
(i) Disease Stage Identification. Disease stage identification is one of the main areas to be explored regarding plant disease identification. Each disease has several stages. Most of the researchers only focused their work on the type of disease identification, but none of these works target a particular disease stage identification. Additionally, such systems must possess the capability to suggest some measures depending particularly on specific disease stages. Identification of disease forecasting will help agriculturists to take proper actions and precautions to reduce damage percentage.
(ii) Quantification of a Disease. Another exciting area to be explored is the quantification of a particular disease. Although there has been much work done in this field, very few researchers have identified the extent of the damage caused by the disease. They can help a lot since remedial actions could be taken according to the severity of the disease. Such kind of quantification will detect the infected proportion of a particular culture with some disease. This research perspective is vital as the number of pesticides can be controlled. Normally, the farmers apply chemicals to cure diseases without any prior analysis or quantification. Such kind of practice is extremely harmful to human health. Developing an efficient image processing application will help to determine if specific chemicals are needed or not.
(iii) Mobile and Online Applications. Literature reported several solutions regarding applications of disease identification. However, few of the portals and mobile applications are publicly available and 10 Wireless Communications and Mobile Computing accessible online. Some of these applications are Assess Software and Leaf Doctor, which are publicly available for use. However, these applications work on leave images with a flat and only black background. Therefore, such kinds of online systems and applications are highly needed for plant diseases' recognition. The availability of these types of software will help farmers to identify a particular kind of disease. Such kinds of software may be used to get analysis reports which can be sent to a disease expert for getting some suggestions.
(iv) Exploring Transfer Learning to Increase Data Size.
Similarly, noting the current trends of the developments in CV, which are moving very fast towards DL methods, for plant disease detection is not very satisfactory. Given the difficulty of the data, particularly the training stage, the best option to be explored is transfer learning. For investigating knowledge transfer, a heterogeneous domain strategy may be adapted. Considering automatic plant disease identification, the keywords which may be explored are LSTMs, optical flow frames, temporal pooling, and 3D convolution. The last point that must be remembered is that better and carefully engineered methods are needed for further exploring this area. For example, the case of data augmentation may be further investigated.

Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.