Plant-Leaf Disease Prediction Using Deep Learning
Plant-Leaf Disease Prediction Using Deep Learning
Plant-Leaf Disease Prediction Using Deep Learning
https://doi.org/10.22214/ijraset.2023.49338
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
Abstract: Brown spot, Mosaic, Grey spot, and Rust all significantly reduce apple yield. Rust is a sign of Foliar illness in this
instance. The primary factor influencing apple output is the occurrence of apple leaf diseases, which results in significant yearly
economic losses. Therefore, it is very important to research apple leaf disease identification. Plants are frequently attacked by
pests, bacterial diseases, and other microorganisms. Inspection of the leaves, stem, or fruit usually identifies the attack's signs.
Powdery Mildew and Leaf Blight are two common plant diseases that can cause severe harm if not treated quickly. In the realm
of agriculture, image processing is frequently utilized for classification, detection, grading, and quality control. Finding and
identifying plant diseases is crucial, especially when trying to produce fruit of the highest caliber. The real-time identification of
apple leaf diseases is addressed in this research using a deep learning strategy that is based on enhanced convolutional neural
networks (CNNs). This study uses data augmentation and image annotation tools to create the foliar disease dataset, which is
made up of complex images captured in the field and laboratories.
Overall, we can identify the illness present in plants on a massive scale by utilizing machine learning to train the vast data sets
that are publically available. The project explains how to identify plant leaf diseases, how they affect plant yield, and which
pesticides should be used to treat them. in agriculture. To monitor huge plant fields and automatically identify disease symptoms
as soon as they develop on plant leaves, research on automatic plant disease is crucial. In this essay, we'll demonstrate how to
identify plant illnesses by obtaining photos of their leaves.
Keywords: Rust, CNN, Scab, Foliar Disease, Plant-Leaf
I. INTRODUCTION
The precision and dependability of detection and analysis processes are improved by the use of technology. For instance,
individuals who use cutting-edge technology to study diseases that emerge suddenly have a better chance of controlling them than
those who do not. India is a developed nation where agriculture supports around 70% of the people. Farmers can choose from a
wide variety of eligible crops and choose the right insecticides for their plants. A considerable decrease in both the quality and
quantity of agricultural products is caused by plant disease. Research on visually discernible patterns in plants is referred to as plant
disease studies. In the beginning, the specialist in that sector would manually monitor and analyze plant illnesses. Techniques for
image processing can be used to find plant diseases. The fruit, stem, and leaves frequently show signs of disease. The Apple plant
leaf is taken into consideration for disease identification since it exhibits disease signs. This article provides an introduction to the
image-processing method used to detect plant diseases. Agricultural diseases pose a serious hazard to human existence because they
could trigger famines and droughts. In situations where farming is done for commercial objectives, they also result in significant
losses. The diagnosis and treatment of diseases may be aided by the application of computer vision (CV) and machine learning
(ML). Artificial intelligence (AI) in the form of computer vision involves utilizing computers to comprehend and recognize objects.
It is presently used in medical operations to detect and evaluate items. It is principally employed in assessing drivers, parking, and
driving of self-driven automobiles.
Food security is made simple by the use of computer vision to improve plant disease protection accuracy. Pests and diseases harm
crops or plant parts, reducing food production and escalating food poverty. Also, little is known about diseases and pest
management or control in many less-developed nations. One of the main causes of decreased food production is toxic pathogens,
poor disease control, and dramatic climate change.
In recent years, disease identification methods based on servers and mobile devices have been used. Automatic disease recognition
is made possible by several elements, including the high-resolution camera, high-performance processing, and numerous built-in
accessories.
The accuracy of the results has been improved by using contemporary methods like machine learning and deep learning algorithms.
For the detection and diagnosis of plant diseases, numerous studies have been conducted using classic machine learning techniques
such as random forests, artificial neural networks, support vector machines (SVM), fuzzy logic, the K-means method, and
convolutional neural networks.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 121
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
B. Existing System
Several procedures must be carried out to determine if the leaf is healthy or diseased. that is, Preprocessing, Feature Extraction,
Classification, and Classifier Training. Preprocessing involves reducing all of the photos' sizes to a single, uniform value. The next
step is to extract features from a preprocessed image with the aid of HOG.
A feature descriptor for object detection is HOG. This feature descriptor uses the gradient of its intensity to represent both the
appearance of the object and the contour of the image.
The fact that HOG feature extraction uses the generated cells is one of its drawbacks. Any changes have no impact on this. Three
feature descriptors were used in this case.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 122
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
Dataset
These are the sample images from the dataset showing symptoms of apple tree (A)Healthy leaf, (B)apple scab, (C) Rust, and (D)
Multiple diseases. This dataset contains 3k pics with both train data and a test set of images.
Fig 1 Dataset
The First process is to take the leaf image as input. In the next step, we do data preprocessing.
A. Data Preprocessing
Data preprocessing is a crucial step in the data analysis process, particularly in machine learning and data mining. It refers to the
process of cleaning, transforming, and preparing raw data into a form that is ready for analysis. The raw data collected from various
sources may contain errors, missing values, outliers, and other inconsistencies that need to be addressed before it can be analyzed.
Data preprocessing involves a series of steps to prepare the data for analysis. Data preprocessing is an iterative process that may
involve repeating some of these steps multiple times to refine the data and improve the accuracy of the analysis. Effective data
preprocessing is critical to ensure the accuracy, efficiency, and relevance of the analysis results.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 123
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
B. Model Preparation
The Model we have used is H5. An H5 model is a file format used to store a trained deep learning model in the Hierarchical Data
Format version 5 (HDF5) file format.
When a deep learning model is trained, it generates a set of weights that capture the learned patterns and relationships between the
input and output data. These weights can be stored in an H5 model file format, which can be used to later load the model and make
predictions on new data. The H5 model file contains all the information necessary to recreate the trained model, including the model
architecture, the values of the weights, and the training configuration used to train the model. The file can be easily shared with
others, and the model can be used in different programming languages and platforms.
The HDF5 file format is a binary file format used for storing large and complex datasets, including scientific data, image data, and
deep learning models. The format is designed to efficiently store and retrieve large amounts of data, making it a popular choice for
storing deep learning models. To use an H5 model file, the file must first be loaded into memory using a library or framework that
supports the HDF5 file format. Common deep learning frameworks such as TensorFlow, Keras, and PyTorch all have built-in
support for loading and using H5 model files. Once the file is loaded, the model can be used to make predictions on new data.
C. Training Data
We train data with CNN model,
1) CNN: Convolutional Neural Networks are a sophisticated chain of neural networks that extract an image's attributes from a
trained dataset and classify them to provide the desired result. By converting the dataset's picture data to numerical values, it
trains the neural networks. The primary benefit of CNN over its predecessors is that it automatically recognizes crucial details
without human supervision. ConvNets are computationally efficient and more potent than machine learning techniques. Based
on their identified qualities, these numerical values are subsequently placed into numerical arrays. Depending on the input,
these arrays are then placed in various network nodes and passed through many iterations. The CNN models function similarly
to a filter, eliminating dust and separating the features of the images, and are utilized for geographical classification in many
businesses that need data to be categorized quickly and securely.
2) LeNet: LeNet, short for LeNet-5, is a convolutional neural network (CNN) architecture developed by Yann LeCun, Leon
Bottou, Yoshua Bengio, and Patrick Haffner in 1998. It was one of the first successful deep learning models for image
recognition, specifically for handwritten digit recognition tasks. LeNet-5 consists of seven layers, including two convolutional
layers, two subsampling layers, and three fully connected layers. The input to the network is a grayscale image of size 32x32
pixels. The first layer performs a convolution on the input image, followed by a subsampling operation. This process is repeated
in the second layer, resulting in a reduced feature map. For classification, the final two layers that are totally connected are used.
LeNet-5 was trained on the MNIST database, which contains 60,000 training images and 10,000 test images of handwritten
digits, and achieved an error rate of less than 1% on the test set. The success of LeNet-5 demonstrated the potential of CNNs for
image recognition tasks and paved the way for the development of more complex deep learning models.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 124
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
D. Classification
Plant disease detection uses feature categorization. The plant's sick traits are eliminated, and the healthy leaf image is used to
classify the plant. When the leaf is healthy and there is no classification, the findings are displayed as healthy; however, when there
is a disease, which is indicated by black dots on a greyscale, the results are classified and the disease along with the classification's
level of confidence are displayed. Two numerical arrays are used to classify data. Depending on the dataset provided, it is either a
healthy leaf or a diseased leaf if the numerical arrays match. Classification is a quick yet important process that produces accurate
results and is used to identify plant diseases.
E. Feature Extraction
Feature extraction is a process of extracting the most relevant and useful information or features from raw data in a way that the
processed data can be used to train machine learning models or perform other analytical tasks.
In the context of machine learning, feature extraction is used to identify and extract important features from the raw input data that
can be used to train a model. The goal is to reduce the dimensionality of the data while retaining the most informative features that
are relevant to the task at hand. This can be achieved through techniques like principal component analysis (PCA), independent
component analysis (ICA), and linear discriminant analysis (LDA).
Feature extraction can also be used in other fields, such as computer vision and natural language processing. In computer vision,
feature extraction involves analyzing images to identify relevant features such as edges, corners, or textures. In natural language
processing, feature extraction involves transforming text data into a numerical representation, such as a bag-of-words model, that
can be used to train a model.
The extracted features can then be used to train a machine learning model, such as a classification or regression model, to make
predictions on new data. Feature extraction is an important step in the machine learning pipeline as it helps to reduce the
dimensionality of the data, remove irrelevant information, and improve the accuracy and efficiency of the model.
1) Max Pooling: Maxpooling is a common operation used in convolutional neural networks (CNNs) for feature extraction from
images. It is a form of subsampling that reduces the size of the feature maps produced by the convolutional layers. The
operation works by dividing the input feature map into non-overlapping rectangular regions, called pooling windows or filters.
For each pooling window, the maximum value within the window is extracted and used as the output value for that region. The
result is a new, smaller feature map with reduced spatial dimensions but with preserved dominant features.
Maxpooling has several benefits, such as: Translation invariance: Maxpooling reduces the sensitivity of the network to small
translations of the input image. This means that the network can recognize the same object regardless of its position in the
image. Dimensionality reduction: Maxpooling reduces the number of parameters in the network and helps prevent overfitting.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 125
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
Robustness to noise: Maxpooling can help remove noise or small distortions in the input image by extracting the maximum value in
the pooling window.
Typical maxpooling layers use a pooling window size of 2x2 with a stride of 2, resulting in a reduction of the feature map size by a
factor of 2. However, other pooling window sizes and strides can be used as well depending on the network architecture and task
requirements.
Fig 4: Pooling
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 126
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
B. Output
Above fig.7 shows, the output displayed over a website. After the successful completion of training, we uploaded an image as input
and checked its activity. As a result, we obtained good accuracy in predicting disease.
V. ACKNOWLEDGMENT
We would like to thank our guide “Mr.V.N.L.N.Murthy” for helping throughout our project. Moreover, a particular thank you to the
administration of Vardhaman College of Engineering for supporting us in completing the project successfully.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 127
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue III Mar 2023- Available at www.ijraset.com
VI. CONCLUSION
Smallholder farmers rely on early and precise identification of crop diseases to avoid losses. A pre-trained convolutional neural
network for apple leaf disease was deployed, and after that online deployment of a website for detecting plant diseases was the end
outcome. All you need to utilize this service through a smartphone and an internet connection, and it is free. Overall, this study
provides convincing evidence for the potential use of CNNs to support smallholder farmers in their fight against plant disease.
The implementation of a website was done for detecting the apple leaf disease detection using a convolutional neural network, so
the Implementation of the project work was done successfully with an accuracy of 0.95 and we got to learn many more things from
this project.
REFERENCES
[1] Liu, F., Wang, X., Fan, J., Cao, Z., & Zhou, T. Zhang, C. (2021). Review of deep learning models for leaf image-based autonomous disease diagnosis in plants.
Agricultural Electronics and Computers, 182, 106008.
[2] Khan, M. A., Dauda, L., Al-Turjman, F., & Amin, M. B. (2020). A review of deep learning techniques for plant disease detection from leaf images. Applied
Sciences, 10(5), 1615.
[3] Cruz, A. C. S., De Albuquerque, V. H. C., & De Araújo, A. F. (2018). Leaf disease detection using convolutional neural networks and transfer learning.
Computers and Electronics in Agriculture, 153, 46-58.
[4] Zhang, Y., Zhang, J., & Li, X. (2021). A review of machine learning and deep learning methods for plant disease detection and classification. Computers and
Electronics in Agriculture, 185, 106086.
[5] The 11th National Conference on Science and Engineering, Yangon, Yangon, Myanmar, Ko Ko Zaw, Zin Ma Ma Myo, and Wah Wah Hlaing. The detection
and classification of leaf diseases using multiclass SVMs.
[6] Plant Disease Detection Using CNN, G. Shrestha, Deepsikha, M. Das, and N. Dey, sApplied Signal Processing Conference (ASPCON)IEEE 2020, pp. 109–113,
10.1109/ASPCON49795.2020.9276722.
[7] Khan, M. A., Dauda, L., Al-Turjman, F., & Amin, M. B. (2020). A review of deep learning techniques for plant disease detection from leaf images. Applied
Sciences, 10(5), 1615.
[8] S. Sarkar, B. Ganapathysubramanian, and A. Singh (2016). High-throughput plant stress phenotyping using machine learning. 21(2), 110–124, Trends in Plant
Science.
[9] S. P. Mohanty, D. P. Hughes, and M. Salathé (2016). using deep learning to identify plant diseases from images. Plant Science Frontiers, 7, 1419.
[10] Singh, D., Singh, P., & Kumar, A. (2021). Plant disease detection and classification using deep learning techniques: A review. Computers and Electronics in
Agriculture, 184, 106012.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 128