An Approach For Prediction of Crop Yield Using Machine Learning and Big Data Techniques
An Approach For Prediction of Crop Yield Using Machine Learning and Big Data Techniques
Volume 10, Issue 03, May-June 2019, pp. 110-118, Article ID: IJCET_10_03_013
Available online at http://www.iaeme.com/ijcet/issues.asp?JType=IJCET&VType=10&IType=3
Journal Impact Factor (2019): 10.5167 (Calculated by GISI) www.jifactor.com
ISSN Print: 0976-6367 and ISSN Online: 0976–6375
© IAEME Publication
ABSTRACT
Agriculture is the primary source of livelihood which forms the backbone of our country. Current
challenges of water shortages, uncontrolled cost due to demand-supply, and weather
uncertainty necessitate farmers to be equipped with smart farming. In particular, low
yield of crops due to uncertain climatic changes, poor irrigation facilities, reduction
in soil fertility and traditional farming techniques need to be addressed. Machine
learning is one such technique employed to predict crop yield in agriculture. Various machine learning
techniques such as prediction, classification, regression and clustering are utilized to forecast crop
yield. Artificial neural networks, support vector machines, linear and logistic regression, decision
trees, Naïve Bayes are some of the algorithms used to implement prediction. However, the selection of
the appropriate algorithm from the pool of available algorithms imposes challenge to the researchers
with respect to the chosen crop. In this paper, an investigation has been performed on how various
machine learning algorithms are useful in prediction of crop yield. An approach has been proposed for
prediction of crop yield using machine learning techniques in big data computing paradigm.
Key words: ISTA, IISTA, image restoration, inverse problems, l0 norm, l1 norm, l2
data fidelity term, regularization function, total variation.
Cite this Article: Kodimalar Palanivel and Chellammal Surianarayanan, An
Approach for Prediction of Crop Yield Using Machine Learning and Big Data
Techniques, International Journal of Computer Engineering and Technology 10(3),
2019, pp. 110-118.
http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=10&IType=3
1. INTRODUCTION
Agriculture proves to be a major factor of Indian economy and it involves production of
crops. Crops may be either food crops or commercial crops. Food crops include paddy, wheat,
maize, grams, millets, etc., whereas commercial crops are sugarcane cotton, groundnut,
cashew, etc. The productivity of the crops is significantly influenced by weather conditions
[1]. Hence, accurate yield prediction is a major problem that ought to be addressed. Early
prediction of yield would facilitate the farmers to make precautionary actions to improve
productivity. Early prediction is possible through collection of previous experience of the
farmers, weather conditions and other influencing factors and; store it in a large database.
The common input parameters are rainfall, temperature, humidity, solar radiation, crop
population density, fertilizer application, irrigation, tillage, type of soil, depth, farm capacity,
and soil organic matter. By applying data mining techniques such as prediction, classification
and clustering, early decisions are possible.
regression, , artificial neural network, data mining, support vector machine etc. The expert
system such as fuzzy logic is based on logical rules to predict the yield. However, it requires
wide interaction with the experts for obtaining the rules for prediction. In addition, these rules
are based on certain set of input data [4].
Yield prediction of wheat from using both the spatial and temporal neural network models
is presented [13]. Simulation results proved that the spatial NN model is capable of predicting
the wheat yield with a high accuracy for a specific field area compared with the temporal
models. An artificial neural network (ANN) technique for predicting the production of wheat
using extensive data collected is discussed [14] The ANN model was able to predict wheat
production for various conditions and farming methods through direct and indirect technical
factors with an error margin of ±9% (±0.89 t ha-1).
3. METRICS
From the above section, it is understood that different machine learning algorithms are being
used for prediction of crop yield. Though different machine learning algorithms are available
for use, selection of a particular algorithm is based on the nature of application and accuracy
of prediction algorithm. Prediction accuracy of classifiers is validated by different metrics as
Mean Absolute Error (MAE)[8], Root Mean Squared Error(RMSE)[3],[10], [14] , Mean
Absolute Error(MAE) as discussed below.
In (0), yi denotes the expected output and yi denotes the predicted output for an ith data
and ‘i’ ranges from 1 to N. Low value of MSE indicates that the accuracy of the classifier is
good. For a perfect classifier, the value of MSE would be zero.
It is the standard deviation of the prediction errors. Initially, the difference between an actual
target value and the model output value is calculated. This difference is then squared and
averaged over all data items before the root of the mean value is computed.
Mean Absolute Error (MAE)
Mean Absolute Error(MAE) is computed as the average of absolute differences between
the target values and predicted values. This is expressed as in (3)
1 N
MAE yi yi (3)
N i 1
4. PROPOSED APPROACH
To provide an insight into how different research works have employed various machine
learning algorithms, tools and data sets for predicting the yield of different crops are tabulated
in Table 1.
S.No Research Crop Factors Method Tool Data sets
reference
1 [11] Maize Precipitation, ANN Neural net CRU TS 3.24.01
Maximum temperature, package in
Minimum temperature, R software
potential
evapotranspiration, soil
moisture, land cultivated
2 [5] Pepper, Irrigation water depth, ANN, SPSS, Agricultural production
common Solar radiation, rainfall,MLR, WEKA data from spriter-GIS
bean, corn, temperature, relative SLR, system and weather
potato, humidity regression information data from
tomato tree SMN
3 [12] Rice Precipitation, Minimum, ANN WEKA 1998-2002 Maharashtra
average and maximum government records
temperature,
Evapotranspiration,
Area,
4 [20] Rice precipitation, minimum SVM WEKA publicly available
temperature, average Indian Government
temperature, maximum records
temperature and
reference crop
evapotranspiration, area,
production and yield
5 [21] Rice Soil code, section SVM v2.1 weather observation
thickness, soil software in stations in China (i.e.,
composition entropy, the daily published
Figure 1 Approach to predict the yield of black gram yield with big data tools and techniques
5. CONCLUSION
An extensive study on the crop prediction is made. The literatures reveal diverse machine learning techniques
adopted for prediction of crop yield. Further, the performance metrics of the machine learning algorithms such as
root mean square error are studies. Along with machine learning algorithms for prediction, it is planned to study
the impact of big data techniques in the prediction of crop yield. A conceptual approach is proposed for the
same. The proposed approach is being implemented.
REFERENCES
[1] Food and Agriculture Organization of the United Nations (FAO). World Food Situation:
FAO Cereal Supply and Demand Brief. Available online:
http://www.fao.org/worldfoodsituation/csdb/en/ (accessed on 17 September 2018).
[2] AnnaChlingaryan, SalahSukkarieh, BrettWhelan, “Machine learning approaches for crop
yield prediction and nitrogen status estimation in precision agriculture: A review”,
Computers and Electronics in Agriculture, Volume 151, August 2018, Pages 61-69
[3] S. Veenadhari , Bharat Misra , CD Singh,” Machine learning approach for forecasting crop
yield based on climatic parameters “,International Conference on Computer
Communication and Informatics, 2014, Coimbatore, India
[4] M. A. Jayaram and Netra Marad,” Fuzzy Inference Systems for Crop Yield Prediction”,
Journal of Intelligent Systems, 2012,21(4),pp.363-372.
[5] Gonzalez-Sanchez, A.,; Frausto-Solis, J., Ojeda-Bustamante, W.,” Attribute selection
impact on linear and nonlinear regression models for crop yield prediction”, Sci. World
Journal, 2014.
[6] Kenya Betty J., Sitienei , Shem G. Juma, and Everline Opere,”:On the Use of Regression
Models to Predict Tea Crop Yield Responses to Climate Change: A Case of Nandi East,
Sub-County of Nandi County”, MDPI Sensors, 2017
[7] D Ramesh and B Vishnu Vardhan,”Analysis Of Crop Yield Prediction Using Data Mining
Techniques”, International Journal of Research in Engineering and Technology, 2015,
4(1)
[8] Fortin JG, Anctil F, Parent L, Bolinder MA,” Site- specific early season potato yield
forecast by neural network in Eastern Canada”, Precision Agriculture, Springer,
2011,12(6), 905-923
[9] Ghodsi, R.; Yani, R.M.; Jalali, R.; Ruzbahman, M, “Predicting wheat production in Iran
using an artificial neural networks approach”, Int. J. Acad. Res. Bus. Soc. Sci. 2012, 2, 34–
47
[10] Snehal S.Dahikar, Dr.Sandeep V.Rode, “Agricultural Crop Yield Prediction Using
Artificial Neural Network Approach”, International Journal Of Innovative Research In
Electrical, Electronics, Instrumentation And Control Engineering, 2(1),2014, 683-686
[11] Omolola M. Adisa ,Joel O. Botai, Abiodun M. Adeola , Abubeker Hassen et. al “Application of
Artificial Neural Network for Predicting Maize Production in South
Africa”, Sustainability, 2019, 11(4), 1145
[12] Niketa Gandhi, Owaiz Petkar and Leisa J. Armstrong “Rice Crop Yield Prediction Using
Artificial Neural Networks”, IEEE International Conference on Technological
Innovations in ICT For Agriculture and Rural Development, 2016, 105-110.
[13] William W. Guo and Heru Xue,Crop,” Yield Forecasting Using Artificial Neural
Networks: A Comparison between Spatial and Temporal Models”, Mathematical
Problems in Engineering, Hindawi, 2014Suhas S Athani and CH Tejeshwar, “Support
Vector Machine-Based Classification Scheme of Maize Crop” , IEEE Conference,
2017,84-88.
[14] M. Safa1, S. Samarasinghe, and M. Nejat, ”Prediction of Wheat Production Using
Artificial Neural Networks and Investigating Indirect Factors Affecting It: Case Study in
Canterbury Province, New Zealand”, J. Agr. Sci. Technology, 2015, Vol. 17, 791-803
[15] Suhas S Athani and CH Tejeshwar, “Support Vector Machine-Based Classification
Scheme of Maize Crop” , IEEE Conference, 2017,84-88.
[16] Monali Paul, Santosh K. Vishwakarma and Ashok Verma, “Analysis of Soil Behaviour
and Prediction of Crop Yield using Data Mining Approach”, IEEE International
Conference on Computational Intelligence and Communication Networks, 2015,766-771.
[17] Rohit Kumar Rajak, Ankit Pawar, Mitalee Pendke , Pooja Shinde, Suresh Rathod and
Avinash Devare, “ Crop Recommendation System to Maximize Crop Yield using
Machine Learning Technique”, International Research Journal of Engineering
and Technology, 2017, 4( 12), 950-953.
[18] Y. Karimi, S.O. Prasher , A. Madani and S. Kim,” Application of support vector machine
technology for the estimation of crop biophysical parameters using aerial hyperspectral
observations”, Canadian Biosystems Engineering, 2008, Vol.50
[19] Arun Kumar, Naveen Kumar, Vishal VatArun Kumar, Naveen Kumar, Vishal Vats,
“Efficient Crop Yield Prediction Using Machine Learning Algorithms”, International
Research Journal of Engineering and Technology, 05(06),2018.
[20] Niketa Gandhi, Leisa J Armstrong, RiPtkar, Owaiz Petkar, Amiya Kumar, “Rice crop
yield prediction in India using support vector machines”, International Joint Conference
on Computer Science and Software Engineering, 2016, IEEEexplore
[21] Su YX, Xu H, Yan LJ, “Support vector machine-based open crop model (SBOCM): Case
of rice production in China. Saudi J Biol Sci. 2017;24(3):537–547.
doi:10.1016/j.sjbs.2017.01.024
[22] Sanja Brdar, Dubravko Culibrk, Branko Marinkovic, Jovan Crnobarac and Vladimir
Crnojevic, “Support Vector Machines with Features Contribution Analysis for
Agricultural Yield Prediction”, in the Proc. of Second International Workshop on Sensing
Technologies in Agriculture, Forestry and Environment, 2011.
[23] D.S.Zingade, Omkar Buchade, Bilesh Mehta, Shubhm Ghodekar, Chandan Mehta, “Crop
prediction System using Machine Learning”, International Journal of Advance Engineering and
Research Development, vol. 4, Special Issue 5, Dec 2017, pp. 1-6
[24] Narayanan Balakrishnan, Dr. Govindarajan Muthukumarasamy, “Crop production-
ensemble machine learning model for prediction, International Journal of Computer
Science and Software Engineering, vol. 5, Issue 7, July 2016, pp. 148-153
[25] Rushika Ghadge, Juilee Kulkarni, Pooja More, Sachee Nene, Priya R L, “Prediction of
Crop Yield using Machine Learning”, International Research Journal of Engineering and
Technology, vol. 5, Issue 2, Feb-2018, pp.2237-2239
[26] Athmaja S., Hanumanthappa M, “Applications of Mobile Cloud Computing and Big data
Analytics in Agriculture Sector: A survey”, October 2016