Wheat Yield Prediction in India Using Principal Component Analysis-Multivariate Adaptive Regression Splines (PCA-MARS)
Open Access
- 17 May 2022
- journal article
- research article
- Published by MDPI AG in Agriengineering
- Vol. 4 (2), 461-474
- https://doi.org/10.3390/agriengineering4020030
Abstract
Crop yield forecasting is becoming more essential in the current scenario when food security must be assured, despite the problems posed by an increasingly globalized community and other environmental challenges such as climate change and natural disasters. Several factors influence crop yield prediction, which has complex non-linear relationships. Hence, to study these relationships, machine learning methodologies have been increasingly adopted from conventional statistical methods. With wheat being a primary and staple food crop in the Indian community, ensuring the country’s food security is crucial. In this paper, we study the prediction of wheat yield for India overall and the top wheat-producing states with a comparison. To accomplish this, we use Multivariate Adaptive Regression Splines (MARS) after extracting the main features by Principal Component Analysis (PCA) considering the parameters such as area under cultivation and production for the years 1962–2018. The performance is evaluated by error analyses such as RMSE, MAE, and R2. The best-fitted MARS model is chosen using cross-validation and user-defined parameter optimization. We find that the MARS model is well suited to India as a whole and other top wheat-producing states. A comparative result is obtained on yield prediction between India overall and other states, wherein the state of Rajasthan has a better model than other major wheat-producing states. This research will emphasize the importance of improved government decision-making as well as increased knowledge and robust forecasting among Indian farmers in various states.Keywords
This publication has 22 references indexed in Scilit:
- Multivariate adaptive regression splines (MARS) applied to daily reference evapotranspiration modeling with limited weather dataActa Scientiarum. Agronomy, 2018
- Predicting Grassland Leaf Area Index in the Meadow Steppes of Northern China: A Comparative Study of Regression Approaches and Hybrid Geostatistical MethodsRemote Sensing, 2016
- Principal component analysis: a review and recent developmentsPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2016
- Multivariate adaptive regression splines models for vehicular emission predictionVisualization in Engineering, 2015
- Potential of ensemble tree methods for early-season prediction of winter wheat yield from short time series of remotely sensed normalized difference vegetation index andin situmeteorological dataJournal of Applied Remote Sensing, 2015
- Factors affecting food security and contribution of modern technologies in food sustainabilityJournal of the Science of Food and Agriculture, 2011
- Predicting species distributions from museum and herbarium records using multiresponse models fitted with multivariate adaptive regression splinesDiversity and Distributions, 2007
- Using multivariate adaptive regression splines (MARS) to identify relationships between soil and corn (Zea mays L.) production propertiesCanadian Journal of Soil Science, 2005
- An introduction to multivariate adaptive regression splinesStatistical Methods in Medical Research, 1995
- Multivariate Adaptive Regression SplinesThe Annals of Statistics, 1991