TY - JOUR
T1 - Integration of maximum crop response with machine learning regression model to timely estimate crop yield
AU - Zhou, Qiming
AU - Ismaeel, Ali
N1 - Funding Information:
The research is supported by the Natural Science Foundation of China (NSFC) General Research (Grant number 41971386) and Hong Kong Research Grant Council (RGC) General Research Fund (Grant number 12301820). The work is a part of PhD research funded by Hong Kong PhD Fellowship Scheme (HKPFS);Natural Science Foundation of China (NSFC) General Program (Grant number 41971386); Hong Kong Research Grant Council (RGC) General Research Fund (Grant number 12301820).
Publisher Copyright:
© 2021 Wuhan University. Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2021/8/6
Y1 - 2021/8/6
N2 - Timely and reliable estimation of regional crop yield is a vital component of food security assessment, especially in developing regions. The traditional crop forecasting methods need ample time and labor to collect and process field data to release official yield reports. Satellite remote sensing data is considered a cost-effective and accurate way of predicting crop yield at pixel-level. In this study, maximum Enhanced Vegetation Index (EVI) during the crop-growing season was integrated with Machine Learning Regression (MLR) models to estimate wheat and rice yields in Pakistan’s Punjab province. Five MLR models were compared using a fivefold cross-validation method for their predictive accuracy. The study results revealed that the regression model based on the Gaussian process outperformed over other models. The best performing model attained coefficient of determination (R2), Root Mean Square Error (RMSE, t/ha), and Mean Absolute Error (MAE, t/ha) of 0.75, 0.281, and 0.236 for wheat; 0.68, 0.112, and 0.091 for rice, respectively. The proposed method made it feasible to predict wheat and rice 6–8 weeks before the harvest. The early prediction of crop yield and its spatial distribution in the region can help formulate efficient agricultural policies for sustainable social, environmental, and economic progress.
AB - Timely and reliable estimation of regional crop yield is a vital component of food security assessment, especially in developing regions. The traditional crop forecasting methods need ample time and labor to collect and process field data to release official yield reports. Satellite remote sensing data is considered a cost-effective and accurate way of predicting crop yield at pixel-level. In this study, maximum Enhanced Vegetation Index (EVI) during the crop-growing season was integrated with Machine Learning Regression (MLR) models to estimate wheat and rice yields in Pakistan’s Punjab province. Five MLR models were compared using a fivefold cross-validation method for their predictive accuracy. The study results revealed that the regression model based on the Gaussian process outperformed over other models. The best performing model attained coefficient of determination (R2), Root Mean Square Error (RMSE, t/ha), and Mean Absolute Error (MAE, t/ha) of 0.75, 0.281, and 0.236 for wheat; 0.68, 0.112, and 0.091 for rice, respectively. The proposed method made it feasible to predict wheat and rice 6–8 weeks before the harvest. The early prediction of crop yield and its spatial distribution in the region can help formulate efficient agricultural policies for sustainable social, environmental, and economic progress.
KW - Machine learning
KW - crop yield
KW - remote sensing
KW - timely forecast
UR - http://www.scopus.com/inward/record.url?scp=85112634571&partnerID=8YFLogxK
U2 - 10.1080/10095020.2021.1957723
DO - 10.1080/10095020.2021.1957723
M3 - Journal article
SN - 1009-5020
VL - 24
SP - 474
EP - 483
JO - Geo-Spatial Information Science
JF - Geo-Spatial Information Science
IS - 3
ER -