Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments
cg.contact | a.kheir@cgiar.org | en_US |
cg.contributor.center | International Center for Agricultural Research in the Dry Areas - ICARDA | en_US |
cg.contributor.center | International Center for Biosaline Agriculture - ICBA | en_US |
cg.contributor.center | Agricultural Research Center, Soil, Water and Environment Research Institute - ARC - SWERI | en_US |
cg.contributor.center | Julius Kühn-Institut - JKI (Germany) | en_US |
cg.contributor.center | Agricultural Research Center, Field Crops Research Institute - ARC - FCRI | en_US |
cg.contributor.crp | Resilient Agrifood Systems - RAFS | en_US |
cg.contributor.funder | Not Applicable | en_US |
cg.contributor.initiative | Excellence in Agronomy | en_US |
cg.contributor.project-lead-institute | International Center for Agricultural Research in the Dry Areas - ICARDA | en_US |
cg.coverage.country | EG | en_US |
cg.coverage.region | Northern Africa | en_US |
cg.creator.id | Govind, Ajit: 0000-0002-0656-0004 | en_US |
cg.identifier.doi | https://dx.doi.org/10.3390/w14223647 | en_US |
cg.isijournal | ISI Journal | en_US |
cg.issn | 2073-4441 | en_US |
cg.issue | 22 | en_US |
cg.journal | WATER | en_US |
cg.subject.actionArea | Resilient Agrifood Systems | en_US |
cg.subject.agrovoc | water use | en_US |
cg.subject.impactArea | Climate adaptation and mitigation | en_US |
cg.subject.impactArea | Nutrition, health and food security | en_US |
cg.subject.impactArea | Poverty reduction, livelihoods and jobs | en_US |
cg.subject.impactArea | Gender equality, youth and social inclusion | en_US |
cg.subject.sdg | SDG 1 - No poverty | en_US |
cg.subject.sdg | SDG 2 - Zero hunger | en_US |
cg.subject.sdg | SDG 8 - Decent work and economic growth | en_US |
cg.subject.sdg | SDG 13 - Climate action | en_US |
cg.subject.sdg | SDG 15 - Life on land | en_US |
cg.volume | 14 | en_US |
dc.contributor | Govind, Ajit | en_US |
dc.contributor | Qureshi, Asad Sarwar | en_US |
dc.contributor | Feike, Til | en_US |
dc.contributor | Rizk, Mosa Sayed | en_US |
dc.contributor | Shabana, Mahmoud Mohamed Abd ElHay | en_US |
dc.contributor | Kheir, Ahmed M.S. | en_US |
dc.creator | Attia, Ahmed | en_US |
dc.date.accessioned | 2023-03-29T19:41:24Z | |
dc.date.available | 2023-03-29T19:41:24Z | |
dc.description.abstract | Crop yield prediction is critical for investigating the yield gap and potential adaptations to environmental and management factors in arid regions. Crop models (CMs) are powerful tools for predicting yield and water use, but they still have some limitations and uncertainties; therefore, combining them with machine learning algorithms (MLs) could improve predictions and reduce uncertainty. To that end, the DSSAT-CERES-maize model was calibrated in one location and validated in others across Egypt with varying agro-climatic zones. Following that, the dynamic model (CERES-Maize) was used for long-term simulation (1990–2020) of maize grain yield (GY) and evapotranspiration (ET) under a wide range of management and environmental factors. Detailed outputs from three growing seasons of field experiments in Egypt, as well as CERES-maize outputs, were used to train and test six machine learning algorithms (linear regression, ridge regression, lasso regression, K-nearest neighbors, random forest, and XGBoost), resulting in more than 1.5 million simulated yield and evapotranspiration scenarios. Seven warming years (i.e., 1991, 1998, 2002, 2005, 2010, 2013, and 2020) were chosen from a 31-year dataset to test MLs, while the remaining 23 years were used to train the models. The Ensemble model (super learner) and XGBoost outperform other models in predicting GY and ET for maize, as evidenced by R2 values greater than 0.82 and RRMSE less than 9%. The broad range of management practices, when averaged across all locations and 31 years of simulation, not only reduced the hazard impact of environmental factors but also increased GY and reduced ET. Moving beyond prediction and interpreting the outputs from Lasso and XGBoost, and using global and local SHAP values, we found that the most important features for predicting GY and ET are maximum temperatures, minimum temperature, available water content, soil organic carbon, irrigation, cultivars, soil texture, solar radiation, and planting date. Determining the most important features is critical for assisting farmers and agronomists in prioritizing such features over other factors in order to increase yield and resource efficiency values. The combination of CMs and ML algorithms is a powerful tool for predicting yield and water use in arid regions, which are particularly vulnerable to climate change and water scarcity. | en_US |
dc.format | en_US | |
dc.identifier | https://mel.cgiar.org/reporting/downloadmelspace/hash/f4251d341be9cbae5c081e1ac25d5366 | en_US |
dc.identifier.citation | Ahmed Attia, Ajit Govind, Asad Sarwar Qureshi, Til Feike, Mosa Sayed Rizk, Mahmoud Mohamed Abd ElHay Shabana, Ahmed M. S. Kheir. (12/11/2022). Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments. WATER, 14 (22). | en_US |
dc.identifier.status | Open access | en_US |
dc.identifier.uri | https://hdl.handle.net/20.500.11766/68237 | |
dc.language | en | en_US |
dc.publisher | MDPI | en_US |
dc.rights | CC-BY-4.0 | en_US |
dc.source | WATER;14,(2022) | en_US |
dc.subject | random forest | en_US |
dc.subject | dssat models | en_US |
dc.subject | xgboost | en_US |
dc.subject | super learner | en_US |
dc.subject | lasso regression | en_US |
dc.subject | hyperparameters tuning | en_US |
dc.subject | feature importance | en_US |
dc.title | Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments | en_US |
dc.type | Journal Article | en_US |
dcterms.available | 2022-11-12 | en_US |
mel.impact-factor | 3.530 | en_US |