Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments

cg.contacta.kheir@cgiar.orgen_US
cg.contributor.centerInternational Center for Agricultural Research in the Dry Areas - ICARDAen_US
cg.contributor.centerInternational Center for Biosaline Agriculture - ICBAen_US
cg.contributor.centerAgricultural Research Center, Soil, Water and Environment Research Institute - ARC - SWERIen_US
cg.contributor.centerJulius Kühn-Institut - JKI (Germany)en_US
cg.contributor.centerAgricultural Research Center, Field Crops Research Institute - ARC - FCRIen_US
cg.contributor.crpResilient Agrifood Systems - RAFSen_US
cg.contributor.funderNot Applicableen_US
cg.contributor.initiativeExcellence in Agronomyen_US
cg.contributor.project-lead-instituteInternational Center for Agricultural Research in the Dry Areas - ICARDAen_US
cg.coverage.countryEGen_US
cg.coverage.regionNorthern Africaen_US
cg.creator.idGovind, Ajit: 0000-0002-0656-0004en_US
cg.identifier.doihttps://dx.doi.org/10.3390/w14223647en_US
cg.isijournalISI Journalen_US
cg.issn2073-4441en_US
cg.issue22en_US
cg.journalWATERen_US
cg.subject.actionAreaResilient Agrifood Systemsen_US
cg.subject.agrovocwater useen_US
cg.subject.impactAreaClimate adaptation and mitigationen_US
cg.subject.impactAreaNutrition, health and food securityen_US
cg.subject.impactAreaPoverty reduction, livelihoods and jobsen_US
cg.subject.impactAreaGender equality, youth and social inclusionen_US
cg.subject.sdgSDG 1 - No povertyen_US
cg.subject.sdgSDG 2 - Zero hungeren_US
cg.subject.sdgSDG 8 - Decent work and economic growthen_US
cg.subject.sdgSDG 13 - Climate actionen_US
cg.subject.sdgSDG 15 - Life on landen_US
cg.volume14en_US
dc.contributorGovind, Ajiten_US
dc.contributorQureshi, Asad Sarwaren_US
dc.contributorFeike, Tilen_US
dc.contributorRizk, Mosa Sayeden_US
dc.contributorShabana, Mahmoud Mohamed Abd ElHayen_US
dc.contributorKheir, Ahmed M.S.en_US
dc.creatorAttia, Ahmeden_US
dc.date.accessioned2023-03-29T19:41:24Z
dc.date.available2023-03-29T19:41:24Z
dc.description.abstractCrop yield prediction is critical for investigating the yield gap and potential adaptations to environmental and management factors in arid regions. Crop models (CMs) are powerful tools for predicting yield and water use, but they still have some limitations and uncertainties; therefore, combining them with machine learning algorithms (MLs) could improve predictions and reduce uncertainty. To that end, the DSSAT-CERES-maize model was calibrated in one location and validated in others across Egypt with varying agro-climatic zones. Following that, the dynamic model (CERES-Maize) was used for long-term simulation (1990–2020) of maize grain yield (GY) and evapotranspiration (ET) under a wide range of management and environmental factors. Detailed outputs from three growing seasons of field experiments in Egypt, as well as CERES-maize outputs, were used to train and test six machine learning algorithms (linear regression, ridge regression, lasso regression, K-nearest neighbors, random forest, and XGBoost), resulting in more than 1.5 million simulated yield and evapotranspiration scenarios. Seven warming years (i.e., 1991, 1998, 2002, 2005, 2010, 2013, and 2020) were chosen from a 31-year dataset to test MLs, while the remaining 23 years were used to train the models. The Ensemble model (super learner) and XGBoost outperform other models in predicting GY and ET for maize, as evidenced by R2 values greater than 0.82 and RRMSE less than 9%. The broad range of management practices, when averaged across all locations and 31 years of simulation, not only reduced the hazard impact of environmental factors but also increased GY and reduced ET. Moving beyond prediction and interpreting the outputs from Lasso and XGBoost, and using global and local SHAP values, we found that the most important features for predicting GY and ET are maximum temperatures, minimum temperature, available water content, soil organic carbon, irrigation, cultivars, soil texture, solar radiation, and planting date. Determining the most important features is critical for assisting farmers and agronomists in prioritizing such features over other factors in order to increase yield and resource efficiency values. The combination of CMs and ML algorithms is a powerful tool for predicting yield and water use in arid regions, which are particularly vulnerable to climate change and water scarcity.en_US
dc.formatPDFen_US
dc.identifierhttps://mel.cgiar.org/reporting/downloadmelspace/hash/f4251d341be9cbae5c081e1ac25d5366en_US
dc.identifier.citationAhmed Attia, Ajit Govind, Asad Sarwar Qureshi, Til Feike, Mosa Sayed Rizk, Mahmoud Mohamed Abd ElHay Shabana, Ahmed M. S. Kheir. (12/11/2022). Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments. WATER, 14 (22).en_US
dc.identifier.statusOpen accessen_US
dc.identifier.urihttps://hdl.handle.net/20.500.11766/68237
dc.languageenen_US
dc.publisherMDPIen_US
dc.rightsCC-BY-4.0en_US
dc.sourceWATER;14,(2022)en_US
dc.subjectrandom foresten_US
dc.subjectdssat modelsen_US
dc.subjectxgboosten_US
dc.subjectsuper learneren_US
dc.subjectlasso regressionen_US
dc.subjecthyperparameters tuningen_US
dc.subjectfeature importanceen_US
dc.titleCoupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environmentsen_US
dc.typeJournal Articleen_US
dcterms.available2022-11-12en_US
mel.impact-factor3.530en_US

Files