TY - JOUR
T1 - Comparison and integration of hydrological models and machine learning models in global monthly streamflow simulation
AU - Zhang, Jiawen
AU - Kong, Dongdong
AU - Li, Jianfeng
AU - Qiu, Jianxiu
AU - Zhang, Yongqiang
AU - Gu, Xihui
AU - Guo, Meiyu
N1 - Funding Information:
This study was supported by the National Key R&D Program of China (2022YFC3002804 and 2023YFE0103900), National Natural Science Foundation of China (42071055 & 42101052), Key research project of Hubei Hydrology and Water Resources Center (Grant: HBSWKY202406), the Research Grants Council of the Hong Kong Special Administrative Region, China (CUHK12301220 and RFS2223-2H02), Open Fund of National Engineering Research Center for Geographic Information System, China University of Geosciences, Wuhan 430074, China (Grant No. 2022KFJJ05), and the Natural Science Foundation of Wuhan (Grant 2024040801020275).
Publisher Copyright:
© 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
PY - 2025/4
Y1 - 2025/4
N2 - With the advancement of machine learning technology, machine learning models (MLMs) are progressively emerging as a pivotal branch in streamflow simulation. However, compared to traditional hydrological models (HMs), a comprehensive understanding of their strengths and applicability across diverse climatic regions worldwide remains elusive. This study compares the performance of four widely used lumped HMs (GR2M, XAJ, SAC, and Alpine) with four prevalent MLMs (RF, GBDT, DNN, and CNN) across 16,218 catchments worldwide. Results show that MLMs can't always surpass HMs. Although the percentage of qualified-level accuracy (Kling–Gupta efficiency, KGE ≥ 0.2) is higher in the MLMs group, the HMs have a higher percentage of good-level accuracy (KGE > 0.6). HMs outperform MLMs in Southeastern North America, Western Europe, and most regions of the Southern Hemisphere. To combine the merits of HMs and MLMs, the performance of seven multi-model weighting ensemble methods (MWEs) is evaluated. The optimal MWE is employed to unify the results of four HMs and four MLMs, further improving the simulation accuracy. The MWEs improve the simulation accuracy effectively and the Inverse Rank Prediction Combination (InvW) is the best-performing MWE, which elevates the percentage of qualified accuracy by 13 % and 6 % for the best-performing HM and MLM, respectively. Despite the improvements with InvW, correlation analyses of KGE with catchment humidity index (HI), mean temperature (Tair), and leaf area index (LAI) reveal that simulating streamflow in dry, cold, and lower vegetated areas remains challenging (RHI = 0.36, RTair = 0.1, and RLAI = 0.19). Additionally, the precision of streamflow simulations diminishes in areas heavily impacted by human activities, primarily due to hydraulic engineering and water withdrawals. Our study systematically evaluates the performance of different HMs and MLMs in different regions, proposes a framework to combine the merits of HMs and MLMs, and sheds light on the constraining factors of accurate streamflow simulation.
AB - With the advancement of machine learning technology, machine learning models (MLMs) are progressively emerging as a pivotal branch in streamflow simulation. However, compared to traditional hydrological models (HMs), a comprehensive understanding of their strengths and applicability across diverse climatic regions worldwide remains elusive. This study compares the performance of four widely used lumped HMs (GR2M, XAJ, SAC, and Alpine) with four prevalent MLMs (RF, GBDT, DNN, and CNN) across 16,218 catchments worldwide. Results show that MLMs can't always surpass HMs. Although the percentage of qualified-level accuracy (Kling–Gupta efficiency, KGE ≥ 0.2) is higher in the MLMs group, the HMs have a higher percentage of good-level accuracy (KGE > 0.6). HMs outperform MLMs in Southeastern North America, Western Europe, and most regions of the Southern Hemisphere. To combine the merits of HMs and MLMs, the performance of seven multi-model weighting ensemble methods (MWEs) is evaluated. The optimal MWE is employed to unify the results of four HMs and four MLMs, further improving the simulation accuracy. The MWEs improve the simulation accuracy effectively and the Inverse Rank Prediction Combination (InvW) is the best-performing MWE, which elevates the percentage of qualified accuracy by 13 % and 6 % for the best-performing HM and MLM, respectively. Despite the improvements with InvW, correlation analyses of KGE with catchment humidity index (HI), mean temperature (Tair), and leaf area index (LAI) reveal that simulating streamflow in dry, cold, and lower vegetated areas remains challenging (RHI = 0.36, RTair = 0.1, and RLAI = 0.19). Additionally, the precision of streamflow simulations diminishes in areas heavily impacted by human activities, primarily due to hydraulic engineering and water withdrawals. Our study systematically evaluates the performance of different HMs and MLMs in different regions, proposes a framework to combine the merits of HMs and MLMs, and sheds light on the constraining factors of accurate streamflow simulation.
KW - Hydrological model
KW - Machine learning
KW - Multi-model weighting ensemble
KW - Streamflow simulation
UR - http://www.scopus.com/inward/record.url?scp=85212403770&partnerID=8YFLogxK
U2 - 10.1016/j.jhydrol.2024.132549
DO - 10.1016/j.jhydrol.2024.132549
M3 - Journal article
AN - SCOPUS:85212403770
SN - 0022-1694
VL - 650
JO - Journal of Hydrology
JF - Journal of Hydrology
M1 - 132549
ER -