摘要

Accurate runoff forecasting plays a vital role in issuing timely flood warnings. Whereas, previous research has primarily focused on historical runoff and precipitation variability while disregarding other factors' influence. Additionally, the prediction process of most machine learning models is opaque, resulting in low interpretability of model predictions. Hence, this study develops an ensemble deep learning model to forecast runoff from three hydrological stations. Initially, time-varying filtered based empirical mode decomposition is employed to decompose the runoff series into several internal mode functions (IMFs). Subsequently, the complexity of each IMF component is evaluated by the multi-scale permutation entropy, and the IMFs are classified into high- and low-frequency portions based on entropy values. Considering the high-frequency portions still exhibit great volatility, robust local mean decomposition is adopted to perform secondary decomposition of the high-frequency portions. Then, the meteorological variables processed by the Relief algorithm and variance inflation factor features are employed as inputs, the individual subsequences of secondary and preliminary decomposition as outputs to the bidirectional gated recurrent unit and extreme learning machine models. Random forests (RF) are introduced to nonlinear ensemble the individual predicted sub-models to obtain the final prediction results. The proposed model outperforms other models in various evaluation metrics. Meanwhile, due to the opaque nature of machine learning models, shapley is employed to assess the contribution of each selected meteorological variable to the long-term trend of runoff. The proposed model could serve as an essential reference for precise flood prediction and timely warning. @@@ The volatility of a river's runoff has increased in recent years due to various factors, including extreme rainfall and storms. This has further intensified the frequency and severity of flooding disasters. To address this issue, our study has developed an ensemble deep learning model that combines data from multiple sources to accurately predict river runoff. This model not only enhances the interpretability of the data, but also provides a crucial reference for effective flood forecasting and early warning systems. Model concerns include: meteorological variables affecting runoff are feature-selected and incorporated into the model, so that the inclusion of multi-source information makes the model richer; historical data are secondarily decomposed and differentiated into high-frequency and low-frequency components to mine more informative features; high and low-frequency components are predicted separately using the applicable model and ensembled in a non-linear way, so that the prediction results are more closed to the reality; explainable artificial intelligence is applied to mine important hydrological elements that affect runoff. @@@ The proposed ensemble deep learning model achieves high prediction accuracy for runoff from multiple stationsClassify internal mode functions (IMFs) into high-frequency and low-frequency components and use secondary mode decomposition to extract more informative featuresRelief and variance inflation factor (VIF) as feature selection methods to extract key meteorological factors and interpretable analyses by shapley

  • 单位
    中国科学院