A hybrid technique to enhance the rainfall-runoff prediction of physical and data-driven model: a case study of Upper Narmada River Sub-basin, India

Kumar, Sachin; Choudhary, Mahendra Kumar; Thomas, T.

doi:10.1038/s41598-024-77655-5

Download PDF

Article
Open access
Published: 01 November 2024

A hybrid technique to enhance the rainfall-runoff prediction of physical and data-driven model: a case study of Upper Narmada River Sub-basin, India

Sachin Kumar¹,
Mahendra Kumar Choudhary¹ &
T. Thomas²

Scientific Reports volume 14, Article number: 26263 (2024) Cite this article

2721 Accesses
4 Citations
Metrics details

Subjects

Abstract

Accurate streamflow prediction is crucial for effective water resource management and planning. This study aims to enhance streamflow simulation accuracy in the data-scarce Upper Narmada River Basin (UNB) by proposing a novel hybrid approach, ANN_Hybrid, which combines a physically-based model (WEAP) with a data-driven model (ANN). The WEAP model was calibrated and validated using observed streamflow data, while the ANN model was trained and tested using meteorological variables and simulated streamflow. The ANN_Hybrid model integrates simulated flow from both WEAP and ANN to improve prediction accuracy. The results demonstrate that the ANN_Hybrid model outperforms the standalone WEAP and ANN models, with higher NSE values of 95.5% and 92.3% during training and testing periods, respectively, along with an impressive R² value of 0.96. The improved streamflow predictions can support better decision-making related to water allocation, reservoir operations, and flood and drought risk assessment. The novelty of this research lies in the development of the ANN_Hybrid model, which leverages the strengths of both physically-based and data-driven approaches to enhance streamflow simulation accuracy in data-limited regions. The proposed methodology offers a promising tool for sustainable water management strategies in the UNB and other similar catchments.

Development of a new hybrid model to enhance streamflow estimation using artificial neural network and reptile search algorithm

Article Open access 19 February 2025

Hydrological model-based streamflow reconstruction for Indian sub-continental river basins, 1951–2021

Article Open access 18 October 2023

Restructuring and serving web-accessible streamflow data from the NOAA National Water Model historic simulations

Article Open access 20 October 2023

Introduction

The hydrological processes have several interconnected elements, with runoff linking precipitation to water bodies. surface runoff refers to water movement across the soil surface and into nearby surface waters, such as streams, dams, lakes, or other reservoirs, without infiltrating into the ground. The variability of surface runoff is dependent on both temporal and spatial factors. Approximately 33% of the precipitation that reaches the field is converted into runoff, while the remaining 67% is either evaporated, infiltrated, or retained within the soil. According to the Central Water Commission (CWC), the total availability of water on the Earth is fixed, i.e., 1378.72 Mkm³ out of which 97% of water is in the oceans and 0.4 to 0.8% is trapped in the glaciers; only 2.2 to 2.6% of water is available as fresh water. However, the population of India is 17.85% of the world’s population, and it covers approximately 2% land area of the world. In contrast, it has only 4% of available water worldwide. Hence reliable streamflow prediction is significant for better water resources planning and management. The best model is one that describes a real-world system¹. According to², various hydrological models are classified as empirical, conceptual, physical-based, lumped, semi-distributed or distributed models. A runoff model comprises a series of equations that are utilized to estimate the runoff based on various parameters that define the characteristics of a watershed.

Several hydrological models are used to simulate rainfall-runoff processes, including SWAT, HEC-HMS, VIC, and WEAP, which are the usual models used by hydrological modellers across the world. For example, the potential of both surface water and groundwater to fulfil the demands of water for different users and the ecological reserve is assessed by implementing the WEAP model in a sub-basin of Steelpoort River, specifically called the Olifants River basin³. WEAP has also been considered to be a Decision Support System for water management planning. The study showed the importance of developing the Decision Support System and how the watershed system can use the developed DSS to enhance the flow dynamics in the basin management using the WEAP model⁴. The soil moisture method used in the WEAP model was employed to model the rainfall-runoff process while it modelled the hydrological system of the basin of Rio Conchos. The method showed that could be used to simulate the dynamics of the basin flow to assess the impacts of climate variation under several carbon dioxide emission scenarios⁵. WEAP model was also used to evaluate water supply and demand in the Langat watershed area. Due to the water balance in the catchment area, a new technique of inventory to match the changing availability of water due to pressure was developed. The result indicated that the analysis area was expected to face water shortage because of the increase in population and temperature rise above the current situation⁶. The use of WEAP will enhance synchronization between the hydrological input of the area and the water management system infrastructure. This system decides on the distribution of the available water resources to meet the varying water demands. Based on the findings, it can be inferred that constructing a hydroelectric dam on the Niger River would be advantageous as it would aid in regulating water flow and mitigate the issue of low water levels. Moreover, the dam’s construction will facilitate the acquisition of sufficient potable water for the prominent cities of Niamey and Tillabéry⁷.

In further study, WEAP model deployed and tested several unilateral and multilateral adaptation methods considering a socioeconomic and climatic change to analyse transboundary water resource management. WEAP has assessed agricultural and water policies in the Jordan River watershed to maintain rural livelihoods and safeguard freshwater resources⁸. WEAP model is used to forecast rainfall and temperature at a smaller spatial and temporal level to model the effect of climate change on water resources and provide results for water administrators and decision-makers. Climate change, he concluded, would have a direct impact on water sources and the demand for water in terms of uses such as urban and rural use could also be more disrupted. The capabilities of the WEAP21 water resource management model were assessed and tested when used in the Parameterization Simulation Optimisation system by the EGO method. The outcomes indicate that the WEAP21-PSO system has been successful, and EGO has the capacity and expertise to solve computationally difficult problems and yield the best solution according to⁹. WEAP was used for the quantitative predictions of the water supply and demand deficit in Chennai.” Furthermore, the impact of various potential long-term gradual water availability improvements on the water supply-demand failure was predicted using WEAP for the outcomes analysis. The data between 2009 and 2015 was used to analyze and validate the model, while the model calibration data were from 1991 to 2008. The results revealed that the system’s productivity increased with the construction of a second desalination plant, by harvesting water from the same reservoir and recycling wastewater¹⁰. The WEAP model for the Ur-River catchment in Maharashtra state was adapted, connected to IWRM, and employed as a simulation method to conduct the scenario method and assess the supply and demand of water. As findings, scenario methodology WEAP-MABIA could be performed efficiently, and many more suggested¹¹. Water source access: testing the WEAP model by analyzing hydrological elements and calculating the Pradesh catchment’s water balance was essential for the accessibility of water resources, as is the case in the Chongwe River. The Actual Evapotranspiration is increasing by 0.03 Mm³/year, Potential Evapotranspiration decreasing by 0.24 Mm³/year, the annual stream flow is increasing by 0.13 Mm³/year, and precipitation p is decreasing by 0.12 Mm³/year. The WEAP process simulation results were statistically assessed by computing the coefficient of determination and the Nash-Sutcliffe function effectiveness coefficient. The process simulation results showed a good correlation with an R² of 0.97 and an NSE of 0.64¹².

Data-driven models are inspired by the information processing capabilities of a biological nervous system with available large data sets, such as the brain. Developing soft computing techniques like artificial neural networks (ANNs), support vector systems, and the adaptive neuro-fuzzy inference system to model stream-flows using the available data has also benefited from science and computer technology advances. It has been observed that among these soft computing tools, ANNs are increasingly in demand and used for various applications, such as streamflow projections and estimates of future hydropower generation, to comprehend links between future climate and crop yield. Although the ANN approach has been used in streamflow forecasting globally, the UNB has not received significant attention for this technique. Because the ANN models can describe linear and nonlinear systems without any assumptions, unlike most traditional statistical approaches, they are increasingly being employed in numerous science and engineering fields. ANNs were first developed in the 1940s by¹³. ANNs have been successfully utilized for the rainfall-runoff process¹⁴, to predict river flow in specific hydrologic issues¹⁵, for the characterization of soil pollution, and the prediction of water-quality parameters. Additionally, ANNs are used to anticipate rainfall, runoff, and evaporation, for river-flow time-series prediction and flood-disaster prediction. A multilayer feed-forward backpropagation method is employed in various hydrological applications. It typically consists of many interconnected nodes organized into an input layer, an output layer, and one or more hidden layers. The sigmoid function was chosen as the transfer function for the network. ANNs are a subclass of machine learning that has attracted a lot of attention in the context of estimating problems. Data processing devices called ANNs imitate the functions of the human brain¹⁶. Machine learning approaches’ popularity is growing due to their use, simplicity, and effectiveness. With little data and a complex process, machine-learning techniques are a great option¹⁷. In the last few decades, ANN models have been widely used in managing watersheds, water resources, and hydrology. Three layers make up the ANN: the input layer, the hidden layer, and the output layer. The interaction between the neurons in the subsequent levels determines how much communication there is¹⁸. ANN models are black-box models¹⁹. Applying ANNs in creating models results in trustworthy and versatile learning ability, making ANNs promising for forecasting²⁰.

Although hydrological models, such as the WEAP and ANN, have been generally used in the modelling of the rainfall-runoff process, their limitation is associated with inefficient performance. Like, WEAP is used to simulate various complex water systems and account for hydrological processes. Furthermore, the use of the ANN model is due to its capability to generate data-driven models to establish nonlinear and complex patterns. However, there is limited existing literature on the development and application of the hybrid model that integrates the strengths of both the conceptual understanding of the hydrological process in WEAP and the literature-based learning abilities of the ANN model in the rainfall-runoff modelling. The use of physical hydrological models combined with machine learning techniques has emerged as a novel and effective approach to improving the accuracy and validity of rainfall-runoff simulations. This hybrid methodology combines the advantages of two different models: while physical hydrological models account for the understanding of underlying processes, machine learning is used to amend simulations by reconciling them with the observed data. Hybrid models have proven to be useful in alpine regions where both hydropower plants and glacial melt contribute to streamflow patterns²¹. combined the Soil and Water Assessment Tool (SWAT), with the Support Vector Machines (SVM) to model hydropeaking in the absence of the reservoir operation rules following the rigid schedules. The approach was very effective as the simulation error was decreased by tens of per cent. Another model combining the HBV hydrological model with a Bayesian neural network was employed by²² to improve monthly streamflow forecasting for the Yarkant River and account for the precipitation, snow, and glacier melting in the region. The HBNN method yielded very good results for the high flows²³. used the approach to hybridize the WEAP, and the GR2M with the ANN. Their approach was particularly interesting as the output from the two models was further used as an input for the new ANN, which produced excellent results and brought NSE to 0.99 values from 0.64 to 0.88 of the initial models. The availability of several models that have been hybridized using various machine learning techniques and physical hydrological models is a further confirmation of their effectiveness. The LSTM-XAJ from²⁴ was also proved to be extremely effective in flood forecasting using the multiple basins and the multi-step-ahead approach²⁵. provided an example of the further development of a nested hybrid model that accounted both for the physical processes and for the non-linear patterns in conventional data. Recent advances in the SWAT, LSTM, and RF model by²⁶ proved to be more effective and valid than the conventional models. Comparative studies of machine learning approaches and conventional hydrological models have also indicated that hybrid models should be developed. Further²⁷, indicated that LSTM surpasses the machine learning models and also the SWAT models in the glaciated region of the Tianshan Mountains. Moreover, other studies by^28,29,30 have demonstrated that AI and machine learning models can bring similar or even better results than physical process-driven models in case of some limitations of the data. All of the above findings indicate that, rather than replacing, the machine learning hydrological models could be combined with the process-driven ones.

The hybrid models are also promising, as they could be employed not only in streamflow forecasting but also in parameter regionalization and uncertainty quantification³¹. For instance³², have demonstrated that SWAT model performance could be enhanced using machine learning models and, in particular, the approach of so parameter estimation through machine learning leading to significant improvements in the model calibration. Another method developed by³³ used decision tree algorithms and the limits of acceptability (LOA), to narrow the parameter range for more accurate and less complex hydrological predictions. The emerging models go even further –³⁴ have introduced a model that hybridized the conventional process-based models with CNN and LSTM³⁵. went even further by introducing super ensemble deep learning models that are even more effective than conventional conceptual hydrological models. The effectiveness of hybrid models in complex regions and data-constrained regions with complex hydrological process were proven from different approaches by^36,37,38.

Additional to it, recent studies have emphasized the importance of spatial pattern analysis and hydrological modeling in understanding precipitation variability^39,40. employed data-driven models and clustering techniques to simulate water levels and characterize spatiotemporal properties of precipitation, respectively⁴¹. analyzed trends and variability in monthly rainfall, while⁴² investigated the synoptic conditions leading to floods and the relationship between ENSO and streamflow. These studies highlight the significance of understanding spatial and temporal patterns of precipitation and the meteorological aspects of flooding for effective water resource management. Similarly, few more studies have highlighted the growing importance of machine learning techniques, particularly hybrid and ensemble approaches, in hydrological modeling. These methods have been successfully applied to various problems, such as suspended sediment load prediction⁴³, groundwater level prediction⁴⁴, rainfall-runoff modeling^44,45, daily flow discharge prediction⁴⁶, and precipitation prediction⁴⁷. These studies underscore the significance of exploring novel methodologies to enhance the accuracy and robustness of hydrological predictions, aligning with the objectives of the present study.

Despite the extensive use of hydrological models like WEAP and ANN in rainfall-runoff modeling, their performance is often limited in data-scarce catchments^48,49. While some studies have explored combining physical and data-driven models to improve streamflow predictions⁵⁰, the potential of such hybrid approaches in data-limited regions remains largely unexplored.

To address this gap, we propose a novel hybrid technique, ANN_Hybrid, that integrates simulated flow from both a physically-based distributed hydrological model (WEAP) and a data-driven model (ANN) to enhance streamflow prediction accuracy in data-scarce catchments. The effectiveness of the proposed approach is evaluated against standalone WEAP and ANN models in the UNB, which lacks long-term hydrological observations. The main contributions of this study are: (1) demonstrating the effectiveness of combining a physically-based hydrological model (WEAP) with machine learning techniques (ANN) to enhance streamflow prediction accuracy in data-scarce catchments; (2) proposing a novel hybrid technique, ANN_Hybrid, that integrates simulated flow from both physical and data-driven models to further improve the performance of streamflow simulation; and (3) evaluating the performance of the proposed hybrid approach against individual WEAP and ANN models in the UNB. The findings provide valuable insights for developing accurate and efficient streamflow prediction models in data-limited regions, which is crucial for effective water resource management and planning.

Study area

The study area of this study is UNB. Narmada River is a west-flowing river in Central India and the 5th largest river in the sub-continent of India. It is the third-longest river which flows totally within India, after Krishna and Godavari. It flows through the rift valley between the Satpura and Vindhyan ranges. The river originated from the Amarkantak Plateau of Maikala, located in the Shahdol district of Madhya Pradesh. The source coordinates are 22°40’ North latitude and 81°45’ East longitude, with an altitude of 1057 m above the mean sea level. The map depicted in Fig. 1 displays the study area spanning from the source of the Narmada River to the Manot G/D site, which includes an area of approximately 4765 square kilometres.

The UNB was chosen for this research due to its importance in the region and the need for accurate streamflow predictions to support water resource management. The sub-basin is characterized by diverse hydrological conditions and land use patterns, making it an ideal site for testing the performance of the proposed hybrid modeling technique that integrates the physical WEAP model with data-driven approaches like ANN and a categorization approach. Furthermore, the UNB serves as a representative case study for demonstrating the applicability and effectiveness of the proposed ANN_Hybrid modeling technique. The findings from this research can be extended to other river basins facing similar challenges, highlighting the potential of hybrid modeling approaches in enhancing rainfall-runoff predictions and supporting sustainable water resource management⁵¹.

The study area is divided into five parts according to the land type shown in Table 1. A significant portion of the basin’s precipitation occurs during the southwest monsoon and accounts for about 85–95% of the total rainfall. The relative humidity in the basin varies between 92% and 27% in the morning and between 88% and 15% in the evening, depending on the season. This basin consists mainly of black soil. Black soils have high water-holding capacity, and the organic matter is generally less than 5% in black soils.

Table 1 Classification of the study area based on land use.

Full size table

Methods

Figure 2 shows the detailed flowchart of methodology used for present study.

Data used

The model’s primary input data requirements include meteorological and discharge data for catchment parameter definition, initial condition determination, and model calibration and validation. Meteorological data include precipitation, wind speed, temperature, and humidity. The data used in this study were obtained from reliable and authorized sources, ensuring their integrity and quality. The meteorological data, including precipitation, temperature, relative humidity, and wind speed, were collected from the Indian Meteorological Department (IMD). The observed streamflow data used for model calibration and validation were obtained from the Central Water Commission (CWC) of India. All data were used solely for research purposes, and no personal or sensitive information was involved in the study. The investigation has complied with the data usage policies and guidelines set by the respective organizations and have duly acknowledged the data sources.

Rainfall

The daily gridded precipitation at a resolution of 0.25°x0.25° (IMD) has been used in the study area. The Thiessen Polygon method was used to compute the daily aerial average rainfall over the study area. The daily rainfall data of 8 grid points spread over the catchment for calculating the average daily rainfall. The ___location of all rain grids over the catchment area and the weightage for those grids are shown in Table 2. Figure 3(b) show the distribution of observed monthly rainfall in UNB for calibration and validation period.

Wind speed & humidity

The humidity and wind speed characteristics of the UNB play an important role in determining local climate and influence various hydrological and ecological processes. Seasonal fluctuations are observed in the wind speed, with higher velocities during summer and comparatively slow winds during winter. The humidity levels also exhibit seasonal fluctuations, with higher values during the monsoon season and lower values during summer. These variables significantly influence the rates of evaporation, water availability, and the survival of flora and fauna. They are crucial factors when implementing hydrological studies in any river basin or sub-basin.

Temperature (maximum & minimum)

There are noticeable seasonal temperature variations in the UNB. Maximum temperatures during the summer vary from 18 °C to 45 °C, while minimum temperature falls between 5 °C and 30 °C. Figure 3(a) shows the variations of daily maximum and minimum recorded temperature (IMD) for UNB. The hydrological cycle may be significantly impacted by these high temperatures, including increased evaporation rates and water scarcity. Also, the ecological processes impacted by the low temperature include plant growth, soil moisture levels, and aquatic organisms’ patterns of survival and reproduction.

Table 2 Details of the influencing rainfall grides.

Full size table

Runoff

The runoff at the stream gauge station located at Manot (CWC) has been used to calibrate/train and validate/test the WEAP/ANN model respectively. The daily runoff from January 2000 to December 2013 was available and used in the present study. The runoff coefficient for the calibration and validation period is given in Table 3, and Fig. 4 shows monthly flow distribution in monsoon and non-monsoon months for calibration and validation period.

Table 3 Details of the rainfall and runoff data for the calibration and validation period.

Full size table

Water Evaluation and Planning (WEAP) model

The WEAP model was developed by the Stockholm Environment Institute (SEI). The system is a critical simulation tool with multiple reservoirs and purposes, which selects the optimal water distribution approach by prioritizing supply and demand aspects. The programme provides a user-friendly and adaptable method for planning and policy creation. Using an integrated method, natural inflows, and artificial components, such as water reservoirs and groundwater pumps, are both simulated by WEAP. This provides a realistic perspective on the issues that are addressed in managing current and future use of water resources. This model enables the prediction of the entire system’s effects and facilitates the analysis of diverse water development and management options (SEI, 2007). WEAP employs a simple collection of model devices and procedures to examine water managers’ various problems and uncertainties, such as the environment, watershed conditions, anticipated demand, ecosystem needs, regulatory environment, functional priorities, and infrastructure⁵².

There are nine parameters of WEAP, such as Soil-Water-Capacity (SWC); Root-Zone-Conductivity (RZC); Runoff-Resistance-Factor (RRF); Preferred-Flow-Direction (PFD); Deep-Water-Capacity (DWC); Deep-Conductivity (DC); Crop Coefficient (Kc); Initial-$\:{\text{Z}}_{1}$; and Initial-$\:{\text{Z}}_{2}$. The SWC is the adequate water-holding capacity of the upper soil layer given in mm. The SWC varies according to the type of land class. Moreover, RZC is the saturated hydraulic conductivity of the topsoil layer and varies according to the different land classes. Furthermore, RRF controls surface runoff and the increase in the RRF values will increase the resistance to flow and reduce runoff. In addition, PFD is a parameter that is used to partition the flow between the topsoil layer (surface runoff) and flow to the lower soil layer (baseflow) and PFD varies depending on the soil type in the catchment. Moreover, DWC depicts the adequate water-holding capacity of the lower soil layer and is given in mm which is uniform across all types of land classes. DC is the saturated hydraulic conductivity of the bottom soil layer and does not vary according to the land classes that are responsible for controlling the transmission of baseflow. Along with all the parameters, Initial-$\:{\text{Z}}_{1}$ relative storage and Initial-$\:{\text{Z}}_{2}$ relative storage are expressed as a percentage of the total adequate storage of the topsoil water capacity at the beginning and the total adequate storage of the bottom layer of soil respectively.

Artificial-Neural-Network (ANN) model

ANN has been successfully applied in a wide range of domains such as the classification of data; data mining; speech recognition; time series analysis etc. The ANN models are effective forecasting tools for the relationship between runoff variables and rainfall. The findings will aid in decision-making on managing and planning water resources. Additionally, they help managers and urban planners take the required steps to prepare for negative projections. As a result, it helps in preventing risks to human health and the environment that floods are expected to cause.

ANN is a set of connected inputs and output layers in which each connection has associated some weights. The mathematical computing program MATLAB was used to model the relationship between rainfall and runoff. The neural network was modelled using one hidden layer along with the input and output layers. Meteorological data such as daily rainfall, wind speed, temperature, humidity and one day before runoff were provided as input data, while hydrological data and observed runoff were fed as target data. During the learning phase, ANN learns by adjusting the weights to be able to predict the correct output according to input data. One of the three training methods in ANN, Levenberg-Marquardt (LM) was used to train the created ANN network. The training phase of ANN consists of three steps i.e., Initialize the weights for different inputs; propagate the input forward; and backpropagate the error. In the study sigmoid function was used as an activation function/transfer function which given as

$$\mathrm f\left(\mathrm{Sum}\right)=\frac1{1+\mathrm e^{-\mathrm{Sum}}}$$

(1)

$$\mathrm{Sum}=\sum\:_{\mathrm i=1}^{\mathrm n}{\mathrm I}_{\mathrm i}{\mathrm W}_{\mathrm i}+\mathrm b$$

(2)

where, I_i = input variable; Wi = Weights for connection of input layer to hidden layer; b = bias.

The ANN network used in the present study is shown in Fig. 5 and, it consists of 4 data in the input layer, namely (rainfall, temperature, one day before rainfall, and one day before runoff). The hidden layer consists of 5 sets of hidden neurons, i.e., 4, 5, 6, 7, and 8. The output layer comprises observed runoff.

The use of the ANN model in this study is justified by its proven effectiveness in capturing complex, nonlinear relationships between hydrological variables and streamflow^53,54. ANNs have been widely applied in rainfall-runoff modeling due to their ability to learn from data and adapt to varying hydrological conditions⁵⁵. Although ANNs have been used in numerous studies, their application in hybrid models, particularly in combination with physically-based models like WEAP, has been less explored. The novelty of this study lies in the development of the ANN_Hybrid model, which leverages the strengths of both the physically-based WEAP model and the data-driven ANN model to improve streamflow predictions in data-scarce regions.

The choice of the ANN model is further supported by its flexibility in incorporating various input variables, its ability to handle missing or incomplete data, and its computational efficiency compared to other ML techniques⁵⁶. Additionally, the use of a well-established ML model allows for a more focused evaluation of the proposed hybrid approach, as the emphasis is on the integration of the physical and data-driven models rather than on the development of a new ML technique. By demonstrating the effectiveness of the ANN_Hybrid model in the UNB, this study highlights the potential of combining physically-based and data-driven models for improved streamflow predictions in data-limited regions. The findings provide a foundation for future research on hybrid modeling approaches and their application in water resource management.

Performance evaluation criteria

The performance evaluation is done based on statistical indices of performance evaluation and comparing the observed and simulated runoff.

Coefficient of determination (R²)

R² have been widely used for the performance evaluation of models. The value of R² ranges from 0 to 1, where a higher value of R² indicates less error variance, and typically values greater than 0.5 are considered acceptable for any model^25,27,38,57.

$$\mathrm R^2=\left[\frac{\mathrm n\left(\sum\mathrm Y^{\mathrm{obs}}\mathrm Y^{\mathrm{sim}}\right)-\left(\sum\mathrm Y^{\mathrm{obs}}\right)\left(\sum\mathrm Y^{\mathrm{sim}}\right)}{\sqrt{\left\{\mathrm n\sum\left(\mathrm Y^{\mathrm{obs}}\right)^2-\left(\sum\mathrm Y^{\mathrm{obs}}\right)^2\right\}\left\{\mathrm n\sum\left(\mathrm Y^{\mathrm{sim}}\right)^2-\left(\sum\mathrm Y^{\mathrm{sim}}\right)^2\right\}}}\right]^2$$

(3)

Y^obs is the observed runoff, Y^sim is the simulated runoff, and n is the total number of observations in the evaluated time series.

Nash Sutcliffe Efficiency (NSE)

NSE indicates how well the plot of observed data versus simulated data fits the 1:1 line. A higher value shows a better relationship between observed and simulated values^{21,24,27,38,57}.

$$\mathrm{NSE}=\left[\frac{\sum_{\mathrm i=1}^{\mathrm n}\left(\mathrm Y{}^{\mathrm{obs}}-\mathrm Y{}^{\mathrm{sim}}\right)^2}{\sum_{\mathrm i=1}^{\mathrm n}\left(\mathrm Y{}^{\mathrm{obs}}-\mathrm Y_{\mathrm{mean}}^{\mathrm{obs}}\right)^2}\right]$$

(4)

where Y^mean is the mean of observed runoff.

Percentage Bias (%bias)

%bias refers to the percentage deviation of evaluated data. The measure of %bias determines the average inclination of the simulated data to either exceed or fall short of their observed. Positive values signify underestimation by the model, while negative values indicate overestimation^27,38,58.

$$\%\mathrm{bias}=\left[\frac{\sum_{\mathrm i=1}^{\mathrm n}\left(\mathrm Y_{\mathrm i}^{\mathrm{obs}}-\:\mathrm Y_{\mathrm i}^{\mathrm{sim}}\right)\ast\left(100\right)}{\sum\:_{\mathrm i=1}^{\mathrm n}{(\mathrm Y}_{\mathrm i}^{\mathrm{obs}})}\right]$$

(5)

RMSE-observations standard deviation ratio (RSR)

RSR ranges from a high positive value to the ideal value of zero. The performance of the model simulation increases with decreasing RSR²¹. RSR is calculated as

$$\mathrm{RSR}=\frac{\mathrm{RMSE}}{{\mathrm{STDEV}}_{\mathrm{obs}}}=\frac{\left[\sqrt{\sum_{\mathrm i=1}^{\mathrm n}\left(\mathrm Y_{\mathrm i}^{\mathrm{obs}}-\mathrm Y_{\mathrm i}^{\mathrm{sim}}\right)^2}\right]}{\left[\sqrt{\sum_{\mathrm i=1}^{\mathrm n}\left(\mathrm Y_{\mathrm i}^{\mathrm{obs}}-\mathrm Y^{\mathrm{mean}}\right)^2}\right]}$$

(6)

Ethical approval

This material has not been published in whole or in part elsewhere; The manuscript is not currently being considered for publication in another journal; All authors have been personally, and actively involved in substantive work leading to the manuscript and will hold themselves jointly and individually responsible for its content.

Results

Sensitivity analysis of WEAP

In the present study runoff data was set as output and the minimum & maximum temperature, rainfall, wind speed and relative humidity was set as input since they are the main causes which effects the runoff. Soil Moisture method in WEAP model was used for the prediction of streamflow. Where, Soil Moisture Method works in such a way that it automatically utilizes 9 key parameters related to soil characteristics, hydrological behavior, and initial conditions to simulate streamflow and other hydrological processes^59,60. These parameters include Crop Coefficient, Soil Water Capacity, Deep Water Capacity, Runoff Resistance Factor, Conductivity of Root Zone, Conductivity of Deep Zone, Preferred Flow Direction, Initial Z1 and Initial Z2^59,61. The sensitivity analysis of model was done by adjusting these parameters to identify the most sensitive parameters that have the greatest impact on model outputs⁵¹. Figure 6 shows the result of the sensitivity analysis of four sensitive parameters, i.e., SWC; RZC; RRF; and PFD based on NSE and SSE.

WEAP model calibration

The selection of appropriate parameter values is crucial for the accurate simulation of hydrological processes in the WEAP model. In this study, a sensitivity analysis was performed to identify the most influential parameters affecting the model’s performance⁵¹. The sensitive parameters identified were subjected to a manual calibration process, where their values were adjusted within physically plausible ranges to minimize the discrepancy between observed and simulated streamflow^12,62. The calibration was performed using a trial-and-error approach, to maximize the Nash-Sutcliffe Efficiency (NSE) and minimize the Sum of Squared Errors (SSE)⁶³. The final calibrated parameter values were selected based on their ability to provide the best match between observed and simulated streamflow, as evaluated by the performance metrics discussed in the “Performance Evaluation Criteria” section. This systematic approach to parameter sensitivity analysis and calibration ensures that the WEAP model is optimally configured to represent the hydrological characteristics of the UNB, thereby enhancing the reliability and accuracy of the simulated results. The model was calibrated for seven years, from 2000 to 2006. Table 4 shows the final model parameters values of the calibrated model, and Table 5 displays the performance of the model in the calibration period and performance indicates good values, i.e., NSE = 75%, % Bias = -4.4, RSR = 0.48 and R² = 0.754 which depicts good resemblance between observed and simulated runoff. Figure 7(a), (b), and (c) shows the comparison of the daily, average daily and annually observed and simulated runoff, respectively during the calibration period.

Table 4 Calibrated model parameters values.

Full size table

Table 5 Performance evaluation of the model during calibration.

Full size table

WEAP model validation

The model validation has been carried out to test the ability of the model to simulate the runoff with the independent rainfall data and for periods other than those used in the calibration process. The validation was carried out for six years, from 2008 to 2013. The calibrated model was run with the independent rainfall data from 2008 to 2013 using the calibrated model parameters to simulate runoff. The output of the simulations during the validation period was evaluated using the same statistical indices used to evaluate the model performance in Table 6. Figure 8(a), (b), and (c) shows the comparison of the daily, average daily annually observed and simulated runoff, respectively, during the validation period.

Table 6 Performance evaluation of the model during validation.

Full size table

As the result presented, during the validation of the WEAP model, the runoff was overestimated considerably because the runoff coefficient was significantly less during the validation period as compared to the calibration period, this means that the model parameters were calibrated to generate higher runoff due to this, it generated higher runoff during the validation period.

ANN model

Various combinations of input data sets and several hidden neurons in the hidden layer were used to develop different ANN models to perform rainfall-runoff modelling and simultaneously determine the values of performance evaluation metrics, such as R², NSE, %bias, and RSR for each ANN. A neural network with only one hidden layer having (2n + 1) hidden neurons, where ‘n’ is the number of input nodes, can represent any continuous function. The correct number of neurons to use in the hidden layers can be determined using different thumb rules, such as – (1) The number of nodes in the hidden layer should be between the size of the input and output layers. (2) Number of nodes in the hidden layer should be 2/3 of the total nodes in the input & output layer. (3) Number of nodes in the hidden layer should be less than two times of nodes in the input layer. These three guidelines give an idea about the number of hidden neurons in a hidden layer at the start. Still, it will ultimately come from trial and error when selecting an architecture for your neural network.

Further, many neural networks were created for different input data sets and numbers of neurons in the hidden layer and check the performance of each network, shown in Table 7. Among all input data sets, the combination of R_t; T_t; Q_t−1; & R_t−1 shows the best result. For this data set model with different numbers of hidden neurons developed i.e., 4-4-1, 4-5-1, 4-6-1, 4-7-1, & 4-8-1, out of these networks structure of 4-8-1 ANN shown in Fig. 9, gives the best simulation and shows good correlation between observed and simulated runoff. However, when the input data combination changed by adding other variables, the results showed negligible variation. The output of the simulations through ANN was evaluated using the same statistical indices used to evaluate the model performance for WEAP, shown in Table 8 for the training and testing periods respectively. Figures 10(a), (b), & (c) and 11(a), (b), & (c) shows the comparison of the daily, average daily, annually observed, and simulated runoff, during the training and testing period respectively.

Table 7 Comparisons of ANN Models with different input data sets.

Full size table

Table 8 Performance evaluation of the ANN model during the training and testing period.

Full size table

ANN-based hybrid technique

The hybrid approach is a complex technique adopted to improve the accuracy of flow forecasts. The four-time series, explored by the ANN models from the previous section, and the one-time series, explored by the WEAP model for a total of five-time series, are adopted as inputs to a novel ANN in these experiments. The time series of the ANN models, namely the (R_t, R_t−1, T_t, Q_t−1), are the input to the optimal number of neurons results. Table 9 shows the comparison of performance evaluation indices for WEAP, ANN, and ANN_Hybrid model, whereas the web plot (Fig. 12) presents a visual comparison of the performance metrics for the WEAP, ANN, and ANN_Hybrid models during the calibration and validation periods. The ANN_Hybrid model exhibits the highest values for R² and NSE and the lowest values for %Bias in both periods, indicating its superior performance compared to the standalone models. Further, the comparison of simulated daily runoff by WEAP, ANN and ANN_Hybrid models with the observed daily runoff during calibration/training and validation/testing period shown in Fig. 13. To further illustrate the performance of the models, box plots comparing the observed and simulated monthly flow distributions for monsoon months during the calibration and validation periods are presented in Fig. 14. The box plots demonstrate that the ANN_Hybrid model captures the monthly flow variability more accurately than the standalone WEAP and ANN models, particularly during the validation period, where the simulated flow distributions closely match the observed data.

Table 9 Comparison of performance of the WEAP and ANN model.

Full size table

Discussions

The hydrological simulation of rainfall-runoff processes in the UNB using the WEAP and ANN models, and the novel ANN_Hybrid approach, not only advances our understanding of water resource management in this specific region but also demonstrates the transferability of these techniques to other hydrological applications and geographic locations. The input variables used in this study, such as rainfall, temperature, humidity, wind speed, and previous day’s runoff, are commonly measured variables that influence hydrological processes in many river basins worldwide^51,62. By successfully integrating these variables into the modeling framework and improving the accuracy of streamflow predictions, our study showcases the versatility of the employed methods in capturing the complex interactions between atmospheric conditions, land surface characteristics, and runoff generation. Moreover, the ANN_Hybrid approach, which influences the strengths of both physically-based and data-driven models, offers a promising avenue for enhancing the performance of hydrological simulations in regions where data scarcity or process understanding may be limited^12,51. The transferability of this hybrid approach to other basins and its potential to incorporate additional variables, such as solar radiation or land use change, highlights its value as a tool for sustainable water resource management and climate change adaptation strategies^62,63. Thus, the novelty of our study lies not only in the specific application to the UNB but also in the demonstration of the broader applicability and impact of the employed modeling techniques for advancing hydrological research and water resource management practices in diverse settings.

The results achieved in this study have significant implications for streamflow prediction and water resource management in the UNB and similar data-scarce catchments. The proposed ANN_Hybrid model, which integrates simulated flow from both the physically-based WEAP model and the data-driven ANN model, demonstrates superior performance compared to the individual models. The hybrid approach leverages the strengths of both models, enabling more accurate and reliable streamflow simulations in the study area. Improved streamflow predictions are crucial for effective water resource planning, allocation, and management, especially in regions facing water scarcity and increasing water demands^53,64. The UNB particular, plays a vital role in water provisioning for agricultural, domestic, and industrial uses in the region¹¹. By providing more accurate streamflow estimates, the ANN_Hybrid model can support better decision-making related to water allocation, reservoir operations, and drought and flood risk assessment in the basin.

The present study demonstrates superior performance in streamflow simulation compared to similar studies conducted by^31,36,39,66. As evident from the quarter Taylor diagrams shown in Fig. 15, our proposed ANN_Hybrid model exhibits the highest Nash-Sutcliffe efficiency (NSE) values of 0.96 and 0.93 during the calibration and validation periods, respectively, along with an impressive R² value of 0.96. These metrics indicate that our model captures the observed streamflow patterns with exceptional accuracy and minimal residual variance. In comparison, the HEC-HMS-LSTM model developed by³¹ achieved lower NSE values of 0.93 and 0.90, with an R² of 0.92, suggesting that their model, while performing well, may not capture the streamflow dynamics as precisely as our ANN_Hybrid model. Similarly³⁶, GBHM-ANN-CA-CV model obtained NSE values of 0.89 and 0.88 and an R² of 0.94 during the same periods, indicating that their approach, although effective, may not provide the same level of accuracy as our proposed model. The wavelet-ANN (WANN) model by³⁹ showed NSE values of 0.91 and 0.89, with an R² of 0.91, which, while commendable, still falls short of the performance achieved by our ANN_Hybrid model. Lastly, the best-performing model by⁶⁶ reached NSE values of 0.78 and 0.91 for calibration and validation, respectively, with an R² of 0.92, suggesting that their approach may not consistently capture the streamflow dynamics across both periods as effectively as our model.

The superior performance of our ANN_Hybrid model can be attributed to several key factors. Firstly, the effective integration of physically-based and data-driven approaches leverages the strengths of both methodologies, enabling our model to capture the complex rainfall-runoff relationships more accurately than the standalone models used in the comparative studies. By combining the hydrological knowledge embedded in the WEAP model with the learning capabilities of the ANN, our proposed approach can better represent the underlying physical processes while also adapting to the unique characteristics of the study area. The methodological advancements set our study apart from the previous works and establish its credibility in the field of streamflow modeling. The superior performance of our ANN_Hybrid model, as demonstrated by the higher NSE and R² values, highlights its ability to reliably simulate streamflow in the study area. The robustness and versatility of our approach suggest that it could be applied to other watersheds with different hydrological characteristics, making it a valuable tool for water resource management and planning.

The proposed hybrid ANN model offers a practical and reliable tool for streamflow prediction in the UNB. Its superior performance and computational efficiency can support informed decision-making for water resource management, flood risk assessment, and hydraulic engineering design^39,65,67,67. The findings of this research have the potential to benefit engineers, water managers, and stakeholders in the region by providing them with an accurate and efficient means of simulating streamflow, ultimately contributing to sustainable water resource management and resilience against future challenges.

The ANN_Hybrid model’s performance relies on the quality and quantity of input data, which were obtained from limited monitoring stations that may not fully capture the spatial variability of the hydro-meteorological processes in the UNB. However, the successful application of the hybrid approach demonstrates its potential to enhance streamflow simulations in regions where long-term hydrological observations are scarce, providing a foundation for future research on hybrid modeling approaches and their application in water resource management.

Future scope: Furthermore, the methodology presented in this study can be extended to other data-limited catchments facing similar challenges in streamflow prediction. The successful application of the hybrid approach in the UNB demonstrates its potential to enhance streamflow simulations in regions where long-term hydrological observations are scarce⁴⁹. The improved predictions can ultimately contribute to more sustainable and efficient water resource management practices in these areas.

Conclusions

In this study, we proposed a novel hybrid approach, ANN_Hybrid, that combines a physically-based hydrological model (WEAP) with a data-driven model (ANN) to enhance streamflow prediction accuracy in the data-scarce UNB. The results demonstrate that the ANN_Hybrid model outperforms the standalone WEAP and ANN models in simulating streamflow, particularly during the training and testing periods. The superior performance of the ANN_Hybrid model can be attributed to its ability to leverage the strengths of both the physically-based and data-driven approaches. By integrating the hydrological knowledge embedded in the WEAP model with the learning capabilities of the ANN, the hybrid approach captures the complex rainfall-runoff relationships more accurately than the individual models. This is evidenced by the higher Nash-Sutcliffe efficiency (NSE) values achieved by the ANN_Hybrid model compared to the WEAP and ANN models during both the calibration (95.5% vs. 75% and 81%, respectively) and validation (92.3% vs. 59% and 79%, respectively) periods.

The improved streamflow predictions provided by the ANN_Hybrid model have significant implications for water resource management and planning in the UNB. The enhanced accuracy and reliability of the simulations can support better decision-making related to water allocation, reservoir operations, and flood and drought risk assessment. These improvements are particularly crucial in regions facing water scarcity and increasing water demands, such as the study area. Furthermore, the methodology presented in this study can be extended to other data-limited catchments facing similar challenges in streamflow prediction. The successful application of the hybrid approach in the UNB demonstrates its potential to enhance streamflow simulations in regions where long-term hydrological observations are scarce. By combining the strengths of physically-based models with the flexibility and learning capabilities of data-driven techniques, the proposed ANN_Hybrid approach offers a promising avenue for improving hydrological predictions in data-scarce environments.

In conclusion, this study contributes to the advancement of hydrological modeling by demonstrating the effectiveness of combining physically-based and data-driven models for improved streamflow predictions in data-scarce regions. The proposed ANN_Hybrid model offers a practical and reliable tool for supporting sustainable water management and climate change adaptation strategies in the UNB and other similar catchments. The findings provide a foundation for future research on hybrid modeling approaches and their application in water resource management, particularly in regions where data limitations and complex hydrological processes pose challenges for accurate streamflow prediction.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Moradkhani, H. &Sorooshian, S. General Review of Rainfall-Runoff Modeling: Model Calibration, Data Assimilation, and Uncertainty Analysis. in Hydrological Modelling andthe Water Cycle 1–24 (2008) (Springer Berlin Heidelberg, Berlin, Heidelberg). https://doi.org/10.1007/978-3-540-77843-1_1.
Singh, A. A concise review on introduction to hydrological models. Glob Res. Dev. J. Eng. 3, 14–19 (2018).
Google Scholar
Lévite, H., Sally, H. & Cour, J. Testing water demand management scenarios in a water-stressed basin in South Africa: application of the WEAP model. Phys. Chem. Earth. 28, 779–786 (2003).
Article ADS Google Scholar
Haddad, M., Jayousi, A., Abu Hantash, S. & Hantash, S. A. Applicability of WEAP as Water Management Decision Support System Tool on Localized Area of Watershed Scales: Tulkarem District in Palestine as Case Study. In: Eleventh International Water Technology Conference (IWTC11), Sharm El-Sheikh, Egypt, 811-825 (2007).
Ingol-Blanco, E. & McKinney, D. C. Development of a Hydrological Model for the Rio Conchos Basin. J. Hydrol. Eng. 18, 340–351 (2013).
Article Google Scholar
Ali, M. F., Saadon, A., Abd Rahman, N. F. & Khalid, K. An Assessment of Water Demand in Malaysia Using Water Evaluation and Planning System. in InCIEC 2013, 743–755 (Springer Singapore, Singapore, 2014). https://doi.org/10.1007/978-981-4585-02-6_64.
Mounir, Z. M., Ma, C. M. & Amadou, I. Application of water evaluation and planning (WEAP): a model to assess future water demands in the Niger River (in Niger Republic). Mod. Appl. Sci. 5, 38–49 (2011).
Article Google Scholar
Hoff, H., Bonzi, C., Joyce, B. & Tielbörger, K. A water resources planning tool for the Jordan River basin. Water (Switzerland). 3, 718–736 (2011).
Google Scholar
Tsoukalas, P., Dimas, & Makropoulos, C. Hydrosystem optimization on a budget: Investigating the potential of surrogate based optimization techniques. In: 14th International Conference on Environmental Science and Technology (CEST2015), Global Network on Environmental Science and Technology, University of the Aegean (2015).
Paul, N. & Elango, L. Predicting future water supply-demand gap with a new reservoir, desalination plant and waste water reuse by water evaluation and planning model for Chennai megacity, India. Groundw. Sustain. Dev. 7, 8–19 (2018).
Article Google Scholar
Agarwal, S., Patil, J. P., Goyal, V. C. & Singh, A. Assessment of Water supply–demand using water evaluation and planning (WEAP) model for Ur River Watershed, Madhya Pradesh, India. J. Inst. Eng. Ser. A. 100, 21–32 (2019).
Article Google Scholar
Tena, T. M., Nguvulu, A., Mwelwa, D. & Mwaanga, P. Assessing Water Availability and Unmet Water Demand Using the WEAP Model in the Semi-Arid Bweengwa, Kasaka and Magoye Sub-Catchments of Southern Zambia. J. Environ. Prot. (Irvine,. Calif). 12, 280–295 (2021).
Rogers, L. L. & Dowla, F. U. Optimization of ground water remediation using artificial neural networks with parallel solute transport modeling. WATER Resour. Res. 30, 457–481 (1994).
Hsu, K., -l, Gupta, H. V. & Sorooshian, S. Artificial neural network modeling of the Rainfall‐runoff process. Water Resour. Res. 31, 2517–2530 (1995).
Article ADS Google Scholar
'Sivakumar, B., Jayawardena, A. W. & Fernando, T. M. K. G. River flow forecasting: use of phase-space reconstruction and artificial neural networks approaches. J. Hydrol. 265, 225–245 (2002).
Kisi, O., Shiri, J. & Tombul, M. Modeling rainfall-runoff process using soft computing techniques. Comput. Geosci. 51, 108–117 (2013).
Article ADS Google Scholar
Chae, Y. T., Horesh, R., Hwang, Y. & Lee, Y. M. Artificial neural network model for forecasting sub-hourly electricity usage in commercial buildings. Energy Build. 111, 184–194 (2016).
Article Google Scholar
Chang, T. K., Talei, A., Quek, C. & Pauwels, V. R. N. Rainfall-Runoff modelling using a self-reliant fuzzy inference network with flexible structure. J. Hydrol. 564, 1179–1193 (2018).
Article ADS Google Scholar
Vandana, M., John, S. E., Maya, K., Sunny, S. & Padmalal, D. Environmental impact assessment (EIA) of hard rock quarrying in a tropical river basin—study from the SW India. Environ. Monit. Assess. 192, 1-18 (2020).
Sahour, H., Gholami, V. & Vazifedan, M. A comparative analysis of statistical and machine learning techniques for mapping the spatial distribution of groundwater salinity in a coastal aquifer. J. Hydrol. 591, 125321 (2020).
Chiogna, G., Marcolini, G., Liu, W., Pérez Ciria, T. & Tuo, Y. Coupling hydrological modeling and support vector regression to model hydropeaking in alpine catchments. Sci. Total Environ. 633, 220–229 (2018).
Article ADS CAS PubMed Google Scholar
Ren, W. W., Yang, T., Huang, C. S., Xu, C. Y. & Shao, Q. X. improving monthly streamflow prediction in alpine regions: integrating HBV model with bayesian neural network. Stoch. Environ. Res. Risk Assess. 32, 3381–3396 (2018).
Article Google Scholar
Farfán, J. F., Palacios, K., Ulloa, J. & Avilés, A. A hybrid neural network-based technique to improve the flow forecasting of physical and data-driven models: methodology and case studies in Andean watersheds. J. Hydrol. Reg. Stud. 27, 100652 (2020).
Article Google Scholar
Cui, Z. et al. A novel hybrid XAJ-LSTM model for multi-step-ahead flood forecasting. Hydrol. Res. 52, 1436–1454 (2021).
Article Google Scholar
Okkan, U., Ersoy, Z. B., Kumanlioglu, A., Fistikoglu, O. & A. & Embedding machine learning techniques into a conceptual model to improve monthly runoff simulation: a nested hybrid rainfall-runoff modeling. J. Hydrol. 598, 126433 (2021).
Article Google Scholar
Liang, W., Chen, Y., Fang, G. & Kaldybayev, A. Machine learning method is an alternative for the hydrological model in an alpine catchment in the Tianshan region, Central Asia. J. Hydrol. Reg. Stud. 49, 101492 (2023).
Article Google Scholar
Ji, H. et al. Adaptability of machine learning methods and hydrological models to discharge simulations in data-sparse glaciated watersheds. J. Arid Land. 13, 549–567 (2021).
Article Google Scholar
Ham, Y. G., Kim, J. H. & Luo, J. J. Deep learning for multi-year ENSO forecasts. Nature. 573, 568–572 (2019).
Article ADS CAS PubMed Google Scholar
Kim, T. et al. Can artificial intelligence and data-driven machine learning models match or even replace process-driven hydrologic models for streamflow simulation? A case study of four watersheds with different hydro-climatic regions across the CONUS. J. Hydrol. 598, 126423 (2021).
Article Google Scholar
Kim, T., Shin, J.-Y., Kim, H., Kim, S. & Heo, J.-H. The Use of Large-Scale Climate Indices in Monthly Reservoir Inflow Forecasting and Its Application on Time Series and Artificial Intelligence Models. Water 11, 374 (2019).
Parisouj, P. et al. Physics-informed data-driven model for predicting streamflow: a case study ofthe Voshmgir Basin, Iran. Appl. Sci. 12, 7464 (2022).
Senent-Aparicio, J., Jimeno-Sáez, P. & Martínez-España, R. Pérez-Sánchez, J. Novel approaches for Regionalising SWAT parameters based on machine learning clustering for estimating Streamflow in Ungauged Basins. Water Resour. Manag. 38, 423–440 (2024).
Article Google Scholar
Gupta, A., Govindaraju, R. S., Li, P. C. & Merwade, V. On constructing limits-of-acceptability in watershed hydrology using decision trees. Adv. Water Resour. 178, 104486 (2023).
Article Google Scholar
Yifru, B. A., Lim, K. J., Bae, J. H., Park, W. & Lee, S. A hybrid deep learning approach for streamflow prediction utilizing watershed memory and process-based modeling. Hydrol. Res. 55, 498–518 (2024).
Article Google Scholar
Wegayehu, E. B. & Muluneh, F. B. Comparing conceptual and super ensemble deep learning models for streamflow simulation in data-scarce catchments. J. Hydrol. Reg. Stud. 52, 101694 (2024).
Article Google Scholar
Yang, S. et al. A physical process and machine learning combined hydrological model for daily streamflow simulations of large watersheds with limited observation data. J. Hydrol. 590, 125206 (2020).
Article Google Scholar
Gharbia, S. et al. Hybrid Data-Driven models for Hydrological Simulation and Projection on the Catchment Scale. Sustain. 14, 1–23 (2022).
Google Scholar
Rahman, K. U. et al. Comparison of machine learning and process-based SWAT model in simulating streamflow in the Upper Indus Basin. Appl. Water Sci. 12, 178 (2022).
Article ADS Google Scholar
Gharbia, S. et al. Hybrid Data-Driven models for Hydrological Simulation and Projection on the Catchment Scale. Sustainability. 14, 4037 (2022).
Article Google Scholar
Sahu, R. T., Verma, S., Kumar, K., Verma, M. K. & Ahmad, I. Testing some grouping methods to achieve a low error quantile estimate for high resolution (0.25° x 0.25°) precipitation data. J. Phys. Conf. Ser. 2273, 0–16 (2022).
Article CAS Google Scholar
Verma, R. K., Verma, S., Mishra, S. K. & Pandey, A. SCS-CN-Based Improved models for Direct Surface Runoff Estimation from large rainfall events. Water Resour. Manag. 35, 2149–2175 (2021).
Article Google Scholar
Azharuddin, M., Verma, S., Verma, M. K. & Prasad, A. D. A synoptic-scale Assessment of Flood events and ENSO—Streamflow variability in Sheonath River Basin, India. Lect Notes Civ. Eng. 176, 93–104 (2022).
Article Google Scholar
Samantaray, S. et al. Suspended sediment load prediction using sparrow search algorithm-based support vector machine model. Sci. Rep. 14, 12889 (2024).
Samantaray, S. & Sahoo, A. Groundwater level prediction using an improved ELM model integrated with hybrid particle swarm optimisation and grey wolf optimisation. Groundw. Sustain. Dev. 26, 101178 (2024).
Article Google Scholar
Bhusal, A., Parajuli, U., Regami, S. & Kalra, A. Application of machine learning and process-based models for. Hydrology. 9, 1–20 (2022).
Article Google Scholar
Sahoo, A., Parida, S. S., Samantaray, S. & Satapathy, D. P. Daily flow discharge prediction using integrated methodology based on LSTM models: Case study in Brahmani-Baitarani basin. HydroResearch. 7, 272–284 (2024).
Article Google Scholar
Sahoo, A., Behera, S. & Sharma, N. Performance comparison of LS-SVM and ELM-based models for precipitation prediction in Barak valley: Acase study. in vol. 2745, 020004 (2023).
Sellami, H., La Jeunesse, I., Benabdallah, S., Baghdadi, N. & Vanclooster, M. Uncertainty analysis in model parameters regionalization: a case study involving the SWAT model in Mediterranean catchments (Southern France). Hydrol. Earth Syst. Sci. 18, 2393–2413 (2014).
Article ADS Google Scholar
Swain, J. B. & Patra, K. C. Streamflow estimation in ungauged catchments using regionalization techniques. J. Hydrol. 554, 420–433 (2017).
Article ADS Google Scholar
Humphrey, G. B., Gibbs, M. S., Dandy, G. C. & Maier, H. R. A hybrid approach to monthly streamflow forecasting: integrating hydrological model outputs into a bayesian artificial neural network. J. Hydrol. 540, 623–640 (2016).
Article ADS Google Scholar
Abera Abdi, D. & Ayenew, T. Evaluation of the WEAP model in simulating subbasin hydrology in the Central Rift Valley basin, Ethiopia. Ecol. Process. 10, 41 (2021).
Suryawanshi, R. A. & Shirke, A. J. Watershed management of subernarekha river basin using WEAP. Int. J. Recent. Trends Sci. Technol. 12, 156–163 (2014).
Google Scholar
Adnan, R. M. et al. Daily streamflow prediction using optimally pruned extreme learning machine. J. Hydrol. 577, 123981 (2019).
Article Google Scholar
Kratzert, F., Klotz, D., Brenner, C., Schulz, K. & Herrnegger, M. Rainfall-runoff modelling using long short-term memory (LSTM) networks. Hydrol. Earth Syst. Sci. 22, 6005–6022 (2018).
Article ADS Google Scholar
Zhang, X., Li, P., Li, Z., Bin, Yu, G. Q. & Li, C. Effects of precipitation and different distributions of grass strips on runoff and sediment in the loess convex hillslope. Catena. 162, 130–140 (2018).
Article Google Scholar
Kratzert, F. et al. Toward improved predictions in Ungauged basins: exploiting the power of machine learning. Water Resour. Res. 55, 11344–11354 (2019).
Article ADS Google Scholar
Mao, G. et al. Comprehensive comparison of artificial neural networks and long short-term memory networks for rainfall-runoff simulation. Phys. Chem. Earth. 123, 103026 (2021).
Article Google Scholar
Gupta, H. V., Sorooshian, S. & Yapo, P. O. Status of automatic calibration for hydrologic models: comparison with Multilevel Expert Calibration. J. Hydrol. Eng. 4, 135–143 (1999).
Article Google Scholar
Basic Parameters. https://www.weap21.org/webhelp/BasicParameters.htm. Accessed 27 September 2024.
WEAP. Water Evaluation And Planning System. https://www.weap21.org/index.asp?action=213. Accessed 27 September 2024.
Soil Moisture Method Climate. https://www.weap21.org/webhelp/two-bucket_climate.htm. Accessed 27 September 2024.
Ismail Dhaqane, A., Murshed, M. F., Mourad, K. A. & Abd Manan, T. S. B. Assessment of the Streamflow and Evapotranspiration at Wabiga Juba Basin Using a Water Evaluation and Planning (WEAP) Model. Water 15, 2594 (2023).
Tena, T. M., Nguvulu, A., Mwelwa, D. & Mwaanga, P. Assessing water availability and Unmet Water demand using the WEAP model in the Semi-arid Bweengwa, Kasaka and Magoye Sub-catchments of Southern Zambia. J. Environ. Prot. (Irvine Calif). 12, 280–295 (2021).
Article Google Scholar
Zhang, Z., Zhang, Q. & Singh, V. P. Univariate streamflow forecasting using commonly used data-driven models: literature review and case study. Hydrol. Sci. J. 63, 1091–1111 (2018).
Article Google Scholar
Senent-Aparicio, J., López-Ballesteros, A., Jimeno-Sáez, P. & Pérez-Sánchez, J. Recent precipitation trends in Peninsular Spain and implications for water infrastructure design. J. Hydrol. Reg. Stud. 45, 101308 (2023).
Eguibar, M. Á., Porta-garcía, R., Torrijo, F. J. & Garzón‐roca, J. Flood hazards in flat coastal areas of the eastern iberian peninsula: a case study in oliva (Valencia, Spain). Water (Switzerland). 13, 1–24 (2021).
Google Scholar
Guo, Y., Zhang, Y., Zhang, L. & Wang, Z. Regionalization of hydrological modeling for predicting streamflow in ungauged catchments: a comprehensive review. Wiley Interdiscip Rev. Water. 8, 1–32 (2021).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Civil Engineering, Maulana Azad National Institute of Technology, Bhopal, 462003, India
Sachin Kumar & Mahendra Kumar Choudhary
National Institute of Hydrology, Bhopal, 462003, India
T. Thomas

Authors

Sachin Kumar
View author publications
Search author on:PubMed Google Scholar
Mahendra Kumar Choudhary
View author publications
Search author on:PubMed Google Scholar
T. Thomas
View author publications
Search author on:PubMed Google Scholar

Contributions

All authors contributed to the conception and modelling of the study. Sachin Kumar: Data collection, Model analysis, Model set-up, Writing an original draft of the paper. M. K. Choudhary: Supervision, Review and T. Thomos: Supervision, Visualization, Review, Editing.

Corresponding author

Correspondence to Sachin Kumar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kumar, S., Choudhary, M.K. & Thomas, T. A hybrid technique to enhance the rainfall-runoff prediction of physical and data-driven model: a case study of Upper Narmada River Sub-basin, India. Sci Rep 14, 26263 (2024). https://doi.org/10.1038/s41598-024-77655-5

Download citation

Received: 31 July 2024
Accepted: 24 October 2024
Published: 01 November 2024
DOI: https://doi.org/10.1038/s41598-024-77655-5

Keywords

This article is cited by

Review of machine learning and WEAP models for water allocation under climate change
- Deme Betele Hirko
- Jakobus Andries Du Plessis
- Adele Bosman
Earth Science Informatics (2025)

Subjects

Abstract

Similar content being viewed by others

Development of a new hybrid model to enhance streamflow estimation using artificial neural network and reptile search algorithm

Hydrological model-based streamflow reconstruction for Indian sub-continental river basins, 1951–2021

Restructuring and serving web-accessible streamflow data from the NOAA National Water Model historic simulations

Introduction

Study area

Methods

Data used

Rainfall

Wind speed & humidity

Temperature (maximum & minimum)

Runoff

Water Evaluation and Planning (WEAP) model

Artificial-Neural-Network (ANN) model

Performance evaluation criteria

Coefficient of determination (R2)

Nash Sutcliffe Efficiency (NSE)

Percentage Bias (%bias)

RMSE-observations standard deviation ratio (RSR)

Ethical approval

Results

Sensitivity analysis of WEAP

WEAP model calibration

WEAP model validation

ANN model

ANN-based hybrid technique

Discussions

Conclusions

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Informed consent

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Review of machine learning and WEAP models for water allocation under climate change

Search

Quick links

Coefficient of determination (R²)