Abstract
Improving the accuracy of reference evapotranspiration (RET) estimation is essential for effective water resource management, irrigation planning, and climate change assessments in agricultural systems. The FAO-56 Penman-Monteith (PM-FAO56) model, a widely endorsed approach for RET estimation, often encounters limitations due to the lack of complete meteorological data. This study evaluates the performance of eight empirical models and four machine learning (ML) models, along with their hybrid counterparts, in estimating daily RET within the Gharb and Loukkos irrigated perimeters in Morocco. The ML models examined include Random Forest (RF), M5 Pruned (M5P), eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), with hybrid combinations of RF-M5P, RF-XGBoost, RF-LightGBM, and XGBoost-LightGBM. Six input combinations were created, utilizing Tmax, Tmin, RHmean, Rs, and U2, with the PM-FAO56 model serving as the benchmark. Model performance was assessed using four statistical indicators: Kling-Gupta efficiency index (KGE), coefficient of determination (R2), mean squared error (RMSE), and relative root squared error (RRSE). Results indicate that the Valiantzas 2013 (VAL2013b) model outperformed other empirical models across all stations, achieving high KGE and R2 values (0.95–0.97) and low RMSE (0.32–0.35 mm/day) and RRSE (8.14–10.30%). The XGBoost-LightGBM and RF-LightGBM hybrid models exhibited the highest accuracy (average RMSE of 0.015–0.097 mm/day), underscoring the potential of hybrid ML models for RET estimation in subhumid and semi-arid regions, thereby enhancing water resource management and irrigation scheduling.
Similar content being viewed by others
Introduction
Reference evapotranspiration (RET) is a crucial hydrological cycle element, responsible for a significant portion of water loss from continental surfaces1,2,3. It accounts for approximately 62% of the rainfall contribution, equivalent to around 73,000 km³ per year4. RET serves as a powerful indicator for climate change studies5,6,7,8 and plays a vital role in many fields, including hydrology, agriculture, ecology, and water resource management. Notably, RET can also be instrumental in addressing natural hazards like dry spells, heat waves, and flash droughts9,10,11. While direct estimation methods like latent heat flux or real evapotranspiration from climate models offer accuracy, they often require complex modelling and high-resolution data, posing practical challenges. The intricate nature of modelling the soil-vegetation-atmosphere interaction further complicates accurate RET estimation. Therefore, understanding the processes and assessing RET is crucial for effective water resource management and planning, especially in semi-arid and dry locations where water availability is restricted. Raza et al.12 presented potential evapotranspiration (PET) and RET differentiation and categorized their empirical equations based on different meteorological factors.
The Penman-Monteith technique (PM-FAO56), a revised version of the Penman Equation 13, is widely considered as the most accurate method for estimating RET, and has been endorsed by FAO14 and the Task Committee on Standardization15. Despite its global acceptance, PM-FAO56 relies dependent on meteorological parameters such as wind speed, relative humidity, and solar radiation, which may be unavailable in certain weather stations, particularly in developing countries. Further, this limitation hinders its application in regions with limited weather data. Consequently, researchers have sought alternative methods for estimating RET through comparison of PM-FAO56 method with other empirical methods or methods development through meteorological data obtained from remote sensing.
Numerous researches have been conducted to investigate the effectiveness of different RET methods across different regions and climatic conditions. These studies compare empirical models with PM-FAO56 using different approaches16,17,18,19, including (a) temperature-based methods, (b) radiation-based methods, (c) mass transfer based methods, (d) methods combining radiation and temperature, and (e) methods integrating radiation, temperature, mass transfer, and other variables. For example, Almorox et al.18 assessed eleven temperature-based potential evapotranspiration (PET) estimation methods and determined that the Hargreaves and Samani model exhibited the most accurate performance on a global scale across diverse climatic regions. Additionally, comparisons have been made between empirical models and data obtained from lysimeters20,21. In Morocco, few scientists have investigated RET performance with existing empirical models/methods16,20,22,23. Er-raki et al.16 evaluated three empirical RET estimation methods for Tensift Basin (Morocco’s center) and Yaqui Valley (Northwest Mexico) during 2003–2004. In a semi-arid region, they suggest using Hargreaves and Samani model without calibration (as long as the wind remains low). They suggested that calibration is required for both Priestley-Taylor and Makkink parameters, particularly for dry periods. Several methods were examined by Bouhlassa and Paré22 to choose an appropriate solution to PM-FAO56 equation for 1989–2001 in Tafilalet, an arid region in southeastern Morocco. Their findings indicate that Jensen-Haise and Thornthwaite methods best-reflected evapotranspiration obtained by Penman-Monteith-FAO method. Similary, Hadria et al.23 conducted a calibration and validation analysis of five temperature-based empirical models in 22 meteorological stations across Morocco. Their results demonstrated that Dorji’s estimate outperformed the other empirical models and they introduced a new fit version called RET-Hadria, specifically designed for assessing RET in arid and semi-arid areas. Zeggaf20 compared lysimeter results with various empirical methods for Ouled Gnaou (Morocco’s semi-arid central region) during 1975, 1977, and 1978 years. They found that Priestley-Taylor method gave better results, followed by Penman-Monteith method. Besides, researchers like Liou and Kar24 and Elfarkh et al.25 have used remote sensing (RS) images and processes in geographical information system (GIS) to enhance evapotranspiration estimation.
In recent years, machine learning models have garnered considerable attention in various fields26,27,28,29,30,31,32. For instance, a model’s capacity to represent intricate nonlinear relationships has a significant impact on RET estimation. Goyal et al.33 highlighted the promising findings of ML models in various climates and environments, emphasizing their ability to improve accuracy above standard empirical models. Besides, researchers have applied various ML models, such as artificial neural networks (ANN)34,35, support vector regression (SVR)36,37, M5 model tree38,39, random forests (RF)40,41,42,43, reduced error pruning tree (REPTree)44,45, extreme gradient boosting (XGBoost)46,47,48, light gradient boosting machine (LightGBM)34,49,50 and decision trees (DT)51,52,53 to estimate daily RET uing restricted meteorological data. For instance, Granata38 conducted a comparative investigation of M5P Regression Tree, Bagging, RF and SVR with differing input combination (Tmean, RHmean, Rs, U2) for RET estimation in a humid subtropical climate region of Central Florida. They concluded that the M5P models exhibited good performance, while RF proved to be the least accurate. In China, Fan et al.40 examined limited meteorological data to investigate four empirical models and three ML models (LightGBM, M5Tree, and RF) to estimate daily RET. Their findings suggested that LightGBM outperformed the other models, with input combinations comprising Tmax, Tmin, U2, Rs, and RHmean. Similary, Fan et al.46 compared six ML models, including SVM, gradient boosting decision tree (GBDT), M5Tree, XGBoost, ELM and RF, and using meteorological data from eight Chinese stations. Their findings revealed that the GBDT and XGBoost models exhibited performance on par with the SVM and ELM models, while offering advantages in terms of simplicity, accuracy, stability, and reduced computational costs, making them recommended options for daily RET estimation. Additionally, Yong et al.34 evaluated performance of LightGBM, ANN and decision forest regression (DFR) in five Malaysian meteorological stations and reported that LightGBM and ANN have proven stable and accurate in determining daily RET. It is worth mentioning that the ML models’ performance in daily RET estimation is influenced by various factors, including the selection of input climatic variables, model structure, basic parameters, and performance criteria. Careful consideration of these factors and the correlation between individual input variables and RET is crucial for optimizing the ML models’ accuracy and efficiency. Additionally, effective tuning of ML model parameters further enhances their performance and efficiency for correct estimation33,34,35,36,38,40,44,46.
Nowadays, recent studies33,54 in RET estimation have emphasized the use of hybrid models, combining multiple ML algorithms through blending or stacking techniques, to address the challenges posed by highly complex meteorological data. Goyal et al.33 noted that standalone ML models may not achieve sufficient accuracy in such cases. For example, Elbeltagi et al.54 assessed five hybrid models (additive regression (AR) with bagging, M5tree, random subspace, REPTree, ANN, and RF) for a semi-arid area in Pakistan. They concluded that AR-M5tree model is the most appropriate hybrid model for estimating RET. Hence, this highlights the effectiveness of hybrid models in improving performance while maintaining interpretability. However, their RET estimation’s utilization is currently limited, and the available information on this subject is incomplete and fragmented. There is a need for further research and investigation to fully explore the hybrid models’ potential and effectiveness in addressing evapotranspiration estimation challenges.
In Morocco, few studies51,55,56 have focused on evaluating ML models for estimating RET. Recently, Lachgar et al.51 studied the performance of five ML models, like RF, Linear Regression (LR), SVR, k-Nearest Neighbor (k-NN), and DT for estimating RET between 2011 and 2019 in Fez. They highlighted the ML models’ ability to capture the variance in RET. In Marrakesh, El Hachimi et al.55 investigated the performance of SVM, RF, DT, k-NN, LR, XGboost for estimating RET during 2013–2020 and found that XGboost surpasses the other models, followed by RF. To our best knowledge, there is currently no existing comparative research on the use of hybrid models for estimating daily RET in Morocco. Thus, the novelty of this research lies in the comprehensive comparison of empirical models, ML models, and their hybrid models for estimating daily RET in subhumid and semi-arid climates.
The primary objectives of the present research are as follows: (i) estimating RET using eight empirical models, namely, Valiantzas 2013 (VAL2013a and VAL2013b); Dalton 1802 (Dal1802); Trabert 1896 (Trab1896) Hargreaves, 1975 (Harg1975); Irmak and Haman, 2003 (Irs2003); Hargreaves and Samani, 1985 (HargS1985); and Allen and Pruitt, 1986 (BC1986) and comparing their performance with standard FAO-PM56 method, (ii) development of ML models (RF, M5P, XGBoost and LightGBM) and their hybrid models (RF-M5P, RF-XGBoost, RF-LightGBM and XGBoost-LightGBM) using different meteorological input combinations (based on maximum and minimum air temperature (Tmax and Tmin), relative humidity (RHmean), solar radiation (Rs), and wind speed (U2)), (iii) evaluating performance of developed ML and hybrid models using different statistical indices to determine the best for RET estimation. The findings of this research will contribute to assessing the performance and suitability of selected machine learning models for reference evapotranspiration (RET) estimation under the specific climatic conditions of Morocco. This will support improved planning for water resource management and irrigation practices.
Materials and methods
Study area and data collection
Figure 1 illustrates the research region’s geographic distribution. This study region consists of two perimeters which are among Morocco’s most important irrigated perimeters. It covers an area of 6,007.14 km2, which is 57.2% for Gharb and 42.8% for Loukkos perimeter. It has a Mediterranean climate (Csa) with an oceanic impact, according to Köppen’s classification. The differences between the two perimeters are shown in Table 1.
Observed daily minimum and maximum daily air temperature (Tmin and Tmax), minimum and maximum relative humidity (RHmax and RHmin), solar radiation (Rs), mean wind speed at 2 m height (U2) and precipitation (P) were obtained from five weather stations to estimate reference evapotranspiration. These stations were strategically chosen, with three situated within the Gharb perimeter, representing three out of the five districts, and the remaining two located within the Loukkos perimeter (Figs. 1 and 2). The selection criteria considered the availability of reliable data and practical constraints, allowing for a focused analysis on representative stations within the study area. Meteorological data were provided by two Regional Offices for Agricultural Development, ORMVAG and ORMVAL, with varying data collection periods covering both perimeters. Detailed information on the studied station locations, statistical characteristics of observed meteorological data, and missing data percentage are described in Table 2.
Missing data imputation
In general, the database contains missing data due to weather station malfunctions. To address this issue, we employed either deletion or imputation. For the imputation process, we utilized the Multivariate Chain Equations Imputation Method (MICE) developed by Van Buuren and Groothuis-Oudshoorn57. This approach enabled us to assess whether imputing missing data would impact the selection of estimation methods. Otherwise, the other missing data was completely removed. The MICE method comprises three main steps. Firstly, a regression model is chosen for the variable being studied. Then, missing data values are iteratively assigned random values based on observed data. Finally, imputed values are estimated using the regression coefficients obtained for each dataset. We opted for this method due to its practicality and effectiveness in analysing precipitation data58.
Table 2 reveals that the solar radiation series for the Sidi Allal Tazi station had the highest proportion of missing data, accounting for 10.77% of the observed series. Through the MICE method, we imputed 9.4% of the solar radiation series for the Sidi Allal Tazi station. Overall, the deleted data represented between 1.37% and 4.48% of the database for each station studied.
Estimating evapotranspiration via FAO-56 Penman-Monteith and empirical models
A series of models were designed by researchers to estimate reference evapotranspiration59,60,61,62. In this research, eight empirical models, divided into four groups (combination, mass transfer, radiation, and temperature), were selected, as indicated in Table 3. Model selection focused on variables’ availability, usage extent, and simplicity. Subsequently, we compared these models with Penman-Monteith model [PM-FAO56, Eq. 114]. Table 4 summarizes the climate parameters for each empirical model.
Machine learning models and hybrid models description
Different equations and models to estimate daily RET were compared. Linear Regression (LR) was used to compare eight empirical equations with the PMFAO56 model. Additionally, we explored the impact of various meteorological variables on RET estimation using ML algorithms like Random Forest (RF), M5 Pruned (M5P), eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM). Additionally, we combined these models to create hybrid models like RF-M5P, RF-XGBoost, RF-LightGBM and XGBoost-LightGBM.
Linear regression (LR)
Linear regression is a well-known approach for modelling a dependent variable’s value via one or more independent variables. In this research, The LR equation is written as follows.
\(\:{\text{R}\text{E}\text{T}\:}_{\text{P}{\text{M}}_{\text{i}}}\) represents observed values (estimated by PM-FAO 56); \(\:{\text{R}\text{E}\text{T}\:}_{\text{c}\text{a}{\text{l}}_{\text{i}}}\) represents values estimated by different empirical models.
Random forests (RF)
RF model, introduced by Breiman66, is an ensemble approach that integrates numerous decision trees to create a powerful prediction model. It is widely utilized for regression and forecasting tasks due to its ability to capture complex, non-linear interactions between features. It works by generating a collection of random binary trees through bootstrapping, where each tree is trained on a randomly sampled subset of observations from the training dataset. The remaining data, known to as “out-of-bag” (OOB) data, is used for evaluating the model’s performance. RF exhibits several advantages, including strong generalization capabilities, robustness to outliers, and the ability to tune hyperparameters easily. By aggregating the results from individual trees, RF produces a final prediction, often using methods like majority voting. This ensemble approach helps mitigate overfitting and minimize variance by training on various data samples. To further control overfitting, the minimum leaf size parameter can be adjusted, requiring a minimum number of observations to generate child nodes.
M5 pruned (M5P)
The M5P model, also known as M5 Pruned, is a decision tree algorithm introduced by Quinlan67. This model runs in two steps, providing a novel approach to regression challenges. It separates the input data into subgroups in the first phase and applies linear regression models to each subset based on their partial attribute values. This enables the model to record variable relationships and construct regression equations at each node. Furthermore, the M5P model is typically built as a tree, starting with a root node and branching out into subnodes that reflect the regression equations. It can easily handle huge datasets with high dimensionality68, making it useful for investigating complicated systems like evapotranspiration estimates. Additionally, it does not require parameter adjustment, which simplifies the process.
Extreme gradient boosting (XGBoost)
XGBoost model is an improved Gradient Boosting Machines (GBMs) version presented by Chen and Guestrin69 that expands on the notion of “boosting” weak learners. Through additive training procedures, it combines numerous weak models to generate a powerful learner. By simplifying goal functions and providing parallel calculations during training, XGBoost tries to minimize overfitting while decreasing computational costs. It offers a scalable and effective solution for both regression and classification workloads27. With features such as distributed computing, pruning strategies, and management of missing data, the approach is meant to maximize speed. Because of its efficacy, versatility, and capacity to handle big datasets, XGBoost has become a popular choice.
Light gradient boosting machine (LightGBM)
LightGBM model is a gradient-boosting architecture that improves model performance while using less memory than conventional models. LightGBM is distinguished by its novel leaf-wise development, which develops trees by focusing on individual leaves rather than sequentially expanding the branches70. This approach enables efficient tree formation and increased computing performance. LightGBM also incorporates two approaches: gradient-based one-sided sampling and exclusive feature bundling (EFB). These approaches improve model performance by allowing for more efficient feature sampling and grouping. Overall, LightGBM is an efficient technique for dealing with massive datasets and producing accurate predictions while conserving resources. One of the most essential variables impacting the accuracy of a given model is the selection of proper hyper-parameters. The hyper-parameters employed in this study, shown in Table 5, were chosen through grid search optimization, supported by prior ___domain knowledge. This approach enabled systematic evaluation of parameter values to minimize prediction error and optimize performance. The resulting tuning achieved a balance between computational efficiency and accuracy across all models.
Weighted hybridization for ML algorithms
Weighted hybridization is an approach that combines different algorithms to increase the accuracy of evapotranspiration estimate. Individual algorithms are given varied weights based on their performance, as proposed by Nourani et al.36. More information can be found in33,71. The hybrid model provides more accurate and resilient predictions by harnessing the capabilities of each algorithm and assessing their relative relevance. This method is critical for reducing the constraints of individual algorithms and increasing the dependability of evapotranspiration estimations.
To ensure consistent scale and improve modelling capabilities, the input data for the ML models were normalised. This process, described by Eq. (11), transformed the data to a range between 0 and 1.
where, xnorm represents the normalised value, x0 is the real value, and xmin and xmax are the minimum and maximum values respectively.
It should be noted that the XGBoost model does not require input variable normalization since it is not sensitive to monotonic input variable normalization.
Input combinations
To investigate the impact of different meteorological variables on RET estimation, six combinations were utilized, as outlined in Table 6. The inputs for the ML models included air temperature (Tmax and Tmin), relative humidity (RHmean), solar radiation (Rs), and wind speed (U2). Likewise, the observed meteorological data were divided into two sets: a training set comprising 70% of the data and a separate testing set for evaluating the performance of the models. This division ensured that the models thoroughly evaluated on a substantial amount of data from each ___location and were rigorously assessed on an independent dataset. All the chosen ML models and simulations were implemented using R software (version 4.2.2).
Evaluation performance
To evaluate the model’s performance in comparison to the standard Penman-Monteith model (FAO-PM56), four statistical metric parameters were employed. These metrics encompassed the Kling Gupta Efficiency index (KGE, Eq. 12), Coefficient of determination (R2, Eq. 13), Mean Squared Error (RMSE, Eq. 14), and Root relative squared error (RRSE, Eq. 15). The selection of these metrics aimed to assess the precision, accuracy, under/overestimation, and provide a means for model comparison72,73,74. To rank the models, the RMSE and RRSE values were arranged in ascending order, while the R2 and KGE values were ordered in descending order. The specific formulas for these metrics can be found in Table 7.
Proposed model development for RET estimation
Figure 3 illustrates the flowchart outlining the suggested empirical, machine learning (ML), and hybrid models for estimating reference evapotranspiration (RET). LR, RF, M5P, XGBoost, LightGBM, RF-M5P, RF-XGBoost, RF-LightGBM, and XGBoost-LightGBM are the models used for RET estimation. As input variables, these models use six distinct climatic combinations (Comb1 - Comb6). The performance of each model was extensively explored by assessing the statistical metrics parameters listed above.
Results and discussions
Correlation between PM-FAO 56 daily RET and meteorological variables
Figure 4 shows a correlation between meteorological variables and daily reference evapotranspiration estimated by PM-FAO 56 model at each station studied. Results shows the RET is primarily affected by solar radiation where Pearson’s coefficients were above 78%, indicating a good correlation. This suggests that models that incorporate radiation-based variables may perform better in estimating RET compared to those that rely on temperature-based or mass transfer-based variables. For all Gharb stations, Rn obtained the highest correlation, with values ranging from 0.96 to 0.97, followed by Rs and Ra. However, Rs is the most correlated to Rn and Ra for all Loukkos stations. One of these three variables is explicitly contained in all combination models and those based on radiation-based (Table 3). Temperatures occupy the second place as r values vary from 0.53 to 0.79. Moreover, maximum temperature for Loukkos stations is more correlated with reference evapotranspiration than with mean and minimal temperatures. This leads to saying that a model containing maximum temperature coupled with solar radiation could better estimate RET for these stations. Recent research by Chia et al.71 pointed out that temperature and radiation are indispensable for estimating RET in semi-arid regions. Conversely, in sub-humid regions, the RET estimation requires the inclusion of evaporation in addition to temperature and radiation. This review underscores the significance of considering distinct climatic conditions when estimating RET in various regions.
Figure 4 further indicate that air vapor pressure deficit (VPD) is moderately correlated with reference evapotranspiration ranging from 0.56 to 0.74. Moreover, mass transfer models usually use this VPD variable as input and lack other variables that could improve RET estimation. On the other hand, relative humidity is negatively correlated with reference evapotranspiration, with r values varying from − 0.63 to -0.27. In the Gharb stations, it is noteworthy that RHmax correlation coefficient is higher than RHmin, regardless of the study period. In line with previous studies33,44,71, our findings support the positive correlation of Tmean, Tmax, Tmin, and Rs with RET. Wind speed shows a slight correlation, while relative humidity (RHmean) exhibits a negative correlation with RET. The correlation coefficients for wind speed (U2) are relatively low, ranging from 0.21 to 0.40, with the exception of the MB station, which exhibits a correlation coefficient of 0.50. The consideration of wind impact in RET estimation varies among researchers, with some arguing that wind is a significant factor due to potential data inaccuracies75, while others suggest that wind has minimal influence76 except in areas with high wind conditions. These results align with the findings of other studies46,54, providing further evidence of the relationship between meteorological variables and RET.
Empirical models’ comparison for daily reference evapotranspiration estimates
The statistical results of the eight empirical models (Dal1802, Trab1896, Harg1975, Irs2003, HargS1985, BC1986, VAL2013a and VAL2013b) for estimating daily RET at the five meteorological stations within the Gharb and Loukkos perimeters are presented in Tables 8 and 9. As above mentioned, the computed statistical indicator values (KGE, R2, RMSE and RRSE) were obtained using model performance equations [Eqs. 12–15], evaluated against the PM - FAO 56 model during the training and testing phases. The models were prioritized based on their statistical errors, and the best model was identified accordingly. KGE, R2, RMSE, and RRSE were determined to be 0.291–0.974, 0.249–0.964, 0.182–1.287 mm/day and 8.572–51.913%, respectively, during training; and 0.386–0.972, 0.366–0.966, 0.172–1.302 mm/day and 8.137–47.993% during testing. As seen in the table, the empirical models exhibited minimal differences (with an RRSE gap from − 1.57 to 4.63%) between the training and testing phases.
Notably, the combination models (VAL2013a, VAL2013b) outperformed other models across all stations, with KGE R2, RMSE, and RRSE ranging 0.947–0.974, 0.942–0.966, 0.318–0.402, 8.137–10.739 respectively, during testing phase. Except for Mensara station, Trab1896 performed successfully at the training and testing phases. It was noticed that The VAL13b model performs noticeably better than the VAL13a model, owing to the distinct variable requirements of each model. This is because it includes all contributing factors affecting RET.
Notably, VAL13b incorporates additional variables, such as relative humidity and wind speed, which account for the aerodynamic effects on RET. Similary, Kisi77 evaluated seven empirical models to the PM-FAO 56 in Mediterranean climate and Valiantzas (2013b) was found to be the best model.
The analysis indicates that temperature-based models, specifically Hargreaves and Samani62 and Brutsaert and Chen78, demonstrate superior performance compared to radiation-based models such as Hargreaves61 and Irmak et al.79, as well as mass transfer models like Dalton59 and Trabert60. These findings are consistent with the conclusions reported by multiple researchers in the field17. Conversely, it is important to note that some researchers20,76,80 have suggested that radiation-based models may perform better than temperature-based models. The higher effectiveness of temperature-based models can be related to temperature’s relative stability compared to solar radiation, which changes depending on conditions like as cloud cover, meteorological conditions, and time of day. Consequently, variations in solar radiation create uncertainty in radiation-based models. Generally, temperature appears to be a more powerful element than solar radiation in promoting evapotranspiration in dry or semi-arid regions where water supply is limited72,73,74.
Specifically, the HargS1985 model demonstrates superior performance compared to the BC1986 model across all stations. This finding is congruent with Er-raki et al.16 results, who found that the HargS1985 model provides more accurate estimations in semi-arid conditions of the Tensift basin. Similarly, Almorox et al.18 reported that the HargS1985 model had the greatest overall performance after analyzing eleven temperature-based models on a worldwide scale. In contrast, other studies have reported that the Hargreaves-Samani model62 tends to underestimate reference evapotranspiration in arid regions and overestimate it in wetland environments81,82.
In term of mass transfer models, Trab1896 performed better than Dal1802 at most stations. The mass transfer models used in this study were unsuitable for estimating daily RET in this region due to substantial statistical errors, with the exception of the Menasra station. In term of radiation models, Harg1975 was generally superior to Irs2003. On the other hand, we found that RET ranking is generally like SAT station data and data supplemented by Mice imputation method (SAT mice). This means that the filling of solar radiation (Rs) gaps did not affect the choice of RET estimation method.
The scatter plots in Fig. 5 illustrate the comparison between the estimated RET values obtained from the best empirical models and the FAO56-PM values during testing at the two meteorological stations. When the data points in the scatter plot were closely matched with the diagonal 1:1 trend line, a strong fit was noticed, showing high agreement between RET estimated by PM-FAO 56 and by empirical model. When the data points deviated considerably from the trend line, it indicated a poor fit, suggesting a lack of connectivity between the two previously estimated models. Overall, the data points in the plots demonstrate a strong correlation, aligning closely to the 1:1 line for models Val2013a, Val2013b, Harg195, and Harg1985.
Comparison of standalone and hybrid ML models using various input combinations
Table 10 presents the averaged statistical performance indicators (KGE, R2, RMSE, and RRSE) values for estimating RET across five meteorological stations in the Loukkos and Gharb perimeters, categorized by model and input combination used during testing phase. Additionally, Fig. 6 displays the KGE and RRSE values for each meteorological station during training and testing phases, comparing the four ML models and four hybrid models with six different input combinations. Overall, the statistical indicators values were found to varied substantially based on input combination, model types, and phase employed.
It can be seen from Table 10 that, independent of the perimeter and input combination, KGE, R2, RMSE, and RRSE values varied from 0.557 to 0.982, 0.585–0.979, 0.015–0.108, and 6.925–35.360 respectively during testing phase. These results clearly surpass those produced by the empirical model. The RMSE values achieved by these models were smaller than those reported by other researchers using different ML models in various regions34,40,46,51. It’s worth noting that discrepancies in RMSE values between studies might be caused by various factors such as ML model type and input variable selection, time periods chosen, climate conditions, and data quality83,84,85.
Among the different input combinations evaluated across all eight models, it was observed that the ML and hybrid models displayed the poorest performance when utilizing Tmax, Tmin, and RHmean as input (combination 2). On average, the R2 values ranged from 0.585 to 0.669, while the RRSE values varied between 26.724% and 35.360%, indicating relatively lower accuracy compared to other input combinations. This can be attributed to the negative correlation observed between RHmean and RET estimated, as depicted in Fig. 4. Furthermore, the limited information provided by these variables may not fully capture the complex relationships involved in accurately estimating RET. When comparing combination 1 (Tmax, Tmin, Rs) with combination 3 (Tmax, Tmin, Rs, U2), it was observed that the inclusion of U2 improve slightly the R2 (difference < 0.12) and reduced RMSE values (difference < 0.01). However, the most significant difference was found in the RRSE values, particularly in the Gharb stations, where the improvements ranged from 6.597 to 9.862%.
In the Loukkos stations, the differences in RRSE values were between 0.910% and 1.886%. These disparities suggest that the Loukkos stations experience higher wind speeds compared to those in the Gharb (Table 2), and the correlation between U2 and RET is slightly weaker in the Loukkos (as shown in Fig. 4). Similarly, when comparing combination 1 (Tmax, Tmin, Rs) with combination 5 (Tmax, Tmin, Rs, RHmean), improvements were observed, although they were lower than in the previous case. The RRSE differences were 3.871–8.923% for Gharb and 0.499–1.537% for Loukkos.
Goyal et al.33 suggested that incorporating either U2, RHmean, or both can improve model performance. Although relative humidity is considered the least significant parameter, its addition to the combination of Tmax, Tmin, and Rs results in a decrease in RMSE values. Besides, the combination 6, which included Tmax, Tmin, RHmean, Rs, and U2 as input meteorological variables, demonstrated the best performance with R2 and RRSE values ranging from 0.955 to 0.979 and from 6.925 to 11.272%, respectively. This improved performance can be attributed to the inclusion of additional variables capture the complex interactions and dynamics involved in estimating RET accurately.
Overall, it’s worth noting that temperature data is a foundational requirement for the models presented in Tables 9 and 10. Without sufficient temperature data, model predictions become unreliable, thereby limiting accuracy of RET estimations. Alternative gridded datasets, such as those from reanalysis products like ERA5-Land, can be used effectively to run models like Penman-Monteith FAO-5686. Nouri et al.86 demonstrated that the ERA5-Land dataset provides reliable RET estimates, especially in data-limited and windy regions. Their findings revealed that while some models, like recalibrated Hargreaves-Samani and Penman-Monteith with localized wind speed, performed well, others struggled due to wind speed variation.
In term of standalone ML models, the testing phase revealed the following ranking for Gharb: LightGBM6 > XGBoost6 > RF6 > LightGBM3, with average RMSE of 0.015–0.017 mm/day. For Loukkos, the ranking was: LightGBM6 > XGBoost6 > LightGBM3 > XGBoost3 with average RMSE of 0.025–0.027 mm/day. These findings support previous studies by Fan et al.40 and Yong et al.34, where LightGBM consistently outperformed other standalone ML models with an RMSE of 0.08–0.58 mm/day and 0.041–0.315 mm/day, respectively. Further, there was a minor difference between LightGBM6 model and XGBoost6 model, with LightGBM6 having a little higher RRSE value (0.34–0.5%). It should be pointed out that while XGBoost had the highest KGE value among all models, our ranking also considered the low RMSE and RSSE values, which placed XGBoost in the second position.
In both study area, the worst performance were obtained by M5P2 < XGBoost2 < RF2 < LightGBM2, where on average KGE < 0.772, R2 < 0.661, RMSE > 0.067 mm/day and RRSE > 27.097% (Table 10). These findings contradict those of Granata38, who claimed that the M5P models performed well while the RF models were the least accurate. From Fig. 6, it can be observed that RF demonstrated strong performance during the training phase across all stations, likely due to its ensemble nature and robustness to noise.
However, LightGBM’s performance was often inferior to that of the RF and XGBoost models. This could be due to its different optimization approach and less effective capturing of complex relationships. This finding aligns with the study by Chia et al.71, which reported that LightGBM exhibited relatively weaker performance than RF and the M5 tree model during the training phase. The authors explained that LightGBM’s leaf-wise optimization required sufficient training data for effective performance71. Interestingly, the situation reversed during the testing phase, demonstrating that LightGBM performed better when properly trained. It is noteworthy to mention that for all ML model, the model’s performance difference was triggered by a difference in the training and testing datasets, such as temporal differences in meteorological data patterns during both phases.
In term of hybrid ML models, the Gharb’s performance ranking was XGBoost-LightGBM6 > RF-LightGBM6 > RF-XGBoost6 > RF-M5P6. The R2 (RRSE) values were 0.979 (7.598), 0.978 (7.813), 0.974 (8.685) and 0.973 (9.211) respectively. For Loukkos, the ranking models were XGBoost-LightGBM6 > RF-LightGBM6 > RF-XGBoost6 > RF-LightGBM3, with R2 (RRSE) values of 0.975 (6.925), 0.974 (7.038), 0.971 (7.619) and 0.969 (7.635) respectively. From Table 10,, the result shows that XGBoost-LightGBM perform the best. This might be because XGBoost is recognized for its regularization approaches and successful handling of complicated relationships, whereas LightGBM excels in efficient computation and handling huge datasets. Consequently, their hybridization could detect a wider range of patterns and improve overall performance. In contrast, the lowest performance occurred in RF-M5P2 < RF-LightGBM2 < XGBoost-LightGBM2 < RF-XGBoost2. Nonetheless, during training phase, RF-XGBoost model gave good performance, as shown in Fig. 6.
When comparing standoalone ML models with hybrid ML models, it was found that XGBoost-LightGBM was highly close to LightGBM in term of all statistical performance indicator. For instance, compared with XGBoost-LightGBM6, RRSE of LightGBM6, XGBoost6 and RF6 was in difference of 0.318% (0.096%), 0.655% (0.592%), 2.468% (1.258%), respectively for Gharb (Loukkos). This suggests that while the hybridization approach slightly improved model performance, the improvement was not significant. It’s worth noting that factors such as computational efficiency and implementation flexibility might impact the selection between standalone ML models and hybrid models. Therefore, LightGBM and XGBoost are the recommended ML model for estimating RET in our research study.
Collectively, these studies highlight the effectiveness of standalone and hybrid machine learning models for improved accuracy in RET estimation33,34,38,40,51,54,55,56,71. Similary, our findings indicate that standalone ML models exhibited better performance and accuracy compared to the empirical models for estimating daily RET at Gharb and Loukkos stations, using Tmax, Tmin, RHmean, Rs, U2 as input variables.
Research’s limitations
This research has some notable limitations that need to be addressed. The study’s period is relatively limited, spanning from 2011 to 2017. Hence, expanding the study’s period would provide a more comprehensive understanding of the ML models’ performance across diverse climatic fluctuations over a longer period. Moreover, the investigation of current study limited from subhumid to semi-arid climatic conditions only, further investigation should incorporate more climatic conditions to examine climatic variability thoroughly. Additionally, the accuracy and performance of ML and hybrid models heavily relied on the availability and quality of meteorological data, which could impact their effectiveness in areas with limited or incomplete data. To address this issue, future studies could utilise reanalysis data products, such as ERA5 ERA5-Land, and MERRA-2, which provide continuous, high-resolution data across extended temporal and spatial scales. Such data sources could prove invaluable in regions where in situ measurements are scarce. Moreover, ML models lack physical mechanisms, making it challenging to comprehend their inner workings and create accurate models without knowledge of functional specifications74. The issue of over-fitting and under-fitting during the training/testing phases of ML models, due to the dataset random division, may also affect model accuracy. Employing advanced validation techniques could help enhance model reliability and generalizability.
Conclusion
Estimating reference evapotranspiration (RET) accurately has remained a major focus across a wide range of applications, including water resource management, agricultural water requirements, irrigation scheduling, and climate change assessments. In this research, we investigated the ability of four machine learning (ML) models, and their hybrid models along with eight empirical models (grouped in mass transfer-based, temperature-based, radiation-based and combination models) to estimate daily RET in subhumid and semi-arid irrigated perimeters in Morocco. For six input combinations, RF, M5P, XGBoost, LightGBM, RF-M5P, RF-XGBoost, RF-LightGBM, and XGBoost-LightGBM were all thoroughly evaluated. The results showed that combination models (particularly, Valiantzas 2013 (VAL2013b)) were the best empirical models and that temperature-based models generally outperformed radiation-based models. Compared with empirical models, ML models gave more accurate RET estimation, and the hybrid XGBoost-LightGBM models provided the highest statistical indicator values (KGE, R2, RMSE, RRSE). Interestingly, the standalone ML model LightGBM also demonstrated acceptable accuracy across all stations and input combinations, indicating its potential as a promising model for RET estimation with limited data. Moreover, the XGBoost model is also an intriguing alternative ML model. Overall, models with the input variables Tmax, Tmin, RHmean, Rs, and U2 performed better for daily RET estimation.
The current research highlights the ML and hybrid models’ efficiency in estimating daily RET within two irrigated Moroccan perimeters. Ultimately, further investigations could explore additional ML algorithms, hybrid model configurations, and their relevance to long-term datasets at different time scales and various climate regions.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Satpathi, A. et al. Estimation of crop evapotranspiration using statistical and machine learning techniques with limited meteorological data: A case study in Udham Singh Nagar, India. Theor. Appl. Climatol. https://doi.org/10.1007/s00704-024-04953-3 (2024).
Vishwakarma, D. K. et al. Methods to estimate evapotranspiration in humid and subtropical climate conditions. Agric. Water Manag. 261, 107378 (2022).
Mirzania, E., Vishwakarma, D. K., Bui, Q. A. T., Band, S. S. & Dehghani, R. A novel hybrid AIG-SVR model for estimating daily reference evapotranspiration. Arab. J. Geosci. 16, 301 (2023).
Dingman, S. L. Physical Hydrology (Waveland, 2015).
Wanniarachchi, S. & Sarukkalige, R. A. Review on evapotranspiration estimation in agricultural water management: Past, present, and future. Hydrology 9, 123 (2022).
Jerin, J. N., Islam, A. R. M. T., Al Mamun, M. A., Mozahid, M. N. & Ibrahim, S. M. Climate change effects on potential evapotranspiration in Bangladesh. Arab. J. Geosci. 14 (2021).
Dinpashoh, Y., Jahanbakhsh-Asl, S., Rasouli, A. A., Foroughi, M. & Singh, V. P. Impact of climate change on potential evapotranspiration (case study: west and NW of Iran). Theor. Appl. Climatol. 136, 185–201 (2019).
Haider, S. et al. Simulation of the potential impacts of projected climate and land use change on runoff under CMIP6 scenarios. Water 15, 3421 (2023).
Nouri, M. Drought assessment using gridded data sources in data-poor areas with different aridity conditions. Water Resour. Manag. 37, 4327–4343 (2023).
Noguera, I., Domínguez-Castro, F. & Vicente-Serrano, S. M. Flash drought response to precipitation and atmospheric evaporative demand in Spain. Atmosphere (Basel). 12, 165 (2021).
Herold, N., Kala, J. & Alexander, L. V. The influence of soil moisture deficits on Australian heatwaves. Environ. Res. Lett. 11, 064003 (2016).
Raza, A. et al. Misconceptions of reference and potential evapotranspiration: A PRISMA-guided comprehensive review. Hydrology 9, 153 (2022).
Monteith, J. L. Evaporation and Environment the State and Movement of Water in Living Organisms. In Symp. 19 Soc. Exp. Bid (ed. Fogg, G.E.) (Cambridge University Press, 1965).
Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. Crop Evapotranspiration: Guidelines for Computing Crop Water Requirements. Irrigation and Drainage Paper No 56. Food and Agriculture Organization of the United Nations (FAO), Rome, Italy. https://doi.org/10.3390/agronomy9100614 (1998).
Allen, R. G. et al. The ASCE Standardized Reference Evapotranspiration Equation (2005).
Er-Raki, S. et al. Assessment of reference evapotranspiration methods in semi-arid regions: Can weather forecast data be used as alternate of ground meteorological parameters? J. Arid Environ. 74, 1587–1596 (2010).
Hamed, M. M., Khan, N., Muhammad, M. K. I. & Shahid, S. Ranking of empirical evapotranspiration models in different climate zones of Pakistan. Land 11, 2168 (2022).
Almorox, J., Quej, V. H. & Martí, P. Global performance ranking of temperature-based approaches for evapotranspiration estimation considering Köppen climate classes. J. Hydrol. 528, 514–522 (2015).
Valipour, M. Retracted: Comparative evaluation of radiation-based methods for estimation of potential evapotranspiration. J. Hydrol. Eng. 20, 4014068 (2015).
Zeggaf, T. A., El Mourid, M., Karrou, M. & Steduto, P. Comparaison des méthodes d’estimation de l’évapotranspiration de référence dans la région du Tadla-Maroc. AL AWAMIA. 100, 73–84 (1999).
Dai, L. et al. Comparison of fourteen reference evapotranspiration models with lysimeter measurements at a site in the humid Alpine Meadow, northeastern Qinghai-Tibetan Plateau. Front. Plant. Sci. 13 (2022).
Bouhlassa, S. & Paré, S. Évapotranspiration De référence dans la région aride de tafilalet Au sud-est du Maroc. Afr. J. Environ. Assess. Manag. 11, 1–16 (2006).
Hadria, R., Benabdelouhab, T., Lionboui, H. & Salhi, A. Comparative assessment of different reference evapotranspiration models towards a fit calibration for arid and semi-arid areas. J. Arid Environ. 184, 104318 (2021).
Liou, Y. A. & Kar, S. K. Evapotranspiration estimation with remote sensing and various surface energy balance algorithms—A review. Energies 7, 2821–2849 (2014).
Elfarkh, J. et al. Evapotranspiration estimates in a traditional irrigated area in semi-arid Mediterranean. Comparison of four remote sensing-based models. Agric. Water Manag. 270, 107728 (2022).
El-Rawy, M. et al. An Integrated GIS and machine-learning technique for groundwater quality assessment and prediction in southern Saudi Arabia. Water 15, 2448 (2023).
Alshehri, F. & Rahman, A. Coupling machine and deep learning with explainable Artificial intelligence for improving prediction of groundwater quality and decision-making in Arid Region, Saudi Arabia. Water 15, 2298 (2023).
Abd El-Hamid, H. T. & Alshehri, F. Integrated remote sensing data and machine learning for drought prediction in eastern Saudi Arabia. J. Coast Conserv. 27, 48 (2023).
Nhu, V. H. et al. GIS-based gully erosion susceptibility mapping: A comparison of computational ensemble data mining models. Appl. Sci. 10. (2039). https://doi.org/10.3390/app10062039 (2020).
Prodhan, F. A., Zhang, J., Hasan, S. S., Sharma, P., Mohana, H. P. & T. P. & A review of machine learning methods for drought hazard monitoring and forecasting: Current research trends, challenges, and future research directions. Environ. Model. Softw. 149, 105327 (2022).
Pham, Q. B. et al. Groundwater level prediction using machine learning algorithms in a drought-prone area. Neural Comput. Appl. 34, 10751–10773 (2022).
Raza, A. et al. Performance evaluation of five machine learning algorithms for estimating reference evapotranspiration in an arid climate. Water 15, 3822 (2023).
Goyal, P., Kumar, S. & Sharda, R. A review of the artificial intelligence (AI) based techniques for estimating reference evapotranspiration: current trends and future perspectives. Comput. Electron. Agric. 209, 107836 (2023).
Yong, S. L. S., Ng, J. L., Huang, Y. F. & Ang, C. K. Estimation of reference crop evapotranspiration with three different machine learning models and limited meteorological variables. Agronomy 13, 1048 (2023).
Kisi, O. The potential of different ANN techniques in evapotranspiration modelling. Hydrol. Process. 22, 2449–2460 (2008).
Nourani, V., Elkiran, G. & Abdullahi, J. Multi-station artificial intelligence based ensemble modeling of reference evapotranspiration using pan evaporation measurements. J. Hydrol. 577, 123958 (2019).
Singh, A. K. et al. An integrated statistical-machine learning approach for runoff prediction. Sustainability 14, 8209 (2022).
Granata, F. Evapotranspiration evaluation models based on machine learning algorithms—A comparative study. Agric. Water Manag. 217, 303–315 (2019).
Vishwakarma, D. K. et al. Pre- and post-dam river water temperature alteration prediction using advanced machine learning models. Environ. Sci. Pollut Res. https://doi.org/10.1007/s11356-022-21596-x (2022).
Fan, J. et al. Light gradient boosting machine: An efficient soft computing model for estimating daily reference evapotranspiration with local and external meteorological data. Agric. Water Manag. 225, 105758 (2019).
Elbeltagi, A. et al. Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models. Environ. Sci. Pollut Res. 30, 43183–43202 (2023).
Achite, M. et al. Performance of machine learning techniques for meteorological drought forecasting in the Wadi Mina Basin, Algeria. Water 15, 765 (2023).
Kumar, D. et al. Multi-ahead electrical conductivity forecasting of surface water based on machine learning algorithms. Appl. Water Sci. 13, 192 (2023).
Elbeltagi, A. et al. Forecasting long-series daily reference evapotranspiration based on best subset regression and machine learning in Egypt. Water 15, 1149 (2023).
Kushwaha, N. L. et al. Data intelligence model and meta-heuristic algorithms-based pan evaporation modelling in two different agro-climatic zones: A case study from Northern India. Atmosphere (Basel) 12, 1654 (2021).
Fan, J. et al. Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric. Meteorol. 263, 225–241 (2018).
Torsoni, G. B. et al. Soybean yield prediction by machine learning and climate. Theor. Appl. Climatol. 151, 1709–1725 (2023).
Masood, A. et al. Improving PM2.5 prediction in New Delhi using a hybrid extreme learning machine coupled with snake optimization algorithm. Sci. Rep. 13, 21057 (2023).
Shahhosseini, M., Hu, G., Huber, I. & Archontoulis, S. V. Coupling machine learning and crop modeling improves crop yield prediction in the US Corn Belt. Sci. Rep. 11, 1606 (2021).
González, S., García, S., Del Ser, J., Rokach, L. & Herrera, F. A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives and opportunities. Inf. Fusion. 64, 205–237 (2020).
Lachgar, N., Berrajaa, A., Essabbar, M. & Saikouk, H. Machine learning approach for reference evapotranspiration estimation in the Region of Fes, Morocco. In International Conference on Digital Technologies and Applications. 105–113 (Springer, 2023).
Nagalla, R., Pothuganti, P. & Pawar, D. S. Analyzing gap acceptance behavior at unsignalized intersections using support vector machines, decision tree and random forests. Proc. Comput. Sci. 109, 474–481 (2017).
Pal, M. & Mather, P. M. An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens. Environ. 86, 554–565 (2003).
Elbeltagi, A. et al. Data intelligence and hybrid metaheuristic algorithms-based estimation of reference evapotranspiration. Appl. Water Sci. 12, 152 (2022).
El Hachimi, C., Salwa, B., Saïd, K. & Abdelghani, C. Early estimation of daily reference evapotranspiration using machine learning techniques for efficient management of irrigation water. J. Phys. Conf. Ser. 2224, 12006 (IOP Publishing, 2022).
Hachimi, C. et al. Smart weather data management based on artificial intelligence and big data analytics for precision agriculture. Agriculture 13, 95 (2022).
Van Buuren, S. & Groothuis-Oudshoorn, K. Mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).
Acharki, S., Amharref, M., El Halimi, R. & Bernoussi, A. S. Évaluation par approche statistique de l’impact des changements climatiques sur les ressources en eau: Application Au périmètre Du Gharb (Maroc). Rev. Des. Sci. l’Eau/J. Water Sci. 32, 291–315 (2019).
Dalton, J. Experimental essays on the constitution of mixed gases; on the force of stream or vapor from water and other liquids, both in a Torricellian vacuum and in air; on evaporation; and on the expansion of gases by heat. Proc. Manch. Lit. Philos. Soc. 5, 536–602 (1802).
Trabert, W. Neue Beobachtungen über Verdampfungsgeschwindigkeiten. Meteorol. Z. 13, 261–263 (1896).
Hargreaves, G. H. Moisture availability and crop production. Trans. ASAE. 18, 980–984 (1975).
Hargreaves, G. H. & Samani, Z. A. Reference crop evapotranspiration from temperature. Appl. Eng. Agric. 1, 96–99 (1985).
Allen, R. G. & Pruitt, W. O. Rational use of the FAO Blaney-Criddle formula. J. Irrig. Drain. Eng. 112, 139–155 (1986).
Irmak, S. & Haman, D. Z. Evapotranspiration: Potential or reference? Agric. Eng. Fla. Coop. Ext. Serv. Inst. Food Agric. Sci. Univ. Fla. US ABE. 343, 1–3 (2003).
Valiantzas, J. D. Simple: ET 0 forms of Penman’s equation without wind and/or humidity data. II: comparisons with reduced set-FAO and other methodologies. J. Irrig. Drain. Eng. 139, 9–19 (2013).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Quinlan, J. R. Learning with continuous classes. In 5th Australian Joint Conference on Artificial Intelligence. Vol. 92. 343–348 (World Scientific, 1992).
Solomatine, D. P. & Xue, Y. M5 model trees and neural networks: Application to flood forecasting in the Upper Reach of the Huai River in China. J. Hydrol. Eng. 9, 491–501 (2004).
Chen, T. et al. Xgboost: Extreme Gradient Boosting. R Package Version 0.4-2. Vol. 1. 1–4 (2015).
Ke, G., Ye, Q., Chen, W., Liu, T. Y. & LightGBM. Highly Effic. Gradient Boost Decis. Tree 30 (2016).
Chia, M. Y., Huang, Y. F., Koo, C. H. & Fung, K. F. Recent advances in evapotranspiration estimation using artificial intelligence approaches with a focus on hybridization techniques—A review. Agronomy 10. https://doi.org/10.3390/agronomy10010101 (2020).
Raza, A. et al. Use of gene expression programming to predict reference evapotranspiration in different climatic conditions. Appl. Water Sci. 14, 152 (2024).
Raza, A. et al. Modelling reference evapotranspiration using principal component analysis and machine learning methods under different climatic environments. Irrig. Drain. 72, 945–970 (2023).
Wang, J. et al. Development of monthly reference evapotranspiration machine learning models and mapping of Pakistan—A comparative study. Water 14, 1666 (2022).
Grace, B. & Quick, B. A comparison of methods for the calculation of potential evapotranspiration under the windy semi-arid conditions of southern Alberta. Can. Water Resour. J. 13, 9–19 (1988).
Pandey, P. K., Dabral, P. P. & Pandey, V. Evaluation of reference evapotranspiration methods for the northeastern region of India. Int. Soil. Water Conserv. Res. 4, 52–63 (2016).
Kisi, O. Comparison of different empirical methods for estimating daily reference evapotranspiration in Mediterranean climate. J. Irrig. Drain. Eng. 140, 4013002 (2014).
Brutsaert, W. & Chen, D. Diurnal variation of surface fluxes during thorough drying (or severe drought) of natural Prairie. Water Resour. Res. 32, 2013–2019 (1996).
Irmak, S., Allen, R. G. & Whitty, E. B. Daily grass and Alfalfa-reference evapotranspiration estimates and alfalfa-to-grass evapotranspiration ratios in Florida. J. Irrig. Drain. Eng. 129, 360–370 (2003).
Arellano, M. G. & Irmak, S. Reference (potential) evapotranspiration. I: Comparison of temperature, radiation, and combination-based energy balance equations in humid, subhumid, arid, semiarid, and Mediterranean-type climates. J. Irrig. Drain. Eng. 142, 4015065 (2016).
Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. Crop evapotranspiration-guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao Rome. 300, D05109 (1998).
Droogers, P. & Allen, R. G. Estimating reference evapotranspiration under inaccurate data conditions. Irrig. Drain. Syst. 16, 33–45 (2002).
Raza, A. et al. Comparative assessment of reference evapotranspiration estimation using conventional method and machine learning algorithms in four climatic regions. Pure Appl. Geophys. 177, 4479–4508 (2020).
Raza, A. et al. Application of non-conventional soft computing approaches for estimation of reference evapotranspiration in various climatic regions. Theor. Appl. Climatol. 139, 1459–1477 (2020).
Raza, A. et al. Comparative study of powerful predictive modeling techniques for modeling monthly reference evapotranspiration in various climatic regions. Fresenius Environ. Bull. 30, 7490–7513 (2021).
Nouri, M., Ebrahimipak, N. A. & Hosseini, S. N. Estimating reference evapotranspiration for water-limited windy areas under data scarcity. Theor. Appl. Climatol. 150, 593–611 (2022).
Pelosi, A., Terribile, F., D’Urso, G. & Chirico, G. Comparison of ERA5-Land and UERRA MESCAN-SURFEX reanalysis data with spatially interpolated weather observations for the regional assessment of reference evapotranspiration. Water 12, 1669 (2020).
Allen, R. G. et al. Conditioning point and gridded weather data under aridity conditions for calculation of reference evapotranspiration. Agric. Water Manag. 245, 106531 (2021).
Acknowledgements
The authors extend their appreciation to the Deanship of Scientific Research, King Saud University for funding through the Vice Deanship of Scientific Research Chairs; Research Chair of Prince Sultan Bin Abdulaziz International Prize for Water.
Funding
Open access funding provided by Lulea University of Technology. This research was funded by the Deanship of Scientific Research, King Saud University through the Vice Deanship of Scientific Research Chairs; Research Chair of Prince Sultan Bin Abdulaziz International Prize for Water.
Author information
Authors and Affiliations
Contributions
Conceptualization, supervision, methodology, formal analysis, writing—original draft preparation, writing—review and editing, S.A., A.R., D.K.V., M.A.; data curation, project administration, investigation, writing—review and editing, A.S.B., S.K.S., N.A-A., A.Z.D., A.A.A., M.A.M. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Acharki, S., Raza, A., Vishwakarma, D.K. et al. Comparative assessment of empirical and hybrid machine learning models for estimating daily reference evapotranspiration in sub-humid and semi-arid climates. Sci Rep 15, 2542 (2025). https://doi.org/10.1038/s41598-024-83859-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-83859-6
Keywords
This article is cited by
-
Air temperature estimation and modeling using data driven techniques based on best subset regression model in Egypt
Scientific Reports (2025)
-
An ensemble-driven machine learning framework for enhanced water quality classification
Discover Sustainability (2025)
-
Evaluating machine learning models and feature selection for reference evapotranspiration estimation in semi-arid regions: a case study in Doukkala, Morocco
Theoretical and Applied Climatology (2025)