Comparative assessment of empirical and hybrid machine learning models for estimating daily reference evapotranspiration in sub-humid and semi-arid climates

Acharki, Siham; Raza, Ali; Vishwakarma, Dinesh Kumar; Amharref, Mina; Bernoussi, Abdes Samed; Singh, Sudhir Kumar; Al-Ansari, Nadhir; Dewidar, Ahmed Z.; Al-Othman, Ahmed A.; Mattar, Mohamed A.

doi:10.1038/s41598-024-83859-6

Download PDF

Article
Open access
Published: 20 January 2025

Comparative assessment of empirical and hybrid machine learning models for estimating daily reference evapotranspiration in sub-humid and semi-arid climates

Scientific Reports volume 15, Article number: 2542 (2025) Cite this article

2834 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Agroecology

Abstract

Improving the accuracy of reference evapotranspiration (RET) estimation is essential for effective water resource management, irrigation planning, and climate change assessments in agricultural systems. The FAO-56 Penman-Monteith (PM-FAO56) model, a widely endorsed approach for RET estimation, often encounters limitations due to the lack of complete meteorological data. This study evaluates the performance of eight empirical models and four machine learning (ML) models, along with their hybrid counterparts, in estimating daily RET within the Gharb and Loukkos irrigated perimeters in Morocco. The ML models examined include Random Forest (RF), M5 Pruned (M5P), eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), with hybrid combinations of RF-M5P, RF-XGBoost, RF-LightGBM, and XGBoost-LightGBM. Six input combinations were created, utilizing T_max, T_min, RH_mean, R_s, and U₂, with the PM-FAO56 model serving as the benchmark. Model performance was assessed using four statistical indicators: Kling-Gupta efficiency index (KGE), coefficient of determination (R²), mean squared error (RMSE), and relative root squared error (RRSE). Results indicate that the Valiantzas 2013 (VAL2013b) model outperformed other empirical models across all stations, achieving high KGE and R² values (0.95–0.97) and low RMSE (0.32–0.35 mm/day) and RRSE (8.14–10.30%). The XGBoost-LightGBM and RF-LightGBM hybrid models exhibited the highest accuracy (average RMSE of 0.015–0.097 mm/day), underscoring the potential of hybrid ML models for RET estimation in subhumid and semi-arid regions, thereby enhancing water resource management and irrigation scheduling.

Predictive framework of vegetation resistance in channel flow

Article Open access 04 March 2025

Assessing climate and land use impacts on surface water yield using remote sensing and machine learning

Article Open access 27 May 2025

Development of a novel modeling framework based on weighted kernel extreme learning machine and ridge regression for streamflow forecasting

Article Open access 28 December 2024

Introduction

Reference evapotranspiration (RET) is a crucial hydrological cycle element, responsible for a significant portion of water loss from continental surfaces^1,2,3. It accounts for approximately 62% of the rainfall contribution, equivalent to around 73,000 km³ per year⁴. RET serves as a powerful indicator for climate change studies^5,6,7,8 and plays a vital role in many fields, including hydrology, agriculture, ecology, and water resource management. Notably, RET can also be instrumental in addressing natural hazards like dry spells, heat waves, and flash droughts^9,10,11. While direct estimation methods like latent heat flux or real evapotranspiration from climate models offer accuracy, they often require complex modelling and high-resolution data, posing practical challenges. The intricate nature of modelling the soil-vegetation-atmosphere interaction further complicates accurate RET estimation. Therefore, understanding the processes and assessing RET is crucial for effective water resource management and planning, especially in semi-arid and dry locations where water availability is restricted. Raza et al.¹² presented potential evapotranspiration (PET) and RET differentiation and categorized their empirical equations based on different meteorological factors.

The Penman-Monteith technique (PM-FAO56), a revised version of the Penman Equation ¹³, is widely considered as the most accurate method for estimating RET, and has been endorsed by FAO¹⁴ and the Task Committee on Standardization¹⁵. Despite its global acceptance, PM-FAO56 relies dependent on meteorological parameters such as wind speed, relative humidity, and solar radiation, which may be unavailable in certain weather stations, particularly in developing countries. Further, this limitation hinders its application in regions with limited weather data. Consequently, researchers have sought alternative methods for estimating RET through comparison of PM-FAO56 method with other empirical methods or methods development through meteorological data obtained from remote sensing.

Numerous researches have been conducted to investigate the effectiveness of different RET methods across different regions and climatic conditions. These studies compare empirical models with PM-FAO56 using different approaches^16,17,18,19, including (a) temperature-based methods, (b) radiation-based methods, (c) mass transfer based methods, (d) methods combining radiation and temperature, and (e) methods integrating radiation, temperature, mass transfer, and other variables. For example, Almorox et al.¹⁸ assessed eleven temperature-based potential evapotranspiration (PET) estimation methods and determined that the Hargreaves and Samani model exhibited the most accurate performance on a global scale across diverse climatic regions. Additionally, comparisons have been made between empirical models and data obtained from lysimeters^20,21. In Morocco, few scientists have investigated RET performance with existing empirical models/methods^16,20,22,23. Er-raki et al.¹⁶ evaluated three empirical RET estimation methods for Tensift Basin (Morocco’s center) and Yaqui Valley (Northwest Mexico) during 2003–2004. In a semi-arid region, they suggest using Hargreaves and Samani model without calibration (as long as the wind remains low). They suggested that calibration is required for both Priestley-Taylor and Makkink parameters, particularly for dry periods. Several methods were examined by Bouhlassa and Paré²² to choose an appropriate solution to PM-FAO56 equation for 1989–2001 in Tafilalet, an arid region in southeastern Morocco. Their findings indicate that Jensen-Haise and Thornthwaite methods best-reflected evapotranspiration obtained by Penman-Monteith-FAO method. Similary, Hadria et al.²³ conducted a calibration and validation analysis of five temperature-based empirical models in 22 meteorological stations across Morocco. Their results demonstrated that Dorji’s estimate outperformed the other empirical models and they introduced a new fit version called RET-Hadria, specifically designed for assessing RET in arid and semi-arid areas. Zeggaf²⁰ compared lysimeter results with various empirical methods for Ouled Gnaou (Morocco’s semi-arid central region) during 1975, 1977, and 1978 years. They found that Priestley-Taylor method gave better results, followed by Penman-Monteith method. Besides, researchers like Liou and Kar²⁴ and Elfarkh et al.²⁵ have used remote sensing (RS) images and processes in geographical information system (GIS) to enhance evapotranspiration estimation.

In recent years, machine learning models have garnered considerable attention in various fields^{26,27,28,29,30,31,32}. For instance, a model’s capacity to represent intricate nonlinear relationships has a significant impact on RET estimation. Goyal et al.³³ highlighted the promising findings of ML models in various climates and environments, emphasizing their ability to improve accuracy above standard empirical models. Besides, researchers have applied various ML models, such as artificial neural networks (ANN)^34,35, support vector regression (SVR)^36,37, M5 model tree^38,39, random forests (RF)^40,41,42,43, reduced error pruning tree (REPTree)^44,45, extreme gradient boosting (XGBoost)^46,47,48, light gradient boosting machine (LightGBM)^34,49,50 and decision trees (DT)^51,52,53 to estimate daily RET uing restricted meteorological data. For instance, Granata³⁸ conducted a comparative investigation of M5P Regression Tree, Bagging, RF and SVR with differing input combination (T_mean, RH_mean, R_s, U₂) for RET estimation in a humid subtropical climate region of Central Florida. They concluded that the M5P models exhibited good performance, while RF proved to be the least accurate. In China, Fan et al.⁴⁰ examined limited meteorological data to investigate four empirical models and three ML models (LightGBM, M5Tree, and RF) to estimate daily RET. Their findings suggested that LightGBM outperformed the other models, with input combinations comprising T_max, T_min, U₂, R_s, and RH_mean. Similary, Fan et al.⁴⁶ compared six ML models, including SVM, gradient boosting decision tree (GBDT), M5Tree, XGBoost, ELM and RF, and using meteorological data from eight Chinese stations. Their findings revealed that the GBDT and XGBoost models exhibited performance on par with the SVM and ELM models, while offering advantages in terms of simplicity, accuracy, stability, and reduced computational costs, making them recommended options for daily RET estimation. Additionally, Yong et al.³⁴ evaluated performance of LightGBM, ANN and decision forest regression (DFR) in five Malaysian meteorological stations and reported that LightGBM and ANN have proven stable and accurate in determining daily RET. It is worth mentioning that the ML models’ performance in daily RET estimation is influenced by various factors, including the selection of input climatic variables, model structure, basic parameters, and performance criteria. Careful consideration of these factors and the correlation between individual input variables and RET is crucial for optimizing the ML models’ accuracy and efficiency. Additionally, effective tuning of ML model parameters further enhances their performance and efficiency for correct estimation^{33,34,35,36,38,40,44,46}.

Nowadays, recent studies^33,54 in RET estimation have emphasized the use of hybrid models, combining multiple ML algorithms through blending or stacking techniques, to address the challenges posed by highly complex meteorological data. Goyal et al.³³ noted that standalone ML models may not achieve sufficient accuracy in such cases. For example, Elbeltagi et al.⁵⁴ assessed five hybrid models (additive regression (AR) with bagging, M5tree, random subspace, REPTree, ANN, and RF) for a semi-arid area in Pakistan. They concluded that AR-M5tree model is the most appropriate hybrid model for estimating RET. Hence, this highlights the effectiveness of hybrid models in improving performance while maintaining interpretability. However, their RET estimation’s utilization is currently limited, and the available information on this subject is incomplete and fragmented. There is a need for further research and investigation to fully explore the hybrid models’ potential and effectiveness in addressing evapotranspiration estimation challenges.

In Morocco, few studies^51,55,56 have focused on evaluating ML models for estimating RET. Recently, Lachgar et al.⁵¹ studied the performance of five ML models, like RF, Linear Regression (LR), SVR, k-Nearest Neighbor (k-NN), and DT for estimating RET between 2011 and 2019 in Fez. They highlighted the ML models’ ability to capture the variance in RET. In Marrakesh, El Hachimi et al.⁵⁵ investigated the performance of SVM, RF, DT, k-NN, LR, XGboost for estimating RET during 2013–2020 and found that XGboost surpasses the other models, followed by RF. To our best knowledge, there is currently no existing comparative research on the use of hybrid models for estimating daily RET in Morocco. Thus, the novelty of this research lies in the comprehensive comparison of empirical models, ML models, and their hybrid models for estimating daily RET in subhumid and semi-arid climates.

The primary objectives of the present research are as follows: (i) estimating RET using eight empirical models, namely, Valiantzas 2013 (VAL2013a and VAL2013b); Dalton 1802 (Dal1802); Trabert 1896 (Trab1896) Hargreaves, 1975 (Harg1975); Irmak and Haman, 2003 (Irs2003); Hargreaves and Samani, 1985 (HargS1985); and Allen and Pruitt, 1986 (BC1986) and comparing their performance with standard FAO-PM56 method, (ii) development of ML models (RF, M5P, XGBoost and LightGBM) and their hybrid models (RF-M5P, RF-XGBoost, RF-LightGBM and XGBoost-LightGBM) using different meteorological input combinations (based on maximum and minimum air temperature (T_max and T_min), relative humidity (RH_mean), solar radiation (R_s), and wind speed (U₂)), (iii) evaluating performance of developed ML and hybrid models using different statistical indices to determine the best for RET estimation. The findings of this research will contribute to assessing the performance and suitability of selected machine learning models for reference evapotranspiration (RET) estimation under the specific climatic conditions of Morocco. This will support improved planning for water resource management and irrigation practices.

Materials and methods

Study area and data collection

Figure 1 illustrates the research region’s geographic distribution. This study region consists of two perimeters which are among Morocco’s most important irrigated perimeters. It covers an area of 6,007.14 km², which is 57.2% for Gharb and 42.8% for Loukkos perimeter. It has a Mediterranean climate (Csa) with an oceanic impact, according to Köppen’s classification. The differences between the two perimeters are shown in Table 1.

Observed daily minimum and maximum daily air temperature (T_min and T_max), minimum and maximum relative humidity (RH_max and RH_min), solar radiation (R_s), mean wind speed at 2 m height (U₂) and precipitation (P) were obtained from five weather stations to estimate reference evapotranspiration. These stations were strategically chosen, with three situated within the Gharb perimeter, representing three out of the five districts, and the remaining two located within the Loukkos perimeter (Figs. 1 and 2). The selection criteria considered the availability of reliable data and practical constraints, allowing for a focused analysis on representative stations within the study area. Meteorological data were provided by two Regional Offices for Agricultural Development, ORMVAG and ORMVAL, with varying data collection periods covering both perimeters. Detailed information on the studied station locations, statistical characteristics of observed meteorological data, and missing data percentage are described in Table 2.

Table 1 Difference between two perimeters (Gharb and Loukkos).

Full size table

Table 2 Weather station’s geographical locations and meteorological data annual average values.

Full size table

Missing data imputation

In general, the database contains missing data due to weather station malfunctions. To address this issue, we employed either deletion or imputation. For the imputation process, we utilized the Multivariate Chain Equations Imputation Method (MICE) developed by Van Buuren and Groothuis-Oudshoorn⁵⁷. This approach enabled us to assess whether imputing missing data would impact the selection of estimation methods. Otherwise, the other missing data was completely removed. The MICE method comprises three main steps. Firstly, a regression model is chosen for the variable being studied. Then, missing data values are iteratively assigned random values based on observed data. Finally, imputed values are estimated using the regression coefficients obtained for each dataset. We opted for this method due to its practicality and effectiveness in analysing precipitation data⁵⁸.

Table 2 reveals that the solar radiation series for the Sidi Allal Tazi station had the highest proportion of missing data, accounting for 10.77% of the observed series. Through the MICE method, we imputed 9.4% of the solar radiation series for the Sidi Allal Tazi station. Overall, the deleted data represented between 1.37% and 4.48% of the database for each station studied.

Estimating evapotranspiration via FAO-56 Penman-Monteith and empirical models

A series of models were designed by researchers to estimate reference evapotranspiration^59,60,61,62. In this research, eight empirical models, divided into four groups (combination, mass transfer, radiation, and temperature), were selected, as indicated in Table 3. Model selection focused on variables’ availability, usage extent, and simplicity. Subsequently, we compared these models with Penman-Monteith model [PM-FAO56, Eq. 1¹⁴]. Table 4 summarizes the climate parameters for each empirical model.

Table 3 FAO-56 Penman-Monteith and empirical equations used in this research.

Full size table

Table 4 Climatic parameters required by used models.

Full size table

Machine learning models and hybrid models description

Different equations and models to estimate daily RET were compared. Linear Regression (LR) was used to compare eight empirical equations with the PMFAO56 model. Additionally, we explored the impact of various meteorological variables on RET estimation using ML algorithms like Random Forest (RF), M5 Pruned (M5P), eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM). Additionally, we combined these models to create hybrid models like RF-M5P, RF-XGBoost, RF-LightGBM and XGBoost-LightGBM.

Linear regression (LR)

Linear regression is a well-known approach for modelling a dependent variable’s value via one or more independent variables. In this research, The LR equation is written as follows.

$$\:{\text{R}\text{E}\text{T}\:}_{\text{c}\text{a}{\text{l}}_{\text{i}}}=\text{a}\times\:{\text{R}\text{E}\text{T}\:}_{\text{P}{\text{M}}_{\text{i}}}+\text{b}$$

(10)

$\:{\text{R}\text{E}\text{T}\:}_{\text{P}{\text{M}}_{\text{i}}}$ represents observed values (estimated by PM-FAO 56); $\:{\text{R}\text{E}\text{T}\:}_{\text{c}\text{a}{\text{l}}_{\text{i}}}$ represents values estimated by different empirical models.

Random forests (RF)

RF model, introduced by Breiman⁶⁶, is an ensemble approach that integrates numerous decision trees to create a powerful prediction model. It is widely utilized for regression and forecasting tasks due to its ability to capture complex, non-linear interactions between features. It works by generating a collection of random binary trees through bootstrapping, where each tree is trained on a randomly sampled subset of observations from the training dataset. The remaining data, known to as “out-of-bag” (OOB) data, is used for evaluating the model’s performance. RF exhibits several advantages, including strong generalization capabilities, robustness to outliers, and the ability to tune hyperparameters easily. By aggregating the results from individual trees, RF produces a final prediction, often using methods like majority voting. This ensemble approach helps mitigate overfitting and minimize variance by training on various data samples. To further control overfitting, the minimum leaf size parameter can be adjusted, requiring a minimum number of observations to generate child nodes.

M5 pruned (M5P)

The M5P model, also known as M5 Pruned, is a decision tree algorithm introduced by Quinlan⁶⁷. This model runs in two steps, providing a novel approach to regression challenges. It separates the input data into subgroups in the first phase and applies linear regression models to each subset based on their partial attribute values. This enables the model to record variable relationships and construct regression equations at each node. Furthermore, the M5P model is typically built as a tree, starting with a root node and branching out into subnodes that reflect the regression equations. It can easily handle huge datasets with high dimensionality⁶⁸, making it useful for investigating complicated systems like evapotranspiration estimates. Additionally, it does not require parameter adjustment, which simplifies the process.

Extreme gradient boosting (XGBoost)

XGBoost model is an improved Gradient Boosting Machines (GBMs) version presented by Chen and Guestrin⁶⁹ that expands on the notion of “boosting” weak learners. Through additive training procedures, it combines numerous weak models to generate a powerful learner. By simplifying goal functions and providing parallel calculations during training, XGBoost tries to minimize overfitting while decreasing computational costs. It offers a scalable and effective solution for both regression and classification workloads²⁷. With features such as distributed computing, pruning strategies, and management of missing data, the approach is meant to maximize speed. Because of its efficacy, versatility, and capacity to handle big datasets, XGBoost has become a popular choice.

Light gradient boosting machine (LightGBM)

LightGBM model is a gradient-boosting architecture that improves model performance while using less memory than conventional models. LightGBM is distinguished by its novel leaf-wise development, which develops trees by focusing on individual leaves rather than sequentially expanding the branches⁷⁰. This approach enables efficient tree formation and increased computing performance. LightGBM also incorporates two approaches: gradient-based one-sided sampling and exclusive feature bundling (EFB). These approaches improve model performance by allowing for more efficient feature sampling and grouping. Overall, LightGBM is an efficient technique for dealing with massive datasets and producing accurate predictions while conserving resources. One of the most essential variables impacting the accuracy of a given model is the selection of proper hyper-parameters. The hyper-parameters employed in this study, shown in Table 5, were chosen through grid search optimization, supported by prior ___domain knowledge. This approach enabled systematic evaluation of parameter values to minimize prediction error and optimize performance. The resulting tuning achieved a balance between computational efficiency and accuracy across all models.

Table 5 ML algorithm hyper-parameters.

Full size table

Weighted hybridization for ML algorithms

Weighted hybridization is an approach that combines different algorithms to increase the accuracy of evapotranspiration estimate. Individual algorithms are given varied weights based on their performance, as proposed by Nourani et al.³⁶. More information can be found in^33,71. The hybrid model provides more accurate and resilient predictions by harnessing the capabilities of each algorithm and assessing their relative relevance. This method is critical for reducing the constraints of individual algorithms and increasing the dependability of evapotranspiration estimations.

To ensure consistent scale and improve modelling capabilities, the input data for the ML models were normalised. This process, described by Eq. (11), transformed the data to a range between 0 and 1.

$$\:{x}_{norm}=\frac{{x}_{0}-{x}_{min}}{{x}_{max}-{x}_{min}}$$

(11)

where, x_norm represents the normalised value, x₀ is the real value, and x_min and x_max are the minimum and maximum values respectively.

It should be noted that the XGBoost model does not require input variable normalization since it is not sensitive to monotonic input variable normalization.

Input combinations

To investigate the impact of different meteorological variables on RET estimation, six combinations were utilized, as outlined in Table 6. The inputs for the ML models included air temperature (T_max and T_min), relative humidity (RH_mean), solar radiation (R_s), and wind speed (U₂). Likewise, the observed meteorological data were divided into two sets: a training set comprising 70% of the data and a separate testing set for evaluating the performance of the models. This division ensured that the models thoroughly evaluated on a substantial amount of data from each ___location and were rigorously assessed on an independent dataset. All the chosen ML models and simulations were implemented using R software (version 4.2.2).

Table 6 ML models, hybrid models and input meteorological combinations.

Full size table

Evaluation performance

To evaluate the model’s performance in comparison to the standard Penman-Monteith model (FAO-PM56), four statistical metric parameters were employed. These metrics encompassed the Kling Gupta Efficiency index (KGE, Eq. 12), Coefficient of determination (R², Eq. 13), Mean Squared Error (RMSE, Eq. 14), and Root relative squared error (RRSE, Eq. 15). The selection of these metrics aimed to assess the precision, accuracy, under/overestimation, and provide a means for model comparison^72,73,74. To rank the models, the RMSE and RRSE values were arranged in ascending order, while the R² and KGE values were ordered in descending order. The specific formulas for these metrics can be found in Table 7.

Table 7 Statistical metrics parameters.

Full size table

Proposed model development for RET estimation

Figure 3 illustrates the flowchart outlining the suggested empirical, machine learning (ML), and hybrid models for estimating reference evapotranspiration (RET). LR, RF, M5P, XGBoost, LightGBM, RF-M5P, RF-XGBoost, RF-LightGBM, and XGBoost-LightGBM are the models used for RET estimation. As input variables, these models use six distinct climatic combinations (Comb1 - Comb6). The performance of each model was extensively explored by assessing the statistical metrics parameters listed above.

Results and discussions

Correlation between PM-FAO 56 daily RET and meteorological variables

Figure 4 shows a correlation between meteorological variables and daily reference evapotranspiration estimated by PM-FAO 56 model at each station studied. Results shows the RET is primarily affected by solar radiation where Pearson’s coefficients were above 78%, indicating a good correlation. This suggests that models that incorporate radiation-based variables may perform better in estimating RET compared to those that rely on temperature-based or mass transfer-based variables. For all Gharb stations, R_n obtained the highest correlation, with values ranging from 0.96 to 0.97, followed by R_s and R_a. However, R_s is the most correlated to R_n and R_a for all Loukkos stations. One of these three variables is explicitly contained in all combination models and those based on radiation-based (Table 3). Temperatures occupy the second place as r values vary from 0.53 to 0.79. Moreover, maximum temperature for Loukkos stations is more correlated with reference evapotranspiration than with mean and minimal temperatures. This leads to saying that a model containing maximum temperature coupled with solar radiation could better estimate RET for these stations. Recent research by Chia et al.⁷¹ pointed out that temperature and radiation are indispensable for estimating RET in semi-arid regions. Conversely, in sub-humid regions, the RET estimation requires the inclusion of evaporation in addition to temperature and radiation. This review underscores the significance of considering distinct climatic conditions when estimating RET in various regions.

Figure 4 further indicate that air vapor pressure deficit (VPD) is moderately correlated with reference evapotranspiration ranging from 0.56 to 0.74. Moreover, mass transfer models usually use this VPD variable as input and lack other variables that could improve RET estimation. On the other hand, relative humidity is negatively correlated with reference evapotranspiration, with r values varying from − 0.63 to -0.27. In the Gharb stations, it is noteworthy that RH_max correlation coefficient is higher than RH_min, regardless of the study period. In line with previous studies^33,44,71, our findings support the positive correlation of T_mean, T_max, T_min, and R_s with RET. Wind speed shows a slight correlation, while relative humidity (RH_mean) exhibits a negative correlation with RET. The correlation coefficients for wind speed (U₂) are relatively low, ranging from 0.21 to 0.40, with the exception of the MB station, which exhibits a correlation coefficient of 0.50. The consideration of wind impact in RET estimation varies among researchers, with some arguing that wind is a significant factor due to potential data inaccuracies⁷⁵, while others suggest that wind has minimal influence⁷⁶ except in areas with high wind conditions. These results align with the findings of other studies^46,54, providing further evidence of the relationship between meteorological variables and RET.

Empirical models’ comparison for daily reference evapotranspiration estimates

The statistical results of the eight empirical models (Dal1802, Trab1896, Harg1975, Irs2003, HargS1985, BC1986, VAL2013a and VAL2013b) for estimating daily RET at the five meteorological stations within the Gharb and Loukkos perimeters are presented in Tables 8 and 9. As above mentioned, the computed statistical indicator values (KGE, R², RMSE and RRSE) were obtained using model performance equations [Eqs. 12–15], evaluated against the PM - FAO 56 model during the training and testing phases. The models were prioritized based on their statistical errors, and the best model was identified accordingly. KGE, R², RMSE, and RRSE were determined to be 0.291–0.974, 0.249–0.964, 0.182–1.287 mm/day and 8.572–51.913%, respectively, during training; and 0.386–0.972, 0.366–0.966, 0.172–1.302 mm/day and 8.137–47.993% during testing. As seen in the table, the empirical models exhibited minimal differences (with an RRSE gap from − 1.57 to 4.63%) between the training and testing phases.

Notably, the combination models (VAL2013a, VAL2013b) outperformed other models across all stations, with KGE R², RMSE, and RRSE ranging 0.947–0.974, 0.942–0.966, 0.318–0.402, 8.137–10.739 respectively, during testing phase. Except for Mensara station, Trab1896 performed successfully at the training and testing phases. It was noticed that The VAL13b model performs noticeably better than the VAL13a model, owing to the distinct variable requirements of each model. This is because it includes all contributing factors affecting RET.

Table 8 Statistical indicators result for evaluating empirical model performance in Gharb perimeter.

Full size table

Table 9 Statistical indicators result for evaluating empirical model performance in Loukkos perimeter.

Full size table

Notably, VAL13b incorporates additional variables, such as relative humidity and wind speed, which account for the aerodynamic effects on RET. Similary, Kisi⁷⁷ evaluated seven empirical models to the PM-FAO 56 in Mediterranean climate and Valiantzas (2013b) was found to be the best model.

The analysis indicates that temperature-based models, specifically Hargreaves and Samani⁶² and Brutsaert and Chen⁷⁸, demonstrate superior performance compared to radiation-based models such as Hargreaves⁶¹ and Irmak et al.⁷⁹, as well as mass transfer models like Dalton⁵⁹ and Trabert⁶⁰. These findings are consistent with the conclusions reported by multiple researchers in the field¹⁷. Conversely, it is important to note that some researchers^20,76,80 have suggested that radiation-based models may perform better than temperature-based models. The higher effectiveness of temperature-based models can be related to temperature’s relative stability compared to solar radiation, which changes depending on conditions like as cloud cover, meteorological conditions, and time of day. Consequently, variations in solar radiation create uncertainty in radiation-based models. Generally, temperature appears to be a more powerful element than solar radiation in promoting evapotranspiration in dry or semi-arid regions where water supply is limited^72,73,74.

Specifically, the HargS1985 model demonstrates superior performance compared to the BC1986 model across all stations. This finding is congruent with Er-raki et al.¹⁶ results, who found that the HargS1985 model provides more accurate estimations in semi-arid conditions of the Tensift basin. Similarly, Almorox et al.¹⁸ reported that the HargS1985 model had the greatest overall performance after analyzing eleven temperature-based models on a worldwide scale. In contrast, other studies have reported that the Hargreaves-Samani model⁶² tends to underestimate reference evapotranspiration in arid regions and overestimate it in wetland environments^81,82.

In term of mass transfer models, Trab1896 performed better than Dal1802 at most stations. The mass transfer models used in this study were unsuitable for estimating daily RET in this region due to substantial statistical errors, with the exception of the Menasra station. In term of radiation models, Harg1975 was generally superior to Irs2003. On the other hand, we found that RET ranking is generally like SAT station data and data supplemented by Mice imputation method (SAT mice). This means that the filling of solar radiation (R_s) gaps did not affect the choice of RET estimation method.

The scatter plots in Fig. 5 illustrate the comparison between the estimated RET values obtained from the best empirical models and the FAO56-PM values during testing at the two meteorological stations. When the data points in the scatter plot were closely matched with the diagonal 1:1 trend line, a strong fit was noticed, showing high agreement between RET estimated by PM-FAO 56 and by empirical model. When the data points deviated considerably from the trend line, it indicated a poor fit, suggesting a lack of connectivity between the two previously estimated models. Overall, the data points in the plots demonstrate a strong correlation, aligning closely to the 1:1 line for models Val2013a, Val2013b, Harg195, and Harg1985.

Comparison of standalone and hybrid ML models using various input combinations

Table 10 presents the averaged statistical performance indicators (KGE, R², RMSE, and RRSE) values for estimating RET across five meteorological stations in the Loukkos and Gharb perimeters, categorized by model and input combination used during testing phase. Additionally, Fig. 6 displays the KGE and RRSE values for each meteorological station during training and testing phases, comparing the four ML models and four hybrid models with six different input combinations. Overall, the statistical indicators values were found to varied substantially based on input combination, model types, and phase employed.

It can be seen from Table 10 that, independent of the perimeter and input combination, KGE, R², RMSE, and RRSE values varied from 0.557 to 0.982, 0.585–0.979, 0.015–0.108, and 6.925–35.360 respectively during testing phase. These results clearly surpass those produced by the empirical model. The RMSE values achieved by these models were smaller than those reported by other researchers using different ML models in various regions^34,40,46,51. It’s worth noting that discrepancies in RMSE values between studies might be caused by various factors such as ML model type and input variable selection, time periods chosen, climate conditions, and data quality^83,84,85.

Among the different input combinations evaluated across all eight models, it was observed that the ML and hybrid models displayed the poorest performance when utilizing T_max, T_min, and RH_mean as input (combination 2). On average, the R² values ranged from 0.585 to 0.669, while the RRSE values varied between 26.724% and 35.360%, indicating relatively lower accuracy compared to other input combinations. This can be attributed to the negative correlation observed between RH_mean and RET estimated, as depicted in Fig. 4. Furthermore, the limited information provided by these variables may not fully capture the complex relationships involved in accurately estimating RET. When comparing combination 1 (T_max, T_min, R_s) with combination 3 (T_max, T_min, R_s, U₂), it was observed that the inclusion of U₂ improve slightly the R² (difference < 0.12) and reduced RMSE values (difference < 0.01). However, the most significant difference was found in the RRSE values, particularly in the Gharb stations, where the improvements ranged from 6.597 to 9.862%.

In the Loukkos stations, the differences in RRSE values were between 0.910% and 1.886%. These disparities suggest that the Loukkos stations experience higher wind speeds compared to those in the Gharb (Table 2), and the correlation between U₂ and RET is slightly weaker in the Loukkos (as shown in Fig. 4). Similarly, when comparing combination 1 (T_max, T_min, R_s) with combination 5 (T_max, T_min, R_s, RH_mean), improvements were observed, although they were lower than in the previous case. The RRSE differences were 3.871–8.923% for Gharb and 0.499–1.537% for Loukkos.

Goyal et al.³³ suggested that incorporating either U₂, RH_mean, or both can improve model performance. Although relative humidity is considered the least significant parameter, its addition to the combination of T_max, T_min, and R_s results in a decrease in RMSE values. Besides, the combination 6, which included T_max, T_min, RH_mean, R_s, and U₂ as input meteorological variables, demonstrated the best performance with R² and RRSE values ranging from 0.955 to 0.979 and from 6.925 to 11.272%, respectively. This improved performance can be attributed to the inclusion of additional variables capture the complex interactions and dynamics involved in estimating RET accurately.

Overall, it’s worth noting that temperature data is a foundational requirement for the models presented in Tables 9 and 10. Without sufficient temperature data, model predictions become unreliable, thereby limiting accuracy of RET estimations. Alternative gridded datasets, such as those from reanalysis products like ERA5-Land, can be used effectively to run models like Penman-Monteith FAO-56⁸⁶. Nouri et al.⁸⁶ demonstrated that the ERA5-Land dataset provides reliable RET estimates, especially in data-limited and windy regions. Their findings revealed that while some models, like recalibrated Hargreaves-Samani and Penman-Monteith with localized wind speed, performed well, others struggled due to wind speed variation.

In term of standalone ML models, the testing phase revealed the following ranking for Gharb: LightGBM6 > XGBoost6 > RF6 > LightGBM3, with average RMSE of 0.015–0.017 mm/day. For Loukkos, the ranking was: LightGBM6 > XGBoost6 > LightGBM3 > XGBoost3 with average RMSE of 0.025–0.027 mm/day. These findings support previous studies by Fan et al.⁴⁰ and Yong et al.³⁴, where LightGBM consistently outperformed other standalone ML models with an RMSE of 0.08–0.58 mm/day and 0.041–0.315 mm/day, respectively. Further, there was a minor difference between LightGBM6 model and XGBoost6 model, with LightGBM6 having a little higher RRSE value (0.34–0.5%). It should be pointed out that while XGBoost had the highest KGE value among all models, our ranking also considered the low RMSE and RSSE values, which placed XGBoost in the second position.

Table 10 Average statistical indicators values for different standalone and hybrid ML models across studied stations during testing phase with various input combinations.

Full size table

In both study area, the worst performance were obtained by M5P2 < XGBoost2 < RF2 < LightGBM2, where on average KGE < 0.772, R² < 0.661, RMSE > 0.067 mm/day and RRSE > 27.097% (Table 10). These findings contradict those of Granata³⁸, who claimed that the M5P models performed well while the RF models were the least accurate. From Fig. 6, it can be observed that RF demonstrated strong performance during the training phase across all stations, likely due to its ensemble nature and robustness to noise.

However, LightGBM’s performance was often inferior to that of the RF and XGBoost models. This could be due to its different optimization approach and less effective capturing of complex relationships. This finding aligns with the study by Chia et al.⁷¹, which reported that LightGBM exhibited relatively weaker performance than RF and the M5 tree model during the training phase. The authors explained that LightGBM’s leaf-wise optimization required sufficient training data for effective performance⁷¹. Interestingly, the situation reversed during the testing phase, demonstrating that LightGBM performed better when properly trained. It is noteworthy to mention that for all ML model, the model’s performance difference was triggered by a difference in the training and testing datasets, such as temporal differences in meteorological data patterns during both phases.

In term of hybrid ML models, the Gharb’s performance ranking was XGBoost-LightGBM6 > RF-LightGBM6 > RF-XGBoost6 > RF-M5P6. The R² (RRSE) values were 0.979 (7.598), 0.978 (7.813), 0.974 (8.685) and 0.973 (9.211) respectively. For Loukkos, the ranking models were XGBoost-LightGBM6 > RF-LightGBM6 > RF-XGBoost6 > RF-LightGBM3, with R² (RRSE) values of 0.975 (6.925), 0.974 (7.038), 0.971 (7.619) and 0.969 (7.635) respectively. From Table 10,, the result shows that XGBoost-LightGBM perform the best. This might be because XGBoost is recognized for its regularization approaches and successful handling of complicated relationships, whereas LightGBM excels in efficient computation and handling huge datasets. Consequently, their hybridization could detect a wider range of patterns and improve overall performance. In contrast, the lowest performance occurred in RF-M5P2 < RF-LightGBM2 < XGBoost-LightGBM2 < RF-XGBoost2. Nonetheless, during training phase, RF-XGBoost model gave good performance, as shown in Fig. 6.

When comparing standoalone ML models with hybrid ML models, it was found that XGBoost-LightGBM was highly close to LightGBM in term of all statistical performance indicator. For instance, compared with XGBoost-LightGBM6, RRSE of LightGBM6, XGBoost6 and RF6 was in difference of 0.318% (0.096%), 0.655% (0.592%), 2.468% (1.258%), respectively for Gharb (Loukkos). This suggests that while the hybridization approach slightly improved model performance, the improvement was not significant. It’s worth noting that factors such as computational efficiency and implementation flexibility might impact the selection between standalone ML models and hybrid models. Therefore, LightGBM and XGBoost are the recommended ML model for estimating RET in our research study.

Collectively, these studies highlight the effectiveness of standalone and hybrid machine learning models for improved accuracy in RET estimation^{33,34,38,40,51,54,55,56,71}. Similary, our findings indicate that standalone ML models exhibited better performance and accuracy compared to the empirical models for estimating daily RET at Gharb and Loukkos stations, using T_max, T_min, RH_mean, R_s, U₂ as input variables.

Research’s limitations

This research has some notable limitations that need to be addressed. The study’s period is relatively limited, spanning from 2011 to 2017. Hence, expanding the study’s period would provide a more comprehensive understanding of the ML models’ performance across diverse climatic fluctuations over a longer period. Moreover, the investigation of current study limited from subhumid to semi-arid climatic conditions only, further investigation should incorporate more climatic conditions to examine climatic variability thoroughly. Additionally, the accuracy and performance of ML and hybrid models heavily relied on the availability and quality of meteorological data, which could impact their effectiveness in areas with limited or incomplete data. To address this issue, future studies could utilise reanalysis data products, such as ERA5 ERA5-Land, and MERRA-2, which provide continuous, high-resolution data across extended temporal and spatial scales. Such data sources could prove invaluable in regions where in situ measurements are scarce. Moreover, ML models lack physical mechanisms, making it challenging to comprehend their inner workings and create accurate models without knowledge of functional specifications⁷⁴. The issue of over-fitting and under-fitting during the training/testing phases of ML models, due to the dataset random division, may also affect model accuracy. Employing advanced validation techniques could help enhance model reliability and generalizability.

Conclusion

Estimating reference evapotranspiration (RET) accurately has remained a major focus across a wide range of applications, including water resource management, agricultural water requirements, irrigation scheduling, and climate change assessments. In this research, we investigated the ability of four machine learning (ML) models, and their hybrid models along with eight empirical models (grouped in mass transfer-based, temperature-based, radiation-based and combination models) to estimate daily RET in subhumid and semi-arid irrigated perimeters in Morocco. For six input combinations, RF, M5P, XGBoost, LightGBM, RF-M5P, RF-XGBoost, RF-LightGBM, and XGBoost-LightGBM were all thoroughly evaluated. The results showed that combination models (particularly, Valiantzas 2013 (VAL2013b)) were the best empirical models and that temperature-based models generally outperformed radiation-based models. Compared with empirical models, ML models gave more accurate RET estimation, and the hybrid XGBoost-LightGBM models provided the highest statistical indicator values (KGE, R², RMSE, RRSE). Interestingly, the standalone ML model LightGBM also demonstrated acceptable accuracy across all stations and input combinations, indicating its potential as a promising model for RET estimation with limited data. Moreover, the XGBoost model is also an intriguing alternative ML model. Overall, models with the input variables T_max, T_min, RH_mean, R_s, and U₂ performed better for daily RET estimation.

The current research highlights the ML and hybrid models’ efficiency in estimating daily RET within two irrigated Moroccan perimeters. Ultimately, further investigations could explore additional ML algorithms, hybrid model configurations, and their relevance to long-term datasets at different time scales and various climate regions.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Satpathi, A. et al. Estimation of crop evapotranspiration using statistical and machine learning techniques with limited meteorological data: A case study in Udham Singh Nagar, India. Theor. Appl. Climatol. https://doi.org/10.1007/s00704-024-04953-3 (2024).
Article MATH Google Scholar
Vishwakarma, D. K. et al. Methods to estimate evapotranspiration in humid and subtropical climate conditions. Agric. Water Manag. 261, 107378 (2022).
Article MATH Google Scholar
Mirzania, E., Vishwakarma, D. K., Bui, Q. A. T., Band, S. S. & Dehghani, R. A novel hybrid AIG-SVR model for estimating daily reference evapotranspiration. Arab. J. Geosci. 16, 301 (2023).
Article Google Scholar
Dingman, S. L. Physical Hydrology (Waveland, 2015).
Wanniarachchi, S. & Sarukkalige, R. A. Review on evapotranspiration estimation in agricultural water management: Past, present, and future. Hydrology 9, 123 (2022).
Article Google Scholar
Jerin, J. N., Islam, A. R. M. T., Al Mamun, M. A., Mozahid, M. N. & Ibrahim, S. M. Climate change effects on potential evapotranspiration in Bangladesh. Arab. J. Geosci. 14 (2021).
Dinpashoh, Y., Jahanbakhsh-Asl, S., Rasouli, A. A., Foroughi, M. & Singh, V. P. Impact of climate change on potential evapotranspiration (case study: west and NW of Iran). Theor. Appl. Climatol. 136, 185–201 (2019).
Article ADS Google Scholar
Haider, S. et al. Simulation of the potential impacts of projected climate and land use change on runoff under CMIP6 scenarios. Water 15, 3421 (2023).
Article MATH Google Scholar
Nouri, M. Drought assessment using gridded data sources in data-poor areas with different aridity conditions. Water Resour. Manag. 37, 4327–4343 (2023).
Article MATH Google Scholar
Noguera, I., Domínguez-Castro, F. & Vicente-Serrano, S. M. Flash drought response to precipitation and atmospheric evaporative demand in Spain. Atmosphere (Basel). 12, 165 (2021).
Article ADS Google Scholar
Herold, N., Kala, J. & Alexander, L. V. The influence of soil moisture deficits on Australian heatwaves. Environ. Res. Lett. 11, 064003 (2016).
Article ADS MATH Google Scholar
Raza, A. et al. Misconceptions of reference and potential evapotranspiration: A PRISMA-guided comprehensive review. Hydrology 9, 153 (2022).
Article MATH Google Scholar
Monteith, J. L. Evaporation and Environment the State and Movement of Water in Living Organisms. In Symp. 19 Soc. Exp. Bid (ed. Fogg, G.E.) (Cambridge University Press, 1965).
Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. Crop Evapotranspiration: Guidelines for Computing Crop Water Requirements. Irrigation and Drainage Paper No 56. Food and Agriculture Organization of the United Nations (FAO), Rome, Italy. https://doi.org/10.3390/agronomy9100614 (1998).
Allen, R. G. et al. The ASCE Standardized Reference Evapotranspiration Equation (2005).
Er-Raki, S. et al. Assessment of reference evapotranspiration methods in semi-arid regions: Can weather forecast data be used as alternate of ground meteorological parameters? J. Arid Environ. 74, 1587–1596 (2010).
Article ADS MATH Google Scholar
Hamed, M. M., Khan, N., Muhammad, M. K. I. & Shahid, S. Ranking of empirical evapotranspiration models in different climate zones of Pakistan. Land 11, 2168 (2022).
Article MATH Google Scholar
Almorox, J., Quej, V. H. & Martí, P. Global performance ranking of temperature-based approaches for evapotranspiration estimation considering Köppen climate classes. J. Hydrol. 528, 514–522 (2015).
Article ADS Google Scholar
Valipour, M. Retracted: Comparative evaluation of radiation-based methods for estimation of potential evapotranspiration. J. Hydrol. Eng. 20, 4014068 (2015).
Article MATH Google Scholar
Zeggaf, T. A., El Mourid, M., Karrou, M. & Steduto, P. Comparaison des méthodes d’estimation de l’évapotranspiration de référence dans la région du Tadla-Maroc. AL AWAMIA. 100, 73–84 (1999).
Google Scholar
Dai, L. et al. Comparison of fourteen reference evapotranspiration models with lysimeter measurements at a site in the humid Alpine Meadow, northeastern Qinghai-Tibetan Plateau. Front. Plant. Sci. 13 (2022).
Bouhlassa, S. & Paré, S. Évapotranspiration De référence dans la région aride de tafilalet Au sud-est du Maroc. Afr. J. Environ. Assess. Manag. 11, 1–16 (2006).
Google Scholar
Hadria, R., Benabdelouhab, T., Lionboui, H. & Salhi, A. Comparative assessment of different reference evapotranspiration models towards a fit calibration for arid and semi-arid areas. J. Arid Environ. 184, 104318 (2021).
Article ADS PubMed Google Scholar
Liou, Y. A. & Kar, S. K. Evapotranspiration estimation with remote sensing and various surface energy balance algorithms—A review. Energies 7, 2821–2849 (2014).
Article MATH Google Scholar
Elfarkh, J. et al. Evapotranspiration estimates in a traditional irrigated area in semi-arid Mediterranean. Comparison of four remote sensing-based models. Agric. Water Manag. 270, 107728 (2022).
Article MATH Google Scholar
El-Rawy, M. et al. An Integrated GIS and machine-learning technique for groundwater quality assessment and prediction in southern Saudi Arabia. Water 15, 2448 (2023).
Article CAS MATH Google Scholar
Alshehri, F. & Rahman, A. Coupling machine and deep learning with explainable Artificial intelligence for improving prediction of groundwater quality and decision-making in Arid Region, Saudi Arabia. Water 15, 2298 (2023).
Article MATH Google Scholar
Abd El-Hamid, H. T. & Alshehri, F. Integrated remote sensing data and machine learning for drought prediction in eastern Saudi Arabia. J. Coast Conserv. 27, 48 (2023).
Article MATH Google Scholar
Nhu, V. H. et al. GIS-based gully erosion susceptibility mapping: A comparison of computational ensemble data mining models. Appl. Sci. 10. (2039). https://doi.org/10.3390/app10062039 (2020).
Prodhan, F. A., Zhang, J., Hasan, S. S., Sharma, P., Mohana, H. P. & T. P. & A review of machine learning methods for drought hazard monitoring and forecasting: Current research trends, challenges, and future research directions. Environ. Model. Softw. 149, 105327 (2022).
Article Google Scholar
Pham, Q. B. et al. Groundwater level prediction using machine learning algorithms in a drought-prone area. Neural Comput. Appl. 34, 10751–10773 (2022).
Article MATH Google Scholar
Raza, A. et al. Performance evaluation of five machine learning algorithms for estimating reference evapotranspiration in an arid climate. Water 15, 3822 (2023).
Article MATH Google Scholar
Goyal, P., Kumar, S. & Sharda, R. A review of the artificial intelligence (AI) based techniques for estimating reference evapotranspiration: current trends and future perspectives. Comput. Electron. Agric. 209, 107836 (2023).
Article MATH Google Scholar
Yong, S. L. S., Ng, J. L., Huang, Y. F. & Ang, C. K. Estimation of reference crop evapotranspiration with three different machine learning models and limited meteorological variables. Agronomy 13, 1048 (2023).
Article Google Scholar
Kisi, O. The potential of different ANN techniques in evapotranspiration modelling. Hydrol. Process. 22, 2449–2460 (2008).
Article ADS MATH Google Scholar
Nourani, V., Elkiran, G. & Abdullahi, J. Multi-station artificial intelligence based ensemble modeling of reference evapotranspiration using pan evaporation measurements. J. Hydrol. 577, 123958 (2019).
Article Google Scholar
Singh, A. K. et al. An integrated statistical-machine learning approach for runoff prediction. Sustainability 14, 8209 (2022).
Article MATH Google Scholar
Granata, F. Evapotranspiration evaluation models based on machine learning algorithms—A comparative study. Agric. Water Manag. 217, 303–315 (2019).
Article MATH Google Scholar
Vishwakarma, D. K. et al. Pre- and post-dam river water temperature alteration prediction using advanced machine learning models. Environ. Sci. Pollut Res. https://doi.org/10.1007/s11356-022-21596-x (2022).
Article MATH Google Scholar
Fan, J. et al. Light gradient boosting machine: An efficient soft computing model for estimating daily reference evapotranspiration with local and external meteorological data. Agric. Water Manag. 225, 105758 (2019).
Article MATH Google Scholar
Elbeltagi, A. et al. Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models. Environ. Sci. Pollut Res. 30, 43183–43202 (2023).
Article Google Scholar
Achite, M. et al. Performance of machine learning techniques for meteorological drought forecasting in the Wadi Mina Basin, Algeria. Water 15, 765 (2023).
Article MATH Google Scholar
Kumar, D. et al. Multi-ahead electrical conductivity forecasting of surface water based on machine learning algorithms. Appl. Water Sci. 13, 192 (2023).
Article ADS CAS MATH Google Scholar
Elbeltagi, A. et al. Forecasting long-series daily reference evapotranspiration based on best subset regression and machine learning in Egypt. Water 15, 1149 (2023).
Article MATH Google Scholar
Kushwaha, N. L. et al. Data intelligence model and meta-heuristic algorithms-based pan evaporation modelling in two different agro-climatic zones: A case study from Northern India. Atmosphere (Basel) 12, 1654 (2021).
Article ADS MATH Google Scholar
Fan, J. et al. Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric. Meteorol. 263, 225–241 (2018).
Article MATH Google Scholar
Torsoni, G. B. et al. Soybean yield prediction by machine learning and climate. Theor. Appl. Climatol. 151, 1709–1725 (2023).
Article ADS MATH Google Scholar
Masood, A. et al. Improving PM2.5 prediction in New Delhi using a hybrid extreme learning machine coupled with snake optimization algorithm. Sci. Rep. 13, 21057 (2023).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Shahhosseini, M., Hu, G., Huber, I. & Archontoulis, S. V. Coupling machine learning and crop modeling improves crop yield prediction in the US Corn Belt. Sci. Rep. 11, 1606 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
González, S., García, S., Del Ser, J., Rokach, L. & Herrera, F. A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives and opportunities. Inf. Fusion. 64, 205–237 (2020).
Article Google Scholar
Lachgar, N., Berrajaa, A., Essabbar, M. & Saikouk, H. Machine learning approach for reference evapotranspiration estimation in the Region of Fes, Morocco. In International Conference on Digital Technologies and Applications. 105–113 (Springer, 2023).
Nagalla, R., Pothuganti, P. & Pawar, D. S. Analyzing gap acceptance behavior at unsignalized intersections using support vector machines, decision tree and random forests. Proc. Comput. Sci. 109, 474–481 (2017).
Article Google Scholar
Pal, M. & Mather, P. M. An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens. Environ. 86, 554–565 (2003).
Article ADS MATH Google Scholar
Elbeltagi, A. et al. Data intelligence and hybrid metaheuristic algorithms-based estimation of reference evapotranspiration. Appl. Water Sci. 12, 152 (2022).
Article ADS Google Scholar
El Hachimi, C., Salwa, B., Saïd, K. & Abdelghani, C. Early estimation of daily reference evapotranspiration using machine learning techniques for efficient management of irrigation water. J. Phys. Conf. Ser. 2224, 12006 (IOP Publishing, 2022).
Hachimi, C. et al. Smart weather data management based on artificial intelligence and big data analytics for precision agriculture. Agriculture 13, 95 (2022).
Article MATH Google Scholar
Van Buuren, S. & Groothuis-Oudshoorn, K. Mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).
Article MATH Google Scholar
Acharki, S., Amharref, M., El Halimi, R. & Bernoussi, A. S. Évaluation par approche statistique de l’impact des changements climatiques sur les ressources en eau: Application Au périmètre Du Gharb (Maroc). Rev. Des. Sci. l’Eau/J. Water Sci. 32, 291–315 (2019).
Google Scholar
Dalton, J. Experimental essays on the constitution of mixed gases; on the force of stream or vapor from water and other liquids, both in a Torricellian vacuum and in air; on evaporation; and on the expansion of gases by heat. Proc. Manch. Lit. Philos. Soc. 5, 536–602 (1802).
MATH Google Scholar
Trabert, W. Neue Beobachtungen über Verdampfungsgeschwindigkeiten. Meteorol. Z. 13, 261–263 (1896).
Google Scholar
Hargreaves, G. H. Moisture availability and crop production. Trans. ASAE. 18, 980–984 (1975).
Article MATH Google Scholar
Hargreaves, G. H. & Samani, Z. A. Reference crop evapotranspiration from temperature. Appl. Eng. Agric. 1, 96–99 (1985).
Article MATH Google Scholar
Allen, R. G. & Pruitt, W. O. Rational use of the FAO Blaney-Criddle formula. J. Irrig. Drain. Eng. 112, 139–155 (1986).
Article MATH Google Scholar
Irmak, S. & Haman, D. Z. Evapotranspiration: Potential or reference? Agric. Eng. Fla. Coop. Ext. Serv. Inst. Food Agric. Sci. Univ. Fla. US ABE. 343, 1–3 (2003).
Google Scholar
Valiantzas, J. D. Simple: ET 0 forms of Penman’s equation without wind and/or humidity data. II: comparisons with reduced set-FAO and other methodologies. J. Irrig. Drain. Eng. 139, 9–19 (2013).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Quinlan, J. R. Learning with continuous classes. In 5th Australian Joint Conference on Artificial Intelligence. Vol. 92. 343–348 (World Scientific, 1992).
Solomatine, D. P. & Xue, Y. M5 model trees and neural networks: Application to flood forecasting in the Upper Reach of the Huai River in China. J. Hydrol. Eng. 9, 491–501 (2004).
Article MATH Google Scholar
Chen, T. et al. Xgboost: Extreme Gradient Boosting. R Package Version 0.4-2. Vol. 1. 1–4 (2015).
Ke, G., Ye, Q., Chen, W., Liu, T. Y. & LightGBM. Highly Effic. Gradient Boost Decis. Tree 30 (2016).
Chia, M. Y., Huang, Y. F., Koo, C. H. & Fung, K. F. Recent advances in evapotranspiration estimation using artificial intelligence approaches with a focus on hybridization techniques—A review. Agronomy 10. https://doi.org/10.3390/agronomy10010101 (2020).
Raza, A. et al. Use of gene expression programming to predict reference evapotranspiration in different climatic conditions. Appl. Water Sci. 14, 152 (2024).
Article ADS MATH Google Scholar
Raza, A. et al. Modelling reference evapotranspiration using principal component analysis and machine learning methods under different climatic environments. Irrig. Drain. 72, 945–970 (2023).
Article MATH Google Scholar
Wang, J. et al. Development of monthly reference evapotranspiration machine learning models and mapping of Pakistan—A comparative study. Water 14, 1666 (2022).
Article MATH Google Scholar
Grace, B. & Quick, B. A comparison of methods for the calculation of potential evapotranspiration under the windy semi-arid conditions of southern Alberta. Can. Water Resour. J. 13, 9–19 (1988).
Article MATH Google Scholar
Pandey, P. K., Dabral, P. P. & Pandey, V. Evaluation of reference evapotranspiration methods for the northeastern region of India. Int. Soil. Water Conserv. Res. 4, 52–63 (2016).
Article MATH Google Scholar
Kisi, O. Comparison of different empirical methods for estimating daily reference evapotranspiration in Mediterranean climate. J. Irrig. Drain. Eng. 140, 4013002 (2014).
Article MATH Google Scholar
Brutsaert, W. & Chen, D. Diurnal variation of surface fluxes during thorough drying (or severe drought) of natural Prairie. Water Resour. Res. 32, 2013–2019 (1996).
Article ADS MATH Google Scholar
Irmak, S., Allen, R. G. & Whitty, E. B. Daily grass and Alfalfa-reference evapotranspiration estimates and alfalfa-to-grass evapotranspiration ratios in Florida. J. Irrig. Drain. Eng. 129, 360–370 (2003).
Article Google Scholar
Arellano, M. G. & Irmak, S. Reference (potential) evapotranspiration. I: Comparison of temperature, radiation, and combination-based energy balance equations in humid, subhumid, arid, semiarid, and Mediterranean-type climates. J. Irrig. Drain. Eng. 142, 4015065 (2016).
Article MATH Google Scholar
Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. Crop evapotranspiration-guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao Rome. 300, D05109 (1998).
Google Scholar
Droogers, P. & Allen, R. G. Estimating reference evapotranspiration under inaccurate data conditions. Irrig. Drain. Syst. 16, 33–45 (2002).
Article Google Scholar
Raza, A. et al. Comparative assessment of reference evapotranspiration estimation using conventional method and machine learning algorithms in four climatic regions. Pure Appl. Geophys. 177, 4479–4508 (2020).
Article ADS MATH Google Scholar
Raza, A. et al. Application of non-conventional soft computing approaches for estimation of reference evapotranspiration in various climatic regions. Theor. Appl. Climatol. 139, 1459–1477 (2020).
Article ADS MATH Google Scholar
Raza, A. et al. Comparative study of powerful predictive modeling techniques for modeling monthly reference evapotranspiration in various climatic regions. Fresenius Environ. Bull. 30, 7490–7513 (2021).
CAS MATH Google Scholar
Nouri, M., Ebrahimipak, N. A. & Hosseini, S. N. Estimating reference evapotranspiration for water-limited windy areas under data scarcity. Theor. Appl. Climatol. 150, 593–611 (2022).
Article ADS Google Scholar
Pelosi, A., Terribile, F., D’Urso, G. & Chirico, G. Comparison of ERA5-Land and UERRA MESCAN-SURFEX reanalysis data with spatially interpolated weather observations for the regional assessment of reference evapotranspiration. Water 12, 1669 (2020).
Article Google Scholar
Allen, R. G. et al. Conditioning point and gridded weather data under aridity conditions for calculation of reference evapotranspiration. Agric. Water Manag. 245, 106531 (2021).
Article MATH Google Scholar

Download references

Acknowledgements

The authors extend their appreciation to the Deanship of Scientific Research, King Saud University for funding through the Vice Deanship of Scientific Research Chairs; Research Chair of Prince Sultan Bin Abdulaziz International Prize for Water.

Funding

Open access funding provided by Lulea University of Technology. This research was funded by the Deanship of Scientific Research, King Saud University through the Vice Deanship of Scientific Research Chairs; Research Chair of Prince Sultan Bin Abdulaziz International Prize for Water.

Author information

Authors and Affiliations

Faculty of Sciences and Technologies of Tangier, Abdelmalek Essaadi University, 93000, Tetouan, Morocco
Siham Acharki, Mina Amharref & Abdes Samed Bernoussi
Center for Remote Sensing Applications (CRSA), Mohammed VI Polytechnic University (UM6P), 43150, Benguerir, Morocco
Siham Acharki
School of Agricultural Engineering, Jiangsu University, Zhenjiang, 212013, People’s Republic of China
Ali Raza
Department of Irrigation and Drainage Engineering, G.B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, 263145, India
Dinesh Kumar Vishwakarma
K. Banerjee Centre of Atmospheric and Ocean Studies, University of Allahabad, Prayagraj, Uttar Pradesh, 211002, India
Sudhir Kumar Singh
Department of Civil, Environmental, and Natural Resources Engineering, Lulea University of Technology, 97187, Lulea, Sweden
Nadhir Al-Ansari
Prince Sultan Bin Abdulaziz International Prize for Water Chair, Prince Sultan Institute for Environmental, Water and Desert Research, King Saud University, P.O. Box 2454, Riyadh 11451, Saudi Arabia
Ahmed Z. Dewidar & Mohamed A. Mattar
Department of Agricultural Engineering, College of Food and Agriculture Sciences, King Saud University, Riyadh 11451, Saudi Arabia
Ahmed Z. Dewidar, Ahmed A. Al-Othman & Mohamed A. Mattar
Agricultural Engineering Research Institute (AEnRI), Agricultural Research Centre, P.O. Box 256, Giza, Egypt
Mohamed A. Mattar

Authors

Siham Acharki
View author publications
Search author on:PubMed Google Scholar
Ali Raza
View author publications
Search author on:PubMed Google Scholar
Dinesh Kumar Vishwakarma
View author publications
Search author on:PubMed Google Scholar
Mina Amharref
View author publications
Search author on:PubMed Google Scholar
Abdes Samed Bernoussi
View author publications
Search author on:PubMed Google Scholar
Sudhir Kumar Singh
View author publications
Search author on:PubMed Google Scholar
Nadhir Al-Ansari
View author publications
Search author on:PubMed Google Scholar
Ahmed Z. Dewidar
View author publications
Search author on:PubMed Google Scholar
Ahmed A. Al-Othman
View author publications
Search author on:PubMed Google Scholar
Mohamed A. Mattar
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, supervision, methodology, formal analysis, writing—original draft preparation, writing—review and editing, S.A., A.R., D.K.V., M.A.; data curation, project administration, investigation, writing—review and editing, A.S.B., S.K.S., N.A-A., A.Z.D., A.A.A., M.A.M. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Dinesh Kumar Vishwakarma, Nadhir Al-Ansari or Mohamed A. Mattar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Acharki, S., Raza, A., Vishwakarma, D.K. et al. Comparative assessment of empirical and hybrid machine learning models for estimating daily reference evapotranspiration in sub-humid and semi-arid climates. Sci Rep 15, 2542 (2025). https://doi.org/10.1038/s41598-024-83859-6

Download citation

Received: 24 April 2024
Accepted: 18 December 2024
Published: 20 January 2025
DOI: https://doi.org/10.1038/s41598-024-83859-6

Keywords

This article is cited by

Air temperature estimation and modeling using data driven techniques based on best subset regression model in Egypt
- Ahmed Elbeltagi
- Dinesh Kumar Vishwakarma
- Ali Salem
Scientific Reports (2025)
An ensemble-driven machine learning framework for enhanced water quality classification
- Preet Singh
- Taniya Hasija
- Ateeq Ur Rehman
Discover Sustainability (2025)
Evaluating machine learning models and feature selection for reference evapotranspiration estimation in semi-arid regions: a case study in Doukkala, Morocco
- Zaid Belarbi
- Yacine EL Younoussi
Theoretical and Applied Climatology (2025)

Subjects

Abstract

Similar content being viewed by others

Predictive framework of vegetation resistance in channel flow

Assessing climate and land use impacts on surface water yield using remote sensing and machine learning

Development of a novel modeling framework based on weighted kernel extreme learning machine and ridge regression for streamflow forecasting

Introduction

Materials and methods

Study area and data collection

Missing data imputation

Estimating evapotranspiration via FAO-56 Penman-Monteith and empirical models

Machine learning models and hybrid models description

Linear regression (LR)

Random forests (RF)

M5 pruned (M5P)

Extreme gradient boosting (XGBoost)

Light gradient boosting machine (LightGBM)

Weighted hybridization for ML algorithms

Input combinations

Evaluation performance

Proposed model development for RET estimation

Results and discussions

Correlation between PM-FAO 56 daily RET and meteorological variables

Empirical models’ comparison for daily reference evapotranspiration estimates

Comparison of standalone and hybrid ML models using various input combinations

Research’s limitations

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Air temperature estimation and modeling using data driven techniques based on best subset regression model in Egypt

An ensemble-driven machine learning framework for enhanced water quality classification

Evaluating machine learning models and feature selection for reference evapotranspiration estimation in semi-arid regions: a case study in Doukkala, Morocco

Search

Quick links