Machine learning models for daily net radiation prediction across different climatic zones of China

Yu, Haiying; Jiang, Shouzheng; Chen, Minzhi; Wang, Mingjun; Shi, Rui; Li, Songyu; Wu, Jinfeng; Kui, Xiu; Zou, Haoting; Zhan, Cun

doi:10.1038/s41598-024-71550-9

Download PDF

Article
Open access
Published: 03 September 2024

Machine learning models for daily net radiation prediction across different climatic zones of China

Haiying Yu¹,
Shouzheng Jiang²,
Minzhi Chen³,
Mingjun Wang²,
Rui Shi¹,
Songyu Li¹,
Jinfeng Wu¹,
Xiu Kui¹,
Haoting Zou¹ &
…
Cun Zhan²

Scientific Reports volume 14, Article number: 20454 (2024) Cite this article

1276 Accesses
1 Citations
Metrics details

Subjects

Abstract

Net radiation (R_n), a critical component in land surface energy cycling, is calculated as the difference between net shortwave radiation and longwave radiation at the Earth’s surface and holds significant importance in crop models for precision agriculture management. In this study, we examined the performance of four machine learning models, including extreme learning machine (ELM), hybrid artificial neural networks with genetic algorithm models (GANN), generalized regression neural networks (GRNN), and random forests (RF), in estimating daily R_n at four representative sites across different climatic zones of China. The input variables included common meteorological factors such as minimum and maximum temperature, relative humidity, sunshine duration, and shortwave solar radiation. Model performance was assessed and compared using statistical parameters such as the correlation coefficient (R²), root mean square errors (RMSE), mean absolute errors (MAE), and Nash–Sutcliffe coefficient (NS). The results indicated that all models slightly underestimated actual R_n, with linear regression slopes ranging from 0.810 to 0.870 across different zones. The estimated R_n was found to be comparable to observed values in terms of data distribution characteristics. Among the models, the ELM and GANN demonstrated higher consistency with observed values, exhibiting R² values ranging from 0.838 to 0.963 and 0.836 to 0.963, respectively, across varying climatic zones. These values surpassed those of the RF (0.809–0.959) and GRNN (0.812–0.949) models. Additionally, the ELM and GANN models showed smaller simulation errors in terms of RMSE, MAE, and NS across the four climatic zones compared to the RF and GRNN models. Overall, the ELM and GANN models outperformed the RF and GRNN models. Notably, the ELM model's faster computational speed makes it a strong recommendation for R_n estimates across different climatic zones of China.

Securing China’s rice harvest: unveiling dominant factors in production using multi-source data and hybrid machine learning models

Article Open access 26 June 2024

Employing machine learning for advanced gap imputation in solar power generation databases

Article Open access 11 October 2024

Downscaling of ERA5 reanalysis land surface temperature based on attention mechanism and Google Earth Engine

Article Open access 03 January 2025

Introduction

Solar radiation is the essential energy source driving biological and physical processes on the Earth's surface^1,2. Accurate assessment and prediction of solar radiation form the foundation for precision agriculture management, including quantifying crop water requirements and evaluating crop productivity³. However, instrumentation for measuring solar radiation is installed only at selected sites within specific regions, constrained by cost and technological limitations^4,5,6,7,8,9. For instance, China has 752 national weather stations, but fewer than one-fifth are equipped to observe solar radiation¹⁰. Similarly, in America, the ratio of stations observing solar radiation to those observing air temperature is less than 1:100¹¹. This scarcity underscores the importance of estimating solar radiation using climatic variables¹².

Numerous models and techniques have been proposed for estimating solar radiation, encompassing the Angström–Prescott linear regression model, the Annandale method, and various machine learning (ML) models^9,13. Among them, ML models, particularly artificial neural networks (ANN), have emerged as promising methods for solar radiation estimation^{14,15,16,17,18}. Wang¹⁸ compared the accuracy of multilayer perceptron (MLP) models, GRNN models and radial basis neural network (RBNN) models in estimating daily solar radiation; results showed that MLP and RBNN models performed better than GRNN and the improved Bristow Campbell model. These studies highlighted the good potential for solar energy applications across various regions.

New ML methods have also been introduced for solar radiation simulation, such as the wavelet-coupled support vector machine (W-SVM) approach¹⁹, the Wavelet Transform (WT) algorithm incorporated into the SVM model²⁰, and the extreme learning machine (ELM)^21,22. These novel approaches have demonstrated superiority over traditional models in terms of efficiency, viability, and accuracy.

Nevertheless, while ML models have proven effective in estimating or predicting solar radiation, their performance may differ across stations and climatic regions. Moreover, existing studies primarily focus on solar radiation estimation using ML models, leaving the applicability of ML models to estimate all-wave net surface radiation (R_n) unexplored. R_n, representing the net flux density of radiation at the Earth's surface, plays a vital role in biological reactions such as photosynthesis and physical processes like water heat flux cycling^23,24,25, thus R_n is generally considered as a most important input variable in calculating crop evapotranspiration and crop growth model. Although approximately 120 stations observe global solar radiation within China's meteorological network, even fewer are equipped with R_n monitoring instruments. Therefore, the development of applicable models for Rn estimation can furnish essential tools for investigating Earth's surface energy distribution, with implications for eco-hydrological processes, crop growth modeling, and solar energy development^26,27.

An integration study focusing on R_n estimation using an ML model based on meteorological variables across diverse climatic zones has not been extensively explored, which motivates the present research. This study aims to estimate R_n through four ML models (the ELM, backpropagation neural networks optimized by a genetic algorithm (GANN), random forests (RF), and GRNN, and subsequently compare the performances of these models in Rn estimation across different climatic zones in China.

Materials and methods

Study sites and data sets

In the present study, four meteorological stations were strategically selected to represent different climatic zones within China. These include the Urumqi station, situated in the temperate continental zone; the Lhasa station, located in the mountain plateau zone; Beijing, representing the temperate monsoon zone; and the Kunming station, indicative of the subtropical monsoon zone. The geographical distribution of these four stations is depicted in Fig. 1.

The temperate continental zone where Urumqi station is typically an arid region, characterized by low annual precipitation (approximately 300 mm yr⁻¹, as detailed in Table 1). Conversely, the mountain plateau, temperate monsoon, and subtropical monsoon zones represented by Lhasa, Beijing, and Kunming stations are generally categorized as semi-arid, semi-humid, and humid regions. A summary of the meteorological variables in each station is available in Table 1.

Table 1 Annul means of the main climatic variables at each station during 2000–2014, including sunshine duration (S), air temperature (T_a), relative humidity (RH), global solar radiation (R_s) and precipitation.

Full size table

The daily meteorological dataset used for this study was obtained from the National Meteorological Information Center of China (http://data.cma.cn/). This comprehensive dataset encompasses daily maximum air temperature (T_max), minimum air temperature (T_min), sunshine duration (S), relative humidity (RH), shortwave solar radiation (R_s), and net radiation (R_n) for the period 2000–2014. It is important to note that these datasets were subject to rigorous quality control by the National Meteorological Administration. This was carried out with standard data processing methods. Any gaps in the data, caused by issues such as instrument maintenance or power failures, were addressed using linear interpolation. This method ensures continuity and reliability in the data, facilitating more robust analyses and conclusions.

The Random Forest analysis was conducted to identify the environmental factors (T_max, T_min, S, RH and R_s) significantly affecting the response variable R_n using the“rfPermute”package²⁸. As shown in Fig. 2. The importance of the climatic variables to R_n prediction followed the order of RH, R_s, S, T_min and T_max in all stations, with the RH as the most important variable. The importance of the same climatic variables to R_n also varied across the sites, as quantified by the Mean Decrease in Accuracy (%IncMSE).

Artificial intelligence models for solar radiation estimation

Extreme learning machine

The Extreme Learning Machine (ELM) represents a specific form of the Feedforward Neural Network (FFNN) model, as illustrated in Fig. 3. Structurally, an FFNN model typically encompasses three principal layers: the input layer (containing the input variables), the hidden layer (comprising neurons), and the output layer (yielding the predicted Rn in this study).

Distinctively, the ELM model permits random assignment of the input weights and analytical determination of the output weight. This leads to advantages such as superior learning efficiency and enhanced generalization capability as compared to traditional FFNN models. Within the ELM framework, various input parameters can be systematically adjusted, ensuring the integrity of mapping between different layers. Conversely, in traditional approaches, the parameters of the hidden layers are assigned randomly, without subsequent modification.

The net result of this architectural distinction is that the learning or training efficiency of the ELM model is typically far greater than that of conventional feed-forward network learning algorithms^29,30. For an in-depth exploration of the theoretical foundations and applications of ELM, readers are directed to several seminal works in the field^{20,29,30,31,32,33,34}.

In an FFNN model, multiple neurons facilitate the connections between the distinct layers of the network. Assume that the number of input variables is denoted as n and the number of output variables is represented as m. Correspondingly, the input layer of the FFNN model will consist of n neurons, while the output layer will comprise m neurons. Interposed between these input and output layers, the hidden layer will incorporate l neurons. This structure establishes a hierarchical framework for information processing, with each layer transforming the input data towards the desired output through the application of weights, biases, and activation functions. The connection weight w was assumed as follows:

$$ {\varvec{w}} = \left( {\begin{array}{*{20}c} {w_{11} } & \ldots & {w_{1n} } \\ \vdots & \ddots & \vdots \\ {w_{l1} } & \cdots & {w_{ln} } \\ \end{array} } \right)_{l \times n} $$

(1)

where w_ij is the weight between the ith neuron in the input layer and the jth neuron in the hidden layer.Similarly, the connection weights β between the neurons in the hidden layer and output layer is described as:

$$ {\varvec{\beta}} = \left( {\begin{array}{*{20}c} {\beta_{11} } & \ldots & {\beta_{1m} } \\ \vdots & \ddots & \vdots \\ {\beta_{l1} } & \cdots & {\beta_{lm} } \\ \end{array} } \right)_{l \times m} $$

(2)

Assuming b for hidden layer neuron is:

$$ {\varvec{b}} = \left( {\begin{array}{*{20}c} {b_{1} } \\ \vdots \\ {b_{l} } \\ \end{array} } \right)_{l \times 1} $$

(3)

The input matrix X is estimated as:

$$ {\varvec{X}} = \left( {\begin{array}{*{20}c} {x_{11} } & \ldots & {y_{1Q} } \\ \vdots & \ddots & \vdots \\ {x_{n1} } & \cdots & {y_{nQ} } \\ \end{array} } \right)_{n \times Q} $$

(4)

The output matrix Y is estimated as:

$$ {\varvec{Y}} = \left( {\begin{array}{*{20}c} {y_{11} } & \ldots & {y_{1Q} } \\ \vdots & \ddots & \vdots \\ {y_{m1} } & \cdots & {y_{mQ} } \\ \end{array} } \right)_{m \times Q} $$

(5)

Therefore, T is constructed as follows:

$$ {\varvec{T}} = \left( {\begin{array}{*{20}c} {{\varvec{t}}_{1} } & \cdots & {{\varvec{t}}_{Q} } \\ \end{array} } \right)_{m \times Q} ,{\varvec{t}}_{j} = \left( {\begin{array}{*{20}c} {t_{1j} } \\ \vdots \\ {t_{mj} } \\ \end{array} } \right)_{m \times 1} = \left( {\begin{array}{*{20}c} {\sum\limits_{i = 1}^{l} {\beta_{i1} g({\varvec{w}}_{i} {\varvec{x}}_{j} + b_{i} )} } \\ \vdots \\ {\sum\limits_{i = 1}^{l} {\beta_{im} g({\varvec{w}}_{i} {\varvec{x}}_{j} + b_{i} )} } \\ \end{array} } \right)_{m \times 1} (j = 1,2, \cdots ,Q) $$

(6)

where ${\varvec{w}}_{i} = \, [w_{i1} ,w_{i2} , \cdots ,w_{in} ]$; ${\varvec{x}}_{j} = \, [x_{1j} ,x_{2j} , \cdots ,x_{nj} ]^{T}$. g(x) is the activation function.

Equation (6) is rewritten as:

$$ \user2{H\beta } = {\varvec{T}}^{\prime } $$

(7)

where Tʹ represents the transposition of T; H represents the hidden layer output matrix, and it is presented as:

$$ \begin{gathered} {\varvec{H}}{(}{\varvec{w}}_{1} {, }{\varvec{w}}_{2} {, } \cdots {, }{\varvec{w}}_{l} {, }b_{1} {, }b_{2} , \cdots ,b_{1} ,{\varvec{x}}_{1} ,{\varvec{x}}_{2} , \cdots ,{\varvec{x}}_{Q} {)} = \hfill \\ \left( {\begin{array}{*{20}c} {g({\varvec{w}}_{1} \cdot {\varvec{x}}_{1} + b_{1} )g({\varvec{w}}_{2} \cdot {\varvec{x}}_{1} + b_{2} )g({\varvec{w}}_{l} \cdot {\varvec{x}}_{1} + b_{l} )} \\ \vdots \\ {g({\varvec{w}}_{1} \cdot {\varvec{x}}_{Q} + b_{1} )g({\varvec{w}}_{2} \cdot {\varvec{x}}_{Q} + b_{2} )g({\varvec{w}}_{l} \cdot {\varvec{x}}_{Q} + b_{l} )} \\ \end{array} } \right)_{Q \times l} \hfill \\ \end{gathered} $$

(8)

ELM can study l different samples with tiny errors^32,35, since it assigns the random variables to the hidden nodes and computes the output weights by matrix H, even though the number of hidden neurons is less than that of distinct samples²¹. There are two principles in the ELM model:

Theorem 1

Assuming that a given FFNN l additive nodes, and g(x) is differentiable in any intervals, the H is invertible and $\left\| {\user2{H\beta - T}^{\prime } } \right\| = 0$^32,35.

Theorem 2

Given any small error ε > 0, and g(x) is infinitely differentiable in any intervals, there exists l ≤ n such that any different input vectors are generated according to any continuous probability distribution $\left\| {{\varvec{H}}_{N \times M} {\varvec{\beta}}_{M \times m} - \user2{T^{\prime}}} \right\| < \varepsilon$ with a probability of 1^32,35.

Based on Theorem 1, we can derive:

$$ \sum\limits_{j = 1}^{Q} {\left\| {{\varvec{t}}_{j} - {\varvec{y}}_{j} } \right\|} = 0 $$

(9)

where y_i = [y_1j, y_2j, ···, y_mj]^T (j = 1, 2, ···, Q).

In many cases, the number of hidden neurons is smaller than the training data Q. Thus, according to Theorem 2:

$$ \sum\limits_{j = 1}^{Q} {\left\| {{\varvec{t}}_{j} - {\varvec{y}}_{j} } \right\|} < \varepsilon $$

(10)

ELM randomly determines w and b, and remains unchanged during training. The output weight β is determined by solving the least squares solution:

$$ \mathop {\min }\limits_{\beta } \left\| {\user2{H\beta } - {\varvec{T}}^{\prime } } \right\| $$

(11)

and the solution is:

$$ \user2{\beta = H}^{ + } {\varvec{T}}^{\prime } $$

(12)

where H + represents the generalized inverse of the matrix H³³.

Artificial neural networks

The process of using an ANN model is mainly divided into two phases: training and testing. The training phase includes variable selection, data segmentation, and normalization. Within the network, independent neurons are connected by transfer functions, forming a pathway for information flow. The ANN begins by receiving data and learning through a process of fitness adjustment. The output results are compared to the target values, and if the error exceeds the expected threshold, forward propagation is replaced by backpropagation. This enables the network to change the weights and thresholds of neurons in each layer, reducing the error in signal transmission. The ANN can iteratively adjust the model's parameters by employing backpropagation to minimize the error between prediction and ground truth, resulting in the best possible prediction¹⁴. Many studies utilize the ANN model with the backpropagation learning algorithm (BPNN)^36,37. Still, the BPNN's iterative nature to find the optimal parameters might result in local optima instead of global optima. Genetic algorithms can mitigate these issues with their capacity for extensive global search³⁸. As a type of evolutionary algorithm, the genetic algorithm simulates natural selection and genetic mechanisms to find all solutions within the solution space rapidly. Unlike other methods, it avoids local optima and can leverage inherent parallelism for distributed computing, thus enhancing the solving speed. Therefore, the backpropagation ANN model was further optimized by incorporating the genetic algorithm (GANN). The flowchart detailing the GANN model can be found in Fig. 3.

Random forests

The RF algorithm, utilized in this study, is a supervised ensemble learning method that integrates multiple classification or regression trees. Building upon the bagging model, RF adds an extra layer of randomness by selecting samples for training at random to create independent decision trees, as depicted in Fig. 3. Each of these trees independently produces predictions through regression. The final output of the RF model is derived from the average value of these predictions. By employing many classifiers in an ensemble and utilizing regression techniques, the RF model exhibits a more robust potential for improving model performance compared to any single classifier or a typical ensemble of classifiers³⁹. This robustness allows RF to handle high-dimensional regression tasks and complex nonlinear problems. Moreover, the risk of overfitting can be reduced through regression within the model. For comprehensive insights into the RF model and its applications, references such as Breiman, Karimi et al., Matin and Chelgani can be consulted^39,40,41.

For an ensemble of classifiers $h_{1} ({\varvec{X}})$,$h_{2} ({\varvec{X}})$, …,$h_{k} ({\varvec{X}})$, and with the training set drawn at random from the distribution of the random vector Y, X, the margin function between the two can be represented by:

$$ mg({\varvec{X}},Y) = av_{k} I(h_{k} ({\varvec{X}}) = Y) - \mathop {max}\limits_{j \ne Y} av_{k} I(h_{k} ({\varvec{X}}) = j) $$

(13)

where $I( \cdot )$ denotes the indicator function. The margin assesses the extent of the average number of votes at X, Y for the right class surpasses the average vote for any other class. The generalization predicted value error of RF model (PE*) is estimated as:

$$ PE^{*} = P_{{{\varvec{X}},Y}} (mg({\varvec{X}},Y) < 0) $$

(14)

If the decision trees number is large, two theorems are suitable for such conditions:

Theorem 3

As the number of decision trees increases, the sequences ${\varvec{\theta}}_{k}$, $PE^{*}$ converges to:

$$ P_{{{\varvec{X}},Y}} (P_{\theta } h({\varvec{X}},\theta ) = Y) - \mathop {\max }\limits_{j \ne i} P_{\theta } (h({\varvec{X}},\theta ) = j) < 0) $$

(15)

The theorem shows that RF will not overfit as more trees are added, but will produce a limit of generalization error.

Theorem 4

The maximum generalization error is estimated as follows:

$$ PE^{*} \le \frac{{\overline{\rho } (1 - s^{2} )}}{{s^{2} }} $$

(16)

where $\overline{\rho }$ represents the correlation expected value of individual classifier; $s^{2}$ indicates the strength of the classifier. Theorem 2 reveals that the maximum limit of generalization error was reduced by increasing the strength and decreasing the correlation.

Generalized regression neural network

GRNN were first introduced by Specht⁴² and have been distinguished for their ability to approximate nonlinear relational mapping functions using training data. A GRNN typically consists of four layers, including the input layer, pattern layer, summation layer, and output layer, as depicted in Fig. 3⁴³. The pattern and summation layers are unique to GRNN, with the radial basis function serving as an activation function within the network. This structure underpins the fundamental principle of GRNN, which is nonlinear regression. Although GRNN shares some similarities with the common nonlinear multilayer feedforward network model (BPNN), the two models diverge in their underlying operations. A significant difference lies in GRNN's utilization of a global approximation algorithm, in contrast to the iterative training procedure of BPNN. Consequently, GRNN avoids the issues associated with local minima problems, a common challenge in BPNN^33,42,44. Overall, the flexibility and non-iterative nature of GRNN position it as an effective tool for complex nonlinear modeling, with particular strengths in overcoming certain limitations of traditional neural network techniques. The basic formula involved in the GRNN model is given in Eq. (17).

$$ E\left( {{y \mathord{\left/ {\vphantom {y X}} \right. \kern-0pt} X}} \right) = \frac{{\int_{ - \infty }^{\infty } {yf{(}x{,}y{)}} dy}}{{\int_{ - \infty }^{\infty } {f{(}x{,}y{)}} dy}} $$

(17)

where x and y represent the input and output variable, respectively,$f(x,y)$ is the joint probability density function of x and y. $E({y \mathord{\left/ {\vphantom {y X}} \right. \kern-0pt} X})$ is the mean value of y. The function value is calculated based on the following formula⁴³ (Tabari et al., 2012):

$$ y_{i} = \frac{{\sum\limits_{i = 1}^{n} {h_{i} w_{ij} } }}{{\sum\limits_{i = 1}^{n} {h_{i} } }} $$

(18)

where w_ij is the target output for input x_i and output j, $h_{i} = \exp ( - \frac{{D_{i}^{2} }}{{2\sigma^{2} }})$, represents the output of a hidden layer neuron;$D_{i}^{2} = (x - u_{i} )^{{\hbox{T}}} (x - u_{i} )$ represents the squared distance between the input vector x and the training vector u_i, σ is the width coefficient of the Gaussian function.

Model training and testing

To estimate R_n, the model incorporated daily T_max, T_min, RH, S, and R_s as its key input variables. We allocated the data spanning from 2000 to 2009, representing two-thirds of the complete dataset, for training purposes. The subsequent data from 2010 to 2014, making up the remaining third, served as the test set. For enhanced model performance, all variables underwent normalization to mitigate disparities arising from their inherent scales. The normalized algorithm is given in the following equation³⁴:

$$ x_{{{{\rm norm}}}} = \frac{{x_{0} - x_{\min } }}{{x_{\max } - x_{\min } }} $$

(19)

where x_norm, x₀, x_min, and x_max are the normalized, original, minimum, and maximum values of the variable x, respectively.

Performance evaluation

We utilized various statistical accuracy indicators to evaluate the model's accuracy based on the testing data. These included the root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe coefficient (NS), and relative error. The definitions for each of these metrics were provided in accordance with the following equations⁴⁵:

$$ RMSE = \sqrt {\frac{1}{m}\sum\limits_{i = 1}^{m} {\left( {Y_{i} - X_{i} } \right)^{2} } } $$

(20)

$$ MAE = \frac{1}{m}\sum\limits_{i = 1}^{m} {\left| {Y_{i} - X_{i} } \right|} $$

(21)

$$ \hbox{NS} = 1 - \frac{{\sum\limits_{i = 1}^{m} {(Y_{i} - X_{i} )^{2} } }}{{\sum\limits_{i = 1}^{m} {(X_{i} - \overline{X} )^{2} } }} $$

(22)

where X_i and Y_i are the ith observed and predicted values, respectively, $\overline{{X_{i} }}$ and $\overline{{Y_{i} }}$ represent the mean values of X_i and Y_i, respectively, and m is the size of the test sample.

Results and discussions

The four ML models, namely ELM, GANN, RF, and GRNN, have been employed to estimate R_n values across four distinct climatic zones within China. Initially, a comparative analysis was conducted to evaluate the computational time required by each model. Utilizing MATLAB R2015a software on a system equipped with an Intel Xeon 2 E3-1230 v2 processor and 12-GB RAM, the ELM model demonstrated a running time of approximately 1 s across the different climatic zones. In contrast, the running times for the remaining models were notably higher, registering at 296 s, 161 s, and 21 s for GANN, RF, and GRNN, respectively. ELM's efficiency can be attributed to its relatively simple learning algorithm for single-hidden layer FFNN structures. It was anticipated that the learning speed of ELM would surpass that of traditional learning algorithms, as previously evidenced in the works of Huang et al.^29,30. Consequently, ELM exhibits superiority over other models in the speed of estimating R_n. Supporting evidence of the ELM model's faster learning speed, facilitated by the ease of random assignment to hidden node parameters, has also been documented by Huang et al.²⁹ and Feng et al.³³.

The relationships between the measured R_n values and the estimates produced by the four ML models (ELM, GANN, RF, and GRNN) for each distinct climatic zone within China are delineated in Figs. 4, 5, 6, 7. Statistical analysis reveals that R_n estimates provided by each model exhibit significant correlations with the corresponding measured values across the various climatic zones (P < 0.001). Specifically, linear regression slopes were observed to be 0.815–0.868 for ELM, 0.815–0.870 for GANN, 0.810–0.857 for RF, and 0.812–0.845 for GRNN. Additionally, the coefficient of determination (R²) ranged from 0.838 to 0.963 for ELM, 0.836 to 0.963 for GANN, 0.809 to 0.959 for RF, and 0.812 to 0.949 for GRNN model. The substantial agreement between the measured and estimated R_n values attests to the potential applicability of these four ML models in Rn estimation within varying climatic contexts. Nevertheless, all four models exhibited a slight tendency to underestimate R_n across different climatic zone. Moreover, a consistent pattern emerged wherein the models demonstrated optimal performance within the temperate monsoon zone (specifically at the Beijing station), as compared to other regions. This observed regional performance disparity may be attributable to variations in the climatic conditions characterizing each individual station.

Box plots have been employed to elucidate the distribution of both observed and estimated daily R_n values generated by the four ML models, as depicted in Fig. 8. Analysis indicates that the distributional characteristics of the estimated R_n values by the various models closely mirror those of the observed R_n. However, the four models collectively demonstrate lower distribution disparities across each climatic zone, indicating a consistent underestimation of R_n. Moreover, a marked contrast is evident in the distribution of measured R_n across the different climatic zones. This variation can be ascribed not solely to the heterogeneous climatic conditions inherent to each station but also to the underlying surface condition. The influence of surface albedo on R_n is substantial, as documented by Campbell and Norman⁴⁶, and the uncertainty associated with surface albedo across different underlying surfaces poses challenges to the precise determination of R_n. Such complexities may contribute to the observed error in each model's estimation of R_n within the context of the present study.

Figure 9 illustrates the monthly variations of both measured and estimated R_n, along with the absolute error between these values, across different climatic zones. The estimated monthly average R_n values derived from the four ML models are in substantial agreement with the measured quantities, with both sets of data exhibiting a bell-shaped pattern. Specifically, lower R_n are apparent in winter (December to February), while higher values are observed in summer (June to August). The disparities between the underestimation and overestimation of R_n by the different models fluctuate across the climatic zones. Within the mountain plateau zone, all ML models generally underestimate R_n annually, except for RF and GRNN in May, with absolute errors ranging from – 2.01 to 0.13 MJ m² d⁻¹. For the temperate monsoon zone, the four models consistently underestimate R_n throughout the year, with absolute errors spanning from – 2.02 to – 0.22 MJ m² d⁻¹. In the subtropical monsoon zone, the models overestimate R_n in summer while underestimating it in other seasons, with absolute errors varying between – 1.00 and 0.72 MJ m² d⁻¹. Conversely, within the temperate continental zone, all models overestimate R_n in winter while underestimating it in other seasons, with absolute errors fluctuating between 1.44 and 0.22 MJ m² d⁻¹. These findings shed light on the intricate relationship between the models' estimations and actual measurements of R_n, and emphasize the need to consider regional climatic variations in the interpretation of the results.

Table 2 presents the statistical parameters that detail the performance of the four ML models in estimating R_n for each climatic zone within the study area. Overall, the performance of the models is satisfactory, with Root Mean Square Error (RMSE) values ranging from 1.386–2.025 MJ m² d⁻¹ for the ELM model, 1.360–2.122 MJ m² d⁻¹ for the GANN model, 1.457–2.186 MJ m² d⁻¹ for the RF model, and 1.487–2.126 MJ m² d⁻¹ for the GRNN model. The ELM and GANN models slightly outperformed the RF and GRNN models across the four climatic zones, according to RMSE statistics. Notably, the lowest RMSE values for the ELM and GANN models were recorded in the subtropical monsoon zone, at 1.386 and 1.360 MJ m² d⁻¹, respectively, while the highest values were observed in the mountain plateau zone. Mean Absolute Error (MAE) values ranged from 1.010 to 1.601 MJ m² d⁻¹ for the ELM model, 1.010 to 1.627 MJ m² d⁻¹ for the GANN model, 1.068 to 1.693 MJ m² d⁻¹ for the RF model, and 1.148 to 1.684 MJ m² d⁻¹ for the GRNN model. The accuracy of the ELM and GANN models was found to be slightly superior to the RF and GRNN models, as indicated by MAE statistics. The Nash–Sutcliffe (NS) efficiency values for the ELM and GANN models were also marginally higher than those for the RF and GRNN models, with values ranging from 0.768 to 0.926 for the ELM model, 0.760 to 0.925 for the GANN model, 0.755 to 0.917 for the RF model, and 0.756 to 0.894 for the GRNN model. The NS values for the ELM and GANN models were lowest in the mountain plateau zone, while the highest values were detected in the temperate monsoon zone. These statistical insights underscore the nuanced differences in the performance of the models across various climatic conditions and contribute to a comprehensive understanding of their applicability in R_n estimation.

Table 2 Statistics performance (including root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe coefficient (NS)) of extreme learning machine (ELM), backpropagation neural networks optimized by genetic algorithm (GANN), random forests (RF) and generalized regression neural networks (GRNN) model in estimating daily net radiation for each climatic zone.

Full size table

In a comparative analysis, the performances of the ELM and GANN models in predicting R_n are demonstrably superior to those of the RF and GRNN models. The ELM model, characterized by its approach of randomly selecting hidden nodes and determining output weight through specific analytical methods, diverges from traditional ANN models that require human intervention for optimal parameter determination¹⁴. This characteristic may contribute to its enhanced performance compared to conventional artificial neural networks. Prior research has corroborated that ELM can yield higher prediction accuracy for various hydrological and meteorological variables such as streamflow⁴⁷, evapotranspiration⁴⁸, solar radiation²¹, and soil temperature¹⁴. Although traditional neural network models offer flexibility in handling nonlinear relationships, their reliance on the backpropagation method to obtain network weights introduces a susceptibility to local minimum issues during the learning process^49,50,51. To address the shortcomings associated with backpropagation in traditional ANN models, this study employed a genetic algorithm to enhance the neural networks. As a result, the GANN model's performance in estimating Rn was found to be superior to that of the RF and GRNN models. In summary, the ELM and GANN models exhibit higher accuracy in R_n estimates compared to the RF and GRNN models, with the ELM model particularly recommended for assessing R_n across various climatic zones in China, owing to its remarkable speed and performance. The inferior stability in the RF model's simulations may be attributable to its lack of hyper-parameter tuning and its method of randomly selecting training data to construct classifiers⁴⁰. Furthermore, a study by Yamaç⁵² emphasized that RF models might not perform as effectively as other artificial intelligence algorithms in specific applications, such as sugar beet evapotranspiration estimation.

A salient limitation of the four ML models is the lack of transparency in their underlying physical principles. In contrast to physically-based models that articulate explicit mathematical expressions, these ML models function as data-driven black-box systems. Consequently, their adaptability can become constrained in scenarios where the requisite training data are unavailable^43,53. Despite this limitation, the inherent compatibility of ML models allows them to be hybridized with other algorithms, offering the potential to enhance the precision of target variable estimation. A notable example can be found in the work of Nahvi et al.⁵⁴, who applied a self-adaptive evolutionary approach to optimize the hidden module of the ELM. This improvement led to an increase in the daily soil temperature estimation accuracy.

Furthermore, in the context of R_n estimation, ML models often consider a multitude of meteorological variables as input. The selection of these variables requires careful consideration and analysis of their various combinations to determine the optimal set for each specific study area. Such a targeted approach is not only vital for R_n estimation but is also of substantial importance for data-scarce regions where variable selection becomes critical²¹.

Accurate estimation of daily net radiation plays a pivotal role in optimising water management and crop productivity across China's diverse agricultural landscapes. Our study reveals the availability and effectiveness of four machine learning models in predicting daily net radiation in different climate zones in China, and recommends ELM as the most recommended model due to its accuracy and computational speed. However, all models in our study were found to slightly underestimate Rn, future foucs on integration of real-time data into our model architecture and optimisation of model parameters using heuristic optimisation methods may increase model adaptability and accuracy in changing environments^55,56.

Conclusions

In the investigation of estimating daily R_n across four distinct climatic zones in China, four ML models, namely ELM, GANN, RF, and GRNN, were employed. Utilizing conventional meteorological variables alongside net radiation data, the models were trained and tested to assess their efficacy. The statistical analysis revealed satisfactory estimations for daily R_n across the models. The RMSE ranged from 1.392 to 2.025 MJ m⁻² d⁻¹ for ELM, 1.360 to 2.122 MJ m⁻² d⁻¹ for GANN, 1.457 to 2.186 MJ m⁻² d⁻¹ for RF, and 1.487 to 2.126 MJ m⁻² d⁻¹ for GRNN. The MAE spanned from 1.010 to 1.601 MJ m⁻² d⁻¹, 1.010 to 1.627 MJ m⁻² d⁻¹, 1.068 to 1.603 MJ m⁻² d⁻¹, and 1.148 to 1.684 MJ m⁻² d⁻¹, respectively, while the NS ranged from 0.768 to 0.926, 0.760 to 0.925, 0.755 to 0.917, and 0.756 to 0.894 for the four models.

Comparatively, the distribution of estimated R_n closely aligned with the observed values, with minimal distributional discrepancies identified across the climatic zones. Upon an overall examination, the ELM and GANN models demonstrated superiority over the RF and GRNN models in the precision of R_n estimations. Specifically, the ELM model emerged as highly recommended for R_n estimation across different climatic zones in China, attributed to its combination of remarkable computational efficiency and performance accuracy.

Data availability

All data, models, and code generated or used during the study appear in the submitted article.

References

Feng, Y., Zhang, X., Jia, Y., Cui, N., Hao, W., Li, H. & Gong, D. High-resolution assessment of solar radiation and energy potential in China. Energy Convers. Manag. 240 (2021).
Zhang, Y. X., Cui, N. B., Feng, Y., Gong, D. Z., & Hu, X. T. Comparison of BP, PSO-BP and statistical models for predicting daily global solar radiation in arid Northwest China. Comput. Electron. Agric. 164 (2019).
Feng, Y., Cui, N. B., Gong, D. Z., Zhang, Q. W. & Zhao, L. Evaluation of random forests and generalized regression neural networks for daily reference evapotranspiration modelling. Agric. Water Manag. 193, 163–173 (2017).
Article Google Scholar
Besharat, F., Dehghan, A. A. & Faghih, A. R. Empirical models for estimating global solar radiation: A review and case study. Renew. Sustain. Energy Rev. 21, 798–821 (2013).
Article Google Scholar
Dorvlo, A. S. S., Jervase, J. A. & Al-Lawati, A. Solar radiation estimation using artificial neural networks. Appl. Energy 71, 307–319 (2002).
Article ADS Google Scholar
El-Sebaii, A. A., Al-Hazmi, F. S., Al-Ghamdi, A. A. & Yaghmour, S. J. Global, direct and diffuse solar radiation on horizontal and tilted surfaces in Jeddah, Saudi Arabia. Appl. Energy 87, 568–576 (2010).
Article ADS Google Scholar
Hassan, G. E., Youssef, M. E., Mohamed, Z. E., Ali, M. A. & Hanafy, A. A. New temperature-based models for predicting global solar radiation. Appl. Energy 179, 437–450 (2016).
Article ADS Google Scholar
Khatib, T., Mohamed, A. & Sopian, K. A review of solar energy modeling techniques. Renew. Sustain. Energy Rev. 16, 2864–2869 (2012).
Article CAS Google Scholar
Teke, A., Yildirim, H. B. & Celik, O. Evaluation and performance comparison of different models for the estimation of solar radiation. Renew. Sustain. Energy Rev. 50, 1097–1107 (2015).
Article Google Scholar
Pan, T., Wu, S., Dai, E. & Liu, Y. Estimating the daily global solar radiation spatial distribution from diurnal temperature ranges over the Tibetan Plateau in China. Appl. Energy 107, 384–393 (2013).
Article ADS Google Scholar
Zarzalejo, L. F., Polo, J., Martin, L., Ramirez, L. & Espinar, B. A new statistical approach for deriving global solar radiation from satellite images. Sol. Energy 83, 480–484 (2009).
Article ADS Google Scholar
Yadav, A. K. & Chandel, S. S. Solar radiation prediction using artificial neural network techniques: A review. Renew. Sustain. Energy Rev. 33, 772–781 (2014).
Article Google Scholar
Feng, Y., Gong, D. Z., Jiang, S. Z., Zhao, L. & Cui, N. B. National-scale development and calibration of empirical models for predicting daily global solar radiation in China. Energy Convers. Manag. 203, 112236 (2020).
Article Google Scholar
Feng, Y., Cui, N. B., Hao, W. P., Gao, L. L. & Gong, D. Z. Estimation of soil temperature from meteorological data using different machine learning models. Geoderma 338, 67–77 (2019).
Article ADS Google Scholar
Gairaa, K., Khellaf, A., Messlem, Y. & Chellali, F. Estimation of the daily global solar radiation based on Box-Jenkins and ANN models: A combined approach. Renew. Sustain. Energy Rev. 57, 238–249 (2016).
Article Google Scholar
Jiang, Y. Computation of monthly mean daily global solar radiation in China using artificial neural networks and comparison with other empirical models. Energy 34, 1276–1283 (2009).
Article Google Scholar
Kashyap, Y., Bansal, A. & Sao, A. K. Solar radiation forecasting with multiple parameters neural networks. Renew. Sustain. Energy Rev. 49, 825–835 (2015).
Article Google Scholar
Wang, L. et al. Solar radiation prediction using different techniques: Model evaluation and comparison. Renew. Sustain. Energy Rev. 61, 384–397 (2016).
Article Google Scholar
Deo, R. C., Wen, X. & Qi, F. A wavelet-coupled support vector machine model for forecasting global incident solar radiation using limited meteorological dataset. Appl. Energy 168, 568–593 (2016).
Article ADS Google Scholar
Mohammadi, K. et al. Predicting the wind power density based upon extreme learning machine. Energy 86, 232–239 (2015).
Article Google Scholar
Shamshirband, S., Mohammadi, K., Yee, P. L., Petkovic, D. & Mostafaeipour, A. A comparative evaluation for identifying the suitability of extreme learning machine to predict horizontal global solar radiation. Renew. Sustain. Energy Rev. 52, 1031–1042 (2015).
Article Google Scholar
Feng, Y. et al. Machine learning models to quantify and map daily global solar radiation and photovoltaic power. Renew. Sustain. Energ Rev. 118, 109393 (2020).
Article Google Scholar
Jiang, B. et al. Empirical estimation of daytime net radiation from shortwave radiation and ancillary information. Agric. For. Meteorol. 211, 23–36 (2015).
Article ADS Google Scholar
Kaicun, W. & Shunlin, L. An improved method for estimating global evapotranspiration based on satellite determination of surface net radiation, vegetation index, temperature, and soil moisture (2008).
Lu, J., Tang, R., Tang, H. & Li, Z.-L. Derivation of daily evaporative fraction based on temporal variations in surface temperature, air temperature, and net radiation. Remote Sens. 5, 5369–5396 (2013).
Article ADS Google Scholar
Bisht, G. & Bras, R. L. Estimation of net radiation from the moderate resolution imaging spectroradiometer over the continental United States. IEEE Trans. Geosci. Remote Sens. 49, 2448–2462 (2011).
Article ADS Google Scholar
Hwang, K., Choi, M., Lee, S. O. & Seo, J.-W. Estimation of instantaneous and daily net radiation from MODIS data under clear sky conditions: A case study in East Asia. Irrig. Sci. 31, 1173–1184 (2013).
Article Google Scholar
Shi, X. et al. Tree species richness and functional composition drive soil nitrification through ammonia-oxidizing archaea in subtropical forests. Soil Biol. Biochem. 187, 109211 (2023).
Article CAS Google Scholar
Huang, G.-B., Chen, L. & Siew, C.-K. Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans. Neural Netw. 17, 879–892 (2006).
Article PubMed Google Scholar
Huang, G.-B., Zhu, Q.-Y. & Siew, C.-K. Extreme learning machine: theory and applications. Neurocomputing 70, 489–501 (2006).
Article Google Scholar
Kariminia, S., Shamshirband, S., Motamedi, S., Hashim, R. & Roy, C. A systematic extreme learning machine approach to analyze visitors’ thermal comfort at a public urban space. Renew. Sustain. Energy Rev. 58, 751–760 (2016).
Article Google Scholar
Mohammadi, K. et al. Extreme learning machine based prediction of daily dew point temperature. Comput. Electron. Agric. 117, 214–225 (2015).
Article Google Scholar
Feng, Y., Gong, D., Mei, X. & Cui, N. Estimation of maize evapotranspiration using extreme learning machine and generalized regression neural network on the China Loess Plateau. Hydrol. Res. 48, 1156–1168 (2017).
Article Google Scholar
Feng, Y., Cui, N., Zhao, L., Hu, X. & Gong, D. Comparison of ELM, GANN, WNN and empirical models for estimating reference evapotranspiration in humid region of Southwest China. J. Hydrol. 536, 376–383 (2016).
Article ADS Google Scholar
Huang, G. B., Zhu, Q. Y., & Siew, C. K., IEEE. Extreme learning machine: A new learning scheme of feedforward neural networks. IEEE International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, pp. 985–990 (2004).
Kumar, M., Raghuwanshi, N. S. & Singh, R. Artificial neural networks approach in evapotranspiration modeling: A review. Irrig. Sci. 29, 11–25 (2011).
Article Google Scholar
Shiri, J. et al. Comparison of heuristic and empirical approaches for estimating reference evapotranspiration from limited inputs in Iran. Comput. Electron. Agric. 108, 230–241 (2014).
Article Google Scholar
Gao, L., Gong, D., Cui, N., Lv, M. & Feng, Y. Evaluation of bio-inspired optimization algorithms hybrid with artificial neural network for reference crop evapotranspiration estimation. Comput. Electron. Agric. 190, 106466 (2021).
Article Google Scholar
Matin, S. S. & Chelgani, S. C. Estimation of coal gross calorific value based on various analyses by random forest method. Fuel 177, 274–278 (2016).
Article CAS Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Karimi, S., Shiri, J. & Marti, P. Supplanting missing climatic inputs in classical and random forest models for estimating reference evapotranspiration in humid coastal areas of Iran. Comput. Electron. Agric. 176, 105633 (2020).
Article Google Scholar
Specht, D. F. A general regression neural network. IEEE Trans. Neural Netw. 2, 568–576 (1991).
Article PubMed CAS Google Scholar
Tabari, H., Kisi, O., Ezani, A. & Talaee, P. H. SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. J. Hydrol. 444, 78–89 (2012).
Article ADS Google Scholar
Kisi, O. The potential of different ANN techniques in evapotranspiration modelling. Hydrol. Process. 22, 2449–2460 (2008).
Article ADS Google Scholar
Gong, D., Hao, W., Gao, L., Feng, Y. & Cui, N. Extreme learning machine for reference crop evapotranspiration estimation: Model optimization and spatiotemporal assessment across different climates in China. Comput. Electron. Agric. 187, 106294 (2021).
Article Google Scholar
Campbell, G. S. & Norman, J. M. An Introduction to Environment Biophysics. Spring-Verlag (1998).
Taormina, R. & Chau, K.-W. ANN-based interval forecasting of streamflow discharges using the LUBE method and MOFIPS. Eng. Appl. Artif. Intell. 45, 429–440 (2015).
Article Google Scholar
Jia, Y. et al. Optimization of an extreme learning machine model with the sparrow search algorithm to estimate spring maize evapotranspiration with film mulching in the semiarid regions of China. Comput. Electron. Agric. 201, 107298 (2022).
Article Google Scholar
Kisi, O. Modeling solar radiation of Mediterranean region in Turkey by using fuzzy genetic approach. Energy 64, 429–436 (2014).
Article Google Scholar
Kumar, M., Raghuwanshi, N. S., Singh, R., Wallender, W. W. & Pruitt, W. O. Estimating evapotranspiration using artificial neural network. J. Irrig. Drain. Eng. 128, 224–233 (2002).
Article Google Scholar
Sudheer, K. P., Gosain, A. K. & Ramasastri, K. S. Estimating actual evapotranspiration from limited climatic data using neural computing technique. J. Irrig. Drain. Eng. 129, 214–218 (2003).
Article Google Scholar
Yamac, S. S. Artificial intelligence methods reliably predict crop evapotranspiration with different combinations of meteorological data for sugar beet in a semiarid area. Agric. Water Manag. 254 (2021).
Kisi, O. & Cimen, M. Evapotranspiration modelling using support vector machines. Hydrol. Sci. J. J. Des Sci. Hydrol. 54, 918–928 (2009).
Article Google Scholar
Nahvi, B., Habibi, J., Mohammadi, K., Shamshirband, S. & Al Razgan, O. S. Using self-adaptive evolutionary algorithm to improve the performance of an extreme learning machine for estimating soil temperature. Comput. Electron. Agric. 124, 150–160 (2016).
Article Google Scholar
Li, Y. & Yang, Z. Application of EOS-ELM with binary Jaya-based feature selection to real-time transient stability assessment using PMU data. IEEE Access 5, 23092–23101 (2017).
Article MathSciNet Google Scholar
Zhang, Y., Li, T., Na, G., Li, G. & Li, Y. Optimized extreme learning machine for power system transient stability prediction using synchrophasors. Math. Probl. Eng. 2015, 529724 (2015).
Article ADS MathSciNet Google Scholar

Download references

Acknowledgements

During the process of conducting this study, we deeply felt the support and assistance of many people. Firstly, we would like to express our gratitude to the members of our mentor's research team for providing valuable guidance and advice. In particular, Daozhi Gong made significant contributions to the conceptualization of this project in the early stages of research. Although he was not listed in the author list due to his participation in the overall amount of work that did not meet the author criteria, his contribution was crucial to the successful implementation of this study. At the same time, thank you to all colleagues who participated in data collection and analysis, it’s your efforts that have enabled the smooth completion of this study. We would like to thank the National Climatic Centre of the China Meteorological Administration for providing the climate database used in this study. This work was also financially funded by the Science and Technology Program of Sichuan (2023YFN0024) and National Key Research and Development Program of China (2021YFD1600803-1).

Author information

Authors and Affiliations

College of Engineering, Sichuan Normal University, Chengdu, 610066, China
Haiying Yu, Rui Shi, Songyu Li, Jinfeng Wu, Xiu Kui & Haoting Zou
State Key Laboratory of Hydraulics and Mountain River Engineering, College of Water Resource and Hydropower, Sichuan University, Chengdu, 610065, China
Shouzheng Jiang, Mingjun Wang & Cun Zhan
College of Civil Engineering and Architecture, Zhejiang University of Water Resource and Electric Power, Hangzhou, 310018, Zhejiang, China
Minzhi Chen

Authors

Haiying Yu
View author publications
Search author on:PubMed Google Scholar
Shouzheng Jiang
View author publications
Search author on:PubMed Google Scholar
Minzhi Chen
View author publications
Search author on:PubMed Google Scholar
Mingjun Wang
View author publications
Search author on:PubMed Google Scholar
Rui Shi
View author publications
Search author on:PubMed Google Scholar
Songyu Li
View author publications
Search author on:PubMed Google Scholar
Jinfeng Wu
View author publications
Search author on:PubMed Google Scholar
Xiu Kui
View author publications
Search author on:PubMed Google Scholar
Haoting Zou
View author publications
Search author on:PubMed Google Scholar
Cun Zhan
View author publications
Search author on:PubMed Google Scholar

Contributions

H.Y.: Supervision, Resources; S. J.:methodology, simulation analysis, investigation and writing of original manuscript; R.S. and S.L.: Format Editing; J.W. and X.K.: Grammar review; H.Z: Review & Editing; M.W.: Software, Validation; D.G. and M.C.: Conceptualization, Methodology; C.Z.: Project administration. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Cun Zhan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, H., Jiang, S., Chen, M. et al. Machine learning models for daily net radiation prediction across different climatic zones of China. Sci Rep 14, 20454 (2024). https://doi.org/10.1038/s41598-024-71550-9

Download citation

Received: 24 May 2024
Accepted: 28 August 2024
Published: 03 September 2024
DOI: https://doi.org/10.1038/s41598-024-71550-9

Keywords

This article is cited by

Comparative analysis of daily global solar radiation prediction using deep learning models inputted with stochastic variables
- Amit Kumar Yadav
- Raj Kumar
- Tej Singh
Scientific Reports (2025)