Abstract
Achieving high-strength concrete (HSC) with sustainable supplementary cementitious materials (SCMs) remains a significant challenge in the construction industry. Although glass powder has shown promise as a partial cement substitute, its specific impact on HSC growth is still unclear. This study aims to evaluate the compressive strength (CS) of high strength glass-powder concrete (HSGPC) using machine learning (ML) models and enhance predictive accuracy through hybrid optimization techniques. A dataset comprising 598 points was compiled, considering cement, glass powder, aggregates, water, superplasticizer, and curing days as key input parameters. Three standalone ML models—K-Nearest Neighbors (KNN), Random Forest (RF), and Extreme Gradient Boosting (XGB)—were trained, with RF achieving R² = 0.963 and XGB achieving R² = 0.946 on the test set. To further enhance performance, XGB was optimized using Particle Swarm Optimization (PSO), Firefly Algorithm (FA), and Grey Wolf Optimizer (GWO). Among these, XGB-GWO demonstrated the highest accuracy, with R² improving to 0.991 and MSE decreasing significantly from 83.95 to 14.42, resulting in an 82.82% error reduction. SHAP, PDP, and ICE analyses identified superplasticizer dosage, curing days, and coarse aggregate as the most influential parameters affecting compressive strength (CS). PDP and ICE validated these findings, showing reduced strength gains beyond 600 kg/m³ of cement and a decline beyond 800 kg/m³ of coarse aggregate. This study highlights the potential of ML-driven optimization for sustainable concrete design, offering an efficient, data-driven approach to optimizing material proportions for high-strength, eco-friendly concrete.
Similar content being viewed by others
Introduction
In concrete, Ordinary Portland Cement (OPC) has been largely replaced by supplemental cementitious materials (SCMs). European cement manufacturers, for example, have replaced 25% of OPC clinkers during the past 20 years with alternative SCMs1,2. Fly ash caught the interest of researchers as a new and potential cementitious material because of its pozzolanic qualities and the existence of amorphous silica and alumina. According to recent research, replacing 40–60% of the calcium hydroxide (Ca(OH)2) generated during cement hydration, which results in the formation of more calcium silicate hydrate (C-S-H) and calcium aluminate hydrate (C-A-S-H) gel, can gradually increase the strength and durability of concrete3,4,5. Nonetheless, various studies indicate that the potential for substituting cement with elevated fly ash content may be constrained by the early strength development of concrete6,7.
The researchers utilized various methods to mitigate the initial reduction in CS of concrete comprising high volumes of fly ash, including the addition of lime to enhance the solubility of fly ash and accelerate the pozzolanic reaction, as well as making fly ash particles finer to increase their reactivity6,8. The utilization of industrial by-products, including blast furnace slag and silica fume, has gained popularity due to their substantial amorphous silica content, which enhances the early strength of such concrete9. Recycled waste glass, primarily soda-lime glass, is a non-biodegradable solid that has gained increased attention recently as a sustainable alternative to Portland cement in building materials. It has become widely recognized that pulverized glass debris from bottles may include significant amounts of silica and alumina, which seem to be non-crystalline (high amorphous silica concentration)10,11.
According to the preliminary work by Shao et al.12, glass powder finely crushed to less than 38 μm may exhibit pozzolanic activity. Due to the pozzolanic reaction of the glass powder, concrete with 30% glass powder as SCM exhibited reduced CS before 28 days but enhanced strength at 90 days. Moreover, like other SCMs, glass powder with long curing period13,14 and smaller particle size is more pozzolan12,15. Dyer and Dhir16 found that the leaching of alkalis from glass particles accelerated cement hydration when glass powder was utilized as a replacement. However, the amount of free alkalis is inadequate to compensate for the hydration and early-age strength reduction caused by cement dilution. Du and Tan17 as well as Schwarz et al.18 achieved similar findings. Moreover, when 10% glass powder was used in place of cement in concrete, Schwarz et al.10 examined the durability properties and found that there was a decrease in both water absorption and chloride penetration. However, at every testing interval up to 90 days, its CS was inferior to that of conventional cement concrete. Matos and Sousa-Coutinho19 studied the durability properties of masonry by substituting the 10% and 20% glass powder for cement. The replacement of glass powder decreased chloride diffusion without influencing water sorptivity or carbonation resistance. Carsana et al.20 reported that glass powder exhibited superior long-term pozzolanic activity compared to fly ash and natural pozzolans over a duration of seven years at a concentration of 30%. The literature indicates that glass powder can substitute a maximum of 30% of the cement component in mortar or concrete. Furthermore, glass powder has been utilized as SCM in concrete in substantial amounts (up to 60% by weight of OPC) in certain prior studies16,21. Such concrete was found to have both fresh and hardened characteristics. While the strength at 28 days increased, especially at 30% replacement, the strength at younger ages (less than 7 days) decreased steadily with the amount of glass powder. Additionally, the concrete was denser and more compacted due to the microstructure in the interfacial transition zone (ITZ). As a result, a significant increase in durability qualities has been noted.
It is evident that compressive strength (CS) is the key mechanical property. CS must be determined by compression tests carried out in compliance with applicable standards under particular curing times and ambient circumstances. All the constituents such as OPC, glass powder, water, fine and coarse aggregate show strong correlations with the CS of GPC22,23. Extensive experiments requiring a large investment of time, money, and resources are necessary if researchers hope to comprehend the interaction between CS and other elements. As a result, to represent the obtained data, researchers usually turn to using straightforward empirical formulae. However, these formulas are typically inadequate. This is because the correlations between variables and CS are complex and nonlinear and cannot be adequately captured by a single universal Eqs24,25.. Machine learning (ML) approaches are increasingly being used to predict the mechanical characteristics of materials due to artificial intelligence’s remarkable nonlinear fitting capacity26,27,28,29,30.
While a few studies have used ML techniques to predict the CS of glass powder concrete, most have been limited to normal-strength concrete and lack optimization techniques for improving predictive performance. Additionally, limited studies have explored the interpretability of these models through feature importance analysis. This study addresses these gaps by integrating three advanced metaheuristic algorithms, Particle Swarm Optimization (PSO), Firefly Algorithm (FA), and Grey Wolf Optimizer (GWO) to fine-tune the Extreme Gradient Boosting (XGB) model for accurate prediction of HSGPC compressive strength. In addition to benchmarking against baseline models (KNN, RF, and XGB), the study applies SHapley Additive Explanations (SHAP), Partial Dependence (PDP), and Individual Conditional Expectation (ICE) analyses to interpret feature effects. The study also employs Taylor diagrams for visual model comparison and robustness assessment. The key objectives include: (1) applying and comparing standalone ML models (KNN, RF, XGB); (2) improving performance via nature-inspired optimization techniques; and (3) conducting a comprehensive interpretability and statistical outlier analysis to ensure model reliability and engineering relevance.
The paper is systematically structured to ensure a clear progression of the research. Section “Research Significance”, elaborates on the research gap and the scope of the current study. Section “Research Methodology”, provides a detailed overview of data collection, dataset characteristics, and model development. Section “Results and Discussion”, presents the performance evaluation of various ML models along with an in-depth interpretability analysis. Section “Future Perspectives” outlines the future perspectives and research limitations, highlighting potential advancements and challenges. Finally, Section. “Conclusion”, summarizes the key findings and contributions of this study.
Research significance
The application of ML algorithms to predict unseen data particularly in civil engineering with significant outcomes is gaining popularity. Predictions regarding the mechanical characteristics of concrete are prevalent, however acquiring requisite data using conventional approaches necessitates costly and difficult laboratory procedures.
Recent research has employed ML models to improve the predictive accuracy of concrete properties. In this regard, Neural networks, particularly backpropagation neural networks (BPNN), have successfully predicted the CS of ceramic waste slag concrete with notable accuracy31. Similarly, ensemble models like RF and boosting have shown promise. RF excelled in predicting various attributes of fly ash-included SCC, while boosting models significantly improved CS predictions using recycled coarse aggregates (RCA) and SCMs, emphasizing the critical impact of RCA replacement rates32,33. In one study conducted by Singh et al.34, the CS of high-performance concrete was predicted using ML models including MLP, SVR, and XGB, with XGB optimized by a genetic algorithm achieving the highest accuracy across multinational datasets. Another study by Parhi et al.35 demonstrated that using the Spotted Hyena Optimization algorithm to tune XGB hyperparameters significantly improved prediction accuracy of concrete strength over traditional grid and random search methods, with SHAP analysis highlighting concrete age as the most influential factor. Additionally, support vector machines (SVM) have been explored for their potential in predicting SCC concrete properties, though smaller sample sizes might constrain their effectiveness36. This body of work illustrates the effectiveness of advanced ML models in predicting concrete behaviors under varied conditions, including high temperatures where XGB outperformed other models37.
The utilization of glass powder as a replacement for OPC in concrete has been thoroughly investigated owing to its pozzolanic properties and sustainability advantages. Kovačević et al.38 developed multiple ML models, including Gaussian Process Regression (GPR), SVM, DT, and ensemble models such as RF and Boosted Trees, to forecast glass powder concrete’s CS. With an RMSE of 3.98 MPa and a R of 0.9244, their analysis showed that the GPR model with an ARD exponential covariance function had the best predicted accuracy. Gao and Ma39 compared the predictive capabilities of response surface methodology (RSM) and ANN for estimating CS in concrete containing waste glass powder and eggshell powder. ANN outperformed RSM, achieving an R² of 0.956 for CS prediction, demonstrating the superior learning capability of deep learning approaches in handling complex material interactions. Similarly, Nassar and Sohaib40 explored the use of ensemble learning, employing a stacking approach that combines AdaBoost, SVM, XGB, and DT models. Their results demonstrated that the stacking ensemble model had the greatest prediction accuracy, closely matching actual data, when used to forecast the 28-day CS of concrete containing 10–25% glass powder as a cement substitute. Xu et al.22 advanced ML-based predictions by integrating nature-inspired optimization algorithms with traditional models. They utilized four standalone ML models (BPNN, XGB, SVR, and RF), along with hybrid models optimized using the Sparrow Search Algorithm (SSA). The SSA-XGB model achieved the highest accuracy (R² = 0.9645, RMSE = 3.064 MPa), demonstrating the significant role of cement content and curing age in determining the strength of glass powder concrete. The study also performed a feature importance analysis using PDP to explore the impact of glass powder dosage, curing age, and cement content on CS. Table 1 provides a summary of previous studies, including references, employed ML models, dataset sizes, and interpretability analyses such as SHAP and PDP.
According to Table 1, the majority of research has been restricted to normal-strength concrete and has not concentrated on applying ML approaches to forecast the CS of glass powder concrete. Furthermore, extensive interpretability analyses such as PDP, ICE, and SHAP have not been comprehensively performed to evaluate the specific influence of GP and other input variables. To address this research gap, the present study employes advanced nature-inspired optimization algorithms, including PSO, FA, and GWO, to fine-tune the XGB model for accurately predicting the CS of HSGPC. Additionally, this study benchmarks the performance of the optimized XGB model against individual ML models, including DT, RF and XGB, thereby enhancing model generalization and prediction accuracy. Furthermore, the developed models are thoroughly analyzed using explainability analysis to ensure transparency in decision-making to facilitate practical applications.
Research methodology
This study follows a structured approach to forecast the CS of HSGPC using ML models and optimization techniques. The overall methodology consists of four key stages.
-
Data collection and preprocessing.
-
Model development.
-
Performance evaluation.
-
Explainability analysis.
The dataset, compiled from published experimental studies, undergoes preprocessing to ensure data quality. Three individual ML models—KNN, RF, and XGB—are trained and evaluated, followed by optimization of XGB using PSO, FA, and GWO. Performance is assessed through R², MSE, RMSE, MAE, and MAPE metrics, with a comparative analysis between models. Additionally, explainability analysis using SHAP, PDP, and ICE provides insights into feature influence and prediction trends. The overall research methodology is illustrated in Fig. 1.
Dataset collection and characteristics
The dataset utilized in this investigation was collected from experimental studies on the mechanical characteristics of HSGPC that have already been published. The details of the sources and corresponding datasets (X) are presented in Table 2. Moreover, the complete dataset utilized in this study is attached as a supplementary file (S1-S2). The compiled dataset consists of 598 data points, each representing an experimental mix with specific material proportions and corresponding CS values.
The input parameters considered in this study include seven key mix design variables, which significantly influence the CS of HSGPC. These parameters are Cement Content ‘C’ (kg/m³), Glass Powder Content ‘GP’ (kg/m³), Fine Aggregate ‘FA’ (kg/m³), Coarse Aggregate ‘CA’ (kg/m³), Water Content ‘W’ (kg/m³), Superplasticizer Dosage ‘SP’ (kg/m³), Curing Age ‘D’ (days). The dependent variable in this study is the compressive strength ‘CS’ (MPa), measured at different curing ages. High-strength concrete (HSC) is generally defined as concrete with CS exceeding 40 MPa at 28 days. Such concrete exhibits superior mechanical properties, providing it appropriate for structural applications that demand increased durability, load-bearing capability, and sustainability. Table 3 provides a statistical overview of the dataset, highlighting key descriptive statistics such as mean, minimum, quartiles (25%, 50%, 75%), maximum, standard deviation, variance, skewness and kurtosis for each input variable and the output variable.
Moreover, the histogram plots (Fig. 2) depict the frequency distributions of each input variable and compressive strength. The cement and fine aggregate distributions show a near-normal spread, while glass powder and superplasticizer exhibit skewed distributions, suggesting varying replacement levels in different mix designs. The compressive strength histogram reveals a bimodal distribution, implying the presence of distinct strength categories in the dataset. The statistical analysis of input and output parameters is further validated by pair plot analysis (Fig. 3). It visually highlights potential correlations and distribution patterns, indicating possible nonlinear relationships between compressive strength and mixed components. Notably, cement and curing age exhibit a strong positive correlation with CS, while excessive water content shows a potential negative influence.
Multicollinearity analysis
When independent variables in a regression model show a high degree of correlation, it is referred to as multicollinearity. This occurrence may skew the predicted coefficients and make the model less interpretable. In predictive modeling, multicollinearity poses a significant challenge as it can inflate variance, leading to unstable and unreliable predictions67,68. To assess multicollinearity in the dataset, Pearson correlation coefficients were computed between all independent variables, as illustrated in Fig. 4. The heatmap shows the correlation structure between the important mix design factors that affect the HSGPC’s CS. The analysis reveals that certain variables exhibit moderate to strong correlations with each other. Notably, cement content shows a positive correlation with compressive strength (0.57), indicating its fundamental role in improving concrete strength. Similarly, glass powder exhibits a moderate positive correlation (0.34) with CS, reinforcing its efficacy as a supplementary cementitious material. Conversely, coarse aggregate demonstrates a strong negative correlation (−0.71) with compressive strength, suggesting that higher coarse aggregate content may reduce strength due to weaker interfacial bonding and increased void content within the concrete matrix. SP content also displays a notable positive correlation (0.61) with CS.
Multicollinearity is considered critically high when the correlation coefficient exceeds 0.85, as it can disrupt machine learning models by introducing redundancy and instability in parameter estimation69,70. In this study, no extreme multicollinearity was detected among input features; however, moderate correlations between certain variables indicate the need for careful feature selection and engineering to improve models’ accuracy.
Data preprocessing and splitting
The dataset was partitioned into training and testing sets in a 70:30 ratio, allocating 70% for model training and 30% for testing, to ensure effective model training and evaluation. To normalize the input variables, the Scikit-learn library’s Standard Scaler71 was used. To prevent models from being biased toward variables with bigger magnitudes, standardization converts the features to have zero mean and unit variance72.
Outliers detection and analysis
Outliers are data points that deviate significantly from the overall distribution of the dataset and may arise from experimental variability, rare cases, or data recording inconsistencies73. In this study, outlier detection was performed using the Interquartile Range (IQR) method. According to this method, any value lying below \(\:Q1-1.5\times\:IQR\) or above \(\:Q3+1.5\times\:IQR\) is classified as an outlier, where Q1 and Q3 represent the 25 th and 75 th percentiles, respectively74.
To visualize and interpret the presence of outliers across all input variables and the target output (compressive strength), a horizontal boxplot was constructed, as shown in Fig. 5. The plot highlights the IQR spread and flags outliers as black circles beyond the whiskers. Among the eight features analyzed, coarse aggregate exhibited the highest proportion of outliers (18.56%), followed by superplasticizer (14.88%), and compressive strength (11.20%). Conversely, water content exhibited no outliers, indicating a more uniform distribution.
Rather than removing or replacing these values, all outliers were retained in the data set to preserve the natural variability and full range of mix design combinations present in HSGPC. Outliers in this context may represent deliberate, high-performance formulations or boundary conditions relevant to practical concrete applications.
Models’ development
This section outlines the development and evaluation of predictive models for estimating the CS of HSGPC. Various individual machine learning models were implemented (KNN, RF, XGB) to assess their predictive accuracy and efficiency. To further enhance predictive performance, a hybrid approach was developed by integrating XGB with nature-inspired optimization algorithms, namely PSO, FA, and GWO whose details and their effectiveness are discussed in the following subsections.
K-Nearest neighbor
The KNN algorithm is a non-parametric, instance-based learning method that forecast the target variable based on the similarity of input features to its nearest neighbours in the dataset75. By analysing the nearest “k” data points and calculating their average response, KNN calculates the CS of additional data points. Distance measures like the Manhattan, Minkowski, and Euclidean distances are commonly used to quantify how similar two data points are76.
In this research, the Minkowski distance was selected as the similarity metric, with the power parameter p = 2, making it equivalent to the Euclidean distance. The optimal value of ‘k’ was determined based on empirical testing, and k = 5 was chosen as it provided the best trade-off between bias and variance, preventing overfitting while maintaining accurate predictions. The input features included cement content, glass powder percentage, fine and coarse aggregate proportions, water content, SP dosage, and curing age. Figure 6 illustrating the KNN approach, highlighting the five nearest neighbours (yellow) used to predict the CS of a new concrete mix (red).
Random forest
RF is an ensemble learning technique that builds multiple decision trees and aggregates their predictions to improve accuracy and reduce overfitting77. Given the variability of concrete mix parameters, especially coarse aggregates, RF is well-suited to predict compressive strength (CS). In this study, the RF Regressor employed 100 decision trees (n_estimators = 100), each trained on a random subset of data and input variables (bootstrap aggregation or bagging), to enhance generalization and minimize variance78. Final predictions were determined by averaging outputs from all trees, improving model stability and precision. A fixed random seed (random_state = 42) ensured reproducibility of results.
Extreme gradient boosting
Extreme Gradient Boosting (XGB) is an advanced gradient boosting algorithm known for its fast execution and accuracy79 developed by Chen and Guestrin80. It constructs sequential decision trees, each correcting the errors of the preceding one. Like RF, XGB effectively handles data with high variability by iteratively focusing on regions where previous models underperformed81. In this study, the XGB regressor was configured with 100 decision trees (n_estimators = 100) and a learning rate of 0.1, striking a balance between complexity and efficiency. XGB uses gradient boosting to sequentially learn from previous prediction errors and incorporates regularization techniques (L1 and L2) to prevent overfitting. Final predictions for CS were derived by aggregating weighted outputs from all trees.
Grey Wolf optimizer
The Grey Wolf Optimizer (GWO), proposed by Mirjalili in 201482, is a nature-inspired metaheuristic known for its effectiveness, simplicity, and minimal parameter tuning83. Inspired by the hierarchical social structure and hunting strategies of grey wolves, GWO optimizes by iteratively updating three best solutions—alpha (α), beta (β), and delta (δ)—that guide the search process84,85. Other members (omega wolves, ω) adapt their positions based on these leading solutions, thus converging toward an optimal result. Due to its robustness, GWO has been applied widely in engineering, machine learning, bioinformatics, power flow control, scheduling, and cryptography86,87,88. Figure 7 visually represents the wolves’ movement patterns toward the optimal solution.
Particle swarm optimization
PSO, inspired by swarm behaviours observed in nature, is a popular metaheuristic algorithm proposed by Eberhart and Kennedy in 199589,90. It utilizes the collective intelligence of particles (candidate solutions) to effectively explore and exploit the search space. Each particle updates its position based on its personal best solution (pBest) and the global best solution (gBest) identified by the swarm, guided by inertia weight, cognitive, and social parameters91,92. PSO is well-suited to dynamic optimization problems due to its minimal constraints on objective function continuity, straightforward implementation, and computational efficiency93. Figure 8 illustrates how particles iteratively move toward the optimal solution through swarm intelligence.
Firefly algorithm
FA is a nature-inspired optimization technique simulating the bioluminescent attraction behaviours of fireflies, effectively balancing global and local searches94,95. In FA, less bright fireflies move toward brighter ones based on their brightness (fitness), intelligently guiding the search process toward optimal solutions96. The attraction intensity diminishes with distance, ensuring diverse exploration while preventing stagnation at local optima through random movements if no brighter neighbours are found. Due to its capability in addressing high-dimensional and nonlinear optimization problems, FA has been successfully applied in engineering design, machine learning, and multi-criteria decision-making96. Figure 9 visually illustrates the FA mechanism, depicting fireflies moving toward the brightest solutions.
Evaluation metrics
-
R² (Coefficient of Determination): Determines the proportion of the dependent variable’s variance that the model can explain, ranging from -∞ to 1, where 1 indicates a perfect fit.
-
RMSE (Root Mean Squared Error): Computes the root mean square deviation between the input and output data, represented as the square root of the average squared difference, with units matching the target variable.
-
NRMSE (Normalized RMSE): Normalizes RMSE by the range or mean of the target variable, providing a unitless measure of error.
-
MSE (Mean Squared Error): Computes the mean squared deviation between the input and output data.
-
MAE (Mean Absolute Error): Assesses the mean absolute discrepancies between expected and actual values, with units matching the target variable.
-
MAPE (Mean Absolute Percentage Error): Denotes the mean percentage discrepancy between forecasted and actual values, providing a unitless measure.
Results and discussion
Performance evaluation
For both training and testing phase, R2 values were used to evaluate the forecasting efficacy of the generated models, which included KNN, RF, and XGB. The findings, which are shown in Fig. 10 (a–c), demonstrate how well each model can learn and generalize the link between compressive strength and mix design parameters. The KNN model exhibited an R² value of 0.783 on the training dataset, indicating that it was able to capture a substantial portion of the data variability. However, its performance on the test dataset dropped to 0.778, signifying a moderate reduction in generalization capability.
Moreover, both RF and XGB demonstrated excellent predictive capabilities, achieving high R² values on both training and testing datasets. RF attained an R² of 0.995 (train) and 0.963 (test), while XGB recorded 0.999 (train) and 0.946 (test). Their high-test performance highlights their ability to generalize well, benefiting from ensemble learning and boosting techniques that reduce variance and improve model stability. As seen in Fig. 10 (b, c), both models align closely with actual CS values, with minimal deviations beyond the ± 25% error margin.
The models’ detailed evaluation metrics, such as MSE, RMSE, MAE, and MAPE for both training and testing datasets, are shown in Table 4. The RF model demonstrated the best predictive accuracy, with the lowest MSE (8.118 train, 56.851 test), RMSE (2.849 train, 7.540 test), and MAE (1.903 train, 5.330 test). XGB came in second regarding MSE, with slightly higher error values but still strong generalization in terms of other metrics. The KNN model, however, showed the highest error rates, particularly in testing (MSE: 343.589, RMSE: 18.536, MAE: 10.512), reflecting its lower reliability compared to the tree-based models. The MAPE values confirm this trend, with XGB achieving the lowest error percentage, followed by RF, while KNN demonstrated the highest deviation. These results reinforce the effectiveness of ensemble-based approaches (RF and XGB) over distance-based regression techniques (KNN) to forecast the CS of HSGPC.
Hybrid model optimization and performance analysis
To further optimize and enhance the test performance of XGB, the model was integrated with nature-inspired optimization algorithms, namely PSO, FA, and GWO. These metaheuristic approaches were used to adjust XGB’s hyperparameters to increase its capacity for generalization and prediction accuracy.
The optimal hyperparameters, including the number of estimators, max depth, learning rate, subsample ratio, and column sample by tree ratio, were consistently identified for each optimization method. Table 5 presents a summary of the hyperparameters optimized by PSO, FA, and GWO.
The optimized XGB models were evaluated using R² scores and multiple error metrics to assess their effectiveness in predicting compressive strength. The optimized models achieved R² values of 0.986 (XGB-PSO), 0.984 (XGB-FA), and 0.991 (XGB-GWO) on the test dataset, demonstrating enhanced predictive capability. Figure 11 (a-c) illustrate the regression plots for each optimized model, comparing predicted and actual compressive strength values. The optimized models exhibit tighter clustering along the perfect fit line, with fewer data points deviating beyond the ± 25% error margins.
Additionally, Fig. 12 (a-c) presents a comprehensive comparison of the XGB model before and after optimization, highlighting the performance of each hybrid model relative to the baseline XGB model based on testing phase. Both regression plots and error analysis visualizations provide insights into the distribution of errors across data points. The results indicate that the optimized models significantly enhanced predictive performance, improved reliability and reduced prediction deviations in the modified models.
In addition to the regression and error plots, Fig. 13 provides a comparative bar chart illustrating the reduction in error metrics (MSE, RMSE, MAE, and MAPE) after optimization. The optimized models demonstrated a substantial decline in MSE, with XGB-GWO achieving the lowest value (14.42), followed by XGB-PSO (21.63) and XGB-FA (24.78). Similarly, RMSE, MAE, and MAPE were significantly reduced across all optimized models, reaffirming the effectiveness of the applied metaheuristic techniques in enhancing the model’s predictive performance. Among the three optimization techniques, XGB-GWO demonstrated superior test performance, attaining the greatest R² score (0.991) and the least MSE (14.42). These findings are in line with previous studies97,98 as GWO balances exploration and exploitation effectively. For instance, A study by Ӧzyüksel Çiftçioğlu et al.99 developed a GWO-XGB model for UHPC incorporating nano- and micro-materials, achieving R² values of 0.984 and 0.948, respectively, for CS prediction. Similarly, Parhi et al.73 implemented a hybrid ensemble model using XGB/GWO-optimized RF, NN, and MARS to predict the strength of geopolymer concrete, reporting an R² of 0.938 on the test set. In another work, Zhang et al.98 proposed a GWO-optimized RF–XGB stacking model for landscape geopolymer concrete, achieving R² values of 0.983 and 0.981 with significantly reduced RMSE. These findings affirm that the integration of metaheuristic optimization, particularly GWO, enhances the reliability of ML models. Compared to these studies, the current work’s XGB-GWO model for HSGPC concrete yields comparable or better accuracy. Moreover, the ML models demonstrated strong generalization capability and predictive accuracy despite the presence of outliers (Fig. 5), as evidenced by the performance metrics. This supports the robustness of the models and justifies the retention of the full dataset for training and evaluation.
Taylor diagram
Figure 12(a, b) illustrates Taylor diagrams that compare the performance of machine learning models (KNN, RF, XGB) and hybrid-optimized XGB models (XGB-PSO, XGB-FA, XGB-GWO) in predicting the compressive strength of HSGPC. These diagrams simultaneously present the correlation coefficient, standard deviation, and root mean square error (RMSE), where the contour lines indicate the centered RMSE between predicted and actual values. In the training phase (Fig. 14(a)), XGB and its optimized variants exhibit near-perfect agreement with the observed standard deviation and the highest correlation coefficients (> 0.999), indicating excellent model fitting. In the testing phase (Fig. 14(b)), the XGB-GWO model achieved a highest correlation coefficient of 0.9952, with a predicted standard deviation of 39.143 MPa, closely matching the observed standard deviation of 39.446 MPa. This proximity to the reference point is also associated with the lowest RMSE contour level, aligning with the model’s recorded test RMSE of 3.80 MPa. In contrast, the KNN model showed a notably lower correlation (R = 0.882) and a standard deviation of 33.570 MPa, indicating weaker agreement and higher prediction error. These Taylor diagrams further validate and visually compare the findings of all models, reinforcing the enhanced prediction capability of hybrid-optimized XGB models especially the GWO-enhanced variant.
SHAP analysis
The SHapley Additive Explanations (SHAP), a complex method for elucidating the outputs of any machine learning model, facilitates the comprehension of complex machine learning models100. By allocating relevance values to each feature according to how it contributes to a particular prediction, SHAP enables researchers to comprehend how the trained model makes decisions. Based on game theory, the method calculates the contribution of each characteristic in the dataset to the overall prediction, in a manner like how a player’s contribution is assessed in a cooperative game78,101.
To examine the impact of cement, glass powder, fine and coarse aggregates, water, superplasticizer, and curing age on compressive strength predictions, SHAP was used to the improved XGB model in this study. Feature significance was shown using the SHAP summary plot, which displays the total effect of every variable. To evaluate the interplay between distinct characteristics and their impact on CS, the SHAP dependency plot was also produced.
SHAP summary and feature importance
The influence of input parameters on CS prediction is ranked using the SHAP feature importance ranking (Fig. 15), curing days (+ 10.49), coarse aggregate (+ 8.46), water content (+ 4.42), fine aggregate (+ 2.83), cement (+ 2.74), and glass powder (+ 1.81) are the next most influential factors, followed by SP dosage with a SHAP value of + 11.65.
The SHAP summary plot (Fig. 16) further substantiates these findings. Interestingly, coarse aggregate showed a high influence on the negative side, suggesting that excessive amounts may reduce strength, possibly due to insufficient paste-to-aggregate bonding or an increase in interfacial transition zone (ITZ) defects. Conversely, curing days significantly enhance compressive strength, reinforcing the well-established role of extended hydration in improving concrete performance. The notable impact of superplasticizer dosage indicates its role in optimizing workability and enhancing particle dispersion, leading to improved strength outcomes in HSC.
Additionally, glass powder exhibited the lowest SHAP impact, which can be attributed to its sequential replacement with cement rather than direct enhancement of mechanical strength. However, its contribution remains positive, aligning with its pozzolanic activity and potential to improve long-term strength development. The moderate influence of cement and fine aggregate highlights their essential, yet balanced, contribution to the overall mix design.
SHAP dependence analysis for individual features
To further investigate the influence of individual material components on compressive strength, SHAP dependence plots were generated, providing deeper insights into how variations in mix proportions affect strength development in high-strength glass powder concrete.
The first important parameter, cement has a robust positive association with CS, underscoring its function as the principal binder in concrete. As observed in the SHAP dependence plot (Fig. 17a), an increase in cement content generally leads to a higher CS. However, beyond a certain threshold, the effect diminishes, likely due to inefficient hydration at excessive cement levels, which can contribute to shrinkage cracks and increased brittleness. This trend highlights the necessity of maintaining an optimal cement proportion to achieve maximum strength without compromising durability.
There is a non-linear tendency in the association between CS and coarse aggregate content (Fig. 17b). Strength is increased by increasing coarse aggregate at moderate levels because it improves load distribution within the matrix. However, an excessive amount of coarse aggregate results in a little decrease in compressive strength, which may be caused by decreased matrix cohesion and impaired paste-aggregate bonding. Moreover, curing days, as shown in Fig. 17c, demonstrate a strong positive influence on compressive strength. The SHAP analysis confirms that prolonged curing significantly enhances strength development by allowing extended hydration reactions to take place. This aligns with well-established principles of concrete technology, where longer curing periods improve microstructural densification, reduce porosity, and enhance durability.
Fine aggregate and water content also exhibit moderate influence on compressive strength Fig. 17d and e). While fine aggregate contributes to the overall packing density and strength of the mix, its effect is less pronounced compared to coarse aggregate and cement. Conversely, water content is essential for workability; nevertheless, an excess of water results in enhanced porosity and diminished strength. The SHAP dependency plot indicates that sustaining an ideal water-to-cement ratio is crucial for achieving a balance between workability and strength in high-strength concrete. Superplasticizer dosage follows a distinctive trend, as seen in Fig. 17f, where an optimal dosage enhances workability and contributes to higher compressive strength. However, excessive superplasticizer can disrupt particle cohesion, leading to segregation and reduced strength development.
Glass powder, despite having the lowest SHAP influence, still plays a positive role in strength development (Fig. 17g). Unlike other mix components, its lower influence can be attributed to its sequential replacement with cement, meaning its effect is moderated by the presence of the primary binder. The SHAP dependence plot suggests that glass powder contributes to long-term strength enhancement through pozzolanic activity, improving the overall sustainability of the concrete mix without significantly altering its early-age performance.
Partial dependence plot (PDP) analysis
The Partial Dependence Plots (PDPs) further validate the feature importance and trends observed in the SHAP analysis. These are low-dimensional graphical representations of prediction functions that show how the result and relevant predictors relate to one another. Partial dependency plots validate the use of explanatory variables for prediction and enable model review102.
The results (Fig. 18) confirm that cement, curing days, fine aggregate, and superplasticizer positively contribute to compressive strength, reinforcing their role in enhancing concrete performance. Cement and curing days show a strong correlation with strength gain, while fine aggregate and superplasticizer exhibit moderate yet beneficial effects. The effect of glass powder remains stable, further supporting its supplementary role as a cement replacement without significantly impacting strength.
Conversely, coarse aggregate and water content display a negative influence at higher proportions, aligning with the SHAP findings. Excessive coarse aggregate disrupts the paste-aggregate bond, while increased water content reduces matrix density, leading to lower compressive strength. There are well established concrete engineering theories that excessive coarse aggregate content reduces concrete compressive strength primarily by altering the paste-to-aggregate ratio and weakening the Interfacial Transition Zone (ITZ). An overly high aggregate volume results in insufficient cementitious paste, leaving uncoated aggregate surfaces and internal voids that diminish concrete density and strength103. Additionally, excessive coarse aggregate enlarges and overlaps the ITZ regions, inherently porous zones with higher local water-to-cement ratios. These enlarged ITZs promote stress concentration and initiate cracking under lower stress, further compromising strength104,105. In this regard, Malkawi et al.104 found that a moderate coarse aggregate ratio (about 55–65% of total aggregate by volume) yielded the highest compressive strength in a high-strength fly ash geopolymer concrete, whereas mixes with higher coarse aggregate content showed significantly lower strength along with increased porosity. Furthermore, an excessively coarse aggregate fraction impairs workability and compaction, leading to increased entrapped air and internal voids, thereby reducing load-bearing capacity106. In both cases, the highest strength is achieved when there is enough binder paste to create a dense microstructure and a thin, well-bonded ITZ around aggregates, and any further increase in aggregate disrupts that balance103. Hence, optimal coarse aggregate content is critical for maintaining structural integrity and strength.
Individual conditional expectation
The Individual Conditional Expectation (ICE) plots provide a more detailed perspective on how each feature influences compressive strength at an individual data point level. These plots visualize the variation in predictions across different feature values while maintaining interactions with other variables107.
The ICE analysis further reinforces the trends observed in SHAP and PDP, confirming the significant roles of coarse aggregate, curing days, and superplasticizer dosage in predicting compressive strength (Fig. 19(a-g)). In the ICE plots, the black dashed line represents the average trend across all instances, offering a generalized view of how a feature impacts the output on average. The thin blue lines depict individual sample trajectories, illustrating how compressive strength predictions change as each sample’s feature value varies while keeping other variables fixed.
The ICE plot for cement (Fig. 19a) shows a predominantly increasing trend, indicating a positive effect on CS, though it stabilizes at higher values, suggesting a saturation point. The curing days plot (Fig. 19b) confirms a strong positive correlation, with strength increasing significantly within the early curing period before reaching a plateau. The fine aggregate (Fig. 19c) and glass powder (Fig. 19d) plots exhibit minimal variation, confirming their relatively lower influence on strength development. The superplasticizer plot (Fig. 19e) highlights an optimal dosage range, beyond which no significant improvement is observed. The water content plot (Fig. 19f) aligns with previous findings, where excessive water reduces strength due to increased porosity. Lastly, the coarse aggregate ICE plot (Fig. 19g) underscores its diminishing effect at higher levels, supporting earlier observations that excessive usage may disrupt the concrete matrix.
Future perspectives
The results of this research demonstrate how well ML predicts HSGPC’s CS. Nonetheless, there are still a few crucial areas for further study and advancement:
-
Incorporating detailed particle size distribution data for glass powder could refine the model’s predictive accuracy and provide deeper insights into its role in cementitious reactions.
-
Exploring deep learning architectures such as transformers, graph neural networks (GNNs), and attention-based models may further enhance predictive performance and generalization.
-
Increasing the dataset with a wider variety of mix compositions, curing conditions, and environmental exposure scenarios would improve the robustness of the models.
-
Future developments should include real-time mix optimization, predictive analytics for durability performance, and cloud-based computing integration to make the tool more interactive and practical for users.
-
Sustainability Analysis: Extending the model to evaluate the environmental impact of different mix compositions by integrating life-cycle assessment (LCA) tools could support sustainable concrete mix designs.
By addressing these future directions, ML-based predictive tools can evolve into more precise, scalable, and user-friendly systems, ultimately benefiting researchers, engineers, and the construction industry.
Research limitations
While this study successfully demonstrates the utilization of ML in predicting the CS of high-strength concrete incorporating glass powder, certain limitations must be acknowledged. Achieving HSC is not solely dependent on glass powder; rather, it results from the combined effects of various SCMs. However, due to data limitations and the inherent variability in experimental datasets, this research focuses specifically on the role of glass powder alongside superplasticizers in enhancing concrete performance. The exclusion of other SCMs, such as fly ash, silica fume, or metakaolin, may limit the model’s ability to generalize across a broader range of high-performance mix designs. Future studies should incorporate a more comprehensive dataset encompassing multiple SCMs to refine predictive accuracy and provide a more complete understanding of their synergistic effects in achieving high-strength concrete.
Conclusion
This research was conducted to optimize machine learning models for forecasting the CS of HSGPC. Predictive models like KNN, RF, and XGB were investigated in the study, and nature-inspired optimization approaches (PSO, FA, and GWO) were used to further improve them. To evaluate the model predictions and comprehend the impact of important input factors, SHAP, PDP, and ICE analyses were performed. The key findings are summarized as follows:
-
XGB and RF outperformed KNN and RF in terms of predictive accuracy among standalone models, obtaining R2 values on the test dataset of 0.946 and 0.963, respectively.
-
Based on the comparative analysis of standalone and optimized models, XGB-GWO exhibited superior predictive capability, attaining the highest R² (0.991) and the least MSE (14.42), demonstrating the effectiveness of hyperparameter tuning through metaheuristic algorithms.
-
Taylor diagram analysis provided a compact visual validation of all models by jointly comparing their correlation, standard deviation, and RMSE. The XGB-GWO model consistently plotted nearest to the reference point in both training and testing phases, confirming its superior accuracy and generalization performance over the other models.
-
SHAP analysis revealed that coarse aggregate (SHAP impact = 8.46), with excessive content leading to strength reduction due to weaker paste-aggregate bonding. Curing days (SHAP = 10.49) strongly influenced strength development, confirming that prolonged curing enhances strength.
-
Superplasticizer dosage (SHAP = 11.65) improved strength within an optimal range but plateaued at higher concentrations. Water content (SHAP = 4.42) exhibited a negative impact beyond optimal levels due to increased porosity, leading to strength reduction.
-
Dependence plots (PDP and ICE) further validated these findings, highlighting non-linear trends. Cement content exhibited a positive correlation with strength, but beyond ~ 600 kg/m³, the marginal gain decreased due to hydration inefficiencies. Coarse aggregate showed a strength decline beyond ~ 800 kg/m³, reinforcing the trade-off between particle packing and paste bonding.
-
Although the study highlights the potential of ML in concrete mix design optimization, larger datasets, additional SCMs, and more sophisticated hybrid models should be incorporated into future studies to achieve even greater gains.
Declaration of Generative AI and AI-assisted technologies in the writing process
During the preparation of this work the authors used ChatGPT only to improve readability and language of the work. After using this tool, the authors reviewed and edited the content as needed under strict human oversight and take full responsibility for the content of the publication.
Data availability
The data is available from the corresponding author upon request.
References
Du, H. & Tan, K. H. Properties of high volume glass powder concrete. Cem. Concr Compos. 75, 22–29. https://doi.org/10.1016/J.CEMCONCOMP.2016.10.010 (2017).
Wray, P. & Scrivener, K. Straight talk with Karen Scrivener on cements, CO2 and sustainable development, 91 47–50. (2012). https://infoscience.epfl.ch/handle/20.500.14299/82069 (accessed March 3, 2025).
De Weerdt, K. et al. Hydration mechanisms of ternary Portland cements containing limestone powder and fly Ash. Cem. Concr Res. 41, 279–291. https://doi.org/10.1016/J.CEMCONRES.2010.11.014 (2011).
Lothenbach, B., Scrivener, K. & Hooton, R. D. Supplementary cementitious materials. Cem. Concr Res. 41, 1244–1256. https://doi.org/10.1016/J.CEMCONRES.2010.12.001 (2011).
Park, B. & Choi, Y. C. Hydration and pore-structure characteristics of high-volume fly Ash cement pastes. Constr. Build. Mater. 278, 122390. https://doi.org/10.1016/J.CONBUILDMAT.2021.122390 (2021).
Onaizi, A. M., Lim, N. H. A. S., Huseien, G. F., Amran, M. & Ma, C. K. Effect of the addition of nano glass powder on the compressive strength of high volume fly Ash modified concrete. Mater. Today Proc. 48, 1789–1795. https://doi.org/10.1016/J.MATPR.2021.08.347 (2022).
Siad, H., Lachemi, M., Sahmaran, M., Mesbah, H. A. & Hossain, K. M. A. Use of recycled glass powder to improve the performance properties of high volume fly ash-engineered cementitious composites. Constr. Build. Mater. 163, 53–62. https://doi.org/10.1016/J.CONBUILDMAT.2017.12.067 (2018).
Solikin, M. et al. Experimental design analysis of ultra fine fly ash, lime water, and basalt fibre in mix proportion of high volume fly ash concrete, undefined, 2013•pertanika.Upm.Edu.My 21 (2013) 589–600. (2013). http://www.pertanika.upm.edu.my/resources/files/Pertanika%20PAPERS/JST%20Vol.%2021%20(2)%20Jul.%202013/24%20Page%20589-600.pdf (accessed March 3, 2025).
Rashad, A. M., Seleem, H. E. D. H. & Shaheen, A. F. Effect of silica fume and slag on compressive strength and abrasion resistance of HVFA concrete. Int. J. Concr Struct. Mater. 8, 69–81. https://doi.org/10.1007/S40069-013-0051-2/FIGURES/10 (2014).
Schwarz, N., Cam, H. & Neithalath, N. Influence of a fine glass powder on the durability characteristics of concrete and its comparison to fly Ash. Cem. Concr Compos. 30, 486–496. https://doi.org/10.1016/J.CEMCONCOMP.2008.02.001 (2008).
Samadi, M., Shah, K. W., Lim, N. H. A. S. & Huseien, G. F. Influence of Glass Silica Waste Nano Powder on the Mechanical and Microstructure Properties of Alkali-Activated Mortars, Nanomaterials 2020, Vol. 10, Page 324 10 324. (2020). https://doi.org/10.3390/NANO10020324
Shao, Y., Lefort, T., Moras, S. & Rodriguez, D. Studies on concrete containing ground waste glass. Cem. Concr Res. 30, 91–100. https://doi.org/10.1016/S0008-8846(99)00213-6 (2000).
Maraghechi, H., Maraghechi, M., Rajabipour, F. & Pantano, C. G. Pozzolanic reactivity of recycled glass powder at elevated temperatures: reaction stoichiometry, reaction products and effect of alkali activation. Cem. Concr Compos. 53, 105–114. https://doi.org/10.1016/J.CEMCONCOMP.2014.06.015 (2014).
Mirzahosseini, M. & Riding, K. A. Effect of curing temperature and glass type on the pozzolanic reactivity of glass powder. Cem. Concr Res. 58, 103–111. https://doi.org/10.1016/J.CEMCONRES.2014.01.015 (2014).
Shi, C., Wu, Y., Riefler, C. & Wang, H. Characteristics and pozzolanic reactivity of glass powders. Cem. Concr Res. 35, 987–993. https://doi.org/10.1016/J.CEMCONRES.2004.05.015 (2005).
DyerT.D. & DhirR.K. Chemical reactions of glass cullet used as cement component. J. Mater. Civ. Eng. 13, 412–417. https://doi.org/10.1061/(ASCE)0899-1561(2001)13:6(412) (2001).
Du, H. et al. Waste Glass Powder as Cement Replacement in Concrete, Journal of Advanced Concrete Technology 12 468–477. (2014). https://doi.org/10.3151/JACT.12.468
Schwarz, N., DuBois, M. & Neithalath, N. Electrical conductivity based characterization of plain and coarse glass powder modified cement pastes. Cem. Concr Compos. 29, 656–666. https://doi.org/10.1016/J.CEMCONCOMP.2007.05.005 (2007).
Matos, A. M. & Sousa-Coutinho, J. Durability of mortar using waste glass powder as cement replacement. Constr. Build. Mater. 36, 205–215. https://doi.org/10.1016/J.CONBUILDMAT.2012.04.027 (2012).
Carsana, M., Frassoni, M. & Bertolini, L. Comparison of ground waste glass with other supplementary cementitious materials. Cem. Concr Compos. 45, 39–45. https://doi.org/10.1016/J.CEMCONCOMP.2013.09.005 (2014).
H. Du, K.T.-A.M. Journal, undefined 2015, Transport properties of concrete with glass powder as supplementary cementitious material, Search.Proquest.ComH Du, KH TanACI Materials Journal, 2015•search.Proquest.Com (n.d.). https://search.proquest.com/openview/e32cdcec905a92bdb22d5b34636fe1bc/1?pq-origsite=gscholar&cbl=37076 (accessed March 3, 2025).
Miao, X., Chen, B. & Zhao, Y. Prediction of compressive strength of glass powder concrete based on artificial intelligence. J. Building Eng. 91, 109377. https://doi.org/10.1016/J.JOBE.2024.109377 (2024).
Vasudevan, G., Ganis, S. & Pillay, K. Performance of using waste glass powder in concrete as replacement of cement. Am. J. Eng. Res. (AJER). 02, 175–181 (2013). www.ajer.org (accessed March 3, 2025).
Cree, D., Green, M. & Noumowé, A. Residual strength of concrete containing recycled materials after exposure to fire: A review. Constr. Build. Mater. 45, 208–223. https://doi.org/10.1016/J.CONBUILDMAT.2013.04.005 (2013).
Choi, W. C., Do, H. & Yun Compressive behavior of reinforced concrete columns with recycled aggregate under uniaxial loading. Eng. Struct. 41, 285–293. https://doi.org/10.1016/J.ENGSTRUCT.2012.03.037 (2012).
El-Mir, A. et al. Machine learning prediction of concrete compressive strength using rebound hammer test. J. Building Eng. 64, 105538. https://doi.org/10.1016/J.JOBE.2022.105538 (2023).
Nguyen, H., Vu, T., Vo, T. P. & Thai, H. T. Efficient machine learning models for prediction of concrete strengths. Constr. Build. Mater. 266, 120950. https://doi.org/10.1016/J.CONBUILDMAT.2020.120950 (2021).
Paudel, S., Pudasaini, A., Shrestha, R. K. & Kharel, E. Compressive strength of concrete material using machine learning techniques. Clean. Eng. Technol. 15, 100661. https://doi.org/10.1016/J.CLET.2023.100661 (2023).
Kumar Dash, P., Kumar Parhi, S., Kumar Patro, S. & Panigrahi, R. Efficient machine learning algorithm with enhanced Cat swarm optimization for prediction of compressive strength of GGBS-based geopolymer concrete at elevated temperature. Constr. Build. Mater. 400, 132814. https://doi.org/10.1016/J.CONBUILDMAT.2023.132814 (2023).
Parhi, S. K. & Patro, S. K. Compressive strength prediction of PET fiber-reinforced concrete using Dolphin echolocation optimized decision tree-based machine learning algorithms. Asian J. Civil Eng. 25, 977–996. https://doi.org/10.1007/S42107-023-00826-8/METRICS (2024).
Song, H., Ahmad, A., Ostrowski, K. A. & Dudek, M. Analyzing the Compressive Strength of Ceramic Waste-Based Concrete Using Experiment and Artificial Neural Network (ANN) Approach, Materials 2021, Vol. 14, Page 4518 14 4518. (2021). https://doi.org/10.3390/MA14164518
Cui, T., Kulasegaram, S. & Li, H. Design automation of sustainable self-compacting concrete containing fly Ash via data driven performance prediction. J. Building Eng. 87, 108960. https://doi.org/10.1016/J.JOBE.2024.108960 (2024).
Huang, P., Dai, K. & Yu, X. Machine learning approach for investigating compressive strength of self-compacting concrete containing supplementary cementitious materials and recycled aggregate. J. Building Eng. 79, 107904. https://doi.org/10.1016/J.JOBE.2023.107904 (2023).
Singh, S., Patro, S. K. & Parhi, S. K. Evolutionary optimization of machine learning algorithm hyperparameters for strength prediction of high-performance concrete. Asian J. Civil Eng. 24, 3121–3143. https://doi.org/10.1007/S42107-023-00698-Y/METRICS (2023).
Parhi, S. K. & Panigrahi, S. K. Application of metaheuristic spotted hyena optimization in strength prediction of concrete. Metaheuristics-Based Mater. Optim. 229–248. https://doi.org/10.1016/B978-0-443-29162-3.00008-3 (2025).
Saha, P., Debnath, P. & Thomas, P. Prediction of fresh and hardened properties of self-compacting concrete using support vector regression approach. Neural Comput. Appl. 32, 7995–8010. https://doi.org/10.1007/S00521-019-04267-W/FIGURES/9 (2020).
Ahmad, A. et al. Comparative Study of Supervised Machine Learning Algorithms for Predicting the Compressive Strength of Concrete at High Temperature, Materials 2021, Vol. 14, Page 4222 14 4222. (2021). https://doi.org/10.3390/MA14154222
Kovačević, M., Grubeša, I. N., Hadzima-Nyarko, M. & Nyarko, E. K. Application of machine learning models for the compressive strength prediction of concrete with glass waste powder. Artif. Intell. Appl. Sustainable Constr. 123–149. https://doi.org/10.1016/B978-0-443-13191-2.00004-3 (2024).
Gao, Y. & Ma, R. Study of machine learning and conventional techniques in predicting compressive strength of concrete with eggshell and glass powder additives. Buildings 14, 2956. https://doi.org/10.3390/BUILDINGS14092956/S1 (2024).
Nassar, R. U. D. & Sohaib, O. Prediction of the compressive strength of sustainable concrete produced with powder glass using standalone and stack machine learning methods. Commun. Comput. Inform. Sci. 2145 CCIS. 147–158 https://doi.org/10.1007/978-981-97-5934-7_13 (2024).
Farooq, F., Ahmed, W., Akbar, A., Aslam, F. & Alyousef, R. Predictive modeling for sustainable high-performance concrete from industrial wastes: A comparison and optimization of models using ensemble learners. J. Clean. Prod. 292, 126032. https://doi.org/10.1016/J.JCLEPRO.2021.126032 (2021).
Nasir Amin, M., Iftikhar, B., Khan, K., Faisal Javed, M. & Mohammad AbuArab, A. Faisal rehman, prediction model for rice husk Ash concrete using AI approach: boosting and bagging algorithms. Structures 50, 745–757. https://doi.org/10.1016/J.ISTRUC.2023.02.080 (2023).
Kou, S. C. & Xing, F. The effect of recycled glass powder and reject fly Ash on the mechanical properties of Fibre-Reinforced ultrahigh performance concrete. Adv. Mater. Sci. Eng. 2012, 263243. https://doi.org/10.1155/2012/263243 (2012).
Ahmad, J. & Zhou, Z. Development of high strength self compacting concrete with waste glass and waste marble. Constr. Build. Mater. 408, 133760. https://doi.org/10.1016/J.CONBUILDMAT.2023.133760 (2023).
Chandra Sekhar, M., Kumar, M. H., Lova Raju, S. & Saikrishnamacharyulu, I. Influence of Metakaolin and glass powder on mechanical behaviour of concrete. Mater. Today Proc. https://doi.org/10.1016/J.MATPR.2023.03.713 (2023).
Mustafa, T. S., Mahmoud, A. A., Mories, E. M., El, S. A. & Beshlawy Flexural behavior of reinforced concrete slabs containing recycled glass powder and steel fibers. Structures 54, 1491–1508. https://doi.org/10.1016/J.ISTRUC.2023.05.101 (2023).
Peng, L. et al. Enhancing the corrosion resistance of recycled aggregate concrete by incorporating waste glass powder. Cem. Concr Compos. 137, 104909. https://doi.org/10.1016/J.CEMCONCOMP.2022.104909 (2023).
Ibrahim, K. I. M. Recycled waste glass powder as a partial replacement of cement in concrete containing silica fume and fly Ash. Case Stud. Constr. Mater. 15, e00630. https://doi.org/10.1016/J.CSCM.2021.E00630 (2021).
Azeez, A. T., Hassan, M. S. & Attiyah, A. A. H. Modelling the complementary role of inert slag aggregate and waste glass on concrete mechanical strengths using the response surface and design-of-experiment approach. Constr. Build. Mater. 409, 134199. https://doi.org/10.1016/J.CONBUILDMAT.2023.134199 (2023).
Tahwia, A. M., Essam, A., Tayeh, B. A. & Elrahman, M. A. Enhancing sustainability of ultra-high performance concrete utilizing high-volume waste glass powder. Case Stud. Constr. Mater. 17, e01648. https://doi.org/10.1016/J.CSCM.2022.E01648 (2022).
Alateah, A. H. et al. Modelling of strength characteristics of silica fume/glass ternary blended concrete using destructive and non-destructive testing methods. J. Mater. Res. Technol. 22, 997–1013. https://doi.org/10.1016/J.JMRT.2022.11.180 (2023).
Zidol, A. et al. Effect of glass powder on concrete sustainability. New. J. Glass Ceram. 7, 34–47. https://doi.org/10.4236/NJGC.2017.72004 (2017).
Paul, D., Bindhu, K. R., Matos, A. M. & Delgado, J. Eco-friendly concrete with waste glass powder: A sustainable and circular solution. Constr. Build. Mater. 355, 129217. https://doi.org/10.1016/J.CONBUILDMAT.2022.129217 (2022).
Soliman, N. A. & Tagnit-Hamou, A. Development of ultra-high-performance concrete using glass powder – Towards ecofriendly concrete. Constr. Build. Mater. 125, 600–612. https://doi.org/10.1016/J.CONBUILDMAT.2016.08.073 (2016).
Gupta, J., Jethoo, A. S. & Ramana, P. V. Evaluating long term properties of concrete using waste beverage glass. Mater. Today Proc. 61, 297–306. https://doi.org/10.1016/J.MATPR.2021.09.446 (2022).
Tahwia, A. M., Elmansy, A., Abdellatief, M. & Elrahman, M. A. Durability and ecological assessment of low-carbon high-strength concrete with short AR-glass fibers: effects of high-volume of solid waste materials. Constr. Build. Mater. 429, 136422. https://doi.org/10.1016/J.CONBUILDMAT.2024.136422 (2024).
Muhedin, D. A. & Ibrahim, R. K. Effect of waste glass powder as partial replacement of cement & sand in concrete. Case Stud. Constr. Mater. 19, e02512. https://doi.org/10.1016/J.CSCM.2023.E02512 (2023).
Zhan, P. et al. Strength, microstructure and Nanomechanical properties of recycled aggregate concrete containing waste glass powder and steel slag powder. J. Clean. Prod. 341, 130892. https://doi.org/10.1016/J.JCLEPRO.2022.130892 (2022).
Xu, J. et al. Design and assessment of eco-friendly ultra-high performance concrete with steel slag powder and recycled glass powder. Powder Technol. 419, 118356. https://doi.org/10.1016/J.POWTEC.2023.118356 (2023).
Lai, M. H. et al. Pre- and post-fire behaviour of glass concrete from wet packing density perspective. J. Building Eng. 86, 108758. https://doi.org/10.1016/J.JOBE.2024.108758 (2024).
Orouji, M., Zahrai, S. M. & Najaf, E. Effect of glass powder & polypropylene fibers on compressive and flexural strengths, toughness and ductility of concrete: an environmental approach. Structures 33, 4616–4628. https://doi.org/10.1016/J.ISTRUC.2021.07.048 (2021).
Nassar, R. U. D., Soroushian, P. & Sufyan-Ud-Din, M. Long-term field performance of concrete produced with powder waste glass as partial replacement of cement. Case Stud. Constr. Mater. 15, e00745. https://doi.org/10.1016/J.CSCM.2021.E00745 (2021).
Aliabdo, A. A., Abd Elmoaty, A. E. M. & Aboshama, A. Y. Utilization of waste glass powder in the production of cement and concrete. Constr. Build. Mater. 124, 866–877. https://doi.org/10.1016/J.CONBUILDMAT.2016.08.016 (2016).
Althoey, F. et al. Effect of fly Ash and waste glass powder as a fractional substitute on the performance of natural fibers reinforced concrete. Ain Shams Eng. J. 14, 102247. https://doi.org/10.1016/J.ASEJ.2023.102247 (2023).
Chu, S. H., Li, L., Shen, P. L., Lu, J. X. & Poon, C. S. Recycling of waste glass powder as paste replacement in green UHPFRC. Constr. Build. Mater. 316, 125719. https://doi.org/10.1016/J.CONBUILDMAT.2021.125719 (2022).
Saify, S. et al. Impact of waste materials (glass powder and silica fume) on features of high-strength concrete. Open. Eng. 13 https://doi.org/10.1515/ENG-2022-0479/MACHINEREADABLECITATION/ENDNOTE (2023).
Asadi Shamsabadi, E., Salehpour, M., Zandifaez, P. & Dias-da-Costa, D. Data-driven multicollinearity-aware multi-objective optimisation of green concrete mixes. J. Clean. Prod. 390, 136103. https://doi.org/10.1016/J.JCLEPRO.2023.136103 (2023).
Manan, A. et al. Machine learning prediction of recycled concrete powder with experimental validation and life cycle assessment study. Case Stud. Constr. Mater. 21, e04053. https://doi.org/10.1016/J.CSCM.2024.E04053 (2024).
Bin Inqiad, W., Siddique, M. S., Ali, M. & Najeh, T. Predicting 28-day compressive strength of fibre-reinforced self-compacting concrete (FR-SCC) using MEP and GEP. Sci. Rep. 2024. 14, 1. https://doi.org/10.1038/s41598-024-65905-5 (2024).
Liu, K. et al. Development of compressive strength prediction platform for concrete materials based on machine learning techniques. J. Building Eng. 80, 107977. https://doi.org/10.1016/J.JOBE.2023.107977 (2023).
Pedregosa, F. et al. EDOUARDDUCHESNAY, Scikit-learn: Machine Learning in Python Gaël Varoquaux Bertrand Thirion Vincent Dubourg Alexandre Passos PEDREGOSA, VAROQUAUX, GRAMFORT ET AL. Matthieu Perrot, Journal of Machine Learning Research 12 2825–2830. (2011). http://scikit-learn.sourceforge.net. (accessed March 4, 2025).
Alhalil, I. & Gullu, M. F. Predicting Main Characteristics of Reinforced Concrete Buildings Using Machine Learning, Buildings 2024, Vol. 14, Page 2967 14 2967. (2024). https://doi.org/10.3390/BUILDINGS14092967
Parhi, S. K. & Patro, S. K. Prediction of compressive strength of geopolymer concrete using a hybrid ensemble of grey Wolf optimized machine learning estimators. J. Building Eng. 71, 106521. https://doi.org/10.1016/J.JOBE.2023.106521 (2023).
Dash, P. K., Parhi, S. K., Patro, S. K. & Panigrahi, R. Influence of chemical constituents of binder and activator in predicting compressive strength of fly ash-based geopolymer concrete using firefly-optimized hybrid ensemble machine learning model. Mater. Today Commun. 37, 107485. https://doi.org/10.1016/J.MTCOMM.2023.107485 (2023).
Singh, P. et al. Development of performance-based models for green concrete using multiple linear regression and artificial neural network. Int. J. Interact. Des. Manuf. 18, 2945–2956. https://doi.org/10.1007/S12008-023-01386-6/FIGURES/9 (2024).
Saravanan, A. et al. Thermal performance prediction of a solar air heater with a C-shape finned absorber plate using RF, LR and KNN models of machine learning. Therm. Sci. Eng. Progress. 38, 101630. https://doi.org/10.1016/J.TSEP.2022.101630 (2023).
Meddage, D. P. P., Ekanayake, I. U., Weerasuriya, A. U. & Lewangamage, C. S. Tree-based regression models for predicting external wind pressure of a building with an unconventional configuration, MERCon 2021–7th International Multidisciplinary Moratuwa Engineering Research Conference, Proceedings 257–262. (2021). https://doi.org/10.1109/MERCON52712.2021.9525734
Meddage, D. P. P., Fonseka, I., Mohotti, D., Wijesooriya, K. & Lee, C. K. An explainable machine learning approach to predict the compressive strength of graphene oxide-based concrete. Constr. Build. Mater. 449, 138346. https://doi.org/10.1016/J.CONBUILDMAT.2024.138346 (2024).
Meddage, D. P. P. et al. Explainable machine learning (XML) to predict external wind pressure of a low-rise Building in urban-like settings. J. Wind Eng. Ind. Aerodyn. 226, 105027. https://doi.org/10.1016/J.JWEIA.2022.105027 (2022).
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system, Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 13-17-August-2016 785–794. (2016). https://doi.org/10.1145/2939672.2939785/SUPPL_FILE/KDD2016_CHEN_BOOSTING_SYSTEM_01-ACM.MP4
Dong, W., Huang, Y., Lehane, B. & Ma, G. XGBoost algorithm-based prediction of concrete electrical resistivity for structural health monitoring. Autom. Constr. 114, 103155. https://doi.org/10.1016/J.AUTCON.2020.103155 (2020).
Mirjalili, S., Mirjalili, S. M., Lewis, A. & Optimizer, G. W. Adv. Eng. Softw. 69 46–61. https://doi.org/10.1016/J.ADVENGSOFT.2013.12.007. (2014).
Faris, H., Aljarah, I., Al-Betar, M. A. & Mirjalili, S. Grey Wolf optimizer: a review of recent variants and applications. Neural Comput. Appl. 30, 413–435. https://doi.org/10.1007/S00521-017-3272-5/TABLES/4 (2018).
Kamboj, V. K., Bath, S. K. & Dhillon, J. S. Solution of non-convex economic load dispatch problem using grey Wolf optimizer. Neural Comput. Appl. 27, 1301–1316. https://doi.org/10.1007/S00521-015-1934-8/FIGURES/10 (2016).
Liu, Y. et al. Variable fatigue loading effects on corrugated steel box girders with recycled concrete. J. Constr. Steel Res. 215, 108526. https://doi.org/10.1016/J.JCSR.2024.108526 (2024).
Nadimi-Shahraki, M. H., Taghian, S. & Mirjalili, S. An improved grey Wolf optimizer for solving engineering problems. Expert Syst. Appl. 166, 113917. https://doi.org/10.1016/J.ESWA.2020.113917 (2021).
El-Fergany, A. A. & Hasanien, H. M. Single and Multi-objective optimal power flow using grey Wolf optimizer and differential evolution algorithms. Electr. Power Compon. Syst. 43, 1548–1559. https://doi.org/10.1080/15325008.2015.1041625 (2015).
Komaki, G. M. & Kayvanfar, V. Grey Wolf optimizer algorithm for the two-stage assembly flow shop scheduling problem with release time. J. Comput. Sci. 8, 109–120. https://doi.org/10.1016/J.JOCS.2015.03.011 (2015).
Gad, A. G. Particle swarm optimization algorithm and its applications: A systematic review, archives of computational methods. Eng. 2022. 29, 5. https://doi.org/10.1007/S11831-021-09694-4 (2022).
Lu, D., Wang, G., Du, X. & Wang, Y. A nonlinear dynamic uniaxial strength criterion that considers the ultimate dynamic strength of concrete. Int. J. Impact Eng. 103, 124–137. https://doi.org/10.1016/J.IJIMPENG.2017.01.011 (2017).
Kennedy, J. & Eberhart, R. Particle swarm optimization. Proc. ICNN’95 - Int. Conf. Neural Networks. 4, 1942–1948. https://doi.org/10.1109/ICNN.1995.488968 (n.d.).
Alyami, M. et al. Innovative hybrid machine learning models for estimating the compressive strength of copper mine tailings concrete. Case Stud. Constr. Mater. 21, e03869. https://doi.org/10.1016/J.CSCM.2024.E03869 (2024).
Juneja, M. & Nagar, S. K. Particle swarm optimization algorithm and its parameters: A review. ICCCCM 2016–2nd IEEE Int. Conf. Control Comput. Communication Mater. https://doi.org/10.1109/ICCCCM.2016.7918233 (2017).
Ibrahim, I. A. & Khatib, T. A novel hybrid model for hourly global solar radiation prediction using random forests technique and firefly algorithm. Energy Convers. Manag. 138, 413–425. https://doi.org/10.1016/J.ENCONMAN.2017.02.006 (2017).
Gandomi, A. H., Yang, X. S., Talatahari, S. & Alavi, A. H. Firefly algorithm with chaos. Commun. Nonlinear Sci. Numer. Simul. 18, 89–98. https://doi.org/10.1016/J.CNSNS.2012.06.009 (2013).
Bui, D. K., Nguyen, T., Chou, J. S., Nguyen-Xuan, H. & Ngo, T. D. A modified firefly algorithm-artificial neural network expert system for predicting compressive and tensile strength of high-performance concrete. Constr. Build. Mater. 180, 320–333. https://doi.org/10.1016/J.CONBUILDMAT.2018.05.201 (2018).
Mabdeh, A. N., Ajin, R. S., Razavi-Termeh, S. V., Ahmadlou, M. & Al-Fugara, A. Enhancing the Performance of Machine Learning and Deep Learning-Based Flood Susceptibility Models by Integrating Grey Wolf Optimizer (GWO) Algorithm, Remote Sensing 2024, Vol. 16, Page 2595 16 2595. (2024). https://doi.org/10.3390/RS16142595
Zhang, J., Wang, R., Lu, Y., Huang, J. & Prediction of Compressive Strength of Geopolymer Concrete Landscape Design. : Application of the Novel Hybrid RF–GWO–XGBoost Algorithm, Buildings 2024, Vol. 14, Page 591 14 591. (2024). https://doi.org/10.3390/BUILDINGS14030591
Ӧzyüksel Çiftçioğlu, A., Kazemi, F. & Shafighfard, T. Grey Wolf optimizer integrated within boosting algorithm: application in mechanical properties prediction of ultra high-performance concrete including carbon nanotubes. Appl. Mater. Today. 42, 102601. https://doi.org/10.1016/J.APMT.2025.102601 (2025).
Wang, Z. et al. Optimizing machine learning techniques and SHapley additive explanations (SHAP) analysis for the compressive property of self-compacting concrete. Mater. Today Commun. 39, 108804. https://doi.org/10.1016/J.MTCOMM.2024.108804 (2024).
Islam, M. M. et al. Prediction of compressive strength of high-performance concrete using optimization machine learning approaches with SHAP analysis. J. Building Pathol. Rehabilitation. 9, 1–20. https://doi.org/10.1007/S41024-024-00445-Z/FIGURES/15 (2024).
Bello, S. A. et al. A deep learning approach to concrete water-cement ratio prediction. Results Mater. 15, 100300. https://doi.org/10.1016/J.RINMA.2022.100300 (2022).
Martínez-garcía, R. et al. Impact of design parameters on the ratio of compressive to split tensile strength of Self-Compacting concrete with recycled aggregate. Materials 14, 3480. https://doi.org/10.3390/MA14133480 (2021).
Malkawi, A. B. Effect of Aggregate on the Performance of Fly-Ash-Based Geopolymer Concrete, Buildings Vol. 13, Page 769 13 (2023) 769. (2023). https://doi.org/10.3390/BUILDINGS13030769
Pouhet, R. & Cyr, M. Formulation and performance of flash Metakaolin geopolymer concretes. Constr. Build. Mater. 120, 150–160. https://doi.org/10.1016/J.CONBUILDMAT.2016.05.061 (2016).
Li, B., Yu, Y., Zhang, C. & Zhang, Y. Mesoscopic fracture model of coarse aggregate interlocking concrete. Adv. Mater. Sci. Eng. 2022, 3453797. https://doi.org/10.1155/2022/3453797 (2022).
Bin Inqiad, W. et al. Predicting compressive strength of Hollow concrete prisms using machine learning techniques and explainable artificial intelligence (XAI). Heliyon 10, e36841. https://doi.org/10.1016/J.HELIYON.2024.E36841 (2024).
Acknowledgements
The authors extend their appreciation to Taif University, Saudi Arabia, for supporting this work through project number (TU-DSPP-2024-248).
Author information
Authors and Affiliations
Contributions
M.S. wrote the original draft, T.A. contributed to data curation, conceptualization, analysis, and investigation, M.Z, I.I, and S.S contributed to the review and editing of the manuscript, M.A. and M.AD. contributed in methodology and software analysis, H. A and A.H.H. contributed to project administration and analysis.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
All authors approve that the research was performed under all the ethical norms.
Consent to publish
All authors consent to publishing this paper.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mahmood, M.S., Ali, T., Inam, I. et al. Machine learning prediction and explainability analysis of high strength glass powder concrete using SHAP PDP and ICE. Sci Rep 15, 22089 (2025). https://doi.org/10.1038/s41598-025-04762-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-04762-2