Abstract
With the increasing incidence of extreme rainfall driven by global climate change, geological hazards like landslides have become more prevalent. This study proposed an efficient framework that combined machine learning and physical models to enhance computational efficiency and reliability for regional slope stability predictions under extreme rainfall. The GEOtop model was employed to simulate volumetric water content (VWC) in unsaturated soil of an area in Singapore under maximum daily and maximum 5-day antecedent rainfall conditions. The result of analyses was then incorporated into Scoops3D for factor of safety (FOS) calculations. The random forest (RF) models were trained using VWC under maximum daily rainfall and applied to predict slope stability under maximum 5-day antecedent rainfall, with outcomes compared to those of Scoops3D. Statistical results and spatial distribution maps both showed that the proposed framework achieved comparable accuracy to Scoops3D at various depths while significantly improving efficiency. The findings also highlighted the critical role of surface soil moisture (at 0.05 m) in slope stability predictions. This framework demonstrates the potential of integrating machine learning and physical models for efficient slope stability prediction, as well as supports the integration of remote sensing or field-measured surface soil moisture data for dynamic predictions in unsaturated soil conditions.
Similar content being viewed by others
Introduction
In recent times, the intensifying effects of global warming have led to notable climatic shifts, particularly seen in the increased intensity of rainfall across various regions1,2. This surge in precipitation plays a key role in the growing frequency of slope failures3,4,5, posing significant threats to infrastructure and public safety. Shallow landslide refers to the failure of the upper soil layers, typically within a few metres of the surface. These failures are often triggered by intense or prolonged rainfall, which elevates pore-water pressure, reduces matric suction, and decreases shear strength, which ultimately decreases slope stability. Shallow landslides are characterised by thin failure surfaces and localised movements, primarily involving unsaturated soils. The failure depth is generally less than 2 m, making these slopes highly sensitive to rainfall-induced changes in soil shear strength6,7. Incorporating unsaturated soil mechanics is crucial for accurately analysing slope stability under rainfall conditions8,9,10.
Rapid and reliable prediction of regional slope stability is essential to identify high-risk areas, facilitating prompt intervention and preventive measures. Methods for slope stability analysis can be broadly categorized into empirical methods, physically-based models, statistical models, and machine-learning models. The physically based techniques for estimating slope stability are generally capable of providing comprehensive and detailed insights and have been widely used by scholars worldwide over the past few decades. Although two-dimensional water balance and slope stability analyses have been extensively studied, they are typically applied to individual slopes rather than at a regional scale11,12,13,14,15. As a result, several regional models have been developed, many of which are based on assumptions that either overlook three-dimensional subsurface flow or differ from actual failure surfaces16,17,18. The commonly used physical models typically incorporate hydrological models and infinite slope, exemplified by the shallow land sliding stability model (SHALSTAB)19, stability index mapping (SINMAP)20, and transient rainfall infiltration and grid-based slope-stability (TRIGRS)21. However, SHALSTAB and SINMAP solely address the impact of lateral flow, neglecting the implications of the water infiltration process, whereas TRIGRS exclusively accounts for the vertical water flow but overlooks the lateral flow. Considering these limitations, Simoni et al.22 proposed the GEOtop-FS, which incorporates the three-dimensional (3D) Richards’ equation to take into account both lateral and vertical water flows, as well as the evapotranspiration process, comprehensively assessing the hydrological processes on slope stability. Based on the GEOtop model, Formetta & Capparelli23 demonstrated the significant effect of hydraulic conductivity anisotropy on slope stability. Cheng & Li24 further applied the GEOtop model at a regional scale, analysing the variations in slope stability along India’s National Highway 10 before and after stabilisation.
However, these widely used infinite slope models assume a translational failure surface and overlook the commonly occurring rotational failures. To compensate for this limitation, Scoops3D, a 3D slope stability model proposed by Reid et al.25, accounts for rotational slope failure surface by generating 3D columns within the framework of limit equilibrium analysis. Although studies have proved the efficacy of Scoops3D in slope stability analysis26,27, scholars still sought to explore ensemble methods of Scoops3D, given its lack of rainfall infiltration dynamics. Ip et al.28 incorporated unsaturated soil properties and variations in pore-water pressure by integrating TRIGRS and Scoops3D, demonstrating the effectiveness of this approach in slope stability mapping. Rahardjo et al.29 also deployed a combination of the two models, TRIGRS and Scoops3D, to map the slope susceptibility across different zones in Singapore. Nevertheless, the adoption of the integration of TRIGRS and Scoops3D does not take lateral flow into consideration because of the inherent characteristics of TRIGRS. Researchers have therefore recommended using GEOtop and Scoops3D together to account for rotational slides that consider both lateral and vertical flow in water balance analyses30,31. Rangarajan et al.31 verified the results from the GEOtop-Scoops3D analysis using 2D seepage and slope stability analyses in GeoStudio software (SEEP/W and SLOPE/W). The study found that the GEOtop model produced higher pore-water pressures than 2D seepage analysis, leading to a lower factor of safety in the 3D GEOtop-Scoops3D analysis compared to the 2D SEEP/W-SLOPE/W analysis.
Despite the thorough and intricate analyses offered by physically based models, they typically encounter a substantial computing burden and thus compromise on speed, especially in 3D analysis32, presenting a significant limitation in situations that require prompt response. To address this limitation, machine learning has emerged as a promising alternative. Known for its swift processing power, machine learning enables quick predictions, a crucial aspect of early warning systems33,34. Numerous studies have shown that machine learning techniques achieve practically-usable accuracy in slope stability analysis35,36,37,38,39,40, as well as demonstrating potential in 3D slope stability analysis and understanding the underlying mechanisms with interpretable models41,42,43. Therefore, the utilization of machine learning approaches not only heightens the accuracy of predictions but also facilitates real-time estimation of the FOS and understanding of the controlling factors, vital for prompt risk management and tailored mitigations.
However, it is crucial to note that the effectiveness of machine learning and deep learning, despite their rapid processing capabilities, can be diminished in scenarios involving extreme weather conditions. This limitation often stems from the lack of sufficient data representing such extreme conditions during the training phase of the model. The rarity of these events means that the models may not have been exposed to enough relevant data to accurately predict outcomes under such circumstances. This shortfall emphasizes the need for a balanced approach that perhaps combines the quick predictive power of machine learning with the depth and comprehensiveness of physically based models. Some scholars have explored the integration of data-driven methodologies with physical models, exemplified by Liu et al.44 who introduced a dual-driven approach that integrated Scoops3D and RF, utilising non-landslide points extracted from Scoops3D and a historical landslide inventory for RF model training. Their approach achieved a significant improvement, with an area under the curve (AUC) 20.1% greater than that obtained by the RF model alone. On the other hand, Xue et al.45 employed a matrix to integrate the landslide susceptibility mapping results obtained from the RF model with the slope stability outcomes derived from Scoops3D for landslide prediction, with a frequency ratio, when compared to solely using Scoops3D, was 2.24 times higher. However, as far as the authors are aware, there is no research that has integrated data-driven models with physical models in the context of risk management related to extreme weather events for the rapid and accurate prediction of slope stability.
Soil moisture has been proven to be an effective indicator for predicting slope stability, demonstrating impressive performance in various studies46,47,48,49,50. While field instruments provide precise measurements at specific points, their scope is limited. In contrast, remote sensing technologies can assess soil moisture over extensive areas. However, remote sensing methods are limited to analysing only surface soil layers, leading to crucial questions about the importance of soil moisture depth in assessing slope stability. To the best of the authors’ knowledge, no studies have focused on the variations in predicting FOS based on moisture contents in surface soil and in other soil layers. This aspect is vital in understanding and effectively predicting slope failures with the future application of remote sensing techniques on a larger regional scale, particularly in the context of the evolving global climate.
To sum up, the existing studies in slope stability prediction may still have limitations in the following aspects: (1) The existing research primarily utilised either physical models or machine learning models for slope stability analysis, with few studies integrating the strengths of both to rapidly predict unseen and infrequent scenarios during extreme rainfall events. (2) There were few studies that have accounted for the influence of soil moisture at different depths, and quantitative research is still required for the rationality of solely using surface soil moisture (e.g., retrieved from remote sensing). Thus, to bridge these research gaps, this study employed a small region in Singapore as the study area and carried out analyses as follows: (1) The GEOtop model was utilised to determine the pore-water pressure and volumetric water content under scenarios of both maximum daily and maximum 5-day antecedent rainfalls, with the outcomes imported into Scoops3D for FOS calculations. (2) The FOS outcomes under the maximum daily rainfall scenario were employed to train the machine learning model, allowing the efficient estimation of FOS in the maximum 5-day antecedent rainfall scenario and enabling a comparative analysis with prior physically-based modelling outcomes. (3) Soil moisture data incorporated at various depths were used within the machine learning models. The differences in the FOS estimates between the surface-level and multi-depth soil moistures are further discussed.
Physically-based modelling using GEOtop and Scoops3D
A zone in Singapore was selected for the water balance and slope stability analyses performed using the GEOtop51,52 and Scoops3D53 software, respectively. The selection of the zone was based on the ___location of the undisturbed soil sample which was obtained from a slope located along Dalvey Road. The slope is located in Bukit Timah Granite (BTG) formation and is within the BTG42 zonation as per Li et al.54 database. In this developed database, Li et al. divided Singapore into 97 zones, with each zone having similar soil properties. Hence, the soil properties obtained from the soil sampling at Dalvey Road were used to represent the soil for the BTG42 zone given that the zone is small and the soil is assumed to be homogenous. Laboratory tests were performed using the soil samples to acquire the soil properties for the numerical analysis which are presented in Table 1. The soil has been classified as clay of high plasticity (CH) under the Unified Soil Classification System and thus has a low saturated permeability of 3.13 × 10 –8 m/s. The grain size distribution curve is presented in Fig. 1a. The unsaturated soil properties such as the soil water characteristic curve and permeability function were defined using the van Genuchten (1980)55 and modified Mualem equations56. The van Genuchten soil water characteristic curve is presented in Fig. 1b. Most of the rainfall-induced slope failures in Singapore occurs within the shallow depths57,58. Thus, the soil between 0 and 10 m depth (i.e. layers 1 to 7) was used to analyse the relationship between soil moisture and factor of safety. The soil thickness for soil layers 1 to 7 modelled in the GEOtop analysis is presented in Fig. 2.
The GEOtop software which is a physically based hydrological model was utilised in this study to perform water balance analysis. This model analyses the subsurface flow in saturated and unsaturated soil using the three-dimensional (3D) Richards’ equation52. The water balance equation (Eq. 1) in GEOtop comprises the 3D Richards’ equation \(\:{q}_{sub}\)59,60 and evapotranspiration term S51.
where \(\:x\) is the position, t is time, \(\:{\theta\:}_{w}\) is the volumetric water content of the soil. The term S represents the evapotranspiration. The term \(\:\nabla\:\bullet\:{q}_{sub}\) refers to the divergence of water per unit volume and is calculated by the Richards’ equation in Eq. 2.
where ∇ is the gradient operator, ψ is the pressure head, z is the elevation head and the term \(\:K\) is the permeability.
The required input maps such as the Digital Elevation Model (DEM), slope angle, aspect, and groundwater table (Fig. 3) are used to define the precise topography and groundwater conditions of the zone. The initial pore-water pressure was limited to − 100 kPa due to the deep groundwater table depth of 47 m as seen in Fig. 361. The resolution of the maps used in the GEOtop analysis is 5 m.
Different soils respond differently to varying rainfall patterns, influencing slope stability in distinct ways. Studies on slope stability in Singapore have shown that regions with low-permeability soils are predominantly affected by antecedent rainfall (e.g., 5-day antecedent rainfall), while areas with high-permeability soils are more influenced by maximum daily rainfall62,63. In the studied zone, characterized by CH soil, the 5-day antecedent rainfall plays a crucial role. Two different extreme rainfall conditions were applied in the water balance analyses; maximum daily and maximum 5-day antecedent rainfall. In the machine learning model, results from physically-based models under the maximum daily rainfall condition were used to train the model and predict slope stability under the maximum 5-day rainfall condition. These predictions were then compared with the physically-based model results under the maximum 5-day rainfall condition to evaluate the model’s performance. For the maximum daily rainfall analysis, an extreme rainfall intensity of 22.2 mm/h suggested by Singapore’s Public Utilities Board (PUB) Code of Practice on Surface Water Drainage guidelines64 was applied continuously for 24 h followed by another 24 h of no rainfall. For the maximum 5-day antecedent rainfall analysis, a constant rainfall intensity of 4.94 mm/hour was applied for 5 continuous days followed by an additional 5 days with no rainfall65.
Inputs maps used in water balance analysis: (a) DEM; (b) Groundwater table; (c) Slope angle; (d) Aspect map for BTG42. (software: ArcGIS Pro; version number: 3.1; URL: https://www.esri.com/en-us/arcgis/products/arcgis-pro/overview).
The average pore-water pressure results from the water balance analyses are presented in Fig. 4. Pore-water pressure changes were only observed in the first layer (0.05 m depth) in the first hour of rainfall. Subsequently, the pore-water pressure increases from − 10 kPa to 0 kPa and − 30 kPa to 0 kPa between the 1st hour and 24th hour timesteps for the maximum daily and 5-day antecedent rainfall conditions, respectively. Moreover, the pore-water pressure at the deeper depth of 0.4 m also experienced an increase in pore-water pressure from − 62 kPa to approximately − 31 kPa for both rainfall conditions. Therefore, the pore-water pressure changes can be observed to be increasing with depth as the duration of rainfall increases.
The pore-water pressures during the 1st and 24th hour timestep are similar under both maximum daily and 5-day antecedent rainfall conditions. This suggests that despite the higher rainfall intensity in the maximum daily rainfall analysis, the corresponding pore-water pressures are not higher. This is due to the rainfall intensity (22.2 mm/hour = 6.16E−06 m/s and 4.94 mm/hour = 1.37E−06 m/s) exceeding the saturated permeability of the soil (3.13E−08 m/s). Hence, the extended duration of rainfall in the 5-day antecedent rainfall analysis resulted in further penetration of rainfall into deeper soil layers as seen in Fig. 4 where pore-water pressures continued to increase after 24th hour with rainfall infiltration in deeper depths. Nevertheless, likely due to the low permeability of the soil, rainfall infiltrations into depths below 1.5 m remained minimal over the simulation period, resulting in limited variations in pore-water pressure in these deeper layers.
The pore-water pressure and volumetric water content results for the 1st, 24th, and 48th hour timesteps of the maximum daily rainfall analysis and 1st hour, 1, 2.5, 5, 7.5, and 10 days timesteps from the maximum 5-day antecedent rainfall analysis were thereafter imported into the Scoops3D software for the slope stability analysis. The Scoops3D software utilises the 3D limit equilibrium method, Bishop’s Simplified Method to calculate the factor of safety (FOS, Eqs. 3 and 4) for each 3D column defined by the resolution of the DEM53. Therefore, to allow for a more precise slope stability analysis, a 1 m resolution DEM was used.
where \(\:{R}_{i,j}\) is the radial distance from the rotation axis to the centre of the base of each column, \(\:{c}_{i,j}\) is the cohesion, \(\:{A}_{{h}_{i,j}}\) is the horizontal area of the trial slip surface, \(\:{W}_{i,j}\) is the weight of the column, \(\:{u}_{i,j}\) is the pore-water pressure derived from the GEOtop model, \(\:{\phi\:}_{i,j}\) is the internal friction angle, \(\:{{\upalpha\:}}_{i,j}\) is the apparent dip in the sliding direction, \(\:{\in}_{i,j}\) is the actual dip of the column, \(\:{k}_{eq}\) is the horizontal pseudo-acceleration coefficient, \(\:{e}_{i,j}\) is the moment arm for the horizontal driving force. A schematic diagram illustrating the definition of the parameters used in Eqs. 3 and 4 has been provided by Reid et al.53.
Slopes with a FOS less than 1.5 were considered as slopes of concern following the recommendations by the Housing and Development Board of Singapore for slope design66. Therefore, the percentage of pixels with FOS less than 1.5 was calculated for both rainfall analyses (Fig. 5). The results show that the FOS continued to decrease after the 24th hour with the end of the maximum 5-day rainfall event having the highest percentage of pixels. These observations agree with the pore-water pressure results in Fig. 4. The higher pore-water pressures at the end of the maximum 5-day rainfall analysis as compared to the end of the 24-hour maximum daily rainfall analysis resulted in more areas having a lower FOS. At the end of the rainfall event, the maximum 5-day antecedent rainfall analysis had 0.95% of pixels with FOS less than 1.5 while the maximum daily rainfall analysis only had 0.38%. Hence, the slopes in the BTG42 zone are more susceptible to failure under low rainfall intensity but longer duration rainfall events (5-day antecedent rainfall) as compared to rainfall events with higher intensity but shorter duration (maximum daily rainfall). The computational time took to run the GEOtop and Scoops3D analyses are presented in Table 2 below.
Machine learning model using random forest regression
The outcomes of the regional FOS assessments, which are derived from physically based models utilising GEOtop for water balance evaluations and Scoops3D for slope stability analyses, serve as a reliable reference point. While computation time for one timestep in regional 3D analyses is time-consuming (Table 2), the results could identify slopes that warrant attention at specific timesteps during critical rainfall scenarios. However, to obtain FOS maps for intervals not covered by these specific timesteps, significant additional computing time is required. Consequently, analysing FOS under typical, rather than critical, rainfall conditions with physically based models demand even more computational effort. Utilising FOS maps generated under critical conditions for real-time monitoring could lead to conservative outcomes. There is a likelihood that some slopes, considered unstable under extreme rainfall conditions, may not present a significant hazard under less severe weather circumstances, particularly when current meteorological conditions are much less intense than the potential extremes that have yet to occur.
Therefore, machine learning methods are proposed as an efficient and effective alternative for real-time monitoring and early warning systems due to their rapid processing and strong nonlinear capabilities. In this study, data from maximum daily rainfall events, encompassing 24-hour rainfall intervals, and 24-hour dry intervals, was employed to train the machine learning model. Subsequently, the model was used to estimate the FOS during the maximum 5-day antecedent rainfall event and to contrast these estimations with outcomes from physically based modelling under identical rainfall conditions. The comparisons were conducted over various timesteps: 1 h, 1 day, 2.5 days, 5 days, 7.5 days, and 10 days, corresponding to times of maximum 5-day preceding rainfall and 5-day absence of rainfall. Previous studies have demonstrated that the random forest (RF) model can achieve satisfactory results in slope stability prediction67,68,69,70. The RF model was developed in this research using input variables that include slope angle, groundwater table (GWT) levels, landcover, as well as VWC results from the GEOtop analyses. To improve the model’s accuracy, the original dataset, which had a 5m resolution, was resampled to a more detailed 1m resolution prior to training the RF model. 80% of the data were designated for training the model, while the remaining 20% were reserved for evaluating the performance of the trained model. Additionally, five-fold cross-validation was implemented to fine-tune the model parameters.
Furthermore, it is widely recognised that variations in the soil moisture content layer are crucial for monitoring slope stability71,72. Yet, the significance of this factor in predicting the FOS remains unclear. The use of remotely sensed soil moisture data in slope stability estimation is often viewed with scepticism, primarily because these methods are limited to measuring moisture in the surface layer, without offering insights into the deeper, more substantive soil layers. This raises questions about the reliability of using just the moisture content of the first layer (L1 VWC) for estimating FOS and how it compares to the analyses that incorporate moisture contents from multiple layers. Considering the potential correlation between surface moisture content and the moisture contents of deeper soil layers, it is conceivable that reliable FOS estimates might be derived from the surface VWC alone. Therefore, this study extended its investigation to explore the impact of integrating soil moisture data from different depths, aiming to understand how the inclusion of information from various layers influences FOS. This exploration assessed the effect of using VWC data from all the seven layers, solely from the top layer, or from the selected combinations of layers (Fig. 2), on the accuracy and reliability of FOS estimations. This evaluation involved utilising varying sets of soil moisture inputs when training RF models, which included VWC for all seven layers, as well as subsets ranging from layers 1 to 7, down to layer 1 only. These soil moisture data, along with slope angle, GWT levels, and land cover, served as input variables, with their quantities detailed in Table 3. The study then analysed how these different data configurations influenced the FOS estimations.
Figure 6 presents the comparison between the predicted FOS derived from the RF model and the actual FOS obtained from Scoops3D, under the maximum daily rainfall conditions. The FOS was divided into four categories for slope susceptibility mapping, as recommended by Rahardjo et al.73 : very high risk (FOS ≤ 1), high risk (1 < FOS ≤ 1.25), moderate risk (1.25 < FOS ≤ 1.5), and low risk (FOS > 1.5). This comparison aims to evaluate the model’s performance with the test dataset under scenarios of maximum daily precipitation. Figure 6a features the comparison using VWC data from all seven layers (L1-7) within the RF model, establishing a baseline for comparison. Meanwhile, Fig. 6b highlights the scenario where only the VWC data from the top layer (L1) is used in the RF model, illustrating the predicted FOS under the limited condition of involving only the surface layer’s VWC. Additional analyses that involve various combinations of VWC data from differing soil layers within the RF model are elaborated upon in Supplementary Fig. A1. Metrics such as the mean absolute error (MAE), root mean square error (RMSE), and the coefficient of determination (R2) were calculated to assess model performance. Generally, there is a good trend, with the results closely aligning with the 1:1 reference line (black dotted). Notably, the model trained with VWC data from all seven layers exhibited slightly better performance compared to the model with only the first layer’s VWC, evidenced by lower MAE, RMSE, and higher R2 values. Nonetheless, the implications of these findings on downstream outcomes, like FOS maps under alternate scenarios, remain to be explored. As such, scenarios involving the maximum 5-day antecedent rainfall were also examined to discern differences in FOS maps.
Figure 7 displays the estimated FOS maps obtained from the RF model and Scoops3D under conditions of maximum 5-day antecedent rainfall, which is characterized as a period of 5 consecutive days of rainfall followed by 5 days without rainfall, at three representative timesteps: the 1st day, 5th day (marking the end of rainfall), and 10th day. The FOS maps in Fig. 7 from the RF model include results from two configurations: one incorporating VWC from all seven layers and another focusing solely on the top layer’s VWC variations. Figure 7 presents the FOS maps for three representative timesteps—1st day, 5th day (end of rainfall), and 10th day (end of no rainfall)—for a specific section of the study area, allowing for better visual comparison. The FOS map for the entire zone is provided in Supplementary Figure A2. FOS maps using all combinations of VWC data from various soil depths at each timestep are shown in Supplementary Figures B1–B6. The FOS distribution patterns from the Scoops3D and RF models are highly similar, with only minor differences. Notably, RF predictions using only the top layer’s VWC (RF FOS L1 SM) align more closely with Scoops3D than those incorporating data from all seven layers (RF FOS L1-7 SM).
Alongside visual evaluation, a detailed quantitative analysis was performed to determine the pixel distribution within various FOS categories, excluding the low-risk category pertinent to slope stability (with its percentage being the complement to 100% of the other categories). As illustrated in Fig. 8, this analysis provided a comparison between the results obtained from Scoops3D and those from RF models that accounted for VWC across all seven layers (L1-7), in addition to RF models that exclusively considered VWC from the surface layer (L1). During the rainfall event, spanning 5 days, a notable decline in FOS was recorded, leading to a marked increase percentage of pixels in all FOS categories (≤ 1.5) by the end of rainfall. Throughout the 10-day analysis, the variance in FOS between Scoops3D and RF models remains minimal for the very high-risk category with FOS ≤ 1 (represented by red lines). In the high-risk category (1 < FOS ≤ 1.25), highlighted by orange lines, the results from Scoops3D and RF (L1) are closely aligned, whereas RF (L1-7) reveals a substantially larger percentage of pixels within this risk level. At the moderate risk tier (1.25 < FOS ≤ 1.5), denoted by blue lines, findings from both RF (L1) and RF (L1-7) show an increased count of pixels in this category compared to Scoops3D, with RF (L1) presenting a further elevated number of pixels. When comparing the outcomes of the three models across various timesteps, a more significant discrepancy is noticed as rainfall progresses, yet the results converge closely after the rainfall stops.
In summary, although the comparisons typically present closely aligned results, RF models are inclined to generate more conservative FOS maps, resulting in lower FOS values compared to those produced by Scoops3D, particularly during ongoing rainfall events. Therefore, the RF model has proven to be an effective and efficient alternative for estimating regional FOS distributions, bypassing the need for more time-intensive physically based models like Scoops3D. This model leveraged the outcomes of physically based modelling under extreme conditions, encompassing a broad range of vertical soil moisture profiles and spatial distributions. It enables the prediction of real-time FOS distributions with the VWC measurements, rendering it an essential asset for early warning systems.
To understand the influence of incorporating soil moisture measurements from various depths, the performance of RF models that considered different numbers of soil layers was evaluated. This involved calculating the MAE and the RMSE in comparison to the Scoops3D results for each model configuration. To demonstrate performance variations of the different model configurations that incorporate moisture contents from different soil depths, MAE and RMSE values, derived from models including VWC from all seven layers, were set as the baseline. Changes in MAE and RMSE, expressed as percentages, were subsequently computed to highlight the impact of integrating varying moisture contents with soil depths into each model configuration. These changes were plotted to visually present how different soil depth integrations affect each model’s performance, as depicted in Fig. 9. The analysis indicated that incorporating moisture measurements from deeper soil layers did not consistently improve the model’s performance. After rainfall events, the surface soil layer was more significantly influenced by rainfall, resulting in more distinct fluctuations in VWC over time. This variability in surface soil moisture provided the model with distinctive features for identifying changes in slope stability across different time intervals. In contrast, when deeper layers such as layer 4 and layer 5 were included, which exhibited minimal response to rainfall (as shown in Fig. 4), these layers contribute limited variability due to the low permeability of the soil. This lack of variation in deeper layers can reduce the distinctiveness of the VWC signals from the surface layer, which may potentially affect the model’s ability to accurately capture and predict the slope stability variations.
Consequently, compared to the RF(L1), models RF(L1-2), RF(L1-3), and RF(L1-4), which included soil moisture from additional layers and greater depths, exhibited lower accuracy, as evidenced by higher RMSE and MAE values throughout the 10-day prediction period. This observation shows that models incorporating only the surface layer’s VWC demonstrated strong performance. Furthermore, when compared to models RF(L1-5), RF(L1-6), and RF(L1-7), the RF(L1)’s performance remained comparable, displaying smaller MAE and RMSE at most evaluated timesteps, except for a slight 5% increase in error at the 2.5-day timestep for models using only the surface layer compared to those including all seven layers. For other time intervals, the performance was on par or better, with lower MAE and RMSE, highlighting the efficiency of models that rely exclusively on the surface layer’s VWC.
Therefore, the results highlight the critical role of surface layers in FOS estimation and the potential for utilising solely surface moisture. In recent years, numerous missions have been launched to observe surface soil moisture from space, including the Soil Moisture Active Passive (SMAP) and Soil Moisture and Ocean Salinity (SMOS), which are measured in the 0–5 cm layer74,75, and the Global Land Data Assimilation System (GLDAS) which can provide soil moisture in the top 10 cm of the soil column76. The success and results of this research could pave the way for the adoption of remote sensing technologies over the traditional field-based moisture sensing methods, significantly enhancing the efficiency of data collection procedures and slope stability mapping over a large area.
However, this study presents a model designed specifically for a single zone in Singapore, where soil properties were assumed to be constant and not considered as variables. The primary aim of this work was to propose a framework (Fig. 10) for training a model that efficiently predicts FOS using surface soil moisture data. Due to the assumption of constant soil properties, the model is currently limited to the illustrated zone. To enhance its applicability and generalisability, future research should focus on expanding the analysis to multiple zones with varying soil properties, such as different soil types and diverse hydraulic and mechanical characteristics. Incorporating these soil properties as input variables would improve the model’s ability to predict FOS in regions with high soil property variability, thereby increasing its generalisability across larger areas and to other parts of the world.
Conclusions
This study introduced a methodology utilising a random forest algorithm for efficient regional slope stability prediction in response to precipitation, with a particular focus on the integration of spatiotemporal variability of soil moisture. Through the comparative analysis of the RF model trained with VWC data from various soil depths against the outcomes derived from the GEOtop and Scoops3D models, the feasibility and reliability of the approach were demonstrated.
The verification under the maximum daily rainfall condition indicated favourable results for RF (L1) and RF (L1-7), while the latter showed a slightly superior performance. Additionally, all RF models trained with combinations of VWC data from differing soil layers exhibited MAE lower than 0.15 and RMSE lower than 0.20, suggesting the potential and effectiveness of the proposed method for efficient slope stability prediction under extreme rainfall conditions. For the further comparison under the maximum 5-day antecedent rainfall condition, notable deviations were observed from the RF (L1) on the 5th day, with a greater number of pixels being classified into the moderate risk category, indicating a tendency towards conservative prediction. Nonetheless, RF (L1) demonstrated greater alignment with Scoops3D than RF (L1-7) in the high-risk category, and predictions of both were highly consistent with Scoops3D in the very high-risk tier. Moreover, RF (L1) demonstrated comparable or even superior accuracy in most scenarios compared to RF (L1-7), emphasizing the importance of surface soil moisture (0.05 m) in slope stability prediction. However, the inclusion of deeper soil moisture data did not consistently improve the performance of the RF models, with a significant accuracy decrease observed in RF (L1-2), RF (L1-3), and RF (L1-4).
In general, this framework is capable of leveraging the strengths of both physically based models and machine-learning approaches for efficient slope stability prediction, offering valuable insights for early warning and mitigation efforts in the context of increasing extreme rainfall events driven by global climate change. While this study highlights the critical role of surface soil moisture in slope stability predictions, these findings are particularly relevant to low-permeability soils. Future study could further investigate whether similar conclusions apply to other soil types. The incorporation of spatiotemporal variations of soil moisture also reinforces the potential of remote sensing-retrieved soil moisture datasets for rapid and dependable slope stability predictions and landslide susceptibility mapping at a larger scale in future studies.
Data availability
The datasets used and/or analysed during the current study will be made available from the corresponding author on reasonable request.
References
Martel, J. L., Brissette, F. P., Lucas-Picher, P., Troin, M. & Arsenault, R. Climate change and rainfall intensity–duration–frequency curves: overview of science and guidelines for adaptation. J. Hydrol. Eng. 26, 03121001 (2021).
Myhre, G. et al. Frequency of extreme precipitation increases extensively with event rareness under global warming. Sci. Rep. 9, 16063 (2019).
Lu, M., Zhang, J., Zheng, J. & Yu, Y. Assessing annual probability of rainfall-induced slope failure through a mechanics-based model. Acta Geotech. 17, 949–964 (2022).
Liu, X. & Wang, Y. Quantifying impacts of precipitation scenarios projected under climate change on annual probability of rainfall-induced landslides at a specific slope. Comput. Geotech. 167, 106063 (2024).
Pirone, M., Forte, G., Santo, A. & Urciuoli, G. Novel rainfall thresholds for shallow slip prediction based on field monitoring: case study of the Lattari mountains, Italy. J. Geotech. Geoenviron. Eng. 151, 05024016 (2025).
Bordoni, M. et al. Hydrological factors affecting rainfall-induced shallow landslides: from the field monitoring to a simplified slope stability analysis. Eng. Geol. 193, 19–37 (2015).
Li, Y. et al. Database of soil properties incorporating organic content from roots and soil organisms for regional slope stabilisation. Sci. Rep. 15, 1066 (2025).
Johari, A. & Gholampour, A. A practical approach for reliability analysis of unsaturated slope by conditional random finite element method. Comput. Geotech. 102, 79–91 (2018).
Li, Y., Satyanaga, A. & Rahardjo, H. Characteristics of unsaturated soil slope covered with capillary barrier system and deep-rooted grass under different rainfall patterns. Int. Soil. Water Conserv. Res. 9, 405–418 (2021).
Xue, Y., Miao, F., Li, J., Wu, Y. & Li, L. Probabilistic back analysis of reservoir landslide considering hydro-mechanical coupled observations. Comput. Geotech. 176, 106798 (2024).
Cho, S. E. & Lee, S. R. Instability of unsaturated soil slopes due to infiltration. Comput. Geotech. 28, 185–208 (2001).
Johari, A., Hooshmand Nejad, A. & Mousavi, S. Probabilistic model of unsaturated slope stability considering the uncertainties of soil-water characteristic curve. Sci. Iran. 0, 0–0 (2017).
Johari, A. & Talebi, A. Stochastic analysis of rainfall-induced slope instability and steady-state seepage flow using random finite-element method. Int. J. Geomech. 19, 04019085 (2019).
Li, S., Qin, C., Chian, S. C. & Zhang, W. Another look at the stability of unsaturated soil slopes considering nonuniformity and nonlinearity. Comput. Geotech. 148, 104743 (2022).
Xue, Y. et al. Slope stability analysis with a hypoplastic constitutive model: investigating a stochastic anisotropy model and a hydro-mechanical coupled simulation. Int. J. Numer. Anal. Methods Geomech. 48, 1846–1863 (2024).
Lu, N. & Godt, J. Infinite slope stability under steady unsaturated seepage conditions. Water Resour. Res. 44, (2008).
Lin, W. et al. Landslide hazard assessment of rainfall-induced landslide based on the CF-SINMAP model: a case study from Wuling mountain in Hunan Province, China. Nat. Hazards. 106, 679–700 (2021).
Placido Antonio Gatto, M., Montrasio, L. X-SLIP: A SLIP-based multi-approach algorithm to predict the spatial–temporal triggering of rainfall-induced shallow landslides over large areas. Comput. Geotech. 154, 105175 (2023).
Montgomery, D. R. & Dietrich, W. E. A physically based model for the topographic control on shallow landsliding. Water Resour. Res. 30, 1153–1171 (1994).
Pack, R. T., Tarboton, D. G. & Goodwin, C. N. The SINMAP approach to terrain stability mapping. (1998).
Baum, R. L., Savage, W. Z. & Godt, J. W. TRIGRS: A Fortran Program for Transient Rainfall Infiltration and Grid-Based Regional Slope-Stability Analysis, Version 2.0 (US Geological Survey Reston, 2008).
Simoni, S., Zanotti, F., Bertoldi, G. & Rigon, R. Modelling the probability of occurrence of shallow landslides and channelized debris flows using GEOtop-FS. Hydrol. Process. 22, 532–545 (2008).
Formetta, G. & Capparelli, G. Quantifying the three-dimensional effects of anisotropic soil horizons on hillslope hydrology and stability. J. Hydrol. 570, 329–342 (2019).
Cheng, Y. & Li, Y. Application of capillary barrier systems for slope stabilization under extreme rainfall: A case study of National Highway 10, India. Infrastructures. 9, (2024).
Reid, M. E., Christian, S. B. & Brien, D. L. Gravitational stability of three-dimensional stratovolcano edifices. J. Geophys. Res. Solid Earth. 105, 6043–6056 (2000).
Rashid, B., Iqbal, J. & Su, L. Landslide susceptibility analysis of Karakoram highway using analytical hierarchy process and scoops 3D. J. Mt. Sci. 17, 1596–1612 (2020).
Marin, R. J. & Long, M. 2D and 3D numerical modelling for preliminary assessment of long-term deterioration in Irish glacial till geotechnical slopes. Georisk Assess. Manag. Risk Eng. Syst. Geohazards. 1–27 https://doi.org/10.1080/17499518.2024.2379946 (2024).
Ip, S. C. Y., Rahardjo, H. & Satyanaga, A. Three-dimensional slope stability analysis incorporating unsaturated soil properties in Singapore. Georisk Assess. Manag. Risk Eng. Syst. Geohazards. 15, 98–112 (2021).
Rahardjo, H. et al. Slope susceptibility map for preventive measures against rainfall-induced slope failure. Urban Lifeline. 1, 5 (2023).
Li, Y. & Duan, W. Decoding vegetation’s role in landslide susceptibility mapping: an integrated review of techniques and future directions. Biogeotechnics 2, 100056 (2024).
Rangarajan, S., Rahardjo, H., Satyanaga, A. & Li, Y. Influence of 3D subsurface flow on slope stability for unsaturated soils. Eng. Geol. 339, 107665 (2024).
Wang, J. H. & Xu, W. J. Slope stability and failure dynamics of rainfall-induced landslide: algorithm and applications. Comput. Geotech. 177, 106919 (2025).
Thirugnanam, H., Ramesh, M. V. & Rangan, V. P. Enhancing the reliability of landslide early warning systems by machine learning. Landslides 17, 2231–2246 (2020).
Wang, L., Wu, C., Yang, Z. & Wang, L. Deep learning methods for time-dependent reliability analysis of reservoir slopes in spatially variable soils. Comput. Geotech. 159, 105413 (2023).
Azmoon, B., Biniyaz, A. & Liu, Z. Evaluation of deep learning against conventional limit equilibrium methods for slope stability analysis. Appl. Sci. 11, 6060 (2021).
Lin, S., Zheng, H., Han, C., Han, B. & Li, W. Evaluation and prediction of slope stability using machine learning approaches. Front. Struct. Civ. Eng. 15, 821–833 (2021).
Lin, S. et al. Comparative performance of eight ensemble learning approaches for the development of models of slope stability prediction. Acta Geotech. 17, 1477–1502 (2022).
Li, X., Nishio, M., Sugawara, K., Iwanaga, S. & Chun, P. Surrogate model development for slope stability analysis using machine learning. Sustainability 15, 10793 (2023).
Cai, Y., Yuan, Y. & Zhou, A. Predictive slope stability early warning model based on catboost. Sci. Rep. 14, 25727 (2024).
Onyelowe, K. C., Moghal, A. A. B., Ahmad, F., Rehman, A. U. & Hanandeh, S. Numerical model of debris flow susceptibility using slope stability failure machine learning prediction with metaheuristic techniques trained with different algorithms. Sci. Rep. 14, 19562 (2024).
Meng, J., Mattsson, H. & Laue, J. Three-dimensional slope stability predictions using artificial neural networks. Int. J. Numer. Anal. Methods Geomech. 45, 1988–2000 (2021).
Youssef, K., Shao, K., Moon, S. & Bouchard, L. S. Landslide susceptibility modeling by interpretable neural network. Commun. Earth Environ. 4, 162 (2023).
Lin, S. et al. A comprehensive evaluation of ensemble machine learning in geotechnical stability analysis and explainability. Int. J. Mech. Mater. Des. 20, 331–352 (2024).
Liu, S. et al. A physics-informed data-driven model for landslide susceptibility assessment in the three Gorges reservoir area. Geosci. Front. 14, 101621 (2023).
Xue, Z., Feng, W., Yi, X., Dun, J. & Wu, M. Integrating data-driven and physically based landslide susceptibility methods using matrix models to predict reservoir landslides. Adv. Space Res. 73, 1702–1720 (2024).
Zhao, B. et al. Probabilistic thresholds for landslides warning by integrating soil moisture conditions with rainfall thresholds. J. Hydrol. 574, 276–287 (2019).
Marino, P., Peres, D. J., Cancelliere, A., Greco, R. & Bogaard, T. A. Soil moisture information can improve shallow landslide forecasting using the hydrometeorological threshold approach. Landslides 17, 2041–2054 (2020).
Wicki, A. et al. Assessing the potential of soil moisture measurements for regional landslide early warning. Landslides 17, 1881–1896 (2020).
Li, L. et al. Study on the stability analysis of rainfall slope based on G-A model considering moisture content. Sci. Rep. 12, 10480 (2022).
Guo, Y., Xu, Z., Zhu, S., Luo, X. & Xiao, Y. Using distributed root soil moisture data to enhance the performance of rainfall thresholds for landslide warning. Nat. Hazards. 115, 1167–1192 (2023).
Rigon, R., Bertoldi, G. & Over, T. M. GEOtop: A distributed hydrological model with coupled water and energy budgets. J. Hydrometeorol. 7, 371–388 (2006).
Endrizzi, S., Gruber, S., Dall’Amico, M. & Rigon, R. GEOtop 2.0: simulating the combined energy and water balance at and below the land surface accounting for soil freezing, snow cover and terrain effects. Geosci. Model. Dev. 7, 2831–2857 (2014).
Reid, M. E., Christian, S. B., Brien, D. L. & Henderson, S. T. Scoops3D: Software to Analyze 3D Slope Stability throughout a Digital Landscape. 236 https://pubs.usgs.gov/publication/tm14A1 (2015). https://doi.org/10.3133/tm14A1
Li, Y., Rahardjo, H., Satyanaga, A., Rangarajan, S. & Lee, D. T.-T. Soil database development with the application of machine learning methods in soil properties prediction. Eng. Geol. 306, 106769 (2022).
van Genuchten, M. Th. A Closed-form equation for predicting the hydraulic conductivity of unsaturated soils. Soil. Sci. Soc. Am. J. 44, 892–898 (1980).
Mualem, Y. A new model for predicting the hydraulic conductivity of unsaturated porous media. Water Resour. Res. 12, 513–522 (1976).
Rahardjo, H., Li, X. W., Toll, D. G. & Leong, E. C. The effect of antecedent rainfall on slope stability. In Unsaturated Soil Concepts and their Application in Geotechnical Practice (ed Toll, D. G.) 371–399. https://doi.org/10.1007/978-94-015-9775-3_8. (Springer Netherlands, 2001).
Rahardjo, H. et al. Slope failures in Singapore due to rainfall. In Proc., 10th Australia New Zealand Conf. on Geomechanics 704–709 (2007).
Richards, L. A. Capillary conduction of liquids through Porous mediums Phys. 1, 318–333 (1931).
Freeze, R. A. Role of subsurface flow in generating surface runoff: 2. Upstream source areas. Water Resour. Res. 8, 1272–1283 (1972).
Nistor, M. M. et al. Investigation of groundwater table distribution using borehole piezometer data interpolation: case study of Singapore. Eng. Geol. 271, 105590 (2020).
Rahimi, A., Rahardjo, H. & Leong, E. C. Effect of antecedent rainfall patterns on rainfall-Induced slope failure. J. Geotech. Geoenviron. Eng. 137, 483–491 (2011).
Kim, Y. et al. Assessment of critical rainfall scenarios for slope stability analyses based on historical rainfall records in Singapore. Environ. Earth Sci. 81, 39 (2022).
Public Utilities Board. Code of practice on surface water drainage. (Seventh Edition–Dec 2018 with amendments under Addendum No. 1 (2018). Singapore: Public Utilities Board (PUB) (2021).
Rahardjo, H. et al. Spatial distribution, variation and trend of five-day antecedent rainfall in Singapore. Georisk Assess. Manag. Risk Eng. Syst. Geohazards. 14, 177–191 (2020).
HDB (Housing and Development Board). HDB Design, Building & Quality Requirements for A&AWork on HDB Premises (Housing and Development Board, 2012).
Lin, Y., Zhou, K. & Li, J. Prediction of slope stability using four supervised learning methods. IEEE Access. 6, 31169–31179 (2018).
Qi, C. & Tang, X. Slope stability prediction using integrated metaheuristic and machine learning approaches: A comparative study. Comput. Ind. Eng. 118, 112–122 (2018).
Moayedi, H., Bui, D. T., Kalantar, B. & Foong, L. K. Machine-learning-based classification approaches toward recognizing slope stability failure. Appl. Sci. 9, 4638 (2019).
Feng, L. et al. Comparative analysis of machine learning methods and a physical model for shallow landslide risk modeling. Sustainability 15, 6 (2022).
Ray, R. L., Jacobs, J. M. & De Alba, P. Impacts of unsaturated zone soil moisture and groundwater table on slope instability. J. Geotech. Geoenviron. Eng. 136, 1448–1458 (2010).
Cai, J. S. et al. Importance of variability in initial soil moisture and rainfalls on slope stability. J. Hydrol. 571, 265–278 (2019).
Rahardjo, H., Li, Y. & Satyanaga, A. The importance of unsaturated soil properties in the development of slope susceptibility map for Old Alluvium in Singapore. E3S Web Conf. 382, 06009 (2023).
Entekhabi, D. et al. The soil moisture active passive (SMAP) mission. Proc. IEEE. 98, 704–716 (2010).
Kerr, Y. H. et al. The SMOS Mission: New Tool for Monitoring Key Elements ofthe Global Water Cycle. Proc. IEEE 98, 666–687 (2010).
Rodell, M. et al. The global land data assimilation system. Bull. Am. Meteorol. Soc. 85, 381–394 (2004).
Acknowledgements
The authors would like to acknowledge the assistance of Adeline Tong Ying Li, Lai Han Ming, Tian Zuyu and staff of the Geotechnics Laboratory, School of Civil and Environmental Engineering, Nanyang Technological University, Singapore, as well as Shengjie Wei of the Southeast University-Monash University Joint Graduate School, Suzhou, PR China, during the data collection for this study.
Author information
Authors and Affiliations
Contributions
Y.L.: Conceptualization, methodology, software, formal analysis, data curation, investigation, supervision, visualization, writing - original draft, writing—review and editing. S.R.: Methodology, software, formal analysis, data curation, investigation, visualization, writing—original draft, writing—review and editing. Y.C.: Methodology, investigation, visualization, writing—original draft, writing—review and editing. H.R.: Data curation, investigation, writing—review and editing. A.S.: Data curation, investigation, writing—review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Li, Y., Rangarajan, S., Cheng, Y. et al. Random forest-based prediction of shallow slope stability considering spatiotemporal variations in unsaturated soil moisture. Sci Rep 15, 8751 (2025). https://doi.org/10.1038/s41598-025-92739-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-92739-6