Abstract
Groundwater is crucial for human society’s development in piedmont plains, yet its hydrogeochemistry often exhibits complex spatial distributions due to the interplay of nature and human factors. Ninety-two phreatic groundwater samples were collected from a typical piedmont plain in northern China and analyzed using self-organizing map combined with hydrogeochemical simulation, diagrams, and the entropy-weighted water quality index. Groundwater samples were categorized into four clusters, demonstrating a gradual hydrogeochemical facies evolution from HCO3-Ca to Cl-Mg·Ca and Cl-Na, along with an increase in NO3− content in the order of clusters IV, II, III, and I. Natural processes, including silicates weathering and reverse cation-exchange, establish the natural fundamental framework of groundwater chemistry, which is furtherly sculptured by agricultural substances input. Groundwater quality was predominantly excellent or good, with entropy-weighted water quality index (EWQI) values below 100 at over 92% of the sampling sites. Groundwater quality is relatively poorer in the upstream areas near the mountains and along the Hutuo River, where the stratum permeability is high, but improves in the downstream areas where permeability is lower. Agricultural land use and spatial variation in aquifer permeability are responsible for the observed spatial variations in groundwater chemistry. Agricultural contaminants warrant attention for the protection of groundwater quality in piedmont plains that with long-term agricultural activities, especially in the upstream areas near the mountains. This research improves the understanding of the spatial distribution and variation of groundwater chemistry in piedmont plains, and provides scientific guidance for related groundwater development and management.
Similar content being viewed by others
Introduction
As a vital part of water resources, groundwater is indispensable in water resource supply for daily life, agricultural production and industrial activities worldwide1,2,3. This is especially significant in semi-arid and arid plains where with scarce surface water resource, large population, as well as dense agricultural and industrial activities4,5,6. It is estimated that at least millions of people are suffering to diseases related to drinking water chemical quality in water-scarce regions over the world7,8. In this context, it is of great significance to perform in-depth studies on the hydrogeochemical characteristics, spatial evolution and genesis of groundwater quality to ensure safe water supply and sustainable groundwater resource utilization in drought-prone regions globally.
Hydrogeochemical compositions are the fundament determining groundwater quality suitability for diverse usages9,10. Numerous processes and factors can influence and even control groundwater hydrogeochemical composition11,12. Generally, natural factors establish the initial framework of groundwater chemical composition13,14,15. Basic physicochemical characteristics are inherited by groundwater from the recharge water16,17. Then, various hydrological processes and hydrogeochemical interactions would further sculpture the hydrogeochemical features of groundwater along its flow path18,19. Substances produced by geogenic processes would enter and be enriched in groundwater during this process, and lead it to be beneficial or harmful to water consumers’ health20,21,22. Spatial discrepancy would be exhibited in groundwater chemistry due to the different climatic conditions, surrounding overburden or rock lithologies, residence time in aquifer23,24,25. This makes groundwater hydrogeochemical features variable and complex in nature.
In addition to natural factors, with the rapid development of human community in various aspects including urbanization, industrialization, agriculture, etc., there have been great external disturbances to groundwater chemistry in many countries 26,20. These disturbances could be direct substances input into aquifers, and also indirect influence on groundwater hydrogeochemical composition through changes to aqueous hydrogeochemical environments (e.g., change of reduction-oxidation (redox) potential)28,29,30. The anthropogenic disturbances vary in both degree and space for aquifers, resulting in much more complex groundwater chemistry feature and genesis when compared with the natural status31,32,33. Comprehensive knowledge of spatial distribution of groundwater chemistry driven by natural and anthropogenic factors could provide important support to rational utilization and scientific management of the invisible water resource beneath the ground surface34,35.
To reveal and distinguish the various factors governing the spatial distribution of groundwater chemistry, multivariate analysis approaches, such as factor analysis, principal component analysis, discriminant analysis, and cluster analysis, etc., were commonly employed to interpret the available data of groundwater geochemistry36,37,38. Although these conventional linear dimensionality reduction techniques can simplify the complex measured data without losing too much information and project them on a low-dimensional space39,40,41, inaccurate and even erroneous interpretations were usually made for groundwater chemistry36. This is attributed to the nonlinearity and highly dispersiveness of groundwater physicochemical data. Thus, more capable techniques should be considered for the interpretation of complex groundwater chemistry, especially that driven by both natural and anthropogenic factors simultaneously. Self-organizing map, a typical artificial neural network, is powerful and intelligent non-linear technique to transform complex high-dimensional dataset and visualize it into a low-dimensional space42,43,44. It has been widely applied into various aspects of earth science, e.g., meteorology, geomorphology, geo-disaster, hydrology and groundwater science, etc.44,45,46,47,48. Self-organizing map is robust and suitable for solving the potential nonlinearity, noisy and irregularity of groundwater hydrogeochemical data1,49,36.
A piedmont plain is one of the most important geomorphic units for human community and regarded as the ideal area for various human activities involving urbanization, industrialization, agricultural activity, as well as groundwater exploitation for field construction51,52,53,54. It is also one of the most interactive regions between nature and human society, thus, potentially with complex groundwater hydrogeochemical characteristics and mechanisms55,56. A piedmont plain of the North China Plain, a quintessential large sedimentary plain significantly affected by human activities, serves as an example for examining the spatial distribution and variations of groundwater chemistry in semi-arid and arid piedmonts with intense anthropogenic influence. The piedmont plain of interest in the present study is in semi-arid and arid regions and was of concern due to the variation of hydrogeochemical composition of groundwater.
The specific objectives of this study are to (1) delineate the spatial distribution of groundwater chemistry within a human-impacted piedmont regions, (2) identify the hydrogeochemical variations and evolutionary trends of groundwater in space, and (3) elucidate the critical natural and anthropogenic processes that shape groundwater chemistry, leveraging the Self-organizing map methodology in conjunction with hydrogeochemical simulations and graphical analysis. This study can enhance the understanding of groundwater chemistry in water-scarce piedmont plains, which have been significantly impacted by human activities. The findings could inform the sustainable development and conservation efforts for these vulnerable water resources, not only within the focal region but also in analogous areas globally.
Study area
The study region, defined by latitudes ranging from 38°02’26"N to 38°40’23"N and longitudes from 114°15’26"E to 115°15’35"E, spans an area of approximately 4,095 km2. This area is within the piedmont plains of the North China Plain and is bounded by the Taihang mountains in the west (Fig. 1). The terrain of the region gradually slopes downwards from the northwest to the southeast, with an average elevation of approximately 72 m. This area is traversed by four significant rivers including the Hutuo river, Ci river, Sha river, and Tang River, with the Hutuo River being the most substantial in terms of size. The study region is characterized by temperate continental arid climate and boasts an average annual temperature of 12.3℃. Annual precipitation in the region varies from 400 mm to 600 mm, predominantly falling during the wet season from June to September.
Location of (a) the North China Plain, (b) the study piedmont plain, and (c) the groundwater sampling sites. The map was created via ArcGIS 10.2 (https://www.esri.com/en-us/arcgis/products/arcgis-desktop/resources) based on the ASTER GDEMV2 data from Geospatial Data Cloud site, Computer Network Information Center, Chinese Academy of Sciences (http://www.gscloud.cn).
This region is underlain by Quaternary sediments, predominantly from the extensive alluvial fans of the Hutuo River, complemented by smaller alluvial fans from neighboring rivers. The Quaternary sediments gradually vary laterally from gravel pebbles to medium-coarse sands, and then to fine sands and clays along the southeastwards river flow direction57,58. As a result, it has better sediments permeability in the upper piedmont area and along the Hutuo River (Fig. 2a)59. The predominant aquifer system in this area is formed by these Quaternary sediments and reaches a thickness of hundreds of meters60. The Quaternary groundwater flows from west to east, and evolves from phreatic to semi-confined along the flow direction. The buried depth of groundwater level ranges from ~ 10 m to >50 m, resulting ignorable evaporation effects. The aquifers system is recharged by lateral flow-in, local precipitation, and river seepage, and discharges groundwater through lateral flow-out in various environments57. As a typical part of North China Plain, the study area has undergone significant anthropogenic disturbances to the groundwater recharge and discharge patterns61. A lot of Quaternary groundwater was extracted from the aquifers35. As a result, artificial extraction had become one of the most important groundwater discharge ways. Meanwhile, the infiltration of irrigation water in the widespread agricultural lands (Fig. 2b) has become one of the major processes of the Quaternary aquifer recharge. External contaminants, particularly those originating from human activities including daily life and agricultural practices, may infiltrate the aquifers alongside irrigation water, thereby posing a potential threat to groundwater quality.
(a) Infiltration coefficient map of phreatic aquifer, and (b) the land use within the study area along with the groundwater sampling sites in various SOM clusters. The map was created via ArcGIS 10.2 (https://www.esri.com/en-us/arcgis/products/arcgis-desktop/resources) based on (a) the data from field hydrogeological survey in present research and (b) the GlobeLand30 data from National Catalogue Service For Geographic Information (https://www.webmap.cn/commres.do? method=globeIndex).
Materials and methods
Sampling strategies and analytical methods
A total of 92 groundwater samples were collected from phreatic boreholes and wells, with depth ranging from 20 m to 100 m, across the piedmont plain during the July to August 2022 (Fig. 1). Prior to sampling, stagnant water from boreholes and wells was pumped out to ensure the representativeness of the collected samples. Groundwater parameters pH and electrical conductivity (EC) were measured in-situ with the aid of a portable multiparameter device (Multi 350i/SET, Munich, Germany), and sample collection was initiated once these two parameters had stabilized during continuous in-situ monitoring. Sample polyethylene bottles were pre-rinsed with a 10% nitric acid solution, followed by ultrapure water, and finally with the target groundwater itself, with each of these steps repeated at least three times to ensure thorough cleanliness. Immediately following collection, water samples were stored at a temperature of 4 °C in portable coolers and promptly transported to the laboratory for hydrogeochemical analysis. All water sample collection and storage were conducted in accordance with the Technical Regulations for Groundwater Sampling62 in China.
Physicochemical analyses of groundwater were conducted in the Key Laboratory of Groundwater Science and Engineering, Ministry of Natural Resources, People’s Republic of China. Major cations Ca2+, Mg2+, Na+, and K+ were determined by ICP-MS (Inductively Coupled Plasma Mass Spectrometry, Agilent 7500ce, Tokyo, Japan). Concentrations of NH4+, SO42−, Cl−, NO3−, and NO2− were measured by ion chromatography (Shimadzu LC-10ADVP, Kyoto, Japan). Total dissolved solids (TDS) was determined using the gravimetric method, while HCO3− concentrations were measured by acid-base titration. The ion balance charge error (ICBE) was calculated to assess the precision and accuracy of the laboratory measurement63. All the ICBE were within ± 5%, indicating credible and reliable analysis results.
Self-organizing map (SOM)
Self-organizing map (SOM) is a mathematical algorithm for unsupervised self-organizing learning38. This method can effectively realize the visualization of high-dimensional data by projecting it on a two-dimensional space1,64,65. Consequently, SOM has been widely used for the explanation of many complex data like hydrogeochemical data40,45. The details of SOM algorithm has been described in detail by Clark, et al.44. Its procedure can be summarized into three steps: (1) Input and pre-process data; (2) Create a SOM network grid and determine the grid size and shape; and (3) Run the SOM training algorithm. In the present study, the MATLAB software (version R2021b) was used to implement the SOM algorithm described above. The output neurons number of the SOM model is determined by heuristic rules \(\sqrt {5N}\), in which N is the input samples number38,66. The K-means algorithm was used to classify the samples, and determine the optimal clustering number according to the Davies-Bouldin index (DBI) 8. The DBI can be obtained by the following equation:
Where, Ci+Cj represents the sum of the average distances from each point to their respective cluster centroid; d(Ci, Cj) denotes the distance between the centroids of cluster i and cluster j.
Statistical analysis, hydrogeochemical diagrams and simulation
Multivariate statistical analysis, such as descriptive and Spearman correlation analysis67, were conducted using SPSS software (version 22.0) to explore the statistical characteristics of groundwater chemistry and the relationships between physicochemical parameters. Spearman correlation analysis was conducted to statistically corroborate the nonlinear inter-variable associations visualized in the SOM-derived component planes, thereby providing quantitative validation of the topological relationships identified through the SOM’s similarity-driven clustering algorithm68. Hydrogeochemical diagrams, including Piper, Gibbs, and bivariate diagrams38,56,69, were generated using Origin software (version 2021) to elucidate the hydrochemical facies and hydrogeochemical processes of groundwater. Hydrogeochemical simulations were conducted using PHREEQC software (version 3.5.0) to determine the saturation indices (SI) of specific minerals in groundwater. The SI values indicate the mineral equilibrium status: oversaturation if SI>0, equilibrium if SI = 0, undersaturation if SI<070. In addition, ArcGIS software (version 10.2) was used to map the geospatial distribution of groundwater chemistry.
Entropy-weighted water quality index
Water quality evaluation can be effectively performed using the entropy-weighted water quality index (EWQI)71. In contrast to traditional methods such as the one using the Water Quality Index (WQI), the EWQI approach incorporates information on entropy to mitigate the influence of subjective human factors on the weighting of hydrogeochemical parameters during the assessment55,55. The EWQI assessment procedure is as follows:
First, construct the eigenvalue matrix X as follows.
Where m and n are the number of water samples and physicochemical parameters, respectively.
Second, the eigenvalue matrix X is transformed into a normalized eigenvalue matrix Y. The normalization process employs Eq. (4) presented below.
Where (xij)max and (xij)min are the maximum and minimum values of a single index in the X matrix, respectively.
Third, the information entropy ej is computed using the following Eq.
Where Pij represents the ratio of the parameter value for hydrogeochemical parameter j in groundwater sample i.
Fourth, calculate the EWQI value using the following Eq.
Where wj is the entropy weight of parameter j, qj denotes the quality rating scale of the jth physicochemical index, Cj is the jth physicochemical index measured value, Sj is the drinking water standard value of the jth physicochemical index given by Chinese guideline73 or World Health Organization guideline74.
The EWQI assessment algorithm was implemented using the Excel package in Microsoft Office software (version 365), following the method described above. Groundwater overall quality can be assessed based on the calculated values of the EWQI. Groundwater quality is categorized into five distinct ranks based on EWQI values: Rank 1 for ‘excellent quality’ (EWQI ≤ 50), Rank 2 for ‘good quality’ (50 < EWQI ≤ 100), Rank 3 for ‘medium quality’ (100 < EWQI ≤ 150), Rank 4 for ‘poor quality’ (150 < EWQI ≤ 200), and Rank 5 for ‘extremely poor quality’ (EWQI > 200)59.
Results and discussion
Physicochemical properties of groundwater
The physicochemical parameters of the groundwater samples from the study piedmont plain are presented statistically in Table 1. For comparative purposes, this research also introduced the desirable limits for drinking water quality as recommended by the Chinese guidelines73. If there was no recommendation from the Chinese guidelines73, the World Health Organization guideline74 was used in the comparison.
In the study piedmont plain, phreatic groundwater exhibited a pH range from 6.98 to 8.68, with an average pH of 7.64, indicating a characteristic that is nearly neutral to slightly alkaline. An overwhelming majority of the groundwater samples (98.91%) displayed pH values that fall within the acceptable range of 6.5 to 8.5, as stipulated by the Chinese guidelines73. Only one sample (1.09%) was observed with the pH of 8.68 and slightly beyond the upper limit of 8.5. The EC values of the collected groundwater samples ranged from 291.99 µS/cm to 1,872.00 µS/cm, suggesting significant variability in groundwater salinity. The majority of the groundwater samples (96.74%) exhibited EC values below 1500 µS/cm, whereas only four samples surpassed this threshold. Generally, water salinity is categorized into three classes based on EC value, i.e., fresh water with EC values below 1,500 µS/cm, brackish water with EC values ranging from 1,500 µS/cm to 3,000 µS/cm, and saline water with EC values exceeding 3,000 µS/cm56. Thus, groundwater in the study piedmont plain was dominantly fresh water with a few of brackish water type. Analysis results indicated that all groundwater samples from the study piedmont plain had TDS values ranging from 189.90 mg/L to 844.80 mg/L, which was within the recommended drinking water limit of maximum 1,000 mg/L73. This suggested groundwater was suitable for drinking purpose in terms of water salinity. Groundwater samples exhibited a range of total hardness (TH) values from 119.10 mg/L to 623.00 mg/L, averaging at 318.86 mg/L. The majority of phreatic groundwater samples possessed acceptable levels of hardness, with only about 9.78% slightly exceeding the recommended drinking water limit of 450 mg/L as specified by the Chinese guidelines73.
Based on their average concentrations, the major cations in groundwater samples are ranked as follows: Ca2+ is the most prevalent and it is successively followed by Mg2+, Na+, and K+. The concentration of Ca2+ in the groundwater samples exhibited a range from 29.38 mg/L to 189.80 mg/L, with an average value of 87.95 mg/L. Mg2+ and Na+ also showed variability in their concentrations, with Mg2+ ranging from 6.22 mg/L to 50.96 mg/L and Na+ ranging from 9.53 mg/L to 144.40 mg/L. The mean concentrations for these ions were 24.10 mg/L for Mg2+ and 20.80 mg/L for Na+. In comparison to the other major cations, K+ was found in relatively lower concentrations, with a range from 0.21 mg/L to 22.42 mg/L and an average concentration of 1.98 mg/L. The average concentrations of the major anions in the samples were ranked in the following order: HCO3− was the most abundant, followed by SO42−, with Cl− being the least prevalent. The concentration of HCO3− varied within the range of 87.97 mg/L to 488.10 mg/L, with an average value of 255.33 mg/L. The maximum value of Cl− and SO42− was 159.00 mg/L and 227.90 mg/L, respectively. Both Cl− and SO42− were within the drinking water desirable limits recommended by Chinese guideline (Table 1). Overall, phreatic groundwater in the study piedmont plain was characterized by a dominance of Ca2+ among cations and HCO3− among anions.
Nitrogen is one of main potential threats to groundwater quality in many regions worldwide including the study area75. Notably, NO3− concentration in the sampled groundwater ranged from 2.55 mg/L to 171.4 mg/L. Most of samples are with the NO3− concentration greater than the natural background level (8.7 mg/L) in the mountainous area 76. Approximately 43.48% of the groundwater samples exceeded the WHO drinking water limit for NO3−, which is set at 50 mg/L74. The concentrations of NO2− and NH4+ were measured within the ranges of 0.001 mg/L to 0.16 mg/L and 0.02 mg/L to 0.28 mg/L, respectively. Approximately 14.13% of the collected groundwater samples exceeded the desirable limit for NO2−, which is set at 0.02 mg/L. In addition, 2.17% of the samples surpassed the recommended threshold of 0.2 mg/L for NH4+. Therefore, nitrogen contamination is a significant issue within the aquifers of the piedmont plain, and existed in various forms including NO3−, NO2−, and NH4+. This contamination warrants attention due to its potential adverse impacts on groundwater quality.
Self-organizing map (SOM) and clustering analysis
The self-organization map clustering
Twelve hydrogeochemical parameters including TDS, Total hardness (TH), Ca2+, Mg2+, Na+, K+, NH4+, HCO3−, SO42−, Cl−, NO2−, and NO3− were used as the input variables for the SOM model in the present research. According to heuristic rules, 48 hexagons (i.e., neurons) were obtained in the component planes by SOM (N = 92 groundwater samples) (Fig. 3). The SOM matrices signify the projection value of hydrogeochemical variables on the two-dimensional plane, which is reflected by the gradient of colour. The neurons in dark colours represent high values (black being for the highest values), and those in light colours signify low values (yellow being for the lowest value). The SOM matrices with colour gradient are significant for illustrating the clustering characteristics of hydrogeochemical variables and water samples38. Each hydrogeochemical variable was illustrated by one SOM matrix map, where the colour gradient could visually and clearly reveal the relationship among the hydrogeochemical variables50. Similar colour gradients indicate positive correlations. The stronger the correlation, the more similarity the colour gradients are77.
As illustrated in Fig. 3, fairly similar colour gradients were observed among the hydrogeochemical variables of TH, TDS, Ca2+, Mg2+, Cl−, SO42−, HCO3− and NO3−, implying positive correlation among them. Between some of these variables, some differences in the colour gradient are noted, suggesting some degree of discrepancy. TH, TDS, Ca2+, Mg2+ and HCO3− showed higher similarities in colour gradient, indicating strong positive correlation. High similarities were also observed among the colour gradient of NO3−, Cl−, SO42− and TDS, suggesting relatively strong positive correlation among them. K+ and Na+ did not have high similarities in the colour gradient with other hydrogeochemical variables, implying relatively weak correlation. NO2− and NO3− were characterized with similar pattern of colour gradient, although the similarity was not of high degree. Same relation was observed between the colour gradients of NO2− and NH4+. To some extent, these observations suggested the potential connection through chemical processes between NO2− and NO3−, and NO2− and NH4+, respectively. Spearman correlation test was employed for all the hydrogeochemical variables with the aid of the SPSS software. The results (Fig. 4) were generally consistent with the aforementioned SOM calculation, confirming the accuracy and reliability of the established SOM models in the present research.
The water samples were also clustered to further get insights into groundwater hydrogeochemical features and formation based on the SOM map. The lowest DBI value corresponds to the optimal clustering number78. As illustrated in Fig. 5, there could be four optimal clusters for the present study. It can be seen in Fig. 3 that 1 (1.09%) groundwater samples were classified in Cluster I, 33 samples (35.87%) were classified in Cluster II, 11 samples (11.96%) were classified in Cluster III, and 47 (51.09%) were classified in Cluster IV.
Hydrogeochemical features and Spatial distribution of SOM clusters
The mean values of various physicochemical parameters for each cluster and all samples calculated by SOM are presented in Table 2. NO3− is an effective indicator of human-induced pollution of the groundwater environment18. Cl− is a conservative ion in aquifers and can be used to trace external inputting of human-derived substances in to groundwater environment 63. It can be seen from Table 2, groundwaters in Cluster I were featured by the highest concentrations of NO3− and Cl− among the four SOM clusters, and followed successively by those in Clusters III, II, and IV.
To get insights into the overall groundwater hydrogeochemical characteristics and its evolution among the SOM clusters, the Piper diagram79 was used to illustrate their hydrogeochemical facies. As illustrated in Fig. 6a, the predominant hydrogeochemical facies of the sampled groundwaters from the study piedmont plain was characterized by HCO3-Ca, accounting for 92.39% of the samples. In contrast, only a minority of the groundwaters exhibited hydrogeochemical characteristics associated with the Cl-Mg·Ca type, representing 6.52%, and the Cl-Na type, which constituted 1.09%. When considering each SOM cluster separately, the hydrogeochemical facies are HCO3-Ca for Cluster IV, Cl-Na for Cluster I, and both partly HCO3-Ca and partly Mixed Cl-Mg·Ca for Cluster II and Cluster III (Fig. 6a). The hydrogeochemical facies of groundwater depict a gradual evolving trend along the Cluster order of IV, II, III and I from fresh hydrogeochemical facies of HCO3-Ca to relatively saline hydrogeochemical facies of Cl-Mg·Ca and Cl-Na type. This is consistent with the aforementioned evolving trend of NO3− content among various clusters. In other words, the salinity levels of the groundwater in Clusters IV, II, III, and I exhibited a progressive shift from the fresh hydrogeochemical facies characterized by HCO3-Ca to more saline types, specifically the Cl-Mg·Ca and Cl-Na types, correlating with an increase in NO3− content.
(a) Piper Diagram and (b) spatial distribution of nitrate content in groundwater across various clusters identified through Self-organizing map analysis. The map was created via ArcGIS 10.2 (https://www.esri.com/en-us/arcgis/products/arcgis-desktop/resources).
The distribution of SOM clusters along with groundwater NO3− content was spatially illustrated in Fig. 6b. Groundwater of Cluster I (P2), which featured with the highest NO3− and saltiest hydrogeochemical facies, was found located in the upstream area of the plain and close to the mountains where the highest permeability is found in the vadose zone and the aquifers 57,59. Cluster III groundwaters, characterized by the second-highest nitrate concentrations (NO3−) and secondary high-salinity hydrogeochemical facies, exhibited a distinct spatial distribution pattern. These samples were predominantly located in the upstream area of the study area closing to the mountains (P3, P22 and P23), and along the upstream section of Hutuo River (P60, P92, P91, P87, P88), as well as at some scattered sites adjacent to Tang River (P38), Ci River (P65) and Hutuo River (P82) in the middle area. Groundwaters of Cluster II, which were characterized by the third highest NO3− and third saltiest hydrogeochemical facies, were situated in the middle-upper stream area of the plain, especially far away from the rivers. Whereas for Cluster IV which was featured by the lowest NO3− content and freshest hydrogeochemical facies, groundwaters were found distributed mainly in the lower elevation area of the plain, also some along the Tang River, and some sporadically in the middle-upper area. In summary, the SOM clusters depicted a spatial distribution trend in the successive order of Clusters I, III, II, and IV, and extending from the upper areas adjacent to the mountains towards the downstream regions, along with the NO3− content gradually decreasing and hydrogeochemical facies being fresher.
Hydrogeochemical regimes of groundwater
Natural processes governing groundwater chemistry
The chemical composition of groundwater is fundamentally governed by a suite of natural factors and mechanisms. The natural factors include the lithology of aquifer matrices, the characteristics of recharge waters, the duration of water residence within the aquifer, prevailing climate conditions, the extent of evaporation, etc59. There are various natural mechanisms, and the present research explored those relevant to understand the spatial distribution of groundwater chemistry in the study piedmont plain.
In general, the natural mechanisms that govern the chemistry of groundwater can be categorized into three primary processes, namely precipitation, evaporation, and rock-water interaction14,16. These three primary natural mechanisms can be effectively elucidated through the analysis of the relationship between TDS and the ratio of Na+/((Na++Ca2+), and between TDS and the ratio of Cl−/(Cl−+HCO3−)80. As depicted in Fig. 7, the data points for all SOM clusters were predominantly located within the rock dominance field of the Gibbs diagrams. This distribution suggested that the chemical composition of sampled groundwater in the study piedmont plain was primarily influenced by interactions between the aquifer matrix and groundwater, rather than by precipitation and evaporation processes. Groundwater usually flows very slowly in porous aquifers when compared to river water, even in the piedmonts like the study area81. As a result, the residence time of groundwater within the aquifer system would be relatively long, allowing ample time for the water to undergo extensive chemical reactions with the surrounding geological media. Furthermore, the study area is situated in the piedmont region of the Taihang Mountains, where the depth of groundwater table is relatively substantial. Consequently, the impact of evaporation on groundwater in this region is considered to be negligible82.
To further identify the specific minerals involved in the interactions between the aquifer matrix and groundwater, this study employed the analysis of the ratios Ca2+/Na+ versus HCO3−/Na+ and Mg2+/Na+83. As depicted in Fig. 8, the groundwater samples from the study piedmont plain were situated along a transition from the dominance of silicates to that of carbonates. This distribution suggested that the dissolution of carbonates and the weathering of silicates were the fundamental natural processes contributing to the chemical composition of groundwater solutes.
The ratios of major ions originating from specific mineral dissolutions are fixed, so the relationship between major ions can reflect the underlying hydrogeochemical processes 84. The ratio of Cl−/Na+ usually presents a value approximating 1:1 if these two ions naturally originate from halite dissolution85. As shown in Fig. 9a, the majority of groundwater samples from all clusters aligned closely with the 1:1 line for the Cl− to Na+ ratio. Additionally, a strong positive correlation was quantitatively confirmed with a correlation coefficient (R2) of 0.67, which is significant at the p < 0.01 level (Fig. 4). Thus, halite dissolution is a potential contribution for Na+ and Cl− for all clusters. Saturation indexes (SI) were further calculated to examine the potential contribution of minerals. As shown for halite in Fig. 10a, SI values were below 0 for groundwaters in all clusters; these values suggest that halite, if existing in the surrounding media, can contribute Na+ and Cl− to groundwater. While, as demonstrated in Fig. 8, the predominant minerals engaged in interactions between the aquifer matrix and groundwater in the study region were carbonates and silicates, rather than evaporites. This is attributable to the fact that the current area is situated in the piedmont region of the North China Plain where the aquifer lithology is predominantly characterized by medium-coarse sand and sand gravel59,86. Evaporates are very limited in the aquifer media and could hardly provide substantial halite mineral sources for groundwater. Consequently, the dissolution of halite was not be the dominant natural process influencing the chemical composition of groundwater within the study area.
The majority of sampled groundwaters across various SOM clusters were also observed to align along the 1:1 line of the ratio (HCO3−+SO42−) to (Ca2++Mg2+) as shown in Fig. 9b. This alignment suggested that the dissolution of carbonate and sulfate minerals could be significant sources of major ions in the groundwater of the study piedmont plain16. For carbonates, strong positive correlations were observed among the ions Ca2+, Mg2+, and HCO3−, as indicated by the SOM component planes (Fig. 3) and further supported by the Spearman correlation coefficients (Fig. 4). These findings suggest that the dissolution of carbonate minerals may be a predominant source of hydrogeochemical solutes in groundwater of the study area.
While, as shown in Fig. 9c, groundwaters except a small portion in Cluster IV exhibited a deviation from the 1:1 line of the HCO3− to (Ca2++Mg2+) and presented excessive Ca2++Mg2+ compared to HCO3−. This implied that there were some other processes contributing Ca2+ and Mg2+ to groundwaters in most sites besides carbonates dissolution. The saturation status simulation of carbonate minerals showed that only small portions of sampled groundwaters in Cluster IV and Cluster II were with the SI below 0 for aragonite, calcite and dolomite minerals (Fig. 10b, c,d). Thus, the contribution of carbonates dissolution to groundwater solutes was only dominant for some groundwater samples in Cluster IV and Cluster II, but not for the other groundwater samples, especially those in Cluster III and Cluster I.
For the sulfates, both the SOM component planes (Fig. 3) and the Spearman correlation test (Fig. 4) presented their existing strong positive correlation between Ca2+ and SO42−, implying the high possibility that sulfates dissolution contributed to the natural hydrogeochemical composition of groundwater. In addition, groundwaters in all SOM clusters had SI values of anhydrite and gypsum below 0 (Fig. 10e, f), evidencing the possible contribution of sulfates dissolution. However, excessive Ca2+ was observed when compared with the SO42− for almost all groundwaters, indicating that there were other Ca2+ sources or sulfates dissolution was not the dominant contribution (Fig. 9d). As discussed before, evaporates (including sulfates) were not the dominant minerals in the aquifer media. Thus, although sulfates dissolution can contribute Ca2+ and SO42− to groundwater in some degree, it was not the main source. This was evidenced by the end-member diagrams (Fig. 8).
Groundwater can also obtain some solutes from surrounding media through ion exchange processes. The relationship between (Ca2++Mg2+-HCO3−-SO42−) and (Na++K+-Cl−) was utilized to elucidate potential ion exchange processes occurring in the piedmont plain. As shown in Fig. 11, the majority of the groundwater samples in the study area were found to align along the line Y=-1.33X + 0.89 (R2 = 0.58), suggesting the existence of ion exchanges. Furthermore, it is observed that only a minority of groundwaters in Cluster IV were located in the lower right dominance, suggesting the occurrence of cation exchange where the aqueous Ca2+ or Mg2+ displaced the Na+ on aquifer media (Eq. 10). While, most groundwaters in Cluster II, Cluster III, and Cluster IV were situated in the upper left dominance, suggesting the occurrence of reverse cation-exchange (Eq. 11). The groundwater (P2) in Cluster I was significantly deviated from the the line Y=-1.33X + 0.89 (R2 = 0.58), demonstrating a limited contribution of ion exchange. This latter observation is due to the fact that aquifers in the area close to the mountains are usually composed of geological material with coarse lithology and deficient of fine sediments 86 for ion exchange reactions.
Anthropogenic factors influencing hydrogeochemical composition
The aforementioned natural processes have shaped the fundamental framework of groundwater chemistry within the study piedmont plain. However, anthropogenic factors often introduce disturbances to, and can even significantly alter, the natural chemical composition of groundwater. This impact is particularly notable in regions with dense populations and high levels of human activity.
The anthropogenic input of chemical solutes is the most direct and severe way in which human communities disturb the chemistry framework of groundwater. Among the externally introduced solutes, nitrate stands out as the most prevalent and contaminant of concern affecting both the surface and subsurface hydrosphere. While the natural background concentration of NO3− in groundwater is generally considered to be 10 mg/L56,87, groundwater with NO3− concentration exceeding this threshold is typically deemed to have been affected by the introduction of external nitrate sources14. Thus, nitrate is widely used as the indicator of man-induced external solutes to both surface water and groundwater. As shown in Fig. 12a, the majority of groundwater samples in Cluster IV, along with all samples from Clusters I, II, and III exhibited NO3− concentrations surpassing the natural groundwater threshold of 10 mg/L56. This suggested that the hydrogeochemical composition of groundwater at most sampling sites had been influenced by the introduction of anthropogenic nitrate contaminants. To identify the specific sources of nitrate, an analysis of the relationship between the ratios of Cl−/ Na+ and NO3−/Na+88 was done. As illustrated in Fig. 12b, with the exception of a portion of the groundwater in Cluster IV, the remaining samples were plotted in the upper right dominance of the diagram, showing a proximity to the influence of agricultural activities. This suggested that the anthropogenic nitrate contamination observed in these samples likely originated from agricultural practices.
As previously mentioned, the hydrogeochemical facies of the groundwater exhibited a progressive evolution from the fresh facies of HCO3-Ca to the relatively salty facies of Cl-Mg-Ca and Cl-Na, correlating with an increase in NO3− content (Fig. 6). Furthermore, positive correlations were identified between NO3− and TDS, Ca2+, Mg2+, HCO3−, SO42−, Cl− by the SOM component planes (Fig. 3) and the Spearman correlation test (Fig. 4). Consequently, the major inorganic salts were also brought into groundwater along with nitrate contaminants from agricultural activities.
Quantitatively, the groundwater salinity represented by TDS was shown by a fairly similar increasing trend with the nitrate content of various SOM clusters (in the cluster order of IV, II, III and I) (Table 2), indicating that more intensive agricultural practices have led to a greater influx of chemical substances into the aquifers, along with the nitrate contaminant. Chloride, a typical conservative ion in aqueous environments, exhibited a consistent upwards trend in correlation with the increase in nitrate content, progressing from Clusters IV to II, then III, and finally I (Table 2). Generally, the chloride content in aqueous environment is controlled by chloride minerals (e.g., halite) dissolution and anthropogenic input, and not significantly influenced by any other hydrogeochemical processes (such as precipitation, ion exchanges70. As previously discussed, halite dissolution does not constitute the primary hydrogeochemical process contributing Cl− to groundwater in the study area. Therefore, the significant increase of Cl− content in space was mainly caused by the anthropogenic input. This further indicated that groundwater salinity increase and hydrogeochemical facies evolution were fully attributed to chemical substances emanating from agricultural activities.
It is noted that groundwaters with higher salinity were predominantly located in the upper stream area, and those with lower salinity were mainly situated in the lower stream area of the study piedmont plain. This pattern is the inverse of the natural spatial evolution of groundwater salinity. As discussed before, this is caused by agricultural chemicals. Spatially, the degree of agricultural practice influence was strong in the upstream area close to the mountains, then the areas along Hutuo River in the upstream section, and weaker in the middle-upper stream area far away from rivers, and the weakest in the lower stream area. It can be seen that agricultural lands were widely and evenly distributed in the study area (Fig. 2). Thus, the external chemical loads from agricultural practices to the groundwater environment were with no significant discrepancy in space. The difference of external chemical substances input was attributed to the sediment permeability of vadose zone and aquifers in space89. The upstream area close to the mountains (the sampling sites of Cluster I and most sites of Cluster III) has the best permeability of sediments in both vadose and saturated zone (Fig. 2a). As a result, the external chemical substances including major inorganic salts, nitrate and others were very easy to enter the aquifers. Hutuo River is a relatively large river in the study region, thus, the sediments along its channels in the upper stream area also has relatively good permeability and are beneficial for the infiltration of external substances downwards. In contrast, the areas far from the rivers and those in the lower stream area are with relatively fine sediments and poor permeability, leading to relatively hard infiltration of external substances from the ground surface downwards. All the above observations make the effects of agricultural practices on groundwater chemistry strong for the sampling sites of Cluster I and III, but weaker for the sites of Cluster II and weakest for that of Cluster IV.
Groundwater quality and its implication
Groundwater quality in various SOM clusters was assessed using the EWQI approach19,90 to comprehensively reveal the hydrogeochemical quality status of groundwater under the regulation of both natural and anthropogenic factors. All the physicochemical parameters listed in Table 1 were considered in the EWQI assessment, and the results are shown in Fig. 13a.
The results demonstrated that the predominant groundwater samples had the EWQI value below 100 and belonged to the Rank 1 and Rank 2 categories (Fig. 13a). Approximately 6.52% and 1.09% of groundwater samples were observed with the EWQI varying from 100 to 150 (Rank 3) and between 150 and 200 (Rank 4), respectively. Generally, water with the EWQI value below 100 is considered suitable for direct potable use. Water with an EWQI value between 100 and 150 may be used for domestic purposes, but is not recommended for direct consumption. Water classified as poor (Rank 4) or extremely poor (Rank 5) should be avoided for any direct use. Thus, groundwater in most sampling sites can be exploited for domestic purposes (98.91%) and even for direct potable usage (92.39%).
Scatter plot of (a) TDS versus EWQI, and (b) the geospatial distribution of groundwater quality based on EWQI. The (b) was created via ArcGIS 10.2 (https://www.esri.com/en-us/arcgis/products/arcgis-desktop/resources).
The quality of groundwater in various clusters presented a gradual deterioration trend from Cluster IV to Cluster II, and then to Cluster III and Cluster I. Specifically, groundwaters in Cluster IV had the best water quality with 89.36% of excellent quality (Rank 1) and 10.64% of good quality (Rank 2). Groundwaters in Cluster II were also observed having relatively good water quality although with higher EWQI than that in Cluster IV. With the exception of one sample (P69), which had an EWQI value exceeding 100, all other samples of Cluster II exhibited EWQI values below 100. This indicates an exceptionally high quality of groundwater, rendering it suitable for direct consumption. Groundwaters in Cluster III had a relatively wide range for EWQI value (62.52-156.68), implying good to poor water quality. Among them, 36.36% (P22, P23, P65, and P38) and 9.09% (P82) belonged to the medium (Rank 3) and poor (Rank 4) quality category, respectively. Groundwater (P2) of Cluster I had the EWQI value of 127.55 and categorized into the medium groundwater quality classification (Rank 3).
Spatially, the majority of the study piedmont plain was distributed in zones of excellent (Rank 1) to good (Rank 2) groundwater quality based on the EWQI assessment (Fig. 13b), and can meet the requirements of groundwater for safe drinking. Groundwater was also shown to be of better overall hydrogeochemical quality in the lower stream areas than in the upper ones in the study piedmont plain. The medium and poor quality groundwaters were mainly located in the upper stream areas close to the mountains (P2, P22, and P23) and sporadic sites (P38, P65, P69, and P82) of the middle-lower parts of the study area.
Overall, groundwater sampled in the study piedmont plain had a relatively good quality in the majority of the plain and was considered safe for daily domestic water supply. While, particular attention should be paid to the upper stream regions and the sporadic sites within the middle-lower areas when groundwater there serves as a water supply source. Although groundwater quality in most parts of the piedmont plain was good and safe for usage by human community, the deterioration of groundwater quality was significant in terms of salinity and nitrate content. Special attention should be paid to the agricultural practices in this area to address their threats to groundwater quality. Otherwise, groundwater quality may be altered in an unexpected speed and degree considering the vulnerability of aquifers system in the piedmont plain against contaminants originating from anthropogenic activities. In addition, microbial bacteria are not considered in the present research, but they usually accompany human-derived contaminants. Thus, microbial parameters are recommended to be involved in the monitoring of the groundwater environment in regions like the present agricultural piedmont plain.
Conclusions
The present research employed self-organizing map (SOM) coupled for example with hydrogeochemical simulation and diagrams, as well as entropy-weighted water quality index approaches, to reveal the spatial distribution of groundwater chemistry in a typical piedmont plain of northern China that has been influenced by anthropogenic and natural forces simultaneously. The major findings are as follows:
-
(1)
Groundwater in the piedmont plain is predominantly of fresh and slightly alkaline nature. Nitrogen contaminants of NO3-, NO2-, and NH4+ were characterized with concentrations exceeding the selected drinking limits for groundwater in some sites. Four hydrogeochemical clusters were deciphered by the SOM approach, and with 1.09%, 35.87%, 11.96% and 51.09% of the sampled groundwaters from Clusters I, II, III and IV, respectively.
-
(2)
Hydrogeochemical facies of groundwater presented a gradual evolution from fresh facies of HCO3-Ca to relatively salty facies of Cl-Mg·Ca and Cl-Na in the order of Cluster IV, II, III and I. NO3- also follows the same order of clusters for its content increase in groundwater. Groundwaters with relatively high salinity and NO3- (Clusters I and III) were dominantly distributed in the upper stream area close to the mountains and along the upstream section of Hutuo River, and that with relatively low salinity and NO3- were mainly located in sites of the lower stream area, or the middle-upper stream areas far away from rivers.
-
(3)
The chemical composition of groundwater in the study piedmont plain was predominantly governed by interactions between the aquifer matrix and groundwater, specifically the weathering of silicates and reverse cation-exchange reaction. Dissolution of carbonates and cation exchange reactions also significantly contributed to the chemical composition of groundwater in the lower stream areas. Agricultural practices have extensively driven the input of external substances into the aquifers, which in turn has led to a widespread increase in both groundwater salinity and nitrate content. The spatial permeability discrepancy of sedimentary lithology resulted in the great influences in the upper stream area rather than in the lower parts of the piedmont plain.
-
(4)
Groundwater had excellent (EWQI ≤ 50) to good (50 < EWQI ≤ 100) quality in most of the study area (92.39% of the sampling sites), and can be used for direct drinking purpose based on EWQI assessment. The relatively poor-quality groundwaters were predominantly associated with Clusters III and I, and were mainly located in the upper stream areas adjacent to the mountains, as well as in sporadic sites within the middle to lower stream areas. Special attention should be paid to the significant deterioration in groundwater quality, particularly in terms of salinity and nitrate levels, which were attributed to agricultural contaminants in the study piedmont plain.
Data availability
The datasets used and/or analyzed during the current study can be found in the Supplementary Material.
References
Lee, C. M. et al. Characterizing land use effect on shallow groundwater contamination by using self-organizing map and buffer zone. Sci. Total Environ. 800, 149632. https://doi.org/10.1016/j.scitotenv.2021.149632 (2021).
Nowicki, S. et al. Water chemistry poses health risks as reliance on groundwater increases: A systematic review of hydrogeochemistry research from Ethiopia and Kenya. Sci. Total Environ. 904, 166929. https://doi.org/10.1016/j.scitotenv.2023.166929 (2023).
Yuan, Y. et al. Numerical simulation of groundwater flow and evaluation of groundwater resources emergency exploitation capacity in Yinsheng Gully of Huangshui River Watershed. J. Salt Lake Res. 33, 48–56 (2025).
Wang, X. et al. Risk assessments of heavy metals to children following non-dietary exposures and sugarcane consumption in a rural area in Southern China. Exposure and Health, 12, 1-8. https://doi.org/10.1007/s12403-018-0275-0 (2018)
Banadkooki, F. B., Xiao, Y., Malekinezhad, H. & Hosseini, M. M. Optimal allocation of regional water resources in an arid basin: insights from integrated water resources management. AQUA - Water Infrastructure, Ecosyst. Soc. 71, 910–925, (2022).
Xiao, Y. et al. Interaction regimes of surface water and groundwater in a hyper-arid endorheic watershed on Tibetan plateau: insights from multi-proxy data. J. Hydrol. 644, 132020. https://doi.org/10.1016/j.jhydrol.2024.132020 (2024).
Zhong, C., Wang, H. & Yang, Q. Hydrochemical interpretation of groundwater in Yinchuan basin using self-organizing maps and hierarchical clustering. Chemosphere 309, 136787. https://doi.org/10.1016/j.chemosphere.2022.136787 (2022).
Li, J., Shi, Z., Wang, G. & Liu, F. Evaluating spatiotemporal variations of groundwater quality in northeast Beijing by self-organizing Map. Water 12(5), 1382. https://doi.org/10.3390/w12051382 (2020).
Dhakate, R., More, S., Duvva, L. K. & Enjamuri, S. Groundwater chemistry and health hazard risk valuation of fluoride and nitrate enhanced groundwater from a semi-urban region of South India. Environ. Sci. Pollut. Res. 30, 43554–43572 (2023).
Wang, S. et al. Geochemical Characteristics and Quality Appraisal of Groundwater From Huatugou of the Qaidam Basin on the Tibetan Plateau. Front. Earth Sci. 10, 874881. https://doi.org/10.3389/feart.2022.874881 (2022).
Zhou, B., Wang, H. & Zhang, Q. Assessment of the evolution of groundwater chemistry and its controlling factors in the Huangshui river basin of northwestern china, using hydrochemistry and multivariate statistical techniques. Int. J. Environ. Res. and Public Health 18 (2021).
Aju, C. D., Reghunath, R., Achu, A. L. & Rajaneesh, A. Understanding the hydrogeochemical processes and physical parameters controlling the groundwater chemistry of a tropical river basin, South India. Environ. Sci. Pollut. Res. 29, 23561–23577, (2022).
Nguyen, T. T. et al. Clustering spatio–seasonal hydrogeochemical data using self-organizing maps for groundwater quality assessment in the Red River Delta, Vietnam. J. Hydrol. 522, 661–673, (2015).
Yang, H. et al. Geochemical characteristics, mechanisms and suitability for sustainable municipal and agricultural water supply of confined groundwater in central North China plain. Urban Clim. 49, 101459. https://doi.org/10.1016/j.uclim.2023.101459 (2023).
Zhao, W. et al. Characteristics of groundwater in Northeast Qinghai-Tibet plateau and its response to climate change and human activities: A case study of Delingha, Qaidam basin. China Geol. 4, 377–388. https://doi.org/10.31035/cg2021053 (2021).
Hao, Q. et al. Hydrogeochemical fingerprint, driving forces and spatial availability of groundwater in a coastal plain, Southeast China. Urban Climate 51, 101611. https://doi.org/10.1016/j.uclim.2023.101611 (2023).
Sarkar, A. & Shekhar, S. The controls on spatial and temporal variation of hydrochemical facies and major ion chemistry in groundwater of South West District, Delhi, India. Environ. Earth Sci. 74, 7783–7791, (2015).
Wang, F. et al. Solute geochemistry and water quality assessment of groundwater in an arid endorheic watershed on Tibetan plateau. Sustainability 14, 15593 (2022).
Hasan, M. S. U. & Rai, A. K. Groundwater quality assessment in the lower Ganga basin using entropy information theory and GIS. J. Clean. Prod. 274 https://doi.org/10.1016/j.jclepro.2020.123077 (2020).
Ramos, E., Bux, R. K., Medina, D. I., Barrios-Piña, H. & Mahlknecht, J. Spatial and multivariate statistical analyses of human health risk associated with the consumption of heavy metals in groundwater of Monterrey metropolitan area. Mexico Water 15 (2023).
De Meyer, C. M. C. et al. Arsenic, manganese and aluminum contamination in groundwater resources of Western Amazonia (Peru). Sci. Total Environ. 607–608, 1437–1450. https://doi.org/10.1016/j.scitotenv.2017.07.059 (2017).
Gevera, P. K., Cave, M., Dowling, K., Gikuma-Njuru, P. & Mouri, H. Naturally occurring potentially harmful elements in groundwater in Makueni County, South-Eastern Kenya: effects on drinking water quality and agriculture. Geosciences 10 (2020).
Zhi, C., Cao, W., Zhang, Z., Li, Z. & Ren, Y. Hydrogeochemical characteristics and processes of shallow groundwater in the yellow river delta, China. Water 13 (2021).
Wen, X., Wu, Y., Su, J., Zhang, Y. & Liu, F. Hydrochemical characteristics and salinity of groundwater in the Ejina basin, Northwestern China. Environ. Geol. 48, 665–675. https://doi.org/10.1007/s00254-005-0001-7 (2005).
Shao, Y. et al. Hydrochemical characteristics and quality evaluation of irrigation and drinking water in Bangong Co lake watershed in Northwest Tibetan plateau. Water 15 (2023).
Pastor, A., Skovsholt, L. J., Christoffersen, K. S., Wu, N. & Riis, T. Geomorphology and vegetation drive hydrochemistry changes in two Northeast Greenland streams. Hydrol. Process. 35, e14369. https://doi.org/10.1002/hyp.14369 (2021).
Li, C. et al. Study on the influence of mining activities on the quality of deep karst groundwater based on multivariate statistical analysis and hydrochemical analysis. Int. J. Environ. Res. Public. Health 19 (2022).
Wu, X., Zhang, L., Hu, B. X., Wang, Y. & Xu, Z. Isotopic and hydrochemical evidence for the salinity origin in the coastal aquifers of the Pearl river delta, Guangzhou, China. J. Contaminant Hydrol. 235 https://doi.org/10.1016/j.jconhyd.2020.103732 (2020).
Wang, H., Gu, H., Lan, S., Wang, M. & Chi, B. Human health risk assessment and sources analysis of nitrate in shallow groundwater of the Liujiang basin, China. Hum. Ecol. Risk Assess. 24, 1515–1531. https://doi.org/10.1080/10807039.2017.1416455 (2018).
He, Z., Han, D., Song, X. & Yang, S. Impact of human activities on coastal groundwater pollution in the Yang-Dai river plain, Northern China. Environ. Sci. Pollut Res. 27, 37592–37613. https://doi.org/10.1007/s11356-020-09760-7 (2020).
Molinari, A., Guadagnini, L., Marcaccio, M. & Guadagnini, A. Geostatistical multimodel approach for the assessment of the spatial distribution of natural background concentrations in large-scale groundwater bodies. WATER RESEARCH 149, 522–532. https://doi.org/10.1016/j.watres.2018.09.049 (2019).
Rezaei, A. & Hassani, H. Hydrogeochemistry study and groundwater quality assessment in the north of Isfahan, Iran. Environ. Geochem. Health 40, 583–608 (2018).
Swartjes, F. A. & Otte, P. F. A novel concept in ground water quality management: towards function specific screening values. Water Res. 119, 187–200. https://doi.org/10.1016/j.watres.2017.04.030 (2017).
Kim, K. H. et al. Geochemical pattern recognitions of deep thermal groundwater in South Korea using self-organizing map: identified pathways of geochemical reaction and mixing. J. Hydrol. 589 https://doi.org/10.1016/j.jhydrol.2020.125202 (2020).
Qin, H. et al. Integrated hydrological modeling of the North China plain and implications for sustainable water management. Hydrol. Earth Syst. Sci. 17, 3759–3778. https://doi.org/10.5194/hess-17-3759-2013 (2013).
Mao, H. et al. Deciphering spatial pattern of groundwater chemistry and nitrogen pollution in Poyang Lake Basin (eastern China) using self-organizing map and multivariate statistics. J. Cleaner Prod. 329, 29697 (2021).
Elumalai, V. et al. Geochemical evolution and the processes controlling groundwater chemistry using ionic ratios, geochemical modelling and chemometric analysis in uMhlathuze catchment, KwaZulu-Natal, South Africa. Chemosphere 312, 137179. https://doi.org/10.1016/j.chemosphere.2022.137179 (2023).
Qu, S., Shi, Z., Liang, X., Wang, G. & Han, J. Multiple factors control groundwater chemistry and quality of multi-layer groundwater system in Northwest China coalfield — Using self-organizing maps (SOM). J. Geochem. Exploration 227, 106795. https://doi.org/10.1016/j.gexplo.2021.106795 (2021).
Zhang, Y. et al. Hydrogeochemical analysis and groundwater pollution source identification based on self-organizing map at a contaminated site. J. Hydrol. 616 ,128839 (2023).
Amiri, V. & Nakagawa, K. Using a linear discriminant analysis (LDA)-based nomenclature system and self-organizing maps (SOM) for spatiotemporal assessment of groundwater quality in a coastal aquifer. J. Hydrol. 603, (2021). https://doi.org/10.1016/j.jhydrol.2021.127082
Chen, I. T., Chang, L.-C. & Chang, F.-J. Exploring the spatio-temporal interrelation between groundwater and surface water by using the self-organizing maps. J. Hydrol. 556, 131–142,(2018).
Thuy Thanh, N., Clark, S., Sisson, S. A. & Sharma, A. Identification of spatio-seasonal hydrogeochemical characteristics of the unconfined groundwater in the Red River Delta, Vietnam. Appl. Geochem. 63, 10–21, (2015). https://doi.org/10.1016/j.apgeochem.2015.07.009
Yin, Z., Luo, Q., Wu, J., Xu, S. & Wu, J. Identification of the long-term variations of groundwater and their governing factors based on hydrochemical and isotopic data in a river basin. J. Hydrol. 592, (2021).
Clark, S., Sisson, S. A. & Sharma, A. Tools for enhancing the application of self-organizing maps in water resources research and engineering. Adv. Water Resour. 143,103676 (2020).
Liu, Z., Feng, S., Zhangsong, A., Han, Y. & Cao, R. Long-term evolution of groundwater hydrochemistry and its influencing factors based on self-organizing map (SOM). Ecol. Indic. 154, 110697. https://doi.org/10.1016/j.ecolind.2023.110697 (2023).
Balkissoon, S., Fox, N., Lupo, A., Haupt, S. E. & Penny, S. G. Classification of tall tower meteorological variables and forecasting wind speeds in Columbia, Missouri. Renew. Energy. 217, 119123. https://doi.org/10.1016/j.renene.2023.119123 (2023).
Foroutan, M. & Zimbelman, J. R. Semi-automatic mapping of linear-trending bedforms using ‘Self-Organizing maps’ algorithm. Geomorphology 293, 156–166. https://doi.org/10.1016/j.geomorph.2017.05.016 (2017).
Avand, M., Moradi, H. & lasboyee, M. R. Spatial modeling of flood probability using geo-environmental variables and machine learning models, case study: Tajan watershed, Iran. Adv. Space Res. 67, 3169–3186. https://doi.org/10.1016/j.asr.2021.02.011 (2021).
Lee, K.-J. et al. The combined use of self-organizing map technique and fuzzy c-means clustering to evaluate urban groundwater quality in Seoul metropolitan city, South Korea. J. Hydrol. 569, 685–697, (2019).
Zhu, G. et al. Influence of mining activities on groundwater hydrochemistry and heavy metal migration using a self-organizing map (SOM). J. Clean. Prod. 257 https://doi.org/10.1016/j.jclepro.2020.120664 (2020).
Gu, X. et al. Hydrogeochemical characterization and quality assessment of groundwater in a long-term reclaimed water irrigation area, North China Plain. Water. 10. https://doi.org/10.3390/w10091209 (2018).
Zhang, Q. et al. Geostatistics-based Spatial variation characteristics of groundwater levels in a wastewater irrigation area, Northern China. Water Supply. 17, 1479–1489. https://doi.org/10.2166/ws.2017.052 (2017).
Liu, T., Gao, X., Zhang, X. & Li, C. Distribution and assessment of hydrogeochemical processes of F-rich groundwater using PCA model: a case study in the Yuncheng basin, China. Acta Geochim. https://doi.org/10.1007/s11631-019-00374-6 (2019).
Zhou, Y., Li, P., Chen, M., Dong, Z. & Lu, C. Groundwater quality for potable and irrigation uses and associated health risk in Southern part of Gu’an County, North China plain. Environ. Geochem. Health. 43, 813–835. https://doi.org/10.1007/s10653-020-00553-y (2021).
Hao, Q., Xiao, Y., Chen, K., Zhu, Y. & Li, J. Comprehensive Understanding of groundwater geochemistry and suitability for sustainable drinking purposes in confined aquifers of the Wuyi region, central North China plain. Water 12 https://doi.org/10.3390/w12113052 (2020).
Xiao, Y. et al. Investigating sources, driving forces and potential health risks of nitrate and fluoride in groundwater of a typical alluvial fan plain. Sci. Total Environ. 802, 149909. https://doi.org/10.1016/j.scitotenv.2021.149909 (2022).
Liu, H., Zhang, G., Guo, H., Wang, Z. & Ge, Q. Occurrences of nitrate-contaminated groundwater in the piedmont aquifers: hydrogeochemical characteristics and health risks. Environ. Geochem. Health 46, 366, (2024).
Li, J.et al Radiocarbon dating of groundwater in the North China Plain and its implication to 14 C correction. Hydrogeol. Eng. Geol. (2025).
Xiao, Y. et al. Hydrogeochemical insights into the signatures, genesis and sustainable perspective of nitrate enriched groundwater in the Piedmont of Hutuo watershed. China CATENA. 212, 106020. https://doi.org/10.1016/j.catena.2022.106020 (2022).
Han, D., Currell, M. J. & Guo, H. Controls on distributions of sulphate, fluoride, and salinity in aquitard Porewater from the North China plain: Long-term implications for groundwater quality. J. Hydrol. 603, 126828. https://doi.org/10.1016/j.jhydrol.2021.126828 (2021).
Cheng, Z., Su, C., Zheng, Z., Chen, Z. & Wei, W. Characterize groundwater vulnerability to intensive groundwater exploitation using tritium time-series and hydrochemical data in Shijiazhuang, North China plain. J. Hydrol. 603, 126953. https://doi.org/10.1016/j.jhydrol.2021.126953 (2021).
MNRC. Vol. DZ/T 0420–2022. (2022).
Adimalla, N., Qian, H. & Li, P. Entropy water quality index and probabilistic health risk assessment from geochemistry of groundwaters in hard rock terrain of Nanganur County, South India. Geochemistry 80, 125544. https://doi.org/10.1016/j.chemer.2019.125544 (2020).
Kohonen, T. Self-organized formation of topologically correct feature maps. Biol. Cybernetics. 43, 59–69. https://doi.org/10.1007/BF00337288 (1982).
Kohonen, T. Essentials of the self-organizing map. Neural Netw. 37, 52–65. https://doi.org/10.1016/j.neunet.2012.09.018 (2013).
Vesanto, J. & Alhoniemi, E. Clustering of the self-organizing map. IEEE Trans. Neural Networks 11, 586–600. https://doi.org/10.1109/72.846731 (2000).
Qu, S. et al. Deciphering spatio-seasonal patterns, driving forces, and human health risks of nitrate and fluoride enriched water bodies in the Inner Mongolia Reaches of the Yellow River Basin, China. Enviro. Sci. Pollut. Res. 30, 111423–111440, (2023).
Deng, Y., Ye, X. & Du, X. Predictive modeling and analysis of key drivers of groundwater nitrate pollution based on machine learning. J. Hydrol. 624, 129934. https://doi.org/10.1016/j.jhydrol.2023.129934 (2023).
Zhang, L., Li, P. & He, X. Interactions between surface water and groundwater in selected tributaries of the Wei river (China) revealed by hydrochemistry and stable isotopes. Hum. Ecol. Risk Assess. 28, 79–99. https://doi.org/10.1080/10807039.2021.2016054 (2022).
Xiao, Y. et al. Occurrence, Controlling Factors and Health Hazards of Fluoride-Enriched Groundwater in the Lower Flood Plain of Yellow River, Northern China. Exposure Health. 14, 345–358. https://doi.org/10.1007/s12403-021-00452-2 (2022).
Adimalla, N. Application of the entropy weighted water quality index (ewqi) and the pollution index of groundwater (pig) to assess groundwater quality for drinking purposes: a case study in a rural area of Telangana State, India. Arch. Environ. Contam. Toxicol. 80, 31–40. https://doi.org/10.1007/s00244-020-00800-4 (2021).
Zhang, Y. et al. Hydrochemistry, quality and potential health risk appraisal of nitrate enriched groundwater in the Nanchong area, Southwestern China. Sci. Total Environ. 784, 147186. https://doi.org/10.1016/j.scitotenv.2021.147186 (2021).
GAQS. Standards for Groundwater Quality (GB/T 14848 – 2017). General Administration of Quality Supervision, (2017).
WHO. Guidelines for drinking-water quality, 4th edition Incorporating the First Addendum. (2017).
Feng, B. et al. Nitrogen sources and conversion processes in shallow groundwater around a plain lake (Northwest China): evidenced by multiple isotopes and water chemistry. Chemosphere 337, 139322. https://doi.org/10.1016/j.chemosphere.2023.139322 (2023).
Zheng, W. & Wang, S. Extreme precipitation accelerates the contribution of nitrate sources from anthropogenetic activities to groundwater in a typical headwater area of the North China plain. J. Hydrol. 603, 127110. https://doi.org/10.1016/j.jhydrol.2021.127110 (2021).
Wu, C., Fang, C., Wu, X., Zhu, G. & Zhang, Y. Hydrogeochemical characterization and quality assessment of groundwater using self-organizing maps in the Hangjinqi gasfield area, Ordos basin, NW China. Geosci. Front. 12, 781–790. https://doi.org/10.1016/j.gsf.2020.09.012 (2021).
Mia, M. Y. et al. Analysis of self-organizing maps and explainable artificial intelligence to identify hydrochemical factors that drive drinking water quality in Haor region. Sci. Total Environ. 904, 166927. https://doi.org/10.1016/j.scitotenv.2023.166927 (2023).
Yang, H. et al. Lithology controls on the mixing behavior and discharge regime of thermal groundwater in the Bogexi geothermal field on Tibetan plateau. J. Hydrol. 628, 130523. https://doi.org/10.1016/j.jhydrol.2023.130523 (2024).
Gibbs, R. J. Mechanisms controlling world water chemistry. Sci. (New York N Y). 170, 1088–1090. https://doi.org/10.1126/science.170.3962.1088 (1970).
Cheng, T. et al. Parallel simulation of groundwater flow in the North China plain. J. Earth Sci. 25, 1059–1066. https://doi.org/10.1007/s12583-014-0485-8 (2014).
Mao, M., Guo, P., Zhu, X., Yang, J. & Pan, D. Hydrochemical characterization and quality assessment of groundwater in the hilly area of the Taihang Mountains in Henan Province, China. Environ Sci Pollut Res Int 28, 43853–43871 (2021).
Xiao, Y. et al. Hydrogeochemical Features and Genesis of Confined Groundwater and Health Perspectives for Sustainable Development in Urban Hengshui, North China Plain. J. Chem. 2021, 5578192, (2021). https://doi.org/10.1155/2021/5578192 (2021).
Ledesma-Ruiz, R., Pasten-Zapata, E., Parra, R., Harter, T. & Mahlknecht, J. Investigation of the geochemical evolution of groundwater under agricultural land: A case study in Northeastern Mexico. J. Hydrol. 521, 410–423. https://doi.org/10.1016/j.jhydrol.2014.12.026 (2015).
Liu, J. et al. Hydrochemical characteristics and water quality assessment of groundwater in the Yishu River basin. Acta Geophysica 68, 877–889. https://doi.org/10.1007/s11600-020-00440-1 (2020).
Sun, H. et al. Variation of nitrate sources affected by precipitation with different intensities in groundwater in the piedmont plain area of alluvial-pluvial fan. J. Environ. Manag. 367, 121885 (2024).
Yang, H. et al. Geochemical fingerprints, evolution, and driving forces of groundwater in an alpine basin on Tibetan plateau: insights from unsupervised machine learning and objective weight allocation approaches. J. Hydrology: Reg. Stud. 56, 102054. https://doi.org/10.1016/j.ejrh.2024.102054 (2024).
Wang, D., Li, P., Ji, Y. & Ren, X. Insight into the impact of biogeochemical reactions of groundwater nitrogen on chemical weathering and carbon cycling. J. Hydrol. 648, 132458. https://doi.org/10.1016/j.jhydrol.2024.132458 (2025).
Wang, S. et al. Seasonal recharge mechanism of the upper shallow groundwater in a long-term wastewater leakage and irrigation region of an alluvial aquifer. J. Hydrol. 629, 130424. https://doi.org/10.1016/j.jhydrol.2023.130424 (2024).
Amiri, V., Kamrani, S., Ahmad, A., Bhattacharya, P. & Mansoori, J. Groundwater quality evaluation using Shannon information theory and human health risk assessment in Yazd Province, central plateau of Iran. Environ. Sci. Pollut Res. 28, 1108–1130. https://doi.org/10.1007/s11356-020-10362-6 (2021).
Acknowledgements
The authors would like to thank the editors and anonymous reviewers for their critical comments and suggestions which greatly helped us to improve the present manuscript.
Funding
This research was funded by the National Key Research and Development Program of China (grant number 2024YFC3211600); the National Natural Science Foundation of China (grant number 42477059; 41702282); the Fujian Provincial Key Laboratory of Water Cycling and Eco-Geological Processes (grant number SK202305KF01); the MOE Key Laboratory of Groundwater Circulation and Environmental Evolution, China University of Geosciences (Beijing) (grant number 2023-004); the Student Research Training Program of Southwest Jiaotong University (241925).
Author information
Authors and Affiliations
Contributions
The design and conception of the study were completed by Q.H., Y.X. and H.C. The Field investigation was completed by H.Y., L.W., J.W., Y.Z. The Data Curation was completed by K.L., L.W., J.W., W.H., Y.L. and B.L. The methodology was completed by K.L., J.W., Y.Z., W.H. and Y.L. The Formal analysis was completed by Q.H., Y.X., K.L., H.Y., L.W., J.W., Y.Z., Y.L., B.L. The original draft writing was completed by Q.H. The reviewing and editing writing were by Y.X and H.C. All the authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Hao, Q., Xiao, Y., Liu, K. et al. Spatial pattern of groundwater chemistry in a typical piedmont plain of Northern China driven by natural and anthropogenic forces. Sci Rep 15, 7643 (2025). https://doi.org/10.1038/s41598-025-91659-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-91659-9