Spatial Distribution and Associated Risk Assessment of Heavy Metal Pollution in Farmland Soil Surrounding the Ganhe Industrial Park in Qinghai Province, China

: The farmland around the industrial areas in the Upper Yellow River is crucial for agricultural production but is vulnerable to contamination from the surrounding industries. This research focused on analyzing the spatial distribution and environmental risks of heavy metal pollution in the farmland around the Ganhe Industrial Park in the Qinghai–Tibet Plateau. A total of 138 surface soil samples were collected, and the concentration of seven heavy metals (Cd, As, Pb, Cr, Cu, Ni, and Zn) was analyzed using the random forest (RF) model. Pollution indicators, including the pollution index and Nemero index, were used to evaluate the pollution levels of soil heavy metals. The human health and ecological risks were estimated using the hazard index (HI) and the potential ecological risk index (RI). Cd and Zn were identiﬁed as the primary soil pollutants in the study area, with Cd being more concentrated than other heavy metals. Heavy metal contamination was most severe in the central–eastern region of the study area, with a ring-shaped distribution, which correlated with the presence of zinc smelting and chemical plants. Furthermore, the study revealed that soil heavy metal contamination posed a health threat to the local population, with children being particularly vulnerable to non-carcinogenic risks when the HI was 1.21 and to potential carcinogenic risks when the CR was 2.27 × 10 − 5 . Additionally, heavy metal pollution caused a moderate to high ecological risk in 56.4% of the samples. The results highlighted the severe impact of soil heavy metal pollution on the delicate ecosystem of the Upper Yellow River and Qinghai–Tibet Plateau. The government should take action to improve soil environment management and prevent heavy metal pollution to protect the health of the local population and the ecological environment.


Introduction
Soil heavy metal pollution is considered to be one of the most serious environmental problems that can lead to health risks for humans and animals [1][2][3][4][5][6]. Heavy metals could enter the soil through various pathways, including bedrock weathering, precipitation, dust settlement, and waste discharge [7][8][9]. Wastes discharged by anthropogenic activities, such as mining, metal processing and smelting, chemical production, factory emissions, and sewage irrigation, are major sources of heavy metal pollution in soil [10][11][12][13]. Prior research has demonstrated that the elevated concentration of heavy metals in soil poses a threat to crops, animals, and human health through direct exposure (such as ingestion, dermal absorption, and inhalation) or via food chains [14][15][16][17][18][19][20][21].
Heavy metals in soils are difficult to degrade and decontaminate [8]. The presence of heavy metals in soils is a serious concern for soil environment management and heavy metal pollution prevention. Several statistics-based methods have been developed to evaluate the soil heavy metal pollution degree, including pollution index (P i ), geo-accumulation index (I geo ), and Nemero index (P), which use classification criteria to assess pollution levels [8,[22][23][24]. These methods could consider the influence of natural geological effects and anthropogenic activities on heavy metals in soil, with the metal background value as a parameter [25,26]. However, the relationship between the concentrations and various predictor variables often reveals non-linear features, making it challenging to predict the spatial distribution and influence factors of soil heavy metals. The random forest model (RF) has been used to overcome this challenge and predict the spatial distribution and influence factors of soil heavy metals [27]. In addition to pollution evaluation, risk assessment is also essential for heavy metal contamination. The potential ecological risk assessment (RI) is commonly used to assess the risk posed to the ecological environment and human health [8,[28][29][30], while the Health Risk Assessment (HRA) is used to quantitatively assess the carcinogenic and non-carcinogenic risks of human exposure to certain contaminants [3,28].
The combination of these methods can provide a more comprehensive evaluation of the potential ecological and health risk of toxic metals in soil. However, most previous studies were carried out mainly in floodplains, ports, mining areas, or urban-rural ecotones, with few studies conducted in industrial park areas with high altitudes [1][2][3]11,12]. Highaltitude areas have different characteristics for the migration and transformation of heavy metals, including low temperature, underdeveloped soil, and sensitive ecosystem, making it necessary to study heavy metal contamination in these areas as well.
The Upper Yellow River is located in the north-eastern Qinghai-Tibet Plateau, which is an area that makes up 16.2% of the total area of the Yellow River Basin [24]. The area has an average altitude of approximately 3500 m and is known as the "third pole" of the Earth due to its sensitivity to climate change in Asia and the Northern Hemisphere [31]. The most widely distributed non-zonal vegetation types in the region are alpine meadow and alpine steppe [31]. As the largest industrial park in the Upper Yellow River, the high-pollution factories in the Ganhe Industrial Park may pose a threat to the residents of the whole county, the farmland, and even the ecosystem of the Upper Yellow River and Qinghai-Tibet Plateau, due to the release of contaminants to the water, air, and soil ( Figure 1). Therefore, this study aims to quantitatively evaluate the pollution status and potential risks of heavy metal concentrations in Ganhe Industrial Park by applying various methods, such as the random forest model, pollution index, Nemero index, potential ecological risk assessment, and health risk assessment [32,33]. The study not only investigated the relative environmental, economic, and natural conditions in the study area but also estimated human health and ecological risks by using the hazard index (HI) and potential ecological risk index (RI) to provide theoretical support for contamination management for the industrial park and even for the Upper Yellow River. The findings of the study could confirm the extent of heavy metal pollution in the study area, as well as the potential risks associated with this pollution, to maintain a sustainable and healthy agricultural system in the Upper Yellow River, which could also be used in similar regions in China and other developing countries. Land 2023, 12, x FOR PEER REVIEW 3 of 17

Study Area
The study area is located in the north-eastern Qinghai Province, which is a typical temperate continental climate region characterized by hot summers, cold winters, and low annual precipitation. The area covers approximately 200 km 2 , with coordinates ranging from 36°29′49″ N~36°36′47″ N to 101°25′44″ E~101°35′2″ E ( Figure 1). The elevation gradually declines from south to north, ranging from 3600 m to 2100 m, and the terrain belongs to the Qinghai-Tibet Plateau.
The land use types are dominated by farmland, grassland, and construction land, and the farmland in this study's area is mainly used for growing grain crops ( Figure 1). The soil types in the area include Chernozem, Kastanozem, Calcisols, and Cryosols, according to the classification system of the Food and Agriculture Organization (FAO) [34]. The residential and factory estates extend along two parallel drainages in the north-south

Study Area
The study area is located in the north-eastern Qinghai Province, which is a typical temperate continental climate region characterized by hot summers, cold winters, and low annual precipitation. The area covers approximately 200 km 2 , with coordinates ranging from 36 • 29 49 N~36 • 36 47 N to 101 • 25 44 E~101 • 35 2 E (Figure 1). The elevation gradually declines from south to north, ranging from 3600 m to 2100 m, and the terrain belongs to the Qinghai-Tibet Plateau.
The land use types are dominated by farmland, grassland, and construction land, and the farmland in this study's area is mainly used for growing grain crops ( Figure 1). The soil types in the area include Chernozem, Kastanozem, Calcisols, and Cryosols, according to the classification system of the Food and Agriculture Organization (FAO) [34]. The residential and factory estates extend along two parallel drainages in the north-south direction, interspaced by farmlands and pastures. The factories include various metals (aluminum, copper, lead, zinc, iron), smelting plants, chemical plants, inorganic salt manufacturing plants, etc. The soil in the study area is heavily polluted with heavy metals, mainly as a result of industrial contaminants released into the air, water, and soil, as well as excessive use of pesticides and fertilizers.

Sampling and Analysis
A total of 138 soil samples were collected between 29 September 2020 and 3 October 2020. Semi-random distribution method was used for selecting the sampling points, taking into account the accessibility and spatial coverage of the study area ( Figure 1). For each sampling point, a 20 m × 20 m square was built in the center, and five samples were collected from the four corners and the center of each square. The collected soil samples were mixed to obtain a composite sample for each sampling point. The soil samples were collected from a depth of 0-20 cm from the natural surface and were approximately 2 kg in weight. Rocks and debris were removed from the samples. The soil samples were airdried, crushed, and sieved through 200 mesh polyethylene for analysis. The heavy metal concentrations of Cd, As, Pb, Cr, Cu, Ni, and Zn were analyzed using inductively coupled plasma mass spectrometry (ICP-MS, Thermo Fisher X7, Bremen, Germany) instruments following digestion with an HCl-H 2 O-HF-HNO 3 mixture [35]. The analysis was conducted in a laboratory with an environment temperature of 24 • C and a humidity of 40%. The ICP-MS detection limits for all elements are calculated as three times the standard deviation of the calibration blank measurements (1:1 v/v HNO 3 : MQ-water); instrumental stability and tuning were checked using a solution of 10 µg/L of Li, Be, Bi, Ce, Co, In, Pb and U in HNO 3 (2% v/v), and 156CeO+/140Ce+<0.3%. Internal standardization was performed using 103Rh for all elements. For ICP-MS determination, certified reference materials (GSS-3a and Gss-5a), blank, and samples were measured for quality control. The GSS-3a and Gss-5a were used for the validation of the measurement. The measured results were compared with the certified reference values, and the results were within the 95% confidence level. Moreover, repeated analysis of samples from different sampling areas was adopted, and the results of duplicate samples were consistent within the margin of error.

Random Forest Model
The RF model was used to establish the non-linear relationship between the independent variables and the dependent variable [27]. The independent variables include factory distance, slope, road distance, annual rainfall, average annual temperature, and elevation of the samples, while the dependent variable is heavy metals in the soil.
The model-building process involved several steps [27]. Firstly, K bootstrap samples were selected using the self-expanding sampling method to train the regression tree. Then, m subsets were randomly extracted from M feature values, and the best subset was selected when the tree was split (m ≤ M). Each decision tree was built without pruning and allowed to grow to the maximum extent. Finally, the prediction results were obtained through voting.
In the modeling process, the research area was divided into 900 (30 × 30) rectangular grids with a grid size of 500 m, ensuring that the number of training grids was greater than 10% of the total grid. Sample points located in the same grid were eliminated if they had the same content values, resulting in only 136 sample points being used. The optimal parameters and accuracy of the training sets corresponding to verification sets are shown in Table 1, which was generated during the model evaluation process.
The sample dataset of 136 points was split into 70% for modeling and 30% for verification. A Python program was developed using ArcGIS Pro 2.5 to optimize the model parameters in order of the number of trees, the maximum tree depth, the number of random sampling variables, the minimum leaf size, and the accuracy of the training, and verification sets was measured (Table 1).

Pollution Index
The pollution index is a single-factor assessment method used to evaluate heavy metal pollution and the associated pollution risk [10,36]. It can be calculated by dividing the actual concentration of a specific heavy metal by the corresponding background values of the same metal in the soil, as indicated in the provided Equation (1) where P i represents the pollution index of the heavy metal i; C i represents the chemical analyzed value of soil heavy metal i; and S i represents the standard value of soil heavy metal quality. The standard value of soil heavy metal quality is determined by using the screening values defined in the Soil Environmental Quality Standard (GB 15618-2018) issued by the Chinese Ministry of Environmental Protection in 2018 [37]. A higher value of P i indicates more severe contamination of the soil. When P i is less than or equal to 1, the soil is considered uncontaminated. A P i value between 1 and 2 indicates that the soil is low to moderately contaminated, while a P i value between 2 and 3 indicates that the soil is moderate to heavily contaminated. Finally, if P i is greater than 3, the soil is considered heavy to extremely contaminated [8].

Nemero Index
The Nemero index is a useful tool for assessing the environmental risk of pollutants, specifically heavy metals, in soil. The index takes into account the different toxic effects of each heavy metal and calculates a comprehensive measure of the environmental risk based on their concentrations. The formula for calculating the Nemero index is as follows: is the maximum value of the pollution index of the is the average value of the individual pollution index. The quality of the soil environment is classified into 5 levels based on the Nemero index, according to

Potential Health Risk Assessment
Health Risk Appraisal (HRA) is a quantitative method used to assess the human risk of exposure to environmental pollutants, including heavy metals [39,40]. Ingestion, inhalation, and dermal absorption are the three primary pathways through which heavy metals can enter the human body [41,42]. According to the model provided by the US Environmental Protection Agency (USEPA), the chronic daily intake (CDI) of the seven heavy metals through multiple exposure pathways was calculated using the following equations: where C soil represents the average of the measured concentration of each soil heavy metal for all the samples in the study area (mg·kg −1 ). CDI ingest , CDI inhale , and CDI dermal represent the potential intake value of each heavy metal through ingest, inhale, and dermal absorption. The other parameters in Equation (3)-(5) are shown in Table 3. The hazard quotient (HQ) and hazard index (HI) are commonly used metrics in human health risk assessments for evaluating the potential chronic effects of exposure to multiple heavy metals. The HQ is calculated by dividing the estimated exposure level of a particular heavy metal over a specified time period by a reference dose (RfD) for a similar exposure period. The RfD is an estimate of the amount of a substance that a person can be exposed to on a daily basis over a lifetime without appreciable health risks [43]. HQ and HI are calculated using Equations (6) and (7), respectively: where RfD is the reference dose of heavy metals (mg·kg −1 ) and is different for each heavy metal, as shown in Table 3 [43]. HI > 1 indicates that the environment has a serious potential for non-carcinogenic risk; HI < 1 suggests that there is no significant potential for non-carcinogenic risks in the environment [44].
The Carcinogenic Risk (CR) assessment methodology is utilized to estimate the risk of carcinogenesis for humans who are exposed to the external environment via three distinct pathways [45]. The Lifetime Carcinogenic Risk (LCR) is calculated as the aggregate of the three CRs [45]. Equations (8) and (9) are used to compute both CR and LCR: where CSF refers to the carcinogenic slope factor of heavy metals and is different for each heavy metal (Table 4). According to the USEPA, for heavy metals, the minimum CR value is 1.00 × 10 −6 , and CR values between 1.00 × 10 −6 and 1.00 × 10 −4 suggest no severe carcinogenic risk to human health [6].

Potential Ecological Risks
The potential ecological risk index (RI) was a widely used method for assessing the potential ecological risk of soil heavy metal pollution. The RI considers both the concentration of heavy metals in the soil and the potential ecological impact of these metals [8,28,47]. This method takes into account various factors, such as the concentration of heavy metals, their toxicity, environmental quality standards, and ecological effects, in order to evaluate the potential impact of heavy metals on ecosystems [8]. The RI index can be calculated using the following equation: where RI is the comprehensive potential ecological risk index; E i r is the individual potential ecological hazard index value of heavy metal i; T i r is the toxicity factor value of heavy metal i [48]; C i is the measured content of heavy metal i; C i n is the reference value of heavy metal i. The toxicity factor values of Pb, Cu, Cr, Cd, Zn, Ni, and As are 5, 5, 2, 30, 1, 5, and 10, respectively [47].
Based on the classification criteria of E i r and RI proposed by Hakanson (1980), E i r < 40 means low potential ecological risk; 40 ≤ E i r < 80 indicates moderate potential risk; 80 ≤ E i r < 160 represents a considerable potential risk; 160 ≤ E i r < 320 means high potential risk; E i r ≥ 320 indicates significantly very high risk. Four categories of RI values define, which are low risk (RI < 150), moderate risk (150 ≤ RI < 300), considerable risk (300 ≤ RI < 600), and high risk (RI ≥ 600) [47].

Statistics of the Soil Heavy Metals
The statistics of the soil heavy metals concentrations are listed in Table 5. Zn had the largest mean value (147 mg·kg −1 ), followed by Cr (89 mg·kg −1 ), while Cd had the lowest mean value of 1.52 mg·kg −1 . The obvious variations occurred in the measured heavy metals concentrations, varying between 0. 16 40%) for Cr, 1 sample (0.70%) for Pb, and none for the others (As, Cu and Ni). For Cd, 13 samples even exceeded level 2 (the risk intervention values in the standard).

Spatial Distribution of RF Model
The results of the RF model pass the significance test with a level of 0.01. The accuracy of seven heavy metals was greater than 0.5, and the R 2 of Cd, Cr, Ni, and Zn was above 0.8 (Table 1). From the perspective of the precision of the validation set, the precision for the other six heavy metals except As was greater than 0.5 (the R 2 of four heavy metals, Cd, Cr, Ni, and Zn, is above 0.6). Finally, the results were visualized in ArcGIS Pro 2.5. The weights of six factors (factory distance, slope, road distance, annual rainfall, annual average temperature, and elevation) were calculated (Figures 2 and 3).
Cd, Pb, and Zn had similar spatial distribution (Figure 2), with high pollution risk in the middle-east of the study area and low risk in the west and north-east regions. The high-risk regions were mainly concentrated in the concentrated areas of factories and enterprises. Similarly, Cr and Ni had similar spatial distributions, with high pollution risk in the north of the central region and low risk in the south-east and north-west regions. Cu had a different spatial distribution, with high pollution risk mainly concentrated in the middle of the study area and medium-risk areas in the south-west. However, the results for As were poor and did not accurately reflect the spatial distribution of pollution risks due to the relatively low R 2 .
The relative importance of six independent factors (factory distance, slope, road distance, annual rainfall, annual average temperature, and elevation) was analyzed ( Figure 3). Specifically, the factory distance factor had the highest weight in the modeling process for all seven heavy metals. Precipitation and annual average temperature were also important factors in predicting the pollution risks of Cr and Ni. The slope factor was important for predicting the pollution risks of Cd, Pb, and Cu. For As, the weight of the factory distance factor is not significantly higher than the weights of the other factors, and the weights of the other five factors are relatively equal. Considering the poor RF modeling accuracy of As (Figure 2), it can be inferred that none of the six factors is the main determinant of As soil contamination.   Cd, Pb, and Zn had similar spatial distribution (Figure 2), with high pollution risk in the middle-east of the study area and low risk in the west and north-east regions. The high-risk regions were mainly concentrated in the concentrated areas of factories and enterprises. Similarly, Cr and Ni had similar spatial distributions, with high pollution risk in the north of the central region and low risk in the south-east and north-west regions. Cu had a different spatial distribution, with high pollution risk mainly concentrated in the middle of the study area and medium-risk areas in the south-west. However, the results for As were poor and did not accurately reflect the spatial distribution of pollution risks due to the relatively low R 2 .
The relative importance of six independent factors (factory distance, slope, road distance, annual rainfall, annual average temperature, and elevation) was analyzed ( Figure  3). Specifically, the factory distance factor had the highest weight in the modeling process for all seven heavy metals. Precipitation and annual average temperature were also important factors in predicting the pollution risks of Cr and Ni. The slope factor was important for predicting the pollution risks of Cd, Pb, and Cu. For As, the weight of the factory distance factor is not significantly higher than the weights of the other factors, and the weights of the other five factors are relatively equal. Considering the poor RF modeling accuracy of As (Figure 2), it can be inferred that none of the six factors is the main determinant of As soil contamination.

Single Pollution Index Method
The mean values of Pi for each heavy metal, from highest to lowest, were as follows: Cd (2.53), Zn (0.

Single Pollution Index Method
The mean values of P i for each heavy metal, from highest to lowest, were as follows: Cd (2.53), Zn (0.49), As (0.47), Cr (0.36), Cu (0.26), Pb (0.21), and Ni (0.19). Cd was the highest pollutant among all the tested metals. Based on the classification criteria for different levels of pollution, the P i values indicated that the pollution level of Cd was in the moderately to heavily contaminated class. The pollution levels of the other heavy metals (Zn, As, Cr, Cu, Pb, and Ni) were at uncontaminated levels based on the mean values of P i (Figure 4). However, when considering the maximum value, P i , Pb, and Cr were at low to moderately contaminated levels, while Cd and Zn were at heavy to extremely contaminated levels. The other heavy metals (As, Cu, and Ni) showed an unpolluted tendency.
Overall, the study area was contaminated with Cd, and there might be some contamination with Zn, Pb, and Cr, depending on the maximum values of P i . The other heavy metals were not likely to cause significant pollution in the area.

Nemero Index
The mean value of the Nemero index in the study area was 1.86, which indicated that the area was slightly polluted, but the precaution levels for the samples had varying levels of pollution, with some samples being only slightly polluted and others being seriously polluted. The results indicated that more than 56.5% of the samples were above the safety level (34 warning level samples, 14 mild pollution level samples, 7 moderate pollution level samples, and 23 severe pollution level samples).
The spatial interpolation and classification of the Nemero index values were carried out using inverse distance weighting (IDW) interpolation [29,36]. Figure 5 shows that the pollution levels in the study area ranged from the safety level to the seriously polluted level. The study identified regions with slightly polluted levels, moderately polluted levels, and seriously polluted levels, which covered areas of 34.3 km 2 , 11.2 km 2 , and 22.0 km 2 , respectively. The polluted areas were mainly concentrated in the central-eastern region, which correlated well with relevant Cd concentrations, as shown in Figure 4. The spatial interpolation and classification of the Nemero index values were carried out using inverse distance weighting (IDW) interpolation [29,36]. Figure 5 shows that the pollution levels in the study area ranged from the safety level to the seriously polluted level. The study identified regions with slightly polluted levels, moderately polluted levels, and seriously polluted levels, which covered areas of 34.3 km 2 , 11.2 km 2 , and 22.0 km 2 , respectively. The polluted areas were mainly concentrated in the central-eastern region, which correlated well with relevant Cd concentrations, as shown in Figure 4.

Pollution Feature and Possible Sources
Based on the statistical results of the samples (Table 5), it could be concluded that the

Pollution Feature and Possible Sources
Based on the statistical results of the samples (Table 5), it could be concluded that the main pollutant of heavy metal pollution in cultivated soil in the study area was Cd, followed by Zn. The coefficient of variation (CV) of Cd and Zn exceeded 100%, which suggested that the variability of these two elements was high. This high variability indicated that external factors had a significant influence on the accumulation of these metals in the soil [46].
The RF model, which was built using six independent variables, including factory distance, slope, road distance, annual rainfall, average annual temperature, and elevation, provided accurate predictions of heavy metal concentrations in the soil for each grid with a size of 500 m in the study area. The results indicated that most heavy metals, especially Cd, Pb, and Zn, had similar spatial distribution (Figure 2), with high pollution risk in the middle-east of the study area and low risk in the west and north-east regions, which was consistent with the locations of the industrial enterprises. The spatial distribution of heavily polluted areas derived from RF was comparable to that mapped from the single pollution index method and Nemero index (Figures 2, 4 and 5).
Based on the relative importance of six independent factors in the RF model, the factory distance factor was identified as having the highest weight in predicting pollution risks for all seven heavy metals, with particular significance for Cd and Zn ( Figure 3). The modeling process showed that precipitation and annual average temperature were important factors in predicting the pollution risks of Cr and Ni. On the other hand, the slope factor was found to be significant in predicting the pollution risks of Cd, Pb, and Cu.
The results obtained in this study were consistent with those derived from source analysis of heavy metals using principal component analysis, cluster analysis, and positive matrix factor analysis methods (PMF) [49,50]. Through these methods, five potential sources, including soil parent material, coal burning, agricultural and industrial activity, electroplating, and transportation, were identified. Previous research had shown that the two major pollutants, Cd and Zn, mostly originated from agricultural and industrial activities and were distributed around industrial enterprises [51], which was in line with the finding that Cd and Zn pollution had the highest weight with the factory distance factor. Relevant studies had also indicated that there was a common source between Cr and Ni in soil, and these two heavy metals both were iron-philic elements in the epigenetic geochemical process. The content of these metals in soil was likely dominated by the soil-forming process [34,35], which was similar to the RF results indicating that the two metals resolved from the soil parent material correlated with precipitation and annual average temperature. This confirms that the combination of RF, single pollution index, and Nemero index could effectively determine the pollution degree of heavy metals, providing valuable information for the formulation of soil remediation measures.

Human Health Risks
The potential risks to children and adults posed by contaminated soil were assessed through ingestion, inhalation, and dermal contact pathways ( Table 6). The HI for children was 1.21, which was nearly 10 times higher than that for adults (HI = 1.42 × 10 −1 ). The sum of the HQ values for each pathway of the seven heavy metals showed that the risk of ingestion was the highest for both adults and children, followed by dermal contact and inhalation. However, the risk of inhalation was considered negligible due to its low HQ value compared to the other two pathways. In addition to non-carcinogenic risks, the carcinogenic risk (CR) of exposure to heavy metals, including Cd, As, Cr, and Ni, was calculated (Table 6), and it was found that the potential CR risks for children were greater than those for adults. However, none of the two indices (CR and HI) were greater than 1.00 × 10 −4 , which indicated that there were no severe potential carcinogenic risks in the area.
The results suggest that while there are potential non-carcinogenic risks for children due to exposure to heavy metals, there are no severe potential health risks posed to adults. It is important to continue monitoring and regulating exposure to heavy metals to ensure that these risks remain low and do not pose a significant threat to human health.

Potential Ecological Risks
According to the results, Cd was found to be the most dangerous heavy metal in the study area, with the highest mean risk value (75.8) and maximum risk value (1090) among all the heavy metals (Cd > As > Pb > Cu > Ni > Cr > Zn). The IDW interpolation and classification criteria were used to map the potential ecological risk index values, and the spatial distribution of the ecological risk index showed that the area with low risk was the largest, followed by the area with moderate risk (Figure 6). The areas with considerable and very high-risk areas were smaller in size but still significant. all the heavy metals (Cd > As > Pb > Cu > Ni > Cr > Zn). The IDW interpolation and classification criteria were used to map the potential ecological risk index values, and the spatial distribution of the ecological risk index showed that the area with low risk was the largest, followed by the area with moderate risk (Figure 6). The areas with considerable and very high-risk areas were smaller in size but still significant. It appears that the study area has an uneven distribution of RI, with the lowest, mean, and highest values being 16.9, 85.0, and 1.11 × 10 3 , respectively. Cd is identified as the most significant contributor to the RI, accounting for 89.2%. The spatial distribution of RI is similar to the distribution of Cd, Pb, and Zn concentrations. The study area is divided into the following four categories based on RI values: low risk (RI < 150); moderate risk (150 ≤ RI < 300); considerable risk (300 ≤ RI < 600); and very high risk (RI > 600). The area and ratio of each category were 93.3 km 2 and 43.4%, 78.2 km 2 and 36.3%, 22.4 km 2 and 10.4%, and 21.3 km 2 and 9.9%.
The results concluded that heavy metals posed noticeable potential ecological risks It appears that the study area has an uneven distribution of RI, with the lowest, mean, and highest values being 16.9, 85.0, and 1.11 × 10 3 , respectively. Cd is identified as the most significant contributor to the RI, accounting for 89.2%. The spatial distribution of RI is similar to the distribution of Cd, Pb, and Zn concentrations. The study area is divided into the following four categories based on RI values: low risk (RI < 150); moderate risk (150 ≤ RI < 300); considerable risk (300 ≤ RI < 600); and very high risk (RI > 600). The area and ratio of each category were 93.3 km 2 and 43.4%, 78.2 km 2 and 36.3%, 22.4 km 2 and 10.4%, and 21.3 km 2 and 9.9%.
The results concluded that heavy metals posed noticeable potential ecological risks in the study area, and Cd was the most polluted heavy metal and, therefore, should be evaluated specifically for the relevant risks.

Conclusions
This study measured the levels of seven heavy metals (Cd, As, Pb, Cr, Cu, Ni, and Zn) in soils in the Ganhe Industrial Park in the Upper Yellow River and found that Cd and Zn were the main pollutants in the soils. A total of 65 samples (47.1%) for Cd, 13 samples (9.40%) for Zn, 2 samples (1.40%) for Cr, and 1 sample (0.70%) for Pb were proved to exceed the soil environmental quality standard values. Cd was more concentrated than the other elements, and the polluted areas were mainly concentrated in the central-eastern region, which was spatially correlated with the factories, such as zinc smelting plants and chemical plants. The pollution index and Nemero index results showed that more than 56.5% of the samples were beyond the safety level, indicating that the soil was slightly polluted.
The study also found that there were serious potential non-carcinogenic risks for children (HI = 1.21) but no severe potential health risks posed to adults (HI = 0.14). Similarly, the potential carcinogenic risk (CR) of heavy metals for children (CR = 2.27 × 10 −5 ) was greater than those for adults (CR = 1.20 × 10 −5 ). The individual index values of potential ecological risk assessment of heavy metals indicated that Cd was the main contributor to ecological risk as it recorded the highest E i r values. Overall, this study highlights the potential ecological risks posed by heavy metals in the study area and the need for specific evaluation of Cd for relevant risks. The results provide theoretical support for pollution control and environmental management in the study area and the Upper Yellow River in China.