Introduction

Soil is a precious natural resources, human survival and agricultural production are inseparable from clean soil. Accompanied by rapid development of agriculture, industrialization and urbanization, the problem of soil pollution has become increasingly prominent1,2. Global soil heavy metal (HM) pollution accidents occur frequently and the form is becoming more and more serious. Soil HM contamination has become one of the serious eco-environmental issues faced by the development of various countries3,4. Soil heavy metals (HMs) accumulated will have a negative impact on soil nutrient cycling5 and crop yield and quality6. Soil HMs can enter body through skin contact and inhalation7or by the food chain8. Generally speaking, in nature, soil HMs mainly come from parent materials of soil9, and natural HMs mainly exist in the form that plants are difficult to use, and there is a low ecological risk10. Human activities, such as agricultural fertilization, industrial activities and vehicle emissions can increase the accumulation of HMs in soil11. Anthropogenic HMs often take high biological activity, are easy to be absorbed and utilized by plants, and have high ecological risks, and aggravate HMs spatial variability in soil. It is the key to effective deal with soil HM pollution, trace its source and explore the factors that affect the spatial differentiation characteristics of soil HMs. Last 10 years, many researches and analysis have been conducted both domestically and internationally on the driving factors of soil HM pollution12,13,14,15,16,17.

In recent years, the evaluation indexes of soil HM pollution, including factor of enrichment (EF), index of potential ecological risk (RI) and index of geological accumulation (Igeo) has been widely applied in practice18. The method of contaminant source division can be divided into identification of pollution source and quantification of pollution source. Pollution source identification is usually calculated by the cluster analysis, principal component analysis19 and geographic information system20, while the quantification of pollution sources is realized by receptor models, such as positive matrix factorization, balance of chemical mass, and geographic detector, which is extensively used because they can quantitatively display the contribution of different pollution sources in many researches4,21,22,23. Multivariate statistical analysis, geostatistical models24 and some comprehensive methods have gradually improved the study of the sources and spatial differentiation of HMs in soils. Accordingly, various evaluation indexes were applied to assess soil HM contaminant, such as Igeo, EF, RI and pollution index (PI)25.

Purple soil is usually classified as regosol or entisol26. A series of red or purple rocks from the Triassic to Cretaceous period formed the purple soil26. A large amount of this type of soil is distributed in the Sichuan Basin, which is one of the greatest agricultural areas in southwest China. With the serious trouble between the land resources limitation and the population surge, more and more purple soil was used into agricultural soil, and part purple soil was developed for production of intensive agricultural because its rich mineral nutrition. Most of the existing studies focus on the background value, baseline, content27, distribution28 and migration characteristics29 of HMs in purple soil, and there are few studies on the sources, ecological risk and spatial differentiation of HMs in purple soil. In particular, there are few reports on the potential ecological risks and driving factors of HMs in purple soil. This study aims to (1) assess the concentration and pollution levels of HMs in typical purple soils; (2) identify the main sources of HMs in typical purple soils; (3) assess the main drivers of HM pollution in typical purple soils based on the GeoDetector model; and (4) assess the potential ecological risks of HMs in typical purple soils. The research results can provide important theoretical and practical reference for the source identification, risk assessment and comprehensive treatment of HM pollution in purple soil, and will lay a theoretical foundation for environmental management and regional sustainable development of purple soil in China.

Materials and methods

Research area

The research area was situated in the east of Sichuan Province, longitude 105°56′-107°19′ E, latitude 30°01′-30°52′ N, with an area of 6339 km2 and a total population of 4.7 million. The topography of the research area is east high and west low, and the zoning characteristics are obvious. The research area was situated in the humid monsoon climate area of the middle subtropics, with a warm climate, abundant heat and abundant rainfall, an annual average precipitation is 1054.46 mm-1512.45 mm, an annual mean temperature is 16 ℃. The territory of 210, 212, 305, 318 national highways and 203, 304, 18 provincial highways and county, township, village roads crisscross, the highway network extends in all directions. The Jialing River and the Qujiang River cross the border.

Sampling and analysis

Field sampling to be completed in 2022. A total of 73 topsoil samples (0–20 cm) were collected according to the specification of land quality geochemical assessment (DZ/T 0295–2016)30. Sampling sites were displayed in Fig. 1. The sampling points were uniformly arranged in a 4 km2 sampling grid, and the distance between each sampling point was required to be greater than 2 km. 3–5 multi-point collections within 100 m around the sampling point are combined into one sample, and the original weight of the combined sample is greater than 1 kg. Locate sampling points with portable GPS. 5 profile sampling points PMA, PMB, PMC, PMD and PME in the study area, with a profile depth of 1 m, and one sample is collected for each 20 cm.

Figure 1
figure 1

The location of research area and sampling sites.

Corg was measured by volumetric method, N was measured by combustion infrared method, atomic fluorescence method was used measure the content of Hg and As, Cu, Pb, Ni, Zn, Sc, Cr, P, Cd and K were measured by x-ray fluorescence spectroscopy (XRF), inductively coupled plasma mass spectrometry (ICP-MS) and inductively Coupled Plasma Optical Emission Spectrometer (ICP-OES). The quality of analysis and testing was controlled by means of inserting national first-level soil standard materials, repeatability inspection, abnormal point inspection, and blank test.

Geo-accumulation Index (Igeo)

The Igeo method can be used to compare the concentration of different soil HMs and their pollution degree31.

$${\text{I}}_{{{\text{geo}}}} = {\text{ log}}_{{2}} \left( {\frac{{C_{i} }}{{K \times B_{i} }}} \right)$$
(1)

where Igeo was soil accumulation index of HM i; Ci was measured value of soil HM i; Bi was guideline value, and background value of profile soil was selected (Table 1); k was the correction coefficient, generally 1.5. The pollution degree of Igeo can be divided into seven grades: < 0 (unpolluted), 0–1 (mild contaminated), 1–2 (moderate contaminated), 2–3 (moderate-heavy contaminated), 3–4 (heavy contaminated), 4–5 (heavy-extreme contaminated) and > 5 (extremely heavy contaminated).

Table 1 Descriptive statistical results of topsoil composition.

Enrichment Factor (EF)

EF was a useful index to distinguish between human activities or natural sources of HMs. EF was calculated32:

$${\text{EF}} = \left[ {{\text{M}}_{i} /{\text{M}}_{Sc} } \right]_{{\text{S}}} /\left[ {{\text{M}}_{i} /{\text{M}}_{Sc} } \right]_{{\text{B}}}$$
(2)

where [Mi/MSc]S was the content ratio of HM i to Sc in samples, while [Mi/ MSc]B was the ratio of purple soil background values. Sc was a trace element, and has no significant anthropogenic sources, so Sc was selected as guideline element32. Generally, according EF value the soils can be classified as six enrichment grades: < 1 (minimal), 1–2 (mild), 2–5 (moderate), 5–20 (significant), 20–40 (very high), and ≥ 40 (extremely high).

Pollution index (PI) and synthetic pollution index (SPI)

For evaluate soil HMs contamination level, PI and SPI were calculated:

$${\text{PI }} = \frac{{C_{i} }}{{S_{i} }}$$
(3)
$${\text{SPI }} = \sqrt {\frac{{\left( {\frac{{C_{i} }}{{S_{i} }}} \right)_{max} + \left( {\frac{{C_{i} }}{{S_{i} }}} \right)_{ave} }}{2}}$$
(4)

where PI was element i contamination index, SPI was overall score of each HM to the composite contamination. Si was the valuation criterion of element i, and the national control thresholds were selected as criterion (Table 1). According PI and SPI values the soils can be classified as 5 contamination categories: < 0.7 (safety), 0.7–1 (alert), 1–2 (low), 2–3 (moderate) and ≥ 3 (severe)4.

Potential ecological risk factor (ER) and Potential ecological risk index (RI)

ER was applied to assess potential ecological risk of individual soil HM33, ER was calculated:

$${\text{ER}} = {\text{TR}}_{i} \times {\text{PI}}_{i}$$
(5)

where TRi was HM (i) toxic-response factor of Hg (40), Cd (30), As (10), Pb (5), Cu (5), Cr (2) and Zn (1)33, TR of Ni was 534. PIi was HM (i)contamination index of. The ER classes can be divided into five potential ecological risk grades: < 40 (low), 40–80 (moderate), 80–160 (considerable), 160–320 (high) and ≥ 320 (very high)35.

RI was a method to evaluate soil multi-element ecological risk. RI was calculated33,35:

$${\text{RI}} = \mathop \sum \limits_{i = 1}^{n} ER = \mathop \sum \limits_{i = 1}^{n} \left( {{\text{TR}}_{i} \times {\text{PI}}_{i} { }} \right)$$
(6)

where \(\mathop \sum \limits_{i = 1}^{n} ER\) was HM (i) potential ecological risk factor, n i the number of HMs. RI classes can be divided into four potential ecological risk grades: < 150 (low), 150–300 (moderate), 300–600 (considerable) and ≥ 600 (very high)35.

Geographical detector (GD)

Factor detector was one of the four methods for GD36, applied to determine spatial differentiation of dependent variables and the ability of their corresponding variables to illustrate the effect of dependent variables, evaluated by the value of q:

$$q = 1 - \frac{{\mathop \sum \nolimits_{h = 1}^{L} N_{h} \sigma_{h}^{2} }}{{N\sigma^{2} }} = 1 - \frac{SSW}{{SST}}$$
(7)

where h = 1, …, L was the classification number of the independent variable X, Nh and N were the classification h and the number of units in the whole region, \(\sigma_{h}^{2}\) and \(\sigma^{2}\) were variance of the dependent variable Y in the classification h and the region. SST and SSW represent the total variances of all categories of the independent variable X and the sum variance in the region. The range of q was [0,1]. The larger q value, the greater impact of the independent variable X on the dependent variable Y.

Factor selection and data processing

Select soil properties (organic carbon (Corg), silt content (X1), total carbon (TC), total potassium (TK), total nitrogen (TN), clay content (X3), total phosphorus (TP), sand content (X2) and total sulfur (TS)), topographic factors (altitude (X4), slope (X5) and aspect (X6)), distance factor (distance from railways (X7), distance from highway (X8), distance from provincial highway (X9), distance from county road (X10), distance from rural road (X11), distance from village road (X12), distance from the river (X13), distance from lakes and reservoirs (X14), distance from urban area (X15), distance from the town (X16) and distance from the village (X17)), soil texture (X18) and land use type (X19) 25 factors. When using GD to analyze the impacting factors, all independent variables must be converted to type variables and the dependent variable must be a numerical variable. In this research, natural breakpoint method was applied to classify the impacting factors. Analysis of correlation and descriptive statistical analysis of the data was carried out by SPSS26.0, sampling map and spatial distribution map were drawn by ArcGIS10.8, mapping was completed by Origin2019, and GD was completed by GeoDetector software (http://www.geodetector.org/).

Results and discussion

Basic properties of topsoil in study area

HMs contents and physicochemical properties in topsoil of study area were displayed in Table 1. Mean concentrations of TC, Corg, TN, TS, TP and TK were 1.18%, 0.90%, 830 mg/kg, 189 mg/kg, 442 mg/kg and 1.94%, and the mean contents of silt grain, sand grain, clay particle were 30.8%, and 42.5%, and 26.8%.

Average concentrations of Cd, As, Cr, Hg, Cu, Ni, Zn and Pb were 0.19, 7.77, 69.5, 0.077, 27.9, 30.9, 76.8 mg/kg and 26.5 mg/kg. Average contents of Cd and Hg larger than the Sichuan province soil background values37, compared with purple soil background values37, average contents of Hg, Cd, Cr, Cu and Ni were greater than the values of background. Compared with the risk screening standard value for soil (GB15618-2018), average contents of all HMs were less the contamination risk screening value, but the maximum As, Cu, Cr, Cd and Ni contents were 1.07, 2.16, 1.01, 1.69 and1.05 times higher than their corresponding pollution risk screening value. The coefficient variation was proportional to the degree of interference from external factors such as human activities22, the coefficient variation of Hg and As were 116.5% and 94.3% in study area indicates that they may be affected by some external interference factors. In comparison with other purple soil areas except that the content of Zn was higher than that of purple soil in Sichuan Nanchong, the contents of other HMs were lower than those in purple soil in Nanchong, Sichuan38. Contamination of Hg, As, Pb, Cd and Cr were greater than their corresponding purple soil values in Sichaun Xichang39, and the contents of As, Cd, Hg, Cu and Pb were larger than respective purple soil values in Chongqing40.

Soil HMs spatial distribution features

Figure 2 shows HMs spatial distribution in purple soil of research area. As a whole, soil HMs spatial distribution was significant various from the south to north of this research area. It can be seen that the large value regions of Cu, As, Zn, Hg, Pb, Ni and Cr were mainly distributed in northeast and southwest regions. The hotspots of large Cd distributed in north and northeast areas.

Figure 2
figure 2

HMs spatial distribution in the topsoil.

Profile distribution characteristics of soil HMs

The average HMs contents in the purple soil profiles were showed in Fig. 3. The content of Cr was then increased greatly versus depth and minimum value in the surface soil in all profile sampling points. The distribution patterns of Cd and Hg in soil were opposite. For As and Pb, fluctuations changed with increasing soil depths. For Cu, Ni and Zn, at sampling sites of PMC, PMD and PME decreased at first and then increased with the increase of depth, while at sampling sites of PMA and PMB did not change significantly with depth.

Figure 3
figure 3

HMs profile distribution in the research region.

Evaluation of environmental risk

The results of Igeo, EF, PI and SPI were presented in Fig. 4. The average Igeo (Fig. 4a) values followed the descending sequence: Cd > Hg > Cr > Ni > Cu > Pb > Zn > As, mean Igeo values of other 7 HMs except Cd were < 0, indicating that purple soil in research area was unpolluted by other HMs, but soils were unpolluted to moderately polluted by Cd. Among all HMs, Hg and Cd had the greatest mean values of EF (1 < EF < 2) (Fig. 4b), meaning that Hg and Cd were mild enrichment in purple soil, the highest EF of Hg and Cd were 8.75 and 4.78. Mean values of EF for other HMs were below 1, showed the minimal enrichment. Although the largest PI (Fig. 4c) values of Cu, Cd, As, Ni, Cr, Pb, Zn and Hg were 2.16, 1.69, 1.07, 1.05, 1.01, 0.84, 0.61 and 0.40, the PI means of all HMs were below than 0.7. It means that research area soil generally has no significant pollution, but there is a phenomenon of point source pollution. The mean SPI values (0.58 ± 0.26) for all HMs, meaning that the soil has no significant pollution with all HMs.

Figure 4
figure 4

The Igeo (a), EF (b), PI (c) and SPI (c) for HMs in purple soils of study area.

Evaluation of ecological risk

The values of ER and RI were given in Fig. 5. Average ER values followed the descending order: Hg (65.2) > Cd (59.9) > As (8.26) > Cu (5.31) > Ni (5.04) > Pb (4.78) > Cr (2.15) > Zn (0.93), Hg and Cd highest mean ER values (40 < ER < 80), meaning that soil Hg and Cd potential ecological risk were moderate, other HMs potential ecological risk were low with mean ER values < 40. The samples with ER values higher than 40 of Cd, Hg and As respective accounted for 65.8%, 58.9% and 2.74%, indicating that Cd and Hg in research area has significant ecological risks, As has certain ecological risks, the ecological risk of other HMs was low. Mean value of RI was 152 and the range of RI values were 51.0 to 868, the samples with RI values higher than 150 accounted for 31.5%, indicating that demonstrating moderate ecological risk in this research. This was similar to the ecological risk estimation results of HMs in Xichang purple soils by Li39.

Figure 5
figure 5

The ER and RI for HMs in soils study area.

Multivariate statistical methods

Correlation analysis

Correlation analysis results were displayed in Fig. 6. The results displayed that there was obviously correlation among most HMs, but interestingly, there was no significant correlation among As-Zn and Hg-Zn. Among the influencing factors of soil properties (TC, Corg, TN, TP, TK, TS, Sc, X1, X2 and X3), 8 HMs were significantly correlated with 6–9 of them, indicating that soil properties had significant effects on all HMs except Ni. Among the topographic factors (X4, X5 and X6), only As, Cr, Cu, and Ni showed significant positive correlation with X4. Among the distance factors (X7, X8, X9, X10, X11, X12, X13 X14, X15, X16 and X17), only As showed a significant positive correlation with X8, but an obviously negative correlation with X7, X10 and X15, a obviously negative correlation between Ni and Cu with X8, and a obviously negative correlation between Hg with X7. It means that the distance factor has a inevitable effect on HMs in purple soil. Generally, 8 kinds of HMs are greatly affected by natural factors, but they are affected by some external sources, which was consistent with the previous research41,42.

Figure 6
figure 6

Pearson correlation coefficient of purple soil HMs and impact factors.

Principal component analysis (PCA)

The PCA results of PTEs in soils were showed in Table 2. The test value p less than 0.0001 of Bartlett's sphericity and KMO score of 0.753 meaning that the dataset was suitable for PCA. The eigenvalues of two factors (3.19 and 2.89), the cumulative contribution rate was 76.1%, which could basically explain the information contained in the purple soil HMs.

Table 2 PCA results of HMs in purple soil.

The first principal component (PC1) had strong positive loadings with Hg (0.966), As (0.803), Pb (0.790) and Cd (0.521), and explained 39.9% of the sum variance. According to the results of correlation analysis, there was a very obviously correlation (p < 0.01) between As-Hg, As-Pb, Hg-Cd, Hg-Pb and Pb–Cd, and a significant correlation (p < 0.05) between As with Cd, indicating that the four HMs probably came from the same source, which was consistent with the results of PCA. The mean values of Pb, Hg and Cd were greater than their respective Sichuan Province soil background values and purple soil background values. Although the content of As was less than the value of background, its variation coefficient was as high as 94.3%, indicating that As was disturbed by pollution from external sources that cannot be ignored. Combined with the spatial distribution map (Fig. 2), the great content areas of As, Hg, Pb and Cd are mainly concentrated in southwest and northeast of research region. The central part of study area is a densely populated and traffic-intensive area, which is affected by huge domestic sewage discharge, industrial wastes, motor vehicle exhaust emissions, pesticides and chemical fertilizers and other external pollution sources. We determine PC1 as the source of human activity pollution.

The second principal component (PC2) explained 36.2% variance contribution rate, the strong loadings on Zn (0.912), Ni (0.870), Cr (0.808) and Cu (0.647). Although the contents of Cr, Ni and Cu except Zn were slightly higher than corresponding purple soil background values in China, the four HMs were all less than the corresponding Sichuan Province soil background values (Table 1). These four HMs showed a very important positive correlations with each other (r > 0.58, p < 0.01) except Cu–Zn (r = 0.40, p < 0.01). Also, these four HMs exhibited lower EF and Igeo values. Related studies show that Cr, Zn, Ni and Cu are mainly affected by soil parent materials43. They were mainly controlled by natural sources. PC1 can be judged to be a natural source. It should be noted that Cu also has equal loading on PC1 (0.647), indicating that Cu has the possibility of multiple sources. Previous research on this area pointed out that soil parent material plays an serious role in soil Cu enrichment, and the use of chemical fertilizers affects Cu enrichment to a certain extent44.

Influencing factors affecting the spatial heterogeneity of HMs according to GD

The geographical detection results of 25 factors for 8 kinds of HMs were displayed in Fig. 7. There was certain discrepancy in the explanation power of various factors for the 8 HMs, but overall, X18, X4, TN, X3, X2 and X1 have great explanatory power for each HM content spatial distribution. The first impacting factor of As was X3 (0.450), followed by X2 (0.240), TS (0.372) and X18 (0.360), followed by X14 (0.280) and TN (0.236). The first influencing factor of Cd was Corg (0.443), followed by TC (0.422) and TN (0.417), TP (0.299), X4 (0.189) and TS (0.188) also had significant influence. The first, second and third influencing factors of Cr were X2 (0.433), X18 (0.433) and X3 (0.403). X4 (0.399), TN (0.246) and X1 (0.242) also has great influence. The main influencing factors of Cu were X4 (0.37) and TN (0.311), and X2 (0.286), Corg (0.265), X3 (0.264), TS (0.247), X18 (0.243) and TC (0.242) were also important factors impacting the spatial differentiation of Cu content. The first impacting factor of Hg was TS (0.376), followed by Corg (0.326), and TC (0.293), TN (0.274), and X18 (0.201) also has significant effects. The primary influencing factor of Ni is X4 (0.368), followed by X2 (0.287) and X18 (0.253), followed by X1 (0.184) and X3 (0.156). The first, second, and third influencing factors of Pb are X18 (0.238), X2 (0.200), and X4 (0.168). TN (0.16), X1 (0.158), and Corg (0.158) also has significant effects. The five factors that have the greatest impact on the spatial differentiation of Zn content were X1 (0.383), X18 (0.374), TK (0.334), X2 (0.326), and TP (0.229).

Figure 7
figure 7

The q value of influencing factors affecting spatial heterogeneity of soil HMs in study area.

The ranking of the impact factors on different HMs varies, revealing the heterogeneity of the mechanisms of HM changes. A large area of crops was planted in research area, and feeding, irrigation, and spraying pesticide directly impact soil physical and chemical properties, accompanied by the introduction of HMs45. Changes in soil physical and chemical properties directly affect soil HMs activity46, migration and transformation47. Correlation analysis also showed that TC, Corg, TN, TP, TK, TS, X1, X2, X3 and X18 were significantly correlated with one or more HMs, so soil physical and chemical properties were important indicators affecting the spatial distribution of HMs. Topographic factors reflect that the soil HMs spatial distribution was affected by natural factors, and the influence process is slow48. The altitude (X4) of research area has a significant impact on 8 HMs spatial distribution, especially on Cu and Ni. Many previous studies have also shown similar conclusions49,50. Interestingly, comparing the Pearson correlation analysis, it was found that there was consistency and difference in the results between Pearson correlation analysis with geographic detection analysis. Consistency, such as the significant relevance between altitude and As, Cr, Cu, and Ni, also has important impact on their spatial distribution; Differences, such as correlation analysis, showed no significant correlation between altitude and Cd, Hg, Pb, and Zn. Geographic detection analysis showed that altitude had a significant impact on the Cd, Hg, Pb, and Zn spatial distribution. It was because GD analyzed the relevance between HMs and impacting factors, containing nonlinear relationships and linear relationships, but coefficient of Pearson correlation was not obviously, meaning that there was no obviously linear relationship between HMs and impacting factors, while it does not mean there was no non-linear relationship51. The influence of slope (X5) and aspect (X6) on HMs was not obvious, which may be due to the little variation range of aspect and slope in research area. The land use type (X19) only has significant impact on the As, Cr and Hg spatial distribution, but has no significant impact on the other 5 HMs spatial distribution.

Among the 11 distance factors X7, X8, X9, X10, X11, X12, X13, X14, X15, X16 and X17, except for X14 and X16, all have obvious influence on the spatial distribution of one or more HMs in soil of research region, it was represented in the complex impact of anthropogenic effect about the spatial distribution changes of soil HMs. Anthropogenic activities have shifted natural state of soil HMs distribution characteristics, creating fresh spatial characteristics48. Many studies have proved that there was an obvious enrichment of HMs in the soil around roads52,53,54or railways55,56,57 because the particles containing HMs formed by automobile exhaust emissions and tire wear enter the soil by atmospheric migration and deposition. River system was an significant water source for agricultural irrigation. HMs enrichment in water sources was caused through industrial discharge and transportation, and then farmland soil enrichment was caused through agricultural irrigation58,59,60. The HMs carried by the “three wastes” from various industries distributed in the study area were enriched in soil by atmospheric deposition, rain water infiltration and erosion. Related researches have displayed residential regions were the most frequent areas of anthropogenic activities, and daily life of residents will generate massive domestic waste containing HMs, which will lead to changes in soil around the residential regions, and the impact will also decrease with the increase of the distance from the residential areas61,62. At the same time, the dense traffic network and frequently anthropogenic activities in towns and cities makes massive pollutants enrich into soil by means of atmospheric deposition and other diffusion ways to produce pollution63.

Conclusions

An integrated approach consists of multivariate statistical analyses, PCA model and GD model is an effective method to identify the ecological risk and sources of HMs in typical purple soil.The average Cd and Hg contents were lager than the Sichuan province soil background values. Mean Cr, Cd, Cu, Ni and Hg contents were lager than the purple soil background values. The ranking of Igeo evaluation results was Cd > Hg > Cr > Ni > Cu > Pb > Zn > As, the enrichment sequence of HMs in purple soil was Cd > Hg > Cr > Cu > Ni > Pb > Zn > As, the contamination sequence was Cd > Cu > Cr > Ni > Zn > Pb > As > Hg and the comprehensive pollution factor results show that there was no significant pollution of HMs in the soil of the research region. The sequence of potential ecological risks was Hg > Cd > As > Cu > Ni > Pb > Cr > Zn. Cd and Hg have the highest potential ecological risks and were at a higher risk level. PCA analysis show that Hg, As, Pb and Cd come from human activities, but Ni, Zn and Cr mainly come from natural sources, while Cu was affected by both natural sources and human activities. Geographical exploration analysis showed that, among the 25 influencing factors, X18, X4, TN, X3, X2 and X1 had the strongest explanatory power to explain the spatial differentiation of 8 HMs. This study provides useful information for determining the distribution characteristics, possible sources, and environmental risks of HM pollution in typical purple soil, which will help to develop targeted policies and measures to reduce HM pollution in purple soil environments.