Welcome to Geological Journal of China Universities ! Today is
Share:

Geological Journal of China Universities ›› 2024, Vol. 30 ›› Issue (04): 485-495.DOI: 10.16108/j.issn1006-7493.2023033

Previous Articles     Next Articles

Prediction of Heavy Metal Content through Fusion of Spatial and Environmental Factors in Machine Learning Models:#br# A Case Study of Naozhou Island 

JIA Lili,HU Feiyue,LI Tingting,ZHU Xin,YI Longke   

  1. Geological Investigation Institute of Guangdong Province, Guangzhou 510080, China
  • Online:2024-08-20 Published:2024-08-20

Abstract: Conventional assessment methods of soil heavy metal contamination predominantly depend on spatial interpolation analyses, conducted with data derived from a restricted number of sampling points, thereby often overlooking the influence exerted by environmental covariates such as geological backdrop, human activities, and geographical features on the heavy metal distribution. Predictions based solely on environmental covariates tend to fall short in adequately reflecting the spatial aggregation effects associated with heavy metal dispersion. As such, this paper introduces a novel approach that amalgamates spatial and environmental factors as covariates, employing three models: Random Forest (RF), Extreme Gradient Boosting (XGboost), and Deep Learning Neural Network (DNN), designed to predict the spatial distributions of heavy metals in Naozhou Island. The results underscore the substantial improvement in predictive performance achieved through the integration of spatial elements into the model. To test and enhance the robustness of the improved model, data splitting and external data validation techniques are utilized in this study, and the most optimal model for prediction is selected. Following the prediction results, correlation and clustering analyses were conducted on the heavy metal elements at the data feature level, and a LISA spatial clustering analysis at the spatial distribution level. The analyses reveal that geological factors predominantly dictate the dispersion of Cu, Ni, Cr, and Zn elements within the island, while human activities primarily govern the distribution of Pb, As, Hg. Moreover, the distribution of Cd element is ascertained to be influenced by a combination of human activities and geological background. 

Key words: Heavy metals, spatial heterogeneity, machine learning, accurate prediction