Imputation of Ammonium Nitrogen Concentration in Groundwater Based on a Machine Learning Method

Open Access

16 May 2022

journal article
research article
Published by MDPI AG in Water

Vol. 14 (10), 1595
https://doi.org/10.3390/w14101595

Abstract

Ammonium is one of the main inorganic pollutants in groundwater, mainly due to agricultural, industrial and domestic pollution. Excessive ammonium can cause human health risks and environmental consequences. Its temporal and spatial distribution is affected by factors such as meteorology, hydrology, hydrogeology and land use type. Thus, a groundwater ammonium analysis based on limited sampling points produces large uncertainties. In this study, organic matter content, groundwater depth, clay thickness, total nitrogen content (TN), cation exchange capacity (CEC), pH and land-use type were selected as potential contributing factors to establish a machine learning model for fitting the ammonium concentration. The Shapley Additive exPlanations (SHAP) method, which explains the machine learning model, was applied to identify the more significant influencing factors. Finally, the machine learning model established according to the more significant influencing factors was used to impute point data in the study area. From the results, the soil organic matter feature was found to have a substantial impact on the concentration of ammonium in the model, followed by soil pH, clay thickness and groundwater depth. The ammonium concentration generally decreased from northwest to southeast. The highest values were concentrated in the northwest and northeast. The lowest values were concentrated in the southeast, southwest and parts of the east and north. The spatial interpolation based on the machine learning imputation model established according to the influencing factors provides a reliable groundwater quality assessment and was not limited by the number and the geographical location of samplings.

Funding Information

National Natural Science Foundation of China (41672231)

This publication has 48 references indexed in Scilit:

Geospatial Based Assessment of Spatial Variation of Groundwater Nitrate Nitrogen in Shandong Intensive Farming Regions of China
Sensor Letters, 2012
MODIS NDVI time-series allow the monitoring of Eucalyptus plantation biomass
Remote Sensing of Environment, 2011
Classification and regression trees
WIREs Data Mining and Knowledge Discovery, 2011
Geographical Information Systems Principles of Ordinary Kriging Interpolator
Journal of Applied Sciences, 2010
Using Kaplan–Meier analysis together with decision tree methods (C&RT, CHAID, QUEST, C4.5 and ID3) in determining recurrence-free survival of breast cancer patients
Expert Systems with Applications, 2009
Vegetable cultivation under greenhouse conditions leads to rapid accumulation of nutrients, acidification and salinity of soils and groundwater contamination in South-Eastern China
Nutrient Cycling in Agroecosystems, 2008
Axiomatic characterizations of probabilistic and cardinal-probabilistic interaction indices
Games and Economic Behavior, 2006
Nitrogen transformation and transport modeling in groundwater aquifers
Ecological Modelling, 2006
Discriminating Sources and Flowpaths of Anthropogenic Nitrogen Discharges to Florida Springs, Streams and Lakes
Environmental & Engineering Geoscience, 2005
Assessment and management of long-term nitrate pollution of ground water in agriculture-dominated watersheds
Journal of Hydrology, 2004

Cited by 2 articles