Imputation of Ammonium Nitrogen Concentration in Groundwater Based on a Machine Learning Method
Open Access
- 16 May 2022
- Vol. 14 (10), 1595
- https://doi.org/10.3390/w14101595
Abstract
Ammonium is one of the main inorganic pollutants in groundwater, mainly due to agricultural, industrial and domestic pollution. Excessive ammonium can cause human health risks and environmental consequences. Its temporal and spatial distribution is affected by factors such as meteorology, hydrology, hydrogeology and land use type. Thus, a groundwater ammonium analysis based on limited sampling points produces large uncertainties. In this study, organic matter content, groundwater depth, clay thickness, total nitrogen content (TN), cation exchange capacity (CEC), pH and land-use type were selected as potential contributing factors to establish a machine learning model for fitting the ammonium concentration. The Shapley Additive exPlanations (SHAP) method, which explains the machine learning model, was applied to identify the more significant influencing factors. Finally, the machine learning model established according to the more significant influencing factors was used to impute point data in the study area. From the results, the soil organic matter feature was found to have a substantial impact on the concentration of ammonium in the model, followed by soil pH, clay thickness and groundwater depth. The ammonium concentration generally decreased from northwest to southeast. The highest values were concentrated in the northwest and northeast. The lowest values were concentrated in the southeast, southwest and parts of the east and north. The spatial interpolation based on the machine learning imputation model established according to the influencing factors provides a reliable groundwater quality assessment and was not limited by the number and the geographical location of samplings.Funding Information
- National Natural Science Foundation of China (41672231)
This publication has 48 references indexed in Scilit:
- Geospatial Based Assessment of Spatial Variation of Groundwater Nitrate Nitrogen in Shandong Intensive Farming Regions of ChinaSensor Letters, 2012
- MODIS NDVI time-series allow the monitoring of Eucalyptus plantation biomassRemote Sensing of Environment, 2011
- Classification and regression treesWIREs Data Mining and Knowledge Discovery, 2011
- Geographical Information Systems Principles of Ordinary Kriging InterpolatorJournal of Applied Sciences, 2010
- Using Kaplan–Meier analysis together with decision tree methods (C&RT, CHAID, QUEST, C4.5 and ID3) in determining recurrence-free survival of breast cancer patientsExpert Systems with Applications, 2009
- Vegetable cultivation under greenhouse conditions leads to rapid accumulation of nutrients, acidification and salinity of soils and groundwater contamination in South-Eastern ChinaNutrient Cycling in Agroecosystems, 2008
- Axiomatic characterizations of probabilistic and cardinal-probabilistic interaction indicesGames and Economic Behavior, 2006
- Nitrogen transformation and transport modeling in groundwater aquifersEcological Modelling, 2006
- Discriminating Sources and Flowpaths of Anthropogenic Nitrogen Discharges to Florida Springs, Streams and LakesEnvironmental & Engineering Geoscience, 2005
- Assessment and management of long-term nitrate pollution of ground water in agriculture-dominated watershedsJournal of Hydrology, 2004