Investigating the capabilities of evolutionary data-driven techniques using the challenging estimation of soil moisture content

Abstract
Soil moisture has a crucial role in both the global energy and hydrological cycles; it affects different ecosystem processes. Spatial and temporal variability of soil moisture add to its complex behaviour, which undermines the reliability of most current measurement methods. In this paper, two promising evolutionary data-driven techniques, namely (i) Evolutionary Polynomial Regression and (ii) Genetic Programming, are challenged with modelling the soil moisture response to the near surface atmospheric conditions. The utility of the proposed models is demonstrated through the prediction of the soil moisture response of three experimental soil covers, used for the restoration of watersheds that were disturbed by the mining industry. The results showed that the storage effect of the soil moisture response is the major challenging factor; it can be quantified using cumulative inputs better than time-lag inputs, which can be attributed to the effect of the soil layer moisture-holding capacity. This effect increases with the increase in the soil layer thickness. Three different modelling tools are tested to investigate the tool effect in data-driven modelling. Despite the promising results with regard to the prediction accuracy, the study demonstrates the need for adopting multiple data-driven modelling techniques and tools (modelling environments) to obtain reliable predictions.