Hydrological concept formation inside long short-term memory (LSTM) networks

Open Access

20 June 2022

journal article
research article
Published by Copernicus GmbH in Hydrology and Earth System Sciences

Vol. 26 (12), 3079-3101
https://doi.org/10.5194/hess-26-3079-2022

Abstract

Neural networks have been shown to be extremely effective rainfall-runoff models, where the river discharge is predicted from meteorological inputs. However, the question remains: what have these models learned? Is it possible to extract information about the learned relationships that map inputs to outputs, and do these mappings represent known hydrological concepts? Small-scale experiments have demonstrated that the internal states of long short-term memory networks (LSTMs), a particular neural network architecture predisposed to hydrological modelling, can be interpreted. By extracting the tensors which represent the learned translation from inputs (precipitation, temperature, and potential evapotranspiration) to outputs (discharge), this research seeks to understand what information the LSTM captures about the hydrological system. We assess the hypothesis that the LSTM replicates real-world processes and that we can extract information about these processes from the internal states of the LSTM. We examine the cell-state vector, which represents the memory of the LSTM, and explore the ways in which the LSTM learns to reproduce stores of water, such as soil moisture and snow cover. We use a simple regression approach to map the LSTM state vector to our target stores (soil moisture and snow). Good correlations (R²>0.8) between the probe outputs and the target variables of interest provide evidence that the LSTM contains information that reflects known hydrological processes comparable with the concept of variable-capacity soil moisture stores. The implications of this study are threefold: (1) LSTMs reproduce known hydrological processes. (2) While conceptual models have theoretical assumptions embedded in the model a priori, the LSTM derives these from the data. These learned representations are interpretable by scientists. (3) LSTMs can be used to gain an estimate of intermediate stores of water such as soil moisture. While machine learning interpretability is still a nascent field and our approach reflects a simple technique for exploring what the model has learned, the results are robust to different initial conditions and to a variety of benchmarking experiments. We therefore argue that deep learning approaches can be used to advance our scientific goals as well as our predictive goals.

Keywords

Funding Information

Natural Environment Research Council (NE/L002612/1)

This publication has 37 references indexed in Scilit:

ESA CCI Soil Moisture for improved Earth system understanding: State-of-the art and future directions
Remote Sensing of Environment, 2017
Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data
IEEE Transactions on Knowledge and Data Engineering, 2017
"Why Should I Trust You?"
Published by Association for Computing Machinery (ACM) ,2016
Introduction
Published by Springer Science and Business Media LLC ,2013
Getting the right answers for the right reasons: Linking measurements, analyses, and models to advance the science of hydrology
Water Resources Research, 2006
Top‐down and data‐based mechanistic modelling of rainfall–flow dynamics at the catchment scale
Hydrological Processes, 2003
Detection of conceptual model rainfall—runoff processes inside an artificial neural network
Hydrological Sciences Journal, 2003
Data-based mechanistic modelling of environmental, ecological, economic and engineering systems
Environmental Modelling & Software, 1998
Data‐based mechanistic modelling and the rainfall‐flow non‐linearity
Environmetrics, 1994
The Role Of Groundwater In Storm Runoff
Published by Elsevier BV ,1979

Cited by 40 articles