Time Series Data Cleaning: A Survey
Open Access
- 25 December 2019
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Access
- Vol. 8, 1866-1881
- https://doi.org/10.1109/access.2019.2962152
Abstract
Errors are prevalent in time series data, which is particularly common in the industrial field. Data with errors could not be stored in the database, which results in the loss of data assets. At present, to deal with these time series containing errors, besides keeping original erroneous data, discarding erroneous data and manually checking erroneous data, we can also use the cleaning algorithm widely used in the database to automatically clean the time series data. This survey provides a classification of time series data cleaning techniques and comprehensively reviews the state-of-the-art methods of each type. Besides we summarize data cleaning tools, systems and evaluation criteria from research and industry. Finally, we highlight possible directions time series data cleaning.Funding Information
- National Key Research and Development Plan (2017YFC0804307, 2019YFB1705301)
- National Natural Science Foundation of China (61572272, 71690231)
This publication has 105 references indexed in Scilit:
- Early classification on time seriesKnowledge and Information Systems, 2011
- The complexity and approximation of fixing numerical attributes in databases under integrity constraintsInformation Systems, 2008
- Automatic outlier detection for time series: an application to sensor dataKnowledge and Information Systems, 2006
- Extended Kalman filtering for battery management systems of LiPB-based HEV battery packs: Part 3. State and parameter estimationJournal of Power Sources, 2004
- Extended Kalman filtering for battery management systems of LiPB-based HEV battery packs: Part 2. Modeling and identificationJournal of Power Sources, 2004
- Time series forecasting using a hybrid ARIMA and neural network modelNeurocomputing, 2003
- Temporal data managementIEEE Transactions on Knowledge and Data Engineering, 1999
- Estimation of Time Series Parameters in the Presence of OutliersTechnometrics, 1988
- Order dependency in the relational modelTheoretical Computer Science, 1983
- A New Approach to Linear Filtering and Prediction ProblemsJournal of Basic Engineering, 1960