Improved Interpolation and Anomaly Detection for Personal PM2.5 Measurement
Open Access
- 11 January 2020
- journal article
- research article
- Published by MDPI AG in Applied Sciences
- Vol. 10 (2), 543
- https://doi.org/10.3390/app10020543
Abstract
With the development of technology, especially technologies related to artificial intelligence (AI), the fine-dust data acquired by various personal monitoring devices is of great value as training data for predicting future fine-dust concentrations and innovatively alerting people of potential danger. However, most of the fine-dust data obtained from those devices include either missing or abnormal data caused by various factors such as sensor malfunction, transmission errors, or storage errors. This paper presents methods to interpolate the missing data and detect anomalies in PM2.5 time-series data. We validated the performance of our method by comparing ours to well-known existing methods using our personal PM2.5 monitoring data. Our results showed that the proposed interpolation method achieves more than 25% improved results in root mean square error (RMSE) than do most existing methods, and the proposed anomaly detection method achieves fairly accurate results even for the case of the highly capricious fine-dust data. These proposed methods are expected to contribute greatly to improving the reliability of data.This publication has 7 references indexed in Scilit:
- Predictive and exposome analytics: A case study of asthma exacerbation managementJournal of Ambient Intelligence and Smart Environments, 2019
- Anomaly detection in the presence of missing values for weather data quality controlPublished by Association for Computing Machinery (ACM) ,2019
- The Impact of Air Pollution, Including Asian Sand Dust, on Respiratory Symptoms and Health-related Quality of Life in Outpatients With Chronic Respiratory Disease in Korea: A Panel StudyJournal of Preventive Medicine & Public Health, 2018
- imputeTS: Time Series Missing Value Imputation in RThe R Journal, 2017
- MissForest—non-parametric missing value imputation for mixed-type dataBioinformatics, 2011
- Finding the most unusual time series subsequence: algorithms and applicationsKnowledge and Information Systems, 2006
- Methods for imputation of missing values in air quality data setsAtmospheric Environment, 2004