Forecasting Daily of Surface Ozone Concentration in the Grand Casablanca Region Using Parametric and Nonparametric Statistical Models
Open Access
- 23 May 2021
- journal article
- research article
- Published by MDPI AG in Atmosphere
- Vol. 12 (6), 666
- https://doi.org/10.3390/atmos12060666
Abstract
Forecasting concentration levels is important for planning atmospheric protection strategies. In this paper, we focus on the daily average surface ozone (O) concentration with a short-time resolution (one day ahead) in the Grand Casablanca Region of Morocco. The database includes previous day O concentrations measured at Jahid station and various meteorological explanatory variables for 3 years (2013 to 2015). Taking into account the multicollinearity problem in the data, adapted statistical models based on parametric (SPLS and Lasso) and nonparametric (CART, Bagging, and RF) models were built and compared using the coefficient of determination and the root mean square error. We conclude that the parametric models predict better than nonparametric ones. Finally, from the explanatory variables stored by the SPLS and Lasso parametric models, we deduce that a very simple linear regression with five variables remains the most appropriate for the available data at Jahid station ( = 0.86 and = 9.60). This resulting model, with few explanatory variables to prevent missing data, has good predictive quality and is easily implementable. It is the first to be built to predict ozone pollution in the Grand Casablanca region of Morocco.
This publication has 47 references indexed in Scilit:
- Real-time air quality forecasting, part I: History, techniques, and current statusAtmospheric Environment, 2012
- Variable selection using random forestsPattern Recognition Letters, 2010
- An Estimate of the Global Burden of Anthropogenic Ozone and Fine Particulate Matter on Premature Human Mortality Using Atmospheric ModelingEnvironmental Health Perspectives, 2010
- Impact of meteorological factors on ozone concentrations modelled by time series analysis and multivariate statistical methodsEcological Informatics, 2009
- Multiple linear regression and artificial neural networks based on principal components to predict ozone concentrationsEnvironmental Modelling & Software, 2007
- Contribution of anthropogenic pollutants to the increase of tropospheric ozone levels in the Oporto Metropolitan Area, Portugal since the 19th centuryEnvironmental Pollution, 2006
- A model for the effect on health of repeated exposure to ozoneEnvironmental Modelling & Software, 2002
- A review of statistical methods for the meteorological adjustment of tropospheric ozoneAtmospheric Environment, 2000
- Forecasting severe ozone episodes in the Baltimore metropolitan areaAtmospheric Environment, 1995
- The Predictive Sample Reuse Method with ApplicationsJournal of the American Statistical Association, 1975