Using Supervised Principal Components Analysis to Assess Multiple Pollutant Effects
Open Access
- 1 December 2006
- journal article
- Published by Environmental Health Perspectives in Environmental Health Perspectives
- Vol. 114 (12), 1877-1882
- https://doi.org/10.1289/ehp.9226
Abstract
Many investigations of the adverse health effects of multiple air pollutants analyze the time series involved by simultaneously entering the multiple pollutants into a Poisson log-linear model. This method can yield unstable parameter estimates when the pollutants involved suffer high intercorrelation; therefore, traditional approaches to dealing with multicollinearity, such as principal component analysis (PCA), have been promoted in this context. A characteristic of PCA is that its construction does not consider the relationship between the covariates and the adverse health outcomes. A refined version of PCA, supervised principal components analysis (SPCA), is proposed that specifically addresses this issue. Models controlling for long-term trends and weather effects were used in conjunction with each SPCA and PCA to estimate the association between multiple air pollutants and mortality for U.S. cities. The methods were compared further via a simulation study. Simulation studies demonstrated that SPCA, unlike PCA, was successful in identifying the correct subset of multiple pollutants associated with mortality. Because of this property, SPCA and PCA returned different estimates for the relationship between air pollution and mortality. Although a number of methods for assessing the effects of multiple pollutants have been proposed, such methods can falter in the presence of high correlation among pollutants. Both PCA and SPCA address this issue. By allowing the exclusion of pollutants that are not associated with the adverse health outcomes from the mixture of pollutants selected, SPCA offers a critical improvement over PCA.Keywords
This publication has 30 references indexed in Scilit:
- Prediction by Supervised Principal ComponentsJournal of the American Statistical Association, 2006
- A critical assessment of shrinkage-based regression approaches for estimating the adverse health effects of multiple air pollutantsAtmospheric Environment, 2005
- Seasonal Analyses of Air Pollution and Mortality in 100 US CitiesAmerican Journal of Epidemiology, 2005
- Association of Ambient Air Pollution with Respiratory Hospitalization in a Government-Designated “Area of Concern”: The Case of Windsor, OntarioEnvironmental Health Perspectives, 2005
- Associations between ambient air pollution and daily mortality among persons with congestive heart failureEnvironmental Research, 2003
- Statistical issues in the study of air pollution involving airborne particulate matterEnvironmetrics, 2000
- Air pollution and daily mortality in three U.S. counties.Environmental Health Perspectives, 2000
- ASSOCIATION BETWEEN PARTICULATE- AND GAS-PHASE COMPONENTS OF URBAN AIR POLLUTION AND DAILY MORTALITY IN EIGHT CANADIAN CITIESInhalation Toxicology, 2000
- PM(10) exposure, gaseous pollutants, and daily mortality in Inchon, South Korea.Environmental Health Perspectives, 1999
- Latent Root Regression AnalysisTechnometrics, 1974