Multi-syndrome analysis of time series using PCA: A new concept for outbreak investigation

Abstract
To date, despite widespread availability of time series data on multiple syndromes, multivariate analysis of syndromic data remains under‐explored. We present a non‐parametric multivariate framework for early detection of temporal anomalies based on principal components analysis of historical data on multiple syndromes. We introduce simulated outbreaks of different shapes and magnitudes into the historical data, and compare the detection sensitivity and timeliness of the multi‐syndrome detection method with those of uni‐syndrome. We find that the multi‐syndrome detection framework provides a powerful tool for identifying such designated abnormalities in the data and significantly improves upon the detection sensitivity and timeliness of uni‐syndrome analysis. The proposed multivariate framework requires minimal preprocessing of the data and can be easily adopted in settings where temporal information on multiple syndromes are routinely collected and processed, and thus can be an integral component of real‐time surveillance systems. Copyright © 2007 John Wiley & Sons, Ltd.