Robust Correlation Analyses: False Positive and Power Validation Using a New Open Source Matlab Toolbox
Top Cited Papers
Open Access
- 1 January 2013
- journal article
- research article
- Published by Frontiers Media SA in Frontiers in Psychology
- Vol. 3, 606
- https://doi.org/10.3389/fpsyg.2012.00606
Abstract
Pearson’s correlation measures the strength of the association between two variables. The technique is, however, restricted to linear associations and is overly sensitive to outliers. Indeed, a single outlier can result in a highly inaccurate summary of the data. Yet, it remains the most commonly used measure of association in psychology research. Here we describe a free Matlab(R) based toolbox (http://sourceforge.net/projects/robustcorrtool/) that computes robust measures of association between two or more random variables: the percentage-bend correlation and skipped-correlations. After illustrating how to use the toolbox, we show that robust methods, where outliers are down weighted or removed and accounted for in significance testing, provide better estimates of the true association with accurate false positive control and without loss of power. The different correlation methods were tested with normal data and normal data contaminated with marginal or bivariate outliers. We report estimates of effect size, false positive rate and power, and advise on which technique to use depending on the data at hand.Keywords
This publication has 5 references indexed in Scilit:
- Better Ways to Improve Standards in Brain-Behavior Correlation AnalysisFrontiers in Human Neuroscience, 2012
- Improving standards in brain-behavior correlation analysesFrontiers in Human Neuroscience, 2012
- Modern robust statistical methods: An easy way to maximize the accuracy and power of your research.American Psychologist, 2008
- Inferences about correlations when there is heteroscedasticity.British Journal of Mathematical and Statistical Psychology, 2001
- Assessing the accuracy of prediction algorithms for classification: an overviewBioinformatics, 2000