Some power considerations when deciding to use transformations

15 March 1994

journal article
Published by Wiley in Statistics in Medicine

Vol. 13 (5-7), 769-783
https://doi.org/10.1002/sim.4780130537

Abstract

Conventional wisdom suggests that for small data sets having substantial skew, one should attempt to determine the correct distributional form, if possible, and apply statistical methods appropriate for that distribution. Transformations such as the log or square root are often used. If an appropriate distributional form cannot be determined, a distribution‐free procedure such as a rank transformation or a randomization test procedure can be used. To better appreciate the effect of such alternatives on both the type I error and power of detecting differences between treatment groups, simulation studies were conducted for responses having specific gamma G(r, Ø) and log‐normal In(M, V) distributions. The gamma and log‐normal distributions were selected so that they had the same first two moments. A simple two group design was assumed. The reference group always had an average disease level μ = 3.0 (μ = rø for gamma, μ = M for log‐normal), and the treatment group always had means whose reductions ranged from 0 per cent to 50 per cent. The effect of distributional type and the degree of skewness was investigated by varying the population parameter values. Six statistical test procedures were compared for the gamma distributions. All test procedures were robust relative to the type I error. The UMP test based on a ratio of sample means produced the greatest power for all combinations of n, r and R_T. The power losses associated with the randomization test, the t‐test on original scale, and the t‐test on the square root scale were very small, (3 per cent to 6 per cent in absolute value) for n = 10 and 15, and less than 2 per cent for group sizes of 25 or more. The power loss associated with the t‐test on the log scale was much larger, ranging from 5 per cent to 10 per cent smaller power than the t‐test on original scale. The Wilcoxon rank test produced similar results to that of the LOG t‐test for small samples. The power for the shifted LOG (X + c) test increased monotonically to the asymptotic value of the ORE t‐test. The same five test procedures based on differences in sample means were then compared for the corresponding log‐normal distributions. The UMP test, that is, LOG(X), produced the highest power. There was very little power lost for the SQRT t‐test. The loss in power varied between 2 per cent and 5 per cent for the RANK test. The RANK test performed considerably better than the t‐test on the original scale. In contrast to the results for the gamma the power for the shifted LOG (X + c) test had its maximum for c = 0, and decreased monotonically to the asymptotic value of the ORIG t‐test. The results suggest that statistical inferences can be highly dependent on the distributional form and the scale of measurement of the response used in the statistical analysis.

Keywords

This publication has 21 references indexed in Scilit:

Specific statistical considerations relevantto the design and analysis of gingivitis trials demonstrating
Journal of Periodontal Research, 1992
Randomization analysis of dental data characterized by skew and variance heterogeneity
Community Dentistry and Oral Epidemiology, 1991
The use of repeated measures analysis of variance for plaque and gingival indices
Journal of Clinical Periodontology, 1989
Clinical index systems used to assess the efficacy of mouth‐rinses on plaque and gingivitis
Journal of Clinical Periodontology, 1988
Power transformations to symmetry
Biometrika, 1985
Statistical Tests Based on Transformed Data
Journal of the American Statistical Association, 1983
Rank Transformations as a Bridge Between Parametric and Nonparametric Statistics
The American Statistician, 1981
An Analysis of Transformations Revisited
Journal of the American Statistical Association, 1981
Transformations: Some Examples Revisited
Technometrics, 1969

Cited by 15 articles