Confusion Over Measures of Evidence (p's) Versus Errors (α's) in Classical Statistical Testing
Top Cited Papers
- 1 August 2003
- journal article
- research article
- Published by Taylor & Francis Ltd in The American Statistician
- Vol. 57 (3), 171-178
- https://doi.org/10.1198/0003130031856
Abstract
Confusion surrounding the reporting and interpretation of results of classical statistical tests is widespread among applied researchers, most of whom erroneously believe that such tests are prescribed by a single coherent theory of statistical inference. This is not the case: Classical statistical testing is an anonymous hybrid of the competing and frequently contradictory approaches formulated by R. A. Fisher on the one hand, and Jerzy Neyman and Egon Pearson on the other. In particular, there is a widespread failure to appreciate the incompatibility of Fisher's evidential p value with the Type I error rate, α, of Neyman-Pearson statistical orthodoxy. The distinction between evidence (p's) and error (α's) is not trivial. Instead, it reflects the fundamental differences between Fisher's ideas on significance testing and inductive inference, and Neyman-Pearson's views on hypothesis testing and inductive behavior. The emphasis of the article is to expose this incompatibility, but we also briefly note a possible reconciliation.Keywords
This publication has 28 references indexed in Scilit:
- The Fisher, Neyman-Pearson Theories of Testing Hypotheses: One Theory or Two?Journal of the American Statistical Association, 1993
- R. A. Fisher: The Founder of Modern StatisticsStatistical Science, 1992
- A comment on replication, P‐values and evidenceStatistics in Medicine, 1992
- Tests of Significance Following R. A. Fisher1The British Journal for the Philosophy of Science, 1987
- Testing a Point Null Hypothesis: The Irreconcilability of P Values and Evidence: CommentJournal of the American Statistical Association, 1987
- Testing a Point Null Hypothesis: The Irreconcilability of P Values and EvidenceJournal of the American Statistical Association, 1987
- Tests of Significance in Theory and PracticeJournal of the Royal Statistical Society: Series D (The Statistician), 1986
- Frequentist probability and frequentist statisticsSynthese, 1977
- R. A. Fisher (1890—1962): An AppreciationScience, 1967
- The problem of inductive inferenceCommunications on Pure and Applied Mathematics, 1955