How to gauge reliability of a binary classification result: a simple case

Abstract
No abstract available