A Heckman Selection-tModel

31 January 2012

journal article
research article
Published by Taylor & Francis Ltd in Journal of the American Statistical Association

Vol. 107 (497), 304-317
https://doi.org/10.1080/01621459.2012.656011

Abstract

Sample selection arises often in practice as a result of the partial observability of the outcome of interest in a study. In the presence of sample selection, the observed data do not represent a random sample from the population, even after controlling for explanatory variables. That is, data are missing not at random. Thus, standard analysis using only complete cases will lead to biased results. Heckman introduced a sample selection model to analyze such data and proposed a full maximum likelihood estimation method under the assumption of normality. The method was criticized in the literature because of its sensitivity to the normality assumption. In practice, data, such as income or expenditure data, often violate the normality assumption because of heavier tails. We first establish a new link between sample selection models and recently studied families of extended skew-elliptical distributions. Then, this allows us to introduce a selection-t (SLt) model, which models the error distribution using a Student's t distribution. We study its properties and investigate the finite-sample performance of the maximum likelihood estimators for this model. We compare the performance of the SLt model to the conventional Heckman selection-normal (SLN) model and apply it to analyze ambulatory expenditures. Unlike the SLN model, our analysis using the SLt model provides statistical evidence for the existence of sample selection bias in these data. We also investigate the performance of the test for sample selection bias based on the SLt model and compare it with the performances of several tests used with the SLN model. Our findings indicate that the latter tests can be misleading in the presence of heavy-tailed data.

Keywords

This publication has 27 references indexed in Scilit:

A unified view on skewed distributions arising from selections
The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 2006
On the Unification of Families of Skew‐normal Distributions
Scandinavian Journal of Statistics, 2006
The Skew-normal Distribution and Related Multivariate Families*
Scandinavian Journal of Statistics, 2005
Simple Estimators for Treatment Parameters in a Latent-Variable Framework
The Review of Economics and Statistics, 2003
Distributions Generated by Perturbation of Symmetry with Emphasis on a Multivariate Skewt-Distribution
Journal of the Royal Statistical Society Series B: Statistical Methodology, 2003
The Heckman Correction for Sample Selection and Its Critique
Journal of Economic Surveys, 2000
The multivariate skew-normal distribution
Biometrika, 1996
Robust Statistical Modeling Using thetDistribution
Journal of the American Statistical Association, 1989
Generalized Econometric Models with Selectivity
Econometrica, 1983
Inference and missing data
Biometrika, 1976

Cited by 67 articles