Missing data in haplotype analysis: a study on the MILC method

Abstract
Given the enormous progress in the knowledge of the human genome, genetic markers are now available throughout the genome. Haplotype analysis, allowing the simultaneous use of information from several markers, has thus become increasingly popular. However, we often face the problem of missing data and of haplotype identification. We have proposed a haplotype based method for the genetic study of multifactorial diseases in founder populations, the MILC method (Bourgain et al. 2000). MILC is based on the contrast of identity length between haplotypes transmitted to affected offspring and haplotype non-transmitted. In this study, the impact of different strategies, regarding missing data, on the MLIC method are evaluated. A real situation is considered where data are derived from a genome screen for asthma susceptibility alleles in the Hutterites. Results are illustrated on this asthma data set.