Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
Open Access
- 1 June 2013
- journal article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 194 (2), 459-471
- https://doi.org/10.1534/genetics.113.150029
Abstract
Segments of indentity-by-descent (IBD) detected from high-density genetic data are useful for many applications, including long-range phase determination, phasing family data, imputation, IBD mapping, and heritability analysis in founder populations. We present Refined IBD, a new method for IBD segment detection. Refined IBD achieves both computational efficiency and highly accurate IBD segment reporting by searching for IBD in two steps. The first step (identification) uses the GERMLINE algorithm to find shared haplotypes exceeding a length threshold. The second step (refinement) evaluates candidate segments with a probabilistic approach to assess the evidence for IBD. Like GERMLINE, Refined IBD allows for IBD reporting on a haplotype level, which facilitates determination of multi-individual IBD and allows for haplotype-based downstream analyses. To investigate the properties of Refined IBD, we simulate SNP data from a model with recent superexponential population growth that is designed to match United Kingdom data. The simulation results show that Refined IBD achieves a better power/accuracy profile than fastIBD or GERMLINE. We find that a single run of Refined IBD achieves greater power than 10 runs of fastIBD. We also apply Refined IBD to SNP data for samples from the United Kingdom and from Northern Finland and describe the IBD sharing in these data sets. Refined IBD is powerful, highly accurate, and easy to use and is implemented in Beagle version 4.Keywords
This publication has 41 references indexed in Scilit:
- Length Distributions of Identity by Descent Reveal Fine-Scale Demographic HistoryAmerican Journal of Human Genetics, 2012
- Identity-by-descent-based heritability analysis in the Northern Finland Birth CohortHuman Genetics, 2012
- Identity by descent estimation with dense genome-wide genotype dataGenetic Epidemiology, 2011
- DASH: A Method for Identical-by-Descent Haplotype Mapping Uncovers Association with Recent VariationAmerican Journal of Human Genetics, 2011
- A Fast, Powerful Method for Detecting Identity by DescentAmerican Journal of Human Genetics, 2011
- Identification of regions of positive selection using Shared Genomic Segment analysisEuropean Journal of Human Genetics, 2011
- MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypesGenetic Epidemiology, 2010
- High-Resolution Detection of Identity by Descent in Unrelated IndividualsAmerican Journal of Human Genetics, 2010
- A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated IndividualsAmerican Journal of Human Genetics, 2009
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007