Accurate Prediction of Genetic Values for Complex Traits by Whole-Genome Resequencing

1 June 2010

journal article
Published by Oxford University Press (OUP) in Genetics

Vol. 185 (2), 623-631
https://doi.org/10.1534/genetics.110.116590

Abstract

Whole-genome resequencing technology has improved rapidly during recent years and is expected to improve further such that the sequencing of an entire human genome sequence for $1000 is within reach. Our main aim here is to use whole-genome sequence data for the prediction of genetic values of individuals for complex traits and to explore the accuracy of such predictions. This is relevant for the fields of plant and animal breeding and, in human genetics, for the prediction of an individual's risk for complex diseases. Here, population history and genomic architectures were simulated under the Wright–Fisher population and infinite-sites mutation model, and prediction of genetic value was by the genomic selection approach, where a Bayesian nonlinear model was used to predict the effects of individual SNPs. The Bayesian model assumed a priori that only few SNPs are causative, i.e., have an effect different from zero. When using whole-genome sequence data, accuracies of prediction of genetic value were >40% increased relative to the use of dense ∼30K SNP chips. At equal high density, the inclusion of the causative mutations yielded an extra increase of accuracy of 2.5–3.7%. Predictions of genetic value remained accurate even when the training and evaluation data were 10 generations apart. Best linear unbiased prediction (BLUP) of SNP effects does not take full advantage of the genome sequence data, and nonlinear predictions, such as the Bayesian method used here, are needed to achieve maximum accuracy. On the basis of theoretical work, the results could be extended to more realistic genome and population sizes.

Keywords

This publication has 20 references indexed in Scilit:

Accuracy of breeding values of 'unrelated' individuals predicted by dense SNP genotyping
Genetics Selection Evolution, 2009
Genome-based prediction of common diseases: advances and prospects
Human Molecular Genetics, 2008
Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach
PLOS ONE, 2008
Next-generation DNA sequencing
Nature Biotechnology, 2008
Next-Generation DNA Sequencing Methods
Annual Review of Genomics and Human Genetics, 2008
Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease
Nature Genetics, 2008
Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits
PLoS Genetics, 2008
The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values
Genetics, 2007
Sequence-Level Population Simulations Over Large Genomic Regions
Genetics, 2007
A Fast and Flexible Statistical Model for Large-Scale Population Genotype Data: Applications to Inferring Missing Genotypes and Haplotypic Phase
American Journal of Human Genetics, 2006

Cited by 335 articles