Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels
Top Cited Papers
Open Access
- 1 July 2012
- journal article
- Published by American Dairy Science Association in Journal of Dairy Science
- Vol. 95 (7), 4114-4129
- https://doi.org/10.3168/jds.2011-5019
Abstract
Achieving accurate genomic estimated breeding values for dairy cattle requires a very large reference population of genotyped and phenotyped individuals. Assembling such reference populations has been achieved for breeds such as Holstein, but is challenging for breeds with fewer individuals. An alternative is to use a multi-breed reference population, such that smaller breeds gain some advantage in accuracy of genomic estimated breeding values (GEBV) from information from larger breeds. However, this requires that marker-quantitative trait loci associations persist across breeds. Here, we assessed the gain in accuracy of GEBV in Jersey cattle as a result of using a combined Holstein and Jersey reference population, with either 39,745 or 624,213 single nucleotide polymorphism (SNP) markers. The surrogate used for accuracy was the correlation of GEBV with daughter trait deviations in a validation population. Two methods were used to predict breeding values, either a genomic BLUP (GBLUP_mod), or a new method, BayesR, which used a mixture of normal distributions as the prior for SNP effects, including one distribution that set SNP effects to zero. The GBLUP_mod method scaled both the genomic relationship matrix and the additive relationship matrix to a base at the time the breeds diverged, and regressed the genomic relationship matrix to account for sampling errors in estimating relationship coefficients due to a finite number of markers, before combining the 2 matrices. Although these modifications did result in less biased breeding values for Jerseys compared with an unmodified genomic relationship matrix, BayesR gave the highest accuracies of GEBV for the 3 traits investigated (milk yield, fat yield, and protein yield), with an average increase in accuracy compared with GBLUP_mod across the 3 traits of 0.05 for both Jerseys and Holsteins. The advantage was limited for either Jerseys or Holsteins in using 624,213 SNP rather than 39,745 SNP (0.01 for Holsteins and 0.03 for Jerseys, averaged across traits). Even this limited and nonsignificant advantage was only observed when BayesR was used. An alternative panel, which extracted the SNP in the transcribed part of the bovine genome from the 624,213 SNP panel (to give 58,532 SNP), performed better, with an increase in accuracy of 0.03 for Jerseys across traits. This panel captures much of the increased genomic content of the 624,213 SNP panel, with the advantage of a greatly reduced number of SNP effects to estimate. Taken together, using this panel, a combined breed reference and using BayesR rather than GBLUP_mod increased the accuracy of GEBV in Jerseys from 0.43 to 0.52, averaged across the 3 traitsKeywords
Funding Information
- Bundesministerium für Bildung und Forschung (0315526)
This publication has 31 references indexed in Scilit:
- Extension of the bayesian alphabet for genomic selectionBMC Bioinformatics, 2011
- Genetic Architecture of Complex Traits and Accuracy of Genomic Prediction: Coat Colour, Milk-Fat Percentage, and Type in Holstein Cattle as Contrasting Model TraitsPLoS Genetics, 2010
- Common SNPs explain a large proportion of the heritability for human heightNature Genetics, 2010
- The impact of genetic relationship information on genomic breeding values in German Holstein cattleGenetics Selection Evolution, 2010
- Accuracy of genomic breeding values in multi-breed dairy cattle populationsGenetics Selection Evolution, 2009
- Development and Characterization of a High Density SNP Genotyping Assay for CattlePLOS ONE, 2009
- Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle BreedsScience, 2009
- Increased accuracy of artificial selection by using the realized relationship matrixGenetics Research, 2009
- A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated IndividualsAmerican Journal of Human Genetics, 2009
- Prediction of individual genetic risk to disease from genome-wide association studiesGenome Research, 2007