A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers
Open Access
- 31 December 2009
- journal article
- research article
- Published by Springer Science and Business Media LLC in Genetics Selection Evolution
- Vol. 41 (1), 56
- https://doi.org/10.1186/1297-9686-41-56
Abstract
Genomic selection (GS) uses molecular breeding values (MBV) derived from dense markers across the entire genome for selection of young animals. The accuracy of MBV prediction is important for a successful application of GS. Recently, several methods have been proposed to estimate MBV. Initial simulation studies have shown that these methods can accurately predict MBV. In this study we compared the accuracies and possible bias of five different regression methods in an empirical application in dairy cattle. Genotypes of 7,372 SNP and highly accurate EBV of 1,945 dairy bulls were used to predict MBV for protein percentage (PPT) and a profit index (Australian Selection Index, ASI). Marker effects were estimated by least squares regression (FR-LS), Bayesian regression (Bayes-R), random regression best linear unbiased prediction (RR-BLUP), partial least squares regression (PLSR) and nonparametric support vector regression (SVR) in a training set of 1,239 bulls. Accuracy and bias of MBV prediction were calculated from cross-validation of the training set and tested against a test team of 706 young bulls. For both traits, FR-LS using a subset of SNP was significantly less accurate than all other methods which used all SNP. Accuracies obtained by Bayes-R, RR-BLUP, PLSR and SVR were very similar for ASI (0.39-0.45) and for PPT (0.55-0.61). Overall, SVR gave the highest accuracy. All methods resulted in biased MBV predictions for ASI, for PPT only RR-BLUP and SVR predictions were unbiased. A significant decrease in accuracy of prediction of ASI was seen in young test cohorts of bulls compared to the accuracy derived from cross-validation of the training set. This reduction was not apparent for PPT. Combining MBV predictions with pedigree based predictions gave 1.05 - 1.34 times higher accuracies compared to predictions based on pedigree alone. Some methods have largely different computational requirements, with PLSR and RR-BLUP requiring the least computing time. The four methods which use information from all SNP namely RR-BLUP, Bayes-R, PLSR and SVR generate similar accuracies of MBV prediction for genomic selection, and their use in the selection of immediate future generations in dairy cattle will be comparable. The use of FR-LS in genomic selection is not recommended.This publication has 40 references indexed in Scilit:
- Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle BreedsScience, 2009
- Reducing dimensionality for prediction of genome-wide breeding valuesGenetics Selection Evolution, 2009
- Genomic breeding value estimation using nonparametric additive regression modelsGenetics Selection Evolution, 2009
- Genome-assisted prediction of a quantitative trait measured in parents and progeny: application to food conversion rate in chickensGenetics Selection Evolution, 2009
- Genome-wide association analysis identifies 20 loci that influence adult heightNature Genetics, 2008
- Association scan of 14,500 nonsynonymous SNPs in four diseases identifies autoimmunity variantsNature Genetics, 2007
- Combined Genome Scans for Body Stature in 6,602 European Twins: Evidence for Common Caucasian LociPLoS Genetics, 2007
- A Primary Assembly of a Bovine Haplotype Block Map Based on a 15,036-Single-Nucleotide Polymorphism Panel Genotyped in Holstein–Friesian CattleGenetics, 2007
- Genome-wide genetic association of complex traits in heterogeneous stock miceNature Genetics, 2006
- Genomic-Assisted Prediction of Genetic Value With Semiparametric ProceduresGenetics, 2006