Maximizing efficiency of genomic selection in CIMMYT’s tropical maize breeding program
Open Access
- 10 October 2020
- journal article
- research article
- Published by Springer Science and Business Media LLC in Theoretical and Applied Genetics
- Vol. 134 (1), 279-294
- https://doi.org/10.1007/s00122-020-03696-9
Abstract
Key message Historical data from breeding programs can be efficiently used to improve genomic selection accuracy, especially when the training set is optimized to subset individuals most informative of the target testing set. Abstract The current strategy for large-scale implementation of genomic selection (GS) at the International Maize and Wheat Improvement Center (CIMMYT) global maize breeding program has been to train models using information from full-sibs in a “test-half-predict-half approach.” Although effective, this approach has limitations, as it requires large full-sib populations and limits the ability to shorten variety testing and breeding cycle times. The primary objective of this study was to identify optimal experimental and training set designs to maximize prediction accuracy of GS in CIMMYT’s maize breeding programs. Training set (TS) design strategies were evaluated to determine the most efficient use of phenotypic data collected on relatives for genomic prediction (GP) using datasets containing 849 (DS1) and 1389 (DS2) DH-lines evaluated as testcrosses in 2017 and 2018, respectively. Our results show there is merit in the use of multiple bi-parental populations as TS when selected using algorithms to maximize relatedness between the training and prediction sets. In a breeding program where relevant past breeding information is not readily available, the phenotyping expenditure can be spread across connected bi-parental populations by phenotyping only a small number of lines from each population. This significantly improves prediction accuracy compared to within-population prediction, especially when the TS for within full-sib prediction is small. Finally, we demonstrate that prediction accuracy in either sparse testing or “test-half-predict-half” can further be improved by optimizing which lines are planted for phenotyping and which lines are to be only genotyped for advancement based on GP.Keywords
Funding Information
- Bill and Melinda Gates Foundation (OPP1134248, OPP1093167)
This publication has 41 references indexed in Scilit:
- Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased PredictorPLoS Genetics, 2013
- Genomic Prediction of Breeding Values when Modeling Genotype × Environment Interaction using Pedigree and Dense Molecular MarkersCrop Science, 2012
- The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemesGenetics Selection Evolution, 2012
- Genomic Selection and Prediction in Plant BreedingJournal of Crop Improvement, 2011
- Prediction of Genetic Values of Quantitative Traits in Plant Breeding Using Pedigree and Molecular MarkersGenetics, 2010
- The impact of genetic relationship information on genomic breeding values in German Holstein cattleGenetics Selection Evolution, 2010
- Efficient Methods to Compute Genomic PredictionsJournal of Dairy Science, 2008
- Linkage Disequilibrium in Related Breeding Lines of ChickensGenetics, 2007
- The Impact of Genetic Relationship Information on Genome-Assisted Breeding ValuesGenetics, 2007
- Precision and information in linear models of genetic evaluationGenetics Selection Evolution, 1993