Maximizing efficiency of genomic selection in CIMMYT’s tropical maize breeding program

Open Access

10 October 2020

journal article
research article
Published by Springer Science and Business Media LLC in Theoretical and Applied Genetics

Vol. 134 (1), 279-294
https://doi.org/10.1007/s00122-020-03696-9

Abstract

Key message Historical data from breeding programs can be efficiently used to improve genomic selection accuracy, especially when the training set is optimized to subset individuals most informative of the target testing set. Abstract The current strategy for large-scale implementation of genomic selection (GS) at the International Maize and Wheat Improvement Center (CIMMYT) global maize breeding program has been to train models using information from full-sibs in a “test-half-predict-half approach.” Although effective, this approach has limitations, as it requires large full-sib populations and limits the ability to shorten variety testing and breeding cycle times. The primary objective of this study was to identify optimal experimental and training set designs to maximize prediction accuracy of GS in CIMMYT’s maize breeding programs. Training set (TS) design strategies were evaluated to determine the most efficient use of phenotypic data collected on relatives for genomic prediction (GP) using datasets containing 849 (DS1) and 1389 (DS2) DH-lines evaluated as testcrosses in 2017 and 2018, respectively. Our results show there is merit in the use of multiple bi-parental populations as TS when selected using algorithms to maximize relatedness between the training and prediction sets. In a breeding program where relevant past breeding information is not readily available, the phenotyping expenditure can be spread across connected bi-parental populations by phenotyping only a small number of lines from each population. This significantly improves prediction accuracy compared to within-population prediction, especially when the TS for within full-sib prediction is small. Finally, we demonstrate that prediction accuracy in either sparse testing or “test-half-predict-half” can further be improved by optimizing which lines are planted for phenotyping and which lines are to be only genotyped for advancement based on GP.

Keywords

Funding Information

Bill and Melinda Gates Foundation (OPP1134248, OPP1093167)

This publication has 41 references indexed in Scilit:

Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor
PLoS Genetics, 2013
Genomic Prediction of Breeding Values when Modeling Genotype × Environment Interaction using Pedigree and Dense Molecular Markers
Crop Science, 2012
The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes
Genetics Selection Evolution, 2012
Genomic Selection and Prediction in Plant Breeding
Journal of Crop Improvement, 2011
Prediction of Genetic Values of Quantitative Traits in Plant Breeding Using Pedigree and Molecular Markers
Genetics, 2010
The impact of genetic relationship information on genomic breeding values in German Holstein cattle
Genetics Selection Evolution, 2010
Efficient Methods to Compute Genomic Predictions
Journal of Dairy Science, 2008
Linkage Disequilibrium in Related Breeding Lines of Chickens
Genetics, 2007
The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values
Genetics, 2007
Precision and information in linear models of genetic evaluation
Genetics Selection Evolution, 1993

Cited by 37 articles