Genomic prediction using training population design in interspecific soybean populations
- 31 January 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Molecular Breeding
- Vol. 41 (2), 1-15
- https://doi.org/10.1007/s11032-021-01203-6
Abstract
Agronomically important traits generally have complex genetic architecture, where many genes have a small and largely additive effect. Genomic prediction has been demonstrated to increase genetic gain and efficiency in plant breeding programs beyond marker-assisted selection and phenotypic selection. The objective of this study was to evaluate the impact of allelic origin, marker density, training population size, and cross-validation schemes on the accuracy of genomic prediction models in an interspecific soybean nested association mapping (NAM) panel. Three cross-validation schemes were used: (a) Within-Family (WF): training population and predictions are made exclusively within each family; (b) Across All families (AF): all the individuals from the three families were randomly assigned to either the training or validation set; (c) Leave one Family out (LFO): each family is predicted using a training set that contains the other two families. Predictive abilities increased with training population size up to 350 individuals, but no significant gains were noted beyond 250 individuals in the training population. The number of markers had a limited impact on the observed predictive ability across traits; increasing markers used in the model above 1000 revealed no significant increases in prediction accuracy. Predictive abilities for AF were not significantly different from the WF method, and predictive abilities across populations for the WF method had a range of 0.58 to 0.70 for maturity, protein, meal, and oil. Our results also showed encouraging prediction accuracies for grain yield (0.58-0.69) using the WF method. Partitioning genomic prediction between G. max and G. soja alleles revealed useful information to select material with a larger allele contribution from both parents and could accelerate allele introgression from exotic germplasm into the elite soybean gene pool.Keywords
Funding Information
- United Soybean Board
- Missouri Soybean Merchandising Council
This publication has 56 references indexed in Scilit:
- Genomewide Selection to Introgress Semidwarf Maize Germplasm into U.S. Corn Belt InbredsCrop Science, 2013
- Resource Allocation for Maximizing Prediction Accuracy and Genetic Gain of Genomic Selection in Plant Breeding: A Simulation ExperimentG3 Genes|Genomes|Genetics, 2013
- Dynamics of long-term genomic selectionGenetics Selection Evolution, 2010
- Understanding and using quantitative genetic variationPhilosophical Transactions Of The Royal Society B-Biological Sciences, 2010
- Genomic breeding value prediction: methods and proceduresAnimal, 2010
- Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and PedigreeGenetics, 2009
- The Impact of Genetic Relationship Information on Genome-Assisted Breeding ValuesGenetics, 2007
- QTL Mapping of Domestication-related Traits in Soybean (Glycine max)Annals of Botany, 2007
- Impacts of genetic bottlenecks on soybean genome diversityProceedings of the National Academy of Sciences of the United States of America, 2006
- Genetic Base for North American Public Soybean Cultivars Released between 1947 and 1988Crop Science, 1994