Research Article Support vector machines applied to the genetic classification problem of hybrid populations with high degrees of similarity
- 31 December 2017
- journal article
- research article
- Published by Genetics and Molecular Research in Evolution
- Vol. 17 (4)
- https://doi.org/10.4238/gmr18122
Abstract
Selection of appropriate genitors in breeding programs increases gains due to the variability found in the divergent groups; this allows quantification of the existing variability, saving time and resources. There are many methods for quantification and evaluation of diversity in population studies, among which we highlight methods that are based on multivariate statistical analyses, such as linear discriminant analysis (LDA) and cluster analysis. Here we propose and evaluate the use of Support Vector machine (SVM) and Artificial Neural Network (ANN) in an attempt to solve the problem of genetic classification of hybrid populations with high degrees of similarity. The results obtained, in terms of the apparent error rate (APER), were compared with those obtained using ANN analysis and LDA. In general, the lowest APER values were associated with scenarios with low degrees of genetic similarity between populations. Specifically, the best results obtained through SVM (ranging from 14.44 to 67.41%) were observed when the exponential radial base kernel function was used. The APERs obtained by the ANN were even lower than those of the linear discriminant function.Keywords
This publication has 14 references indexed in Scilit:
- Automatic Detection of Diseased Tomato Plants Using Thermal and Stereo Visible Light ImagesPLOS ONE, 2015
- Importance of Genetic Diversity Assessment in Crop Plants and Its Recent Advances: An Overview of Its Analytical PerspectivesGenetics Research International, 2015
- Identifying Two of Tomatoes Leaf Viruses Using Support Vector MachinePublished by Springer Science and Business Media LLC ,2015
- Superiority of artificial neural networks for a genetic classification procedureEvolution, 2015
- IntroductionPublished by Springer Science and Business Media LLC ,2013
- Use of support vector machines for disease risk prediction in genome-wide association studies: Concerns and opportunitiesHuman Mutation, 2012
- Application of support vector regression to genome-assisted prediction of quantitative traitsTheoretical and Applied Genetics, 2011
- Divergência genética entre acessos e cultivares de mamoneira por meio de estatística multivariadaPesquisa Agropecuária Brasileira, 2006
- Using discriminant analysis for multi-class classification: an experimental investigationKnowledge and Information Systems, 2006
- Identification of candidate markers associated with agronomic traits in rice using discriminant analysisTheoretical and Applied Genetics, 2005