Linkage Analysis and QTL Mapping Using SNP Dosage Data in a Tetraploid Potato Mapping Population
Open Access
- 21 May 2013
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 8 (5), e63939
- https://doi.org/10.1371/journal.pone.0063939
Abstract
New sequencing and genotyping technologies have enabled researchers to generate high density SNP genotype data for mapping populations. In polyploid species, SNP data usually contain a new type of information, the allele dosage, which is not used by current methodologies for linkage analysis and QTL mapping. Here we extend existing methodology to use dosage data on SNPs in an autotetraploid mapping population. The SNP dosages are inferred from allele intensity ratios using normal mixture models. The steps of the linkage analysis (testing for distorted segregation, clustering SNPs, calculation of recombination fractions and LOD scores, ordering of SNPs and inference of parental phase) are extended to use the dosage information. For QTL analysis, the probability of each possible offspring genotype is inferred at a grid of locations along the chromosome from the ordered parental genotypes and phases and the offspring dosages. A normal mixture model is then used to relate trait values to the offspring genotypes and to identify the most likely locations for QTLs. These methods are applied to analyse a tetraploid potato mapping population of parents and 190 offspring, genotyped using an Infinium 8300 Potato SNP Array. Linkage maps for each of the 12 chromosomes are constructed. The allele intensity ratios are mapped as quantitative traits to check that their position and phase agrees with that of the corresponding SNP. This analysis confirms most SNP positions, and eliminates some problem SNPs to give high-density maps for each chromosome, with between 74 and 152 SNPs mapped and between 100 and 300 further SNPs allocated to approximate bins. Low numbers of double reduction products were detected. Overall 3839 of the 5378 polymorphic SNPs can be assigned putative genetic locations. This methodology can be applied to construct high-density linkage maps in any autotetraploid species, and could also be extended to higher autopolyploids.This publication has 39 references indexed in Scilit:
- BeadArray Expression Analysis Using BioconductorPLoS Computational Biology, 2011
- Genome sequence and analysis of the tuber crop potatoNature, 2011
- Genotype calling in tetraploid species from bi-allelic marker data using mixture modelsBMC Bioinformatics, 2011
- High-resolution melting analysis for SNP genotyping and mapping in tetraploid alfalfa (Medicago sativa L.)Molecular Breeding, 2011
- Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD MarkersPLOS ONE, 2008
- Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratiosBMC Bioinformatics, 2008
- Statistical issues in the analysis of Illumina dataBMC Bioinformatics, 2008
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989
- Robust Locally Weighted Regression and Smoothing ScatterplotsJournal of the American Statistical Association, 1979
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978