A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation
Open Access
- 1 January 2013
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Genomics
- Vol. 14 (1), 137
- https://doi.org/10.1186/1471-2164-14-137
Abstract
Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.This publication has 74 references indexed in Scilit:
- Genomic selection in plant breeding: from theory to practiceBriefings in Functional Genomics, 2010
- Fast and accurate long-read alignment with Burrows–Wheeler transformBioinformatics, 2010
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assayTheoretical and Applied Genetics, 2009
- High-throughput genotyping and mapping of single nucleotide polymorphisms in loblolly pine (Pinus taeda L.)Tree Genetics & Genomes, 2008
- High-throughput genotyping with the GoldenGate assay in the complex genome of soybeanTheoretical and Applied Genetics, 2008
- UniRef: comprehensive and non-redundant UniProt reference clustersBioinformatics, 2007
- Recent history of artificial outcrossing facilitates whole-genome association mapping in elite inbred crop varietiesProceedings of the National Academy of Sciences of the United States of America, 2006
- A Greedy Algorithm for Aligning DNA SequencesJournal of Computational Biology, 2000
- A simple and efficient method for isolating RNA from pine treesPlant Molecular Biology Reporter, 1993