Transcriptome sequencing and high-resolution melt analysis advance single nucleotide polymorphism discovery in duplicated salmonids
- 24 November 2010
- journal article
- research article
- Published by Wiley in Molecular Ecology Resources
- Vol. 11 (2), 335-348
- https://doi.org/10.1111/j.1755-0998.2010.02936.x
Abstract
Until recently, single nucleotide polymorphism (SNP) discovery in nonmodel organisms faced many challenges, often depending upon a targeted-gene approach and Sanger sequencing of many individuals. The advent of next-generation sequencing technologies has dramatically improved discovery, but validating and testing SNPs for use in population studies remain labour intensive. Here, we detail a SNP discovery and validation pipeline that incorporates 454 pyrosequencing, high-resolution melt analysis (HRMA) and 5' nuclease genotyping. We generated 4.59×10(8) bp of redundant sequence from transcriptomes of two individual chum salmon, a highly valued species across the Pacific Rim. Nearly 26000 putative SNPs were identified--some as heterozygotes and some as homozygous for different nucleotides in the two individuals. For validation, we selected 202 templates containing single putative SNPs and conducted HRMA on 10 individuals from each of 19 populations from across the species range. Finally, 5' nuclease genotyping validated 37 SNPs that conformed to Hardy-Weinberg equilibrium expectations. Putative SNPs expressed as heterozygotes in an ascertainment individual had more than twice the validation rate of those homozygous for different alleles in the two fish, suggesting that many of the latter may have been paralogous sequence variants. Overall, this validation rate of 37/202 suggests that we have found more than 4500 templates containing SNPs for use in this population set. We anticipate using this pipeline to significantly expand the number of SNPs available for the studies of population structure and mixture analyses as well as for the studies of adaptive genetic variation in nonmodel organisms.Keywords
This publication has 49 references indexed in Scilit:
- Generic genetic differences between farmed and wild Atlantic salmon identified from a 7K SNP‐chipMolecular Ecology Resources, 2011
- Identification of single nucleotide polymorphisms in candidate genes for growth and reproduction in a nonmodel organism; the Atlantic cod, Gadus morhuaMolecular Ecology Resources, 2011
- From Conservation Genetics to Conservation GenomicsAnnals of the New York Academy of Sciences, 2009
- High resolution melting analysis of almond SNPs derived from ESTsTheoretical and Applied Genetics, 2008
- Sequencing goes 454 and takes large‐scale genomics into the wildMolecular Ecology, 2008
- Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencingMolecular Ecology, 2008
- Quality scores and SNP detection in sequencing-by-synthesis systemsGenome Research, 2008
- genepop’007: a complete re‐implementation of the genepop software for Windows and LinuxMolecular Ecology Resources, 2008
- The medaka draft genome and insights into vertebrate genome evolutionNature, 2007
- Simple cDNA normalization using kamchatka crab duplex-specific nucleaseNucleic Acids Research, 2004