Transcriptome sequencing and high-resolution melt analysis advance single nucleotide polymorphism discovery in duplicated salmonids

24 November 2010

journal article
research article
Published by Wiley in Molecular Ecology Resources

Vol. 11 (2), 335-348
https://doi.org/10.1111/j.1755-0998.2010.02936.x

Abstract

Until recently, single nucleotide polymorphism (SNP) discovery in nonmodel organisms faced many challenges, often depending upon a targeted-gene approach and Sanger sequencing of many individuals. The advent of next-generation sequencing technologies has dramatically improved discovery, but validating and testing SNPs for use in population studies remain labour intensive. Here, we detail a SNP discovery and validation pipeline that incorporates 454 pyrosequencing, high-resolution melt analysis (HRMA) and 5' nuclease genotyping. We generated 4.59×10(8) bp of redundant sequence from transcriptomes of two individual chum salmon, a highly valued species across the Pacific Rim. Nearly 26000 putative SNPs were identified--some as heterozygotes and some as homozygous for different nucleotides in the two individuals. For validation, we selected 202 templates containing single putative SNPs and conducted HRMA on 10 individuals from each of 19 populations from across the species range. Finally, 5' nuclease genotyping validated 37 SNPs that conformed to Hardy-Weinberg equilibrium expectations. Putative SNPs expressed as heterozygotes in an ascertainment individual had more than twice the validation rate of those homozygous for different alleles in the two fish, suggesting that many of the latter may have been paralogous sequence variants. Overall, this validation rate of 37/202 suggests that we have found more than 4500 templates containing SNPs for use in this population set. We anticipate using this pipeline to significantly expand the number of SNPs available for the studies of population structure and mixture analyses as well as for the studies of adaptive genetic variation in nonmodel organisms.

Keywords

This publication has 49 references indexed in Scilit:

Generic genetic differences between farmed and wild Atlantic salmon identified from a 7K SNP‐chip
Molecular Ecology Resources, 2011
Identification of single nucleotide polymorphisms in candidate genes for growth and reproduction in a nonmodel organism; the Atlantic cod, Gadus morhua
Molecular Ecology Resources, 2011
From Conservation Genetics to Conservation Genomics
Annals of the New York Academy of Sciences, 2009
High resolution melting analysis of almond SNPs derived from ESTs
Theoretical and Applied Genetics, 2008
Sequencing goes 454 and takes large‐scale genomics into the wild
Molecular Ecology, 2008
Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing
Molecular Ecology, 2008
Quality scores and SNP detection in sequencing-by-synthesis systems
Genome Research, 2008
genepop’007: a complete re‐implementation of the genepop software for Windows and Linux
Molecular Ecology Resources, 2008
The medaka draft genome and insights into vertebrate genome evolution
Nature, 2007
Simple cDNA normalization using kamchatka crab duplex-specific nuclease
Nucleic Acids Research, 2004

Cited by 54 articles