Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences
Top Cited Papers
Open Access
- 1 August 2011
- journal article
- Published by Oxford University Press (OUP) in G3 Genes|Genomes|Genetics
- Vol. 1 (3), 171-182
- https://doi.org/10.1534/g3.111.000240
Abstract
Advances in sequencing technology provide special opportunities for genotyping individuals with speed and thrift, but the lack of software to automate the calling of tens of thousands of genotypes over hundreds of individuals has hindered progress. Stacks is a software system that uses short-read sequence data to identify and genotype loci in a set of individuals either de novo or by comparison to a reference genome. From reduced representation Illumina sequence data, such as RAD-tags, Stacks can recover thousands of single nucleotide polymorphism (SNP) markers useful for the genetic analysis of crosses or populations. Stacks can generate markers for ultra-dense genetic linkage maps, facilitate the examination of population phylogeography, and help in reference genome assembly. We report here the algorithms implemented in Stacks and demonstrate their efficacy by constructing loci from simulated RAD-tags taken from the stickleback reference genome and by recapitulating and improving a genetic map of the zebrafish, Danio rerio.Keywords
This publication has 35 references indexed in Scilit:
- Ancestral polyploidy in seed plants and angiospermsNature, 2011
- Resolving postglacial phylogeography using high-throughput sequencingProceedings of the National Academy of Sciences of the United States of America, 2010
- Personal genome sequencing: current approaches and challengesJournal of Bone and Joint Surgery, 2010
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- A high density linkage map of the bovine genomeBMC Genomic Data, 2009
- A salmonid EST genomic study: genes, duplications, phylogeny and microarraysBMC Genomics, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markersGenome Research, 2006
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997