SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data
Open Access
- 29 July 2015
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 31 (22), 3709-3711
- https://doi.org/10.1093/bioinformatics/btv449
Abstract
Summary: New applications of next-generation sequencing technologies use pools of DNA from multiple individuals to estimate population genetic parameters. However, no publicly available tools exist to analyse single-nucleotide polymorphism (SNP) calling results directly for evolutionary parameters important in detecting natural selection, including nucleotide diversity and gene diversity. We have developed SNPGenie to fill this gap. The user submits a FASTA reference sequence(s), a Gene Transfer Format (.GTF) file with CDS information and a SNP report(s) in an increasing selection of formats. The program estimates nucleotide diversity, distance from the reference and gene diversity. Sites are flagged for multiple overlapping reading frames, and are categorized by polymorphism type: nonsynonymous, synonymous, or ambiguous. The results allow single nucleotide, single codon, sliding window, whole gene and whole genome/population analyses that aid in the detection of positive and purifying natural selection in the source population. Availability and implementation: SNPGenie version 1.2 is a Perl program with no additional dependencies. It is free, open-source, and available for download at https://github.com/hugheslab/snpgenie. Contact:nelsoncw@email.sc.edu or austin@biol.sc.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 12 references indexed in Scilit:
- Within-host nucleotide diversity of virus populations: Insights from next-generation sequencingInfection, Genetics and Evolution, 2014
- Population-Genetic Inference from Pooled-Sequencing DataGenome Biology and Evolution, 2014
- Selection on haemagglutinin imposes a bottleneck during mammalian transmission of reassortant H5N1 influenza virusesNature Communications, 2013
- MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0Molecular Biology and Evolution, 2013
- SNP calling by sequencing pooled samplesBMC Bioinformatics, 2012
- PoPoolation: A Toolbox for Population Genetic Analysis of Next Generation Sequencing Data from Pooled IndividualsPLOS ONE, 2011
- PolyAna: analyzing synonymous and nonsynonymous polymorphic sitesConservation Genetics Resources, 2010
- To Pool, or Not to Pool?Genetics, 2010
- The Next Generation of Molecular Markers From Massively Parallel Sequencing of Pooled DNA SamplesGenetics, 2010
- Widespread purifying selection at polymorphic sites in human protein-coding lociProceedings of the National Academy of Sciences of the United States of America, 2003