A fast and accurate SNP detection algorithm for next-generation sequencing data
Open Access
- 1 January 2012
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Communications
- Vol. 3 (1), 1258
- https://doi.org/10.1038/ncomms2256
Abstract
Various methods have been developed for calling single-nucleotide polymorphisms from next-generation sequencing data. However, for satisfactory performance, most of these methods require expensive high-depth sequencing. Here, we propose a fast and accurate single-nucleotide polymorphism detection program that uses a binomial distribution-based algorithm and a mutation probability. We extensively assess this program on normal and cancer next-generation sequencing data from The Cancer Genome Atlas project and pooled data from the 1,000 Genomes Project. We also compare the performance of several state-of-the-art programs for single-nucleotide polymorphism calling and evaluate their pros and cons. We demonstrate that our program is a fast and highly accurate single-nucleotide polymorphism detection method, particularly when the sequence depth is low. The program can finish single-nucleotide polymorphism calling within four hours for 10-fold human genome next-generation sequencing data (30 gigabases) on a standard desktop computer.Keywords
This publication has 30 references indexed in Scilit:
- Genotype and SNP calling from next-generation sequencing dataNature Reviews Genetics, 2011
- A framework for variation discovery and genotyping using next-generation DNA sequencing dataNature Genetics, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association StudiesAmerican Journal of Human Genetics, 2009
- Comprehensive genomic characterization defines human glioblastoma genes and core pathwaysNature, 2008
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- A haplotype map of the human genomeNature, 2005
- Advanced sequencing technologies: methods and goalsNature Reviews Genetics, 2004
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001