Beta-Binomial Model for the Detection of Rare Mutations in Pooled Next-Generation Sequencing Experiments
- 1 April 2017
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 24 (4), 357-367
- https://doi.org/10.1089/cmb.2016.0106
Abstract
Against diminishing costs, next-generation sequencing (NGS) still remains expensive for studies with a large number of individuals. As cost saving, sequencing genome of pools containing multiple samples might be used. Currently, there are many software available for the detection of single-nucleotide polymorphisms (SNPs). Sensitivity and specificity depend on the model used and data analyzed, indicating that all software have space for improvement. We use beta-binomial model to detect rare mutations in untagged pooled NGS experiments. We propose a multireference framework for pooled data with ability being specific up to two patients affected by neuromuscular disorders (NMD). We assessed the results comparing with The Genome Analysis Toolkit (GATK), CRISP, SNVer, and FreeBayes. Our results show that the multireference approach applying beta-binomial model is accurate in predicting rare mutations at 0.01 fraction. Finally, we explored the concordance of mutations between the model and software, checking their involvement in any NMD-related gene. We detected seven novel SNPs, for which the functional analysis produced enriched terms related to locomotion and musculature.Keywords
This publication has 34 references indexed in Scilit:
- Single Nucleotide Polymorphism Identification in Polyploids: A Review, Example, and RecommendationsMolecular Plant, 2015
- An Integrated Diagnosis Strategy for Congenital MyopathiesPLOS ONE, 2013
- RVD: a command-line program for ultrasensitive rare single nucleotide variant detection using targeted next-generation DNA resequencingBMC Research Notes, 2013
- A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEffFly, 2012
- A framework for variation discovery and genotyping using next-generation DNA sequencing dataNature Genetics, 2011
- High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiencyNature Genetics, 2010
- A statistical method for the detection of variants from next-generation resequencing of DNA poolsBioinformatics, 2010
- Next generation software for functional trend analysisBioinformatics, 2009
- AmiGO: online access to ontology and annotation dataBioinformatics, 2008
- Mutational analysis of the ACVR1 gene in Italian patients affected with fibrodysplasia ossificans progressiva: confirmations and advancementsEuropean Journal of Human Genetics, 2008