VarMatch: robust matching of small variant datasets using flexible scoring schemes
- 30 December 2016
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 33 (9), 1301-1308
- https://doi.org/10.1093/bioinformatics/btw797
Abstract
Motivation: Small variant calling is an important component of many analyses, and, in many instances, it is important to determine the set of variants which appear in multiple callsets. Variant matching is complicated by variants that have multiple equivalent representations. Normalization and decomposition algorithms have been proposed, but are not robust to different representation of complex variants. Variant matching is also usually done to maximize the number of matches, as opposed to other optimization criteria.Funding Information
- NSF (DBI-1356529, CCF-1439057, IIS-1453527, IIS-1421908)
This publication has 22 references indexed in Scilit:
- Equivalent Indels – Ambiguous Functional Classes and Redundancy in DatabasesPLOS ONE, 2013
- An integrated map of genetic variation from 1,092 human genomesNature, 2012
- SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing dataNucleic Acids Research, 2011
- The variant call format and VCFtoolsBioinformatics, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing dataGenome Research, 2010
- Microindel detection in short-read sequence dataBioinformatics, 2010
- VarScan: variant detection in massively parallel sequencing of individual and pooled samplesBioinformatics, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- A Microhomology-Mediated Break-Induced Replication Model for the Origin of Human Copy Number VariationPLoS Genetics, 2009