The eSNV-detect: a computational system to identify expressed single nucleotide variants from transcriptome sequencing data
Open Access
- 28 October 2014
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 42 (22), e172
- https://doi.org/10.1093/nar/gku1005
Abstract
Rapid development of next generation sequencing technology has enabled the identification of genomic alterations from short sequencing reads. There are a number of software pipelines available for calling single nucleotide variants from genomic DNA but, no comprehensive pipelines to identify, annotate and prioritize expressed SNVs (eSNVs) from non-directional paired-end RNA-Seq data. We have developed the eSNV-Detect, a novel computational system, which utilizes data from multiple aligners to call, even at low read depths, and rank variants from RNA-Seq. Multi-platform comparisons with the eSNV-Detect variant candidates were performed. The method was first applied to RNA-Seq from a lymphoblastoid cell-line, achieving 99.7% precision and 91.0% sensitivity in the expressed SNPs for the matching HumanOmni2.5 BeadChip data. Comparison of RNA-Seq eSNV candidates from 25 ER+ breast tumors from The Cancer Genome Atlas (TCGA) project with whole exome coding data showed 90.6–96.8% precision and 91.6–95.7% sensitivity. Contrasting single-cell mRNA-Seq variants with matching traditional multicellular RNA-Seq data for the MD-MB231 breast cancer cell-line delineated variant heterogeneity among the single-cells. Further, Sanger sequencing validation was performed for an ER+ breast tumor with paired normal adjacent tissue validating 29 out of 31 candidate eSNVs. The source code and user manuals of the eSNV-Detect pipeline for Sun Grid Engine and virtual machine are available at http://bioinformaticstools.mayo.edu/research/esnv-detect/.Keywords
This publication has 47 references indexed in Scilit:
- Comparing somatic mutation-callers: beyond Venn diagramsBMC Bioinformatics, 2013
- TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusionsGenome Biology, 2013
- Comprehensive molecular portraits of human breast tumoursNature, 2012
- Comprehensive genomic characterization of squamous cell lung cancersNature, 2012
- Comprehensive molecular characterization of human colon and rectal cancerNature, 2012
- The landscape of cancer genes and mutational processes in breast cancerNature, 2012
- Exome sequencing identifies frequent mutation of the SWI/SNF complex gene PBRM1 in renal carcinomaNature, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- Comprehensive genomic characterization defines human glioblastoma genes and core pathwaysNature, 2008