FusionMap: detecting fusion genes from next-generation sequencing data at base-pair resolution
Open Access
- 18 May 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (14), 1922-1928
- https://doi.org/10.1093/bioinformatics/btr310
Abstract
Motivation: Next generation sequencing technology generates high-throughput data, which allows us to detect fusion genes at both transcript and genomic levels. To detect fusion genes, the current bioinformatics tools heavily rely on paired-end approaches and overlook the importance of reads that span fusion junctions. Thus there is a need to develop an efficient aligner to detect fusion events by accurate mapping of these junction-spanning single reads, particularly when the read gets longer with the improvement in sequencing technology. Results: We present a novel method, FusionMap, which aligns fusion reads directly to the genome without prior knowledge of potential fusion regions. FusionMap can detect fusion events in both single- and paired-end datasets from either RNA-Seq or gDNA-Seq studies and characterize fusion junctions at base-pair resolution. We showed that FusionMap achieved high sensitivity and specificity in fusion detection on two simulated RNA-Seq datasets, which contained 75 nt paired-end reads. FusionMap achieved substantially higher sensitivity and specificity than the paired-end approach when the inner distance between read pairs was small. Using FusionMap to characterize fusion genes in K562 chronic myeloid leukemia cell line, we further demonstrated its accuracy in fusion detection in both single-end RNA-Seq and gDNA-Seq datasets. These combined results show that FusionMap provides an accurate and systematic solution to detecting fusion events through junction-spanning reads. Availability: FusionMap includes reference indexing, read filtering, fusion alignment and reporting in one package. The software is free for noncommercial use at (http://www.omicsoft.com/fusionmap). Contact:ge@amgen.com Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 24 references indexed in Scilit:
- Detection of splice junctions from paired-end RNA-seq data by SpliceMapNucleic Acids Research, 2010
- Integrative analysis of the melanoma transcriptomeGenome Research, 2010
- Fusion genes and chromosome translocations in the common epithelial cancersThe Journal of Pathology, 2009
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009
- Transcriptome sequencing to detect gene fusions in cancerNature, 2009
- Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcriptsGenome Biology, 2009
- A sequence-level map of chromosomal breakpoints in the MCF-7 breast cancer cell line yields insights into the evolution of a cancer genomeGenome Research, 2008
- Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencingNature Genetics, 2008
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Tyrosine Kinase Activity and Transformation Potency of bcr-abl Oncogene ProductsScience, 1990