Comprehensive assembly of novel transcripts from unmapped human RNA‐Seq data and their association with cancer
Open Access
- 7 August 2015
- journal article
- Published by EMBO in Molecular Systems Biology
- Vol. 11 (8), 826
- https://doi.org/10.15252/msb.156172
Abstract
Crucial parts of the genome including genes encoding microRNAs and noncoding RNAs went unnoticed for years, and even now, despite extensive annotation and assembly of the human genome, RNA‐sequencing continues to yield millions of unmappable and thus uncharacterized reads. Here, we examined > 300 billion reads from 536 normal donors and 1,873 patients encompassing 21 cancer types, identified ~300 million such uncharacterized reads, and using a distinctive approach de novo assembled 2,550 novel human transcripts, which mainly represent long noncoding RNAs. Of these, 230 exhibited relatively specific expression or non‐expression in certain cancer types, making them potential markers for those cancers, whereas 183 exhibited tissue specificity. Moreover, we used lentiviral‐mediated expression of three selected transcripts that had higher expression in normal than in cancer patients and found that each inhibited the growth of HepG2 cells. Our analysis provides a comprehensive and unbiased resource of unmapped human transcripts and reveals their associations with specific cancers, providing potentially important new genes for therapeutic targeting.Keywords
This publication has 57 references indexed in Scilit:
- Mapping and analysis of chromatin state dynamics in nine human cell typesNature, 2011
- Long Noncoding RNAs with Enhancer-like Function in Human CellsCell, 2010
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- Widespread transcription at neuronal activity-regulated enhancersNature, 2010
- Comprehensive genomic characterization defines human glioblastoma genes and core pathwaysNature, 2008
- Patterns of somatic mutation in human cancer genomesNature, 2007
- Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genomeNature Genetics, 2007
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990