Ab initio reconstruction of cell type–specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs
Top Cited Papers
Open Access
- 2 May 2010
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Biotechnology
- Vol. 28 (5), 503-510
- https://doi.org/10.1038/nbt.1633
Abstract
High-throughput sequencing of total cellular RNA by RNA-Seq promises rapid reconstruction of spliced transcripts in a cell population. Guttman et al. accomplish this using only paired-end RNA-seq data and an unannotated genome sequence, and apply the method to better define many new, conserved long intergenic noncoding RNAs (lincRNAs). Massively parallel cDNA sequencing (RNA-Seq) provides an unbiased way to study a transcriptome, including both coding and noncoding genes. Until now, most RNA-Seq studies have depended crucially on existing annotations and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We applied it to mouse embryonic stem cells, neuronal precursor cells and lung fibroblasts to accurately reconstruct the full-length gene structures for most known expressed genes. We identified substantial variation in protein coding genes, including thousands of novel 5′ start sites, 3′ ends and internal coding exons. We then determined the gene structures of more than a thousand large intergenic noncoding RNA (lincRNA) and antisense loci. Our results open the way to direct experimental manipulation of thousands of noncoding RNAs and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes.Keywords
This publication has 32 references indexed in Scilit:
- Dynamic transcriptomes during neural differentiation of human embryonic stem cells revealed by short, long, and paired-end sequencingProceedings of the National Academy of Sciences of the United States of America, 2010
- Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expressionProceedings of the National Academy of Sciences of the United States of America, 2009
- Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencingProceedings of the National Academy of Sciences of the United States of America, 2009
- Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammalsNature, 2009
- Transcriptome sequencing to detect gene fusions in cancerNature, 2009
- Alternative isoform regulation in human tissue transcriptomesNature, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- Highly Integrated Single-Base Resolution Maps of the Epigenome in ArabidopsisCell, 2008
- Genome-wide maps of chromatin state in pluripotent and lineage-committed cellsNature, 2007
- Functional Demarcation of Active and Silent Chromatin Domains in Human HOX Loci by Noncoding RNAsCell, 2007