A catalogue of novel bovine long noncoding RNA across 18 tissues
Open Access
- 23 October 2015
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 10 (10), e0141225
- https://doi.org/10.1371/journal.pone.0141225
Abstract
Long non-coding RNA (lncRNA) have been implicated in diverse biological roles including gene regulation and genomic imprinting. Identifying lncRNA in bovine across many differing tissue would contribute to the current repertoire of bovine lncRNA, and help further improve our understanding of the evolutionary importance and constraints of these transcripts. Additionally, it could aid in identifying sites in the genome outside of protein coding genes where mutations could contribute to variation in complex traits. This is particularly important in bovine as genomic predictions are increasingly used in genetic improvement for milk and meat production. Our aim was to identify and annotate novel long non coding RNA transcripts in the bovine genome captured from RNA Sequencing (RNA-Seq) data across 18 tissues, sampled in triplicate from a single cow. To address the main challenge in identifying lncRNA, namely distinguishing lncRNA transcripts from unannotated genes and protein coding genes, a lncRNA identification pipeline with a number of filtering steps was developed. A total of 9,778 transcripts passed the filtering pipeline. The bovine lncRNA catalogue includes MALAT1 and HOTAIR, both of which have been well described in human and mouse genomes. We attempted to validate the lncRNA in libraries from three additional cows. 726 (87.47%) liver and 1,668 (55.27%) blood class 3 lncRNA were validated with stranded liver and blood libraries respectively. Additionally, this study identified a large number of novel unknown transcripts in the bovine genome with high protein coding potential, illustrating a clear need for better annotations of protein coding genes.Keywords
This publication has 52 references indexed in Scilit:
- An integrated encyclopedia of DNA elements in the human genomeNature, 2012
- A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistanceNature Genetics, 2012
- Conserved Function of lincRNAs in Vertebrate Embryonic Development despite Rapid Sequence EvolutionCell, 2011
- Non-coding RNAs as regulators of gene expression and epigeneticsCardiovascular Research, 2011
- RNA sequencing: advances, challenges and opportunitiesNature Reviews Genetics, 2010
- Critical association of ncRNA with intronsNucleic Acids Research, 2010
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- edgeR: a Bioconductor package for differential expression analysis of digital gene expression dataBioinformatics, 2009
- CPC: assess the protein-coding potential of transcripts using sequence features and support vector machineNucleic Acids Research, 2007
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002