Identification and classification of ncRNA molecules using graph properties
Open Access
- 1 April 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (9), e66
- https://doi.org/10.1093/nar/gkp206
Abstract
The study of non-coding RNA genes has received increased attention in recent years fuelled by accumulating evidence that larger portions of genomes than previously acknowledged are transcribed into RNA molecules of mostly unknown function, as well as the discovery of novel non-coding RNA types and functional RNA elements. Here, we demonstrate that specific properties of graphs that represent the predicted RNA secondary structure reflect functional information. We introduce a computational algorithm and an associated web-based tool (GraPPLE) for classifying non-coding RNA molecules as functional and, furthermore, into Rfam families based on their graph properties. Unlike sequence-similarity-based methods and covariance models, GraPPLE is demonstrated to be more robust with regard to increasing sequence divergence, and when combined with existing methods, leads to a significant improvement of prediction accuracy. Furthermore, graph properties identified as most informative are shown to provide an understanding as to what particular structural features render RNA molecules functional. Thus, GraPPLE may offer a valuable computational filtering tool to identify potentially interesting RNA molecules among large candidate datasets.This publication has 50 references indexed in Scilit:
- The RNA world is alive and wellTrends in Plant Science, 2008
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- ENCODE: More genomic empowermentGenome Research, 2007
- A new paradigm for developmental biologyJournal of Experimental Biology, 2007
- Biological function of unannotated transcription during the early development of Drosophila melanogasterNature Genetics, 2006
- Prediction of RNA secondary structure by free energy minimizationCurrent Opinion in Structural Biology, 2006
- Fewer Genes, More Noncoding RNAScience, 2005
- Sensing Small Molecules by Nascent RNA: A Mechanism to Control Transcription in BacteriaCell, 2002
- The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14Cell, 1993
- Estimation of Secondary Structure in Ribonucleic AcidsNature, 1971