High-Accuracy ncRNA Function Prediction via Deep Learning Using Global and Local Sequence Information
Open Access
- 3 June 2023
- journal article
- research article
- Published by MDPI AG in Biomedicines
- Vol. 11 (6), 1631
- https://doi.org/10.3390/biomedicines11061631
Abstract
The prediction of the biological function of non-coding ribonucleic acid (ncRNA) is an important step towards understanding the regulatory mechanisms underlying many diseases. Since non-coding RNAs are present in great abundance in human cells and are functionally diverse, developing functional prediction tools is necessary. With recent advances in non-coding RNA biology and the availability of complete genome sequences for a large number of species, we now have a window of opportunity for studying non-coding RNA biology. However, the computational methods used to predict the non-coding RNA functions are mostly either scarcely accurate, when based on sequence information alone, or prohibitively expensive in terms of computational burden when a secondary structure prediction is needed. We propose a novel computational method to predict the biological function of non-coding RNA genes that is based on a collection of deep network architectures utilizing solely ncRNA sequence information and which does not rely on or require expensive secondary ncRNA structure information. The approach presented in this work exhibits comparable or superior accuracy to methods that employ both sequence and structural features, at a much lower computational cost.This publication has 33 references indexed in Scilit:
- Infernal 1.1: 100-fold faster RNA homology searchesBioinformatics, 2013
- An integrated encyclopedia of DNA elements in the human genomeNature, 2012
- Landscape of transcription in human cellsNature, 2012
- IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programmingBioinformatics, 2011
- Noncoding RNAs in gene regulationWires Systems Biology and Medicine, 2011
- deepBase: a database for deeply annotating and mining deep sequencing dataNucleic Acids Research, 2009
- Identification and classification of ncRNA molecules using graph propertiesNucleic Acids Research, 2009
- Noncoding RNAs database (ncRNAdb)Nucleic Acids Research, 2006
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Initial sequencing and analysis of the human genomeNature, 2001