tRNADB-CE: tRNA gene database well-timed in the era of big sequence data
Open Access
- 1 May 2014
- journal article
- review article
- Published by Frontiers Media SA in Frontiers in Genetics
- Vol. 5, 114
- https://doi.org/10.3389/fgene.2014.00114
Abstract
The tRNA Gene Data Base Curated by Experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses’, 121 chloroplasts’, and 12 eukaryotes’ genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that BLSOM with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.This publication has 21 references indexed in Scilit:
- Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significanceChromosome Research, 2013
- Decoding system for the AUA codon by tRNA Ile with the UAU anticodon in Mycoplasma mobileNucleic Acids Research, 2013
- MODOMICS: a database of RNA modification pathways—2013 updateNucleic Acids Research, 2012
- tRNADB-CE 2011: tRNA gene database curated manually by expertsNucleic Acids Research, 2010
- Discovery and characterization of tRNAIle lysidine synthetase (TilS)FEBS Letters, 2009
- GtRNAdb: a database of transfer RNA genes detected in genomic sequenceNucleic Acids Research, 2008
- tRNAdb 2009: compilation of tRNA sequences and tRNA genesNucleic Acids Research, 2008
- tRNADB-CE: tRNA gene database curated manually by expertsNucleic Acids Research, 2008
- Informatics for Unveiling Hidden Genome SignaturesGenome Research, 2003
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997