LocTree3 prediction of localization
Open Access
- 21 May 2014
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 42 (W1), W350-W355
- https://doi.org/10.1093/nar/gku396
Abstract
The prediction of protein sub-cellular localization is an important step toward elucidating protein function. For each query protein sequence, LocTree2 applies machine learning (profile kernel SVM) to predict the native sub-cellular localization in 18 classes for eukaryotes, in six for bacteria and in three for archaea. The method outputs a score that reflects the reliability of each prediction. LocTree2 has performed on par with or better than any other state-of-the-art method. Here, we report the availability of LocTree3 as a public web server. The server includes the machine learning-based LocTree2 and improves over it through the addition of homology-based inference. Assessed on sequence-unique data, LocTree3 reached an 18-state accuracy Q18 = 80 ± 3% for eukaryotes and a six-state accuracy Q6 = 89 ± 4% for bacteria. The server accepts submissions ranging from single protein sequences to entire proteomes. Response time of the unloaded server is about 90 s for a 300-residue eukaryotic protein and a few hours for an entire eukaryotic proteome not considering the generation of the alignments. For over 1000 entirely sequenced organisms, the predictions are directly available as downloads. The web server is available at http://www.rostlab.org/services/loctree3.Keywords
This publication has 26 references indexed in Scilit:
- PredictProtein—an open resource for online prediction of protein structural and functional featuresNucleic Acids Research, 2014
- Prediction of subcellular locations of proteins: Where to proceed?Proteomics, 2010
- YLoc—an interpretable web server for predicting subcellular localizationNucleic Acids Research, 2010
- Predicting protein function from sequence and structureNature Reviews Molecular Cell Biology, 2007
- Prediction of protein subcellular localizationProteins-Structure Function and Bioinformatics, 2006
- Profile-based string kernels for remote homology detection and motif extraction2005 IEEE Computational Systems Bioinformatics Conference (CSB'05), 2004
- Enzyme Function Less Conserved than AnticipatedJournal of Molecular Biology, 2002
- The Protein Data BankNucleic Acids Research, 2000
- The SWISS-PROT protein sequence data bank and its supplement TrEMBLNucleic Acids Research, 1997
- [27] Local alignment statisticsMethods in Enzymology, 1996