Computer-Assisted Protein Domain Boundary Prediction Using the Dom-Pred Server
- 1 April 2007
- journal article
- research article
- Published by Bentham Science Publishers Ltd. in Current Protein & Peptide Science
- Vol. 8 (2), 181-188
- https://doi.org/10.2174/138920307780363415
Abstract
Domain prediction from sequence is a particularly challenging task, and currently, a large variety of different methodologies are employed to tackle the task. Here we try to classify these diverse approaches into a number of broad categories. Completely automatic domain prediction from sequence alone is currently fraught with problems, but this should not be so surprising since human experts currently have significant disagreement on domain assignment even when given the structures. It can be argued that we should only test the domain prediction methods on benchmark data that human experts agree upon and this is the approach we take in this paper. Even for the data sets on which human experts agree, automatic structure-based domain assignment still cannot always agree, and so again it is still unlikely that domain prediction methods will reliably obtain correct results completely automatically. We make the argument that computerassisted domain prediction is a more achievable goal. With this aim in mind, we present the DomPred server. This server provides the user with the results from two completely different categories of method (DPS and DomSSEA). In this paper, each method is individually benchmarked against one of the latest domain prediction benchmarks to provide information about their respective reliabilities. A variety of different benchmark scores are employed since the accuracy of a domain prediction method depends critically on what types of results one wishes to obtain (single/multi-domain classification, domain number, residue linker positions, etc.). Also both of these methods, implemented within the DomPred server, can suggest alternative domain predictions, allowing the user to make the final decision based on these results and applying their own background knowledge to the problem. The DomPred server is available from the URL: http://bioinf.cs.ucl.ac.uk/software.html.Keywords
This publication has 26 references indexed in Scilit:
- Partitioning Protein Structures into Domains: Why Is it so Difficult?Journal of Molecular Biology, 2006
- DOMpro: Protein Domain Prediction Using Profiles, Secondary Structure, Relative Solvent Accessibility, and Recursive Neural NetworksData Mining and Knowledge Discovery, 2006
- Pfam: clans, web tools and servicesNucleic Acids Research, 2006
- Armadillo: Domain Boundary Prediction by Amino Acid CompositionJournal of Molecular Biology, 2005
- FFAS03: a server for profile-profile sequence alignmentsNucleic Acids Research, 2005
- Toward Consistent Assignment of Structural Domains in ProteinsJournal of Molecular Biology, 2004
- CHOP proteins into structural domain‐like fragmentsProteins-Structure Function and Bioinformatics, 2004
- Automatic prediction of protein domains from sequence information using a hybrid learning systemBioinformatics, 2004
- Protein domain identification and improved sequence similarity searching using PSI‐BLASTProteins-Structure Function and Bioinformatics, 2002
- SnapDRAGON: a method to delineate protein structural domains from sequence dataJournal of Molecular Biology, 2002