A Two-Layer SVM Ensemble-Classifier to Predict Interface Residue Pairs of Protein Trimers
Open Access
- 22 September 2020
- Vol. 25 (19), 4353
- https://doi.org/10.3390/molecules25194353
Abstract
Study of interface residue pairs is important for understanding the interactions between monomers inside a trimer protein–protein complex. We developed a two-layer support vector machine (SVM) ensemble-classifier that considers physicochemical and geometric properties of amino acids and the influence of surrounding amino acids. Different descriptors and different combinations may give different prediction results. We propose feature combination engineering based on correlation coefficients and F-values. The accuracy of our method is 65.38% in independent test set, indicating biological significance. Our predictions are consistent with the experimental results. It shows the effectiveness and reliability of our method to predict interface residue pairs of protein trimers.This publication has 35 references indexed in Scilit:
- Assessing the utility of coevolution-based residue–residue contact predictions in a sequence- and structure-rich eraProceedings of the National Academy of Sciences of the United States of America, 2013
- A novel method for protein–protein interaction site prediction using phylogenetic substitution modelsProteins: Structure, Function, and Bioinformatics, 2011
- Improving the prediction of protein binding sites by combining heterogeneous data and Voronoi diagramsBMC Bioinformatics, 2011
- Interactome Networks and Human DiseaseCell, 2011
- PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKaPredictionsJournal of Chemical Theory and Computation, 2011
- BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence featuresBMC Systems Biology, 2010
- Genome analysis with inter-nucleotide distancesBioinformatics, 2009
- Structural Basis for Antibody Discrimination between Two Hormones That Recognize the Parathyroid Hormone ReceptorOnline Journal of Public Health Informatics, 2009
- BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequencesNucleic Acids Research, 2006
- Environment and exposure to solvent of protein atoms. Lysozyme and insulinJournal of Molecular Biology, 1973