Propedia: a database for protein–peptide identification based on a hybrid clustering algorithm

Top Cited Papers

Open Access

2 January 2021

journal article
research article
Published by Springer Science and Business Media LLC in BMC Bioinformatics

Vol. 22 (1), 1-20
https://doi.org/10.1186/s12859-020-03881-z

Abstract

Protein–peptide interactions play a fundamental role in a wide variety of biological processes, such as cell signaling, regulatory networks, immune responses, and enzyme inhibition. Peptides are characterized by low toxicity and small interface areas; therefore, they are good targets for therapeutic strategies, rational drug planning and protein inhibition. Approximately 10% of the ethical pharmaceutical market is protein/peptide-based. Furthermore, it is estimated that 40% of protein interactions are mediated by peptides. Despite the fast increase in the volume of biological data, particularly on sequences and structures, there remains a lack of broad and comprehensive protein–peptide databases and tools that allow the retrieval, characterization and understanding of protein–peptide recognition and consequently support peptide design. We introduce Propedia, a comprehensive and up-to-date database with a web interface that permits clustering, searching and visualizing of protein–peptide complexes according to varied criteria. Propedia comprises over 19,000 high-resolution structures from the Protein Data Bank including structural and sequence information from protein–peptide complexes. The main advantage of Propedia over other peptide databases is that it allows a more comprehensive analysis of similarity and redundancy. It was constructed based on a hybrid clustering algorithm that compares and groups peptides by sequences, interface structures and binding sites. Propedia is available through a graphical, user-friendly and functional interface where users can retrieve, and analyze complexes and download each search data set. We performed case studies and verified that the utility of Propedia scores to rank promissing interacting peptides. In a study involving predicting peptides to inhibit SARS-CoV-2 main protease, we showed that Propedia scores related to similarity between different peptide complexes with SARS-CoV-2 main protease are in agreement with molecular dynamics free energy calculation. Propedia is a database and tool to support structure-based rational design of peptides for special purposes. Protein–peptide interactions can be useful to predict, classifying and scoring complexes or for designing new molecules as well. Propedia is up-to-date as a ready-to-use webserver with a friendly and resourceful interface and is available at: https://bioinfo.dcc.ufmg.br/propedia

Keywords

Funding Information

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (51/2013 - 23038.004007/2014-82)

This publication has 56 references indexed in Scilit:

CPPsite: a curated database of cell penetrating peptides
Database: The Journal of Biological Databases and Curation, 2012
Comprehensive, atomic-level characterization of structurally characterized protein-protein interactions: the PICCOLO database
BMC Bioinformatics, 2011
HMMER web server: interactive sequence similarity searching
Nucleic Acids Research, 2011
ProBiS algorithm for detection of structurally similar protein binding sites by local structural alignment
Bioinformatics, 2010
BLAST+: architecture and applications
BMC Bioinformatics, 2009
PepX: a structural database of non-redundant protein–peptide complexes
Nucleic Acids Research, 2009
Biopython: freely available Python tools for computational molecular biology and bioinformatics
Bioinformatics, 2009
Systematic Discovery of New Recognition Peptides Mediating Protein Interaction Networks
PLoS Biology, 2005
WebLogo: A Sequence Logo Generator: Figure 1
Genome Research, 2004

Cited by 220 articles