Deconvolving multiplexed protease signatures with substrate reduction and activity clustering
Open Access
- 3 September 2019
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 15 (9), e1006909
- https://doi.org/10.1371/journal.pcbi.1006909
Abstract
Proteases are multifunctional, promiscuous enzymes that degrade proteins as well as peptides and drive important processes in health and disease. Current technology has enabled the construction of libraries of peptide substrates that detect protease activity, which provides valuable biological information. An ideal library would be orthogonal, such that each protease only hydrolyzes one unique substrate, however this is impractical due to off-target promiscuity (i.e., one protease targets multiple different substrates). Therefore, when a library of probes is exposed to a cocktail of proteases, each protease activates multiple probes, producing a convoluted signature. Computational methods for parsing these signatures to estimate individual protease activities primarily use an extensive collection of all possible protease-substrate combinations, which require impractical amounts of training data when expanding to search for more candidate substrates. Here we provide a computational method for estimating protease activities efficiently by reducing the number of substrates and clustering proteases with similar cleavage activities into families. We envision that this method will be used to extract meaningful diagnostic information from biological samples. The activity of enzymatic proteins, which are called proteases, drives numerous important processes in health and disease: including cancer, immunity, and infectious disease. Many labs have developed useful diagnostics by designing sensors that measure the activity of these proteases. However, if we want to detect multiple proteases at the same time, it becomes impractical to design sensors that only detect one protease. This is due to a phenomenon called protease promiscuity, which means that proteases will activate multiple different sensors. Computational methods have been created to solve this problem, but the challenge is that these often require large amounts of training data. Further, completely different proteases may be detected by the same subset of sensors. In this work, we design a computational method to overcome this problem by clustering similar proteases into "subfamilies", which increases estimation accuracy. Further, our method tests multiple combinations of sensors to maintain accuracy while minimizing the number of sensors used. Together, we envision that this work will increase the amount of useful information we can extract from biological samples, which may lead to better clinical diagnostics.This publication has 53 references indexed in Scilit:
- Functional imaging of proteases: recent advances in the design and application of substrate-based and activity-based probesCurrent Opinion in Chemical Biology, 2011
- Proteolytic Activity Matrix Analysis (PrAMA) for simultaneous determination of multiple protease activitiesIntegrative Biology, 2011
- Activity-based protein profiling for biochemical pathway discovery in cancerNature Reviews Cancer, 2010
- Endolysosomal proteases and their inhibitors in immunityNature Reviews Immunology, 2009
- Monitoring peptidase activities in complex proteomes by MALDI-TOF mass spectrometryNature Protocols, 2009
- Proteases: Multifunctional Enzymes in Life and Disease*Journal of Biological Chemistry, 2008
- In search of partners: linking extracellular proteases to substratesNature Reviews Molecular Cell Biology, 2007
- Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging ReagentsMolecular & Cellular Proteomics, 2004
- Tandem Mass Tags: A Novel Quantification Strategy for Comparative Analysis of Complex Protein Mixtures by MS/MSAnalytical Chemistry, 2003
- Monomeric Structures of the Zymogen and Active Catalytic Domain of Complement Protease C1r: Further Insights into the C1 Activation MechanismStructure, 2002