Large-Scale Discovery of Microbial Fibrillar Adhesins and Identification of Novel Members of Adhesive Domain Families
Open Access
- 24 May 2022
- journal article
- research article
- Published by American Society for Microbiology in Journal of Bacteriology
- Vol. 204 (6), e0010722
- https://doi.org/10.1128/jb.00107-22
Abstract
Fibrillar adhesins are bacterial cell surface proteins that mediate interactions with the environment, including host cells during colonization or other bacteria during biofilm formation. These proteins are characterized by a stalk that projects the adhesive domain closer to the binding target. Fibrillar adhesins evolve quickly and thus can be difficult to computationally identify, yet they represent an important component for understanding bacterium-host interactions. To detect novel fibrillar adhesins, we developed a random forest prediction approach based on common characteristics we identified for this protein class. We applied this approach to Firmicutes and Actinobacteria proteomes, yielding over 6,500 confidently predicted fibrillar adhesins. To verify the approach, we investigated predicted fibrillar adhesins that lacked a known adhesive domain. Based on these proteins, we identified 24 sequence clusters representing potential novel members of adhesive domain families. We used AlphaFold to verify that 15 clusters showed structural similarity to known adhesive domains, such as the TED domain. Overall, our study has made a significant contribution to the number of known fibrillar adhesins and has enabled us to identify novel members of adhesive domain families involved in bacterial pathogenesis. IMPORTANCE Fibrillar adhesins are a class of bacterial cell surface proteins that enable bacteria to interact with their environment. We developed a machine learning approach to identify fibrillar adhesins and applied this classification approach to the Firmicutes and Actinobacteria Reference Proteomes database. This method allowed us to detect a high number of novel fibrillar adhesins and also novel members of adhesive domain families. To confirm our predictions of these potential adhesin protein domains, we predicted their structure using the AlphaFold tool.Keywords
This publication has 43 references indexed in Scilit:
- Targeting the bacteria–host interfaceVirulence, 2013
- Bap, a Biofilm Matrix Protein of Staphylococcus aureus Prevents Cellular Internalization through Binding to GP96 Host ReceptorPLoS Pathogens, 2012
- Staphylococcal biofilm-forming protein has a contiguous rod-like structureProceedings of the National Academy of Sciences of the United States of America, 2012
- Porcine and Human Community Reservoirs ofEnterococcusfaecalis, DenmarkEmerging Infectious Diseases, 2011
- Role of Surface Protein SasG in Biofilm Formation by Staphylococcus aureusJournal of Bacteriology, 2010
- PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotesBioinformatics, 2010
- Structural Basis for the Kexin-like Serine Protease from Aeromonas sobria as Sepsis-causing FactorOnline Journal of Public Health Informatics, 2009
- Crystal Structure and Standardized Geometric Analysis of InlJ, a Listerial Virulence Factor and Leucine-Rich Repeat Protein with a Novel Cysteine LadderJournal of Molecular Biology, 2008
- Structural basis for streptogramin B resistance in Staphylococcus aureus by virginiamycin B lyaseProceedings of the National Academy of Sciences of the United States of America, 2007
- UCSF Chimera?A visualization system for exploratory research and analysisJournal of Computational Chemistry, 2004