Discovery of novel community-relevant small proteins in a simplified human intestinal microbiome
Open Access
- 23 February 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Microbiome
- Vol. 9 (1), 1-19
- https://doi.org/10.1186/s40168-020-00981-z
Abstract
Background: The intestinal microbiota plays a crucial role in protecting the host from pathogenic microbes, modulating immunity and regulating metabolic processes. We studied the simplified human intestinal microbiota (SIHUMIx) consisting of eight bacterial species with a particular focus on the discovery of novel small proteins with less than 100 amino acids (= sProteins), some of which may contribute to shape the simplified human intestinal microbiota. Although sProteins carry out a wide range of important functions, they are still often missed in genome annotations, and little is known about their structure and function in individual microbes and especially in microbial communities. Results: We created a multi-species integrated proteogenomics search database (iPtgxDB) to enable a comprehensive identification of novel sProteins. Six of the eight SIHUMIx species, for which no complete genomes were available, were sequenced and de novo assembled. Several proteomics approaches including two earlier optimized sProtein enrichment strategies were applied to specifically increase the chances for novel sProtein discovery. The search of tandem mass spectrometry (MS/MS) data against the multi-species iPtgxDB enabled the identification of 31 novel sProteins, of which the expression of 30 was supported by metatranscriptomics data. Using synthetic peptides, we were able to validate the expression of 25 novel sProteins. The comparison of sProtein expression in each single strain versus a multi-species community cultivation showed that six of these sProteins were only identified in the SIHUMIx community indicating a potentially important role of sProteins in the organization of microbial communities. Two of these novel sProteins have a potential antimicrobial function. Metabolic modelling revealed that a third sProtein is located in a genomic region encoding several enzymes relevant for the community metabolism within SIHUMIx. Conclusions: We outline an integrated experimental and bioinformatics workflow for the discovery of novel sProteins in a simplified intestinal model system that can be generically applied to other microbial communities. The further analysis of novel sProteins uniquely expressed in the SIHUMIx multi-species community is expected to enable new insights into the role of sProteins on the functionality of bacterial communities such as those of the human intestinal tract.Keywords
Funding Information
- Deutsche Forschungsgemeinschaft (379643916)
- SNSF (156320, 188722)
- Deutsche Forschungsgemeinschaft (EXC 22167)
This publication has 96 references indexed in Scilit:
- Identification of new protein coding sequences and signal peptidase cleavage sites of Helicobacter pylori strain 26695 by proteogenomicsJournal of Proteomics, 2013
- A cross-platform toolkit for mass spectrometry and proteomicsNature Biotechnology, 2012
- Integrative Genomics Viewer (IGV): high-performance genomics data visualization and explorationBriefings in Bioinformatics, 2012
- Mannose 6-phosphate receptor homology (MRH) domain-containing lectins in the secretory pathwayBiochimica et Biophysica Acta (BBA) - General Subjects, 2011
- A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomicsJournal of Proteomics, 2010
- Optimization of parameters for coverage of low molecular weight proteinsAnalytical and Bioanalytical Chemistry, 2010
- Value of Using Multiple Proteases for Large-Scale Mass Spectrometry-Based ProteomicsJournal of Proteome Research, 2010
- Circos: An information aesthetic for comparative genomicsGenome Research, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteinsNucleic Acids Research, 2006