Function Prediction and Analysis of Mycobacterium tuberculosis Hypothetical Proteins
Open Access
- 13 June 2012
- journal article
- research article
- Published by MDPI AG in International Journal of Molecular Sciences
- Vol. 13 (6), 7283-7302
- https://doi.org/10.3390/ijms13067283
Abstract
High-throughput biology technologies have yielded complete genome sequences and functional genomics data for several organisms, including crucial microbial pathogens of humans, animals and plants. However, up to 50% of genes within a genome are often labeled “unknown”, “uncharacterized” or “hypothetical”, limiting our understanding of virulence and pathogenicity of these organisms. Even though biological functions of proteins encoded by these genes are not known, many of them have been predicted to be involved in key processes in these organisms. In particular, for Mycobacterium tuberculosis, some of these “hypothetical” proteins, for example those belonging to the Pro-Glu or Pro-Pro-Glu (PE/PPE) family, have been suspected to play a crucial role in the intracellular lifestyle of this pathogen, and may contribute to its survival in different environments. We have generated a functional interaction network for Mycobacterium tuberculosis proteins and used this to predict functions for many of its hypothetical proteins. Here we performed functional enrichment analysis of these proteins based on their predicted biological functions to identify annotations that are statistically relevant, and analysed and compared network properties of hypothetical proteins to the known proteins. From the statistically significant annotations and network information, we have tried to derive biologically meaningful annotations relatedto infection and disease. This quantitative analysis provides an overview of the functional contributions of Mycobacterium tuberculosis “hypothetical” proteins to many basic cellular functions, including its adaptability in the host system and its ability to evade the host immune response.This publication has 53 references indexed in Scilit:
- The IntAct molecular interaction database in 2012Nucleic Acids Research, 2011
- Proteomic Definition of the Cell Wall of Mycobacterium tuberculosisJournal of Proteome Research, 2010
- The Gene Ontology in 2010: extensions and refinementsNucleic Acids Research, 2009
- The IntAct molecular interaction database in 2010Nucleic Acids Research, 2009
- The Universal Protein Resource (UniProt) in 2010Nucleic Acids Research, 2009
- PPE and PE_PGRS proteins of Mycobacterium marinum are transported via the type VII secretion system ESX‐5Molecular Microbiology, 2009
- The GOA database in 2009--an integrated Gene Ontology Annotation resourceNucleic Acids Research, 2009
- DIMA 2.0 predicted and known domain interactionsNucleic Acids Research, 2007
- UniProt: the Universal Protein knowledgebaseNucleic Acids Research, 2004
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997