Functional assignment of metagenomic data: challenges and applications

Open Access

31 October 2012

journal article
research article
Published by Oxford University Press (OUP) in Briefings in Bioinformatics

Vol. 13 (6), 711-727
https://doi.org/10.1093/bib/bbs033

Abstract

Metagenomic sequencing provides a unique opportunity to explore earth's limitless environments harboring scores of yet unknown and mostly unculturable microbes and other organisms. Functional analysis of the metagenomic data plays a central role in projects aiming to explore the most essential questions in microbiology, namely 'In a given environment, among the microbes present, what are they doing, and how are they doing it?' Toward this goal, several large-scale metagenomic projects have recently been conducted or are currently underway. Functional analysis of metagenomic data mainly suffers from the vast amount of data generated in these projects. The shear amount of data requires much computational time and storage space. These problems are compounded by other factors potentially affecting the functional analysis, including, sample preparation, sequencing method and average genome size of the metagenomic samples. In addition, the read-lengths generated during sequencing influence sequence assembly, gene prediction and subsequently the functional analysis. The level of confidence for functional predictions increases with increasing read-length. Usually, the most reliable functional annotations for metagenomic sequences are achieved using homology-based approaches against publicly available reference sequence databases. Here, we present an overview of the current state of functional analysis of metagenomic sequence data, bottlenecks frequently encountered and possible solutions in light of currently available resources and tools. Finally, we provide some examples of applications from recent metagenomic studies which have been successfully conducted in spite of the known difficulties.

Keywords

This publication has 121 references indexed in Scilit:

The oral metagenome in health and disease
The ISME Journal, 2011
Induction of Intestinal Th17 Cells by Segmented Filamentous Bacteria
Cell, 2009
A core gut microbiome in obese and lean twins
Nature, 2008
Systemic multicompartmental effects of the gut microbiome on mouse metabolic phenotypes
Molecular Systems Biology, 2008
Predicting protein function from sequence and structure
Nature Reviews Molecular Cell Biology, 2007
An obesity-associated gut microbiome with increased capacity for energy harvest
Nature, 2006
Community structure and metabolism through reconstruction of microbial genomes from the environment
Nature, 2004
BLAT—The BLAST-Like Alignment Tool
Genome Research, 2002
Why are proteins marginally stable?
Proteins-Structure Function and Bioinformatics, 2001
Basic Local Alignment Search Tool
Journal of Molecular Biology, 1990

Cited by 135 articles