A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes
Top Cited Papers
Open Access
- 24 July 2014
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Communications
- Vol. 5 (1), 4498
- https://doi.org/10.1038/ncomms5498
Abstract
Metagenomics, or sequencing of the genetic material from a complete microbial community, is a promising tool to discover novel microbes and viruses. Viral metagenomes typically contain many unknown sequences. Here we describe the discovery of a previously unidentified bacteriophage present in the majority of published human faecal metagenomes, which we refer to as crAssphage. Its ~97 kbp genome is six times more abundant in publicly available metagenomes than all other known phages together; it comprises up to 90% and 22% of all reads in virus-like particle (VLP)-derived metagenomes and total community metagenomes, respectively; and it totals 1.68% of all human faecal metagenomic sequencing reads in the public databases. The majority of crAssphage-encoded proteins match no known sequences in the database, which is why it was not detected before. Using a new co-occurrence profiling approach, we predict a Bacteroides host for this phage, consistent with Bacteroides-related protein homologues and a unique carbohydrate-binding domain encoded in the phage genome.This publication has 71 references indexed in Scilit:
- Bacteriophage adhering to mucus provide a non–host-derived immunityProceedings of the National Academy of Sciences of the United States of America, 2013
- Reference-independent comparative metagenomics using cross-assembly: crAssBioinformatics, 2012
- Human gut microbiome viewed across age and geographyNature, 2012
- Hypervariable loci in the human gut viromeProceedings of the National Academy of Sciences of the United States of America, 2012
- Java web tools for PCR, in silico PCR, and oligonucleotide assembly and analysisGenomics, 2011
- Statistical structure of host–phage interactionsProceedings of the National Academy of Sciences of the United States of America, 2011
- Viruses in the faecal microbiota of monozygotic twins and their mothersNature, 2010
- Identifying bacterial genes and endosymbiont DNA with GlimmerBioinformatics, 2007
- An obesity-associated gut microbiome with increased capacity for energy harvestNature, 2006
- Community structure and metabolism through reconstruction of microbial genomes from the environmentNature, 2004