Crass: identification and reconstruction of CRISPR from unassembled metagenomic data
Open Access
- 19 March 2013
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 41 (10), e105
- https://doi.org/10.1093/nar/gkt183
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPR) constitute a bacterial and archaeal adaptive immune system that protect against bacteriophage (phage). Analysis of CRISPR loci reveals the history of phage infections and provides a direct link between phage and their hosts. All current tools for CRISPR identification have been developed to analyse completed genomes and are not well suited to the analysis of metagenomic data sets, where CRISPR loci are difficult to assemble owing to their repetitive structure and population heterogeneity. Here, we introduce a new algorithm, Crass, which is designed to identify and reconstruct CRISPR loci from raw metagenomic data without the need for assembly or prior knowledge of CRISPR in the data set. CRISPR in assembled data are often fragmented across many contigs/scaffolds and do not fully represent the population heterogeneity of CRISPR loci. Crass identified substantially more CRISPR in metagenomes previously analysed using assembly-based approaches. Using Crass, we were able to detect CRISPR that contained spacers with sequence homology to phage in the system, which would not have been identified using other approaches. The increased sensitivity, specificity and speed of Crass will facilitate comprehensive analysis of CRISPRs in metagenomic data sets, increasing our understanding of phage-host interactions and co-evolution within microbial communities.Keywords
This publication has 30 references indexed in Scilit:
- MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence readsNucleic Acids Research, 2012
- Diverse CRISPRs Evolving in Human MicrobiomesPLoS Genetics, 2012
- Analysis of streptococcal CRISPRs from human saliva reveals substantial sequence diversity within and between subjects over timeGenome Research, 2010
- Evolutionary Dynamics of Clustered Irregularly Interspaced Short Palindromic Repeat Systems in the Ocean MetagenomeApplied and Environmental Microbiology, 2010
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- CRISPR — a widespread system that provides acquired resistance against phages in bacteria and archaeaNature Reviews Microbiology, 2008
- Phage Response to CRISPR-Encoded Resistance in Streptococcus thermophilusJournal of Bacteriology, 2008
- PILER-CR: Fast and accurate identification of CRISPR repeatsBMC Bioinformatics, 2007
- Obtaining highly enriched cultures of Candidatus Accumulibacter phosphates through alternating carbon sourcesWater Research, 2006
- Metagenomic analysis of two enhanced biological phosphorus removal (EBPR) sludge communitiesNature Biotechnology, 2006