The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats
Open Access
- 23 May 2007
- journal article
- database
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 8 (1), 1-10
- https://doi.org/10.1186/1471-2105-8-172
Abstract
Background: In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element) are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description: We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion: It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the dictionary creator. CRISPRdb is accessible at http://crispr.u-psud.fr/crisprKeywords
This publication has 24 references indexed in Scilit:
- The Repetitive DNA Elements Called CRISPRs and Their Associated Genes: Evidence of Horizontal Transfer Among ProkaryotesJournal of Molecular Evolution, 2006
- A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomesPLoS Computational Biology, 2005
- Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal originMicrobiology, 2005
- CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studiesMicrobiology, 2005
- Genus-Specific Protein Binding to the Large Clusters of DNA Repeats (Short Regularly Spaced Repeats) Present in Sulfolobus GenomesJournal of Bacteriology, 2003
- Identification of a Novel Family of Sequence Repeats among ProkaryotesOMICS: A Journal of Integrative Biology, 2002
- Biological significance of a family of regularly spaced repeats in the genomes of Archaea, Bacteria and mitochondriaMolecular Microbiology, 2000
- Rapid Molecular Genetic Subtyping of Serotype M1 Group AStreptococcusStrainsEmerging Infectious Diseases, 1999
- Long stretches of short tandem repeats are present in the largest replicons of the Archaea Haloferax mediterranei and Haloferax volcanii and could be involved in replicon partitioningMolecular Microbiology, 1995
- Nature of DNA polymorphism in the direct repeat cluster of Mycobacterium tuberculosis; application for strain differentiation by a novel typing methodMolecular Microbiology, 1993