Beware of mis-assembled genomes

Open Access

25 October 2005

journal article
research article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 21 (24), 4320-4321
https://doi.org/10.1093/bioinformatics/bti769

Abstract

With hundreds of genomes now in GenBank, researchers might be forgiven for assuming that genome sequence data are correct, at least at a large scale. Certainly there might be errors at some small rate, perhaps 1 in 50 000 or 100 000 bases (Schmutz et al., 2004; Read et al., 2002), but at a large scale these genomes are put together correctly, are not they? Well, not always.

Keywords

This publication has 9 references indexed in Scilit:

The Genome Assembly Archive: A New Public Resource
PLoS Biology, 2004
Quality assessment of the human genome sequence
Nature, 2004
The Atlas Genome Assembly System
Genome Research, 2004
PCAP: A Whole-Genome Assembly Program
Genome Research, 2003
Whole-Genome Sequence Assembly for Mammalian Genomes: Arachne 2
Genome Research, 2003
The Phusion Assembler
Genome Research, 2002
Whole-Genome Shotgun Assembly and Analysis of the Genome of Fugu rubripes
Science, 2002
Comparative Genome Sequencing for Discovery of Novel Polymorphisms in Bacillus anthracis
Science, 2002
A Whole-Genome Assembly of Drosophila
Science, 2000

Cited by 147 articles