Multiplex sequencing of plant chloroplast genomes using Solexa sequencing-by-synthesis technology
Open Access
- 27 August 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (19), e122
- https://doi.org/10.1093/nar/gkn502
Abstract
Organellar DNA sequences are widely used in evolutionary and population genetic studies, however, the conservative nature of chloroplast gene and genome evolution often limits phylogenetic resolution and statistical power. To gain maximal access to the historical record contained within chloroplast genomes, we have adapted multiplex sequencing-by-synthesis (MSBS) to simultaneously sequence multiple genomes using the Illumina Genome Analyzer. We PCR-amplified ∼120 kb plastomes from eight species (seven Pinus , one Picea ) in 35 reactions. Pooled products were ligated to modified adapters that included 3 bp indexing tags and samples were multiplexed at four genomes per lane. Tagged microreads were assembled by de novo and reference-guided assembly methods, using previously published Pinus plastomes as surrogate references. Assemblies for these eight genomes are estimated at 88–94% complete, with an average sequence depth of 55× to 186×. Mononucleotide repeats interrupt contig assembly with increasing repeat length, and we estimate that the limit for their assembly is 16 bp. Comparisons to 37 kb of Sanger sequence show a validated error rate of 0.056%, and conspicuous errors are evident from the assembly process. This efficient sequencing approach yields high-quality draft genomes and should have immediate applicability to genomes with comparable complexity.Keywords
This publication has 36 references indexed in Scilit:
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplexNature Methods, 2008
- Small RNAs and the regulation of cis-natural antisense transcripts in ArabidopsisBMC Molecular Biology, 2008
- Direct selection of human genomic loci by microarray hybridizationNature Methods, 2007
- Microarray-based genomic selection for high-throughput resequencingNature Methods, 2007
- Targeted high-throughput sequencing of tagged nucleic acid samplesNucleic Acids Research, 2007
- Constrained hidden Markov models for population-based haplotypingBMC Bioinformatics, 2007
- The Use of Coded PCR Primers Enables High-Throughput Sequencing of Multiple Homolog Amplification Products by 454 Parallel SequencingPLOS ONE, 2007
- Genome-Wide Profiling and Analysis of Arabidopsis siRNAsPLoS Biology, 2007
- Widespread positive selection in the photosynthetic Rubisco enzymeBMC Evolutionary Biology, 2007