Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing
- 25 March 2008
- journal article
- Published by Wiley in Molecular Ecology
- Vol. 17 (7), 1636-1647
- https://doi.org/10.1111/j.1365-294x.2008.03666.x
Abstract
We present a de novo assembly of a eukaryote transcriptome using 454 pyrosequencing data. The Glanville fritillary butterfly (Melitaea cinxia; Lepidoptera: Nymphalidae) is a prominent species in population biology but had no previous genomic data. Sequencing runs using two normalized complementary DNA collections from a genetically diverse pool of larvae, pupae, and adults yielded 608 053 expressed sequence tags (mean length = 110 nucleotides), which assembled into 48 354 contigs (sets of overlapping DNA segments) and 59 943 singletons. blast comparisons confirmed the accuracy of the sequencing and assembly, and indicated the presence of c. 9000 unique genes, along with > 6000 additional microarray-confirmed unannotated contigs. Average depth of coverage was 6.5-fold for the longest 4800 contigs (348–2849 bp in length), sufficient for detecting large numbers of single nucleotide polymorphisms. Oligonucleotide microarray probes designed from the assembled sequences showed highly repeatable hybridization intensity and revealed biological differences among individuals. We conclude that 454 sequencing, when performed to provide sufficient coverage depth, allows de novo transcriptome assembly and a fast, cost-effective, and reliable method for development of functional genomic tools for nonmodel species. This development narrows the gap between approaches based on model organisms with rich genetic resources vs. species that are most tractable for ecological and evolutionary studies.Keywords
This publication has 56 references indexed in Scilit:
- SNP discovery via 454 transcriptome sequencingThe Plant Journal, 2007
- Sampling the Arabidopsis Transcriptome with Massively Parallel PyrosequencingPlant Physiology, 2007
- SNP discovery by mismatch-targeting of Mu transpositionNucleic Acids Research, 2007
- Different levels of alternative splicing among eukaryotesNucleic Acids Research, 2006
- Gene discovery and annotation using LCM-454 transcriptome sequencingGenome Research, 2006
- A tutorial on statistical methods for population association studiesNature Reviews Genetics, 2006
- A Sanger/pyrosequencing hybrid approach for the generation of high-quality draft assemblies of marine microbial genomesProceedings of the National Academy of Sciences of the United States of America, 2006
- Simple cDNA normalization using kamchatka crab duplex-specific nucleaseNucleic Acids Research, 2004
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- The Genome Sequence of Drosophila melanogasterScience, 2000