Analysis of High-Throughput Sequencing and Annotation Strategies for Phage Genomes
Open Access
- 5 February 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 5 (2), e9083
- https://doi.org/10.1371/journal.pone.0009083
Abstract
Bacterial viruses (phages) play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage. To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles), and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL) or of a whole genome shotgun library (WGSL), or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling. These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics.Keywords
This publication has 52 references indexed in Scilit:
- Layers of Evolvability in a Bacteriophage Life History TraitMolecular Biology and Evolution, 2009
- Viral photosynthetic reaction center genes and transcripts in the marine environmentThe ISME Journal, 2007
- RNAmmer: consistent and rapid annotation of ribosomal RNA genesNucleic Acids Research, 2007
- GISMO--gene identification using a support vector machine for ORF classificationNucleic Acids Research, 2006
- MetaGene: prokaryotic gene finding from environmental genome shotgun sequencesNucleic Acids Research, 2006
- A Sanger/pyrosequencing hybrid approach for the generation of high-quality draft assemblies of marine microbial genomesProceedings of the National Academy of Sciences, 2006
- Viruses in the seaNature, 2005
- Bacterial photosynthesis genes in a virusNature, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997