Methods for virus classification and the challenge of incorporating metagenomic sequence data
- 1 June 2015
- journal article
- review article
- Published by Microbiology Society in Journal of General Virology
- Vol. 96 (6), 1193-1206
- https://doi.org/10.1099/jgv.0.000016
Abstract
The division of viruses into orders, families, genera and species provides a classification framework that seeks to organize and make sense of the diversity of viruses infecting animals, plants and bacteria. Classifications are based on similarities in genome structure and organization, the presence of homologous genes and sequence motifs and at lower levels such as species, host range, nucleotide and antigenic relatedness and epidemiology. Classification below the level of family must also be consistent with phylogeny and virus evolutionary histories. Recently developed methods such as PASC, DEMaRC and NVR offer alternative strategies for genus and species assignments that are based purely on degrees of divergence between genome sequences. They offer the possibility of automating classification of the vast number of novel virus sequences being generated by next-generation metagenomic sequencing. However, distance-based methods struggle to deal with the complex evolutionary history of virus genomes that are shuffled by recombination and reassortment, and where taxonomic lineages evolve at different rates. In biological terms, classifications based on sequence distances alone are also arbitrary whereas the current system of virus taxonomy is of utility precisely because it is primarily based upon phenotypic characteristics. However, a separate system is clearly needed by which virus variants that lack biological information might be incorporated into the ICTV classification even if based solely on sequence relationships to existing taxa. For these, simplified taxonomic proposals and naming conventions represent a practical way to expand the existing virus classification and catalogue our rapidly increasing knowledge of virus diversity.Keywords
This publication has 55 references indexed in Scilit:
- Carrot yellow leaf virus Is Associated with Carrot Internal NecrosisPLOS ONE, 2014
- A Novel Anelloviridae Species Detected in Tadarida brasiliensis Bats: First Sequence of a Chiropteran AnellovirusMicrobiology Resource Announcements, 2014
- Metagenomic Analysis of the Airborne Environment in Urban SpacesMicrobial Ecology, 2014
- Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classificationArchiv für die gesamte Virusforschung, 2014
- The Characterization of RNA Viruses in Tropical Seawater Using Targeted PCR and MetagenomicsmBio, 2014
- Full Genome Virus Detection in Fecal Samples Using Sensitive Nucleic Acid Preparation, Deep Sequencing, and a Novel Iterative Sequence Classification AlgorithmPLOS ONE, 2014
- The family ParvoviridaeArchiv für die gesamte Virusforschung, 2013
- PAirwise Sequence Comparison (PASC) and Its Application in the Classification of FilovirusesViruses, 2012
- Rates of evolutionary change in viruses: patterns and determinantsNature Reviews Genetics, 2008
- Relaxed Phylogenetics and Dating with ConfidencePLoS Biology, 2006