Genome-Wide Analysis of Wild-Type Epstein–Barr Virus Genomes Derived from Healthy Individuals of the 1000 Genomes Project
Open Access
- 28 March 2014
- journal article
- research article
- Published by Oxford University Press (OUP) in Genome Biology and Evolution
- Vol. 6 (4), 846-860
- https://doi.org/10.1093/gbe/evu054
Abstract
Most people in the world (∼90%) are infected by the Epstein–Barr virus (EBV), which establishes itself permanently in B cells. Infection by EBV is related to a number of diseases including infectious mononucleosis, multiple sclerosis, and different types of cancer. So far, only seven complete EBV strains have been described, all of them coming from donors presenting EBV-related diseases. To perform a detailed comparative genomic analysis of EBV including, for the first time, EBV strains derived from healthy individuals, we reconstructed EBV sequences infecting lymphoblastoid cell lines (LCLs) from the 1000 Genomes Project. As strain B95-8 was used to transform B cells to obtain LCLs, it is always present, but a specific deletion in its genome sets it apart from natural EBV strains. After studying hundreds of individuals, we determined the presence of natural EBV in at least 10 of them and obtained a set of variants specific to wild-type EBV. By mapping the natural EBV reads into the EBV reference genome (NC007605), we constructed nearly complete wild-type viral genomes from three individuals. Adding them to the five disease-derived EBV genomic sequences available in the literature, we performed an in-depth comparative genomic analysis. We found that latency genes harbor more nucleotide diversity than lytic genes and that six out of nine latency-related genes, as well as other genes involved in viral attachment and entry into host cells, packaging, and the capsid, present the molecular signature of accelerated protein evolution rates, suggesting rapid host–parasite coevolution.Keywords
This publication has 78 references indexed in Scilit:
- Integrative Genomics Viewer (IGV): high-performance genomics data visualization and explorationBriefings in Bioinformatics, 2012
- Fusing structure and function: a structural view of the herpesvirus entry machineryNature Reviews Microbiology, 2011
- A framework for variation discovery and genotyping using next-generation DNA sequencing dataNature Genetics, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- Fast and accurate long-read alignment with Burrows–Wheeler transformBioinformatics, 2010
- The extent of genetic diversity of Epstein-Barr virus and its geographic and disease patterns: A need for reappraisalVirus Research, 2009
- VarScan: variant detection in massively parallel sequencing of individual and pooled samplesBioinformatics, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- Reliabilities of identifying positive selection by the branch-site and the site-prediction methodsProceedings of the National Academy of Sciences of the United States of America, 2009