Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
Open Access
- 19 February 2021
- preprint content
- research article
- Published by Cold Spring Harbor Laboratory
Abstract
Pathogenic mechanisms underlying severe SARS-CoV2 infection remain largely unelucidated. High throughput sequencing technologies that capture genome and transcriptome information are key approaches to gain detailed mechanistic insights from infected cells. These techniques readily detect both pathogen and host-derived sequences, providing a means of studying host-pathogen interactions. Recent studies have reported the presence of host-virus chimeric (HVC) RNA in RNA-seq data from SARS-CoV2 infected cells and interpreted these findings as evidence of viral integration in the human genome as a potential pathogenic mechanism. Since SARS-CoV2 is a positive sense RNA virus that replicates in the cytoplasm it does not have a nuclear phase in its life cycle, it is biologically unlikely to be in a location where splicing events could result in genome integration. Here, we investigated the biological authenticity of HVC events. In contrast to true biological events such as mRNA splicing and genome rearrangement events, which generate reproducible chimeric sequencing fragments across different biological isolates, we found that HVC events across >100 RNA-seq libraries from patients with COVID-19 and infected cell lines, were highly irreproducible. RNA-seq library preparation is inherently error-prone due to random template switching during reverse transcription of RNA to cDNA. By counting chimeric events observed when constructing an RNA-seq library from human RNA and spike-in RNA from an unrelated species, such as fruit-fly, we estimated that ~1% of RNA-seq reads are artifactually chimeric. In SARS-CoV2 RNA-seq we found that the frequency of HVC events was, in fact, not greater than this background “noise”. Finally, we developed a novel experimental approach to enrich SARS-CoV2 sequences from bulk RNA of infected cells. This method enriched viral sequences but did not enrich for HVC events, suggesting that the majority of HVC events are, in all likelihood, artifacts of library construction. In conclusion, our findings indicate that HVC events observed in RNA-sequencing libraries from SARS-CoV2 infected cells are extremely rare and are likely artifacts arising from either random template switching of reverse-transcriptase and/or sequence alignment errors. Therefore, the observed HVC events do not support SARS-CoV2 fusion to cellular genes and/or integration into human genomes.Keywords
This publication has 22 references indexed in Scilit:
- Epstein-Barr Virus Episome Physically Interacts with Active Regions of the Host Genome in Lymphoblastoid CellsJournal of Virology, 2020
- Restoration of RNA helicase DDX5 suppresses hepatitis B virus (HBV) biosynthesis and Wnt signaling in HBV-related hepatocellular carcinomaTheranostics, 2020
- Integrated Pan-Cancer Map of EBV-Associated Neoplasms Reveals Functional Host–Virus InteractionsCancer Research, 2019
- Prediction and identification of recurrent genomic rearrangements that generate chimeric chromosomes in Saccharomyces cerevisiaeProceedings of the National Academy of Sciences of the United States of America, 2019
- Characterization of HPV integration, viral gene expression and E6E7 alternative transcripts by RNA-Seq: A descriptive study in invasive cervical cancerGenomics, 2018
- Genomic and oncogenic preference of HBV integration in hepatocellular carcinomaNature Communications, 2016
- Possible Human Papillomavirus 38 Contamination of Endometrial Cancer RNA Sequencing Samples in The Cancer Genome Atlas DatabaseJournal of Virology, 2015
- Comprehensive assembly of novel transcripts from unmapped human RNA‐Seq data and their association with cancerMolecular Systems Biology, 2015
- Genome-wide survey of recurrent HBV integration in hepatocellular carcinomaNature Genetics, 2012
- Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencingProceedings of the National Academy of Sciences of the United States of America, 2011