Full-length sequencing of circular DNA viruses and extrachromosomal circular DNA using CIDER-Seq

Abstract
Circular DNA is ubiquitous in nature in the form of plasmids, circular DNA viruses, and extrachromosomal circular DNA (eccDNA) in eukaryotes. Sequencing of such molecules is essential to profiling virus distributions, discovering new viruses and understanding the roles of eccDNAs in eukaryotic cells. Circular DNA enrichment sequencing (CIDER-Seq) is a technique to enrich and accurately sequence circular DNA without the need for polymerase chain reaction amplification, cloning, and computational sequence assembly. The approach is based on randomly primed circular DNA amplification, which is followed by several enzymatic DNA repair steps and then by long-read sequencing. CIDER-Seq includes a custom data analysis package (CIDER-Seq Data Analysis Software 2) that implements the DeConcat algorithm to deconcatenate the long sequencing products of random circular DNA amplification into the intact sequences of the input circular DNA. The CIDER-Seq data analysis package can generate full-length annotated virus genomes, as well as circular DNA sequences of novel viruses. Applications of CIDER-Seq also include profiling of eccDNA molecules such as transposable elements (TEs) from biological samples. The method takes ~2 weeks to complete, depending on the computational resources available. Owing to the present constraints of long-read single-molecule sequencing, the accuracy of circular virus and eccDNA sequences generated by the CIDER-Seq method scales with sequence length, and the greatest accuracy is obtained for molecules <10 kb long.
Funding Information
  • Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (Grant 181602)
  • Fonds De La Recherche Scientifique - FNRS (1.B456.20, M.i.S. F.4515.17)