A pipeline for complete characterization of complex germline rearrangements from long DNA reads
Open Access
- 31 July 2020
- journal article
- research article
- Published by Springer Science and Business Media LLC in Genome Medicine
- Vol. 12 (1), 1-17
- https://doi.org/10.1186/s13073-020-00762-1
Abstract
Many genetic/genomic disorders are caused by genomic rearrangements. Standard methods can often characterize these variations only partly, e.g., copy number changes or breakpoints. It is important to fully understand the order and orientation of rearranged fragments, with precise breakpoints, to know the pathogenicity of the rearrangements. We performed whole-genome-coverage nanopore sequencing of long DNA reads from four patients with chromosomal translocations. We identified rearrangements relative to a reference human genome, subtracted rearrangements shared by any of 33 control individuals, and determined the order and orientation of rearranged fragments, with our newly developed analysis pipeline. We describe the full characterization of complex chromosomal rearrangements, by filtering out genomic rearrangements seen in controls without the same disease, reducing the number of loci per patient from a few thousand to a few dozen. Breakpoint detection was very accurate; we usually see ~ 0 ± 1 base difference from Sanger sequencing-confirmed breakpoints. For one patient with two reciprocal chromosomal translocations, we find that the translocation points have complex rearrangements of multiple DNA fragments involving 5 chromosomes, which we could order and orient by an automatic algorithm, thereby fully reconstructing the rearrangement. A rearrangement is more than the sum of its parts: some properties, such as sequence loss, can be inferred only after reconstructing the whole rearrangement. In this patient, the rearrangements were evidently caused by shattering of the chromosomes into multiple fragments, which rejoined in a different order and orientation with loss of some fragments. We developed an effective analytic pipeline to find chromosomal aberration in congenital diseases by filtering benign changes, only from long read sequencing. Our algorithm for reconstruction of complex rearrangements is useful to interpret rearrangements with many breakpoints, e.g., chromothripsis. Our approach promises to fully characterize many congenital germline rearrangements, provided they do not involve poorly understood loci such as centromeric repeats.Keywords
This publication has 41 references indexed in Scilit:
- Retrotransposition of gene transcripts leads to structural variation in mammalian genomesGenome Biology, 2013
- Mammalian NUMT insertion is non-randomNucleic Acids Research, 2012
- Reconstructing cancer genomes from paired-end sequencing dataBMC Bioinformatics, 2012
- Pathogenic orphan transduction created by a nonreference LINE-1 retrotransposonHuman Mutation, 2011
- A Comprehensive Map of Mobile Element Insertion Polymorphisms in HumansPLoS Genetics, 2011
- Mechanisms of change in gene copy numberNature Reviews Genetics, 2009
- Characterization of the complex 7q21.3 rearrangement in a patient with bilateral split-foot malformation and hearing lossAmerican Journal of Medical Genetics Part A, 2009
- The association of primary hyperparathyroidism and primary ovarian failure: a de novo t(X; 2) (q22p13) reciprocal translocationActa Endocrinologica, 2008
- Template switching during break-induced replicationNature, 2007
- Large-scale analysis of the Alu Ya5 and Yb8 subfamilies and their contribution to human genomic diversityJournal of Molecular Biology, 2001