Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA
Open Access
- 6 March 2017
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Genetics
- Vol. 49 (4), 635-642
- https://doi.org/10.1038/ng.3805
Abstract
Kun Zhang and colleagues present a metric called methylation haplotype load (MHL) that quantifies methylation patterns within blocks of tightly linked CpG dinucleotides. They show that the MHL can distinguish samples from different human somatic tissues and that it can be used to improve detection of cancer-derived circulating DNA and identify its tissue of origin. Kun Zhang and colleagues present a metric called methylation haplotype load (MHL) that quantifies methylation patterns within blocks of tightly linked CpG dinucleotides. They show that the MHL can distinguish samples from different human somatic tissues and that it can be used to improve detection of cancer-derived circulating DNA and identify its tissue of origin. Adjacent CpG sites in mammalian genomes can be co-methylated owing to the processivity of methyltransferases or demethylases, yet discordant methylation patterns have also been observed, which are related to stochastic or uncoordinated molecular processes. We focused on a systematic search and investigation of regions in the full human genome that show highly coordinated methylation. We defined 147,888 blocks of tightly coupled CpG sites, called methylation haplotype blocks, after analysis of 61 whole-genome bisulfite sequencing data sets and validation with 101 reduced-representation bisulfite sequencing data sets and 637 methylation array data sets. Using a metric called methylation haplotype load, we performed tissue-specific methylation analysis at the block level. Subsets of informative blocks were further identified for deconvolution of heterogeneous samples. Finally, using methylation haplotypes we demonstrated quantitative estimation of tumor load and tissue-of-origin mapping in the circulating cell-free DNA of 59 patients with lung or colorectal cancer.Keywords
This publication has 45 references indexed in Scilit:
- An integrated encyclopedia of DNA elements in the human genomeNature, 2012
- The accessible chromatin landscape of the human genomeNature, 2012
- Topological domains in mammalian genomes identified by analysis of chromatin interactionsNature, 2012
- Regulated Noise in the Epigenetic Landscape of Development and DiseaseCell, 2012
- Increased methylation variation in epigenetic domains across cancer typesNature Genetics, 2011
- TET1 and hydroxymethylcytosine in transcription and DNA methylation fidelityNature, 2011
- The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shoresNature Genetics, 2009
- Large histone H3 lysine 9 dimethylated chromatin blocks distinguish differentiated from embryonic stem cellsNature Genetics, 2009
- Domain organization of human chromosomes revealed by mapping of nuclear lamina interactionsNature, 2008
- Comprehensive analysis of CpG islands in human chromosomes 21 and 22Proceedings of the National Academy of Sciences of the United States of America, 2002