Chromosome‐scale scaffolds for the Chinese hamster reference genome assembly to facilitate the study of the CHO epigenome
- 15 May 2020
- journal article
- research article
- Published by Wiley in Biotechnology & Bioengineering
- Vol. 117 (8), 2331-2339
- https://doi.org/10.1002/bit.27432
Abstract
The Chinese hamster genome serves as a reference genome for the study of Chinese hamster ovary (CHO) cells, the preferred host system for biopharmaceutical production. Recent re-sequencing of the Chinese hamster genome resulted in the RefSeq PICR meta-assembly, a set of highly accurate scaffolds that filled over 95% of the gaps in previous assembly versions. However, these scaffolds did not reach chromosome-scale due to the absence of long-range scaffolding information during the meta-assembly process. Here, long-range scaffolding of the PICR Chinese hamster genome assembly was performed using high-throughput chromosome conformation capture (Hi-C). This process resulted in a new “PICRH” genome, where 97% of the genome is contained in 11 mega-scaffolds corresponding to the Chinese hamster chromosomes (2n = 22) and the total number of scaffolds is reduced by three-fold from 1,830 scaffolds in PICR to 647 in PICRH. Continuity was improved while preserving accuracy, leading to quality scores higher than recent builds of mouse chromosomes and comparable to human chromosomes. The PICRH genome assembly will be an indispensable tool for designing advanced genetic engineering strategies in CHO cells and enabling systematic examination of genomic and epigenomic instability through comparative analysis of CHO cell lines on a common set of chromosomal coordinates.Funding Information
- National Institute of Standards and Technology (70NANB17H002)
- National Science Foundation (1736123)
This publication has 37 references indexed in Scilit:
- REAPR: a universal tool for genome assembly evaluationGenome Biology, 2013
- HiTC: exploration of high-throughput ‘C’ experimentsBioinformatics, 2012
- Topological domains in mammalian genomes identified by analysis of chromatin interactionsNature, 2012
- The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell lineNature Biotechnology, 2011
- BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics, 2010
- Fast and accurate long-read alignment with Burrows–Wheeler transformBioinformatics, 2010
- Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human GenomeScience, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002