Chromosome genome assembly and annotation of the yellowbelly pufferfish with PacBio and Hi-C sequencing data
Open Access
- 8 November 2019
- journal article
- research article
- Published by Springer Science and Business Media LLC in Scientific Data
- Vol. 6 (1), 1-8
- https://doi.org/10.1038/s41597-019-0279-z
Abstract
Pufferfish are ideal models for vertebrate chromosome evolution studies. The yellowbelly pufferfish, Takifugu flavidus, is an important marine fish species in the aquaculture industry and ecology of East Asia. The chromosome assembly of the species could facilitate the study of chromosome evolution and functional gene mapping. To this end, 44, 27 and 50 Gb reads were generated for genome assembly using Illumina, PacBio and Hi-C sequencing technologies, respectively. More than 13 Gb full-length transcripts were sequenced on the PacBio platform. A 366 Mb genome was obtained with the contig of 4.4 Mb and scaffold N50 length of 15.7 Mb. 266 contigs were reliably assembled into 22 chromosomes, representing 95.9% of the total genome. A total of 29,416 protein-coding genes were predicted and 28,071 genes were functionally annotated. More than 97.7% of the BUSCO genes were successfully detected in the genome. The genome resource in this work will be used for the conservation and population genetics of the yellowbelly pufferfish, as well as in vertebrate chromosome evolution studies.Keywords
This publication has 39 references indexed in Scilit:
- A fast, lock-free approach for efficient parallel counting of occurrences of k-mersBioinformatics, 2011
- Aligning Short Sequencing Reads with BowtieCurrent Protocols in Bioinformatics, 2010
- TopHat: discovering splice junctions with RNA-SeqBioinformatics, 2009
- MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomesGenome Research, 2007
- Adaptation in bacterial flagellar and motility systems: from regulon members to ‘foraging’-like behavior in E. coliNucleic Acids Research, 2007
- LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposonsNucleic Acids Research, 2007
- AUGUSTUS: ab initio prediction of alternative transcriptsNucleic Acids Research, 2006
- Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics researchBioinformatics, 2005
- De novo identification of repeat families in large genomesBioinformatics, 2005
- Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotypeNature, 2004