Population genomic analyses from low‐coverage RAD‐Seq data: a case study on the non‐model cucurbit bottle gourd
- 10 December 2013
- journal article
- Published by Wiley in The Plant Journal
- Vol. 77 (3), 430-442
- https://doi.org/10.1111/tpj.12370
Abstract
Restriction site-associated DNA sequencing (RAD-Seq), a next-generation sequencing-based genome 'complexity reduction' protocol, has been useful in population genomics in species with a reference genome. However, the application of this protocol to natural populations of genomically underinvestigated species, particularly under low-to-medium sequencing depth, has not been well justified. In this study, a Bayesian method was developed for calling genotypes from an F₂ population of bottle gourd [Lagenaria siceraria (Mol.) Standl.] to construct a high-density genetic map. Low-depth genome shotgun sequencing allowed the assembly of scaffolds/contigs comprising approximately 50% of the estimated genome, of which 922 were anchored for identifying syntenic regions between species. RAD-Seq genotyping of a natural population comprising 80 accessions identified 3226 single nuclear polymorphisms (SNPs), based on which two sub-gene pools were suggested for association with fruit shape. The two sub-gene pools were moderately differentiated, as reflected by the Hudson's F(ST) value of 0.14, and they represent regions on LG7 with strikingly elevated F(ST) values. Seven-fold reduction in heterozygosity and two times increase in LD (r²) were observed in the same region for the round-fruited sub-gene pool. Outlier test suggested the locus LX3405 on LG7 to be a candidate site under selection. Comparative genomic analysis revealed that the cucumber genome region syntenic to the high FST island on LG7 harbors an ortholog of the tomato fruit shape gene OVATE. Our results point to a bright future of applying RAD-Seq to population genomic studies for non-model species even under low-to-medium sequencing efforts. The genomic resources provide valuable information for cucurbit genome research.Keywords
This publication has 65 references indexed in Scilit:
- Draft Genome Sequence, and a Sequence-Defined Genetic Linkage Map of the Legume Crop Species Lupinus angustifolius LPLOS ONE, 2013
- A map of rice genome variation reveals the origin of cultivated riceNature, 2012
- Rainbow: an integrated tool for efficient clustering and assembling RAD-seq readsBioinformatics, 2012
- The genome of melon ( Cucumis melo L.)Proceedings of the National Academy of Sciences of the United States of America, 2012
- Genotype and SNP calling from next-generation sequencing dataNature Reviews Genetics, 2011
- MEGA5: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony MethodsMolecular Biology and Evolution, 2011
- Resolving postglacial phylogeography using high-throughput sequencingProceedings of the National Academy of Sciences of the United States of America, 2010
- De novo assembly of human genomes with massively parallel short read sequencingGenome Research, 2009
- Circos: An information aesthetic for comparative genomicsGenome Research, 2009
- SNP detection for massively parallel whole-genome resequencingGenome Research, 2009