A novel DNA sequence database for analyzing human demographic history
- 20 May 2008
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 18 (8), 1354-1361
- https://doi.org/10.1101/gr.075630.107
Abstract
While there are now extensive databases of human genomic sequences from both private and public efforts to catalog human nucleotide variation, there are very few large-scale surveys designed for the purpose of analyzing human population history. Demographic inference from patterns of SNP variation in current large public databases is complicated by ascertainment biases associated with SNP discovery and the ways that populations and regions of the genome are sampled. Here, we present results from a resequencing survey of 40 independent intergenic regions on the autosomes and X chromosome comprising ∼210 kb from each of 90 humans from six geographically diverse populations (i.e., a total of ∼18.9 Mb). Unlike other public DNA sequence databases, we include multiple indigenous populations that serve as important reservoirs of human genetic diversity, such as the San of Namibia, the Biaka of the Central African Republic, and Melanesians from Papua New Guinea. In fact, only 20% of the SNPs that we find are contained in the HapMap database. We identify several key differences in patterns of variability in our database compared with other large public databases, including higher levels of nucleotide diversity within populations, greater levels of differentiation between populations, and significant differences in the frequency spectrum. Because variants at loci included in this database are less likely to be subject to ascertainment biases or linked to sites under selection, these data will be more useful for accurately reconstructing past changes in size and structure of human populations.Keywords
This publication has 41 references indexed in Scilit:
- Worldwide Human Relationships Inferred from Genome-Wide Patterns of VariationScience, 2008
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- Measurement of the human allele frequency spectrum demonstrates greater genetic drift in East Asians than in EuropeansNature Genetics, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- A worldwide survey of haplotype variation and linkage disequilibrium in the human genomeNature Genetics, 2006
- Standardized Subsets of the HGDP‐CEPH Human Genome Diversity Cell Line Panel, Accounting for Atypical and Duplicated Samples and Pairs of Close RelativesAnnals of Human Genetics, 2006
- A haplotype map of the human genomeNature, 2005
- Population History and Natural Selection Shape Patterns of Genetic Variation in 132 GenesPLoS Biology, 2004
- The Human Genome Browser at UCSCGenome Research, 2002
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997