Accurate determination of microsatellite allele frequencies in pooled DNA samples
- 11 August 2004
- journal article
- research article
- Published by Springer Science and Business Media LLC in European Journal of Human Genetics
- Vol. 12 (11), 925-934
- https://doi.org/10.1038/sj.ejhg.5201234
Abstract
Pooling of DNA samples instead of individual genotyping can speed up genetic association studies. However, for microsatellite markers, the electrophoretic pattern of DNA pools can be complex, and procedures for deriving allele frequencies are often confounded by PCR-induced stutter artefacts. We have developed a mathematical procedure to remove stutter noise and accurately determine allele frequencies in pools. A stutter correction model can be reliably derived from one standard 'training set' of the same 10 individual DNA samples for each marker, which can also include heterozygous patterns with partially overlapping peaks. Compared with earlier methods, this reduces the number of genotypes needed in the training set considerably, and allows standardization of analyses for different markers. Moreover, the use of a procedure that fits all data simultaneously makes the method less sensitive to aberrant data. The model was tested with 34 markers, 18 of which were newly defined from human sequence data. Allele frequencies derived from stutter-corrected DNA pool patterns were compared with the summed individual genotyping results of all the individuals in the pools (n=109 and n=64). We show that the model is robust and accurately extracts allele frequencies from pooled DNA samples for 32 of the 34 microsatellite markers tested. Finally, we performed a case–control study in celiac disease and found that weakly associated disease alleles, identified by individual genotyping, were only detectable in pools after stutter correction. This efficient method for correcting stutter artefacts in microsatellite markers enables large-scale genetic association studies using DNA pools to be performed.Keywords
This publication has 22 references indexed in Scilit:
- The Pattern of Polymorphism on Human Chromosome 21Genome Research, 2003
- Identification of the sources of error in allele frequency estimations from pooled DNA indicates an optimal experimental designAnnals of Human Genetics, 2002
- DNA Pooling: a tool for large-scale association studiesNature Reviews Genetics, 2002
- On the advantage of haplotype analysis in the presence of multiple disease susceptibility allelesGenetic Epidemiology, 2002
- Extent and Distribution of Linkage Disequilibrium in Three Genomic RegionsAmerican Journal of Human Genetics, 2001
- The effect of marker characteristics on the power to detect linkage disequilibrium due to single or multiple ancestral mutationsAnnals of Human Genetics, 2000
- Association Mapping of Disease Loci, by Use of a Pooled DNA Genomic ScreenAmerican Journal of Human Genetics, 1997
- The Future of Genetic Studies of Complex Human DiseasesScience, 1996
- Batched analysis of genotypes.Genome Research, 1995
- Monte Carlo tests for associations between disease and alleles at highly polymorphic lociAnnals of Human Genetics, 1995