Precursor arrays for triplet repeat expansion at the fragile X locus

Abstract
To determine factors governing triplet repeat expansion at FMR1, we need to understand the basis of normal variation. We have sequenced the FMR1 repeat from 102 normal X chromosomes and show that most are interrupted with a regularly spaced AGG trinucleotide giving an ordered structure to the array. Five types of arrays were identified consisting of varying numbers of a core unit with consensus [AGG(CGG)g]. Additional variation in the length of the (CGG)n portion within each unit generates the continuum of lengths seen on normal chromosomes. Ten per cent contain long, uninterrupted tracts of (CGG)n, and their lengths suggest they have arisen by the loss of AGG triplets from longer interrupted arrays. Haplotype analysis of arrays carrying long, uninterrupted (CGG)n tracts suggests that they occur more frequently on genetic backgrounds which are more highly represented on fragile X chromosomes. These arrays may well be precursors from which the larger fragile X associated arrays have arisen by further expansion.