Incongruent Patterns of Local and Global Genome Size Evolution in Cotton

Abstract
Genome sizes in plants vary over several orders of magnitude, reflecting a combination of differentially acting local and global forces such as biases in indel accumulation and transposable element proliferation or removal. To gain insight into the relative role of these and other forces, ∼105 kb of contiguous sequence surrounding the cellulose synthase gene CesA1 was compared for the two coresident genomes (AT and DT) of the allopolyploid cotton species, Gossypium hirsutum. These two genomes differ approximately twofold in size, having diverged from a common ancestor ∼5–10 million years ago (Mya) and been reunited in the same nucleus at the time of polyploid formation, ∼1–2 Mya. Gene content, order, and spacing are largely conserved between the two genomes, although a few transposable elements and a single cpDNA fragment distinguish the two homoeologs. Sequence conservation is high in both intergenic and genic regions, with 14 conserved genes detected in both genomes yielding a density of 1 gene every 7.5 kb. In contrast to the twofold overall difference in DNA content, no disparity in size was observed for this 105-kb region, and 555 indels were detected that distinguish the two homoeologous BACs, approximately equally distributed between AT and DT in number and aggregate size. The data demonstrate that genome size evolution at this phylogenetic scale is not primarily caused by mechanisms that operate uniformly across different genomic regions and components; instead, the twofold overall difference in DNA content must reflect locally operating forces between gene islands or in largely gene-free regions.