Retrotransposition of gene transcripts leads to structural variation in mammalian genomes
Open Access
- 1 January 2013
- journal article
- Published by Springer Science and Business Media LLC in Genome Biology
- Vol. 14 (3), R22
- https://doi.org/10.1186/gb-2013-14-3-r22
Abstract
Retroposed processed gene transcripts are an important source of material for new gene formation on evolutionary timescales. Most prior work on gene retrocopy discovery compared copies in reference genome assemblies to their source genes. Here, we explore gene retrocopy insertion polymorphisms (GRIPs) that are present in the germlines of individual humans, mice, and chimpanzees, and we identify novel gene retrocopy insertions in cancerous somatic tissues that are absent from patient-matched non-cancer genomes. Through analysis of whole-genome sequence data, we found evidence for 48 GRIPs in the genomes of one or more humans sequenced as part of the 1,000 Genomes Project and The Cancer Genome Atlas, but which were not in the human reference assembly. Similarly, we found evidence for 755 GRIPs at distinct locations in one or more of 17 inbred mouse strains but which were not in the mouse reference assembly, and 19 GRIPs across a cohort of 10 chimpanzee genomes, which were not in the chimpanzee reference genome assembly. Many of these insertions are new members of existing gene families whose source genes are highly and widely expressed, and the majority have detectable hallmarks of processed gene retrocopy formation. We estimate the rate of novel gene retrocopy insertions in humans and chimps at roughly one new gene retrocopy insertion for every 6,000 individuals. We find that gene retrocopy polymorphisms are a widespread phenomenon, present a multi-species analysis of these events, and provide a method for their ascertainment.Keywords
This publication has 77 references indexed in Scilit:
- Genome-Wide Analysis of Wild-Type Epstein–Barr Virus Genomes Derived from Healthy Individuals of the 1000 Genomes ProjectGenome Biology and Evolution, 2014
- Detection of structural variants and indels within exome dataNature Methods, 2011
- The UCSC Genome Browser database: update 2011Nucleic Acids Research, 2010
- Adaptive evolution of young gene duplicates in mammalsGenome Research, 2009
- Independent genesis of chimeric TRIM5-cyclophilin proteins in two primate speciesProceedings of the National Academy of Sciences of the United States of America, 2008
- LINE-mediated retrotransposition of marked Alu sequencesNature Genetics, 2003
- Nature and Structure of Human Genes that Generate RetropseudogenesGenome Research, 2000
- Human LINE retrotransposons generate processed pseudogenesNature Genetics, 2000
- Human L1 Retrotransposon Encodes a Conserved Endonuclease Required for RetrotranspositionCell, 1996
- On the number of segregating sites in genetical models without recombinationTheoretical Population Biology, 1975