Evolutionary and Expression Signatures of Pseudogenes in Arabidopsis and Rice
Open Access
- 29 July 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Plant Physiology
- Vol. 151 (1), 3-15
- https://doi.org/10.1104/pp.109.140632
Abstract
Pseudogenes (Ψ) are nonfunctional genomic sequences resembling functional genes. Knowledge of Ψs can improve genome annotation and our understanding of genome evolution. However, there has been relatively little systemic study of Ψs in plants. In this study, we characterized the evolution and expression patterns of Ψs in Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa). In contrast to animal Ψs, many plant Ψs experienced much stronger purifying selection. In addition, plant Ψs experiencing stronger selective constraints tend to be derived from relatively ancient duplicates, suggesting that they were functional for a relatively long time but became Ψs recently. Interestingly, the regions 5′ to the first stops in the Ψs have experienced stronger selective constraints compared with 3′ regions, suggesting that the 5′ regions were functional for a longer period of time after the premature stops appeared. We found that few Ψs have expression evidence, and their expression levels tend to be lower compared with annotated genes. Furthermore, Ψs with expressed sequence tags tend to be derived from relatively recent duplication events, indicating that Ψ expression may be due to insufficient time for complete degeneration of regulatory signals. Finally, larger protein domain families have significantly more Ψs in general. However, while families involved in environmental stress responses have a significant excess of Ψs, transcription factors and receptor-like kinases have lower than expected numbers of Ψs, consistent with their elevated retention rate in plant genomes. Our findings illustrate peculiar properties of plant Ψs, providing additional insight into the evolution of duplicate genes and benefiting future genome annotation.Keywords
This publication has 67 references indexed in Scilit:
- Trans-natural antisense transcripts including noncoding RNAs in 10 species: implications for expression regulationNucleic Acids Research, 2008
- PlantGDB: a resource for comparative plant genomicsNucleic Acids Research, 2007
- Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolutionGenome Research, 2007
- The putatively functional Mkrn1-p1 pseudogene is neither expressed nor imprinted, nor does it regulate its source gene in transProceedings of the National Academy of Sciences of the United States of America, 2006
- High Rate of Chimeric Gene Origination by Retroposition in Plant GenomesPlant Cell, 2006
- The map-based sequence of the rice genomeNature, 2005
- The Pfam protein families databaseNucleic Acids Research, 2004
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinasesProceedings of the National Academy of Sciences of the United States of America, 2001
- Initial sequencing and analysis of the human genomeNature, 2001