Identification and phylogenetic analysis of RNA binding domain abundant in apicomplexans or RAP proteins
Open Access
- 1 March 2021
- journal article
- research article
- Published by Microbiology Society in Microbial Genomics
- Vol. 7 (3), 000541
- https://doi.org/10.1099/mgen.0.000541
Abstract
The RNA binding domain abundant in apicomplexans (RAP) is a protein domain identified in a diverse group of proteins, called RAP proteins, many of which have been shown to be involved in RNA binding. To understand the expansion and potential function of the RAP proteins, we conducted a hidden Markov model based screen among the proteomes of 54 eukaryotes, 17 bacteria and 12 archaea. We demonstrated that the domain is present in closely and distantly related organisms with particular expansions in Alveolata and Chlorophyta, and are not unique to Apicomplexa as previously believed. All RAP proteins identified can be decomposed into two parts. In the N-terminal region, the presence of variable helical repeats seems to participate in the specific targeting of diverse RNAs, while the RAP domain is mostly identified in the C-terminal region and is highly conserved across the different phylogenetic groups studied. Several conserved residues defining the signature motif could be crucial to ensure the function(s) of the RAP proteins. Modelling of RAP domains in apicomplexan parasites confirmed an ⍺/β structure of a restriction endonuclease-like fold. The phylogenetic trees generated from multiple alignment of RAP domains and full-length proteins from various distantly related eukaryotes indicated a complex evolutionary history of this family. We further discuss these results to assess the potential function of this protein family in apicomplexan parasites.Funding Information
- National Institute of Allergy and Infectious Diseases (R01 AI142743)
- National Institute of General Medical Sciences (R35 GM118187)
- Academic Senate, University of California, Riverside (NIFA-Hatch-225935)
This publication has 84 references indexed in Scilit:
- Sequence, structure and functional diversity of PD-(D/E)XK phosphodiesterase superfamilyNucleic Acids Research, 2012
- The Oxytricha trifallax Mitochondrial GenomeGenome Biology and Evolution, 2011
- Fast kinase domain-containing protein 3 is a mitochondrial protein essential for cellular respirationBiochemical and Biophysical Research Communications, 2010
- A common red algal origin of the apicomplexan, dinoflagellate, and heterokont plastidsProceedings of the National Academy of Sciences of the United States of America, 2010
- The mitochondrial genomes of the ciliates Euplotes minuta and Euplotes crassusBMC Genomics, 2009
- Comparative genomics of the neglected human malaria parasite Plasmodium vivaxNature, 2008
- The dinoflagellates Durinskia baltica and Kryptoperidinium foliaceum retain functionally overlapping mitochondria from two evolutionarily distinct lineagesBMC Evolutionary Biology, 2007
- Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotationBioinformatics, 2006
- Genome of the Host-Cell Transforming Parasite Theileria annulata Compared with T. parvaScience, 2005
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004