Identification of small RNA pathway genes using patterns of phylogenetic conservation and divergence

Abstract
To identify comprehensively factors involved in RNAi and microRNA-mediated gene expression regulation, this study performed a phylogenetic analysis of 86 eukaryotic species; the candidates this approach highlighted were subjected to Bayesian analysis with transcriptional and proteomic interaction data, identifying protein orthologues of already known RNAi silencing factors, as well as other hits involved in splicing, suggesting a connection between the two processes. In order to identify factors involved in RNA interference (RNAi) and microRNA-mediated gene-expression regulation, Gary Ruvkun and colleagues performed a phylogenetic analysis of 86 eukaryotic species. The resulting candidates were subjected to Bayesian analysis with transcriptional and proteomic interaction data, to estimate the probability of their involvement in small RNA regulation. About half of the small RNA cofactors identified are required for RNAi silencing, and many of the others are involved in splicing, suggesting a connection between the two processes. Genetic and biochemical analyses of RNA interference (RNAi) and microRNA (miRNA) pathways have revealed proteins such as Argonaute and Dicer as essential cofactors that process and present small RNAs to their targets. Well-validated small RNA pathway cofactors such as these show distinctive patterns of conservation or divergence in particular animal, plant, fungal and protist species. We compared 86 divergent eukaryotic genome sequences to discern sets of proteins that show similar phylogenetic profiles with known small RNA cofactors. A large set of additional candidate small RNA cofactors have emerged from functional genomic screens for defects in miRNA- or short interfering RNA (siRNA)-mediated repression in Caenorhabditis elegans and Drosophila melanogaster1,2, and from proteomic analyses of proteins co-purifying with validated small RNA pathway proteins3,4. The phylogenetic profiles of many of these candidate small RNA pathway proteins are similar to those of known small RNA cofactor proteins. We used a Bayesian approach to integrate the phylogenetic profile analysis with predictions from diverse transcriptional coregulation and proteome interaction data sets to assign a probability for each protein for a role in a small RNA pathway. Testing high-confidence candidates from this analysis for defects in RNAi silencing, we found that about one-half of the predicted small RNA cofactors are required for RNAi silencing. Many of the newly identified small RNA pathway proteins are orthologues of proteins implicated in RNA splicing. In support of a deep connection between the mechanism of RNA splicing and small-RNA-mediated gene silencing, the presence of the Argonaute proteins and other small RNA components in the many species analysed strongly correlates with the number of introns in those species.