Predicting transcriptional responses to cold stress across plant species
Open Access
- 3 March 2021
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 118 (10)
- https://doi.org/10.1073/pnas.2026330118
Abstract
Although genome-sequence assemblies are available for a growing number of plant species, gene-expression responses to stimuli have been cataloged for only a subset of these species. Many genes show altered transcription patterns in response to abiotic stresses. However, orthologous genes in related species often exhibit different responses to a given stress. Accordingly, data on the regulation of gene expression in one species are not reliable predictors of orthologous gene responses in a related species. Here, we trained a supervised classification model to identify genes that transcriptionally respond to cold stress. A model trained with only features calculated directly from genome assemblies exhibited only modest decreases in performance relative to models trained by using genomic, chromatin, and evolution/diversity features. Models trained with data from one species successfully predicted which genes would respond to cold stress in other related species. Cross-species predictions remained accurate when training was performed in cold-sensitive species and predictions were performed in cold-tolerant species and vice versa. Models trained with data on gene expression in multiple species provided at least equivalent performance to models trained and tested in a single species and outperformed single-species models in cross-species prediction. These results suggest that classifiers trained on stress data from well-studied species may suffice for predicting gene-expression patterns in related, less-studied species with sequenced genomes.This publication has 65 references indexed in Scilit:
- MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and UsabilityMolecular Biology and Evolution, 2013
- Predicting cell-type–specific gene expression from regions of open chromatinGenome Research, 2012
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- The B73 Maize Genome: Complexity, Diversity, and DynamicsScience, 2009
- More Productive Than Maize in the Midwest: How Does Miscanthus Do It?Plant Physiology, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Disruption of the Arabidopsis Circadian Clock Is Responsible for Extensive Variation in the Cold-Responsive TranscriptomePlant Physiology, 2008
- Improvement of Phylogenies after Removing Divergent and Ambiguously Aligned Blocks from Protein Sequence AlignmentsSystematic Biology, 2007
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997