GSVA: gene set variation analysis for microarray and RNA-Seq data
Top Cited Papers
Open Access
- 16 January 2013
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 14 (1), 7
- https://doi.org/10.1186/1471-2105-14-7
Abstract
Background: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. Results: To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. Conclusions: GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.Keywords
This publication has 62 references indexed in Scilit:
- Understanding mechanisms underlying human gene expression variation with RNA sequencingNature, 2010
- Integrated Genomic Analysis Identifies Clinically Relevant Subtypes of Glioblastoma Characterized by Abnormalities in PDGFRA, IDH1, EGFR, and NF1Cancer Cell, 2010
- Biological and Molecular Heterogeneity of Breast Cancers Correlates with Their Cancer Stem Cell ContentCell, 2010
- Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1Nature, 2009
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- An integrative genomics approach to infer causal associations between gene expression and diseaseNature Genetics, 2005
- X-inactivation profile reveals extensive variability in X-linked gene expression in femalesNature, 2005
- The male-specific region of the human Y chromosome is a mosaic of discrete sequence classesNature, 2003
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- Comparing partitionsJournal of Classification, 1985