Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
- 30 September 2005
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 102 (43), 15545-15550
- https://doi.org/10.1073/pnas.0506580102
Abstract
Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.This publication has 35 references indexed in Scilit:
- Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammalsNature, 2005
- An oncogenic KRAS2 expression signature identified by cross-species gene-expression analysisNature Genetics, 2004
- Impaired Mitochondrial Activity in the Insulin-Resistant Offspring of Patients with Type 2 DiabetesThe New England Journal of Medicine, 2004
- Rapamycin Inhibits the Growth and Metastatic Progression of Non-Small Cell Lung CancerClinical Cancer Research, 2004
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- Identifying differentially expressed genes using false discovery rate controlling proceduresBioinformatics, 2003
- Cloning and characterization of the common fragile site FRA6F harboring a replicative senescence gene and frequently deleted in human tumorsOncogene, 2002
- Gene-expression profiles predict survival of patients with lung adenocarcinomaNature Medicine, 2002
- Escape from X inactivationCytogenetic and Genome Research, 2002
- The IARC TP53 database: New online mutation analysis and recommendations to usersHuman Mutation, 2002