CanProVar: a human cancer proteome variation database
Open Access
- 5 January 2010
- journal article
- databases
- Published by Hindawi Limited in Human Mutation
- Vol. 31 (3), 219-228
- https://doi.org/10.1002/humu.21176
Abstract
Identification and annotation of mutated genes or proteins involved in oncogenesis and tumor progression are crucial for both cancer biology and clinical applications. We have developed a human Cancer Proteome Variation Database (CanProVar) by integrating information on protein sequence variations from various public resources, with a focus on cancer‐related variations (crVAR). We have also built a user‐friendly interface for querying the database. The current version of CanProVar comprises 8,570 crVARs in 2,921 proteins derived from existing genome variation databases and recently published large‐scale cancer genome resequencing studies. It also includes 41,541 non‐cancer specific variations (ncsVARs) in 30,322 proteins derived from the dbSNP database. CanProVar provides quick access to known crVARs in protein sequences along with related cancer samples, relevant publications, data sources, and functional information such as Gene Ontology (GO) annotations for the proteins, protein domains in which the variation occurs, and protein interaction partners with crVARs. CanProVar also helps reveal functional characteristics of crVARs and proteins bearing these variations. Our analysis showed that crVARs were enriched in certain protein domains. We also showed that proteins bearing crVARs were more likely to interact with each other in the protein interaction network. CanProVar can be accessed from http://bioinfo.vanderbilt.edu/canprovar. Hum Mutat 30:1–10, 2010.Keywords
This publication has 74 references indexed in Scilit:
- The caBIG terminology review processJournal of Biomedical Informatics, 2009
- Comprehensive genomic characterization defines human glioblastoma genes and core pathwaysNature, 2008
- Menin Critically Links MLL Proteins with LEDGF on Cancer-Associated Target GenesCancer Cell, 2008
- Patterns of somatic mutation in human cancer genomesNature, 2007
- Modeling the Evolution of Protein Domain Architectures Using Maximum ParsimonyJournal of Molecular Biology, 2007
- EGF receptor gene mutations are common in lung cancers from “never smokers” and are associated with sensitivity of tumors to gefitinib and erlotinibProceedings of the National Academy of Sciences of the United States of America, 2004
- The COSMIC (Catalogue of Somatic Mutations in Cancer) database and websiteBritish Journal of Cancer, 2004
- The Swiss-Prot variant page and the ModSNP database: A resource for sequence and structure information on human protein variantsHuman Mutation, 2004
- A census of human cancer genesNature Reviews Cancer, 2004
- Human Gene Mutation Database (HGMD®): 2003 updateHuman Mutation, 2003