Pan-cancer detection of driver genes at the single-patient resolution
Open Access
- 1 February 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Genome Medicine
- Vol. 13 (1), 1-14
- https://doi.org/10.1186/s13073-021-00830-0
Abstract
Identifying the complete repertoire of genes that drive cancer in individual patients is crucial for precision oncology. Most established methods identify driver genes that are recurrently altered across patient cohorts. However, mapping these genes back to patients leaves a sizeable fraction with few or no drivers, hindering our understanding of cancer mechanisms and limiting the choice of therapeutic interventions. We present sysSVM2, a machine learning software that integrates cancer genetic alterations with gene systems-level properties to predict drivers in individual patients. Using simulated pan-cancer data, we optimise sysSVM2 for application to any cancer type. We benchmark its performance on real cancer data and validate its applicability to a rare cancer type with few known driver genes. We show that drivers predicted by sysSVM2 have a low false-positive rate, are stable and disrupt well-known cancer-related pathways. sysSVM2 can be used to identify driver alterations in patients lacking sufficient canonical drivers or belonging to rare cancer types for which assembling a large enough cohort is challenging, furthering the goals of precision oncology. As resources for the community, we provide the code to implement sysSVM2 and the pre-trained models in all TCGA cancer types (https://github.com/ciccalab/sysSVM2).Keywords
Funding Information
- Cancer Research UK (C43634/A25487)
- Cancer Research UK (C604/A25135, C7893/A26233)
- H2020 Marie Skłodowska-Curie Actions (CONTRA-766030)
- Engineering and Physical Sciences Research Council (EP/L015854/1)
This publication has 41 references indexed in Scilit:
- OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genesBioinformatics, 2013
- Systematic analysis of somatic mutations in phosphorylation signaling predicts novel cancer driversMolecular Systems Biology, 2013
- Integrated analysis of recurrent properties of cancer genes to identify novel driversGenome Biology, 2013
- Functional impact bias reveals cancer driversNucleic Acids Research, 2012
- MuSiC: Identifying mutational significance in cancer genomesGenome Research, 2012
- GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancersGenome Biology, 2011
- Modification of Gene Duplicability during the Evolution of Protein Interaction NetworkPLoS Computational Biology, 2011
- A similarity measure for indefinite rankingsACM Transactions on Information Systems, 2010
- CORUM: the comprehensive resource of mammalian protein complexes—2009Nucleic Acids Research, 2009
- Low duplicability and network fragility of cancer genesTrends in Genetics, 2008