Protein dynamics analysis identifies candidate cancer driver genes and mutations in TCGA data

Abstract
Recently, it has been showed that cancer missense mutations selectively target the neighborhood of hinge residues, which are key sites in protein dynamics. Here, we show that this approach can be extended to find previously unknown candidate mutations and genes. To this aim, we developed a computational pipeline to detect significantly enriched three-dimensional (3D) clustering of missense mutations around hinge residues. The hinge residues were detected by applying a Gaussian network model. By systematically analyzing the PanCancer compendium of somatic mutations in nearly 10 000 tumors from the Cancer Genome Atlas, we identified candidate genes and mutations in addition to well known ones. For instance, we found significantly enriched 3D clustering of missense mutations in known cancer genes including CDK4, CDKN2A, TCL1A, and MAPK1. Beside these known genes, we also identified significantly enriched 3D clustering of missense mutations around hinge residues in PLA2G4A, which may lead to excessive phosphorylation of the extracellular signal-regulated kinases. Furthermore, we demonstrated that hinge-based features improves pathogenicity prediction for missense mutations. Our results show that the consideration of clustering around hinge residues can help us explain the functional role of the mutations in known cancer genes and identify candidate genes.
Funding Information
  • Bilim Akademisi (BAGEP)
  • Türkiye Bilimler Akademisi (TÜBA‐GEBİP)
  • Türkiye Bilimsel ve Teknolojik Araştırma Kurumu (MAG 115M418)