A CATH domain functional family based approach to identify putative cancer driver genes and driver mutations

Sci Rep. 2019 Jan 22;9(1):263. doi: 10.1038/s41598-018-36401-4.

Abstract

Tumour sequencing identifies highly recurrent point mutations in cancer driver genes, but rare functional mutations are hard to distinguish from large numbers of passengers. We developed a novel computational platform applying a multi-modal approach to filter out passengers and more robustly identify putative driver genes. The primary filter identifies enrichment of cancer mutations in CATH functional families (CATH-FunFams) - structurally and functionally coherent sets of evolutionary related domains. Using structural representatives from CATH-FunFams, we subsequently seek enrichment of mutations in 3D and show that these mutation clusters have a very significant tendency to lie close to known functional sites or conserved sites predicted using CATH-FunFams. Our third filter identifies enrichment of putative driver genes in functionally coherent protein network modules confirmed by literature analysis to be cancer associated. Our approach is complementary to other domain enrichment approaches exploiting Pfam families, but benefits from more functionally coherent groupings of domains. Using a set of mutations from 22 cancers we detect 151 putative cancer drivers, of which 79 are not listed in cancer resources and include recently validated cancer associated genes EPHA7, DCC netrin-1 receptor and zinc-finger protein ZNF479.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • DCC Receptor / genetics
  • DCC Receptor / metabolism
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Databases, Genetic / statistics & numerical data
  • Datasets as Topic
  • Humans
  • Neoplasms / genetics*
  • Oncogenes / genetics*
  • Point Mutation
  • Protein Interaction Mapping / methods
  • Protein Interaction Maps / genetics*
  • Receptor, EphA7 / genetics
  • Receptor, EphA7 / metabolism
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • DCC Receptor
  • DCC protein, human
  • DNA-Binding Proteins
  • Transcription Factors
  • ZNF479 protein, human
  • Receptor, EphA7