Interaction-based discovery of functionally important genes in cancers

Nucleic Acids Res. 2014 Feb;42(3):e18. doi: 10.1093/nar/gkt1305. Epub 2013 Dec 19.

Abstract

A major challenge in cancer genomics is uncovering genes with an active role in tumorigenesis from a potentially large pool of mutated genes across patient samples. Here we focus on the interactions that proteins make with nucleic acids, small molecules, ions and peptides, and show that residues within proteins that are involved in these interactions are more frequently affected by mutations observed in large-scale cancer genomic data than are other residues. We leverage this observation to predict genes that play a functionally important role in cancers by introducing a computational pipeline (http://canbind.princeton.edu) for mapping large-scale cancer exome data across patients onto protein structures, and automatically extracting proteins with an enriched number of mutations affecting their nucleic acid, small molecule, ion or peptide binding sites. Using this computational approach, we show that many previously known genes implicated in cancers are enriched in mutations within the binding sites of their encoded proteins. By focusing on functionally relevant portions of proteins--specifically those known to be involved in molecular interactions--our approach is particularly well suited to detect infrequent mutations that may nonetheless be important in cancer, and should aid in expanding our functional understanding of the genomic landscape of cancer.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Binding Sites / genetics
  • DEAD-box RNA Helicases / chemistry
  • DEAD-box RNA Helicases / genetics
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • Genes, Neoplasm*
  • Humans
  • Mutation, Missense
  • Neoplasms / genetics
  • Polymorphism, Genetic
  • Proteins / chemistry
  • Proteins / genetics
  • Proteins / metabolism
  • Proto-Oncogene Proteins c-akt / chemistry
  • Proto-Oncogene Proteins c-akt / genetics
  • Ribonuclease III / chemistry
  • Ribonuclease III / genetics
  • Software
  • Transcription Factors / chemistry
  • Transcription Factors / genetics
  • Ubiquitin-Protein Ligases / chemistry
  • Ubiquitin-Protein Ligases / genetics
  • beta-Transducin Repeat-Containing Proteins / chemistry
  • beta-Transducin Repeat-Containing Proteins / genetics

Substances

  • DNA-Binding Proteins
  • FBXW11 protein, human
  • MBD1 protein, human
  • Proteins
  • Transcription Factors
  • beta-Transducin Repeat-Containing Proteins
  • Ubiquitin-Protein Ligases
  • AKT3 protein, human
  • Proto-Oncogene Proteins c-akt
  • DICER1 protein, human
  • Ribonuclease III
  • DEAD-box RNA Helicases