LowMACA: exploiting protein family analysis for the identification of rare driver mutations in cancer

BMC Bioinformatics. 2016 Feb 9:17:80. doi: 10.1186/s12859-016-0935-7.

Abstract

Background: The increasing availability of resequencing data has led to a better understanding of the most important genes in cancer development. Nevertheless, the mutational landscape of many tumor types is heterogeneous and encompasses a long tail of potential driver genes that are systematically excluded by currently available methods due to the low frequency of their mutations. We developed LowMACA (Low frequency Mutations Analysis via Consensus Alignment), a method that combines the mutations of various proteins sharing the same functional domains to identify conserved residues that harbor clustered mutations in multiple sequence alignments. LowMACA is designed to visualize and statistically assess potential driver genes through the identification of their mutational hotspots.

Results: We analyzed the Ras superfamily exploiting the known driver mutations of the trio K-N-HRAS, identifying new putative driver mutations and genes belonging to less known members of the Rho, Rab and Rheb subfamilies. Furthermore, we applied the same concept to a list of known and candidate driver genes, and observed that low confidence genes show similar patterns of mutation compared to high confidence genes of the same protein family.

Conclusions: LowMACA is a software for the identification of gain-of-function mutations in putative oncogenic families, increasing the amount of information on functional domains and their possible role in cancer. In this context LowMACA emphasizes the role of genes mutated at low frequency otherwise undetectable by classical single gene analysis. LowMACA is an R package available at http://www.bioconductor.org/packages/release/bioc/html/LowMACA.html. It is also available as a GUI standalone downloadable at: https://cgsb.genomics.iit.it/wiki/projects/LowMACA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Mutational Analysis / methods*
  • Humans
  • Mutation / genetics*
  • Neoplasms / genetics*
  • Neoplasms / metabolism*
  • Proteins / genetics
  • Proteins / metabolism*
  • Sequence Analysis, Protein / methods*
  • Software*

Substances

  • Proteins