A mixture model for signature discovery from sparse mutation data

Genome Med. 2021 Nov 1;13(1):173. doi: 10.1186/s13073-021-00988-7.

Abstract

Mutational signatures are key to understanding the processes that shape cancer genomes, yet their analysis requires relatively rich whole-genome or whole-exome mutation data. Recently, orders-of-magnitude sparser gene-panel-sequencing data have become increasingly available in the clinic. To deal with such sparse data, we suggest a novel mixture model, Mix. In application to simulated and real gene-panel sequences, Mix is shown to outperform current approaches and yield mutational signatures and patient stratifications that are in higher agreement with the literature. We further demonstrate its utility in several clinical settings, successfully predicting therapy benefit and patient groupings from MSK-IMPACT pan-cancer data. Availability: https://github.com/itaysason/Mix-MMM .

Keywords: Gene panel sequencing; Mutational signatures; Probabilistic modeling.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Exome
  • Exome Sequencing
  • Humans
  • Lung Neoplasms / genetics
  • Models, Genetic
  • Mutation*
  • Neoplasms / genetics*