gpps: an ILP-based approach for inferring cancer progression with mutation losses from single cell data

BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):413. doi: 10.1186/s12859-020-03736-7.

Abstract

Background: Cancer progression reconstruction is an important development stemming from the phylogenetics field. In this context, the reconstruction of the phylogeny representing the evolutionary history presents some peculiar aspects that depend on the technology used to obtain the data to analyze: Single Cell DNA Sequencing data have great specificity, but are affected by moderate false negative and missing value rates. Moreover, there has been some recent evidence of back mutations in cancer: this phenomenon is currently widely ignored.

Results: We present a new tool, gpps, that reconstructs a tumor phylogeny from Single Cell Sequencing data, allowing each mutation to be lost at most a fixed number of times. The General Parsimony Phylogeny from Single cell (gpps) tool is open source and available at https://github.com/AlgoLab/gpps .

Conclusions: gpps provides new insights to the analysis of intra-tumor heterogeneity by proposing a new progression model to the field of cancer phylogeny reconstruction on Single Cell data.

Keywords: Hill climbing; Integer linear programming; Phylogeny; Single cell sequencing.

MeSH terms

  • Base Sequence
  • Computational Biology / methods*
  • DNA Mutational Analysis*
  • Disease Progression*
  • Evolution, Molecular
  • Humans
  • Mutation*
  • Neoplasms / genetics*
  • Neoplasms / pathology*
  • Phylogeny
  • Single-Cell Analysis