PESM: A novel approach of tumor purity estimation based on sample specific methylation sites

J Bioinform Comput Biol. 2020 Oct;18(5):2050027. doi: 10.1142/S0219720020500274. Epub 2020 Aug 6.

Abstract

Background: Tumor purity is of great significance for the study of tumor genotyping and the prediction of recurrence, which is significantly affected by tumor heterogeneity. Tumor heterogeneity is the basis of drug resistance in various cancer treatments, and DNA methylation plays a core role in the generation of tumor heterogeneity. Almost all types of cancer cells are associated with abnormal DNA methylation in certain regions of the genome. The selection of tumor-related differential methylation sites, which can be used as an indicator of tumor purity, has important implications for purity assessment. At present, the selection of information sites mostly focuses on inter-tumor heterogeneity and ignores the heterogeneity of tumor growth space that is sample specificity. Results: Considering the specificity of tumor samples and the information gain of individual tumor sample relative to the normal samples, we present an approach, PESM, to evaluate the tumor purity through the specificity difference methylation sites of tumor samples. Applied to more than 200 tumor samples of Prostate adenocarcinoma (PRAD) and Kidney renal clear cell carcinoma (KIRC), it shows that the tumor purity estimated by PESM is highly consistent with other existing methods. In addition, PESM performs better than the method that uses the integrated signal of methylation sites to estimate purity. Therefore, different information sites selection methods have an important impact on the estimation of tumor purity, and the selection of sample specific information sites has a certain significance for accurate identification of tumor purity of samples.

Keywords: DNA methylation; information gain; specific methylation sites; tumor purity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / genetics
  • Adenocarcinoma / pathology*
  • Carcinoma, Renal Cell / genetics
  • Carcinoma, Renal Cell / pathology*
  • Computational Biology / methods*
  • CpG Islands
  • DNA Methylation*
  • Databases, Factual
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Kidney Neoplasms / genetics
  • Kidney Neoplasms / pathology*
  • Male
  • Prostatic Neoplasms / genetics
  • Prostatic Neoplasms / pathology*