Machine Learning Identifies Robust Matrisome Markers and Regulatory Mechanisms in Cancer

Int J Mol Sci. 2020 Nov 22;21(22):8837. doi: 10.3390/ijms21228837.

Abstract

The expression and regulation of matrisome genes-the ensemble of extracellular matrix, ECM, ECM-associated proteins and regulators as well as cytokines, chemokines and growth factors-is of paramount importance for many biological processes and signals within the tumor microenvironment. The availability of large and diverse multi-omics data enables mapping and understanding of the regulatory circuitry governing the tumor matrisome to an unprecedented level, though such a volume of information requires robust approaches to data analysis and integration. In this study, we show that combining Pan-Cancer expression data from The Cancer Genome Atlas (TCGA) with genomics, epigenomics and microenvironmental features from TCGA and other sources enables the identification of "landmark" matrisome genes and machine learning-based reconstruction of their regulatory networks in 74 clinical and molecular subtypes of human cancers and approx. 6700 patients. These results, enriched for prognostic genes and cross-validated markers at the protein level, unravel the role of genetic and epigenetic programs in governing the tumor matrisome and allow the prioritization of tumor-specific matrisome genes (and their regulators) for the development of novel therapeutic approaches.

Keywords: big data; bioinformatics; cancer; extracellular matrix; matrisome; regulatory networks.

MeSH terms

  • Biomarkers
  • Chemokines / metabolism
  • Cytokines / metabolism
  • Extracellular Matrix
  • Extracellular Matrix Proteins / metabolism*
  • Gene Regulatory Networks
  • Humans
  • Intercellular Signaling Peptides and Proteins / metabolism
  • Machine Learning
  • Neoplasms / genetics
  • Neoplasms / metabolism*
  • Proteomics
  • Signal Transduction*
  • Tumor Microenvironment*

Substances

  • Biomarkers
  • Chemokines
  • Cytokines
  • Extracellular Matrix Proteins
  • Intercellular Signaling Peptides and Proteins