Robust inflammatory breast cancer gene signature using nonparametric random forest analysis

Breast Cancer Res. 2021 Sep 27;23(1):92. doi: 10.1186/s13058-021-01467-y.

Abstract

Inflammatory breast cancer (IBC) is a rare, aggressive cancer found in all the molecular breast cancer subtypes. Despite extensive previous efforts to screen for transcriptional differences between IBC and non-IBC patients, a robust IBC-specific molecular signature has been elusive. We report a novel IBC-specific gene signature (59 genes; G59) that achieves 100% accuracy in discovery and validation samples (45/45 correct classification) and remarkably only misclassified one sample (60/61 correct classification) in an independent dataset. G59 is independent of ER/HER2 status, molecular subtypes and is specific to untreated IBC samples, with most of the genes being enriched for plasma membrane cellular component proteins, interleukin (IL), and chemokine signaling pathways. Our finding suggests the existence of an IBC-specific molecular signature, paving the way for the identification and validation of targetable genomic drivers of IBC.

Keywords: Breast cancer; IBC; IBC signature; Machine learning; Random forest.

MeSH terms

  • Biomarkers, Tumor / genetics
  • Databases, Genetic
  • Female
  • Humans
  • Inflammatory Breast Neoplasms / genetics*
  • Interleukins / genetics
  • Machine Learning
  • Membrane Proteins / genetics
  • Signal Transduction / genetics
  • Statistics, Nonparametric

Substances

  • Biomarkers, Tumor
  • Interleukins
  • Membrane Proteins