Generalizable biomarker prediction from cancer pathology slides with self-supervised deep learning: A retrospective multi-centric study

Cell Rep Med. 2023 Apr 18;4(4):100980. doi: 10.1016/j.xcrm.2023.100980. Epub 2023 Mar 22.

Abstract

Deep learning (DL) can predict microsatellite instability (MSI) from routine histopathology slides of colorectal cancer (CRC). However, it is unclear whether DL can also predict other biomarkers with high performance and whether DL predictions generalize to external patient populations. Here, we acquire CRC tissue samples from two large multi-centric studies. We systematically compare six different state-of-the-art DL architectures to predict biomarkers from pathology slides, including MSI and mutations in BRAF, KRAS, NRAS, and PIK3CA. Using a large external validation cohort to provide a realistic evaluation setting, we show that models using self-supervised, attention-based multiple-instance learning consistently outperform previous approaches while offering explainable visualizations of the indicative regions and morphologies. While the prediction of MSI and BRAF mutations reaches a clinical-grade performance, mutation prediction of PIK3CA, KRAS, and NRAS was clinically insufficient.

Keywords: artificial intelligence; attention heatmaps; attention-based multiple-instance learning; biomarker; colorectal cancer; computational pathology; multi-input models; oncogenic mutation; self-supervised learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers
  • Class I Phosphatidylinositol 3-Kinases / genetics
  • Colorectal Neoplasms* / genetics
  • Colorectal Neoplasms* / pathology
  • Deep Learning*
  • Humans
  • Microsatellite Instability
  • Proto-Oncogene Proteins B-raf / genetics
  • Proto-Oncogene Proteins p21(ras) / genetics
  • Retrospective Studies

Substances

  • Proto-Oncogene Proteins B-raf
  • Proto-Oncogene Proteins p21(ras)
  • Biomarkers
  • Class I Phosphatidylinositol 3-Kinases