Weakly supervised annotation-free cancer detection and prediction of genotype in routine histopathology

J Pathol. 2022 Jan;256(1):50-60. doi: 10.1002/path.5800. Epub 2021 Oct 22.

Abstract

Deep learning is a powerful tool in computational pathology: it can be used for tumor detection and for predicting genetic alterations based on histopathology images alone. Conventionally, tumor detection and prediction of genetic alterations are two separate workflows. Newer methods have combined them, but require complex, manually engineered computational pipelines, restricting reproducibility and robustness. To address these issues, we present a new method for simultaneous tumor detection and prediction of genetic alterations: The Slide-Level Assessment Model (SLAM) uses a single off-the-shelf neural network to predict molecular alterations directly from routine pathology slides without any manual annotations, improving upon previous methods by automatically excluding normal and non-informative tissue regions. SLAM requires only standard programming libraries and is conceptually simpler than previous approaches. We have extensively validated SLAM for clinically relevant tasks using two large multicentric cohorts of colorectal cancer patients, Darmkrebs: Chancen der Verhütung durch Screening (DACHS) from Germany and Yorkshire Cancer Research Bowel Cancer Improvement Programme (YCR-BCIP) from the UK. We show that SLAM yields reliable slide-level classification of tumor presence with an area under the receiver operating curve (AUROC) of 0.980 (confidence interval 0.975, 0.984; n = 2,297 tumor and n = 1,281 normal slides). In addition, SLAM can detect microsatellite instability (MSI)/mismatch repair deficiency (dMMR) or microsatellite stability/mismatch repair proficiency with an AUROC of 0.909 (0.888, 0.929; n = 2,039 patients) and BRAF mutational status with an AUROC of 0.821 (0.786, 0.852; n = 2,075 patients). The improvement with respect to previous methods was validated in a large external testing cohort in which MSI/dMMR status was detected with an AUROC of 0.900 (0.864, 0.931; n = 805 patients). In addition, SLAM provides human-interpretable visualization maps, enabling the analysis of multiplexed network predictions by human experts. In summary, SLAM is a new simple and powerful method for computational pathology that could be applied to multiple disease contexts. © 2021 The Authors. The Journal of Pathology published by John Wiley & Sons, Ltd. on behalf of The Pathological Society of Great Britain and Ireland.

Keywords: Lynch syndrome; artificial intelligence; colorectal cancer; computational pathology; deep learning; digital pathology; microsatellite instability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Brain Neoplasms / diagnosis
  • Brain Neoplasms / genetics*
  • Brain Neoplasms / pathology*
  • Cohort Studies
  • Colorectal Neoplasms / diagnosis
  • Colorectal Neoplasms / genetics*
  • Colorectal Neoplasms / pathology*
  • Deep Learning
  • Female
  • Genotype
  • Humans
  • Male
  • Microsatellite Instability*
  • Middle Aged
  • Mutation / genetics*
  • Neoplastic Syndromes, Hereditary / diagnosis
  • Neoplastic Syndromes, Hereditary / genetics*
  • Neoplastic Syndromes, Hereditary / pathology*
  • Reproducibility of Results

Supplementary concepts

  • Turcot syndrome