Improved accuracy in colorectal cancer tissue decomposition through refinement of established deep learning solutions

Sci Rep. 2023 Sep 23;13(1):15879. doi: 10.1038/s41598-023-42357-x.

Abstract

Hematoxylin and eosin-stained biopsy slides are regularly available for colorectal cancer patients. These slides are often not used to define objective biomarkers for patient stratification and treatment selection. Standard biomarkers often pertain to costly and slow genetic tests. However, recent work has shown that relevant biomarkers can be extracted from these images using convolutional neural networks (CNNs). The CNN-based biomarkers predicted colorectal cancer patient outcomes comparably to gold standards. Extracting CNN-biomarkers is fast, automatic, and of minimal cost. CNN-based biomarkers rely on the ability of CNNs to recognize distinct tissue types from microscope whole slide images. The quality of these biomarkers (coined 'Deep Stroma') depends on the accuracy of CNNs in decomposing all relevant tissue classes. Improving tissue decomposition accuracy is essential for improving the prognostic potential of CNN-biomarkers. In this study, we implemented a novel training strategy to refine an established CNN model, which then surpassed all previous solutions . We obtained a 95.6% average accuracy in the external test set and 99.5% in the internal test set. Our approach reduced errors in biomarker-relevant classes, such as Lymphocytes, and was the first to include interpretability methods. These methods were used to better apprehend our model's limitations and capabilities.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biopsy
  • Colorectal Neoplasms*
  • Deep Learning*
  • Eosine Yellowish-(YS)
  • Genetic Testing
  • Humans

Substances

  • Eosine Yellowish-(YS)