Deep learning for survival analysis in breast cancer with whole slide image data

Bioinformatics. 2022 Jul 11;38(14):3629-3637. doi: 10.1093/bioinformatics/btac381.

Abstract

Motivation: Whole slide tissue images contain detailed data on the sub-cellular structure of cancer. Quantitative analyses of this data can lead to novel biomarkers for better cancer diagnosis and prognosis and can improve our understanding of cancer mechanisms. Such analyses are challenging to execute because of the sizes and complexity of whole slide image data and relatively limited volume of training data for machine learning methods.

Results: We propose and experimentally evaluate a multi-resolution deep learning method for breast cancer survival analysis. The proposed method integrates image data at multiple resolutions and tumor, lymphocyte and nuclear segmentation results from deep learning models. Our results show that this approach can significantly improve the deep learning model performance compared to using only the original image data. The proposed approach achieves a c-index value of 0.706 compared to a c-index value of 0.551 from an approach that uses only color image data at the highest image resolution. Furthermore, when clinical features (sex, age and cancer stage) are combined with image data, the proposed approach achieves a c-index of 0.773.

Availability and implementation: https://github.com/SBU-BMI/deep_survival_analysis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms* / diagnostic imaging
  • Breast Neoplasms* / pathology
  • Deep Learning*
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Machine Learning
  • Survival Analysis