A new optical density granulometry-based descriptor for the classification of prostate histological images using shallow and deep Gaussian processes

Comput Methods Programs Biomed. 2019 Sep:178:303-317. doi: 10.1016/j.cmpb.2019.07.003. Epub 2019 Jul 4.

Abstract

Background and objective: Prostate cancer is one of the most common male tumors. The increasing use of whole slide digital scanners has led to an enormous interest in the application of machine learning techniques to histopathological image classification. Here we introduce a novel family of morphological descriptors which, extracted in the appropriate image space and combined with shallow and deep Gaussian process based classifiers, improves early prostate cancer diagnosis.

Method: We decompose the acquired RGB image in its RGB and optical density hematoxylin and eosin components. Then, we define two novel granulometry-based descriptors which work in both, RGB and optical density, spaces but perform better when used on the latter. In this space they clearly encapsulate knowledge used by pathologists to identify cancer lesions. The obtained features become the inputs to shallow and deep Gaussian process classifiers which achieve an accurate prediction of cancer.

Results: We have used a real and unique dataset. The dataset is composed of 60 Whole Slide Images. For a five fold cross validation, shallow and deep Gaussian Processes obtain area under ROC curve values higher than 0.98. They outperform current state of the art patch based shallow classifiers and are very competitive to the best performing deep learning method. Models were also compared on 17 Whole Slide test Images using the FROC curve. With the cost of one false positive, the best performing method, the one layer Gaussian process, identifies 83.87% (sensitivity) of all annotated cancer in the Whole Slide Image. This result corroborates the quality of the extracted features, no more than a layer is needed to achieve excellent generalization results.

Conclusion: Two new descriptors to extract morphological features from histological images have been proposed. They collect very relevant information for cancer detection. From these descriptors, shallow and deep Gaussian Processes are capable of extracting the complex structure of prostate histological images. The new space/descriptor/classifier paradigm outperforms state-of-art shallow classifiers. Furthermore, despite being much simpler, it is competitive to state-of-art CNN architectures both on the proposed SICAPv1 database and on an external database.

Keywords: Deep Gaussian processes; Gaussian processes; Granulometries; Histopathological images; Prostate cancer; Variational inference.

MeSH terms

  • Algorithms
  • Area Under Curve
  • Color
  • Databases, Factual
  • Diagnosis, Computer-Assisted
  • False Positive Reactions
  • Hospitals
  • Humans
  • Image Processing, Computer-Assisted*
  • Machine Learning
  • Male
  • Normal Distribution
  • Probability
  • Prostate / diagnostic imaging*
  • Prostatic Neoplasms / diagnostic imaging*
  • ROC Curve