Deep learning-based histopathological segmentation for whole slide images of colorectal cancer in a compressed domain

Sci Rep. 2021 Nov 18;11(1):22520. doi: 10.1038/s41598-021-01905-z.

Abstract

Automatic pattern recognition using deep learning techniques has become increasingly important. Unfortunately, due to limited system memory, general preprocessing methods for high-resolution images in the spatial domain can lose important data information such as high-frequency information and the region of interest. To overcome these limitations, we propose an image segmentation approach in the compressed domain based on principal component analysis (PCA) and discrete wavelet transform (DWT). After inference for each tile using neural networks, a whole prediction image was reconstructed by wavelet weighted ensemble (WWE) based on inverse discrete wavelet transform (IDWT). The training and validation were performed using 351 colorectal biopsy specimens, which were pathologically confirmed by two pathologists. For 39 test datasets, the average Dice score, the pixel accuracy, and the Jaccard score were 0.804 ± 0.125, 0.957 ± 0.025, and 0.690 ± 0.174, respectively. We can train the networks for the high-resolution image with the large region of interest compared to the result in the low-resolution and the small region of interest in the spatial domain. The average Dice score, pixel accuracy, and Jaccard score are significantly increased by 2.7%, 0.9%, and 2.7%, respectively. We believe that our approach has great potential for accurate diagnosis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Colorectal Neoplasms / diagnostic imaging*
  • Computer Graphics
  • Computers
  • Deep Learning*
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Machine Learning
  • Neural Networks, Computer
  • Predictive Value of Tests
  • Principal Component Analysis
  • Probability
  • Reproducibility of Results
  • Software
  • User-Computer Interface
  • Wavelet Analysis