A Clinical Dataset and Various Baselines for Chromosome Instance Segmentation

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):31-39. doi: 10.1109/TCBB.2021.3089507. Epub 2022 Feb 3.

Abstract

Background: In medicine, chromosome karyotyping analysis plays a crucial role in prenatal diagnosis for diagnosing whether a fetus has severe defects or genetic diseases. However, chromosome instance segmentation is the most critical obstacle to automatic chromosome karyotyping analysis due to the complicated morphological characteristics of chromosome clusters, restricting chromosome karyotyping analysis to highly depend on skilled clinical analysts.

Method: In this paper, we build a clinical dataset and propose multiple segmentation baselines to tackle the chromosome instance segmentation problem of various overlapping and touching chromosome clusters. First, we construct a clinical dataset for deep learning-based chromosome instance segmentation models by collecting and annotating 1,655 privacy-removal chromosome clusters. After that, we design a chromosome instance labeled dataset augmentation (CILA) algorithm for the clinical dataset to improve the generalization performance of deep learning-based models. Last, we propose a chromosome instance segmentation framework and implement multiple baselines for the proposed framework based on various instance segmentation models.

Results and conclusions: Experiments evaluated on the clinical dataset show that the best baseline of the proposed framework based on the Mask-RCNN model yields an outstanding result with 77% mAP, 97.5% AP50, and 95.5% AP75 segmentation precision, and 95.38% accuracy, which exceeds results reported in current chromosome instance segmentation methods. The quantitative evaluation results demonstrate the effectiveness and advancement of the proposed method for the chromosome instance segmentation problem. The experimental code and privacy-removal clinical dataset can be found at Github.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosomes*
  • Image Processing, Computer-Assisted*