Automated classification of cytogenetic abnormalities in hematolymphoid neoplasms

Bioinformatics. 2022 Feb 7;38(5):1420-1426. doi: 10.1093/bioinformatics/btab822.

Abstract

Motivation: Algorithms for classifying chromosomes, like convolutional deep neural networks (CNNs), show promise to augment cytogeneticists' workflows; however, a critical limitation is their inability to accurately classify various structural chromosomal abnormalities. In hematopathology, recurrent structural cytogenetic abnormalities herald diagnostic, prognostic and therapeutic implications, but are laborious for expert cytogeneticists to identify. Non-recurrent cytogenetic abnormalities also occur frequently cancerous cells. Here, we demonstrate the feasibility of using CNNs to accurately classify many recurrent cytogenetic abnormalities while being able to reliably detect non-recurrent, spurious abnormal chromosomes, as well as provide insights into dataset assembly, model selection and training methodology that improve overall generalizability and performance for chromosome classification.

Results: Our top-performing model achieved a mean weighted F1 score of 96.86% on the validation set and 94.03% on the test set. Gradient class activation maps indicated that our model learned biologically meaningful feature maps, reinforcing the clinical utility of our proposed approach. Altogether, this work: proposes a new dataset framework for training chromosome classifiers for use in a clinical environment, reveals that residual CNNs and cyclical learning rates confer superior performance, and demonstrates the feasibility of using this approach to automatically screen for many recurrent cytogenetic abnormalities while adeptly classifying non-recurrent abnormal chromosomes.

Availability and implementation: Software is freely available at https://github.com/DaehwanKimLab/Chromosome-ReAd. The data underlying this article cannot be shared publicly due to it being protected patient information.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosome Aberrations
  • Humans
  • Neoplasms*
  • Neural Networks, Computer*
  • Software