C-NMC: B-lineage acute lymphoblastic leukaemia: A blood cancer dataset

Med Eng Phys. 2022 May:103:103793. doi: 10.1016/j.medengphy.2022.103793. Epub 2022 Mar 26.

Abstract

Development of computer-aided cancer diagnostic tools is an active research area owing to the advancements in deep-learning domain. Such technological solutions provide affordable and easily deployable diagnostic tools. Leukaemia, or blood cancer, is one of the leading cancers causing more than 0.3 million deaths every year. In order to aid the development of such an AI-enabled tool, we collected and curated a microscopic image dataset, namely C-NMC, of more than 15000 cancer cell images at a very high resolution of B-Lineage Acute Lymphoblastic Leukaemia (B-ALL). The dataset is prepared at the subject-level and contains images of both healthy and cancer patients. So far, this is the largest (as well as curated) dataset on B-ALL cancer in the public domain. C-NMC is available at The Cancer Imaging Archive (TCIA), USA and can be helpful for the research community worldwide for the development of B-ALL cancer diagnostic tools. This dataset was utilized in an international medical imaging challenge held at ISBI 2019 conference in Venice, Italy. In this paper, we present a detailed description and challenges of this dataset. We also present benchmarking results of all the methods applied so far on this dataset.

Keywords: Acute lymphoblastic leukaemia; Cancer dataset; Cancer diagnostics; Computer aided diagnosis; Image database; Microscopic image.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Diagnostic Imaging
  • Humans
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma*