Hybrid Techniques for the Diagnosis of Acute Lymphoblastic Leukemia Based on Fusion of CNN Features

Diagnostics (Basel). 2023 Mar 8;13(6):1026. doi: 10.3390/diagnostics13061026.

Abstract

Acute lymphoblastic leukemia (ALL) is one of the deadliest forms of leukemia due to the bone marrow producing many white blood cells (WBC). ALL is one of the most common types of cancer in children and adults. Doctors determine the treatment of leukemia according to its stages and its spread in the body. Doctors rely on analyzing blood samples under a microscope. Pathologists face challenges, such as the similarity between infected and normal WBC in the early stages. Manual diagnosis is prone to errors, differences of opinion, and the lack of experienced pathologists compared to the number of patients. Thus, computer-assisted systems play an essential role in assisting pathologists in the early detection of ALL. In this study, systems with high efficiency and high accuracy were developed to analyze the images of C-NMC 2019 and ALL-IDB2 datasets. In all proposed systems, blood micrographs were improved and then fed to the active contour method to extract WBC-only regions for further analysis by three CNN models (DenseNet121, ResNet50, and MobileNet). The first strategy for analyzing ALL images of the two datasets is the hybrid technique of CNN-RF and CNN-XGBoost. DenseNet121, ResNet50, and MobileNet models extract deep feature maps. CNN models produce high features with redundant and non-significant features. So, CNN deep feature maps were fed to the Principal Component Analysis (PCA) method to select highly representative features and sent to RF and XGBoost classifiers for classification due to the high similarity between infected and normal WBC in early stages. Thus, the strategy for analyzing ALL images using serially fused features of CNN models. The deep feature maps of DenseNet121-ResNet50, ResNet50-MobileNet, DenseNet121-MobileNet, and DenseNet121-ResNet50-MobileNet were merged and then classified by RF classifiers and XGBoost. The RF classifier with fused features for DenseNet121-ResNet50-MobileNet reached an AUC of 99.1%, accuracy of 98.8%, sensitivity of 98.45%, precision of 98.7%, and specificity of 98.85% for the C-NMC 2019 dataset. With the ALL-IDB2 dataset, hybrid systems achieved 100% results for AUC, accuracy, sensitivity, precision, and specificity.

Keywords: ALL; CNN; PCA; RF; XGBoost; hybrid method.