Lung cancer diagnosis using deep attention-based multiple instance learning and radiomics

Med Phys. 2022 May;49(5):3134-3143. doi: 10.1002/mp.15539. Epub 2022 Mar 3.

Abstract

Background: Early diagnosis of lung cancer is a key intervention for the treatment of lung cancer in which computer-aided diagnosis (CAD) can play a crucial role. Most published CAD methods perform lung cancer diagnosis by classifying each lung nodule in isolation. However, this does not reflect clinical practice, where clinicians diagnose a patient based on a set of images of nodules, instead of looking at one nodule at a time. Besides, the low interpretability of the output provided by these methods presents an important barrier for their adoption.

Method: In this article, we treat lung cancer diagnosis as a multiple instance learning (MIL) problem, which better reflects the diagnosis process in the clinical setting and provides higher interpretability of the output. We selected radiomics as the source of input features and deep attention-based MIL as the classification algorithm. The attention mechanism provides higher interpretability by estimating the importance of each instance in the set for the final diagnosis. To improve the model's performance in a small imbalanced dataset, we propose a new bag simulation method for MIL.

Results and conclusion: The results show that our method can achieve a mean accuracy of 0.807 with a standard error of the mean (SEM) of 0.069, a recall of 0.870 (SEM 0.061), a positive predictive value of 0.928 (SEM 0.078), a negative predictive value of 0.591 $0.591$ (SEM 0.155), and an area under the curve (AUC) of 0.842 (SEM 0.074), outperforming other MIL methods. Additional experiments show that the proposed oversampling strategy significantly improves the model's performance. In addition, experiments show that our method provides a good indication of the importance of each nodule in determining the diagnosis, which combined with the well-defined radiomic features, to make the results more interpretable and acceptable for doctors and patients.

Keywords: attention mechanism; lung cancer diagnosis; multiple instance learning; radiomics.

MeSH terms

  • Diagnosis, Computer-Assisted
  • Humans
  • Lung
  • Lung Neoplasms* / diagnostic imaging
  • Thorax
  • Tomography, X-Ray Computed* / methods