Random forest based lung nodule classification aided by clustering

Comput Med Imaging Graph. 2010 Oct;34(7):535-42. doi: 10.1016/j.compmedimag.2010.03.006. Epub 2010 Apr 28.

Abstract

An automated lung nodule detection system can help spot lung abnormalities in CT lung images. Lung nodule detection can be achieved using template-based, segmentation-based, and classification-based methods. The existing systems that include a classification component in their structures have demonstrated better performances than their counterparts. Ensemble learners combine decisions of multiple classifiers to form an integrated output. To improve the performance of automated lung nodule detection, an ensemble classification aided by clustering (CAC) method is proposed. The method takes advantage of the random forest algorithm and offers a structure for a hybrid random forest based lung nodule classification aided by clustering. Several experiments are carried out involving the proposed method as well as two other existing methods. The parameters of the classifiers are varied to identify the best performing classifiers. The experiments are conducted using lung scans of 32 patients including 5721 images within which nodule locations are marked by expert radiologists. Overall, the best sensitivity of 98.33% and specificity of 97.11% have been recorded for proposed system. Also, a high receiver operating characteristic (ROC) A(z) of 0.9786 has been achieved.

MeSH terms

  • Cluster Analysis
  • Humans
  • Lung / diagnostic imaging
  • Pattern Recognition, Automated*
  • Radiographic Image Interpretation, Computer-Assisted*
  • Sensitivity and Specificity
  • Solitary Pulmonary Nodule / classification*
  • Solitary Pulmonary Nodule / diagnostic imaging*
  • Tomography, X-Ray Computed*