Automatic Feature Construction Based on Genetic Programming for Survival Prediction in Lung Cancer Using CT Images

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul:2022:3797-3800. doi: 10.1109/EMBC48229.2022.9871039.

Abstract

In the radiomics workflow, machine learning builds classification models from a set of input features. However, some features can be irrelevant and redundant, reducing the classification performance. This paper proposes using the Genetic Programming (GP) algorithm to automatically construct a reduced number of independent and relevant radiomic features. The proposed method is applied to patients affected by Non-Small Cell Lung Cancer (NSCLC) with pre-operative computed tomography (CT) images to predict the two-year survival by the use of linear classifiers. The model built using GP features is compared with benchmark models built using traditional features. The use of the GP algorithm increased classification performance: [Formula: see text] for the proposed model vs. [Formula: see text] and 0.64 for the benchmark models. Hence, the proposed approach better stratifies patients at high and low risk according to their overall postoperative survival time.

MeSH terms

  • Benchmarking
  • Carcinoma, Non-Small-Cell Lung* / diagnostic imaging
  • Carcinoma, Non-Small-Cell Lung* / genetics
  • Humans
  • Lung Neoplasms* / diagnostic imaging
  • Lung Neoplasms* / genetics
  • Machine Learning
  • Tomography, X-Ray Computed