Radiomic Detection of EGFR Mutations in NSCLC

Cancer Res. 2021 Feb 1;81(3):724-731. doi: 10.1158/0008-5472.CAN-20-0999. Epub 2020 Nov 4.

Abstract

Radiomics is defined as the use of automated or semi-automated post-processing and analysis of multiple features derived from imaging exams. Extracted features might generate models able to predict the molecular profile of solid tumors. The aim of this study was to develop a predictive algorithm to define the mutational status of EGFR in treatment-naïve patients with advanced non-small cell lung cancer (NSCLC). CT scans from 109 treatment-naïve patients with NSCLC (21 EGFR-mutant and 88 EGFR-wild type) underwent radiomics analysis to develop a machine learning model able to recognize EGFR-mutant from EGFR-WT patients via CT scans. A "test-retest" approach was used to identify stable radiomics features. The accuracy of the model was tested on an external validation set from another institution and on a dataset from the Cancer Imaging Archive (TCIA). The machine learning model that considered both radiomic and clinical features (gender and smoking status) reached a diagnostic accuracy of 88.1% in our dataset with an AUC at the ROC curve of 0.85, whereas the accuracy values in the datasets from TCIA and the external institution were 76.6% and 83.3%, respectively. Furthermore, 17 distinct radiomics features detected at baseline CT scan were associated with subsequent development of T790M during treatment with an EGFR inhibitor. In conclusion, our machine learning model was able to identify EGFR-mutant patients in multiple validation sets with globally good accuracy, especially after data optimization. More comprehensive training sets might result in further improvement of radiomics-based algorithms. SIGNIFICANCE: These findings demonstrate that data normalization and "test-retest" methods might improve the performance of machine learning models on radiomics images and increase their reliability when used on external validation datasets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / diagnostic imaging
  • Adenocarcinoma / drug therapy
  • Adenocarcinoma / genetics
  • Adenocarcinoma / pathology
  • Algorithms*
  • Area Under Curve
  • Carcinoma, Non-Small-Cell Lung / diagnostic imaging
  • Carcinoma, Non-Small-Cell Lung / drug therapy
  • Carcinoma, Non-Small-Cell Lung / genetics*
  • Carcinoma, Non-Small-Cell Lung / pathology
  • ErbB Receptors / antagonists & inhibitors
  • ErbB Receptors / genetics*
  • Female
  • Humans
  • Lung Neoplasms / diagnostic imaging
  • Lung Neoplasms / drug therapy
  • Lung Neoplasms / genetics*
  • Lung Neoplasms / pathology
  • Machine Learning*
  • Male
  • Mutation*
  • ROC Curve
  • Reproducibility of Results
  • Tomography, X-Ray Computed / methods

Substances

  • ErbB Receptors