Machine learning helps identifying volume-confounding effects in radiomics

Phys Med. 2020 Mar:71:24-30. doi: 10.1016/j.ejmp.2020.02.010. Epub 2020 Feb 20.

Abstract

Highlighting the risk of biases in radiomics-based models will help improve their quality and increase usage as decision support systems in the clinic. In this study we use machine learning-based methods to identify the presence of volume-confounding effects in radiomics features. Methods 841 radiomics features were extracted from two retrospective publicly available datasets of lung and head neck cancers using open source software. Unsupervised hierarchical clustering and principal component analysis (PCA) identified relations between radiomics and clinical outcomes (overall survival). Bootstrapping techniques with logistic regression verified features' prognostic power and robustness. Results Over 80% of the features had large pairwise correlations. Nearly 30% of the features presented strong correlations with tumor volume. Using volume-independent features for clustering and PCA did not allow risk stratification of patients. Clinical predictors outperformed radiomics features in bootstrapping and logistic regression. Conclusions The adoption of safeguards in radiomics is imperative to improve the quality of radiomics studies. We proposed machine learning (ML) - based methods for robust radiomics signatures development.

Keywords: Head and neck; Lung; Machine learning; Predictions; Radiomics.

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms
  • Carcinoma, Non-Small-Cell Lung / diagnostic imaging
  • Carcinoma, Non-Small-Cell Lung / radiotherapy
  • Carcinoma, Squamous Cell / diagnostic imaging
  • Carcinoma, Squamous Cell / radiotherapy
  • Cluster Analysis
  • Databases, Factual
  • Decision Support Systems, Clinical
  • Female
  • Humans
  • Laryngeal Neoplasms / diagnostic imaging
  • Laryngeal Neoplasms / radiotherapy
  • Lung Neoplasms / diagnostic imaging*
  • Lung Neoplasms / radiotherapy
  • Machine Learning*
  • Male
  • Middle Aged
  • Oropharyngeal Neoplasms / diagnostic imaging
  • Oropharyngeal Neoplasms / radiotherapy
  • Principal Component Analysis
  • Radiometry / methods*
  • Regression Analysis
  • Retrospective Studies
  • Software
  • Squamous Cell Carcinoma of Head and Neck / diagnostic imaging*
  • Squamous Cell Carcinoma of Head and Neck / radiotherapy
  • Tomography, X-Ray Computed