Efficient clinical data analysis for prediction of coal workers' pneumoconiosis using machine learning algorithms

Clin Respir J. 2023 Jul;17(7):684-693. doi: 10.1111/crj.13657. Epub 2023 Jun 28.

Abstract

Purpose: The purpose of this study is to propose an efficient coal workers' pneumoconiosis (CWP) clinical prediction system and put it into clinical use for clinical diagnosis of pneumoconiosis.

Methods: Patients with CWP and dust-exposed workers who were enrolled from August 2021 to December 2021 were included in this study. Firstly, we chose the embedded method through using three feature selection approaches to perform the prediction analysis. Then, we performed the machine learning algorithms as the model backbone and combined them with three feature selection methods, respectively, to determine the optimal predictive model for CWP.

Results: Through applying three feature selection approaches based on machine learning algorithms, it was found that AaDO2 and some pulmonary function indicators played an important role in prediction for identifying CWP of early stage. The support vector machine (SVM) algorithm was proved as the optimal machine learning model for predicting CWP, with the ROC curves obtained from three feature selection methods using SVM algorithm whose AUC values of 97.78%, 93.7%, and 95.56%, respectively.

Conclusion: We developed the optimal model (SVM algorithm) through comparisons and analyses among the performances of different models for the prediction of CWP as a clinical application.

Keywords: coal workers' pneumoconiosis clinical prediction; feature selection; machine learning.

MeSH terms

  • Anthracosis*
  • Coal
  • Coal Mining*
  • Dust / analysis
  • Humans
  • Pneumoconiosis*

Substances

  • Dust
  • Coal