Development and application of a deep learning-based comprehensive early diagnostic model for chronic obstructive pulmonary disease

Respir Res. 2024 Apr 18;25(1):167. doi: 10.1186/s12931-024-02793-3.

Abstract

Background: Chronic obstructive pulmonary disease (COPD) is a frequently diagnosed yet treatable condition, provided it is identified early and managed effectively. This study aims to develop an advanced COPD diagnostic model by integrating deep learning and radiomics features.

Methods: We utilized a dataset comprising CT images from 2,983 participants, of which 2,317 participants also provided epidemiological data through questionnaires. Deep learning features were extracted using a Variational Autoencoder, and radiomics features were obtained using the PyRadiomics package. Multi-Layer Perceptrons were used to construct models based on deep learning and radiomics features independently, as well as a fusion model integrating both. Subsequently, epidemiological questionnaire data were incorporated to establish a more comprehensive model. The diagnostic performance of standalone models, the fusion model and the comprehensive model was evaluated and compared using metrics including accuracy, precision, recall, F1-score, Brier score, receiver operating characteristic curves, and area under the curve (AUC).

Results: The fusion model exhibited outstanding performance with an AUC of 0.952, surpassing the standalone models based solely on deep learning features (AUC = 0.844) or radiomics features (AUC = 0.944). Notably, the comprehensive model, incorporating deep learning features, radiomics features, and questionnaire variables demonstrated the highest diagnostic performance among all models, yielding an AUC of 0.971.

Conclusion: We developed and implemented a data fusion strategy to construct a state-of-the-art COPD diagnostic model integrating deep learning features, radiomics features, and questionnaire variables. Our data fusion strategy proved effective, and the model can be easily deployed in clinical settings.

Trial registration: Not applicable. This study is NOT a clinical trial, it does not report the results of a health care intervention on human participants.

Keywords: COPD; Data fusion; Deep learning; Diagnostic.

MeSH terms

  • Area Under Curve
  • Deep Learning*
  • Humans
  • Neural Networks, Computer
  • Pulmonary Disease, Chronic Obstructive* / diagnostic imaging
  • Pulmonary Disease, Chronic Obstructive* / epidemiology
  • ROC Curve
  • Retrospective Studies