An efficient and robust Phonocardiography (PCG)-based Valvular Heart Diseases (VHD) detection framework using Vision Transformer (ViT)

Comput Biol Med. 2023 May:158:106734. doi: 10.1016/j.compbiomed.2023.106734. Epub 2023 Mar 2.

Abstract

Background and objectives: Valvular heart diseases (VHDs) are one of the dominant causes of cardiovascular abnormalities that have been associated with high mortality rates globally. Rapid and accurate diagnosis of the early stage of VHD based on cardiac phonocardiogram (PCG) signal is critical that allows for optimum medication and reduction of mortality rate.

Methods: To this end, the current study proposes novel deep learning (DL)-based high-performance VHD detection frameworks that are relatively simpler in terms of network structures, yet effective for accurately detecting multiple VHDs. We present three different frameworks considering both 1D and 2D PCG raw signals. For 1D PCG, Mel frequency cepstral coefficients (MFCC) and linear prediction cepstral coefficients (LPCC) features, whereas, for 2D PCG, various deep convolutional neural networks (D-CNNs) features are extracted. Additionally, nature/bio-inspired algorithms (NIA/BIA) including particle swarm optimization (PSO) and genetic algorithm (GA) have been utilized for automatic and efficient feature selection directly from the raw PCG signal. To further improve the performance of the classifier, vision transformer (ViT) has been implemented levering the self-attention mechanism on the time frequency representation (TFR) of 2D PCG signal. Our extensive study presents a comparative performance analysis and the scope of enhancement for the combination of different descriptors, classifiers, and feature selection algorithms.

Main results: Among all classifiers, ViT provides the best performance by achieving mean average accuracy Acc of 99.90 % and F1-score of 99.95 % outperforming current state-of-the-art VHD classification models.

Conclusions: The present research provides a robust and efficient DL-based end-to-end PCG signal classification framework for designing a automated high-performance VHD diagnosis system.

Keywords: D-CNNs; Deep learning (DL); Machine learning (ML); Phonocardiography (PCG); Valvular heart diseases (VHD); Vision Transformer (ViT).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Heart Sounds*
  • Heart Valve Diseases* / diagnostic imaging
  • Humans
  • Neural Networks, Computer
  • Phonocardiography
  • Signal Processing, Computer-Assisted