Comparative Analysis of CNN and RNN for Voice Pathology Detection

Biomed Res Int. 2021 Apr 14:2021:6635964. doi: 10.1155/2021/6635964. eCollection 2021.

Abstract

Diagnosis on the basis of a computerized acoustic examination may play an incredibly important role in early diagnosis and in monitoring and even improving effective pathological speech diagnostics. Various acoustic metrics test the health of the voice. The precision of these parameters also has to do with algorithms for the detection of speech noise. The idea is to detect the disease pathology from the voice. First, we apply the feature extraction on the SVD dataset. After the feature extraction, the system input goes into the 27 neuronal layer neural networks that are convolutional and recurrent neural network. We divided the dataset into training and testing, and after 10 k-fold validation, the reported accuracies of CNN and RNN are 87.11% and 86.52%, respectively. A 10-fold cross-validation is used to evaluate the performance of the classifier. On a Linux workstation with one NVidia Titan X GPU, program code was written in Python using the TensorFlow package.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms
  • Databases as Topic
  • Humans
  • Models, Theoretical
  • Neural Networks, Computer*
  • Voice Disorders / diagnosis*