Comparative Analysis of CNN and RNN for Voice Pathology Detection

Sidra Abid Syed; Munaf Rashid; Samreen Hussain; Hira Zahid

doi:10.1155/2021/6635964

Comparative Analysis of CNN and RNN for Voice Pathology Detection

Biomed Res Int. 2021 Apr 14:2021:6635964. doi: 10.1155/2021/6635964. eCollection 2021.

Authors

Sidra Abid Syed¹, Munaf Rashid², Samreen Hussain³, Hira Zahid⁴

Affiliations

¹ Department of Biomedical Engineering and Department of Electrical Engineering, Ziauddin University Faculty of Engineering Science, Technology, and Management, Karachi, Pakistan.
² Department of Electrical Engineering and Department of Software Engineering, Ziauddin University Faculty of Engineering Science, Technology, and Management, Karachi, Pakistan.
³ Vice Chancellor, Begum Nusrat Bhutto Women University, Sukkur, Pakistan.
⁴ Department of Biomedical Engineering, Ziauddin University Faculty of Engineering Science, Technology, and Management, Karachi, Pakistan.

Abstract

Diagnosis on the basis of a computerized acoustic examination may play an incredibly important role in early diagnosis and in monitoring and even improving effective pathological speech diagnostics. Various acoustic metrics test the health of the voice. The precision of these parameters also has to do with algorithms for the detection of speech noise. The idea is to detect the disease pathology from the voice. First, we apply the feature extraction on the SVD dataset. After the feature extraction, the system input goes into the 27 neuronal layer neural networks that are convolutional and recurrent neural network. We divided the dataset into training and testing, and after 10 k-fold validation, the reported accuracies of CNN and RNN are 87.11% and 86.52%, respectively. A 10-fold cross-validation is used to evaluate the performance of the classifier. On a Linux workstation with one NVidia Titan X GPU, program code was written in Python using the TensorFlow package.

Publication types

Comparative Study

MeSH terms

Algorithms
Databases as Topic
Humans
Models, Theoretical
Neural Networks, Computer*
Voice Disorders / diagnosis*