Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients

J Voice. 2024 May;38(3):796.e1-796.e13. doi: 10.1016/j.jvoice.2021.11.004. Epub 2021 Nov 26.

Abstract

Many virological tests have been implemented during the Coronavirus Disease 2019 (COVID-19) pandemic for diagnostic purposes, but they appear unsuitable for screening purposes. Furthermore, current screening strategies are not accurate enough to effectively curb the spread of the disease. Therefore, the present study was conducted within a controlled clinical environment to determine eventual detectable variations in the voice of COVID-19 patients, recovered and healthy subjects, and also to determine whether machine learning-based voice assessment (MLVA) can accurately discriminate between them, thus potentially serving as a more effective mass-screening tool. Three different subpopulations were consecutively recruited: positive COVID-19 patients, recovered COVID-19 patients and healthy individuals as controls. Positive patients were recruited within 10 days from nasal swab positivity. Recovery from COVID-19 was established clinically, virologically and radiologically. Healthy individuals reported no COVID-19 symptoms and yielded negative results at serological testing. All study participants provided three trials for multiple vocal tasks (sustained vowel phonation, speech, cough). All recordings were initially divided into three different binary classifications with a feature selection, ranking and cross-validated RBF-SVM pipeline. This brough a mean accuracy of 90.24%, a mean sensitivity of 91.15%, a mean specificity of 89.13% and a mean AUC of 0.94 across all tasks and all comparisons, and outlined the sustained vowel as the most effective vocal task for COVID discrimination. Moreover, a three-way classification was carried out on an external test set comprised of 30 subjects, 10 per class, with a mean accuracy of 80% and an accuracy of 100% for the detection of positive subjects. Within this assessment, recovered individuals proved to be the most difficult class to identify, and all the misclassified subjects were declared positive; this might be related to mid and short-term vocal traces of COVID-19, even after the clinical resolution of the infection. In conclusion, MLVA may accurately discriminate between positive COVID-19 patients, recovered COVID-19 patients and healthy individuals. Further studies should test MLVA among larger populations and asymptomatic positive COVID-19 patients to validate this novel screening technology and test its potential application as a potentially more effective surveillance strategy for COVID-19.

Keywords: Accuracy; Cough; SARS-CoV-2; Screening test; Sensitivity; Surveillance.

MeSH terms

  • Adult
  • Aged
  • COVID-19 Testing / methods
  • COVID-19* / diagnosis
  • Case-Control Studies
  • Cough / physiopathology
  • Cough / virology
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Phonation
  • Predictive Value of Tests
  • Reproducibility of Results
  • SARS-CoV-2
  • Speech Acoustics
  • Speech Production Measurement / methods
  • Voice Disorders / diagnosis
  • Voice Disorders / physiopathology
  • Voice Quality*