Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients

Carlo Robotti; Giovanni Costantini; Giovanni Saggio; Valerio Cesarini; Anna Calastri; Eugenia Maiorano; Davide Piloni; Tiziano Perrone; Umberto Sabatini; Virginia Valeria Ferretti; Irene Cassaniti; Fausto Baldanti; Andrea Gravina; Ahmed Sakib; Elena Alessi; Filomena Pietrantonio; Matteo Pascucci; Daniele Casali; Zakarya Zarezadeh; Vincenzo Del Zoppo; Antonio Pisani; Marco Benazzo

doi:10.1016/j.jvoice.2021.11.004

Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients

J Voice. 2024 May;38(3):796.e1-796.e13. doi: 10.1016/j.jvoice.2021.11.004. Epub 2021 Nov 26.

Authors

Carlo Robotti¹, Giovanni Costantini², Giovanni Saggio³, Valerio Cesarini⁴, Anna Calastri⁵, Eugenia Maiorano⁵, Davide Piloni⁶, Tiziano Perrone⁷, Umberto Sabatini⁷, Virginia Valeria Ferretti⁸, Irene Cassaniti⁹, Fausto Baldanti¹⁰, Andrea Gravina¹¹, Ahmed Sakib¹¹, Elena Alessi¹², Filomena Pietrantonio¹², Matteo Pascucci¹², Daniele Casali⁴, Zakarya Zarezadeh⁴, Vincenzo Del Zoppo⁴, Antonio Pisani¹³, Marco Benazzo¹⁴

Affiliations

¹ Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy; Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, Italy. Electronic address: carlorobotti@gmail.com.
² Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy. Electronic address: costantini@uniroma2.it.
³ Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy. Electronic address: saggio@uniroma2.it.
⁴ Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy.
⁵ Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy.
⁶ Pneumology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy.
⁷ Department of Internal Medicine, Fondazione IRCCS Policlinico San Matteo, University of Pavia, Pavia, Italy.
⁸ Clinical Epidemiology and Biometry Unit, Fondazione IRCCS Policlinico San Matteo Foundation, Pavia, Italy.
⁹ Molecular Virology Unit, Microbiology and Virology Department, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy.
¹⁰ Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, Italy; Molecular Virology Unit, Microbiology and Virology Department, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy.
¹¹ Otorhinolaryngology Department, University of Rome Tor Vergata, Rome, Italy.
¹² Internal Medicine Unit, Ospedale dei Castelli ASL Roma 6, Ariccia, Italy.
¹³ Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy; IRCCS Mondino Foundation, Pavia, Italy.
¹⁴ Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy; Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, Italy.

Abstract

Many virological tests have been implemented during the Coronavirus Disease 2019 (COVID-19) pandemic for diagnostic purposes, but they appear unsuitable for screening purposes. Furthermore, current screening strategies are not accurate enough to effectively curb the spread of the disease. Therefore, the present study was conducted within a controlled clinical environment to determine eventual detectable variations in the voice of COVID-19 patients, recovered and healthy subjects, and also to determine whether machine learning-based voice assessment (MLVA) can accurately discriminate between them, thus potentially serving as a more effective mass-screening tool. Three different subpopulations were consecutively recruited: positive COVID-19 patients, recovered COVID-19 patients and healthy individuals as controls. Positive patients were recruited within 10 days from nasal swab positivity. Recovery from COVID-19 was established clinically, virologically and radiologically. Healthy individuals reported no COVID-19 symptoms and yielded negative results at serological testing. All study participants provided three trials for multiple vocal tasks (sustained vowel phonation, speech, cough). All recordings were initially divided into three different binary classifications with a feature selection, ranking and cross-validated RBF-SVM pipeline. This brough a mean accuracy of 90.24%, a mean sensitivity of 91.15%, a mean specificity of 89.13% and a mean AUC of 0.94 across all tasks and all comparisons, and outlined the sustained vowel as the most effective vocal task for COVID discrimination. Moreover, a three-way classification was carried out on an external test set comprised of 30 subjects, 10 per class, with a mean accuracy of 80% and an accuracy of 100% for the detection of positive subjects. Within this assessment, recovered individuals proved to be the most difficult class to identify, and all the misclassified subjects were declared positive; this might be related to mid and short-term vocal traces of COVID-19, even after the clinical resolution of the infection. In conclusion, MLVA may accurately discriminate between positive COVID-19 patients, recovered COVID-19 patients and healthy individuals. Further studies should test MLVA among larger populations and asymptomatic positive COVID-19 patients to validate this novel screening technology and test its potential application as a potentially more effective surveillance strategy for COVID-19.

Keywords: Accuracy; Cough; SARS-CoV-2; Screening test; Sensitivity; Surveillance.

MeSH terms

Adult
Aged
COVID-19 Testing / methods
COVID-19* / diagnosis
Case-Control Studies
Cough / physiopathology
Cough / virology
Female
Humans
Machine Learning*
Male
Middle Aged
Phonation
Predictive Value of Tests
Reproducibility of Results
SARS-CoV-2
Speech Acoustics
Speech Production Measurement / methods
Voice Disorders / diagnosis
Voice Disorders / physiopathology
Voice Quality*