Considerations and Challenges for Real-World Deployment of an Acoustic-Based COVID-19 Screening System

Drew Grant; Ian McLane; Valerie Rennoll; James West

doi:10.3390/s22239530

Considerations and Challenges for Real-World Deployment of an Acoustic-Based COVID-19 Screening System

Sensors (Basel). 2022 Dec 6;22(23):9530. doi: 10.3390/s22239530.

Authors

Drew Grant¹, Ian McLane¹, Valerie Rennoll¹, James West¹

Affiliation

¹ Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.

Abstract

Coronavirus disease 2019 (COVID-19) has led to countless deaths and widespread global disruptions. Acoustic-based artificial intelligence (AI) tools could provide a simple, scalable, and prompt method to screen for COVID-19 using easily acquirable physiological sounds. These systems have been demonstrated previously and have shown promise but lack robust analysis of their deployment in real-world settings when faced with diverse recording equipment, noise environments, and test subjects. The primary aim of this work is to begin to understand the impacts of these real-world deployment challenges on the system performance. Using Mel-Frequency Cepstral Coefficients (MFCC) and RelAtive SpecTrAl-Perceptual Linear Prediction (RASTA-PLP) features extracted from cough, speech, and breathing sounds in a crowdsourced dataset, we present a baseline classification system that obtains an average receiver operating characteristic area under the curve (AUC-ROC) of 0.77 when discriminating between COVID-19 and non-COVID subjects. The classifier performance is then evaluated on four additional datasets, resulting in performance variations between 0.64 and 0.87 AUC-ROC, depending on the sound type. By analyzing subsets of the available recordings, it is noted that the system performance degrades with certain recording devices, noise contamination, and with symptom status. Furthermore, performance degrades when a uniform classification threshold from the training data is subsequently used across all datasets. However, the system performance is robust to confounding factors, such as gender, age group, and the presence of other respiratory conditions. Finally, when analyzing multiple speech recordings from the same subjects, the system achieves promising performance with an AUC-ROC of 0.78, though the classification does appear to be impacted by natural speech variations. Overall, the proposed system, and by extension other acoustic-based diagnostic aids in the literature, could provide comparable accuracy to rapid antigen testing but significant deployment challenges need to be understood and addressed prior to clinical use.

Keywords: COVID-19; acoustics; digital forensics; healthcare; machine learning; respiratory diagnosis; telemedicine.

Publication types

Dataset

MeSH terms

Acoustics
Artificial Intelligence*
COVID-19* / diagnosis
Humans
Respiratory Sounds
Sound

Grants and funding

This research received no external funding.