Machine Learning for Automated Classification of Abnormal Lung Sounds Obtained from Public Databases: A Systematic Review

Juan P Garcia-Mendez; Amos Lal; Svetlana Herasevich; Aysun Tekin; Yuliya Pinevich; Kirill Lipatov; Hsin-Yi Wang; Shahraz Qamar; Ivan N Ayala; Ivan Khapov; Danielle J Gerberi; Daniel Diedrich; Brian W Pickering; Vitaly Herasevich

doi:10.3390/bioengineering10101155

Machine Learning for Automated Classification of Abnormal Lung Sounds Obtained from Public Databases: A Systematic Review

Bioengineering (Basel). 2023 Oct 2;10(10):1155. doi: 10.3390/bioengineering10101155.

Authors

Juan P Garcia-Mendez¹, Amos Lal², Svetlana Herasevich¹, Aysun Tekin¹, Yuliya Pinevich^{1

3}, Kirill Lipatov⁴, Hsin-Yi Wang^{1

5

6}, Shahraz Qamar¹, Ivan N Ayala¹, Ivan Khapov¹, Danielle J Gerberi⁷, Daniel Diedrich¹, Brian W Pickering¹, Vitaly Herasevich¹

Affiliations

¹ Department of Anesthesiology and Perioperative Medicine, Division of Critical Care, Mayo Clinic, Rochester, MN 55905, USA.
² Department of Medicine, Division of Pulmonary and Critical Care Medicine, Mayo Clinic, Rochester, MN 55905, USA.
³ Department of Cardiac Anesthesiology and Intensive Care, Republican Clinical Medical Center, 223052 Minsk, Belarus.
⁴ Division of Pulmonary Medicine, Mayo Clinic Health Systems, Essentia Health, Duluth, MN 55805, USA.
⁵ Department of Anesthesiology, Taipei Veterans General Hospital, National Yang Ming Chiao Tung University, Taipei 11217, Taiwan.
⁶ Department of Biomedical Sciences and Engineering, National Central University, Taoyuan 320317, Taiwan.
⁷ Mayo Clinic Libraries, Mayo Clinic, Rochester, MN 55905, USA.

Abstract

Pulmonary auscultation is essential for detecting abnormal lung sounds during physical assessments, but its reliability depends on the operator. Machine learning (ML) models offer an alternative by automatically classifying lung sounds. ML models require substantial data, and public databases aim to address this limitation. This systematic review compares characteristics, diagnostic accuracy, concerns, and data sources of existing models in the literature. Papers published from five major databases between 1990 and 2022 were assessed. Quality assessment was accomplished with a modified QUADAS-2 tool. The review encompassed 62 studies utilizing ML models and public-access databases for lung sound classification. Artificial neural networks (ANN) and support vector machines (SVM) were frequently employed in the ML classifiers. The accuracy ranged from 49.43% to 100% for discriminating abnormal sound types and 69.40% to 99.62% for disease class classification. Seventeen public databases were identified, with the ICBHI 2017 database being the most used (66%). The majority of studies exhibited a high risk of bias and concerns related to patient selection and reference standards. Summarizing, ML models can effectively classify abnormal lung sounds using publicly available data sources. Nevertheless, inconsistent reporting and methodologies pose limitations to advancing the field, and therefore, public databases should adhere to standardized recording and labeling procedures.

Keywords: deep learning (DL); electronic auscultation; lung sounds; machine learning (ML); public databases.

Publication types

Review

Grants and funding

This research received no external funding.