Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V1

Fu-Shun Hsu; Shang-Ran Huang; Chien-Wen Huang; Chao-Jung Huang; Yuan-Ren Cheng; Chun-Chieh Chen; Jack Hsiao; Chung-Wei Chen; Li-Chin Chen; Yen-Chun Lai; Bi-Fang Hsu; Nian-Jhen Lin; Wan-Ling Tsai; Yi-Lin Wu; Tzu-Ling Tseng; Ching-Ting Tseng; Yi-Tsun Chen; Feipei Lai

doi:10.1371/journal.pone.0254134

Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V1

PLoS One. 2021 Jul 1;16(7):e0254134. doi: 10.1371/journal.pone.0254134. eCollection 2021.

Authors

Fu-Shun Hsu^{1

2

3}, Shang-Ran Huang³, Chien-Wen Huang⁴, Chao-Jung Huang⁵, Yuan-Ren Cheng^{3

6

7}, Chun-Chieh Chen⁴, Jack Hsiao⁸, Chung-Wei Chen², Li-Chin Chen⁹, Yen-Chun Lai³, Bi-Fang Hsu³, Nian-Jhen Lin^{3

10}, Wan-Ling Tsai³, Yi-Lin Wu³, Tzu-Ling Tseng³, Ching-Ting Tseng³, Yi-Tsun Chen³, Feipei Lai¹

Affiliations

¹ Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan.
² Department of Critical Care Medicine, Far Eastern Memorial Hospital, New Taipei, Taiwan.
³ Heroic Faith Medical Science Co., Ltd., Taipei, Taiwan.
⁴ Avalanche Computing Inc., Taipei, Taiwan.
⁵ Joint Research Center for Artificial Intelligence Technology and All Vista Healthcare, National Taiwan University, Taipei, Taiwan.
⁶ Department of Life Science, College of Life Science, National Taiwan University, Taipei, Taiwan.
⁷ Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
⁸ HCC Healthcare Group, New Taipei, Taiwan.
⁹ Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan.
¹⁰ Division of Pulmonary Medicine, Far Eastern Memorial Hospital, New Taipei, Taiwan.

Abstract

A reliable, remote, and continuous real-time respiratory sound monitor with automated respiratory sound analysis ability is urgently required in many clinical scenarios-such as in monitoring disease progression of coronavirus disease 2019-to replace conventional auscultation with a handheld stethoscope. However, a robust computerized respiratory sound analysis algorithm for breath phase detection and adventitious sound detection at the recording level has not yet been validated in practical applications. In this study, we developed a lung sound database (HF_Lung_V1) comprising 9,765 audio files of lung sounds (duration of 15 s each), 34,095 inhalation labels, 18,349 exhalation labels, 13,883 continuous adventitious sound (CAS) labels (comprising 8,457 wheeze labels, 686 stridor labels, and 4,740 rhonchus labels), and 15,606 discontinuous adventitious sound labels (all crackles). We conducted benchmark tests using long short-term memory (LSTM), gated recurrent unit (GRU), bidirectional LSTM (BiLSTM), bidirectional GRU (BiGRU), convolutional neural network (CNN)-LSTM, CNN-GRU, CNN-BiLSTM, and CNN-BiGRU models for breath phase detection and adventitious sound detection. We also conducted a performance comparison between the LSTM-based and GRU-based models, between unidirectional and bidirectional models, and between models with and without a CNN. The results revealed that these models exhibited adequate performance in lung sound analysis. The GRU-based models outperformed, in terms of F1 scores and areas under the receiver operating characteristic curves, the LSTM-based models in most of the defined tasks. Furthermore, all bidirectional models outperformed their unidirectional counterparts. Finally, the addition of a CNN improved the accuracy of lung sound analysis, especially in the CAS detection tasks.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
Aged
Aged, 80 and over
Benchmarking
COVID-19 / diagnosis
COVID-19 / physiopathology*
Databases, Factual
Disease Progression
Female
Humans
Lung / physiopathology*
Male
Middle Aged
Neural Networks, Computer
Respiration
Respiratory Sounds / physiopathology*

Grants and funding

Raising Children Medical Foundation, Taiwan, fully funded all of the lung sound collection and contributed the recordings to Taiwan Society of Emergency and Critical Care Medicine. The Heroic Faith Medical Science Co. Ltd, Taipei, Taiwan, freely provided the lung sound recording device (HF-Type-1) for the study and fully sponsored the data labeling and deep learning model training. There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.