A multi-branch convolutional neural network for snoring detection based on audio

Hao Dong; Haitao Wu; Guan Yang; Junming Zhang; Keqin Wan

doi:10.1080/10255842.2024.2317438

A multi-branch convolutional neural network for snoring detection based on audio

Comput Methods Biomech Biomed Engin. 2024 Feb 19:1-12. doi: 10.1080/10255842.2024.2317438. Online ahead of print.

Authors

Hao Dong^{1

2}, Haitao Wu^{2

3}, Guan Yang¹, Junming Zhang^{2

3

4

5}, Keqin Wan²

Affiliations

¹ School of Computer Science, Zhongyuan University of Technology, Henan, China.
² School of Computing and Artificial Intelligence, Huanghuai University, Henan, China.
³ Henan Key Laboratory of Smart Lighting, Henan, China.
⁴ Henan Joint International Research Laboratory of Behavior Optimization Control for Smart Robots, Henan, China.
⁵ Zhumadian Artificial Intelligence and Medical Engineering Technical Research Centre, Henan, China.

PMID: 38372231
DOI: 10.1080/10255842.2024.2317438

Abstract

Obstructive sleep apnea (OSA) is associated with various health complications, and snoring is a prominent characteristic of this disorder. Therefore, the exploration of a concise and effective method for detecting snoring has consistently been a crucial aspect of sleep medicine. As the easily accessible data, the identification of snoring through sound analysis offers a more convenient and straightforward method. The objective of this study was to develop a convolutional neural network (CNN) for classifying snoring and non-snoring events based on audio. This study utilized Mel-frequency cepstral coefficients (MFCCs) as a method for extracting features during the preprocessing of raw data. In order to extract multi-scale features from the frequency domain of sound sources, this study proposes the utilization of a multi-branch convolutional neural network (MBCNN) for the purpose of classification. The network utilized asymmetric convolutional kernels to acquire additional information, while the adoption of one-hot encoding labels aimed to mitigate the impact of labels. The experiment tested the network's performance by utilizing a publicly available dataset consisting of 1,000 sound samples. The test results indicate that the MBCNN achieved a snoring detection accuracy of 99.5%. The integration of multi-scale features and the implementation of MBCNN, based on audio data, have demonstrated a substantial improvement in the performance of snoring classification.

Keywords: Obstructive sleep apnea; convolutional neural network; deep learning; multi-scale features; snore detection.