Noise profiling for speech enhancement employing machine learning models

Krzysztof Kąkol; Gražina Korvel; Bożena Kostek

doi:10.1121/10.0016495

Noise profiling for speech enhancement employing machine learning models

J Acoust Soc Am. 2022 Dec;152(6):3595. doi: 10.1121/10.0016495.

Authors

Krzysztof Kąkol¹, Gražina Korvel², Bożena Kostek³

Affiliations

¹ PGS Software, Wrocław, 50-086, Poland.
² Institute of Data Science and Digital Technologies, Vilnius University, Vilnius, 08412, Lithuania.
³ Audio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Gdańsk, 80-233, Poland.

PMID: 36586827
DOI: 10.1121/10.0016495

Abstract

This paper aims to propose a noise profiling method that can be performed in near real time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features derived from the Aurora noise dataset. This is to select the best-performing classifier in the context of noise profiling. Therefore, a comparison of all classifier outcomes is shown based on effectiveness metrics. Also, confusion matrices prepared for all tested models are presented. The second part of the experiment consists of selecting the algorithm that scored the best, i.e., Naive Bayes, resulting in an accuracy of 96.76%, and using it in a noise-type recognition model to demonstrate that it can perform in a stable way. Classification results are derived from the real-life recordings performed in momentary and averaging modes. The key contribution is discussed regarding speech intelligibility improvements in the presence of noise, where identifying the type of noise is crucial. Finally, conclusions deliver the overall findings and future work directions.

Publication types

Review

MeSH terms

Algorithms
Bayes Theorem
Machine Learning
Noise* / adverse effects
Speech Intelligibility*