Automated Speech Audiometry for Integrated Voice Over Internet Protocol Communication Services

Am J Audiol. 2022 Sep 21;31(3S):980-992. doi: 10.1044/2022_AJA-21-00217. Epub 2022 Aug 22.

Abstract

Purpose: Problems in speech recognition are often apparent in telecommunication situations. For ecologically valid assessments of such conditions, it is important to quantify the impact of real environments including acoustic conditions at a far-end communication device and all paths of transmission degradation. This study presents an automated matrix sentence test procedure based on automatic speech recognition (ASR) integrated in a Voice over Internet Protocol (VoIP) infrastructure and compares the individual effects of transmission degradations with results from laboratory measurements.

Method: Speech recognition thresholds (SRTs) were measured in 16 normal-hearing subjects in four test conditions: (a) a laboratory condition guided by a human experimenter, (b) a laboratory condition with reduced bandwidth and (c) additionally reduced headset quality to simulate typical communication systems, and (d) an automated, ASR-controlled adaptive test procedure over a real VoIP infrastructure. Errors of the ASR system were analyzed to show possible effects on measurement outcome Results: Measured SRTs showed a highly significant correlation (r = .93) between the fully automatic and "laboratory" conditions, with a constant bias of about 1 dB indicating a linear shift of the data without affecting the distribution around the mean. The individual impact of the different system degradations on SRTs could be quantified Conclusions: This study provides a proof of concept for automated ASR-based SRT measurements over VoIP systems for speech audiometric testing in real communication systems, as it produced results comparable to traditional laboratory settings for this group of 16 normal-hearing subjects. This makes VoIP services a promising candidate for speech audiometric testing in real communication systems.

MeSH terms

  • Audiometry, Speech
  • Humans
  • Internet
  • Language
  • Speech
  • Speech Perception*