Word Categorization of Vowel Durational Changes in Speech-Modulated Bone-Conducted Ultrasound

Audiol Res. 2021 Jul 14;11(3):357-364. doi: 10.3390/audiolres11030033.

Abstract

Ultrasound can deliver speech information when it is amplitude-modulated with speech and presented via bone conduction. This speech-modulated bone-conducted ultrasound (SM-BCU) can also transmit prosodic information. However, there is insufficient research on the recognition of vowel duration in SM-BCU. The aim of this study was to investigate the categorization of vowel durational changes in SM-BCU using a behavioral test. Eight Japanese-speaking participants with normal hearing participated in a forced-choice behavioral task to discriminate between "hato" (pigeon) and "haato" (heart). Speech signal stimuli were presented in seven duration grades from 220 ms to 340 ms. The threshold at which 50% of responses were "haato" was calculated and compared for air-conducted audible sound (ACAS) and SM-BCU. The boundary width was also evaluated. Although the SM-BCU threshold (mean: 274.6 ms) was significantly longer than the ACAS threshold (mean: 269.6 ms), there were no differences in boundary width. These results suggest that SM-BCU can deliver prosodic information about vowel duration with a similar difference limen to that of ACAS in normal hearing.

Keywords: amplitude modulation; bone-conduction; prosody; ultrasonic perception; ultrasound; vowel.