Word Categorization of Vowel Durational Changes in Speech-Modulated Bone-Conducted Ultrasound

Tadao Okayasu; Tadashi Nishimura; Akinori Yamashita; Yoshiki Nagatani; Takashi Inoue; Yuka Uratani; Toshiaki Yamanaka; Hiroshi Hosoi; Tadashi Kitahara

doi:10.3390/audiolres11030033

Word Categorization of Vowel Durational Changes in Speech-Modulated Bone-Conducted Ultrasound

Audiol Res. 2021 Jul 14;11(3):357-364. doi: 10.3390/audiolres11030033.

Authors

Tadao Okayasu¹, Tadashi Nishimura¹, Akinori Yamashita¹, Yoshiki Nagatani², Takashi Inoue³, Yuka Uratani¹, Toshiaki Yamanaka¹, Hiroshi Hosoi⁴, Tadashi Kitahara¹

Affiliations

¹ Department of Otolaryngology-Head and Neck Surgery, Nara Medical University, 840 Shijo-cho, Kashihara 634-8522, Japan.
² Pixie Dust Technologies, 3F, 4F, Sumitomo Fudosan Suidobashi Nisiguchi Bldg, 2-20-5, Kanda-Misakicho, Chiyoda-ku, Tokyo 101-0061, Japan.
³ Institute for Clinical and Translational Science, Nara Medical Univesity, 840 Shijo-cho, Kashihara 634-8522, Japan.
⁴ MBT (Medicine-Based Town) Institute, Nara Medical University, 840 Shijo-cho, Kashihara 634-8522, Japan.

Abstract

Ultrasound can deliver speech information when it is amplitude-modulated with speech and presented via bone conduction. This speech-modulated bone-conducted ultrasound (SM-BCU) can also transmit prosodic information. However, there is insufficient research on the recognition of vowel duration in SM-BCU. The aim of this study was to investigate the categorization of vowel durational changes in SM-BCU using a behavioral test. Eight Japanese-speaking participants with normal hearing participated in a forced-choice behavioral task to discriminate between "hato" (pigeon) and "haato" (heart). Speech signal stimuli were presented in seven duration grades from 220 ms to 340 ms. The threshold at which 50% of responses were "haato" was calculated and compared for air-conducted audible sound (ACAS) and SM-BCU. The boundary width was also evaluated. Although the SM-BCU threshold (mean: 274.6 ms) was significantly longer than the ACAS threshold (mean: 269.6 ms), there were no differences in boundary width. These results suggest that SM-BCU can deliver prosodic information about vowel duration with a similar difference limen to that of ACAS in normal hearing.

Keywords: amplitude modulation; bone-conduction; prosody; ultrasonic perception; ultrasound; vowel.

Abstract

Grants and funding