Adaptive redundant speech transmission over wireless multimedia sensor networks based on estimation of perceived speech quality

Jin Ah Kang; Hong Kook Kim

doi:10.3390/s110908469

Adaptive redundant speech transmission over wireless multimedia sensor networks based on estimation of perceived speech quality

Sensors (Basel). 2011;11(9):8469-84. doi: 10.3390/s110908469. Epub 2011 Aug 31.

Authors

Jin Ah Kang¹, Hong Kook Kim

Affiliation

¹ School of Information and Communications, Gwangju Institute of Science and Technology, Gwangju 500-712, Korea. jinari@gist.ac.kr

Abstract

An adaptive redundant speech transmission (ARST) approach to improve the perceived speech quality (PSQ) of speech streaming applications over wireless multimedia sensor networks (WMSNs) is proposed in this paper. The proposed approach estimates the PSQ as well as the packet loss rate (PLR) from the received speech data. Subsequently, it decides whether the transmission of redundant speech data (RSD) is required in order to assist a speech decoder to reconstruct lost speech signals for high PLRs. According to the decision, the proposed ARST approach controls the RSD transmission, then it optimizes the bitrate of speech coding to encode the current speech data (CSD) and RSD bitstream in order to maintain the speech quality under packet loss conditions. The effectiveness of the proposed ARST approach is then demonstrated using the adaptive multirate-narrowband (AMR-NB) speech codec and ITU-T Recommendation P.563 as a scalable speech codec and the PSQ estimation, respectively. It is shown from the experiments that a speech streaming application employing the proposed ARST approach significantly improves speech quality under packet loss conditions in WMSNs.

Keywords: AMR-NB; packet loss; redundant speech transmission; speech quality estimation; speech streaming; wireless multimedia sensor network.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Humans
Radio Waves*
Speech*