Robust speech recognition from binary masks

Arun Narayanan; DeLiang Wang

doi:10.1121/1.3497358

Robust speech recognition from binary masks

J Acoust Soc Am. 2010 Nov;128(5):EL217-22. doi: 10.1121/1.3497358.

Authors

Arun Narayanan¹, DeLiang Wang

Affiliation

¹ Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio 43210, USA. narayaar@cse.ohio-state.edu

PMID: 21110529
DOI: 10.1121/1.3497358

Abstract

Inspired by recent evidence that a binary pattern may provide sufficient information for human speech recognition, this letter proposes a fundamentally different approach to robust automatic speech recognition. Specifically, recognition is performed by classifying binary masks corresponding to a word utterance. The proposed method is evaluated using a subset of the TIDigits corpus to perform isolated digit recognition. Despite dramatic reduction of speech information encoded in a binary mask, the proposed system performs surprisingly well. The system is compared with a traditional HMM based approach and is shown to perform well under low SNR conditions.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Humans
Noise
Phonetics*
Reproducibility of Results
Software Design*
Speech Recognition Software / standards*