A hierarchical framework approach for voice activity detection and speech enhancement

ScientificWorldJournal. 2014:2014:723643. doi: 10.1155/2014/723643. Epub 2014 May 12.

Abstract

Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech enhancement block. For the feature selection and voting block, several discriminating features were employed in a voting paradigm for the consideration of reliability and discriminative power. Effectiveness of the proposed approach is compared and evaluated to other VAD techniques by using two well-known databases, namely, TIMIT database and NOISEX-92 database. Experimental results show that the proposed method performs well under a variety of noisy conditions.

MeSH terms

  • Humans
  • Noise*
  • Speech Perception / physiology
  • Voice*