An initial investigation into the real-time conversion of facial surface EMG signals to audible speech

Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug:2016:888-891. doi: 10.1109/EMBC.2016.7590843.

Abstract

This paper presents early-stage results of our investigations into the direct conversion of facial surface electromyographic (EMG) signals into audible speech in a real-time setting, enabling novel avenues for research and system improvement through real-time feedback. The system uses a pipeline approach to enable online acquisition of EMG data, extraction of EMG features, mapping of EMG features to audio features, synthesis of audio waveforms from audio features and output of the audio waveforms via speakers or headphones. Our system allows for performing EMG-to-Speech conversion with low latency and on a continuous stream of EMG data, enabling near instantaneous audio output during audible as well as silent speech production. In this paper, we present an analysis of our systems components for latency incurred, as well as the tradeoffs between conversion quality, latency and training duration required.

MeSH terms

  • Electromyography / methods*
  • Humans
  • Signal Processing, Computer-Assisted*
  • Speech / physiology*