Low-Cost Human-Machine Interface for Computer Control with Facial Landmark Detection and Voice Commands

Sensors (Basel). 2022 Nov 29;22(23):9279. doi: 10.3390/s22239279.

Abstract

Nowadays, daily life involves the extensive use of computers, since human beings are immersed in a technological society. Therefore, it is mandatory to interact with computers, which represents a true disadvantage for people with upper limb disabilities. In this context, this work aims to develop an interface for emulating mouse and keyboard functions (EMKEY) by applying concepts of artificial vision and voice recognition to replace the use of hands. Pointer control is achieved by head movement, whereas voice recognition is used to perform interface functionalities, including speech-to-text transcription. To evaluate the interface's usability and usefulness, two studies were carried out. The first study was performed with 30 participants without physical disabilities. Throughout this study, there were significant correlations found between the emulator's usability and aspects such as adaptability, execution time, and the participant's age. In the second study, the use of the emulator was analyzed by four participants with motor disabilities. It was found that the interface was best used by the participant with cerebral palsy, followed by the participants with upper limb paralysis, spina bifida, and muscular dystrophy. In general, the results show that the proposed interface is easy to use, practical, fairly accurate, and works on a wide range of computers.

Keywords: H/M interface; face tracking; facial landmarks; handicap; keyboard; mouse; speech recognition; voice commands.

MeSH terms

  • Computers
  • Disabled Persons*
  • Humans
  • Speech
  • User-Computer Interface
  • Voice*