Deep Learning-Based Violin Bowing Action Recognition

Shih-Wei Sun; Bao-Yun Liu; Pao-Chi Chang

doi:10.3390/s20205732

Deep Learning-Based Violin Bowing Action Recognition

Sensors (Basel). 2020 Oct 9;20(20):5732. doi: 10.3390/s20205732.

Authors

Shih-Wei Sun^{1

2}, Bao-Yun Liu³, Pao-Chi Chang³

Affiliations

¹ Deptartment of New Media Art, Taipei National University of the Arts, Taipei 11201, Taiwan.
² Computer Center, Taipei National University of the Arts, Taipei 11201, Taiwan.
³ Deptartment of Communication Engineering, National Central University, Taoyuan 32001, Taiwan.

Abstract

We propose a violin bowing action recognition system that can accurately recognize distinct bowing actions in classical violin performance. This system can recognize bowing actions by analyzing signals from a depth camera and from inertial sensors that are worn by a violinist. The contribution of this study is threefold: (1) a dataset comprising violin bowing actions was constructed from data captured by a depth camera and multiple inertial sensors; (2) data augmentation was achieved for depth-frame data through rotation in three-dimensional world coordinates and for inertial sensing data through yaw, pitch, and roll angle transformations; and, (3) bowing action classifiers were trained using different modalities, to compensate for the strengths and weaknesses of each modality, based on deep learning methods with a decision-level fusion process. In experiments, large external motions and subtle local motions produced from violin bow manipulations were both accurately recognized by the proposed system (average accuracy > 80%).

Keywords: action recognition; decision level fusion; deep learning applications; depth camera; human perceptual cognition; inertial sensor; violin bowing actions.

MeSH terms

Deep Learning*
Movement*
Music

Abstract

MeSH terms

Grants and funding